Design of probabilistic random forests with applications to anticancer drug sensitivity prediction

Raziur Rahman, Saad Haider, Souparno Ghosh, Ranadip Pal

Research output: Contribution to journalArticlepeer-review

12 Scopus citations

Abstract

Random forests consisting of an ensemble of regression trees with equal weights are frequently used for design of predictive models. In this article, we consider an extension of the methodology by representing the regression trees in the form of probabilistic trees and analyzing the nature of heteroscedasticity. The probabilistic tree representation allows for analytical computation of confidence intervals (CIs), and the tree weight optimization is expected to provide stricter CIs with comparable performance in mean error. We approached the ensemble of probabilistic trees’ prediction from the perspectives of a mixture distribution and as a weighted sum of correlated random variables. We applied our methodology to the drug sensitivity predic-tion problem on synthetic and cancer cell line encyclopedia dataset and illustrated that tree weights can be selected to reduce the average length of the CI without increase in mean error.

Original languageEnglish
Pages (from-to)57-73
Number of pages17
JournalCancer Informatics
Volume15
DOIs
StatePublished - Mar 31 2016

Keywords

  • Drug sensitivity prediction
  • Heteroscedasticity
  • Probabilistic random forests
  • Variance analysis of random forests

Fingerprint

Dive into the research topics of 'Design of probabilistic random forests with applications to anticancer drug sensitivity prediction'. Together they form a unique fingerprint.

Cite this