Sstack: An R package for stacking with applications to scenarios involving sequential addition of samples and features

Kevin Matlock; Raziur Rahman; Souparno Ghosh; Ranadip Pal

doi:10.1093/bioinformatics/btz010

Sstack: An R package for stacking with applications to scenarios involving sequential addition of samples and features

Kevin Matlock, Raziur Rahman, Souparno Ghosh, Ranadip Pal

Research output: Contribution to journal › Article › peer-review

Abstract

Biological processes are characterized by a variety of different genomic feature sets. However, often times when building models, portions of these features are missing for a subset of the dataset. We provide a modeling framework to effectively integrate this type of heterogeneous data to improve prediction accuracy. To test our methodology, we have stacked data from the Cancer Cell Line Encyclopedia to increase the accuracy of drug sensitivity prediction. The package addresses the dynamic regime of information integration involving sequential addition of features and samples.

Original language	English
Pages (from-to)	3143-3145
Number of pages	3
Journal	Bioinformatics
Volume	35
Issue number	17
DOIs	https://doi.org/10.1093/bioinformatics/btz010
State	Published - Sep 1 2019

Access to Document

10.1093/bioinformatics/btz010

Cite this

@article{c870d14a306749d18c04a151ad18cdfd,

title = "Sstack: An R package for stacking with applications to scenarios involving sequential addition of samples and features",

abstract = "Biological processes are characterized by a variety of different genomic feature sets. However, often times when building models, portions of these features are missing for a subset of the dataset. We provide a modeling framework to effectively integrate this type of heterogeneous data to improve prediction accuracy. To test our methodology, we have stacked data from the Cancer Cell Line Encyclopedia to increase the accuracy of drug sensitivity prediction. The package addresses the dynamic regime of information integration involving sequential addition of features and samples.",

author = "Kevin Matlock and Raziur Rahman and Souparno Ghosh and Ranadip Pal",

note = "Publisher Copyright: {\textcopyright} 2019 The Author(s).",

year = "2019",

month = sep,

day = "1",

doi = "10.1093/bioinformatics/btz010",

language = "English",

volume = "35",

pages = "3143--3145",

journal = "Bioinformatics",

issn = "1367-4803",

number = "17",

}

TY - JOUR

T1 - Sstack

T2 - An R package for stacking with applications to scenarios involving sequential addition of samples and features

AU - Matlock, Kevin

AU - Rahman, Raziur

AU - Ghosh, Souparno

AU - Pal, Ranadip

PY - 2019/9/1

Y1 - 2019/9/1

N2 - Biological processes are characterized by a variety of different genomic feature sets. However, often times when building models, portions of these features are missing for a subset of the dataset. We provide a modeling framework to effectively integrate this type of heterogeneous data to improve prediction accuracy. To test our methodology, we have stacked data from the Cancer Cell Line Encyclopedia to increase the accuracy of drug sensitivity prediction. The package addresses the dynamic regime of information integration involving sequential addition of features and samples.

AB - Biological processes are characterized by a variety of different genomic feature sets. However, often times when building models, portions of these features are missing for a subset of the dataset. We provide a modeling framework to effectively integrate this type of heterogeneous data to improve prediction accuracy. To test our methodology, we have stacked data from the Cancer Cell Line Encyclopedia to increase the accuracy of drug sensitivity prediction. The package addresses the dynamic regime of information integration involving sequential addition of features and samples.

UR - http://www.scopus.com/inward/record.url?scp=85072053642&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btz010

DO - 10.1093/bioinformatics/btz010

M3 - Article

C2 - 30649230

AN - SCOPUS:85072053642

SN - 1367-4803

VL - 35

SP - 3143

EP - 3145

JO - Bioinformatics

JF - Bioinformatics

IS - 17

ER -

Sstack: An R package for stacking with applications to scenarios involving sequential addition of samples and features

Abstract

Access to Document

Other files and links

Fingerprint

Cite this