A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction

Zhiyong Hu, Dongping Du

Research output: Contribution to journalArticlepeer-review

Abstract

Background The wide adoption of electronic health records (EHR) system has provided vast opportunities to advance health care services. However, the prevalence of missing values in EHR system poses a great challenge on data analysis to support clinical decision-making. The objective of this study is to develop a new methodological framework that can address the missing data challenge and provide a reliable tool to predict the hospital readmission among Heart Failure patients. Methods We used Gaussian Process Latent Variable Model (GPLVM) to impute the missing values. Specifically, a lower dimensional embedding was learned from a small complete dataset and then used to impute the missing values in the incomplete dataset. The GPLVM-based missing data imputation can provide both the mean estimate and the uncertainty associated with the mean estimate. To incorporate the uncertainty in prediction, a constrained support vector machine (cSVM) was developed to obtain robust predictions. We fi
Original languageEnglish
Pages (from-to)1-15
JournalDefault journal
DOIs
StatePublished - Sep 2020

Fingerprint

Dive into the research topics of 'A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction'. Together they form a unique fingerprint.

Cite this