Abstract
Background Prognosis (survival) prediction of patients is important for disease management. Multi-omics data are good resources for survival prediction, however, difficult to integrate computationally.
Results We introduce DeepProg, a new computational framework that robustly predicts patient survival subtypes based on multiple types of omic data. It employs an ensemble of deep-learning and machine-learning approaches to achieve high performance. We apply DeepProg on 32 cancer datasets from TCGA and discover that most cancers have two optimal survival subtypes. Patient survival risk-stratification using DeepProg is significantly better than another multi-omics data integration method called Similarity Network Fusion (p-value=7.9e-7). DeepProg shows excellent predictive accuracy in external validation cohorts, exemplified by 2 liver cancer (C-index 0.73 and 0.80) and five breast cancer datasets (C-index 0.68-0.73). Further comprehensive pan-cancer analysis unveils the genomic signatures common among all the poorest survival subtypes, with genes enriched in extracellular matrix modeling, immune deregulation, and mitosis processes.
Conclusions DeepProg is a powerful and generic computational framework to predict patient survival risks. DeepProg is freely available for non-commercial use at: http://garmiregroup.org/DeepProg
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This research was supported by grants K01ES025434 awarded by NIEHS through funds provided by the trans-NIH Big Data to Knowledge (BD2K) initiative (www.bd2k.nih.gov), P20 COBRE GM103457 awarded by NIH/NIGMS, R01 LM012373 and R01 LM012907 awarded by NLM, and R01 HD084633 awarded by NICHD to L.X. Garmire.
Author Declarations
All relevant ethical guidelines have been followed and any necessary IRB and/or ethics committee approvals have been obtained.
Not Applicable
All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.
Not Applicable
Any clinical trials involved have been registered with an ICMJE-approved registry such as ClinicalTrials.gov and the trial ID is included in the manuscript.
Not Applicable
I have followed all appropriate research reporting guidelines and uploaded the relevant Equator, ICMJE or other checklist(s) as supplementary files, if applicable.
Not Applicable
Data Availability
Data are available upon request