Marie Denis1, Mahlet G Tadesse2. 1. UMR AGAP, CIRAD, Montpellier, France, Department of Epidemiology, Harvard School of Public Health, Boston, MA, USA and. 2. Department of Mathematics and Statistics, Georgetown University, Washington, DC, USA.
Abstract
MOTIVATION: Advances in high-throughput technologies have led to the acquisition of various types of -omic data on the same biological samples. Each data type gives independent and complementary information that can explain the biological mechanisms of interest. While several studies performing independent analyses of each dataset have led to significant results, a better understanding of complex biological mechanisms requires an integrative analysis of different sources of data. RESULTS: Flexible modeling approaches, based on penalized likelihood methods and expectation-maximization (EM) algorithms, are studied and tested under various biological relationship scenarios between the different molecular features and their effects on a clinical outcome. The models are applied to genomic datasets from two cancer types in the Cancer Genome Atlas project: glioblastoma multiforme and ovarian serous cystadenocarcinoma. The integrative models lead to improved model fit and predictive performance. They also provide a better understanding of the biological mechanisms underlying patients' survival. AVAILABILITY AND IMPLEMENTATION: Source code implementing the integrative models is freely available at https://github.com/mgt000/IntegrativeAnalysis along with example datasets and sample R script applying the models to these data. The TCGA datasets used for analysis are publicly available at https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp CONTACT: marie.denis@cirad.fr or mgt26@georgetown.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Advances in high-throughput technologies have led to the acquisition of various types of -omic data on the same biological samples. Each data type gives independent and complementary information that can explain the biological mechanisms of interest. While several studies performing independent analyses of each dataset have led to significant results, a better understanding of complex biological mechanisms requires an integrative analysis of different sources of data. RESULTS: Flexible modeling approaches, based on penalized likelihood methods and expectation-maximization (EM) algorithms, are studied and tested under various biological relationship scenarios between the different molecular features and their effects on a clinical outcome. The models are applied to genomic datasets from two cancer types in the Cancer Genome Atlas project: glioblastoma multiforme and ovarian serous cystadenocarcinoma. The integrative models lead to improved model fit and predictive performance. They also provide a better understanding of the biological mechanisms underlying patients' survival. AVAILABILITY AND IMPLEMENTATION: Source code implementing the integrative models is freely available at https://github.com/mgt000/IntegrativeAnalysis along with example datasets and sample R script applying the models to these data. The TCGA datasets used for analysis are publicly available at https://tcga-data.nci.nih.gov/tcga/tcgaDownload.jsp CONTACT: marie.denis@cirad.fr or mgt26@georgetown.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Atila van Nas; Leslie Ingram-Drake; Janet S Sinsheimer; Susanna S Wang; Eric E Schadt; Thomas Drake; Aldons J Lusis Journal: Genetics Date: 2010-05-03 Impact factor: 4.562
Authors: Barbara E Stranger; Matthew S Forrest; Mark Dunning; Catherine E Ingle; Claude Beazley; Natalie Thorne; Richard Redon; Christine P Bird; Anna de Grassi; Charles Lee; Chris Tyler-Smith; Nigel Carter; Stephen W Scherer; Simon Tavaré; Panagiotis Deloukas; Matthew E Hurles; Emmanouil T Dermitzakis Journal: Science Date: 2007-02-09 Impact factor: 47.728
Authors: James R Wagner; Stephan Busche; Bing Ge; Tony Kwan; Tomi Pastinen; Mathieu Blanchette Journal: Genome Biol Date: 2014-02-20 Impact factor: 13.583
Authors: Yusha Liu; Keith A Baggerly; Elias Orouji; Ganiraju Manyam; Huiqin Chen; Michael Lam; Jennifer S Davis; Michael S Lee; Bradley M Broom; David G Menter; Kunal Rai; Scott Kopetz; Jeffrey S Morris Journal: Bioinformatics Date: 2021-06-12 Impact factor: 6.931
Authors: Rency S Varghese; Yuan Zhou; Megan Barefoot; Yifan Chen; Cristina Di Poto; Abdalla Kara Balla; Everett Oliver; Zaki A Sherif; Deepak Kumar; Alexander H Kroemer; Mahlet G Tadesse; Habtom W Ressom Journal: BMC Med Genomics Date: 2020-03-30 Impact factor: 3.063