W Jenny Shi1, Yonghua Zhuang2, Pamela H Russell2, Brian D Hobbs3,4, Margaret M Parker3, Peter J Castaldi3, Pratyaydipta Rudra2,5, Brian Vestal6, Craig P Hersh3,4, Laura M Saba7, Katerina Kechris2. 1. Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. 2. Department of Biostatistics and Informatics, University of Colorado Anschutz Medical Campus, Aurora, CO, USA. 3. Channing Division of Network Medicine, Brigham and Women's Hospital, Boston, MA, USA. 4. Division of Pulmonary and Critical Care Medicine, Brigham and Women's Hospital, Boston, MA, USA. 5. Department of Statistics, Oklahoma State University, Stillwater, OK. 6. Center for Genes, Environment & Health, National Jewish Health, Denver, CO, USA. 7. Department of Pharmaceutical Sciences, University of Colorado, Aurora, CO, USA.
Abstract
MOTIVATION: Complex diseases often involve a wide spectrum of phenotypic traits. Better understanding of the biological mechanisms relevant to each trait promotes understanding of the etiology of the disease and the potential for targeted and effective treatment plans. There have been many efforts towards omics data integration and network reconstruction, but limited work has examined the incorporation of relevant (quantitative) phenotypic traits. RESULTS: We propose a novel technique, sparse multiple canonical correlation network analysis (SmCCNet), for integrating multiple omics data types along with a quantitative phenotype of interest, and for constructing multi-omics networks that are specific to the phenotype. As a case study, we focus on miRNA-mRNA networks. Through simulations, we demonstrate that SmCCNet has better overall prediction performance compared to popular gene expression network construction and integration approaches under realistic settings. Applying SmCCNet to studies on chronic obstructive pulmonary disease (COPD) and breast cancer, we found enrichment of known relevant pathways (e.g. the Cadherin pathway for COPD and the interferon-gamma signaling pathway for breast cancer) as well as less known omics features that may be important to the diseases. Although those applications focus on miRNA-mRNA co-expression networks, SmCCNet is applicable to a variety of omics and other data types. It can also be easily generalized to incorporate multiple quantitative phenotype simultaneously. The versatility of SmCCNet suggests great potential of the approach in many areas. AVAILABILITY AND IMPLEMENTATION: The SmCCNet algorithm is written in R, and is freely available on the web at https://cran.r-project.org/web/packages/SmCCNet/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: Complex diseases often involve a wide spectrum of phenotypic traits. Better understanding of the biological mechanisms relevant to each trait promotes understanding of the etiology of the disease and the potential for targeted and effective treatment plans. There have been many efforts towards omics data integration and network reconstruction, but limited work has examined the incorporation of relevant (quantitative) phenotypic traits. RESULTS: We propose a novel technique, sparse multiple canonical correlation network analysis (SmCCNet), for integrating multiple omics data types along with a quantitative phenotype of interest, and for constructing multi-omics networks that are specific to the phenotype. As a case study, we focus on miRNA-mRNA networks. Through simulations, we demonstrate that SmCCNet has better overall prediction performance compared to popular gene expression network construction and integration approaches under realistic settings. Applying SmCCNet to studies on chronic obstructive pulmonary disease (COPD) and breast cancer, we found enrichment of known relevant pathways (e.g. the Cadherin pathway for COPD and the interferon-gamma signaling pathway for breast cancer) as well as less known omics features that may be important to the diseases. Although those applications focus on miRNA-mRNA co-expression networks, SmCCNet is applicable to a variety of omics and other data types. It can also be easily generalized to incorporate multiple quantitative phenotype simultaneously. The versatility of SmCCNet suggests great potential of the approach in many areas. AVAILABILITY AND IMPLEMENTATION: The SmCCNet algorithm is written in R, and is freely available on the web at https://cran.r-project.org/web/packages/SmCCNet/index.html. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Charles J Vaske; Stephen C Benz; J Zachary Sanborn; Dent Earl; Christopher Szeto; Jingchun Zhu; David Haussler; Joshua M Stuart Journal: Bioinformatics Date: 2010-06-15 Impact factor: 6.937
Authors: Yuanbin Ru; Katerina J Kechris; Boris Tabakoff; Paula Hoffman; Richard A Radcliffe; Russell Bowler; Spencer Mahaffey; Simona Rossi; George A Calin; Lynne Bemis; Dan Theodorescu Journal: Nucleic Acids Res Date: 2014-07-24 Impact factor: 16.971
Authors: Paula L Farré; Georgina D Scalise; Rocío B Duca; Guillermo N Dalton; Cintia Massillo; Juliana Porretti; Karen Graña; Kevin Gardner; Paola De Luca; Adriana De Siervi Journal: Oncotarget Date: 2018-02-13
Authors: Peter Langfelder; Jeffrey P Cantle; Doxa Chatzopoulou; Nan Wang; Fuying Gao; Ismael Al-Ramahi; Xiao-Hong Lu; Eliana Marisa Ramos; Karla El-Zein; Yining Zhao; Sandeep Deverasetty; Andreas Tebbe; Christoph Schaab; Daniel J Lavery; David Howland; Seung Kwak; Juan Botas; Jeffrey S Aaronson; Jim Rosinski; Giovanni Coppola; Steve Horvath; X William Yang Journal: Nat Neurosci Date: 2016-02-22 Impact factor: 24.884
Authors: Gabriella B Oliveira; Luciana C A Regitano; Aline S M Cesar; James M Reecy; Karina Y Degaki; Mirele D Poleti; Andrezza M Felício; James E Koltes; Luiz L Coutinho Journal: BMC Genomics Date: 2018-02-07 Impact factor: 3.969
Authors: Mohamed Abdel-Hafiz; Mesbah Najafi; Shahab Helmi; Katherine A Pratte; Yonghua Zhuang; Weixuan Liu; Katerina J Kechris; Russell P Bowler; Leslie Lange; Farnoush Banaei-Kashani Journal: Front Big Data Date: 2022-06-22
Authors: John B O'Connor; Madison Mottlowitz; Monica E Kruk; Alan Mickelson; Brandie D Wagner; Jonathan Kirk Harris; Christine H Wendt; Theresa A Laguna Journal: Front Cell Infect Microbiol Date: 2022-03-10 Impact factor: 6.073