Literature DB >> 31281919

Consistency and overfitting of multi-omics methods on experimental data.

Sean D McCabe1, Dan-Yu Lin2, Michael I Love1.   

Abstract

Knowledge on the relationship between different biological modalities (RNA, chromatin, etc.) can help further our understanding of the processes through which biological components interact. The ready availability of multi-omics datasets has led to the development of numerous methods for identifying sources of common variation across biological modalities. However, evaluation of the performance of these methods, in terms of consistency, has been difficult because most methods are unsupervised. We present a comparison of sparse multiple canonical correlation analysis (Sparse mCCA), angle-based joint and individual variation explained (AJIVE) and multi-omics factor analysis (MOFA) using a cross-validation approach to assess overfitting and consistency. Both large and small-sample datasets were used to evaluate performance, and a permuted null dataset was used to identify overfitting through the application of our framework and approach. In the large-sample setting, we found that all methods demonstrated consistency and lack of overfitting; however, in the small-sample size setting, AJIVE provided the most stable results. We provide an R package so that our framework and approach can be applied to evaluate other methods and datasets. © The authors 2019. Published by Oxford University Press on behalf of the Institute of Mathematics and its Applications. All rights reserved.

Entities:  

Keywords:  angle-based joint and individual variation explained; cross-validation; evaluation; multi-omics; multi-omics factor analysis; sparse canonical correlation analysis

Year:  2020        PMID: 31281919      PMCID: PMC7373174          DOI: 10.1093/bib/bbz070

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  18 in total

1.  Multi-omics integration-a comparison of unsupervised clustering methodologies.

Authors:  Giulia Tini; Luca Marchetti; Corrado Priami; Marie-Pier Scott-Boyer
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

Review 2.  Comparison and evaluation of integrative methods for the analysis of multilevel omics data: a study based on simulated and experimental cancer data.

Authors:  Bettina M Pucher; Oana A Zeleznik; Gerhard G Thallinger
Journal:  Brief Bioinform       Date:  2019-03-25       Impact factor: 11.622

3.  Integrative analysis of gene expression and copy number alterations using canonical correlation analysis.

Authors:  Charlotte Soneson; Henrik Lilljebjörn; Thoas Fioretos; Magnus Fontes
Journal:  BMC Bioinformatics       Date:  2010-04-15       Impact factor: 3.169

4.  Estrogen receptor alpha (ESR1) gene amplification is frequent in breast cancer.

Authors:  Frederik Holst; Phillip R Stahl; Christian Ruiz; Olaf Hellwinkel; Zeenath Jehan; Marc Wendland; Annette Lebeau; Luigi Terracciano; Khawla Al-Kuraya; Fritz Jänicke; Guido Sauter; Ronald Simon
Journal:  Nat Genet       Date:  2007-04-08       Impact factor: 38.330

5.  Multi-Omics Factor Analysis-a framework for unsupervised integration of multi-omics data sets.

Authors:  Ricard Argelaguet; Britta Velten; Damien Arnol; Sascha Dietrich; Thorsten Zenz; John C Marioni; Florian Buettner; Wolfgang Huber; Oliver Stegle
Journal:  Mol Syst Biol       Date:  2018-06-20       Impact factor: 11.429

6.  Expression reflects population structure.

Authors:  Brielin C Brown; Nicolas L Bray; Lior Pachter
Journal:  PLoS Genet       Date:  2018-12-19       Impact factor: 5.917

7.  RNA splicing is a primary link between genetic variation and disease.

Authors:  Yang I Li; Bryce van de Geijn; Anil Raj; David A Knowles; Allegra A Petti; David Golan; Yoav Gilad; Jonathan K Pritchard
Journal:  Science       Date:  2016-04-28       Impact factor: 47.728

8.  Sparse canonical methods for biological data integration: application to a cross-platform study.

Authors:  Kim-Anh Lê Cao; Pascal G P Martin; Christèle Robert-Granié; Philippe Besse
Journal:  BMC Bioinformatics       Date:  2009-01-26       Impact factor: 3.169

9.  Integrative analysis of 111 reference human epigenomes.

Authors:  Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal:  Nature       Date:  2015-02-19       Impact factor: 69.504

10.  A multivariate approach to the integration of multi-omics datasets.

Authors:  Chen Meng; Bernhard Kuster; Aedín C Culhane; Amin Moghaddas Gholami
Journal:  BMC Bioinformatics       Date:  2014-05-29       Impact factor: 3.169

View more
  6 in total

1.  EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models.

Authors:  Xiangju Liu; Yu Zhang; Chunli Fu; Ruochi Zhang; Fengfeng Zhou
Journal:  Front Genet       Date:  2021-04-27       Impact factor: 4.599

2.  Integrative Network Fusion: A Multi-Omics Approach in Molecular Profiling.

Authors:  Marco Chierici; Nicole Bussola; Alessia Marcolini; Margherita Francescatto; Alessandro Zandonà; Lucia Trastulla; Claudio Agostinelli; Giuseppe Jurman; Cesare Furlanello
Journal:  Front Oncol       Date:  2020-06-30       Impact factor: 6.244

3.  DNA methylation and gene expression integration in cardiovascular disease.

Authors:  Alba Fernández-Sanlés; Roberto Elosua; Guillermo Palou-Márquez; Isaac Subirana; Lara Nonell
Journal:  Clin Epigenetics       Date:  2021-04-09       Impact factor: 6.551

4.  Multi-Omics Approach to Elucidate Cerebrospinal Fluid Changes in Dogs with Intervertebral Disc Herniation.

Authors:  Anita Horvatić; Andrea Gelemanović; Boris Pirkić; Ozren Smolec; Blanka Beer Ljubić; Ivana Rubić; Peter David Eckersall; Vladimir Mrljak; Mark McLaughlin; Marko Samardžija; Marija Lipar
Journal:  Int J Mol Sci       Date:  2021-10-28       Impact factor: 5.923

Review 5.  Integrated Multi-Omics Analyses in Oncology: A Review of Machine Learning Methods and Tools.

Authors:  Giovanna Nicora; Francesca Vitali; Arianna Dagliati; Nophar Geifman; Riccardo Bellazzi
Journal:  Front Oncol       Date:  2020-06-30       Impact factor: 6.244

Review 6.  Incorporating Machine Learning into Established Bioinformatics Frameworks.

Authors:  Noam Auslander; Ayal B Gussow; Eugene V Koonin
Journal:  Int J Mol Sci       Date:  2021-03-12       Impact factor: 5.923

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.