Literature DB >> 30184058

Bayesian integrative model for multi-omics data with missingness.

Zhou Fang1, Tianzhou Ma2, Gong Tang1, Li Zhu1, Qi Yan3, Ting Wang3, Juan C Celedón3, Wei Chen1,3, George C Tseng1.   

Abstract

Motivation: Integrative analysis of multi-omics data from different high-throughput experimental platforms provides valuable insight into regulatory mechanisms associated with complex diseases, and gains statistical power to detect markers that are otherwise overlooked by single-platform omics analysis. In practice, a significant portion of samples may not be measured completely due to insufficient tissues or restricted budget (e.g. gene expression profile are measured but not methylation). Current multi-omics integrative methods require complete data. A common practice is to ignore samples with any missing platform and perform complete case analysis, which leads to substantial loss of statistical power.
Methods: In this article, inspired by the popular Integrative Bayesian Analysis of Genomics data (iBAG), we propose a full Bayesian model that allows incorporation of samples with missing omics data.
Results: Simulation results show improvement of the new full Bayesian approach in terms of outcome prediction accuracy and feature selection performance when sample size is limited and proportion of missingness is large. When sample size is large or the proportion of missingness is low, incorporating samples with missingness may introduce extra inference uncertainty and generate worse prediction and feature selection performance. To determine whether and how to incorporate samples with missingness, we propose a self-learning cross-validation (CV) decision scheme. Simulations and a real application on child asthma dataset demonstrate superior performance of the CV decision scheme when various types of missing mechanisms are evaluated. Availability and implementation: Freely available on the GitHub at https://github.com/CHPGenetics/FBM. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Year:  2018        PMID: 30184058      PMCID: PMC6223369          DOI: 10.1093/bioinformatics/bty775

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  12 in total

1.  Detecting differential gene expression with a semiparametric hierarchical mixture method.

Authors:  Michael A Newton; Amine Noueiry; Deepayan Sarkar; Paul Ahlquist
Journal:  Biostatistics       Date:  2004-04       Impact factor: 5.899

2.  Biological impact of missing-value imputation on downstream analyses of gene expression profiles.

Authors:  Sunghee Oh; Dongwan D Kang; Guy N Brock; George C Tseng
Journal:  Bioinformatics       Date:  2010-11-02       Impact factor: 6.937

3.  Integrative Sparse K-Means With Overlapping Group Lasso in Genomic Applications for Disease Subtype Discovery.

Authors:  Zhiguang Huo; George Tseng
Journal:  Ann Appl Stat       Date:  2017-07-20       Impact factor: 2.083

4.  Statistical Methods in Integrative Genomics.

Authors:  Sylvia Richardson; George C Tseng; Wei Sun
Journal:  Annu Rev Stat Appl       Date:  2016-04-18       Impact factor: 5.810

5.  Comparison of Beta-value and M-value methods for quantifying methylation levels by microarray analysis.

Authors:  Pan Du; Xiao Zhang; Chiang-Ching Huang; Nadereh Jafari; Warren A Kibbe; Lifang Hou; Simon M Lin
Journal:  BMC Bioinformatics       Date:  2010-11-30       Impact factor: 3.169

6.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.

Authors:  Ronglai Shen; Adam B Olshen; Marc Ladanyi
Journal:  Bioinformatics       Date:  2009-09-16       Impact factor: 6.937

7.  Which missing value imputation method to use in expression profiles: a comparative study and two selection schemes.

Authors:  Guy N Brock; John R Shaffer; Richard E Blakesley; Meredith J Lotz; George C Tseng
Journal:  BMC Bioinformatics       Date:  2008-01-10       Impact factor: 3.169

8.  iBAG: integrative Bayesian analysis of high-dimensional multiplatform genomics data.

Authors:  Wenting Wang; Veerabhadran Baladandayuthapani; Jeffrey S Morris; Bradley M Broom; Ganiraju Manyam; Kim-Anh Do
Journal:  Bioinformatics       Date:  2012-11-09       Impact factor: 6.937

9.  Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.

Authors:  Valentin Voillet; Philippe Besse; Laurence Liaubet; Magali San Cristobal; Ignacio González
Journal:  BMC Bioinformatics       Date:  2016-10-03       Impact factor: 3.169

10.  Integrative clustering of multi-level omics data for disease subtype discovery using sequential double regularization.

Authors:  Sunghwan Kim; Steffi Oesterreich; Seyoung Kim; Yongseok Park; George C Tseng
Journal:  Biostatistics       Date:  2016-08-22       Impact factor: 5.899

View more
  4 in total

1.  Considerations of biomarker application for cancer continuum in the era of precision medicine.

Authors:  Rayjean J Hung; Elham Khodayari Moez; Shana J Kim; Sanjeev Budhathoki; Jennifer D Brooks
Journal:  Curr Epidemiol Rep       Date:  2022-07-09

Review 2.  Application of Bayesian genomic prediction methods to genome-wide association analyses.

Authors:  Anna Wolc; Jack C M Dekkers
Journal:  Genet Sel Evol       Date:  2022-05-13       Impact factor: 5.100

Review 3.  Data-Driven Modeling of Pregnancy-Related Complications.

Authors:  Camilo Espinosa; Martin Becker; Ivana Marić; Ronald J Wong; Gary M Shaw; Brice Gaudilliere; Nima Aghaeepour; David K Stevenson
Journal:  Trends Mol Med       Date:  2021-02-08       Impact factor: 15.272

4.  TiMEG: an integrative statistical method for partially missing multi-omics data.

Authors:  Sarmistha Das; Indranil Mukhopadhyay
Journal:  Sci Rep       Date:  2021-12-15       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.