Literature DB >> 32691502

A general framework for integrative analysis of incomplete multiomics data.

Dan-Yu Lin1, Donglin Zeng1, David Couper1.   

Abstract

There is a tremendous current interest in measuring multiple types of omics features (e.g., DNA sequences, RNA expressions, methylation profiles, metabolic profiles, protein expressions) on a large number of subjects. Although genotypes are typically available for all study subjects, other data types may be measured only on a subset of subjects due to cost or other constraints. In addition, quantitative omics measurements, such as metabolite levels and protein expressions, are subject to detection limits in that the measurements below (or above) certain thresholds are not detectable. In this article, we propose a rigorous and powerful approach to handle missing values and detection limits in integrative analysis of multiomics data. We relate quantitative omics variables to genetic variants and other variables through linear regression models and relate phenotypes to quantitative omics variables and other variables through generalized linear models. We derive the joint-likelihood for the two sets of models by allowing arbitrary patterns of missing values and detection limits for quantitative omics variables. We carry out maximum-likelihood estimation through computationally fast and stable algorithms. The resulting estimators are approximately unbiased and statistically efficient. An application to a major study on chronic obstructive lung disease yielded new biological insights.
© 2020 Wiley Periodicals LLC.

Entities:  

Keywords:  complex diseases; data integration; detection limits; genetic association; mediation analysis; missing data; quantitative trait loci; trans-omics studies

Mesh:

Year:  2020        PMID: 32691502      PMCID: PMC7951090          DOI: 10.1002/gepi.22328

Source DB:  PubMed          Journal:  Genet Epidemiol        ISSN: 0741-0395            Impact factor:   2.135


  25 in total

1.  A tobit variance-component method for linkage analysis of censored trait data.

Authors:  Michael P Epstein; Xihong Lin; Michael Boehnke
Journal:  Am J Hum Genet       Date:  2003-02-13       Impact factor: 11.025

2.  Structured Matrix Completion with Applications to Genomic Data Integration.

Authors:  Tianxi Cai; T Tony Cai; Anru Zhang
Journal:  J Am Stat Assoc       Date:  2016-08-18       Impact factor: 5.033

3.  Estimating the odds ratio when exposure has a limit of detection.

Authors:  Stephen R Cole; Haitao Chu; Lei Nie; Enrique F Schisterman
Journal:  Int J Epidemiol       Date:  2009-08-10       Impact factor: 7.196

4.  MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes.

Authors:  Yun Li; Cristen J Willer; Jun Ding; Paul Scheet; Gonçalo R Abecasis
Journal:  Genet Epidemiol       Date:  2010-12       Impact factor: 2.135

5.  Linear regression with an independent variable subject to a detection limit.

Authors:  Lei Nie; Haitao Chu; Chenglong Liu; Stephen R Cole; Albert Vexler; Enrique F Schisterman
Journal:  Epidemiology       Date:  2010-07       Impact factor: 4.822

6.  Data integration in genetics and genomics: methods and challenges.

Authors:  Jemila S Hamid; Pingzhao Hu; Nicole M Roslin; Vicki Ling; Celia M T Greenwood; Joseph Beyene
Journal:  Hum Genomics Proteomics       Date:  2009-01-12

7.  Integrative genomic testing of cancer survival using semiparametric linear transformation models.

Authors:  Yen-Tsung Huang; Tianxi Cai; Eunhee Kim
Journal:  Stat Med       Date:  2016-02-16       Impact factor: 2.373

8.  Comparison of serum, EDTA plasma and P100 plasma for luminex-based biomarker multiplex assays in patients with chronic obstructive pulmonary disease in the SPIROMICS study.

Authors:  Wanda K O'Neal; Wayne Anderson; Patricia V Basta; Elizabeth E Carretta; Claire M Doerschuk; R Graham Barr; Eugene R Bleecker; Stephanie A Christenson; Jeffrey L Curtis; Meilan K Han; Nadia N Hansel; Richard E Kanner; Eric C Kleerup; Fernando J Martinez; Bruce E Miller; Stephen P Peters; Stephen I Rennard; Mary Beth Scholand; Ruth Tal-Singer; Prescott G Woodruff; David J Couper; Sonia M Davis
Journal:  J Transl Med       Date:  2014-01-08       Impact factor: 5.531

9.  Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.

Authors:  Valentin Voillet; Philippe Besse; Laurence Liaubet; Magali San Cristobal; Ignacio González
Journal:  BMC Bioinformatics       Date:  2016-10-03       Impact factor: 3.169

10.  Common Genetic Polymorphisms Influence Blood Biomarker Measurements in COPD.

Authors:  Wei Sun; Katerina Kechris; Sean Jacobson; M Bradley Drummond; Gregory A Hawkins; Jenny Yang; Ting-Huei Chen; Pedro Miguel Quibrera; Wayne Anderson; R Graham Barr; Patricia V Basta; Eugene R Bleecker; Terri Beaty; Richard Casaburi; Peter Castaldi; Michael H Cho; Alejandro Comellas; James D Crapo; Gerard Criner; Dawn Demeo; Stephanie A Christenson; David J Couper; Jeffrey L Curtis; Claire M Doerschuk; Christine M Freeman; Natalia A Gouskova; MeiLan K Han; Nicola A Hanania; Nadia N Hansel; Craig P Hersh; Eric A Hoffman; Robert J Kaner; Richard E Kanner; Eric C Kleerup; Sharon Lutz; Fernando J Martinez; Deborah A Meyers; Stephen P Peters; Elizabeth A Regan; Stephen I Rennard; Mary Beth Scholand; Edwin K Silverman; Prescott G Woodruff; Wanda K O'Neal; Russell P Bowler
Journal:  PLoS Genet       Date:  2016-08-17       Impact factor: 5.917

View more
  1 in total

1.  Strengthening Causal Inference in Exposomics Research: Application of Genetic Data and Methods.

Authors:  Christy L Avery; Annie Green Howard; Anna F Ballou; Victoria L Buchanan; Jason M Collins; Carolina G Downie; Stephanie M Engel; Mariaelisa Graff; Heather M Highland; Moa P Lee; Adam G Lilly; Kun Lu; Julia E Rager; Brooke S Staley; Kari E North; Penny Gordon-Larsen
Journal:  Environ Health Perspect       Date:  2022-05-09       Impact factor: 11.035

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.