Literature DB >> 31754283

Accounting for unobserved covariates with varying degrees of estimability in high-dimensional biological data.

Chris McKennan1, Dan Nicolae1.   

Abstract

An important phenomenon in high-throughput biological data is the presence of unobserved covariates that can have a significant impact on the measured response. When these covariates are also correlated with the covariate of interest, ignoring or improperly estimating them can lead to inaccurate estimates of and spurious inference on the corresponding coefficients of interest in a multivariate linear model. We first prove that existing methods to account for these unobserved covariates often inflate Type I error for the null hypothesis that a given coefficient of interest is zero. We then provide alternative estimators for the coefficients of interest that correct the inflation, and prove that our estimators are asymptotically equivalent to the ordinary least squares estimators obtained when every covariate is observed. Lastly, we use previously published DNA methylation data to show that our method can more accurately estimate the direct effect of asthma on DNA methylation levels compared to existing methods, the latter of which likely fail to recover and account for latent cell type heterogeneity.
© 2019 Biometrika Trust.

Entities:  

Keywords:  Batch effect; Cell type heterogeneity; Confounding; High-dimensional factor analysis; Unobserved covariates; Unwanted variation

Year:  2019        PMID: 31754283      PMCID: PMC6845853          DOI: 10.1093/biomet/asz037

Source DB:  PubMed          Journal:  Biometrika        ISSN: 0006-3444            Impact factor:   3.028


  25 in total

1.  Evaluation of the Infinium Methylation 450K technology.

Authors:  Sarah Dedeurwaerder; Matthieu Defrance; Emilie Calonne; Hélène Denis; Christos Sotiriou; François Fuks
Journal:  Epigenomics       Date:  2011-12       Impact factor: 4.778

2.  Using control genes to correct for unwanted variation in microarray data.

Authors:  Johann A Gagnon-Bartsch; Terence P Speed
Journal:  Biostatistics       Date:  2011-11-17       Impact factor: 5.899

3.  The nasal methylome and childhood atopic asthma.

Authors:  Ivana V Yang; Brent S Pedersen; Andrew H Liu; George T O'Connor; Dinesh Pillai; Meyer Kattan; Rana Tawil Misiak; Rebecca Gruchalla; Stanley J Szefler; Gurjit K Khurana Hershey; Carolyn Kercsmar; Adam Richards; Allen D Stevens; Christena A Kolakowski; Melanie Makhija; Christine A Sorkness; Rebecca Z Krouse; Cynthia Visness; Elizabeth J Davidson; Corinne E Hennessy; Richard J Martin; Alkis Togias; William W Busse; David A Schwartz
Journal:  J Allergy Clin Immunol       Date:  2016-10-13       Impact factor: 10.793

Review 4.  Tackling the widespread and critical impact of batch effects in high-throughput data.

Authors:  Jeffrey T Leek; Robert B Scharpf; Héctor Corrada Bravo; David Simcha; Benjamin Langmead; W Evan Johnson; Donald Geman; Keith Baggerly; Rafael A Irizarry
Journal:  Nat Rev Genet       Date:  2010-09-14       Impact factor: 53.242

5.  CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

Authors:  Jingshu Wang; Qingyuan Zhao; Trevor Hastie; Art B Owen
Journal:  Ann Stat       Date:  2017-10-31       Impact factor: 4.028

6.  Estimation of the false discovery proportion with unknown dependence.

Authors:  Jianqing Fan; Xu Han
Journal:  J R Stat Soc Series B Stat Methodol       Date:  2016-09-26       Impact factor: 4.488

7.  Nasal DNA methylation is associated with childhood asthma.

Authors:  Xue Zhang; Jocelyn M Biagini Myers; J D Burleson; Ashley Ulm; Kelly S Bryan; Xiaoting Chen; Matthew T Weirauch; Theresa A Baker; Melinda S Butsch Kovacic; Hong Ji
Journal:  Epigenomics       Date:  2018-04-25       Impact factor: 4.778

8.  Component retention in principal component analysis with application to cDNA microarray data.

Authors:  Richard Cangelosi; Alain Goriely
Journal:  Biol Direct       Date:  2007-01-17       Impact factor: 4.540

9.  Reference-free cell mixture adjustments in analysis of DNA methylation data.

Authors:  Eugene Andres Houseman; John Molitor; Carmen J Marsit
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

10.  A DNA methylation biomarker of alcohol consumption.

Authors:  C Liu; R E Marioni; Å K Hedman; L Pfeiffer; P-C Tsai; L M Reynolds; A C Just; Q Duan; C G Boer; T Tanaka; C E Elks; S Aslibekyan; J A Brody; B Kühnel; C Herder; L M Almli; D Zhi; Y Wang; T Huan; C Yao; M M Mendelson; R Joehanes; L Liang; S-A Love; W Guan; S Shah; A F McRae; A Kretschmer; H Prokisch; K Strauch; A Peters; P M Visscher; N R Wray; X Guo; K L Wiggins; A K Smith; E B Binder; K J Ressler; M R Irvin; D M Absher; D Hernandez; L Ferrucci; S Bandinelli; K Lohman; J Ding; L Trevisi; S Gustafsson; J H Sandling; L Stolk; A G Uitterlinden; I Yet; J E Castillo-Fernandez; T D Spector; J D Schwartz; P Vokonas; L Lind; Y Li; M Fornage; D K Arnett; N J Wareham; N Sotoodehnia; K K Ong; J B J van Meurs; K N Conneely; A A Baccarelli; I J Deary; J T Bell; K E North; Y Liu; M Waldenberger; S J London; E Ingelsson; D Levy
Journal:  Mol Psychiatry       Date:  2016-11-15       Impact factor: 15.992

View more
  10 in total

1.  ESTIMATION AND INFERENCE IN METABOLOMICS WITH NON-RANDOM MISSING DATA AND LATENT FACTORS.

Authors:  Chris McKennan; Carole Ober; Dan Nicolae
Journal:  Ann Appl Stat       Date:  2020-06-29       Impact factor: 2.083

2.  Epigenetic landscape links upper airway microbiota in infancy with allergic rhinitis at 6 years of age.

Authors:  Andréanne Morin; Chris G McKennan; Casper-Emil T Pedersen; Jakob Stokholm; Bo L Chawes; Ann-Marie Malby Schoos; Katherine A Naughton; Jonathan Thorsen; Martin S Mortensen; Donata Vercelli; Urvish Trivedi; Søren J Sørensen; Hans Bisgaard; Dan L Nicolae; Klaus Bønnelykke; Carole Ober
Journal:  J Allergy Clin Immunol       Date:  2020-07-18       Impact factor: 10.793

3.  Estimating and accounting for unobserved covariates in high-dimensional correlated data.

Authors:  Chris McKennan; Dan Nicolae
Journal:  J Am Stat Assoc       Date:  2020-06-30       Impact factor: 4.369

4.  Expression quantitative trait locus fine mapping of the 17q12-21 asthma locus in African American children: a genetic association and gene expression study.

Authors:  Carole Ober; Chris G McKennan; Kevin M Magnaye; Matthew C Altman; Charles Washington; Catherine Stanhope; Katherine A Naughton; Mario G Rosasco; Leonard B Bacharier; Dean Billheimer; Diane R Gold; Lisa Gress; Tina Hartert; Suzanne Havstad; Gurjit K Khurana Hershey; Brian Hallmark; D Kyle Hogarth; Daniel J Jackson; Christine C Johnson; Meyer Kattan; Robert F Lemanske; Susan V Lynch; Eneida A Mendonca; Rachel L Miller; Edward T Naureckas; George T O'Connor; Christine M Seroogy; Ganesa Wegienka; Steven R White; Robert A Wood; Anne L Wright; Edward M Zoratti; Fernando D Martinez; Dennis Ownby; Dan L Nicolae; Albert M Levin; James E Gern
Journal:  Lancet Respir Med       Date:  2020-05       Impact factor: 30.700

5.  Data-based RNA-seq simulations by binomial thinning.

Authors:  David Gerard
Journal:  BMC Bioinformatics       Date:  2020-05-24       Impact factor: 3.169

6.  Asthma-associated genetic variants induce IL33 differential expression through an enhancer-blocking regulatory region.

Authors:  Ivy Aneas; Donna C Decker; Anne I Sperling; Marcelo A Nóbrega; Chanie L Howard; Débora R Sobreira; Noboru J Sakabe; Kelly M Blaine; Michelle M Stein; Cara L Hrusch; Lindsey E Montefiori; Juan Tena; Kevin M Magnaye; Selene M Clay; James E Gern; Daniel J Jackson; Matthew C Altman; Edward T Naureckas; Douglas K Hogarth; Steven R White; Jose Luis Gomez-Skarmeta; Nathan Schoetler; Carole Ober
Journal:  Nat Commun       Date:  2021-10-21       Impact factor: 17.694

7.  DNA methylation signatures in airway cells from adult children of asthmatic mothers reflect subtypes of severe asthma.

Authors:  Kevin M Magnaye; Selene M Clay; Jessie Nicodemus-Johnson; Katherine A Naughton; Janel Huffman; Matthew C Altman; Daniel J Jackson; James E Gern; Douglas K Hogarth; Edward T Naureckas; Steven R White; Carole Ober
Journal:  Proc Natl Acad Sci U S A       Date:  2022-06-06       Impact factor: 12.779

8.  African-specific alleles modify risk for asthma at the 17q12-q21 locus in African Americans.

Authors:  Charles Washington; Matthew Dapas; Arjun Biddanda; Kevin M Magnaye; Ivy Aneas; Britney A Helling; Brooke Szczesny; Meher Preethi Boorgula; Margaret A Taub; Eimear Kenny; Rasika A Mathias; Kathleen C Barnes; Gurjit K Khurana Hershey; Carolyn M Kercsmar; Jessica D Gereige; Melanie Makhija; Rebecca S Gruchalla; Michelle A Gill; Andrew H Liu; Deepa Rastogi; William Busse; Peter J Gergen; Cynthia M Visness; Diane R Gold; Tina Hartert; Christine C Johnson; Robert F Lemanske; Fernando D Martinez; Rachel L Miller; Dennis Ownby; Christine M Seroogy; Anne L Wright; Edward M Zoratti; Leonard B Bacharier; Meyer Kattan; George T O'Connor; Robert A Wood; Marcelo A Nobrega; Matthew C Altman; Daniel J Jackson; James E Gern; Christopher G McKennan; Carole Ober
Journal:  Genome Med       Date:  2022-09-29       Impact factor: 15.266

9.  Longitudinal data reveal strong genetic and weak non-genetic components of ethnicity-dependent blood DNA methylation levels.

Authors:  Chris McKennan; Katherine Naughton; Catherine Stanhope; Meyer Kattan; George T O'Connor; Megan T Sandel; Cynthia M Visness; Robert A Wood; Leonard B Bacharier; Avraham Beigelman; Stephanie Lovinsky-Desir; Alkis Togias; James E Gern; Dan Nicolae; Carole Ober
Journal:  Epigenetics       Date:  2020-09-30       Impact factor: 4.528

10.  A comparison of methods accounting for batch effects in differential expression analysis of UMI count based single cell RNA sequencing.

Authors:  Wenan Chen; Silu Zhang; Justin Williams; Bensheng Ju; Bridget Shaner; John Easton; Gang Wu; Xiang Chen
Journal:  Comput Struct Biotechnol J       Date:  2020-03-30       Impact factor: 6.155

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.