Literature DB >> 35615339

Estimating and accounting for unobserved covariates in high-dimensional correlated data.

Chris McKennan1, Dan Nicolae2.   

Abstract

Many high dimensional and high-throughput biological datasets have complex sample correlation structures, which include longitudinal and multiple tissue data, as well as data with multiple treatment conditions or related individuals. These data, as well as nearly all high-throughput 'omic' data, are influenced by technical and biological factors unknown to the researcher, which, if unaccounted for, can severely obfuscate estimation of and inference on the effects of interest. We therefore developed CBCV and CorrConf: provably accurate and computationally efficient methods to choose the number of and estimate latent confounding factors present in high dimensional data with correlated or nonexchangeable residuals. We demonstrate each method's superior performance compared to other state of the art methods by analyzing simulated multi-tissue gene expression data and identifying sex-associated DNA methylation sites in a real, longitudinal twin study.

Entities:  

Keywords:  Batch effects; cell-type heterogeneity; confounding; correlation; multi-tissue; unwanted variation

Year:  2020        PMID: 35615339      PMCID: PMC9126075          DOI: 10.1080/01621459.2020.1769635

Source DB:  PubMed          Journal:  J Am Stat Assoc        ISSN: 0162-1459            Impact factor:   4.369


  30 in total

1.  Remarks on Parallel Analysis.

Authors:  A Buja; N Eyuboglu
Journal:  Multivariate Behav Res       Date:  1992-10-01       Impact factor: 5.923

2.  An improved and explicit surrogate variable analysis procedure by coefficient adjustment.

Authors:  Seunggeun Lee; Wei Sun; Fred A Wright; Fei Zou
Journal:  Biometrika       Date:  2017-04-21       Impact factor: 2.445

3.  CONFOUNDER ADJUSTMENT IN MULTIPLE HYPOTHESIS TESTING.

Authors:  Jingshu Wang; Qingyuan Zhao; Trevor Hastie; Art B Owen
Journal:  Ann Stat       Date:  2017-10-31       Impact factor: 4.028

4.  Nasal DNA methylation is associated with childhood asthma.

Authors:  Xue Zhang; Jocelyn M Biagini Myers; J D Burleson; Ashley Ulm; Kelly S Bryan; Xiaoting Chen; Matthew T Weirauch; Theresa A Baker; Melinda S Butsch Kovacic; Hong Ji
Journal:  Epigenomics       Date:  2018-04-25       Impact factor: 4.778

5.  Reproducibility and concordance of differential DNA methylation and gene expression in cancer.

Authors:  Chen Yao; Hongdong Li; Xiaopei Shen; Zheng He; Lang He; Zheng Guo
Journal:  PLoS One       Date:  2012-01-03       Impact factor: 3.240

6.  Accounting for unobserved covariates with varying degrees of estimability in high-dimensional biological data.

Authors:  Chris McKennan; Dan Nicolae
Journal:  Biometrika       Date:  2019-09-16       Impact factor: 3.028

7.  Reference-free cell mixture adjustments in analysis of DNA methylation data.

Authors:  Eugene Andres Houseman; John Molitor; Carmen J Marsit
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

8.  How data analysis affects power, reproducibility and biological insight of RNA-seq studies in complex datasets.

Authors:  Lucia Peixoto; Davide Risso; Shane G Poplawski; Mathieu E Wimmer; Terence P Speed; Marcelo A Wood; Ted Abel
Journal:  Nucleic Acids Res       Date:  2015-07-21       Impact factor: 16.971

9.  Sex differences in DNA methylation of the cord blood are related to sex-bias psychiatric diseases.

Authors:  Mariana Maschietto; Laura Caroline Bastos; Ana Carolina Tahira; Elen Pereira Bastos; Veronica Luiza Vale Euclydes; Alexandra Brentani; Günther Fink; Angelica de Baumont; Aloísio Felipe-Silva; Rossana Pulcineli Vieira Francisco; Gisele Gouveia; Sandra Josefina Ferraz Ellero Grisi; Ana Maria Ulhoa Escobar; Carlos Alberto Moreira-Filho; Guilherme Vanoni Polanczyk; Euripedes Constantino Miguel; Helena Brentani
Journal:  Sci Rep       Date:  2017-03-17       Impact factor: 4.379

10.  Longitudinal, genome-scale analysis of DNA methylation in twins from birth to 18 months of age reveals rapid epigenetic change in early life and pair-specific effects of discordance.

Authors:  David Martino; Yuk Jin Loke; Lavinia Gordon; Miina Ollikainen; Mark N Cruickshank; Richard Saffery; Jeffrey M Craig
Journal:  Genome Biol       Date:  2013-05-22       Impact factor: 13.583

View more
  1 in total

1.  Accounting for unobserved covariates with varying degrees of estimability in high-dimensional biological data.

Authors:  Chris McKennan; Dan Nicolae
Journal:  Biometrika       Date:  2019-09-16       Impact factor: 3.028

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.