Literature DB >> 27460538

Principal component of explained variance: An efficient and optimal data dimension reduction framework for association studies.

Maxime Turgeon1,2,3, Karim Oualkacha4, Antonio Ciampi1,3, Hanane Miftah4, Golsa Dehghan1, Brent W Zanke5, Andréa L Benedet6, Pedro Rosa-Neto6,7,8,9, Celia Mt Greenwood1,2,3,10,11, Aurélie Labbe1,3,7,9.   

Abstract

The genomics era has led to an increase in the dimensionality of data collected in the investigation of biological questions. In this context, dimension-reduction techniques can be used to summarise high-dimensional signals into low-dimensional ones, to further test for association with one or more covariates of interest. This paper revisits one such approach, previously known as principal component of heritability and renamed here as principal component of explained variance (PCEV). As its name suggests, the PCEV seeks a linear combination of outcomes in an optimal manner, by maximising the proportion of variance explained by one or several covariates of interest. By construction, this method optimises power; however, due to its computational complexity, it has unfortunately received little attention in the past. Here, we propose a general analytical PCEV framework that builds on the assets of the original method, i.e. conceptually simple and free of tuning parameters. Moreover, our framework extends the range of applications of the original procedure by providing a computationally simple strategy for high-dimensional outcomes, along with exact and asymptotic testing procedures that drastically reduce its computational cost. We investigate the merits of the PCEV using an extensive set of simulations. Furthermore, the use of the PCEV approach is illustrated using three examples taken from the fields of epigenetics and brain imaging.

Keywords:  DNA methylation; Dimension reduction; brain imaging; exact test; multivariate analysis; region-based analysis

Mesh:

Year:  2016        PMID: 27460538     DOI: 10.1177/0962280216660128

Source DB:  PubMed          Journal:  Stat Methods Med Res        ISSN: 0962-2802            Impact factor:   3.021


  4 in total

1.  The pleiotropic effect of rs7903146 on type 2 diabetes and ischemic stroke: a family-based study in a Chinese population.

Authors:  Jing Song; Yiqun Wu; Juan Juan; Yaying Cao; Tao Wu; Yonghua Hu
Journal:  J Thromb Thrombolysis       Date:  2019-08       Impact factor: 2.300

2.  CpG-set association assessment of lipid concentration changes and DNA methylation.

Authors:  Kaiqiong Zhao; Lai Jiang; Kathleen Klein; Celia M T Greenwood; Karim Oualkacha
Journal:  BMC Proc       Date:  2018-09-17

3.  Application of novel and existing methods to identify genes with evidence of epigenetic association: results from GAW20.

Authors:  Angga M Fuady; Samantha Lent; Chloé Sarnowski; Nathan L Tintle
Journal:  BMC Genet       Date:  2018-09-17       Impact factor: 2.797

4.  Whole-genome bisulfite sequencing in systemic sclerosis provides novel targets to understand disease pathogenesis.

Authors:  Tianyuan Lu; Kathleen Oros Klein; Inés Colmegna; Maximilien Lora; Celia M T Greenwood; Marie Hudson
Journal:  BMC Med Genomics       Date:  2019-10-24       Impact factor: 3.063

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.