Literature DB >> 34849857

Disentangling genetic feature selection and aggregation in transcriptome-wide association studies.

Chen Cao1, Pathum Kossinna1, Devin Kwok2, Qing Li1, Jingni He1, Liya Su3, Xingyi Guo4, Qingrun Zhang1,2, Quan Long1,2,5,6.   

Abstract

The success of transcriptome-wide association studies (TWAS) has led to substantial research toward improving the predictive accuracy of its core component of genetically regulated expression (GReX). GReX links expression information with genotype and phenotype by playing two roles simultaneously: it acts as both the outcome of the genotype-based predictive models (for predicting expressions) and the linear combination of genotypes (as the predicted expressions) for association tests. From the perspective of machine learning (considering SNPs as features), these are actually two separable steps-feature selection and feature aggregation-which can be independently conducted. In this study, we show that the single approach of GReX limits the adaptability of TWAS methodology and practice. By conducting simulations and real data analysis, we demonstrate that disentangled protocols adapting straightforward approaches for feature selection (e.g., simple marker test) and aggregation (e.g., kernel machines) outperform the standard TWAS protocols that rely on GReX. Our development provides more powerful novel tools for conducting TWAS. More importantly, our characterization of the exact nature of TWAS suggests that, instead of questionably binding two distinct steps into the same statistical form (GReX), methodological research focusing on optimal combinations of feature selection and aggregation approaches will bring higher power to TWAS protocols.
© The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  feature selection; kernel machine; statistical genetics; statistical power; transcriptome-wide association studies

Mesh:

Year:  2022        PMID: 34849857      PMCID: PMC9208638          DOI: 10.1093/genetics/iyab216

Source DB:  PubMed          Journal:  Genetics        ISSN: 0016-6731            Impact factor:   4.402


  48 in total

1.  Powerful SNP-set analysis for case-control genome-wide association studies.

Authors:  Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

2.  Prime suspect: the TCF7L2 gene and type 2 diabetes risk.

Authors:  Andrew T Hattersley
Journal:  J Clin Invest       Date:  2007-08       Impact factor: 14.808

3.  TIGAR: An Improved Bayesian Tool for Transcriptomic Data Imputation Enhances Gene Mapping of Complex Traits.

Authors:  Sini Nagpal; Xiaoran Meng; Michael P Epstein; Lam C Tsoi; Matthew Patrick; Greg Gibson; Philip L De Jager; David A Bennett; Aliza P Wingo; Thomas S Wingo; Jingjing Yang
Journal:  Am J Hum Genet       Date:  2019-06-20       Impact factor: 11.025

Review 4.  Opportunities and challenges for transcriptome-wide association studies.

Authors:  Michael Wainberg; Nasa Sinnott-Armstrong; Nicholas Mancuso; Alvaro N Barbeira; David A Knowles; David Golan; Raili Ermel; Arno Ruusalepp; Thomas Quertermous; Ke Hao; Johan L M Björkegren; Hae Kyung Im; Bogdan Pasaniuc; Manuel A Rivas; Anshul Kundaje
Journal:  Nat Genet       Date:  2019-03-29       Impact factor: 38.330

5.  Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors:  Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal:  J Stat Softw       Date:  2010       Impact factor: 6.440

6.  New novel non-MHC genes were identified for cervical cancer with an integrative analysis approach of transcriptome-wide association study.

Authors:  Haimiao Chen; Ting Wang; Shuiping Huang; Ping Zeng
Journal:  J Cancer       Date:  2021-01-01       Impact factor: 4.207

7.  Polygenic modeling with bayesian sparse linear mixed models.

Authors:  Xiang Zhou; Peter Carbonetto; Matthew Stephens
Journal:  PLoS Genet       Date:  2013-02-07       Impact factor: 5.917

8.  Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights.

Authors:  Alexander Gusev; Nicholas Mancuso; Hyejung Won; Maria Kousi; Hilary K Finucane; Yakir Reshef; Lingyun Song; Alexias Safi; Steven McCarroll; Benjamin M Neale; Roel A Ophoff; Michael C O'Donovan; Gregory E Crawford; Daniel H Geschwind; Nicholas Katsanis; Patrick F Sullivan; Bogdan Pasaniuc; Alkes L Price
Journal:  Nat Genet       Date:  2018-04-09       Impact factor: 38.330

9.  PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis.

Authors:  Yuhua Zhang; Corbin Quick; Ketian Yu; Alvaro Barbeira; Francesca Luca; Roger Pique-Regi; Hae Kyung Im; Xiaoquan Wen
Journal:  Genome Biol       Date:  2020-09-11       Impact factor: 13.583

10.  Integrating predicted transcriptome from multiple tissues improves association detection.

Authors:  Alvaro N Barbeira; Milton Pividori; Jiamao Zheng; Heather E Wheeler; Dan L Nicolae; Hae Kyung Im
Journal:  PLoS Genet       Date:  2019-01-22       Impact factor: 5.917

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.