Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Disentangling genetic feature selection and aggregation in transcriptome-wide association studies.

Literature DB >> 34849857

Disentangling genetic feature selection and aggregation in transcriptome-wide association studies.

Chen Cao¹, Pathum Kossinna¹, Devin Kwok², Qing Li¹, Jingni He¹, Liya Su³, Xingyi Guo⁴, Qingrun Zhang^1,2, Quan Long^1,2,5,6.

Abstract

The success of transcriptome-wide association studies (TWAS) has led to substantial research toward improving the predictive accuracy of its core component of genetically regulated expression (GReX). GReX links expression information with genotype and phenotype by playing two roles simultaneously: it acts as both the outcome of the genotype-based predictive models (for predicting expressions) and the linear combination of genotypes (as the predicted expressions) for association tests. From the perspective of machine learning (considering SNPs as features), these are actually two separable steps-feature selection and feature aggregation-which can be independently conducted. In this study, we show that the single approach of GReX limits the adaptability of TWAS methodology and practice. By conducting simulations and real data analysis, we demonstrate that disentangled protocols adapting straightforward approaches for feature selection (e.g., simple marker test) and aggregation (e.g., kernel machines) outperform the standard TWAS protocols that rely on GReX. Our development provides more powerful novel tools for conducting TWAS. More importantly, our characterization of the exact nature of TWAS suggests that, instead of questionably binding two distinct steps into the same statistical form (GReX), methodological research focusing on optimal combinations of feature selection and aggregation approaches will bring higher power to TWAS protocols.

Entities: Chemical

Keywords: feature selection; kernel machine; statistical genetics; statistical power; transcriptome-wide association studies

Mesh：

Year: 2022 PMID： 34849857 PMCID： PMC9208638 DOI： 10.1093/genetics/iyab216

Source DB: PubMed Journal: Genetics ISSN： 0016-6731 Impact factor: 4.402

Keyword Cloud
References

48 in total

1. Powerful SNP-set analysis for case-control genome-wide association studies.

Authors: Michael C Wu; Peter Kraft; Michael P Epstein; Deanne M Taylor; Stephen J Chanock; David J Hunter; Xihong Lin
Journal: Am J Hum Genet Date: 2010-06-11 Impact factor: 11.025

2. Prime suspect: the TCF7L2 gene and type 2 diabetes risk.

Authors: Andrew T Hattersley
Journal: J Clin Invest Date: 2007-08 Impact factor: 14.808

3. TIGAR: An Improved Bayesian Tool for Transcriptomic Data Imputation Enhances Gene Mapping of Complex Traits.

Authors: Sini Nagpal; Xiaoran Meng; Michael P Epstein; Lam C Tsoi; Matthew Patrick; Greg Gibson; Philip L De Jager; David A Bennett; Aliza P Wingo; Thomas S Wingo; Jingjing Yang
Journal: Am J Hum Genet Date: 2019-06-20 Impact factor: 11.025

Review 4. Opportunities and challenges for transcriptome-wide association studies.

Authors: Michael Wainberg; Nasa Sinnott-Armstrong; Nicholas Mancuso; Alvaro N Barbeira; David A Knowles; David Golan; Raili Ermel; Arno Ruusalepp; Thomas Quertermous; Ke Hao; Johan L M Björkegren; Hae Kyung Im; Bogdan Pasaniuc; Manuel A Rivas; Anshul Kundaje
Journal: Nat Genet Date: 2019-03-29 Impact factor: 38.330

5. Regularization Paths for Generalized Linear Models via Coordinate Descent.

Authors: Jerome Friedman; Trevor Hastie; Rob Tibshirani
Journal: J Stat Softw Date: 2010 Impact factor: 6.440

6. New novel non-MHC genes were identified for cervical cancer with an integrative analysis approach of transcriptome-wide association study.

Authors: Haimiao Chen; Ting Wang; Shuiping Huang; Ping Zeng
Journal: J Cancer Date: 2021-01-01 Impact factor: 4.207

7. Polygenic modeling with bayesian sparse linear mixed models.

Authors: Xiang Zhou; Peter Carbonetto; Matthew Stephens
Journal: PLoS Genet Date: 2013-02-07 Impact factor: 5.917

8. Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights.

Authors: Alexander Gusev; Nicholas Mancuso; Hyejung Won; Maria Kousi; Hilary K Finucane; Yakir Reshef; Lingyun Song; Alexias Safi; Steven McCarroll; Benjamin M Neale; Roel A Ophoff; Michael C O'Donovan; Gregory E Crawford; Daniel H Geschwind; Nicholas Katsanis; Patrick F Sullivan; Bogdan Pasaniuc; Alkes L Price
Journal: Nat Genet Date: 2018-04-09 Impact factor: 38.330

9. PTWAS: investigating tissue-relevant causal molecular mechanisms of complex traits using probabilistic TWAS analysis.

Authors: Yuhua Zhang; Corbin Quick; Ketian Yu; Alvaro Barbeira; Francesca Luca; Roger Pique-Regi; Hae Kyung Im; Xiaoquan Wen
Journal: Genome Biol Date: 2020-09-11 Impact factor: 13.583

10. Integrating predicted transcriptome from multiple tissues improves association detection.

Authors: Alvaro N Barbeira; Milton Pividori; Jiamao Zheng; Heather E Wheeler; Dan L Nicolae; Hae Kyung Im
Journal: PLoS Genet Date: 2019-01-22 Impact factor: 5.917