Literature DB >> 36063453

TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning.

Hai Yang1,2, Rui Chen2,3, Quan Wang2,3, Qiang Wei2,3, Ying Ji2,3, Xue Zhong3,4, Bingshan Li2,3.   

Abstract

MOTIVATION: Analysis of whole-genome sequencing (WGS) for genetics is still a challenge due to the lack of accurate functional annotation of non-coding variants, especially the rare ones. As eQTLs have been extensively implicated in the genetics of human diseases, we hypothesize that rare non-coding variants discovered in WGS play a regulatory role in predisposing disease risk.
RESULTS: With thousands of tissue- and cell-type-specific epigenomic features, we propose TVAR. This multi-label learning-based deep neural network predicts the functionality of non-coding variants in the genome based on eQTLs across 49 human tissues in the GTEx project. TVAR learns the relationships between high-dimensional epigenomics and eQTLs across tissues, taking the correlation among tissues into account to understand shared and tissue-specific eQTL effects. As a result, TVAR outputs tissue-specific annotations, with an average AUROC of 0.77 across these tissues. We evaluate TVAR's performance on four complex diseases (coronary artery disease, breast cancer, Type 2 diabetes and Schizophrenia), using TVAR's tissue-specific annotations, and observe its superior performance in predicting functional variants for both common and rare variants, compared with five existing state-of-the-art tools. We further evaluate TVAR's G-score, a scoring scheme across all tissues, on ClinVar, fine-mapped GWAS loci, Massive Parallel Reporter Assay (MPRA) validated variants and observe the consistently better performance of TVAR compared with other competing tools.
AVAILABILITY AND IMPLEMENTATION: The TVAR source code and its scores on the ClinVar catalog, fine mapped GWAS Loci, high confidence eQTLs from GTEx dataset, and MPRA validated functional variants are available at https://github.com/haiyang1986/TVAR. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2022        PMID: 36063453      PMCID: PMC9563698          DOI: 10.1093/bioinformatics/btac608

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


  42 in total

1.  Presenting ENCODE.

Authors:  Magdalena Skipper; Ritu Dhand; Philip Campbell
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

2.  Super-enhancers in the control of cell identity and disease.

Authors:  Denes Hnisz; Brian J Abraham; Tong Ihn Lee; Ashley Lau; Violaine Saint-André; Alla A Sigova; Heather A Hoke; Richard A Young
Journal:  Cell       Date:  2013-10-10       Impact factor: 41.582

3.  An evolutionary framework for measuring epigenomic information and estimating cell-type-specific fitness consequences.

Authors:  Brad Gulko; Adam Siepel
Journal:  Nat Genet       Date:  2018-12-17       Impact factor: 38.330

4.  Identifying a high fraction of the human genome to be under selective constraint using GERP++.

Authors:  Eugene V Davydov; David L Goode; Marina Sirota; Gregory M Cooper; Arend Sidow; Serafim Batzoglou
Journal:  PLoS Comput Biol       Date:  2010-12-02       Impact factor: 4.475

5.  Integrative analysis of haplotype-resolved epigenomes across human tissues.

Authors:  Danny Leung; Inkyung Jung; Nisha Rajagopal; Anthony Schmitt; Siddarth Selvaraj; Ah Young Lee; Chia-An Yen; Shin Lin; Yiing Lin; Yunjiang Qiu; Wei Xie; Feng Yue; Manoj Hariharan; Pradipta Ray; Samantha Kuan; Lee Edsall; Hongbo Yang; Neil C Chi; Michael Q Zhang; Joseph R Ecker; Bing Ren
Journal:  Nature       Date:  2015-02-19       Impact factor: 49.962

6.  A spectral approach integrating functional genomic annotations for coding and noncoding variants.

Authors:  Iuliana Ionita-Laza; Kenneth McCallum; Bin Xu; Joseph D Buxbaum
Journal:  Nat Genet       Date:  2016-01-04       Impact factor: 38.330

7.  Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes.

Authors:  Angli Xue; Yang Wu; Zhihong Zhu; Futao Zhang; Kathryn E Kemper; Zhili Zheng; Loic Yengo; Luke R Lloyd-Jones; Julia Sidorenko; Yeda Wu; Allan F McRae; Peter M Visscher; Jian Zeng; Jian Yang
Journal:  Nat Commun       Date:  2018-07-27       Impact factor: 14.919

8.  NCBoost classifies pathogenic non-coding variants in Mendelian diseases through supervised learning on purifying selection signals in humans.

Authors:  Barthélémy Caron; Yufei Luo; Antonio Rausell
Journal:  Genome Biol       Date:  2019-02-11       Impact factor: 13.583

9.  ClinVar: public archive of interpretations of clinically relevant variants.

Authors:  Melissa J Landrum; Jennifer M Lee; Mark Benson; Garth Brown; Chen Chao; Shanmuga Chitipiralla; Baoshan Gu; Jennifer Hart; Douglas Hoffman; Jeffrey Hoover; Wonhee Jang; Kenneth Katz; Michael Ovetsky; George Riley; Amanjeev Sethi; Ray Tully; Ricardo Villamarin-Salomon; Wendy Rubinstein; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

10.  Deep learning sequence-based ab initio prediction of variant effects on expression and disease risk.

Authors:  Jian Zhou; Chandra L Theesfeld; Kevin Yao; Kathleen M Chen; Aaron K Wong; Olga G Troyanskaya
Journal:  Nat Genet       Date:  2018-07-16       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.