Literature DB >> 33300042

Predicting regulatory variants using a dense epigenomic mapped CNN model elucidated the molecular basis of trait-tissue associations.

Guangsheng Pei1, Ruifeng Hu1, Yulin Dai1, Astrid Marilyn Manuel1, Zhongming Zhao1,2,3,4, Peilin Jia1.   

Abstract

Assessing the causal tissues of human complex diseases is important for the prioritization of trait-associated genetic variants. Yet, the biological underpinnings of trait-associated variants are extremely difficult to infer due to statistical noise in genome-wide association studies (GWAS), and because >90% of genetic variants from GWAS are located in non-coding regions. Here, we collected the largest human epigenomic map from ENCODE and Roadmap consortia and implemented a deep-learning-based convolutional neural network (CNN) model to predict the regulatory roles of genetic variants across a comprehensive list of epigenomic modifications. Our model, called DeepFun, was built on DNA accessibility maps, histone modification marks, and transcription factors. DeepFun can systematically assess the impact of non-coding variants in the most functional elements with tissue or cell-type specificity, even for rare variants or de novo mutations. By applying this model, we prioritized trait-associated loci for 51 publicly-available GWAS studies. We demonstrated that CNN-based analyses on dense and high-resolution epigenomic annotations can refine important GWAS associations in order to identify regulatory loci from background signals, which yield novel insights for better understanding the molecular basis of human complex disease. We anticipate our approaches will become routine in GWAS downstream analysis and non-coding variant evaluation.
© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33300042      PMCID: PMC7797043          DOI: 10.1093/nar/gkaa1137

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  57 in total

1.  Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning.

Authors:  Babak Alipanahi; Andrew Delong; Matthew T Weirauch; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2015-07-27       Impact factor: 54.908

2.  Gender differences in muscle sympathetic nerve activity: effect of body fat distribution.

Authors:  P P Jones; S Snitker; J S Skinner; E Ravussin
Journal:  Am J Physiol       Date:  1996-02

Review 3.  Deep learning.

Authors:  Yann LeCun; Yoshua Bengio; Geoffrey Hinton
Journal:  Nature       Date:  2015-05-28       Impact factor: 49.962

4.  Androgen, estrogen and progesterone receptor expression in the human uterus during the menstrual cycle.

Authors:  H J Mertens; M J Heineman; P H Theunissen; F H de Jong; J L Evers
Journal:  Eur J Obstet Gynecol Reprod Biol       Date:  2001-09       Impact factor: 2.435

5.  Predicting effects of noncoding variants with deep learning-based sequence model.

Authors:  Jian Zhou; Olga G Troyanskaya
Journal:  Nat Methods       Date:  2015-08-24       Impact factor: 28.547

6.  Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder.

Authors:  Ditte Demontis; Raymond K Walters; Joanna Martin; Manuel Mattheisen; Thomas D Als; Esben Agerbo; Gísli Baldursson; Rich Belliveau; Jonas Bybjerg-Grauholm; Marie Bækvad-Hansen; Felecia Cerrato; Kimberly Chambert; Claire Churchhouse; Ashley Dumont; Nicholas Eriksson; Michael Gandal; Jacqueline I Goldstein; Katrina L Grasby; Jakob Grove; Olafur O Gudmundsson; Christine S Hansen; Mads Engel Hauberg; Mads V Hollegaard; Daniel P Howrigan; Hailiang Huang; Julian B Maller; Alicia R Martin; Nicholas G Martin; Jennifer Moran; Jonatan Pallesen; Duncan S Palmer; Carsten Bøcker Pedersen; Marianne Giørtz Pedersen; Timothy Poterba; Jesper Buchhave Poulsen; Stephan Ripke; Elise B Robinson; F Kyle Satterstrom; Hreinn Stefansson; Christine Stevens; Patrick Turley; G Bragi Walters; Hyejung Won; Margaret J Wright; Ole A Andreassen; Philip Asherson; Christie L Burton; Dorret I Boomsma; Bru Cormand; Søren Dalsgaard; Barbara Franke; Joel Gelernter; Daniel Geschwind; Hakon Hakonarson; Jan Haavik; Henry R Kranzler; Jonna Kuntsi; Kate Langley; Klaus-Peter Lesch; Christel Middeldorp; Andreas Reif; Luis Augusto Rohde; Panos Roussos; Russell Schachar; Pamela Sklar; Edmund J S Sonuga-Barke; Patrick F Sullivan; Anita Thapar; Joyce Y Tung; Irwin D Waldman; Sarah E Medland; Kari Stefansson; Merete Nordentoft; David M Hougaard; Thomas Werge; Ole Mors; Preben Bo Mortensen; Mark J Daly; Stephen V Faraone; Anders D Børglum; Benjamin M Neale
Journal:  Nat Genet       Date:  2018-11-26       Impact factor: 38.330

7.  Association analyses identify 38 susceptibility loci for inflammatory bowel disease and highlight shared genetic risk across populations.

Authors:  Jimmy Z Liu; Suzanne van Sommeren; Hailiang Huang; Siew C Ng; Rudi Alberts; Atsushi Takahashi; Stephan Ripke; James C Lee; Luke Jostins; Tejas Shah; Shifteh Abedian; Jae Hee Cheon; Judy Cho; Naser E Dayani; Lude Franke; Yuta Fuyuno; Ailsa Hart; Ramesh C Juyal; Garima Juyal; Won Ho Kim; Andrew P Morris; Hossein Poustchi; William G Newman; Vandana Midha; Timothy R Orchard; Homayon Vahedi; Ajit Sood; Joseph Y Sung; Reza Malekzadeh; Harm-Jan Westra; Keiko Yamazaki; Suk-Kyun Yang; Jeffrey C Barrett; Behrooz Z Alizadeh; Miles Parkes; Thelma Bk; Mark J Daly; Michiaki Kubo; Carl A Anderson; Rinse K Weersma
Journal:  Nat Genet       Date:  2015-07-20       Impact factor: 41.307

8.  TSEA-DB: a trait-tissue association map for human complex traits and diseases.

Authors:  Peilin Jia; Yulin Dai; Ruifeng Hu; Guangsheng Pei; Astrid Marilyn Manuel; Zhongming Zhao
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

9.  Combining genomewide association study and lung eQTL analysis provides evidence for novel genes associated with asthma.

Authors:  M A Nieuwenhuis; M Siedlinski; M van den Berge; R Granell; X Li; M Niens; P van der Vlies; J Altmüller; P Nürnberg; M Kerkhof; O C van Schayck; R A Riemersma; T van der Molen; J G de Monchy; Y Bossé; A Sandford; C A Bruijnzeel-Koomen; R Gerth van Wijk; N H Ten Hacken; W Timens; H M Boezen; J Henderson; M Kabesch; J M Vonk; D S Postma; G H Koppelman
Journal:  Allergy       Date:  2016-08-22       Impact factor: 13.146

10.  Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics.

Authors:  David Lamparter; Daniel Marbach; Rico Rueedi; Zoltán Kutalik; Sven Bergmann
Journal:  PLoS Comput Biol       Date:  2016-01-25       Impact factor: 4.475

View more
  3 in total

1.  TVAR: assessing tissue-specific functional effects of non-coding variants with deep learning.

Authors:  Hai Yang; Rui Chen; Quan Wang; Qiang Wei; Ying Ji; Xue Zhong; Bingshan Li
Journal:  Bioinformatics       Date:  2022-10-14       Impact factor: 6.931

2.  Designing optimal convolutional neural network architecture using differential evolution algorithm.

Authors:  Arjun Ghosh; Nanda Dulal Jana; Saurav Mallik; Zhongming Zhao
Journal:  Patterns (N Y)       Date:  2022-08-24

3.  DeepFun: a deep learning sequence-based model to decipher non-coding variant effect in a tissue- and cell type-specific manner.

Authors:  Guangsheng Pei; Ruifeng Hu; Peilin Jia; Zhongming Zhao
Journal:  Nucleic Acids Res       Date:  2021-07-02       Impact factor: 16.971

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.