Literature DB >> 33686085

Prioritizing non-coding regions based on human genomic constraint and sequence context with deep learning.

Dimitrios Vitsios1, Ryan S Dhindsa2, Lawrence Middleton2, Ayal B Gussow3, Slavé Petrovski4.   

Abstract

Elucidating functionality in non-coding regions is a key challenge in human genomics. It has been shown that intolerance to variation of coding and proximal non-coding sequence is a strong predictor of human disease relevance. Here, we integrate intolerance to variation, functional genomic annotations and primary genomic sequence to build JARVIS: a comprehensive deep learning model to prioritize non-coding regions, outperforming other human lineage-specific scores. Despite being agnostic to evolutionary conservation, JARVIS performs comparably or outperforms conservation-based scores in classifying pathogenic single-nucleotide and structural variants. In constructing JARVIS, we introduce the genome-wide residual variation intolerance score (gwRVIS), applying a sliding-window approach to whole genome sequencing data from 62,784 individuals. gwRVIS distinguishes Mendelian disease genes from more tolerant CCDS regions and highlights ultra-conserved non-coding elements as the most intolerant regions in the human genome. Both JARVIS and gwRVIS capture previously inaccessible human-lineage constraint information and will enhance our understanding of the non-coding genome.

Entities:  

Year:  2021        PMID: 33686085     DOI: 10.1038/s41467-021-21790-4

Source DB:  PubMed          Journal:  Nat Commun        ISSN: 2041-1723            Impact factor:   14.919


  37 in total

1.  Initial sequencing and comparative analysis of the mouse genome.

Authors:  Robert H Waterston; Kerstin Lindblad-Toh; Ewan Birney; Jane Rogers; Josep F Abril; Pankaj Agarwal; Richa Agarwala; Rachel Ainscough; Marina Alexandersson; Peter An; Stylianos E Antonarakis; John Attwood; Robert Baertsch; Jonathon Bailey; Karen Barlow; Stephan Beck; Eric Berry; Bruce Birren; Toby Bloom; Peer Bork; Marc Botcherby; Nicolas Bray; Michael R Brent; Daniel G Brown; Stephen D Brown; Carol Bult; John Burton; Jonathan Butler; Robert D Campbell; Piero Carninci; Simon Cawley; Francesca Chiaromonte; Asif T Chinwalla; Deanna M Church; Michele Clamp; Christopher Clee; Francis S Collins; Lisa L Cook; Richard R Copley; Alan Coulson; Olivier Couronne; James Cuff; Val Curwen; Tim Cutts; Mark Daly; Robert David; Joy Davies; Kimberly D Delehaunty; Justin Deri; Emmanouil T Dermitzakis; Colin Dewey; Nicholas J Dickens; Mark Diekhans; Sheila Dodge; Inna Dubchak; Diane M Dunn; Sean R Eddy; Laura Elnitski; Richard D Emes; Pallavi Eswara; Eduardo Eyras; Adam Felsenfeld; Ginger A Fewell; Paul Flicek; Karen Foley; Wayne N Frankel; Lucinda A Fulton; Robert S Fulton; Terrence S Furey; Diane Gage; Richard A Gibbs; Gustavo Glusman; Sante Gnerre; Nick Goldman; Leo Goodstadt; Darren Grafham; Tina A Graves; Eric D Green; Simon Gregory; Roderic Guigó; Mark Guyer; Ross C Hardison; David Haussler; Yoshihide Hayashizaki; LaDeana W Hillier; Angela Hinrichs; Wratko Hlavina; Timothy Holzer; Fan Hsu; Axin Hua; Tim Hubbard; Adrienne Hunt; Ian Jackson; David B Jaffe; L Steven Johnson; Matthew Jones; Thomas A Jones; Ann Joy; Michael Kamal; Elinor K Karlsson; Donna Karolchik; Arkadiusz Kasprzyk; Jun Kawai; Evan Keibler; Cristyn Kells; W James Kent; Andrew Kirby; Diana L Kolbe; Ian Korf; Raju S Kucherlapati; Edward J Kulbokas; David Kulp; Tom Landers; J P Leger; Steven Leonard; Ivica Letunic; Rosie Levine; Jia Li; Ming Li; Christine Lloyd; Susan Lucas; Bin Ma; Donna R Maglott; Elaine R Mardis; Lucy Matthews; Evan Mauceli; John H Mayer; Megan McCarthy; W Richard McCombie; Stuart McLaren; Kirsten McLay; John D McPherson; Jim Meldrim; Beverley Meredith; Jill P Mesirov; Webb Miller; Tracie L Miner; Emmanuel Mongin; Kate T Montgomery; Michael Morgan; Richard Mott; James C Mullikin; Donna M Muzny; William E Nash; Joanne O Nelson; Michael N Nhan; Robert Nicol; Zemin Ning; Chad Nusbaum; Michael J O'Connor; Yasushi Okazaki; Karen Oliver; Emma Overton-Larty; Lior Pachter; Genís Parra; Kymberlie H Pepin; Jane Peterson; Pavel Pevzner; Robert Plumb; Craig S Pohl; Alex Poliakov; Tracy C Ponce; Chris P Ponting; Simon Potter; Michael Quail; Alexandre Reymond; Bruce A Roe; Krishna M Roskin; Edward M Rubin; Alistair G Rust; Ralph Santos; Victor Sapojnikov; Brian Schultz; Jörg Schultz; Matthias S Schwartz; Scott Schwartz; Carol Scott; Steven Seaman; Steve Searle; Ted Sharpe; Andrew Sheridan; Ratna Shownkeen; Sarah Sims; Jonathan B Singer; Guy Slater; Arian Smit; Douglas R Smith; Brian Spencer; Arne Stabenau; Nicole Stange-Thomann; Charles Sugnet; Mikita Suyama; Glenn Tesler; Johanna Thompson; David Torrents; Evanne Trevaskis; John Tromp; Catherine Ucla; Abel Ureta-Vidal; Jade P Vinson; Andrew C Von Niederhausern; Claire M Wade; Melanie Wall; Ryan J Weber; Robert B Weiss; Michael C Wendl; Anthony P West; Kris Wetterstrand; Raymond Wheeler; Simon Whelan; Jamey Wierzbowski; David Willey; Sophie Williams; Richard K Wilson; Eitan Winter; Kim C Worley; Dudley Wyman; Shan Yang; Shiaw-Pyng Yang; Evgeny M Zdobnov; Michael C Zody; Eric S Lander
Journal:  Nature       Date:  2002-12-05       Impact factor: 49.962

2.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

3.  [Somatic polysaccharide-containing antigens of the plague bacterium].

Authors:  E E Bakhrakh; V I Veĭnblat
Journal:  Zh Mikrobiol Epidemiol Immunobiol       Date:  1972-03

4.  The human noncoding genome defined by genetic diversity.

Authors:  Julia di Iulio; Istvan Bartha; Emily H M Wong; Hung-Chun Yu; Victor Lavrenko; Dongchan Yang; Inkyung Jung; Michael A Hicks; Naisha Shah; Ewen F Kirkness; Martin M Fabani; William H Biggs; Bing Ren; J Craig Venter; Amalio Telenti
Journal:  Nat Genet       Date:  2018-02-26       Impact factor: 38.330

5.  Human genome ultraconserved elements are ultraselected.

Authors:  Sol Katzman; Andrew D Kern; Gill Bejerano; Ginger Fewell; Lucinda Fulton; Richard K Wilson; Sofie R Salama; David Haussler
Journal:  Science       Date:  2007-08-17       Impact factor: 47.728

6.  Predicting Splicing from Primary Sequence with Deep Learning.

Authors:  Kishore Jaganathan; Sofia Kyriazopoulou Panagiotopoulou; Jeremy F McRae; Siavash Fazel Darbandi; David Knowles; Yang I Li; Jack A Kosmicki; Juan Arbelaez; Wenwu Cui; Grace B Schwartz; Eric D Chow; Efstathios Kanterakis; Hong Gao; Amirali Kia; Serafim Batzoglou; Stephan J Sanders; Kyle Kai-How Farh
Journal:  Cell       Date:  2019-01-17       Impact factor: 41.582

7.  The intolerance to functional genetic variation of protein domains predicts the localization of pathogenic mutations within genes.

Authors:  Ayal B Gussow; Slavé Petrovski; Quanli Wang; Andrew S Allen; David B Goldstein
Journal:  Genome Biol       Date:  2016-01-18       Impact factor: 13.583

8.  Complexity and conservation of regulatory landscapes underlie evolutionary resilience of mammalian gene expression.

Authors:  Camille Berthelot; Diego Villar; Julie E Horvath; Duncan T Odom; Paul Flicek
Journal:  Nat Ecol Evol       Date:  2017-11-27       Impact factor: 15.460

Review 9.  A genomic perspective on HLA evolution.

Authors:  Diogo Meyer; Vitor R C Aguiar; Bárbara D Bitarello; Débora Y C Brandt; Kelly Nunes
Journal:  Immunogenetics       Date:  2017-07-07       Impact factor: 2.846

10.  UCNEbase--a database of ultraconserved non-coding elements and genomic regulatory blocks.

Authors:  Slavica Dimitrieva; Philipp Bucher
Journal:  Nucleic Acids Res       Date:  2012-11-27       Impact factor: 16.971

View more
  6 in total

1.  Predicting Nanoparticle Delivery to Tumors Using Machine Learning and Artificial Intelligence Approaches.

Authors:  Zhoumeng Lin; Wei-Chun Chou; Yi-Hsien Cheng; Chunla He; Nancy A Monteiro-Riviere; Jim E Riviere
Journal:  Int J Nanomedicine       Date:  2022-03-24

2.  An Ensemble Deep Learning Model with a Gene Attention Mechanism for Estimating the Prognosis of Low-Grade Glioma.

Authors:  Minhyeok Lee
Journal:  Biology (Basel)       Date:  2022-04-12

3.  Evaluation of chromatin mesoscale organization.

Authors:  Dana Lorber; Talila Volk
Journal:  APL Bioeng       Date:  2022-01-12

4.  Cancer-driving mutations are enriched in genic regions intolerant to germline variation.

Authors:  Dimitrios Vitsios; Ryan S Dhindsa; Dorota Matelska; Jonathan Mitchell; Xuequing Zou; Joshua Armenia; Fengyuan Hu; Quanli Wang; Ben Sidders; Andrew R Harper; Slavé Petrovski
Journal:  Sci Adv       Date:  2022-08-26       Impact factor: 14.957

5.  Testing for association with rare variants in the coding and non-coding genome: RAVA-FIRST, a new approach based on CADD deleteriousness score.

Authors:  Ozvan Bocher; Thomas E Ludwig; Marie-Sophie Oglobinsky; Gaëlle Marenne; Jean-François Deleuze; Suryakant Suryakant; Jacob Odeberg; Pierre-Emmanuel Morange; David-Alexandre Trégouët; Hervé Perdry; Emmanuelle Génin
Journal:  PLoS Genet       Date:  2022-09-16       Impact factor: 6.020

6.  Whole genome sequence analysis of blood lipid levels in >66,000 individuals.

Authors:  Margaret Sunitha Selvaraj; Xihao Li; Zilin Li; Akhil Pampana; David Y Zhang; Joseph Park; Stella Aslibekyan; Joshua C Bis; Jennifer A Brody; Brian E Cade; Lee-Ming Chuang; Ren-Hua Chung; Joanne E Curran; Lisa de Las Fuentes; Paul S de Vries; Ravindranath Duggirala; Barry I Freedman; Mariaelisa Graff; Xiuqing Guo; Nancy Heard-Costa; Bertha Hidalgo; Chii-Min Hwu; Marguerite R Irvin; Tanika N Kelly; Brian G Kral; Leslie Lange; Xiaohui Li; Martin Lisa; Steven A Lubitz; Ani W Manichaikul; Preuss Michael; May E Montasser; Alanna C Morrison; Take Naseri; Jeffrey R O'Connell; Nicholette D Palmer; Patricia A Peyser; Muagututia S Reupena; Jennifer A Smith; Xiao Sun; Kent D Taylor; Russell P Tracy; Michael Y Tsai; Zhe Wang; Yuxuan Wang; Wei Bao; John T Wilkins; Lisa R Yanek; Wei Zhao; Donna K Arnett; John Blangero; Eric Boerwinkle; Donald W Bowden; Yii-Der Ida Chen; Adolfo Correa; L Adrienne Cupples; Susan K Dutcher; Patrick T Ellinor; Myriam Fornage; Stacey Gabriel; Soren Germer; Richard Gibbs; Jiang He; Robert C Kaplan; Sharon L R Kardia; Ryan Kim; Charles Kooperberg; Ruth J F Loos; Karine A Viaud-Martinez; Rasika A Mathias; Stephen T McGarvey; Braxton D Mitchell; Deborah Nickerson; Kari E North; Bruce M Psaty; Susan Redline; Alexander P Reiner; Ramachandran S Vasan; Stephen S Rich; Cristen Willer; Jerome I Rotter; Daniel J Rader; Xihong Lin; Gina M Peloso; Pradeep Natarajan
Journal:  Nat Commun       Date:  2022-10-11       Impact factor: 17.694

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.