Literature DB >> 34707284

Disease variant prediction with deep generative models of evolutionary data.

Jonathan Frazer1, Pascal Notin2, Mafalda Dias1, Aidan Gomez2, Joseph K Min1, Kelly Brock1, Yarin Gal3, Debora S Marks4,5.   

Abstract

Quantifying the pathogenicity of protein variants in human disease-related genes would have a marked effect on clinical decisions, yet the overwhelming majority (over 98%) of these variants still have unknown consequences1-3. In principle, computational methods could support the large-scale interpretation of genetic variants. However, state-of-the-art methods4-10 have relied on training machine learning models on known disease labels. As these labels are sparse, biased and of variable quality, the resulting models have been considered insufficiently reliable11. Here we propose an approach that leverages deep generative models to predict variant pathogenicity without relying on labels. By modelling the distribution of sequence variation across organisms, we implicitly capture constraints on the protein sequences that maintain fitness. Our model EVE (evolutionary model of variant effect) not only outperforms computational approaches that rely on labelled data but also performs on par with, if not better than, predictions from high-throughput experiments, which are increasingly used as evidence for variant classification12-16. We predict the pathogenicity of more than 36 million variants across 3,219 disease genes and provide evidence for the classification of more than 256,000 variants of unknown significance. Our work suggests that models of evolutionary information can provide valuable independent evidence for variant interpretation that will be widely useful in research and clinical settings.
© 2021. The Author(s), under exclusive licence to Springer Nature Limited.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 34707284     DOI: 10.1038/s41586-021-04043-8

Source DB:  PubMed          Journal:  Nature        ISSN: 0028-0836            Impact factor:   49.962


  40 in total

1.  M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity.

Authors:  Karthik A Jagadeesh; Aaron M Wenger; Mark J Berger; Harendra Guturu; Peter D Stenson; David N Cooper; Jonathan A Bernstein; Gill Bejerano
Journal:  Nat Genet       Date:  2016-10-24       Impact factor: 38.330

2.  ClinVar at five years: Delivering on the promise.

Authors:  Melissa J Landrum; Brandi L Kattman
Journal:  Hum Mutat       Date:  2018-11       Impact factor: 4.878

3.  REVEL: An Ensemble Method for Predicting the Pathogenicity of Rare Missense Variants.

Authors:  Nilah M Ioannidis; Joseph H Rothstein; Vikas Pejaver; Sumit Middha; Shannon K McDonnell; Saurabh Baheti; Anthony Musolf; Qing Li; Emily Holzinger; Danielle Karyadi; Lisa A Cannon-Albright; Craig C Teerlink; Janet L Stanford; William B Isaacs; Jianfeng Xu; Kathleen A Cooney; Ethan M Lange; Johanna Schleutker; John D Carpten; Isaac J Powell; Olivier Cussenot; Geraldine Cancel-Tassin; Graham G Giles; Robert J MacInnis; Christiane Maier; Chih-Lin Hsieh; Fredrik Wiklund; William J Catalona; William D Foulkes; Diptasri Mandal; Rosalind A Eeles; Zsofia Kote-Jarai; Carlos D Bustamante; Daniel J Schaid; Trevor Hastie; Elaine A Ostrander; Joan E Bailey-Wilson; Predrag Radivojac; Stephen N Thibodeau; Alice S Whittemore; Weiva Sieh
Journal:  Am J Hum Genet       Date:  2016-09-22       Impact factor: 11.025

4.  A method and server for predicting damaging missense mutations.

Authors:  Ivan A Adzhubei; Steffen Schmidt; Leonid Peshkin; Vasily E Ramensky; Anna Gerasimova; Peer Bork; Alexey S Kondrashov; Shamil R Sunyaev
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

5.  PERCH: A Unified Framework for Disease Gene Prioritization.

Authors:  Bing-Jian Feng
Journal:  Hum Mutat       Date:  2017-01-28       Impact factor: 4.878

6.  A spectral approach integrating functional genomic annotations for coding and noncoding variants.

Authors:  Iuliana Ionita-Laza; Kenneth McCallum; Bin Xu; Joseph D Buxbaum
Journal:  Nat Genet       Date:  2016-01-04       Impact factor: 38.330

7.  DEOGEN2: prediction and interactive visualization of single amino acid variant deleteriousness in human proteins.

Authors:  Daniele Raimondi; Ibrahim Tanyalcin; Julien Ferté; Andrea Gazzo; Gabriele Orlando; Tom Lenaerts; Marianne Rooman; Wim Vranken
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

8.  CADD: predicting the deleteriousness of variants throughout the human genome.

Authors:  Philipp Rentzsch; Daniela Witten; Gregory M Cooper; Jay Shendure; Martin Kircher
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

9.  The mutational constraint spectrum quantified from variation in 141,456 humans.

Authors:  Konrad J Karczewski; Laurent C Francioli; Grace Tiao; Beryl B Cummings; Jessica Alföldi; Qingbo Wang; Ryan L Collins; Kristen M Laricchia; Andrea Ganna; Daniel P Birnbaum; Laura D Gauthier; Harrison Brand; Matthew Solomonson; Nicholas A Watts; Daniel Rhodes; Moriel Singer-Berk; Eleina M England; Eleanor G Seaby; Jack A Kosmicki; Raymond K Walters; Katherine Tashman; Yossi Farjoun; Eric Banks; Timothy Poterba; Arcturus Wang; Cotton Seed; Nicola Whiffin; Jessica X Chong; Kaitlin E Samocha; Emma Pierce-Hoffman; Zachary Zappala; Anne H O'Donnell-Luria; Eric Vallabh Minikel; Ben Weisburd; Monkol Lek; James S Ware; Christopher Vittal; Irina M Armean; Louis Bergelson; Kristian Cibulskis; Kristen M Connolly; Miguel Covarrubias; Stacey Donnelly; Steven Ferriera; Stacey Gabriel; Jeff Gentry; Namrata Gupta; Thibault Jeandet; Diane Kaplan; Christopher Llanwarne; Ruchi Munshi; Sam Novod; Nikelle Petrillo; David Roazen; Valentin Ruano-Rubio; Andrea Saltzman; Molly Schleicher; Jose Soto; Kathleen Tibbetts; Charlotte Tolonen; Gordon Wade; Michael E Talkowski; Benjamin M Neale; Mark J Daly; Daniel G MacArthur
Journal:  Nature       Date:  2020-05-27       Impact factor: 69.504

10.  Exome sequencing and characterization of 49,960 individuals in the UK Biobank.

Authors:  Cristopher V Van Hout; Ioanna Tachmazidou; Joshua D Backman; Joshua D Hoffman; Daren Liu; Ashutosh K Pandey; Claudia Gonzaga-Jauregui; Shareef Khalid; Bin Ye; Nilanjana Banerjee; Alexander H Li; Colm O'Dushlaine; Anthony Marcketta; Jeffrey Staples; Claudia Schurmann; Alicia Hawes; Evan Maxwell; Leland Barnard; Alexander Lopez; John Penn; Lukas Habegger; Andrew L Blumenfeld; Xiaodong Bai; Sean O'Keeffe; Ashish Yadav; Kavita Praveen; Marcus Jones; William J Salerno; Wendy K Chung; Ida Surakka; Cristen J Willer; Kristian Hveem; Joseph B Leader; David J Carey; David H Ledbetter; Lon Cardon; George D Yancopoulos; Aris Economides; Giovanni Coppola; Alan R Shuldiner; Suganthi Balasubramanian; Michael Cantor; Matthew R Nelson; John Whittaker; Jeffrey G Reid; Jonathan Marchini; John D Overton; Robert A Scott; Gonçalo R Abecasis; Laura Yerges-Armstrong; Aris Baras
Journal:  Nature       Date:  2020-10-21       Impact factor: 69.504

View more
  28 in total

1.  Machine-learning of complex evolutionary signals improves classification of SNVs.

Authors:  Sapir Labes; Doron Stupp; Naama Wagner; Idit Bloch; Michal Lotem; Ephrat L Lahad; Paz Polak; Tal Pupko; Yuval Tabach
Journal:  NAR Genom Bioinform       Date:  2022-04-07

2.  Democratizing the mapping of gene mutations to protein biophysics.

Authors:  Debora S Marks; Stephen W Michnick
Journal:  Nature       Date:  2022-04       Impact factor: 49.962

3.  Proactive functional classification of all possible missense single-nucleotide variants in KCNQ4.

Authors:  Honglan Zheng; Xinhao Yan; Guanluan Li; Hengwei Lin; Siqi Deng; Wenhui Zhuang; Fuqiang Yao; Yu Lu; Xin Xia; Huijun Yuan; Li Jin; Zhiqiang Yan
Journal:  Genome Res       Date:  2022-06-27       Impact factor: 9.438

Review 4.  Artificial Intelligence Applied to Cardiomyopathies: Is It Time for Clinical Application?

Authors:  Kyung-Hee Kim; Joon-Myung Kwon; Tara Pereira; Zachi I Attia; Naveen L Pereira
Journal:  Curr Cardiol Rep       Date:  2022-09-01       Impact factor: 3.955

5.  Determinants of trafficking, conduction, and disease within a K+ channel revealed through multiparametric deep mutational scanning.

Authors:  Willow Coyote-Maestas; David Nedrud; Yungui He; Daniel Schmidt
Journal:  Elife       Date:  2022-05-31       Impact factor: 8.713

6.  The Location of Missense Variants in the Human GIP Gene Is Indicative for Natural Selection.

Authors:  Peter Lindquist; Lærke Smidt Gasbjerg; Jacek Mokrosinski; Jens Juul Holst; Alexander Sebastian Hauser; Mette Marie Rosenkilde
Journal:  Front Endocrinol (Lausanne)       Date:  2022-06-29       Impact factor: 6.055

7.  The impact of rare germline variants on human somatic mutation processes.

Authors:  Mischan Vali-Pour; Ben Lehner; Fran Supek
Journal:  Nat Commun       Date:  2022-06-28       Impact factor: 17.694

Review 8.  Interpreting protein variant effects with computational predictors and deep mutational scanning.

Authors:  Benjamin J Livesey; Joseph A Marsh
Journal:  Dis Model Mech       Date:  2022-06-23       Impact factor: 5.732

Review 9.  How Functional Genomics Can Keep Pace With VUS Identification.

Authors:  Corey L Anderson; Saba Munawar; Louise Reilly; Timothy J Kamp; Craig T January; Brian P Delisle; Lee L Eckhardt
Journal:  Front Cardiovasc Med       Date:  2022-07-04

Review 10.  Genetic load: genomic estimates and applications in non-model animals.

Authors:  Giorgio Bertorelle; Francesca Raffini; Hernán E Morales; Cock van Oosterhout; Mirte Bosse; Chiara Bortoluzzi; Alessio Iannucci; Emiliano Trucchi
Journal:  Nat Rev Genet       Date:  2022-02-08       Impact factor: 59.581

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.