Literature DB >> 29226803

Quantitative Missense Variant Effect Prediction Using Large-Scale Mutagenesis Data.

Vanessa E Gray1, Ronald J Hause1, Jens Luebeck1, Jay Shendure2, Douglas M Fowler3.   

Abstract

Large datasets describing the quantitative effects of mutations on protein function are becoming increasingly available. Here, we leverage these datasets to develop Envision, which predicts the magnitude of a missense variant's molecular effect. Envision combines 21,026 variant effect measurements from nine large-scale experimental mutagenesis datasets, a hitherto untapped training resource, with a supervised, stochastic gradient boosting learning algorithm. Envision outperforms other missense variant effect predictors both on large-scale mutagenesis data and on an independent test dataset comprising 2,312 TP53 variants whose effects were measured using a low-throughput approach. This dataset was never used for hyperparameter tuning or model training and thus serves as an independent validation set. Envision prediction accuracy is also more consistent across amino acids than other predictors. Finally, we demonstrate that Envision's performance improves as more large-scale mutagenesis data are incorporated. We precompute Envision predictions for every possible single amino acid variant in human, mouse, frog, zebrafish, fruit fly, worm, and yeast proteomes (https://envision.gs.washington.edu/).
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  large-scale mutagenesis; machine learning; variant effect prediction

Mesh:

Year:  2017        PMID: 29226803      PMCID: PMC5799033          DOI: 10.1016/j.cels.2017.11.003

Source DB:  PubMed          Journal:  Cell Syst        ISSN: 2405-4712            Impact factor:   10.304


  32 in total

1.  Germline mutations in genes within the MAPK pathway cause cardio-facio-cutaneous syndrome.

Authors:  Pablo Rodriguez-Viciana; Osamu Tetsu; William E Tidyman; Anne L Estep; Brenda A Conger; Molly Santa Cruz; Frank McCormick; Katherine A Rauen
Journal:  Science       Date:  2006-01-26       Impact factor: 47.728

Review 2.  Tools for Predicting the Functional Impact of Nonsynonymous Genetic Variation.

Authors:  Haiming Tang; Paul D Thomas
Journal:  Genetics       Date:  2016-06       Impact factor: 4.562

3.  Understanding the function-structure and function-mutation relationships of p53 tumor suppressor protein by high-resolution missense mutation analysis.

Authors:  Shunsuke Kato; Shuang-Yin Han; Wen Liu; Kazunori Otsuka; Hiroyuki Shibata; Ryunosuke Kanamaru; Chikashi Ishioka
Journal:  Proc Natl Acad Sci U S A       Date:  2003-06-25       Impact factor: 11.205

4.  The Human Gene Mutation Database (HGMD) and its exploitation in the fields of personalized genomics and molecular evolution.

Authors:  Peter D Stenson; Edward V Ball; Matthew Mort; Andrew D Phillips; Katy Shaw; David N Cooper
Journal:  Curr Protoc Bioinformatics       Date:  2012-09

5.  The RCSB Protein Data Bank: redesigned web site and web services.

Authors:  Peter W Rose; Bojan Beran; Chunxiao Bi; Wolfgang F Bluhm; Dimitris Dimitropoulos; David S Goodsell; Andreas Prlic; Martha Quesada; Gregory B Quinn; John D Westbrook; Jasmine Young; Benjamin Yukich; Christine Zardecki; Helen M Berman; Philip E Bourne
Journal:  Nucleic Acids Res       Date:  2010-10-29       Impact factor: 16.971

6.  The IntAct molecular interaction database in 2012.

Authors:  Samuel Kerrien; Bruno Aranda; Lionel Breuza; Alan Bridge; Fiona Broackes-Carter; Carol Chen; Margaret Duesbury; Marine Dumousseau; Marc Feuermann; Ursula Hinz; Christine Jandrasits; Rafael C Jimenez; Jyoti Khadake; Usha Mahadevan; Patrick Masson; Ivo Pedruzzi; Eric Pfeiffenberger; Pablo Porras; Arathi Raghunath; Bernd Roechert; Sandra Orchard; Henning Hermjakob
Journal:  Nucleic Acids Res       Date:  2011-11-24       Impact factor: 16.971

7.  Better prediction of functional effects for sequence variants.

Authors:  Maximilian Hecht; Yana Bromberg; Burkhard Rost
Journal:  BMC Genomics       Date:  2015-06-18       Impact factor: 3.969

8.  The evaluation of tools used to predict the impact of missense variants is hindered by two types of circularity.

Authors:  Dominik G Grimm; Chloé-Agathe Azencott; Fabian Aicheler; Udo Gieraths; Daniel G MacArthur; Kaitlin E Samocha; David N Cooper; Peter D Stenson; Mark J Daly; Jordan W Smoller; Laramie E Duncan; Karsten M Borgwardt
Journal:  Hum Mutat       Date:  2015-03-26       Impact factor: 4.878

9.  A formal perturbation equation between genotype and phenotype determines the Evolutionary Action of protein-coding variations on fitness.

Authors:  Panagiotis Katsonis; Olivier Lichtarge
Journal:  Genome Res       Date:  2014-09-12       Impact factor: 9.043

10.  Quantifying unobserved protein-coding variants in human populations provides a roadmap for large-scale sequencing projects.

Authors:  James Zou; Gregory Valiant; Paul Valiant; Konrad Karczewski; Siu On Chan; Kaitlin Samocha; Monkol Lek; Shamil Sunyaev; Mark Daly; Daniel G MacArthur
Journal:  Nat Commun       Date:  2016-10-31       Impact factor: 14.919

View more
  53 in total

1.  The Genetic Landscape of Diamond-Blackfan Anemia.

Authors:  Jacob C Ulirsch; Jeffrey M Verboon; Shideh Kazerounian; Michael H Guo; Daniel Yuan; Leif S Ludwig; Robert E Handsaker; Nour J Abdulhay; Claudia Fiorini; Giulio Genovese; Elaine T Lim; Aaron Cheng; Beryl B Cummings; Katherine R Chao; Alan H Beggs; Casie A Genetti; Colin A Sieff; Peter E Newburger; Edyta Niewiadomska; Michal Matysiak; Adrianna Vlachos; Jeffrey M Lipton; Eva Atsidaftos; Bertil Glader; Anupama Narla; Pierre-Emmanuel Gleizes; Marie-Françoise O'Donohue; Nathalie Montel-Lehry; David J Amor; Steven A McCarroll; Anne H O'Donnell-Luria; Namrata Gupta; Stacey B Gabriel; Daniel G MacArthur; Eric S Lander; Monkol Lek; Lydie Da Costa; David G Nathan; Andrei A Korostelev; Ron Do; Vijay G Sankaran; Hanna T Gazda
Journal:  Am J Hum Genet       Date:  2018-11-29       Impact factor: 11.025

2.  A Saturation Mutagenesis Approach to Understanding PTEN Lipid Phosphatase Activity and Genotype-Phenotype Relationships.

Authors:  Taylor L Mighell; Sara Evans-Dutson; Brian J O'Roak
Journal:  Am J Hum Genet       Date:  2018-04-26       Impact factor: 11.025

3.  Rare De Novo Missense Variants in RNA Helicase DDX6 Cause Intellectual Disability and Dysmorphic Features and Lead to P-Body Defects and RNA Dysregulation.

Authors:  Chris Balak; Marianne Benard; Elise Schaefer; Sumaiya Iqbal; Keri Ramsey; Michèle Ernoult-Lange; Francesca Mattioli; Lorida Llaci; Véronique Geoffroy; Maité Courel; Marcus Naymik; Kristine K Bachman; Rolph Pfundt; Patrick Rump; Johanna Ter Beest; Ingrid M Wentzensen; Kristin G Monaghan; Kirsty McWalter; Ryan Richholt; Antony Le Béchec; Wayne Jepsen; Matt De Both; Newell Belnap; Anne Boland; Ignazio S Piras; Jean-François Deleuze; Szabolcs Szelinger; Hélène Dollfus; Jamel Chelly; Jean Muller; Arthur Campbell; Dennis Lal; Sampathkumar Rangasamy; Jean-Louis Mandel; Vinodh Narayanan; Matt Huentelman; Dominique Weil; Amélie Piton
Journal:  Am J Hum Genet       Date:  2019-08-15       Impact factor: 11.025

Review 4.  Emerging strategies to bridge the gap between pharmacogenomic research and its clinical implementation.

Authors:  Volker M Lauschke; Magnus Ingelman-Sundberg
Journal:  NPJ Genom Med       Date:  2020-03-05       Impact factor: 8.617

Review 5.  Biophysical and Mechanistic Models for Disease-Causing Protein Variants.

Authors:  Amelie Stein; Douglas M Fowler; Rasmus Hartmann-Petersen; Kresten Lindorff-Larsen
Journal:  Trends Biochem Sci       Date:  2019-01-31       Impact factor: 13.807

6.  Assessment of methods for predicting the effects of PTEN and TPMT protein variants.

Authors:  Vikas Pejaver; Giulia Babbi; Rita Casadio; Lukas Folkman; Panagiotis Katsonis; Kunal Kundu; Olivier Lichtarge; Pier Luigi Martelli; Maximilian Miller; John Moult; Lipika R Pal; Castrense Savojardo; Yizhou Yin; Yaoqi Zhou; Predrag Radivojac; Yana Bromberg
Journal:  Hum Mutat       Date:  2019-07-03       Impact factor: 4.878

7.  Defining the landscape of ATP-competitive inhibitor resistance residues in protein kinases.

Authors:  D Hernandez; M Do Carmo; L Brenan; N S Persky; O Cohen; S Kitajima; U Nayar; A Walker; S Pantel; Y Lee; J Cordova; M Sathappa; C Zhu; T K Hayes; P Ram; P Pancholi; T S Mikkelsen; D A Barbie; X Yang; R Haq; F Piccioni; D E Root; C M Johannessen
Journal:  Nat Struct Mol Biol       Date:  2020-01-10       Impact factor: 15.369

8.  Perturbing proteomes at single residue resolution using base editing.

Authors:  Philippe C Després; Alexandre K Dubé; Motoaki Seki; Nozomu Yachie; Christian R Landry
Journal:  Nat Commun       Date:  2020-04-20       Impact factor: 14.919

9.  Molecular Origins of Complex Heritability in Natural Genotype-to-Phenotype Relationships.

Authors:  Christopher M Jakobson; Daniel F Jarosz
Journal:  Cell Syst       Date:  2019-05-01       Impact factor: 10.304

10.  funtrp: identifying protein positions for variation driven functional tuning.

Authors:  Maximilian Miller; Daniel Vitale; Peter C Kahn; Burkhard Rost; Yana Bromberg
Journal:  Nucleic Acids Res       Date:  2019-12-02       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.