Literature DB >> 33676930

The DBSAV Database: Predicting Deleteriousness of Single Amino Acid Variations in the Human Proteome.

Jimin Pei1, Nick V Grishin2.   

Abstract

Deleterious single amino acid variation (SAV) is one of the leading causes of human diseases. Evaluating the functional impact of SAVs is crucial for diagnosis of genetic disorders. We previously developed a deep convolutional neural network predictor, DeepSAV, to evaluate the deleterious effects of SAVs on protein function based on various sequence, structural, and functional properties. DeepSAV scores of rare SAVs observed in the human population are aggregated into a gene-level score called GTS (Gene Tolerance of rare SAVs) that reflects a gene's tolerance to deleterious missense mutations and serves as a useful tool to study gene-disease associations. In this study, we aim to enhance the performance of DeepSAV by using expanded datasets of pathogenic and benign variants, more features, and neural network optimization. We found that multiple sequence alignments built from vertebrate-level orthologs yield better prediction results compared to those built from mammalian-level orthologs. For multiple sequence alignments built from BLAST searches, optimal performance was achieved with a sequence identify cutoff of 50% to remove distant homologs. The new version of DeepSAV exhibits the best performance among standalone predictors of deleterious effects of SAVs. We developed the DBSAV database (http://prodata.swmed.edu/DBSAV) that reports GTS scores of human genes and DeepSAV scores of SAVs in the human proteome, including pathogenic and benign SAVs, population-level SAVs, and all possible SAVs by single nucleotide variations. This database serves as a useful resource for research of human SAVs and their relationships with protein functions and human diseases.
Copyright © 2021 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  benign variants; genetic variations; neural network predictor; pathogenic variants; variant deleteriousness prediction

Mesh:

Substances:

Year:  2021        PMID: 33676930      PMCID: PMC8119332          DOI: 10.1016/j.jmb.2021.166915

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   6.151


  47 in total

1.  SPIDER2: A Package to Predict Secondary Structure, Accessible Surface Area, and Main-Chain Torsional Angles by Deep Neural Networks.

Authors:  Yuedong Yang; Rhys Heffernan; Kuldip Paliwal; James Lyons; Abdollah Dehzangi; Alok Sharma; Jihua Wang; Abdul Sattar; Yaoqi Zhou
Journal:  Methods Mol Biol       Date:  2017

2.  ClinPred: Prediction Tool to Identify Disease-Relevant Nonsynonymous Single-Nucleotide Variants.

Authors:  Najmeh Alirezaie; Kristin D Kernohan; Taila Hartley; Jacek Majewski; Toby Dylan Hocking
Journal:  Am J Hum Genet       Date:  2018-09-13       Impact factor: 11.025

3.  ClinVar: improvements to accessing data.

Authors:  Melissa J Landrum; Shanmuga Chitipiralla; Garth R Brown; Chao Chen; Baoshan Gu; Jennifer Hart; Douglas Hoffman; Wonhee Jang; Kuljeet Kaur; Chunlei Liu; Vitaly Lyoshin; Zenith Maddipatla; Rama Maiti; Joseph Mitchell; Nuala O'Leary; George R Riley; Wenyao Shi; George Zhou; Valerie Schneider; Donna Maglott; J Bradley Holmes; Brandi L Kattman
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

4.  IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding.

Authors:  Bálint Mészáros; Gábor Erdos; Zsuzsanna Dosztányi
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

5.  dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs.

Authors:  Xiaoming Liu; Chunlei Wu; Chang Li; Eric Boerwinkle
Journal:  Hum Mutat       Date:  2016-01-05       Impact factor: 4.878

6.  A spectral approach integrating functional genomic annotations for coding and noncoding variants.

Authors:  Iuliana Ionita-Laza; Kenneth McCallum; Bin Xu; Joseph D Buxbaum
Journal:  Nat Genet       Date:  2016-01-04       Impact factor: 38.330

7.  OrthoDB v10: sampling the diversity of animal, plant, fungal, protist, bacterial and viral genomes for evolutionary and functional annotations of orthologs.

Authors:  Evgenia V Kriventseva; Dmitry Kuznetsov; Fredrik Tegenfeldt; Mosè Manni; Renata Dias; Felipe A Simão; Evgeny M Zdobnov
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

8.  Identifying Mendelian disease genes with the variant effect scoring tool.

Authors:  Hannah Carter; Christopher Douville; Peter D Stenson; David N Cooper; Rachel Karchin
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

9.  The SAAP pipeline and database: tools to analyze the impact and predict the pathogenicity of mutations.

Authors:  Nouf S Al-Numair; Andrew C R Martin
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

10.  FATHMM-XF: accurate prediction of pathogenic point mutations via extended features.

Authors:  Mark F Rogers; Hashem A Shihab; Matthew Mort; David N Cooper; Tom R Gaunt; Colin Campbell
Journal:  Bioinformatics       Date:  2018-02-01       Impact factor: 6.937

View more
  5 in total

Review 1.  Advances and Trends in Omics Technology Development.

Authors:  Xiaofeng Dai; Li Shen
Journal:  Front Med (Lausanne)       Date:  2022-07-01

2.  GWYRE: A Resource for Mapping Variants onto Experimental and Modeled Structures of Human Protein Complexes.

Authors:  Sukhaswami Malladi; Harold R Powell; Alessia David; Suhail A Islam; Matthew M Copeland; Petras J Kundrotas; Michael J E Sternberg; Ilya A Vakser
Journal:  J Mol Biol       Date:  2022-04-27       Impact factor: 6.151

3.  Accurate prediction of protein structures and interactions using a three-track neural network.

Authors:  Minkyung Baek; Frank DiMaio; Ivan Anishchenko; Justas Dauparas; Sergey Ovchinnikov; Gyu Rie Lee; Jue Wang; Qian Cong; Lisa N Kinch; R Dustin Schaeffer; Claudia Millán; Hahnbeom Park; Carson Adams; Caleb R Glassman; Andy DeGiovanni; Jose H Pereira; Andria V Rodrigues; Alberdina A van Dijk; Ana C Ebrecht; Diederik J Opperman; Theo Sagmeister; Christoph Buhlheller; Tea Pavkov-Keller; Manoj K Rathinaswamy; Udit Dalwadi; Calvin K Yip; John E Burke; K Christopher Garcia; Nick V Grishin; Paul D Adams; Randy J Read; David Baker
Journal:  Science       Date:  2021-07-15       Impact factor: 47.728

4.  Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs.

Authors:  Neeladri Sen; Ivan Anishchenko; Nicola Bordin; Ian Sillitoe; Sameer Velankar; David Baker; Christine Orengo
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

5.  Pathogenic variation types in human genes relate to diseases through Pfam and InterPro mapping.

Authors:  Giulia Babbi; Castrense Savojardo; Davide Baldazzi; Pier Luigi Martelli; Rita Casadio
Journal:  Front Mol Biosci       Date:  2022-09-16
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.