Literature DB >> 30796987

DAMpred: Recognizing Disease-Associated nsSNPs through Bayes-Guided Neural-Network Model Built on Low-Resolution Structure Prediction of Proteins and Protein-Protein Interactions.

Lijun Quan1, Hongjie Wu2, Qiang Lyu3, Yang Zhang4.   

Abstract

Nearly one-third of non-synonymous single-nucleotide polymorphism (nsSNPs) are deleterious to human health, but recognition of the disease-associated mutations remains a significant unsolved problem. We proposed a new algorithm, DAMpred, to identify disease-causing nsSNPs through the coupling of evolutionary profiles with structure predictions of proteins and protein-protein interactions. The pipeline was trained by a novel Bayes-guided artificial neural network algorithm that incorporates posterior probabilities of distinct feature classifiers with the network training process. DAMpred was tested on a large-scale data set involving 10,635 nsSNPs from 2154 ORFs in the human genome and recognized disease-associated nsSNPs with an accuracy 0.80 and a Matthews correlation coefficient of 0.601, which is 9.1% higher than the best of other state-of-the-art methods. In the blind test on the TP53 gene, DAMpred correctly recognized the mutations causative of Li-Fraumeni-like syndrome with a Matthews correlation coefficient that is 27% higher than the control methods. The study demonstrates an efficient avenue to quantitatively model the association of nsSNPs with human diseases from low-resolution protein structure prediction, which should find important usefulness in diagnosis and treatment of genetic diseases.
Copyright © 2019 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Bayes-guided artificial neural network algorithm; non-synonymous single nucleotide polymorphisms; p53 protein; protein structure prediction; protein–protein interaction

Mesh:

Substances:

Year:  2019        PMID: 30796987      PMCID: PMC6589125          DOI: 10.1016/j.jmb.2019.02.017

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  39 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  How significant is a protein structure similarity with TM-score = 0.5?

Authors:  Jinrui Xu; Yang Zhang
Journal:  Bioinformatics       Date:  2010-02-17       Impact factor: 6.937

3.  Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm.

Authors:  Prateek Kumar; Steven Henikoff; Pauline C Ng
Journal:  Nat Protoc       Date:  2009-06-25       Impact factor: 13.491

4.  Predicting the insurgence of human genetic diseases associated to single point protein mutations with support vector machines and evolutionary information.

Authors:  E Capriotti; R Calabrese; R Casadio
Journal:  Bioinformatics       Date:  2006-08-07       Impact factor: 6.937

5.  I-TASSER: a unified platform for automated protein structure and function prediction.

Authors:  Ambrish Roy; Alper Kucukural; Yang Zhang
Journal:  Nat Protoc       Date:  2010-03-25       Impact factor: 13.491

6.  A method and server for predicting damaging missense mutations.

Authors:  Ivan A Adzhubei; Steffen Schmidt; Leonid Peshkin; Vasily E Ramensky; Anna Gerasimova; Peer Bork; Alexey S Kondrashov; Shamil R Sunyaev
Journal:  Nat Methods       Date:  2010-04       Impact factor: 28.547

7.  Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

Authors:  John A Capra; Roman A Laskowski; Janet M Thornton; Mona Singh; Thomas A Funkhouser
Journal:  PLoS Comput Biol       Date:  2009-12-04       Impact factor: 4.475

8.  Mendelian Inheritance in Man and its online version, OMIM.

Authors:  Victor A McKusick
Journal:  Am J Hum Genet       Date:  2007-03-08       Impact factor: 11.025

9.  The Universal Protein Resource (UniProt) in 2010.

Authors: 
Journal:  Nucleic Acids Res       Date:  2009-10-20       Impact factor: 16.971

10.  LOMETS: a local meta-threading-server for protein structure prediction.

Authors:  Sitao Wu; Yang Zhang
Journal:  Nucleic Acids Res       Date:  2007-05-03       Impact factor: 16.971

View more
  5 in total

1.  Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Authors:  Fang Ge; Ying Zhang; Jian Xu; Arif Muhammad; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

2.  ADDRESS: A Database of Disease-associated Human Variants Incorporating Protein Structure and Folding Stabilities.

Authors:  Jaie Woodard; Chengxin Zhang; Yang Zhang
Journal:  J Mol Biol       Date:  2021-02-02       Impact factor: 6.151

3.  MutTMPredictor: Robust and accurate cascade XGBoost classifier for prediction of mutations in transmembrane proteins.

Authors:  Fang Ge; Yi-Heng Zhu; Jian Xu; Arif Muhammad; Jiangning Song; Dong-Jun Yu
Journal:  Comput Struct Biotechnol J       Date:  2021-11-19       Impact factor: 7.271

4.  Circuit topology predicts pathogenicity of missense mutations.

Authors:  Jaie Woodard; Sumaiya Iqbal; Alireza Mashaghi
Journal:  Proteins       Date:  2022-04-23

Review 5.  Evolution of Sequence-based Bioinformatics Tools for Protein-protein Interaction Prediction.

Authors:  Mst Shamima Khatun; Watshara Shoombuatong; Md Mehedi Hasan; Hiroyuki Kurata
Journal:  Curr Genomics       Date:  2020-09       Impact factor: 2.236

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.