Literature DB >> 23357174

Proteins and domains vary in their tolerance of non-synonymous single nucleotide polymorphisms (nsSNPs).

Christopher M Yates1, Michael J E Sternberg.   

Abstract

The widespread application of whole-genome sequencing is identifying numerous non-synonymous single nucleotide polymorphisms (nsSNPs), many of which are associated with disease. We analyzed nsSNPs from Humsavar and the 1000 Genomes Project to investigate why some proteins and domains are more tolerant of mutations than others. We identified 311 proteins and 112 Pfam families, corresponding to 2910 domains, as diseasesusceptible and 32 proteins and 67 Pfam families (10,783 domains) as diseaseresistant based on the relative numbers of disease-associated and neutral polymorphisms. Proteins with no significant difference from expected numbers of disease and polymorphism nsSNPs are classified as other. This classification takes into account the phenotypes of all known mutations in the protein or domain rather than simply classifying based on the presence or absence of disease nsSNPs. Of the two hypotheses suggested, our results support the model that disease-resistant domains and proteins are more able to tolerate mutations rather than having more lethal mutations that are not observed. Disease-resistant proteins and domains show significantly higher mutation rates and lower sequence conservation than disease-susceptible proteins and domains. Disease-susceptible proteins are more likely to be encoded by essential genes, are more central in protein-protein interaction networks and are less likely to contain loss-of-function mutations in healthy individuals. We use this classification for nsSNP phenotype prediction, predicting nsSNPs in disease-susceptible domains to be disease and those in disease-resistant domains to be polymorphism. In this way, we achieve higher accuracy than SIFT, a state-of-the-art algorithm.
Copyright © 2013 Elsevier Ltd. All rights reserved.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23357174     DOI: 10.1016/j.jmb.2013.01.026

Source DB:  PubMed          Journal:  J Mol Biol        ISSN: 0022-2836            Impact factor:   5.469


  17 in total

1.  DAMpred: Recognizing Disease-Associated nsSNPs through Bayes-Guided Neural-Network Model Built on Low-Resolution Structure Prediction of Proteins and Protein-Protein Interactions.

Authors:  Lijun Quan; Hongjie Wu; Qiang Lyu; Yang Zhang
Journal:  J Mol Biol       Date:  2019-02-21       Impact factor: 5.469

2.  Prediction and expression analysis of deleterious nonsynonymous SNPs of Arabidopsis ACD11 gene by combining computational algorithms and molecular docking approach.

Authors:  Mahmudul Hasan Rifat; Jamil Ahmed; Milad Ahmed; Foeaz Ahmed; Airin Gulshan; Mahmudul Hasan
Journal:  PLoS Comput Biol       Date:  2022-06-16       Impact factor: 4.779

3.  The structure-based cancer-related single amino acid variation prediction.

Authors:  Jia-Jun Liu; Chin-Sheng Yu; Hsiao-Wei Wu; Yu-Jen Chang; Chih-Peng Lin; Chih-Hao Lu
Journal:  Sci Rep       Date:  2021-06-30       Impact factor: 4.379

Review 4.  Genetic variations and diseases in UniProtKB/Swiss-Prot: the ins and outs of expert manual curation.

Authors:  Maria Livia Famiglietti; Anne Estreicher; Arnaud Gos; Jerven Bolleman; Sébastien Géhant; Lionel Breuza; Alan Bridge; Sylvain Poux; Nicole Redaschi; Lydie Bougueleret; Ioannis Xenarios
Journal:  Hum Mutat       Date:  2014-06-24       Impact factor: 4.878

5.  Insight into neutral and disease-associated human genetic variants through interpretable predictors.

Authors:  Bastiaan A van den Berg; Marcel J T Reinders; Dick de Ridder; Tjaart A P de Beer
Journal:  PLoS One       Date:  2015-03-31       Impact factor: 3.240

6.  PinSnps: structural and functional analysis of SNPs in the context of protein interaction networks.

Authors:  Hui-Chun Lu; Julián Herrera Braga; Franca Fraternali
Journal:  Bioinformatics       Date:  2016-03-24       Impact factor: 6.937

7.  In Silico identification of SNP diversity in cultivated and wild tomato species: insight from molecular simulations.

Authors:  Archana Bhardwaj; Yogeshwar Vikram Dhar; Mehar Hasan Asif; Sumit K Bag
Journal:  Sci Rep       Date:  2016-12-08       Impact factor: 4.379

8.  SuSPect: enhanced prediction of single amino acid variant (SAV) phenotype using network features.

Authors:  Christopher M Yates; Ioannis Filippis; Lawrence A Kelley; Michael J E Sternberg
Journal:  J Mol Biol       Date:  2014-05-05       Impact factor: 5.469

Review 9.  The functional diversity of essential genes required for mammalian cardiac development.

Authors:  Christopher Clowes; Michael G S Boylan; Liam A Ridge; Emma Barnes; Jayne A Wright; Kathryn E Hentges
Journal:  Genesis       Date:  2014-06-24       Impact factor: 2.487

10.  Different evolutionary patterns of SNPs between domains and unassigned regions in human protein-coding sequences.

Authors:  Erli Pang; Xiaomei Wu; Kui Lin
Journal:  Mol Genet Genomics       Date:  2016-01-30       Impact factor: 3.291

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.