Literature DB >> 15746281

Prediction of the phenotypic effects of non-synonymous single nucleotide polymorphisms using structural and evolutionary information.

Lei Bao1, Yan Cui.   

Abstract

MOTIVATION: There has been great expectation that the knowledge of an individual's genotype will provide a basis for assessing susceptibility to diseases and designing individualized therapy. Non-synonymous single nucleotide polymorphisms (nsSNPs) that lead to an amino acid change in the protein product are of particular interest because they account for nearly half of the known genetic variations related to human inherited diseases. To facilitate the identification of disease-associated nsSNPs from a large number of neutral nsSNPs, it is important to develop computational tools to predict the phenotypic effects of nsSNPs.
RESULTS: We prepared a training set based on the variant phenotypic annotation of the Swiss-Prot database and focused our analysis on nsSNPs having homologous 3D structures. Structural environment parameters derived from the 3D homologous structure as well as evolutionary information derived from the multiple sequence alignment were used as predictors. Two machine learning methods, support vector machine and random forest, were trained and evaluated. We compared the performance of our method with that of the SIFT algorithm, which is one of the best predictive methods to date. An unbiased evaluation study shows that for nsSNPs with sufficient evolutionary information (with not <10 homologous sequences), the performance of our method is comparable with the SIFT algorithm, while for nsSNPs with insufficient evolutionary information (<10 homologous sequences), our method outperforms the SIFT algorithm significantly. These findings indicate that incorporating structural information is critical to achieving good prediction accuracy when sufficient evolutionary information is not available. AVAILABILITY: The codes and curated dataset are available at http://compbio.utmem.edu/snp/dataset/

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 15746281     DOI: 10.1093/bioinformatics/bti365

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  57 in total

1.  Testing computational prediction of missense mutation phenotypes: functional characterization of 204 mutations of human cystathionine beta synthase.

Authors:  Qiong Wei; Liqun Wang; Qiang Wang; Warren D Kruger; Roland L Dunbrack
Journal:  Proteins       Date:  2010-07

2.  Broiler chickens can benefit from machine learning: support vector machine analysis of observational epidemiological data.

Authors:  Philip J Hepworth; Alexey V Nefedov; Ilya B Muchnik; Kenton L Morgan
Journal:  J R Soc Interface       Date:  2012-02-08       Impact factor: 4.118

3.  Functional hot spots in human ATP-binding cassette transporter nucleotide binding domains.

Authors:  Libusha Kelly; Hisayo Fukushima; Rachel Karchin; Jason M Gow; Leslie W Chinn; Ursula Pieper; Mark R Segal; Deanna L Kroetz; Andrej Sali
Journal:  Protein Sci       Date:  2010-11       Impact factor: 6.725

4.  Protein stability and in vivo concentration of missense mutations in phenylalanine hydroxylase.

Authors:  Zhen Shi; Jenn Sellers; John Moult
Journal:  Proteins       Date:  2011-09-21

5.  Sequence-based prioritization of nonsynonymous single-nucleotide polymorphisms for the study of disease mutations.

Authors:  Rui Jiang; Hua Yang; Linqi Zhou; C-C Jay Kuo; Fengzhu Sun; Ting Chen
Journal:  Am J Hum Genet       Date:  2007-06-22       Impact factor: 11.025

6.  Prediction of functional nonsynonymous single nucleotide polymorphisms in human G-protein-coupled receptors.

Authors:  Dan Xue; Jingyuan Yin; Mingfeng Tan; Junjie Yue; Yuelan Wang; Long Liang
Journal:  J Hum Genet       Date:  2008-02-26       Impact factor: 3.172

7.  Position-specific residue preference features around the ends of helices and strands and a novel strategy for the prediction of secondary structures.

Authors:  Mojie Duan; Min Huang; Chuang Ma; Lun Li; Yanhong Zhou
Journal:  Protein Sci       Date:  2008-06-02       Impact factor: 6.725

Review 8.  Hypothesis-driven candidate gene association studies: practical design and analytical considerations.

Authors:  Timothy J Jorgensen; Ingo Ruczinski; Bailey Kessing; Michael W Smith; Yin Yao Shugart; Anthony J Alberg
Journal:  Am J Epidemiol       Date:  2009-09-17       Impact factor: 4.897

Review 9.  Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data.

Authors:  Gregory M Cooper; Jay Shendure
Journal:  Nat Rev Genet       Date:  2011-08-18       Impact factor: 53.242

10.  A new disease-specific machine learning approach for the prediction of cancer-causing missense variants.

Authors:  Emidio Capriotti; Russ B Altman
Journal:  Genomics       Date:  2011-07-07       Impact factor: 5.736

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.