Literature DB >> 15759645

Improving functional annotation of non-synonomous SNPs with information theory.

R Karchin1, L Kelly, A Sali.   

Abstract

Automated functional annotation of nsSNPs requires that amino-acid residue changes are represented by a set of descriptive features, such as evolutionary conservation, side-chain volume change, effect on ligand-binding, and residue structural rigidity. Identifying the most informative combinations of features is critical to the success of a computational prediction method. We rank 32 features according to their mutual information with functional effects of amino-acid substitutions, as measured by in vivo assays. In addition, we use a greedy algorithm to identify a subset of highly informative features. The method is simple to implement and provides a quantitative measure for selecting the best predictive features given a set of features that a human expert believes to be informative. We demonstrate the usefulness of the selected highly informative features by cross-validated tests of a computational classifier, a support vector machine (SVM). The SVM's classification accuracy is highly correlated with the ranking of the input features by their mutual information. Two features describing the solvent accessibility of "wild-type" and "mutant" amino-acid residues and one evolutionary feature based on superfamily-level multiple alignments produce comparable overall accuracy and 6% fewer false positives than a 32-feature set that considers physiochemical properties of amino acids, protein electrostatics, amino-acid residue flexibility, and binding interactions.

Entities:  

Mesh:

Year:  2005        PMID: 15759645     DOI: 10.1142/9789812702456_0038

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  17 in total

Review 1.  Using bioinformatics to predict the functional impact of SNVs.

Authors:  Melissa S Cline; Rachel Karchin
Journal:  Bioinformatics       Date:  2010-12-15       Impact factor: 6.937

2.  Functional hot spots in human ATP-binding cassette transporter nucleotide binding domains.

Authors:  Libusha Kelly; Hisayo Fukushima; Rachel Karchin; Jason M Gow; Leslie W Chinn; Ursula Pieper; Mark R Segal; Deanna L Kroetz; Andrej Sali
Journal:  Protein Sci       Date:  2010-11       Impact factor: 6.725

3.  A composite score for predicting errors in protein structure models.

Authors:  David Eramian; Min-yi Shen; Damien Devos; Francisco Melo; Andrej Sali; Marc A Marti-Renom
Journal:  Protein Sci       Date:  2006-06-02       Impact factor: 6.725

4.  A novel computational and structural analysis of nsSNPs in CFTR gene.

Authors:  C George Priya Doss; R Rajasekaran; C Sudandiradoss; K Ramanathan; R Purohit; R Sethumadhavan
Journal:  Genomic Med       Date:  2008-05-14

5.  Computational analysis of missense mutations causing Snyder-Robinson syndrome.

Authors:  Zhe Zhang; Shaolei Teng; Liangjiang Wang; Charles E Schwartz; Emil Alexov
Journal:  Hum Mutat       Date:  2010-09       Impact factor: 4.878

6.  Analysis of a set of missense, frameshift, and in-frame deletion variants of BRCA1.

Authors:  Marcelo Carvalho; Maria A Pino; Rachel Karchin; Jennifer Beddor; Martha Godinho-Netto; Rafael D Mesquita; Renato S Rodarte; Danielle C Vaz; Viviane A Monteiro; Siranoush Manoukian; Mara Colombo; Carla B Ripamonti; Richard Rosenquist; Graeme Suthers; Ake Borg; Paolo Radice; Scott A Grist; Alvaro N A Monteiro; Blase Billack
Journal:  Mutat Res       Date:  2008-10-17       Impact factor: 2.433

7.  In silico functional profiling of human disease-associated and polymorphic amino acid substitutions.

Authors:  Matthew Mort; Uday S Evani; Vidhya G Krishnan; Kishore K Kamati; Peter H Baenziger; Angshuman Bagchi; Brandon J Peters; Rakesh Sathyesh; Biao Li; Yanan Sun; Bin Xue; Nigam H Shah; Maricel G Kann; David N Cooper; Predrag Radivojac; Sean D Mooney
Journal:  Hum Mutat       Date:  2010-03       Impact factor: 4.878

8.  Comparing Conformational Ensembles Using the Kullback-Leibler Divergence Expansion.

Authors:  Christopher L McClendon; Lan Hua; Abriela Barreiro; Matthew P Jacobson
Journal:  J Chem Theory Comput       Date:  2012-04-13       Impact factor: 6.006

Review 9.  Connecting protein interaction data, mutations, and disease using bioinformatics.

Authors:  Jake Y Chen; Eunseog Youn; Sean D Mooney
Journal:  Methods Mol Biol       Date:  2009

10.  Cancer-specific high-throughput annotation of somatic mutations: computational prediction of driver missense mutations.

Authors:  Hannah Carter; Sining Chen; Leyla Isik; Svitlana Tyekucheva; Victor E Velculescu; Kenneth W Kinzler; Bert Vogelstein; Rachel Karchin
Journal:  Cancer Res       Date:  2009-08-04       Impact factor: 12.701

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.