Literature DB >> 17935148

Use of estimated evolutionary strength at the codon level improves the prediction of disease-related protein mutations in humans.

Emidio Capriotti1, Leonardo Arbiza, Rita Casadio, Joaquín Dopazo, Hernán Dopazo, Marc A Marti-Renom.   

Abstract

Predicting the functional impact of protein variation is one of the most challenging problems in bioinformatics. A rapidly growing number of genome-scale studies provide large amounts of experimental data, allowing the application of rigorous statistical approaches for predicting whether a given single point mutation has an impact on human health. Up until now, existing methods have limited their source data to either protein or gene information. Novel in this work, we take advantage of both and focus on protein evolutionary information by using estimated selective pressures at the codon level. Here we introduce a new method (SeqProfCod) to predict the likelihood that a given protein variant is associated with human disease or not. Our method relies on a support vector machine (SVM) classifier trained using three sources of information: protein sequence, multiple protein sequence alignments, and the estimation of selective pressure at the codon level. SeqProfCod has been benchmarked with a large dataset of 8,987 single point mutations from 1,434 human proteins from SWISS-PROT. It achieves 82% overall accuracy and a correlation coefficient of 0.59, indicating that the estimation of the selective pressure helps in predicting the functional impact of single-point mutations. Moreover, this study demonstrates the synergic effect of combining two sources of information for predicting the functional effects of protein variants: protein sequence/profile-based information and the evolutionary estimation of the selective pressures at the codon level. The results of large-scale application of SeqProfCod over all annotated point mutations in SWISS-PROT (available for download at http://sgu.bioinfo.cipf.es/services/Omidios/; last accessed: 24 August 2007), could be used to support clinical studies. (c) 2007 Wiley-Liss, Inc.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 17935148     DOI: 10.1002/humu.20628

Source DB:  PubMed          Journal:  Hum Mutat        ISSN: 1059-7794            Impact factor:   4.878


  18 in total

Review 1.  Bioinformatics for personal genome interpretation.

Authors:  Emidio Capriotti; Nathan L Nehrt; Maricel G Kann; Yana Bromberg
Journal:  Brief Bioinform       Date:  2012-01-13       Impact factor: 11.622

2.  Meet me halfway: when genomics meets structural bioinformatics.

Authors:  Sungsam Gong; Catherine L Worth; Tammy M K Cheng; Tom L Blundell
Journal:  J Cardiovasc Transl Res       Date:  2011-02-25       Impact factor: 4.132

3.  A new disease-specific machine learning approach for the prediction of cancer-causing missense variants.

Authors:  Emidio Capriotti; Russ B Altman
Journal:  Genomics       Date:  2011-07-07       Impact factor: 5.736

Review 4.  Bioinformatics and variability in drug response: a protein structural perspective.

Authors:  Jennifer L Lahti; Grace W Tang; Emidio Capriotti; Tianyun Liu; Russ B Altman
Journal:  J R Soc Interface       Date:  2012-05-02       Impact factor: 4.118

Review 5.  Computational methods and resources for the interpretation of genomic variants in cancer.

Authors:  Rui Tian; Malay K Basu; Emidio Capriotti
Journal:  BMC Genomics       Date:  2015-06-18       Impact factor: 3.969

6.  In silico analysis of missense substitutions using sequence-alignment based methods.

Authors:  Sean V Tavtigian; Marc S Greenblatt; Fabienne Lesueur; Graham B Byrnes
Journal:  Hum Mutat       Date:  2008-11       Impact factor: 4.878

7.  Classification of rare missense substitutions, using risk surfaces, with genetic- and molecular-epidemiology applications.

Authors:  Sean V Tavtigian; Graham B Byrnes; David E Goldgar; Alun Thomas
Journal:  Hum Mutat       Date:  2008-11       Impact factor: 4.878

8.  In silico functional profiling of human disease-associated and polymorphic amino acid substitutions.

Authors:  Matthew Mort; Uday S Evani; Vidhya G Krishnan; Kishore K Kamati; Peter H Baenziger; Angshuman Bagchi; Brandon J Peters; Rakesh Sathyesh; Biao Li; Yanan Sun; Bin Xue; Nigam H Shah; Maricel G Kann; David N Cooper; Predrag Radivojac; Sean D Mooney
Journal:  Hum Mutat       Date:  2010-03       Impact factor: 4.878

9.  MuD: an interactive web server for the prediction of non-neutral substitutions using protein structural data.

Authors:  Gilad Wainreb; Haim Ashkenazy; Yana Bromberg; Alina Starovolsky-Shitrit; Turkan Haliloglu; Eytan Ruppin; Karen B Avraham; Burkhard Rost; Nir Ben-Tal
Journal:  Nucleic Acids Res       Date:  2010-06-11       Impact factor: 16.971

10.  WS-SNPs&GO: a web server for predicting the deleterious effect of human protein variants using functional annotation.

Authors:  Emidio Capriotti; Remo Calabrese; Piero Fariselli; Pier Luigi Martelli; Russ B Altman; Rita Casadio
Journal:  BMC Genomics       Date:  2013-05-28       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.