Ali Torkamani1, Nicholas J Schork. 1. Department of Medicine, Center for Human Genetics and Genomics, The Scripps Research Institute, University of California, San Diego, La Jolla, CA 92093, USA.
Abstract
MOTIVATION: Contemporary, high-throughput sequencing efforts have identified a rich source of naturally occurring single nucleotide polymorphisms (SNPs), a subset of which occur in the coding region of genes and result in a change in the encoded amino acid sequence (non-synonymous coding SNPs or 'nsSNPs'). It is hypothesized that a subset of these nsSNPs may underlie common human disease. Testing all these polymorphisms for disease association would be time consuming and expensive. Thus, computational methods have been developed to both prioritize candidate nsSNPs and make sense of their likely molecular physiologic impact. RESULTS: We have developed a method to prioritize nsSNPs and have applied it to the human protein kinase gene family. The results of our analyses provide high quality predictions and outperform available whole genome prediction methods (74% versus 83% prediction accuracy). Our analyses and methods consider both DNA sequence conservation, which most traditional methods are based on, as well unique structural and functional features of kinases. We provide a ranked list of common kinase nsSNPs that have a higher probability of impacting human disease based on our analyses.
MOTIVATION: Contemporary, high-throughput sequencing efforts have identified a rich source of naturally occurring single nucleotide polymorphisms (SNPs), a subset of which occur in the coding region of genes and result in a change in the encoded amino acid sequence (non-synonymous coding SNPs or 'nsSNPs'). It is hypothesized that a subset of these nsSNPs may underlie common human disease. Testing all these polymorphisms for disease association would be time consuming and expensive. Thus, computational methods have been developed to both prioritize candidate nsSNPs and make sense of their likely molecular physiologic impact. RESULTS: We have developed a method to prioritize nsSNPs and have applied it to the human protein kinase gene family. The results of our analyses provide high quality predictions and outperform available whole genome prediction methods (74% versus 83% prediction accuracy). Our analyses and methods consider both DNA sequence conservation, which most traditional methods are based on, as well unique structural and functional features of kinases. We provide a ranked list of common kinase nsSNPs that have a higher probability of impacting human disease based on our analyses.
Authors: Biao Li; Vidhya G Krishnan; Matthew E Mort; Fuxiao Xin; Kishore K Kamati; David N Cooper; Sean D Mooney; Predrag Radivojac Journal: Bioinformatics Date: 2009-09-03 Impact factor: 6.937
Authors: Jose M G Izarzugaza; Anja Baresic; Lisa E M McMillan; Corin Yeats; Andrew B Clegg; Christine A Orengo; Andrew C R Martin; Alfonso Valencia Journal: BMC Bioinformatics Date: 2009-08-27 Impact factor: 3.169
Authors: Anshuman Dixit; Lin Yi; Ragul Gowthaman; Ali Torkamani; Nicholas J Schork; Gennady M Verkhivker Journal: PLoS One Date: 2009-10-16 Impact factor: 3.240