| Literature DB >> 23431257 |
Abstract
The identification of genetic variants that are responsible for human inherited diseases is a fundamental problem in human and medical genetics. As a typical type of genetic variation, nonsynonymous single-nucleotide polymorphisms (nsSNPs) occurring in protein coding regions may alter the encoded amino acid, potentially affect protein structure and function, and further result in human inherited diseases. Therefore, it is of great importance to develop computational approaches to facilitate the discrimination of deleterious nsSNPs from neutral ones. In this paper, we review databases that collect nsSNPs and summarize computational methods for the identification of deleterious nsSNPs. We classify the existing methods for characterizing nsSNPs into three categories (sequence based, structure based, and annotation based), and we introduce machine learning models for the prediction of deleterious nsSNPs. We further discuss methods for identifying deleterious nsSNPs in noncoding variants and those for dealing with rare variants.Entities:
Mesh:
Year: 2013 PMID: 23431257 PMCID: PMC3572689 DOI: 10.1155/2013/675851
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Database for collecting nsSNP data.
| Database | Website | Reference ID |
|---|---|---|
| Online Mendelian Inheritance in Man (OMIM) |
| [ |
| Human Gene Mutation Database (HGMD) |
| [ |
| UniPROT/SWISS-PROT database |
| [ |
| Human Genome Variation database (HGVbase) |
| [ |
| Single-nucleotide polymorphism database (dbSNP) |
| [ |
| Protein Mutant Database (PMD) |
| [ |
| Database for nonsynonymous SNPs' functional predictions (dbNSFP) |
| [ |
Tools for deleterious variant detection.
| Method | Website | Features | Method description | Reference ID |
|---|---|---|---|---|
| SIFT |
| Sequence based | Statistical method using PSSM with Dirichlet priors | [ |
| PolyPhen |
| Sequence based, structure based, annotation | Rule-based model | [ |
| SNAP |
| Sequence based, annotation | Standard feed-forward neural networks with momentum term | [ |
| MSRV |
| Sequence based | Multiple selection rule voting strategy using random forest | [ |
| LRT |
| Sequence based | Log ratio test | [ |
| PolyPhen-2 |
| Sequence based, structure based | Naïve Bayes approach coupled with entropy-based discretization | [ |
| MutationTaster |
| Sequence based, annotation | Naïve bayes model based on integrated data source | [ |
| KGGSeq |
| Sequence based, annotation | A three-level framework to combine a number of filtration and prioritization functions | [ |
| SInBaD |
| Sequence based | Separate mathematical models for promoters, exons, and introns, using logistic regression algorithm | [ |
| GERP (score) |
| Sequence based | A “Rejected Substitutions” score computation to infer the constrained region | [ |
| PhyloP (score) |
| Sequence based | An exact | [ |
Figure 1Web interface of MSRV.
Figure 2Typical procedure for deleterious nsSNPs detection.