| Literature DB >> 16845089 |
Hsiang-Yu Yuan1, Jen-Jie Chiou, Wen-Hsien Tseng, Chia-Hung Liu, Chuan-Kun Liu, Yi-Jung Lin, Hui-Hung Wang, Adam Yao, Yuan-Tsong Chen, Chun-Nan Hsu.
Abstract
Single nucleotide polymorphism (SNP) prioritization based on the phenotypic risk is essential for association studies. Assessment of the risk requires access to a variety of heterogeneous biological databases and analytical tools. FASTSNP (function analysis and selection tool for single nucleotide polymorphisms) is a web server that allows users to efficiently identify and prioritize high-risk SNPs according to their phenotypic risks and putative functional effects. A unique feature of FASTSNP is that the functional effect information used for SNP prioritization is always up-to-date, because FASTSNP extracts the information from 11 external web servers at query time using a team of web wrapper agents. Moreover, FASTSNP is extendable by simply deploying more Web wrapper agents. To validate the results of our prioritization, we analyzed 1569 SNPs from the SNP500Cancer database. The results show that SNPs with a high predicted risk exhibit low allele frequencies for the minor alleles, consistent with a well-known finding that a strong selective pressure exists for functional polymorphisms. We have been using FASTSNP for 2 years and FASTSNP enables us to discover a novel promoter polymorphism. FASTSNP is available at http://fastsnp.ibms.sinica.edu.tw.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16845089 PMCID: PMC1538865 DOI: 10.1093/nar/gkl236
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Decision tree for prioritizing a SNP based on its functional effects. The diamonds represent decision points and the ovals represent terminal points with the risk and class assignments. Given an input SNP at a decision point, if the answer to the question in the diamond is ‘yes,’ then the vertical arrow should be followed. Otherwise, the horizontal arrow should be followed.
Definitions of the function types, their effects and predicted risks of SNPs
| Coding type | Function type | Possible effects | Risk (ranking) |
|---|---|---|---|
| Coding | Non-sense | Causes premature termination of an amino-acid sequence | Very high (5) |
| Splicing regulation (abolishing protein domain) | Breaks the exonic splicing enhancer/silencer binding site in a coding sequence, leading to abolished protein domain | Moderate to high (3∼4) | |
| Splicing regulation | Breaks the exonic splicing enhancer/silencer binding site in a coding sequence containing the same protein domains | Low to moderate (2∼3) | |
| Mis-sense (non-conservative change) | Alters an amino acid in a protein to one with different structure characteristics | Moderate to high (3∼4) | |
| Mis-sense (conservative change) | Alters an amino acid in a protein to one with similar structure characteristics | Low to moderate (2∼3) | |
| Sense/synonymous | Does not alter an amino acid in a sequence | Very low (1) | |
| Non-coding | Downstream with no known effect | No known effect | No known effect (0) |
| Upstream with no known effect | No known effect | No known effect (0) | |
| Splicing site | Breaks a consensus splicing site sequence | Moderate to high (3∼4) | |
| Promoter/regulatory region | Does not alter an amino acid, but can affect the level, location or timing of a gene expression | Very low to moderate (1∼3) | |
| Intronic enhancer | Alters a binding site of a transcription factor in an intronic region | Very low to low (1∼2) | |
| Untranslated | Changes an UTR in a sequence | No known effect to very low (0∼1) | |
| 3′utr post-transcriptional regulation | Breaks motifs likely to be involved in post-transcriptional regulation | Very low to moderate (1∼3) |
External web-based services accessed by FASTSNP
| Name/URL | Usage |
|---|---|
| NCBI dbSNP ( | Provides the location of a SNP in a gene and its alleles, allele frequency, and context sequence |
| Ensembl ( | Provides a cross-reference/alternative data source for dbSNP |
| TFSearch ( | Predicts if a non-coding SNP alters the transcription factor-binding site of a gene |
| PolyPhen ( | Predicts if a non-synonymous SNP alters an amino acid in a protein resulting in structural changes (damaged or benign) in a protein |
| ESEfinder ( | Predicts if a synonymous SNP is located in a exonic splicing enhancer motif, which would diminish the motif with a different allele |
| RescueESE ( | Provides a cross-reference/alternative data source for ESEfinder |
| FAS-ESS ( | Predicts exonic splicing silencer for each SNP allele |
| SwissProt ( | Provides the information about protein domains to determine if a SNP causes an alternative splicing that leads to a protein domain being abolished |
| UCSC Golden Path ( | Provides information about the final draft assembly of the genome sequence (i.e. Golden Path) for quality control of candidate SNPs |
| NCBI Blast ( | Sequence comparison and search tool for quality control of candidate SNPs |
| HapMap ( | Provides information about the haplotype and linkage disequilibrium around a SNP |
The first eight services provide databases and analytical tools to predict functional effects for SNP prioritization. UCSC Golden Path and NCBI Blast are used for quality control of candidate SNPs, while the haplotype database from HapMap is useful for further reducing the number of candidate genes for genotyping.
Figure 2FastSNP component modules and data flow. Each gray box represents an external web server accessed by a web wrapper agent, and each blue box represents a local Java program. Open boxes are data and the arrows represent the data flow.
Figure 3Comparison of SNP's risk predicted by FASTSNP and their MAF. The results are obtained by analyzing 1569 SNPs from the SNP500Cancer database with FASTSNP.