| Literature DB >> 17537826 |
Alper Uzun1, Chesley M Leslin, Alexej Abyzov, Valentin Ilyin.
Abstract
SNPs located within the open reading frame of a gene that result in an alteration in the amino acid sequence of the encoded protein [nonsynonymous SNPs (nsSNPs)] might directly or indirectly affect functionality of the protein, alone or in the interactions in a multi-protein complex, by increasing/decreasing the activity of the metabolic pathway. Understanding the functional consequences of such changes and drawing conclusions about the molecular basis of diseases, involves integrating information from multiple heterogeneous sources including sequence, structure data and pathway relations between proteins. The data from NCBI's SNP database (dbSNP), gene and protein databases from Entrez, protein structures from the PDB and pathway information from KEGG have all been cross referenced into the StSNP web server, in an effort to provide combined integrated, reports about nsSNPs. StSNP provides 'on the fly' comparative modeling of nsSNPs with links to metabolic pathway information, along with real-time visual comparative analysis of the modeled structures using the Friend software application. The use of metabolic pathways in StSNP allows a researcher to examine possible disease-related pathways associated with a particular nsSNP(s), and link the diseases with the current available molecular structure data. The server is publicly available at http://glinka.bio.neu.edu/StSNP/.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17537826 PMCID: PMC1933130 DOI: 10.1093/nar/gkm232
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Representing query and modeling options for resources
| SNPs3D | PolyPhen | topoSNP | SNPeffect | LS-SNP | MutDB | SNP@Domain | Snap | StSNP | |
|---|---|---|---|---|---|---|---|---|---|
| Keyword search is available | Yes | No | No | Yes | No | Yes | No | Yes | Yes |
| Modeling is available | Yes (precomputed) | No | Yes (precomputed) | No | Yes (precomputed) | No, mutations are highlighted in the structure | No, highlighting the amino acids affected by SNPs. | No | Yes (on the fly) |
| Pathway Information | Yes | No | No | No | Yes | No | No | Gene relations are available | Yes |
| Search by Protein ID | Yes | Yes | No | Yes | Yes | Yes | No | Yes | Yes |
| Search by FASTA sequence | No | Yes | Yes | No | No | No | No | No | No |
| Search by PDB ID | Yes | No | No | Yes | No | No | No | No | Yes |
| Analysis of nsSNPs | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | No |
| Connection with OMIM or nsSNP info | Yes | Yes | Yes | Yes | Yes | No | Yes | Yes | No |
| Graphic display of the nsSNPs on protein sequence | No | No | No | No | No | Yes | Yes | Yes | Yes |
| Search by pathways | No | No | No | No | Yes | No | No | No | Yes |
| Disease Related info available | Yes | Yes | Yes | Yes | Yes | No | Yes(link to OMIM) | Yes | No |
| Domain info for the proteins | No | Yes (links available) | No | Yes (links available) | Yes (link to SCOP) | No | Yes | Yes | No |
| Search by rs# | Yes | Yes | No | Yes | Yes | Yes | Yes | Yes | Yes |
| Search by gene | Yes | No | No | Yes | Yes | Yes | Yes | Yes | Yes |
Table shows the differences and the similarities of the resources for their search options and background information
| SNPs3D | PolyPhen | TopoSNP | SNPeffect | LS-SNP | MutDB | SNP@Domain | Snap | StSNP | |
|---|---|---|---|---|---|---|---|---|---|
| Description | Website which assigns molecular functional effects of nsSNPs based on structure and sequence analysis. | An automatic tool for prediction of possible impact of an amino acid substitution on the structure and function of a human protein. | Provides an online resource for analyzing nsSNPs that can be mapped onto known 3D structures of proteins. These include disease associated nsSNPs derived from OMIM db and other nsSNPs derived from dbSNP. | an online resource of human ns SNPs (nsSNPs) mapping phenotypic effects of allelic variation in human genes. | Maps nsSNPs onto protein sequences, functional pathways and comparative protein structure models and predicts positions where nsSNPs destabilize proteins, interfere with the formation of domain-domain interfaces, have an effect on protein-ligand binding, or severely impact human health. | MutDB is to annotate human variation data with protein structural information and other functionally relevant information. | SNP@Domain is a web resource, to identify SNPs within human protein domains. | A medical-focus database which can provide user comprehensive information of a single gene and relationship between genes based on SNPs. | Compare structural nsSNP distributions in many proteins or protein complexes. StSNP enables to researchers to map nsSNPs onto protein structures and visualize their structural locations by using the multiple structure-sequence viewer Friend. StSNP includes human nsSNPs. |
| Sources | dbSNP, HGMD, PDB, PQS, OMIM, SwissPro, LocusLink, GO, KEGG, Mouse Knockout | HGVbase,PDB, dbSNP | dbSNP, OMIM, PFAM | dbSNP, NCBI-Protein, PDB, GO, Pfam, OMIM, CSA, SwitchPDB | dbSNP, SwissProt/TrEMBL, LocusLink, PDB | Swiss-Prot and dbSNP, PDB | SCOP, Pfam, Ensembl database, dbSNP | EnsEMBL, UCSC, Swiss-Prot, Pfam, DAS-CBS, KEGG, MINT, BIND, OMIM | dbSNP, KEGG, PDB, HapMap, LocusLink, NCBI-Protein |
| Links | HGMD, dbSNP, OMIM, PubMed, KEGG | almost all the possible links after the selection of a structure | PDB | Several links (almost all the possible links). | SwissProt/TrEMBL, USCSGenome_browser for SNPs, PDBsum, SCOP | dbSNP, NCBI-Protein, -mrna,-nucleotide, Swiss-Prot | Ensemble, SCOP, Pfam, SIFT, OMIM | EnsEMBL, OMIM | dbSNP, PDB, NCBI Protein, NCBI Chromosome Map |
| Display Info | Gene info, SNP info from HGDV, visualization by Jmol/RasMol Applet | Info for the query only entered SNP location and substitutions, Prediction (benign or damaging), scoring | nsSNP info, available PDB info, Visualization of mapping of nsSNPs is available with MDL's Chime Plugin | nsSNP info, rs#, functional sites, cellular processing of the protein, PDB structure, disease relation, structure information of the protein with wild type and changed aa. | SwissPro ID, rs#, nsSNP info, interface domain info. | Graphic display of exons, SNPs and related data. | Graphic display of domain and SNPs, 2D and 3D visualization, links to external sources. | Gene relation view, disease list, polymorphism statistics, gene sequence, primer parameters, domain info, sequence primer design, gene info | Graphic display of alignment, Protein and nsSNP info, rs#, pathway info, graphic display of pathways. Visualization of modeling is available with Friend software. |
| Visualization | Viewable with Jmol | Not available | Viewable in the Chime window. It will need MDL's Chime Plugin to view. | Not available | RasMol | PyMOL, UCSF's Chimera | MDL Chime Plugin | Gene relation view by GraphViz | Visualization of modeling is available with Friend viewer. |
| User Friendliness | Easy to use, visualization of the pages easy, everything can be seen all at once in a summary | Difficult and results are coming in too many pages | Easy to use, query page is straight forward. | Easy to use, everything can be seen in a summary, detail information is available. | Easy to use and very explicit | Easy to use, results page is compact enough to reach data conveniently | Clear and well summarized results page | Well summarized but it still takes time to navigate. | Easy to use, Steps are easy to identify and understand. (biased by AU) |
| Update Status | Not available | No info | Regular intervals (no specific timing) depending on the sources | Regulary updated (no specific info) | Once or two times a year | Regular intervals, depending on the sources | - (no info) | Updated on a regular basis following the updates on the major sources, dbSNP, PDB, KEGG and others. | |
| How many nsSNPs exist in db? | 50 772, 29 485 can be modeled. | 50 919, 44 005 (unique rs entries) | 27 417 | 23 426 proteins with SNPs | 28 000 validated SNP | 1487 SNPs annotated with protein structures (based on related article) | 17 639 SNPs within SCOP and 28 238 SNPs within Pfam domains were identified. | 68 072 | 33 692 |
| Query options | rs#, gene name, protein accession id, keyword | Fasta input with position and substitution info, PDB,PQS,sorting by | Fasta format of protein sequence, options disease and nondisease associated nsSNPs. | rs#, RefSeq, SwissProt/TrEMBL, PDB, EC#, OMIM#, text search Filters with PDB or Disease Related | SwissPro, rs#, KEGG, HUGO gene, chromosome range | Keyword, gene symbol, refseq protein-id, rna-id, Swiss-Pro Gene Name | rs#, gene name/symbol, domain name/id | Keywords, rs#, protein,accession code from Ensembl and Uniprot/SWISSPROT, chromosome areas, Markers or clones | Protein Accesion id, PDB ID, keyword, rs#, metabolic pathways,gene, length, substitution search |
| Modeling | In house comparative modeling program (based on Modeller's approach) | Not applicable | Not mentioned | Not applicable | MODELLER | The mutations are highlighted in the structure. | Highlighting the amino acids affected by SNPs. | Not applicable | MODELLER |
| Cross References | AVAILABLE | AVAILABLE | AVAILABLE | AVAILABLE | AVAILABLE | AVAILABLE | AVAILABLE | AVAILABLE | AVAILABLE |
| Validated nsSNPs only | BOTH | BOTH | BOTH | OPTIONAL | BOTH | BOTH | BOTH | BOTH | BOTH |
| How many nsSNPs available to map by one search? | all available nsSNP can be mapped. | Not applicable | User dependable | Not applicable | 1 nsSNP per model | The mutations are highlighted in the structure. | Highlighting the amino acids affected by SNPs. | Not applicable | All available nsSNPs can be mapped. |
Figure 1.StSNP is an interactive web server, which utilizes several heterogeneous data sources.
Figure 2.Data generation in StSNP. (A) Main query page, (B) Formatted data for nsSNPs along with graphical alignment representation, (C) nsSNP(s) selection for modeling, (D) Output page, and (E) Visualization in the Friend applet.
Figure 3.(A) Glutathione S Transferase is shown with nsSNP locations displayed in ball and stick representation, with I105V marked with a black circle. The reference residues are shown in blue, nonsynonymous residues in red and the substrate glutathione is displayed in space fill representation (yellow). The query for the example was Protein ID NP_000843 and template PDB ID 1aqv chain B. (B) The Results section also provides a user with a link to glutathione metabolism in order to view other members found in the pathway.
Figure 4.Aldehyde dehydrogenase-2 is shown with nsSNP locations displayed in ball and stick representation, with E504K marked with a black circle. The reference residues are shown in blue, nonsynonymous residues in red and the substrate NAD is displayed in space fill representation (green). The query for the example was Protein ID NP_000681 and template PDB ID 1ag8 chain A.