| Literature DB >> 17921498 |
Hidetoshi Kono1, Tomo Yuasa, Shinya Nishiue, Kei Yura.
Abstract
We have developed coliSNP, a database server (http://yayoi.kansai.jaea.go.jp/colisnp) that maps non-synonymous single nucleotide polymorphisms (nsSNPs) on the three-dimensional (3D) structure of proteins. Once a week, the SNP data from the dbSNP database and the protein structure data from the Protein Data Bank (PDB) are downloaded, and the correspondence of the two data sets is automatically tabulated in the coliSNP database. Given an amino acid sequence, protein name or PDB ID, the server will immediately provide known nsSNP information, including the amino acid mutation caused by the nsSNP, the solvent accessibility, the secondary structure and the flanking residues of the mutated residue in a single page. The position of the nsSNP within the amino acid sequence and on the 3D structure of the protein can also be observed. The database provides key information with which to judge whether an observed nsSNP critically affects protein function and/or stability. As far as we know, this is the only web-based nsSNP database that automatically compiles SNP and protein information in a concise manner.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17921498 PMCID: PMC2238833 DOI: 10.1093/nar/gkm801
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.The coliSNP search interface. The user can use the protein section, SNP section or both to set the search conditions.
Figure 2.A typical search result. nsSNP information is provided with structural information on the mutated amino acid residue—e.g. the secondary structure and solvent accessibility.
Figure 3.Cumulative plots of tolerant, partially tolerant and intolerant sites in Lac repressor against the solvent accessibility. In the experiment (17), 12 or 13 mutations (depending on the identity of the wild-type residue) were tested at 124 sites. We defined the tolerance at each site as follows: tolerant, <5 of the mutations cause loss of function (45 sites); intolerant, >8 of the mutations cause loss of function (69 sites); and partially tolerant, 5–8 of the mutations cause loss of function (10 sites). The solvent accessibility was calculated using the program ASC (19) with a protein–DNA complex form (PDB:1EFA) or a tetrameric form (PDB:1LBI), depending on the site considered.