| Literature DB >> 22069583 |
Florence Jungo1, Anne Estreicher, Amos Bairoch, Lydie Bougueleret, Ioannis Xenarios.
Abstract
Peptide toxins synthesized by venomous animals have been extensively studied in the last decades. To be useful to the scientific community, this knowledge has been stored, annotated and made easy to retrieve by several databases. The aim of this article is to present what type of information users can access from each database. ArachnoServer and ConoServer focus on spider toxins and cone snail toxins, respectively. UniProtKB, a generalist protein knowledgebase, has an animal toxin-dedicated annotation program that includes toxins from all venomous animals. Finally, the ATDB metadatabase compiles data and annotations from other databases and provides toxin ontology.Entities:
Keywords: ATDB; ArachnoServer; ConoServer; Tox-Prot; UniProtKB/Swiss-Prot; animal toxin; database; venom protein
Mesh:
Substances:
Year: 2010 PMID: 22069583 PMCID: PMC3202812 DOI: 10.3390/toxins2020262
Source DB: PubMed Journal: Toxins (Basel) ISSN: 2072-6651 Impact factor: 4.546
Figure 1Venom protein entries in UniProtKB/Swiss-Prot (UniProt release 15.10 of 3 November 2009). Out of a total of 3’579 venom proteins, the sequence of 1’375 (38%) was obtained by translation of a nucleotide sequence (cDNA or genomic DNA). 1’528 sequences (43%) were obtained by direct protein sequencing, and 669 toxins (19%) were sequenced at both protein and nucleotide levels.
Summary table.
| ArachnoServer | ConoServer | UniProtKB/Swiss-Prot | ATDB (metadatabase) | |
|---|---|---|---|---|
| Topic | Spider toxins | Conotoxins | General interest, includes animal toxins and their targets | Animal toxins and their targets |
| Mature peptides and precursors | Mature peptides and precursors | Longest available protein sequences (precursors or mature sequences) (nucleotide sequences available via cross-references to EMBL/GenBank/DDBJ) | Protein sequences (as in source database), and some nucleotide sequences | |
| Nucleotide sequences | Nucleotide sequences | |||
| Tools | BLAST | Sequence search, Fingerprint Search | BLAST, ProtParam, Compute pI/MW, ProtScale | BLAST |
| Clustal alignment, Digest Peptide, Analyse prosequence | PeptideMass, PeptideCutter | |||
| Cross-references | EMBL, UniProtKB | GenBank, GenPept, UniProtKB | EMBL/GenBank/DDBJ | GenBank, GenPept, UniProtKB, SCORPION2 (obsolete) |
| ArachnoServer | Snake neurotoxin database (obsolete) | |||
| ConoServer (work-in-process) | MOLLUSK (obsolete) | |||
| Sequence download | In FASTA format | In FASTA format | In FASTA format | In FASTA format |
| Experimental PTMs (direct and indirect evidences indicated) | Experimental PTMs (disulfide bonds only indicated in cysteine framework) | Experimental PTMs (direct and indirect evidences indicated) | As in source database. | |
| Search | Via the search tool or the browser | Only disulfide bonds (via the cysteine framework) | Via the search tool | Only disulfide bonds |
| Shown if available | Shown if available | Available via cross-references to PDBsum and SMR (3D modeling) | Shown if available | |
| Cross-references | PDB | PDB | PDB | PDB |
| BMRB | PDBsum | |||
| SMR | ||||
| In ‘Description’ (free text) and ‘Biological Activity’ (controlled vocabulary) | In ‘Classification’, a Greek letter indicates the target | In ‘General annotation (free text) and ‘Ontologies’ (controlled vocabulary) | In ‘Description’ using ‘Toxin ontology’ terms (controlled vocabulary). | |
| As in source database (free text) | ||||
| Toxin effectiveness | In ‘Biological activity’. | Not indicated | In ‘General annotation’ | As in source database |
| Animal testing (LD(50), ED(50), PD(50)) | In ‘Biological activity’. | Not indicated | In ‘General annotation’ | As in source database |
| Channel and receptor nomenclature | IUPHAR | Not indicated | IUPHAR | IUPHAR |
| HGNC (only for human proteins and propagated to mammalian orthologs) | HGNC(only for mammalian proteins and propagated to mammalian orthologs) | |||
| In ‘Description’ (free text) and ‘Biological Activity’. | none | In ‘General annotation’ | As in source database | |
| (EC number in ‘Protein names’) | ||||
| Active site and Metal binding | Not indicated | In ‘Sequence annotation (Features)’ | As in source database | |
| Cross-references | MEROPS | MEROPS | ||
| BRENDA | ||||
| Creation of new names and names from the literature | Names from the literature | Names from the literature | As in source database |
Figure 2ArachnoServer entry of U2-agatoxin-Ao1j. Mature toxin with three disulfide bonds and amidated C-terminus are graphically shown in the first box. The sequence of the precursor protein is shown in the ‘Protein Information’, signal and propeptide sequences are colored in blue and red, respectively.
Figure 3Parts of ConoServer toxin entries. (a) Ac1.1a precursor entry. Signal peptides and mature peptide ranges are indicated in ‘Sequence region’. (b) Mature toxin Ac1.1a entry. The C-terminal modified residue is reported in ‘Sequence’. Disulfide connectivity is indicated in ‘Classification’. A description of the cystine framework and pattern is provided in the documentation of the database available from the homepage (See ‘Useful URLs’).
Figure 4Highlight of the ‘Sequence annotation’ section of the scorpion potassium channel toxin alpha-KTx 3.6 entry in UniProtKB (UniProtKB accession number Q9NII7).
Figure 5The ArachnoServer browser tool allows toxin retrieval based on their molecular targets.
Figure 6‘General annotation’ section of the L-amino-acid oxidase entry in UniProtKB (accession number P81382). The enzyme-specific annotation is found in the ‘Catalytic activity’, ‘Cofactor’, and ‘Biophysicochemical properties’ subsections.
Figure 7ATDB entry of Guangxitoxin-1E (ATDB accession number AT0001661).
Figure 8The ATDB browser allows the retrieval of toxins on the basis of their disulfide framework and on the taxonomical group of interest ().