| Literature DB >> 17921502 |
Gábor E Tusnády1, Lajos Kalmár, István Simon.
Abstract
The Topology Data Bank of Transmembrane Proteins (TOPDB) is the most complete and comprehensive collection of transmembrane protein datasets containing experimentally derived topology information currently available. It contains information gathered from the literature and from public databases available on the internet for more than a thousand transmembrane proteins. TOPDB collects details of various experiments that were carried out to learn about the topology of particular transmembrane proteins. In addition to experimental data from the literature, an extensive collection of structural data was also compiled from PDB and from PDBTM. Because topology information is often incomplete, for each protein in the database the most probable topology that is consistent with the collected experimental constraints was also calculated using the HMMTOP transmembrane topology prediction algorithm. Each record in TOPDB also contains information on the given protein sequence, name, organism and cross references to various other databases. The web interface of TOPDB includes tools for searching, relational querying and data browsing as well as for visualization. TOPDB is designed to bridge the gap between the number of transmembrane proteins available in sequence databases and the publicly accessible topology information of experimentally or computationally studied transmembrane proteins. TOPDB is available at http://topdb.enzim.hu.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17921502 PMCID: PMC2238857 DOI: 10.1093/nar/gkm751
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Distribution of experiment types over the TOPDB entries and the total topology data
| Experimental type | Entry counts | Topology data counts |
|---|---|---|
| Fusion | 647 | 3859 |
| Post-translational modification | 31 | 134 |
| Protease | 63 | 259 |
| Immunolocalization | 66 | 253 |
| Chemical modification | 21 | 167 |
| Structure | 820 | 18 405 |
| Other | 22 | 88 |
| Total | 1497 | 23 162 |
A detailed description of the various experiment types may be found under the documents section of the TOPDB website.
Figure 1.A representation of the interactive flash movie found under the documents section of the TOPDB website. It serves to depict eukaryotic, prokaryotic and special membrane types. The animation assists the determination of interior and exterior membrane faces. In the given image the chloroplast membranes can be seen.
Figure 2.The graph shows the topology of TOPDB entry of mouse multidrug resistance protein 1 (AP00199). The distinct experiments have unique identifiers (e.g. PDB ID, PUBMED ID, etc.), and if the topology data are derived from homologous proteins the identifier is extended with the TOPDB ID. The graph is colour coded to show protein segment localization: membrane interior (red), membrane exterior (blue) and transmembrane (yellow). In the interactive view, additional information can be obtained by rolling the mouse over the individual bars, e.g. experiment type, residue position and cross references.
Comparison of TOPDB topology data with other datasets
| Name | Nentry | TMHTOPDB | TMHX | TMHsame | AllTMHsame | TOPsame |
|---|---|---|---|---|---|---|
| UniProt | 696 | 2655 | 2669 (97%) | 2579 (97%) | 619 (89%) | 614 (88%) |
| TMHMM | 63 | 285 | 284 (99%) | 282 (99%) | 60 (95%) | 60 (95%) |
| Moller | 85 | 366 | 371 (98%) | 360 (98%) | 80 (94%) | 79 (93%) |
| TMPDB | 154 | 796 | 809 (97%) | 779 (98%) | 154 (90%) | 125 (81%) |
Abbreviations: Nentry: number of entries with larger than 80% reliability in TOPDB and were also found in the dataset/database compared; TMHTOPDB: number of transmembrane helices of the common entries in TOPDB; TMHx: number of transmembrane helices in the dataset/database compared; TMHsame: number of transmembrane helices that have the same sequential position in TOPDB and the database/dataset compared; AllTMHsame: number of entries where all transmembrane helices positions are the same in; and TOPsame: number of entries whose topologies are the same in TOPDB and in the dataset/database compared.