| Literature DB >> 15608192 |
S Velankar1, P McNeil, V Mittard-Runte, A Suarez, D Barrell, R Apweiler, K Henrick.
Abstract
The Macromolecular Structure Database (MSD) group (http://www.ebi.ac.uk/msd/) continues to enhance the quality and consistency of macromolecular structure data in the worldwide Protein Data Bank (wwPDB) and to work towards the integration of various bioinformatics data resources. One of the major obstacles to the improved integration of structural databases such as MSD and sequence databases like UniProt is the absence of up to date and well-maintained mapping between corresponding entries. We have worked closely with the UniProt group at the EBI to clean up the taxonomy and sequence cross-reference information in the MSD and UniProt databases. This information is vital for the reliable integration of the sequence family databases such as Pfam and Interpro with the structure-oriented databases of SCOP and CATH. This information has been made available to the eFamily group (http://www.efamily.org.uk/) and now forms the basis of the regular interchange of information between the member databases (MSD, UniProt, Pfam, Interpro, SCOP and CATH). This exchange of annotation information has enriched the structural information in the MSD database with annotation from wider sequence-oriented resources. This work was carried out under the 'Structure Integration with Function, Taxonomy and Sequences (SIFTS)' initiative (http://www.ebi.ac.uk/msd-srv/docs/sifts) in the MSD group.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15608192 PMCID: PMC540012 DOI: 10.1093/nar/gki058
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1The database schema supporting the MSD to UniProt residue-level mapping. MSD components are in white, external database components in yellow and the cross-reference components in green.
Current status of the MSD to Uniprot residue-level mappings
| Total MSD entries | 27 259 |
| Entries with no possible Uniprot cross-reference | 2 196 |
| Entries with UniProt cross-reference | 24 665 (98%) |
| Entries with residue-level mapping | 24 218 (97%) |
| Entries awaiting mapping | 845 |