Literature DB >> 18811932

SSMap: a new UniProt-PDB mapping resource for the curation of structural-related information in the UniProt/Swiss-Prot Knowledgebase.

Fabrice P A David1, Yum L Yip.   

Abstract

BACKGROUND: Sequences and structures provide valuable complementary information on protein features and functions. However, it is not always straightforward for users to gather information concurrently from the sequence and structure levels. The UniProt knowledgebase (UniProtKB) strives to help users on this undertaking by providing complete cross-references to Protein Data Bank (PDB) as well as coherent feature annotation using available structural information. In this study, SSMap - a new UniProt-PDB residue-residue level mapping - was generated. The primary objective of this mapping is not only to facilitate the two tasks mentioned above, but also to palliate a number of shortcomings of existent mappings. SSMap is the first isoform sequence-specific mapping resource and is up-to-date for UniProtKB annotation tasks. The method employed by SSMap differs from the other mapping resources in that it stresses on the correct reconstruction of the PDB sequence from structures, and on the correct attribution of a UniProtKB entry to each PDB chain by using a series of post-processing steps.
RESULTS: SSMap was compared to other existing mapping resources in terms of the correctness of the attribution of PDB chains to UniProtKB entries, and of the quality of the pairwise alignments supporting the residue-residue mapping. It was found that SSMap shared about 80% of the mappings with other mapping sources. New and alternative mappings proposed by SSMap were mostly good as assessed by manual verification of data subsets. As for local pairwise alignments, it was shown that major discrepancies (both in terms of alignment lengths and boundaries), when present, were often due to differences in methodologies used for the mappings.
CONCLUSION: SSMap provides an independent, good quality UniProt-PDB mapping. The systematic comparison conducted in this study allows the further identification of general problems in UniProt-PDB mappings so that both the coverage and the quality of the mappings can be systematically improved for the benefit of the scientific community. SSMap mapping is currently used to provide PDB cross-references in UniProtKB.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18811932      PMCID: PMC2567350          DOI: 10.1186/1471-2105-9-391

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  13 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Molecular basis of inherited diseases: a structural perspective.

Authors:  Robert E Steward; Malcolm W MacArthur; Roman A Laskowski; Janet M Thornton
Journal:  Trends Genet       Date:  2003-09       Impact factor: 11.639

Review 3.  A structural perspective on protein-protein interactions.

Authors:  Robert B Russell; Frank Alber; Patrick Aloy; Fred P Davis; Dmitry Korkin; Matthieu Pichaud; Maya Topf; Andrej Sali
Journal:  Curr Opin Struct Biol       Date:  2004-06       Impact factor: 6.809

4.  iMolTalk: an interactive, internet-based protein structure analysis server.

Authors:  Alexander V Diemand; Holger Scheib
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 5.  Protein variety and functional diversity: Swiss-Prot annotation in its biological context.

Authors:  Brigitte Boeckmann; Marie-Claude Blatter; Livia Famiglietti; Ursula Hinz; Lydie Lane; Bernd Roechert; Amos Bairoch
Journal:  C R Biol       Date:  2005-07-28       Impact factor: 1.583

6.  Mapping PDB chains to UniProtKB entries.

Authors:  Andrew C R Martin
Journal:  Bioinformatics       Date:  2005-09-27       Impact factor: 6.937

Review 7.  Human sulfatases: a structural perspective to catalysis.

Authors:  D Ghosh
Journal:  Cell Mol Life Sci       Date:  2007-08       Impact factor: 9.261

8.  Seq2Struct: a resource for establishing sequence-structure links.

Authors:  Allegra Via; Andreas Zanzoni; Manuela Helmer-Citterich
Journal:  Bioinformatics       Date:  2004-09-28       Impact factor: 6.937

9.  E-MSD: an integrated data resource for bioinformatics.

Authors:  S Velankar; P McNeil; V Mittard-Runte; A Suarez; D Barrell; R Apweiler; K Henrick
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  The Universal Protein Resource (UniProt).

Authors:  Amos Bairoch; Rolf Apweiler; Cathy H Wu; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Darren A Natale; Claire O'Donovan; Nicole Redaschi; Lai-Su L Yeh
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

View more
  8 in total

1.  Proteomes of host cell membranes modified by intracellular activities of Salmonella enterica.

Authors:  Stephanie Vorwerk; Viktoria Krieger; Jörg Deiwick; Michael Hensel; Nicole Hansmeier
Journal:  Mol Cell Proteomics       Date:  2014-10-27       Impact factor: 5.911

2.  A method for supporting retrieval of articles on protein structure analysis considering users' intention.

Authors:  Riku Kyogoku; Ryo Fujimoto; Tomonobu Ozaki; Takenao Ohkawa
Journal:  BMC Bioinformatics       Date:  2011-02-15       Impact factor: 3.169

3.  Systematic analysis of short internal indels and their impact on protein folding.

Authors:  RyangGuk Kim; Jun-tao Guo
Journal:  BMC Struct Biol       Date:  2010-08-04

4.  A generic approach to evaluate how B-cell epitopes are surface-exposed on protein structures.

Authors:  Virginie Lollier; Sandra Denery-Papini; Colette Larré; Dominique Tessier
Journal:  Mol Immunol       Date:  2010-12-15       Impact factor: 4.407

5.  From chemoproteomic-detected amino acids to genomic coordinates: insights into precise multi-omic data integration.

Authors:  Maria F Palafox; Heta S Desai; Valerie A Arboleda; Keriann M Backus
Journal:  Mol Syst Biol       Date:  2021-02       Impact factor: 11.429

6.  A database for allergenic proteins and tools for allergenicity prediction.

Authors:  ChangKug Kim; SooJin Kwon; GangSeob Lee; HwanKi Lee; JiWeon Choi; YongHwan Kim; JangHo Hahn
Journal:  Bioinformation       Date:  2009-04-21

7.  Easy retrieval of single amino-acid polymorphisms and phenotype information using SwissVar.

Authors:  Anaïs Mottaz; Fabrice P A David; Anne-Lise Veuthey; Yum L Yip
Journal:  Bioinformatics       Date:  2010-01-26       Impact factor: 6.937

8.  SNP2Structure: A Public and Versatile Resource for Mapping and Three-Dimensional Modeling of Missense SNPs on Human Protein Structures.

Authors:  Difei Wang; Lei Song; Varun Singh; Shruti Rao; Lin An; Subha Madhavan
Journal:  Comput Struct Biotechnol J       Date:  2015-09-30       Impact factor: 7.271

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.