| Literature DB >> 33237313 |
Lisanna Paladin1, Martina Bevilacqua1, Sara Errigo1, Damiano Piovesan1, Ivan Mičetić1, Marco Necci1, Alexander Miguel Monzon1, Maria Laura Fabre2, Jose Luis Lopez2, Juliet F Nilsson2, Javier Rios3, Pablo Lorenzano Menna3, Maia Cabrera3, Martin Gonzalez Buitron3, Mariane Gonçalves Kulik4, Sebastian Fernandez-Alberti3, Maria Silvina Fornasari3, Gustavo Parisi3, Antonio Lagares2, Layla Hirsh5, Miguel A Andrade-Navarro4, Andrey V Kajava6, Silvio C E Tosatto1.
Abstract
The RepeatsDB database (URL: https://repeatsdb.org/) provides annotations and classification for protein tandem repeat structures from the Protein Data Bank (PDB). Protein tandem repeats are ubiquitous in all branches of the tree of life. The accumulation of solved repeat structures provides new possibilities for classification and detection, but also increasing the need for annotation. Here we present RepeatsDB 3.0, which addresses these challenges and presents an extended classification scheme. The major conceptual change compared to the previous version is the hierarchical classification combining top levels based solely on structural similarity (Class > Topology > Fold) with two new levels (Clan > Family) requiring sequence similarity and describing repeat motifs in collaboration with Pfam. Data growth has been addressed with improved mechanisms for browsing the classification hierarchy. A new UniProt-centric view unifies the increasingly frequent annotation of structures from identical or similar sequences. This update of RepeatsDB aligns with our commitment to develop a resource that extracts, organizes and distributes specialized information on tandem repeat protein structures.Entities:
Year: 2021 PMID: 33237313 PMCID: PMC7778985 DOI: 10.1093/nar/gkaa1097
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971