Literature DB >> 33641184

DbStRiPs: Database of structural repeats in proteins.

Broto Chakrabarty1, Nita Parekh1.   

Abstract

Recent interest in repeat proteins has arisen due to stable structural folds, high evolutionary conservation and repertoire of functions provided by these proteins. However, repeat proteins are poorly characterized because of high sequence variation between repeating units and structure-based identification and classification of repeats is desirable. Using a robust network-based pipeline, manual curation and Kajava's structure-based classification schema, we have developed a database of tandem structural repeats, Database of Structural Repeats in Proteins (DbStRiPs). A unique feature of this database is that available knowledge on sequence repeat families is incorporated by mapping Pfam classification scheme onto structural classification. Integration of sequence and structure-based classifications help in identifying different functional groups within the same structural subclass, leading to refinement in the annotation of repeat proteins. Analysis of complete Protein Data Bank revealed 16,472 repeat annotations in 15,141 protein chains, one previously uncharacterized novel protein repeat family (PRF), named left-handed beta helix, and 33 protein repeat clusters (PRCs). Based on their unique structural motif, ~79% of these repeat proteins are classified in one of the 14 PRFs or 33 PRCs, and the remaining are grouped as unclassified repeat proteins. Each repeat protein is provided with a detailed annotation in DbStRiPs that includes start and end boundaries of repeating units, copy number, secondary and tertiary structure view, repeat class/subclass, disease association, MSA of repeating units and cross-references to various protein pattern databases, human protein atlas and interaction resources. DbStRiPs provides easy search and download options to high-quality annotations of structural repeat proteins (URL: http://bioinf.iiit.ac.in/dbstrips/).
© 2021 The Protein Society.

Entities:  

Keywords:  protein repeat database; proteins repeats; structural repeat proteins; tandem repeats

Mesh:

Year:  2021        PMID: 33641184      PMCID: PMC8740836          DOI: 10.1002/pro.4052

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  58 in total

1.  A census of protein repeats.

Authors:  E M Marcotte; M Pellegrini; T O Yeates; D Eisenberg
Journal:  J Mol Biol       Date:  1999-10-15       Impact factor: 5.469

2.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  An efficient algorithm for large-scale detection of protein families.

Authors:  A J Enright; S Van Dongen; C A Ouzounis
Journal:  Nucleic Acids Res       Date:  2002-04-01       Impact factor: 16.971

Review 4.  Comparison of ARM and HEAT protein repeats.

Authors:  M A Andrade; C Petosa; S I O'Donoghue; C W Müller; P Bork
Journal:  J Mol Biol       Date:  2001-05-25       Impact factor: 5.469

Review 5.  Consensus design of repeat proteins.

Authors:  Patrik Forrer; H Kaspar Binz; Michael T Stumpp; Andreas Plückthun
Journal:  Chembiochem       Date:  2004-02-06       Impact factor: 3.164

6.  NGL viewer: web-based molecular graphics for large complexes.

Authors:  Alexander S Rose; Anthony R Bradley; Yana Valasatava; Jose M Duarte; Andreas Prlic; Peter W Rose
Journal:  Bioinformatics       Date:  2018-11-01       Impact factor: 6.937

7.  New and continuing developments at PROSITE.

Authors:  Christian J A Sigrist; Edouard de Castro; Lorenzo Cerutti; Béatrice A Cuche; Nicolas Hulo; Alan Bridge; Lydie Bougueleret; Ioannis Xenarios
Journal:  Nucleic Acids Res       Date:  2012-11-17       Impact factor: 16.971

8.  Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega.

Authors:  Fabian Sievers; Andreas Wilm; David Dineen; Toby J Gibson; Kevin Karplus; Weizhong Li; Rodrigo Lopez; Hamish McWilliam; Michael Remmert; Johannes Söding; Julie D Thompson; Desmond G Higgins
Journal:  Mol Syst Biol       Date:  2011-10-11       Impact factor: 11.429

9.  ProRepeat: an integrated repository for studying amino acid tandem repeats in proteins.

Authors:  Hong Luo; Ke Lin; Audrey David; Harm Nijveen; Jack A M Leunissen
Journal:  Nucleic Acids Res       Date:  2011-11-18       Impact factor: 16.971

10.  Identifying tandem Ankyrin repeats in protein structures.

Authors:  Broto Chakrabarty; Nita Parekh
Journal:  BMC Bioinformatics       Date:  2014-12-30       Impact factor: 3.169

View more
  2 in total

1.  DbStRiPs: Database of structural repeats in proteins.

Authors:  Broto Chakrabarty; Nita Parekh
Journal:  Protein Sci       Date:  2021-03-06       Impact factor: 6.725

2.  Sequence and Structure-Based Analyses of Human Ankyrin Repeats.

Authors:  Broto Chakrabarty; Nita Parekh
Journal:  Molecules       Date:  2022-01-10       Impact factor: 4.411

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.