Literature DB >> 16381843

CyBase: a database of cyclic protein sequence and structure.

Jason P Mulvenna1, Conan Wang, David J Craik.   

Abstract

CyBase is a curated database and information source for backbone-cyclized proteins. The database incorporates naturally occurring cyclic proteins as well as synthetic derivatives, grafted analogues and acyclic permutants. The database provides a centralized repository of information on all aspects of cyclic protein biology and addresses issues pertaining to the management and searching of topologically circular sequences. The database is freely available at http://research.imb.uq.edu.au/cybase.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 16381843      PMCID: PMC1347368          DOI: 10.1093/nar/gkj005

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION AND MOTIVATION

In recent years a number of proteins have been discovered that contain a macrocyclic backbone consisting of a continuous cycle of peptide bonds. Such macrocyclic proteins were unknown a decade ago but have now been discovered in bacteria, plants and animals (1). Unlike bacterial polyketides and small cyclic peptides such as cyclosporin that are constructed by peptide synthetases (2), these new cyclic proteins are ribosomally produced gene products, with backbone cyclization occurring as a post-translational modification. This new class of protein has excited interest because circular proteins have a range of advantages over conventional proteins (3,4). At least one class of cyclic proteins, the cyclotides, has been shown to be resistant to proteolysis and to a wide variety of adverse thermal and chemical conditions (5,6). Furthermore, since the termini of conventional proteins are often flexible, and as the degree of flexibility can be reduced by cyclization, entropic factors can lead to improved receptor binding affinities of circular proteins over corresponding acyclic proteins (7,8). Five classes of naturally occurring proteins contain cyclic examples. These include the cyclic sex-pilin (9) and three bacteriocins (10–12) from bacterial sources, trypsin inhibitors from sunflower seeds (SFTI-1) (7,8) and the squash plant Momordica cochinchinensis (MCoTI-II) (13), the θ-defensins from macaque monkeys (14) and the cyclotides from plants of the Violaceae and Rubiaceae (15–17). The cyclotides are by far the largest family of circular proteins and ∼60 cyclotide sequences have been reported thus far. Screening programs suggest that the number of sequences may soon number in the thousands (18,19). In addition to this natural diversity, a large number of synthetic mutants, grafted analogues and acyclic permutants of cyclic proteins have been reported, and several proteins of biological interest have been artificially cyclized (20–22). This growth in information necessitates a collation of sequence/structure/function data and the development of a uniform nomenclature to prevent duplication of research and multiple naming schemes. Furthermore, sequence searching of cyclic proteins adds an extra layer of complexity, with most sequence searching tools assuming a linear sequence of amino acids. Because backbone cyclization is a ‘seamless’ post-translational modification, the location of the N- and C-termini cannot be determined from the mature sequence alone. Consequently, when searching cyclic sequences, the point at which the sequence begins and ends in the database is often arbitrary and may confound traditional searching techniques. The special considerations needed for dealing with cyclic sequences and the rapidly expanding data on their structure and function has led us to develop a curated database and web-based information source for cyclic proteins called CyBase.

APPLICATION AND DISCUSSION

CyBase incorporates a MySQL database that contains a repository of information on the amino acid and nucleic acid sequence, structure and activity of cyclic proteins. The layout of the database is shown in Figure 1. At the core of the database is the protein table, which contains information on each cyclic protein characterized. Related to each protein table entry are tables containing information on nucleic acid sequences, structure, activity and literature references. A web-based interface provides access to the information and allows for text-based searching on all data fields and filtering of results by class, source, activity or other attributes. To account for the cyclic nature of the sequences any sequence search uses a concatenation of two copies of the linear representation of the sequence to simulate a cyclic protein. Each protein has a dynamically produced sequence card, which provides cross-links to activity, nucleic acid sequence and structural information contained in the database. As the database is intended to supplement existing biological databases, links to UniProt Knowledgebase, Genbank and PDB are available for each entry and linkage with the KNOTTIN website (23) is planned. A number of other tools are provided, including coloured alignments and calculation of amino acid frequencies of selected sequences.
Figure 1

Schematic of the relational database underlying CyBase.

In general, naturally occurring cyclic proteins are small, with the largest possessing 78 amino acids. This small size makes the combination of mass spectrometry, to obtain sequence information, and NMR, to determine 3D structures, ideal for characterizing these proteins. Accordingly, to facilitate the rapid characterization of newly discovered proteins the database can be queried on molar mass, and for cyclotides, the capability exists for searching on the mass of fragments corresponding to particular inter-cysteine loops, facilitating sequence determination when utilizing reduction/alkylation of cysteine residues and tandem mass spectrometry (24,25). Analysis of NMR-derived data such as chemical shifts and patterns of NOE connectivity can provide an early indication of the structure of a protein. To facilitate rapid structural characterization of newly discovered cyclic proteins chemical shift and restraint data from NMR-derived structures are included in the database, along with dihedral angle information. From these data, distances and regions containing defined secondary structure are calculated and stored in the database. These data can be presented visually for the analysis of short- and long-range NOE patterns, the backbone dihedral angles and chemical shift patterns. Although NMR is the most common technique for analysing these proteins, X-ray structures are also incorporated into the database and sets of inter-atom distances calculated for comparative purposes. As with the protein and nucleic acid entries, each structure possesses an information card, which contains cross-links to protein and nucleic acid entries. Updating of the database is facilitated by a range of PERL and PHP scripts. These provide for the automated searching of sequence databases, using BLAST, to provide examples of novel cyclic proteins, and ensure quality control by preventing duplication of sequence data and renaming of already characterized sequences, a particularly important consideration for the cyclotide family, which contains potentially many sequences that may occur in a number of different species. These scripts also provide for the standardizing of residue numbering in new cyclotide structures. Despite these automations the addition of new entries is performed manually to ensure maximum quality of database entries. We plan to extend the database by utilizing the growing number of cyclotide structures to provide predictions of cyclotide secondary structures based on primary sequence and to develop methods to search the structures in the database based on the similarity between selected inter-atomic distances and NOE connectivities. We also plan to improve the information content of the database by including hydrogen bond and other structural information as well as homology models for cyclotide sequences that have not yet been structurally characterized. CyBase is available at and given the growing interest in backbone cyclization, it is hoped that CyBase will prove to be a useful resource in the field of structural biology. Suggestions should be directed to D.C.
  25 in total

1.  Plant cyclotides: A unique family of cyclic and knotted proteins that defines the cyclic cystine knot structural motif.

Authors:  D J Craik; N L Daly; T Bond; C Waine
Journal:  J Mol Biol       Date:  1999-12-17       Impact factor: 5.469

2.  Acyclic permutants of naturally occurring cyclic proteins. Characterization of cystine knot and beta-sheet formation in the macrocyclic polypeptide kalata B1.

Authors:  N L Daly; D J Craik
Journal:  J Biol Chem       Date:  2000-06-23       Impact factor: 5.157

3.  The effect of backbone cyclization on the thermodynamics of beta-sheet unfolding: stability optimization of the PIN WW domain.

Authors:  Songpon Deechongkit; Jeffery W Kelly
Journal:  J Am Chem Soc       Date:  2002-05-08       Impact factor: 15.419

Review 4.  Circular proteins--no end in sight.

Authors:  Manuela Trabi; David J Craik
Journal:  Trends Biochem Sci       Date:  2002-03       Impact factor: 13.807

Review 5.  Oldenlandia affinis (R&S) DC. A plant containing uteroactive peptides used in African traditional medicine.

Authors:  L Gran; F Sandberg; K Sletten
Journal:  J Ethnopharmacol       Date:  2000-06       Impact factor: 4.360

6.  Squash trypsin inhibitors from Momordica cochinchinensis exhibit an atypical macrocyclic structure.

Authors:  J F Hernandez; J Gagnon; L Chiche; T M Nguyen; J P Andrieu; A Heitz; T Trinh Hong; T T Pham; D Le Nguyen
Journal:  Biochemistry       Date:  2000-05-16       Impact factor: 3.162

7.  In vivo protein cyclization promoted by a circularly permuted Synechocystis sp. PCC6803 DnaB mini-intein.

Authors:  Neal K Williams; Pavel Prosselkov; Edvards Liepinsh; Inara Line; Anatoly Sharipo; Dene R Littler; Paul M G Curmi; Gottfried Otting; Nicholas E Dixon
Journal:  J Biol Chem       Date:  2001-12-12       Impact factor: 5.157

8.  Circular proteins in plants: solution structure of a novel macrocyclic trypsin inhibitor from Momordica cochinchinensis.

Authors:  M E Felizmenio-Quimio; N L Daly; D J Craik
Journal:  J Biol Chem       Date:  2001-04-05       Impact factor: 5.157

9.  Solution structures by 1H NMR of the novel cyclic trypsin inhibitor SFTI-1 from sunflower seeds and an acyclic permutant.

Authors:  M L Korsinczky; H J Schirra; K J Rosengren; J West; B A Condie; L Otvos; M A Anderson; D J Craik
Journal:  J Mol Biol       Date:  2001-08-17       Impact factor: 5.469

10.  Biosynthesis and insecticidal properties of plant cyclotides: the cyclic knotted proteins from Oldenlandia affinis.

Authors:  C Jennings; J West; C Waine; D Craik; M Anderson
Journal:  Proc Natl Acad Sci U S A       Date:  2001-09-04       Impact factor: 11.205

View more
  36 in total

1.  Retrocyclins kill bacilli and germinating spores of Bacillus anthracis and inactivate anthrax lethal toxin.

Authors:  Wei Wang; Chandrika Mulakala; Sabrina C Ward; Grace Jung; Hai Luong; Duy Pham; Alan J Waring; Yiannis Kaznessis; Wuyuan Lu; Kenneth A Bradley; Robert I Lehrer
Journal:  J Biol Chem       Date:  2006-06-21       Impact factor: 5.157

2.  A novel suite of cyclotides from Viola odorata: sequence variation and the implications for structure, function and stability.

Authors:  David C Ireland; Michelle L Colgrave; David J Craik
Journal:  Biochem J       Date:  2006-11-15       Impact factor: 3.857

Review 3.  AS-48 bacteriocin: close to perfection.

Authors:  Marina Sánchez-Hidalgo; Manuel Montalbán-López; Rubén Cebrián; Eva Valdivia; Manuel Martínez-Bueno; Mercedes Maqueda
Journal:  Cell Mol Life Sci       Date:  2011-05-17       Impact factor: 9.261

4.  Two Blast-independent tools, CyPerl and CyExcel, for harvesting hundreds of novel cyclotides and analogues from plant genomes and protein databases.

Authors:  Jun Zhang; Zhengshuang Hua; Zebo Huang; QiZhu Chen; Qingyun Long; David J Craik; Alan J M Baker; Wensheng Shu; Bin Liao
Journal:  Planta       Date:  2014-12-21       Impact factor: 4.116

5.  Molecular cloning and in silico characterization of knottin peptide, U2-SCRTX-Lit2, from brown spider (Loxosceles intermedia) venom glands.

Authors:  Gabriel Otto Meissner; Pedro Túlio de Resende Lara; Luis Paulo Barbour Scott; Antônio Sérgio Kimus Braz; Daniele Chaves-Moreira; Fernando Hitomi Matsubara; Eduardo Mendonça Soares; Dilza Trevisan-Silva; Luiza Helena Gremski; Silvio Sanches Veiga; Olga Meiri Chaim
Journal:  J Mol Model       Date:  2016-08-03       Impact factor: 1.810

6.  Gas-Phase Sequencing of Cyclotides: Introduction of Selective Ring Opening at Dehydroalanine via Ion/Ion Reaction.

Authors:  David J Foreman; Nicole C Parsley; John T Lawler; Uma K Aryal; Leslie M Hicks; Scott A McLuckey
Journal:  Anal Chem       Date:  2019-12-03       Impact factor: 6.986

7.  Expression of fluorescent cyclotides using protein trans-splicing for easy monitoring of cyclotide-protein interactions.

Authors:  Krishnappa Jagadish; Radhika Borra; Vanessa Lacey; Subhabrata Majumder; Alexander Shekhtman; Lei Wang; Julio A Camarero
Journal:  Angew Chem Int Ed Engl       Date:  2013-01-15       Impact factor: 15.336

Review 8.  Design of cyclized selective melanotropins.

Authors:  Minying Cai; Victor J Hruby
Journal:  Biopolymers       Date:  2016-11       Impact factor: 2.505

9.  Distribution and evolution of circular miniproteins in flowering plants.

Authors:  Christian W Gruber; Alysha G Elliott; David C Ireland; Piero G Delprete; Steven Dessein; Ulf Göransson; Manuela Trabi; Conan K Wang; Andrew B Kinghorn; Elmar Robbrecht; David J Craik
Journal:  Plant Cell       Date:  2008-09-30       Impact factor: 11.277

Review 10.  Cyclotides, a versatile ultrastable micro-protein scaffold for biotechnological applications.

Authors:  Julio A Camarero
Journal:  Bioorg Med Chem Lett       Date:  2017-10-21       Impact factor: 2.823

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.