Literature DB >> 18948296

PeroxiBase: a database with new tools for peroxidase family classification.

Dominique Koua1, Lorenzo Cerutti, Laurent Falquet, Christian J A Sigrist, Grégory Theiler, Nicolas Hulo, Christophe Dunand.   

Abstract

Peroxidases (EC 1.11.1.x), which are encoded by small or large multigenic families, are involved in several important physiological and developmental processes. They use various peroxides as electron acceptors to catalyse a number of oxidative reactions and are present in almost all living organisms. We have created a peroxidase database (http://peroxibase.isb-sib.ch) that contains all identified peroxidase-encoding sequences (about 6000 sequences in 940 organisms). They are distributed between 11 superfamilies and about 60 subfamilies. All the sequences have been individually annotated and checked. PeroxiBase can be consulted using six major interlink sections 'Classes', 'Organisms', 'Cellular localisations', 'Inducers', 'Repressors' and 'Tissue types'. General documentation on peroxidases and PeroxiBase is accessible in the 'Documents' section containing 'Introduction', 'Class description', 'Publications' and 'Links'. In addition to the database, we have developed a tool to classify peroxidases based on the PROSITE profile methodology. To improve their specificity and to prevent overlaps between closely related subfamilies the profiles were built using a new strategy based on the silencing of residues. This new profile construction method and its discriminatory capacity have been tested and validated using the different peroxidase families and subfamilies present in the database. The peroxidase classification tool called PeroxiScan is accessible at the following address: http://peroxibase.isb-sib.ch/peroxiscan.php.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18948296      PMCID: PMC2686439          DOI: 10.1093/nar/gkn680

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Peroxidases are enzymes that use various peroxides (ROOH) as electron acceptors to catalyze a number of oxidative reactions. These peroxidases can be haem and non-haem proteins. They are extremely widespread and present in all living organisms. In mammals, they are implicated in biological processes as various as immune system or hormone regulation. In plants, they are involved in auxin metabolism, lignin and suberin formation, cross-linking of cell wall components, defense against pathogens or cell elongation. Humans contain more than 30 peroxidases whereas Arabidopsis thaliana has about 130 peroxidases that are grouped in 13 different families and nine subfamilies. There has been increased interest over the last few years in the role that mammalian haem peroxidase enzymes may play in both disease prevention and human pathologies. In general, haem peroxidases tend to promote rather than inhibit oxidative damage. Some mammalian haem peroxidases use H2O2 to generate more aggressive oxidants to fight intruding micro-organisms (1). Peroxidase families from prokaryotic organisms, protists and fungi have been shown to promote virulence (2–5). At the biochemical level, peroxidases can be found in the same enzyme sub-subclass E.C.1.11.1.x, donor:hydrogen-peroxide oxidoreductase (6). Currently, 15 different EC numbers have been ascribed to peroxidase: from EC 1.11.1.1 to EC 1.11.1.16 (EC 1.11.1.4 was removed) (7). Other peroxidase families with dual enzymatic domains were classified with the following numbers: EC 1.13. 11.44, EC 1.14.99.1, EC 1.6.3.1 and EC 4.1.1.44 (7). The two independent EC numbers (1.11.1.9 and 1.11.1.12) both correspond to glutathione peroxidase and are based on the electron acceptor (hydrogen peroxide or lipid peroxide, respectively). Two particular cases are also observed for numbers EC 1.11.1.2 (NADPH peroxidase) and 1.11.1.3 (fatty acid peroxidase) and no known peroxidase sequence has been assigned to NADPH peroxidase. Peroxidasins, peroxinectins, other non-animal peroxidases, Dyp-type peroxidases, hybrid ascorbate-cytochrome C peroxidases and other Class II peroxidases do not possess their own EC number and can only be classified in EC 1.11.1.7. At the sequence level, most haem peroxidases belong to two large families, one mainly found in plants and also in bacteria and fungi (7, 8), and a second found mostly in animals (but also occasionally in some fungal and bacterial species) (9, 10). These two independent groups, though possessing weak sequence homology, can still be identified with a common signature (see InterPro entry IPR010255). In addition to these two large superfamilies, four smaller protein families are indexed as capable of reducing peroxide molecules with the help of haem. Catalases (Kat), which can also oxidize hydrogen peroxide (unique feature); Di-haem cytochrome C peroxidases (DiHCcP); Dyp-type peroxidases (DypPrx); and haem Haloperoxidases (HalPrx). These families display no sequence homology between each other. Non-haem peroxidases are not evolutionarily linked and form five independent families. The largest one is the thiol peroxidase, which currently contains more than 1000 members grouped in two different subfamilies (Glutathione peroxidases and Peroxiredoxines). Alkylhydroperoxidase, non-haem haloperoxidase, manganese catalase and NADH peroxidase are the remaining other four non haem peroxidase families. According to the phylogenetic trees these 11 major groups can be subdivided in 60 subfamilies (Figure 1). These subdivisions based on evolution describe quite well the variety of peroxidase functions and can thus be used to predict the function of newly characterized proteins.
Figure 1.

Schematic representation of the phylogenic relationships between the different protein classes and families found in PeroxiBase.

Schematic representation of the phylogenic relationships between the different protein classes and families found in PeroxiBase. Due to the high diversity of peroxidase functions and increased interest of the medical research in pathologies related to the role of peroxidases there is an urgent need to federate and organize data on peroxidases. The goal of our database is to centralize most sequences that belong to peroxidase superfamilies, to follow the evolution of peroxidase among living organism and to compile the information concerning putative functions and transcriptional regulation. Currently, PeroxiBase is a unique repository exclusively dedicated to peroxidase families and superfamilies from both Eukaryotes and Prokaryotes. It includes 6000 peroxidases encoding sequences from 940 organisms, and each sequence is individually annotated. We have also developed a new tool to facilitate the classification of new peroxidase members.

DATABASE INTERFACE ORGANIZATION

The PeroxiBase toolbar is divided into eight sections (Figure 2). The ‘Documents’ tab gives access to general information: ‘Introduction’, ‘Class description’, ‘Publications’ and ‘Links’. Several useful tools are available (‘Tools’) to classify and analyse peroxidases: ‘Search’ permits complex text queries on the database, ‘Blast’ allows a comparison between a query sequence and the peroxidases stored in PeroxiBase and, ‘FingerPrintscan’ and ‘PeroxiScan’ help classify a query sequence in the right group. The six following sections named ‘Classes’, ‘Organisms’, ‘Cellular localisations’, ‘Inducers’, ‘Repressors’ and ‘Tissue types’ permit the user to navigate within PeroxiBase using the specified criteria. Individual data sheets have been largely redesigned since the previous PeroxiBase publication (Figure 2). Last sequence changes, Reviewer and Last annotation changes fields exhibit the date of first entry (or of last sequence modification) with name of the contributor; the name of the curator who checked the entry, and the date of the last modification in any sections with name of the contributor, respectively. In an attempt to set up a unified nomenclature (Name field), we introduced a simple nomenclature based on species and class acronyms. The various original appellations have been conserved as synonyms in PeroxiBase. Class field refers to the class the peroxidase belongs based on the new PeroxiScan tool. Cellular localisation, Tissue type, Inducer and Repressor fields present data concerning the gene and protein expressions. These fields use fixed terms. Best BLASTp hits field reports the five closest hits to this entry obtained from daily updated BLAST searches. Protein ref, DNA ref, mRNA ref and Cluster/prediction ref fields refer to hyperlinks protein, DNA, mRNA sequences and cluster respectively. PeroxiBase entries are cross-referenced in UniProtKB (SwissProt/TREMBL). The data are stored in a MySQL relational database and the web interface is made of PHP and CGI/Perl scripts.
Figure 2.

Screenshot of one entry and description of the toolbar. The toolbar includes various sections. ‘Documents’ contains ‘Introduction’, ‘Class description’, ‘Publications’ (related to PeroxiBase) and ‘Links’ (specific and general databases). ‘Tools’ menu contains the following sections: ‘Search’ (multi criteria), ‘Blast’, ‘PeroxiScan’ and ‘FingerPrintscan’.

Screenshot of one entry and description of the toolbar. The toolbar includes various sections. ‘Documents’ contains ‘Introduction’, ‘Class description’, ‘Publications’ (related to PeroxiBase) and ‘Links’ (specific and general databases). ‘Tools’ menu contains the following sections: ‘Search’ (multi criteria), ‘Blast’, ‘PeroxiScan’ and ‘FingerPrintscan’.

DATA ACQUISITION AND INTEGRATION

The automatic annotation of the complete genomes of numerous organisms and the automatic clustering and assembling of EST sequences led to the identification of numerous sequences coding for different peroxidase families and superfamilies. However, the automatic processing of the sequences is known to be of poor quality or not as specific as expected. Using the highly conserved motifs of each peroxidase class, manual annotation and editing can clearly identify the correct sequences even in low-quality sequences. In order to increase data reliability, each new entry is individually controlled by a database curator. Each cross-reference is verified by the reviewer. The quality of the sequence is also examined by performing a sequence alignment with the other homologous sequences. Thank to the continuous release of numerous genome sequencing projects (525 in March 2007 and 843 in August 2008 according to the Genomes OnLine Database (11)) and EST libraries, existing entries can be updated and, as more annotated sequences are integrated, the organism coverage is also increased. Existing entries are frequently verified and updated if any changes have occurred.

NEW CLASSIFICATION TOOLS FOR PEROXIDASES

To facilitate the classification of newly sequenced peroxidase proteins, we have developed a tool, based on PROSITE profile methodology that takes advantage of the manually curated hierarchical classification of PeroxiBase. One major problem with subfamily classification is the difficulty in separating proteins due to their high degree of similarity at the sequence level. The main principle of our new approach is to build a PROSITE profile on the whole conserved region of each subfamily, but to make the profile more specific, residues that are conserved in the whole family are lightened and residues specific to each subfamily are emphasized. We started by merging all families that overlap to construct general alignments. In these alignments, we specifically tag well-conserved residues. The family alignments are then simply split (without modifying the alignment of residues) in several sub-alignments according to our subfamily classification. Each subfamily alignment now contains an annotation line where residues conserved in the whole family and residues specific to the subfamily are tagged. This annotation line is then used by our profile construction program to down weigh family-conserved columns and over weight subfamily-specific ones (see http://www.expasy.org/tools/subprofiler/subprofiler_help.html for more details). We first built profiles or used existing PROSITE profiles for the 11 major families that do not overlap. We used these profiles to build multiple sequence alignments (MSA) and integrate the PeroxiBase classification into the MSA. These 11 families were then split into 60 subfamilies according to the PeroxiBase classification. For each of the subfamilies the MSA contains annotation of residues conserved in the whole family and residues specific to the subfamily. This information was used to build 60 sub-profiles specific to each subfamily, which cover all the diversity of peroxidases. During the scanning process the various sub-profiles are in competition and only the best score is reported as is done for overlapping profiles in the PROSITE database (12). This sub-profile classification allows the identification of wrongly annotated sequences in PeroxiBase and reassignment of them to their correct sub-families. It has also improved the classification of some classes of peroxidases that were difficult to distinguish with classical tools. For example the classification of the Vanadium peroxidase has been separated into three subcategories (bromoperoxidase, chloroperoxidase and iodoperoxidase). Each profile is associated to a specific function or to a biological process in order to facilitate functional classification of newly discovered proteins. New sequences can be scanned against the subfamily profile peroxidases at the following address: http://peroxibase.isb-sib.ch/peroxiscan.php (Figure 3). Fine descriptions of the matching residues as well as matching scores can be obtained from a direct submission through MyHits web site (http://.myhits.isb-sib.ch) (13).
Figure 3.

The new PeroxiScan interface and result. PeroxiScan tool enables the identification of a given peroxidase sequence. PeroxiScan can be performed directly from one entry or independently from the ‘Tools’ section for an unknown sequence. Fine descriptions of the matching scores are available from a direct submission through MyHits web site.

The new PeroxiScan interface and result. PeroxiScan tool enables the identification of a given peroxidase sequence. PeroxiScan can be performed directly from one entry or independently from the ‘Tools’ section for an unknown sequence. Fine descriptions of the matching scores are available from a direct submission through MyHits web site.

FUTURE DEVELOPMENTS

The PeroxiBase is a unique, powerful and reliable database dedicated to a large superfamily composed of several families (multigenic or not) and present in all kingdoms. The database currently contains over 6000 complete or partial peroxidase-encoding sequences distributed among 60 different protein classes. The number of peroxidase families should not undergo major changes in the future. We expect only minor modifications in the sub-classification of a few classes due to better coverage and to the biochemical characterization of the enzymes. Profiles will be updated continuously to account for such modifications, thus maintaining high quality discriminators to pursue our effort in data mining of non-annotated sequences. Even with the large extension of the database (from 4700 in March 2007 (14) to 6026 in August 2008), it is still mainly composed of sequences originated from Viridiplantae (68%). The next step forward is to extend the coverage and to increase the number of sequences from exotic and poorly represented organisms. As the number of new sequences increases rapidly, the subsequent expansion of PeroxiBase will facilitate peroxidase gene-family studies. Even if the manual integration of sequences is a guarantee of quality we need automatic methods to speed up the annotation of new sequences. Our classification method will help curators to rapidly integrate new peroxidases and assign them to the correct sub-families. To make the PeroxiBase more user-friendly to anyone who would like to add new entries or to modify present entries, a Wiki page is in development. It will surely create more collaborative interactions for the peroxidase scientific community.

FUNDING

Swiss National Science Foundation (31-068003.02 to C.D., 315200-116864 to L.C. and N.H.). Conflict of interest statement. None declared.
  13 in total

1.  IntEnz, the integrated relational enzyme database.

Authors:  Astrid Fleischmann; Michael Darsow; Kirill Degtyarenko; Wolfgang Fleischmann; Sinéad Boyce; Kristian B Axelsen; Amos Bairoch; Dietmar Schomburg; Keith F Tipton; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  PeroxiBase: the peroxidase database.

Authors:  Filippo Passardi; Grégory Theiler; Marcel Zamocky; Claudia Cosio; Nicolas Rouhier; Felipe Teixera; Marcia Margis-Pinheiro; Vassilios Ioannidis; Claude Penel; Laurent Falquet; Christophe Dunand
Journal:  Phytochemistry       Date:  2007-06-04       Impact factor: 4.072

Review 3.  Active site structure and catalytic mechanisms of human peroxidases.

Authors:  Paul G Furtmüller; Martina Zederbauer; Walter Jantschko; Jutta Helm; Martin Bogner; Christa Jakopitsch; Christian Obinger
Journal:  Arch Biochem Biophys       Date:  2005-10-26       Impact factor: 4.013

4.  Prokaryotic origins of the non-animal peroxidase superfamily and organelle-mediated transmission to eukaryotes.

Authors:  Filippo Passardi; Nenad Bakalovic; Felipe Karam Teixeira; Marcia Margis-Pinheiro; Claude Penel; Christophe Dunand
Journal:  Genomics       Date:  2007-03-13       Impact factor: 5.736

5.  Effects of overexpression of the alkyl hydroperoxide reductase AhpC on the virulence and isoniazid resistance of Mycobacterium tuberculosis.

Authors:  B Heym; E Stavropoulos; N Honoré; P Domenech; B Saint-Joanis; T M Wilson; D M Collins; M J Colston; S T Cole
Journal:  Infect Immun       Date:  1997-04       Impact factor: 3.441

6.  Contribution of glutathione peroxidase to the virulence of Streptococcus pyogenes.

Authors:  Audrey Brenot; Katherine Y King; Blythe Janowiak; Owen Griffith; Michael G Caparon
Journal:  Infect Immun       Date:  2004-01       Impact factor: 3.441

7.  Two glutathione peroxidases in the fungal pathogen Cryptococcus neoformans are expressed in the presence of specific substrates.

Authors:  Tricia A Missall; Jocie F Cherry-Harris; Jennifer K Lodge
Journal:  Microbiology       Date:  2005-08       Impact factor: 2.777

8.  Peroxiredoxins from Trypanosoma cruzi: virulence factors and drug targets for treatment of Chagas disease?

Authors:  María Dolores Piñeyro; Adriana Parodi-Talice; Talia Arcari; Carlos Robello
Journal:  Gene       Date:  2007-10-22       Impact factor: 3.688

9.  Peroxidase: a term of many meanings.

Authors:  Leopold Flohé; Fulvio Ursini
Journal:  Antioxid Redox Signal       Date:  2008-09       Impact factor: 8.401

10.  MyHits: improvements to an interactive resource for analyzing protein sequences.

Authors:  Marco Pagni; Vassilios Ioannidis; Lorenzo Cerutti; Monique Zahn-Zabal; C Victor Jongeneel; Jörg Hau; Olivier Martin; Dmitri Kuznetsov; Laurent Falquet
Journal:  Nucleic Acids Res       Date:  2007-06-01       Impact factor: 16.971

View more
  33 in total

1.  Ascorbate peroxidase-related (APx-R) is not a duplicable gene.

Authors:  Christophe Dunand; Catherine Mathé; Fernanda Lazzarotto; Rogério Margis; Marcia Margis-Pinheiro
Journal:  Plant Signal Behav       Date:  2011-12

Review 2.  Overview of peroxiredoxins in oxidant defense and redox regulation.

Authors:  Leslie B Poole; Andrea Hall; Kimberly J Nelson
Journal:  Curr Protoc Toxicol       Date:  2011-08

3.  Identification of Surface-Exposed Protein Radicals and A Substrate Oxidation Site in A-Class Dye-Decolorizing Peroxidase from Thermomonospora curvata.

Authors:  Ruben Shrestha; Xuejie Chen; Kasra X Ramyar; Zahra Hayati; Eric A Carlson; Stefan H Bossmann; Likai Song; Brian V Geisbrecht; Ping Li
Journal:  ACS Catal       Date:  2016-10-12       Impact factor: 13.084

4.  Characterization of Dye-decolorizing Peroxidase (DyP) from Thermomonospora curvata Reveals Unique Catalytic Properties of A-type DyPs.

Authors:  Chao Chen; Ruben Shrestha; Kaimin Jia; Philip F Gao; Brian V Geisbrecht; Stefan H Bossmann; Jishu Shi; Ping Li
Journal:  J Biol Chem       Date:  2015-07-23       Impact factor: 5.157

5.  Predicting the functionally distinct residues in the heme, cation, and substrate-binding sites of peroxidase from stress-tolerant mangrove specie, Avicennia marina.

Authors:  Uzma Jabeen; Atiya Abbasi; Asmat Salim
Journal:  Cell Stress Chaperones       Date:  2011-06-10       Impact factor: 3.667

6.  Ectopic overexpression of vacuolar and apoplastic Catharanthus roseus peroxidases confers differential tolerance to salt and dehydration stress in transgenic tobacco.

Authors:  Santosh Kumar; Monika Jaggi; Alok Krishna Sinha
Journal:  Protoplasma       Date:  2011-06-04       Impact factor: 3.356

Review 7.  Non-heme manganese catalase--the 'other' catalase.

Authors:  James W Whittaker
Journal:  Arch Biochem Biophys       Date:  2011-12-16       Impact factor: 4.013

8.  PROSITE, a protein domain database for functional characterization and annotation.

Authors:  Christian J A Sigrist; Lorenzo Cerutti; Edouard de Castro; Petra S Langendijk-Genevaux; Virginie Bulliard; Amos Bairoch; Nicolas Hulo
Journal:  Nucleic Acids Res       Date:  2009-10-25       Impact factor: 16.971

9.  Genomewide analysis of the CIII peroxidase family in sweet orange (Citrus sinensis) and expression profiles induced by Xanthomonas citri subsp. citri and hormones.

Authors:  Qiang Li; Wanfu Dou; Jingjing Qi; Xiujuan Qin; Shanchun Chen; Yongrui He
Journal:  J Genet       Date:  2020       Impact factor: 1.166

10.  Biodegradation of a mixture of PAHs by non-ligninolytic fungal strains isolated from crude oil-contaminated soil.

Authors:  Anaisell Reyes-César; Ángel E Absalón; Francisco J Fernández; Juan Manuel González; Diana V Cortés-Espinosa
Journal:  World J Microbiol Biotechnol       Date:  2013-10-17       Impact factor: 3.312

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.