Literature DB >> 17135190

PeroxisomeDB: a database for the peroxisomal proteome, functional genomics and disease.

Agatha Schlüter1, Stéphane Fourcade, Enric Domènech-Estévez, Toni Gabaldón, Jaime Huerta-Cepas, Guillaume Berthommier, Raymond Ripp, Ronald J A Wanders, Olivier Poch, Aurora Pujol.   

Abstract

Peroxisomes are essential organelles of eukaryotic origin, ubiquitously distributed in cells and organisms, playing key roles in lipid and antioxidant metabolism. Loss or malfunction of peroxisomes causes more than 20 fatal inherited conditions. We have created a peroxisomal database (http://www.peroxisomeDB.org) that includes the complete peroxisomal proteome of Homo sapiens and Saccharomyces cerevisiae, by gathering, updating and integrating the available genetic and functional information on peroxisomal genes. PeroxisomeDB is structured in interrelated sections 'Genes', 'Functions', 'Metabolic pathways' and 'Diseases', that include hyperlinks to selected features of NCBI, ENSEMBL and UCSC databases. We have designed graphical depictions of the main peroxisomal metabolic routes and have included updated flow charts for diagnosis. Precomputed BLAST, PSI-BLAST, multiple sequence alignment (MUSCLE) and phylogenetic trees are provided to assist in direct multispecies comparison to study evolutionary conserved functions and pathways. Highlights of the PeroxisomeDB include new tools developed for facilitating (i) identification of novel peroxisomal proteins, by means of identifying proteins carrying peroxisome targeting signal (PTS) motifs, (ii) detection of peroxisomes in silico, particularly useful for screening the deluge of newly sequenced genomes. PeroxisomeDB should contribute to the systematic characterization of the peroxisomal proteome and facilitate system biology approaches on the organelle.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17135190      PMCID: PMC1747181          DOI: 10.1093/nar/gkl935

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Peroxisomes were first identified by de Duve in 1966 (1). Only a few months ago, the open debate on its ontogenetic as well as evolutionary origin has been settled: the organelle is derived from ER-membranes (2–4). This constitutes perhaps, a modern illustration in an organelle context, of the ‘ontogeny recapitulates phylogeny’ principle. Peroxisomes are indispensable for development, morphogenesis and differentiation, and play roles of paramount importance in hydrogen peroxide detoxification and fatty acid metabolism, hallmark functions of the organelle. The plasticity of their protein composition and biochemical function is remarkable and may vary according to the organism, cell type and/or environmental condition. Depending on species, peroxisomes are playing key roles on the degradation of amino acids, methanol and purines, or in the synthesis of bile acids, essential polyunsaturated fatty acids or penicillin [see reviews (5–7)]. Peroxisomes belong to the microbody family jointly with glyoxysomes of plants and glycosomes of trypanosomes, which are related peroxisomes specialized in glycolisis and glyoxylate pathways, respectively. In higher eukaryotes, peroxisomes play a key role in lipid homeostasis, adapting its copy number to obey cellular requirements, such as cold exposure, nutrients or drugs. At a first glimpse, some peroxisomal routes such as the β-oxidation of fatty acids could seem to be redundant with the mitochondria β-oxidation, but they are in fact complementary as there is substrate specificity between the two organelles. The peroxisomal β-oxidation is uncoupled of ATP synthesis thus leading to heat production in thermogenesis. Gaining knowledge on peroxisome components, metabolic functions in the different organisms and their key players remains an important challenge. We wish to contribute to the field with PeroxisomeDB (). This relational database has been created joining expertise from peroxisome biologists and bioinformaticians, and provides a comprehensive and exhaustive reference list of peroxisomal proteins of human and Saccharomyces cerevisiae with extensive hyperlinks to databases, which we believe to be among the most useful currently available. In addition to combining these links on a single location, we have included useful tools for in silico detection of the organelle, based on the presence of four peroxins that we had recently found to be peroxisomal markers (2); and also for detection of peroxisome targeting signals (PTS) motifs for PTS1, PTS2 and for Pex19 binding sites, facilitating thereof the identification of novel candidate peroxisomal proteins.

RESULTS

The peroxisomal proteome

The available metabolic databases (i.e. Kegg) use to depict metabolic pathways regardless of the subcellular compartmentalization of the different enzymatic steps. PeroxisomeDB is helpful in this regard, because we integrate the different routes specifying which steps take place in peroxisomes. Further, current annotation of peroxisomal proteins in the databases is frequently incomplete and often does not distinguish between proteins, which have a confirmed peroxisomal sublocalization and those which are only candidates according to in silico predictions. Using annotated data derived from one of the most broadly used bio-ontology resources, Gene Ontology (GO), supplemented with curated experimental literature searches, we have built a complete peroxisomal proteome of Homo sapiens (encoded by 85 genes) and S.cerevisiae (encoded by 61 genes). To this core ensemble, we have added the peroxisomal proteins from the model organisms Mus musculus or Yarrowia lipolytica which lack their human orthologues, for instance the mouse urate oxidase or the several Pex that are mostly restricted to Y.lipolytica (Table 1).
Table 1

PeroxisomeDB contents

Database contentsNumber of entries
Peroxisomal genes157
    Homo sapiens [GO:77(6)]85
    Saccharomyces cerevisiae (GO:51)61
    Present in M.musculus and not in human7
    Present in Y.lipolytica and not in human4
Functions and metabolic pathways50
Diseases22
Interactive metabolic pathway schemes6
Peroxisomal tools2
    Peroxisome identification tool
    Target signal predictor

Number of entries refers to the number of genes manually annotated in peroxisomeDB. Gene Ontology (GO) followed by number indicates the number of peroxisomal entries previously annotated as such in GO; the number in brackets indicates misannotated entries identified in GO and not included in peroxisomeDB.

From gene to pathway, from metabolite to disease

The proteomes are organized in interrelated sections: ‘Genes’, ‘Functions’, ‘Diseases’ and ‘Metabolic pathways’. The information is integrated in a section called ‘The Peroxisome at a Glance’, which includes four interactive schemes depicting the main peroxisomal functions: Lipid Metabolism, Glyoxylate and Dicarboxylate Metabolism, Amino acid and Purine Degradation and Antioxidant Metabolism. Two additional schemes are focused on the Peroxins and other peroxisomal membrane proteins (PMP) (Figure 1), and on Peroxisomal Disease.
Figure 1

One of the six interactive schemes displayed in PeroxisomeDB: ‘Peroxins and other PMPs (peroxisome membrane proteins)’.

One of the six interactive schemes displayed in PeroxisomeDB: ‘Peroxins and other PMPs (peroxisome membrane proteins)’. PeroxisomeDB contents Number of entries refers to the number of genes manually annotated in peroxisomeDB. Gene Ontology (GO) followed by number indicates the number of peroxisomal entries previously annotated as such in GO; the number in brackets indicates misannotated entries identified in GO and not included in peroxisomeDB. In the ‘Genes’ section, we have included: (i) a description and localization of the individual protein, (ii) its functional roles, (iii) the corresponding disease caused by protein malfunction, (iv) selected links to reference databases (Gene Info). Of particular relevance, we found the following sections: (i) from NCBI, the gene summary, the chromosomal localization, the predicted intron/exon structure at ACE VIEW, the Single Nucleotide Polymorphism (SNP) collection, the ortholog prediction and the conserved domains section; (ii) from ENSEMBL, the gene summary page that includes information on regions of syntheny; (iii) from the UCSC, the Proteome and Genome Browser with extensive expression data derived from microarray experiments; (iv) from the Weizmann Institute, the gene summary page with focus on expression data derived from microarrays, electronic northern and SAGE (Figure 2). Based on the biochemical and physiological context of protein function, we have organized the retrieved information in 50 different Functional Categories (Table 2). Noteworthy, 54 proteins belong to the lipid metabolism and 10 to the antioxidant categories.
Figure 2

Gene Page for a given peroxisomal gene, including a brief description, localization, functional role, disease caused by malfunction if any, tools for functional genomics and selected links to reference databases (Gene Info).

Table 2

Classification of the peroxisomal proteome into 50 different functional categories; the number of proteins classified according to functional category is listed

Functional categoriesNumber of entries
Metabolism87
    Antioxidant10
        Antiinflammatory-antimicrobial1
        Catalases2
        Epoxides/Isochorismatase hydrolases1
        Gluthatione peroxidase/Thioredoxins1
        Microsomal detoxification system related1
        Peroxiredoxins2
        Superoxide dismutases2
    Glycerol synthesis1
    Glyoxylate and dicarboxylate metabolism10
    Lipid metabolism54
        Etherlipid and plasmalogen synthesis4
        Fatty acid oxidation24
            Branched chain fatty acid beta-oxidation9
            alpha-oxidation4
            Branched chain fatty beta-oxidation4
        di-trihydroxycholestanoic acid oxidation/bile acid6
            di-trihydroxycholestanoic acid beta-oxidation3
        Long-chain dicarboxylic acid oxidation5
        Straight chain fatty acid oxidation13
            Straight chain fatty acid beta-oxidation7
        Fatty acid synthesis/PUFA synthesis14
            Fatty acid chain elongation1
            Unsaturated fatty acid beta-oxidation12
        Long/very fatty acid activation9
        Regulation of acyl-CoA /CoA ratio11
    Nicotinate and nicotinamide metabolism3
    Protein/Amino acid metabolism10
        d-amino acid degradation2
        l-Lysine degradation3
        Polyamines degradation1
        Proteases2
        Transaminases2
    Purine metabolism2
    Retinoid metabolism1
Peroxisomal membrane proteins (PMP)13
        ABC transporters6
        PXMP 2/4 family proteins5
        PXMP 34 family proteins2
Peroxisome biogenesis proteins (peroxins)47
        Peroxisomal AAA-ATPases proteins4
        Peroxisomal division-proliferation12
        Peroxisome docking5
        Peroxisome matrix protein import13
            Zn Ring proteins6
        Peroxisome membrane assembly7
        Peroxisome targeting sequence binding8
Peroxisome organization9
Unknown3
Gene Page for a given peroxisomal gene, including a brief description, localization, functional role, disease caused by malfunction if any, tools for functional genomics and selected links to reference databases (Gene Info). Classification of the peroxisomal proteome into 50 different functional categories; the number of proteins classified according to functional category is listed The indispensable role of the organelle is stressed by the fatal consequences of the absence of peroxisomes in human diseases known as Peroxisome Biogenesis Disorders (PBD). Peroxisomal disorders are classified into two generic groups: (i) peroxisome biogenesis (PBD), (ii) single peroxisomal enzyme deficiencies (8). Mutations in peroxisomal proteins essential for biogenesis and matrix and membrane protein import (called peroxins or Pex genes) invariably lead to PBD: Zellweger Syndrome (ZS), Neonatal Adrenoleukodystrophy (NALD), Infantile Refsum disease (IRD) and Rhizomelic Chondrodysplasia Punctata (RCDP) are characterized by a strong reduction in number and size, or even complete absence of peroxisomes. Phenotypically, patients suffer from hepatic failure, neurodevelopmental delay, retinopathy and deafness with onset intrauteri or in the first months of life and fatal outcome within the first years (8). In the Disease Catalog, we have listed a total of 22 disorders with a brief disease description and links to Online Mendelian Inheritance in Man (OMIM) database, to specific disease-related mutation databases, to The Human Gene Mutation Database (HGMD) and to the SNPs database, that facilitate access to polymorphic markers useful for molecular genetic diagnosis. Of particular relevance for clinicians are new peroxisomal disorders, not described in previous reviews on peroxisomal disease. These include the contiguous ABCD1/DXS1375E deletion syndrome, and other disorders involving proteins which are not exclusively peroxisomal, such as malonyl-CoA decarboxylase (MLYCD) causing Malonic aciduria, xanthine oxidase (XDH) causing xanthinuria, acyl-CoA synthetase long-chain family member 4 (FACL4) for X-linked mental retardation, superoxide dismutase 1 (SOD1), mutated in amyotrophic lateral sclerosis (since 1% of SOD1 protein is located in peroxisomes) and fatty aldehyde dehydrogenase (FALDH) encoded by ALDH3A2, which has a splice variant of peroxisomal localization and causes Sjögren–Larsson syndrome.

Tools to identify peroxisomal proteins and peroxisomes

As an increasing amount of genomic sequences have become available, reverse genetics approaches have become very popular for identification of novel peroxisomal proteins, either by identifying consensus PTSs or by identification of orthologs of proteins determined to be peroxisomal in other species (homology probing). In PeroxisomeDB, we provide tools for facilitating both strategies, and also a simple and extremely useful approach for detecting the presence of the organelle in a given genome.

Comparative genomics tools

For facilitating identification of paralogs, orthologs and the main conserved domains throughout evolution, in-house developed routines provide the best BLAST and PSI-BLAST (9) results for each protein, in the form of precomputed homolog sequence results. Peroxisomal protein homologs with a region of similarity covering >50% of the query sequence are aligned using MUSCLE (10). Maximum Likelihood (ML) trees were built as implemented in PhyML (11). The tree built with the best-fitting model was further refined in a Bayesian analysis as implemented in MrBayes (12) (Figure 3). Both phylogenetic trees are provided in under PeroxisomeDB.
Figure 3

Bayesian tree of a peroxisomal protein (PEX12) displayed in ATV [A Tree Viewer (21)]. Numbers at the nodes indicate the posterior probability of the corresponding partition. ***Indicate query sequence.

Bayesian tree of a peroxisomal protein (PEX12) displayed in ATV [A Tree Viewer (21)]. Numbers at the nodes indicate the posterior probability of the corresponding partition. ***Indicate query sequence.

Peroxisomal targeting signal (PTS) predictor

Three different sequences targeting proteins to peroxisome have been identified and characterized to date (i) the 12 C-terminal residues PTS1 (13), (ii) N-terminal PTS2 for matrix proteins (14,15) and recently (iii) the Pex19 binding sites (Pex19 BS) for membrane proteins (16,17). We have manually selected the proteins carrying a PTS1 motif from 99 bona fide PTS1 proteins in H.sapiens, M.musculus, S.cerevisiae and Arabidopsis thaliana; also the PTS2 motif of the proteins ACAA1, PHYH and AGPS from H.sapiens, M.musculus, Ratus norvegicus, S.cerevisiae, Kluyveromyces lactis and A.thaliana; and for the Pex19 BS, we have chosen the best hits from yeast and humans (16,17). All these motifs have been experimentally characterized. On this manually curated selection of proteins carrying the PTSs, we have firstly applied Multiple EM for Motif Elicitation (MEME) (18) for refining a peroxisomal motif consensus. Then, we have submitted the three consensus PTS motifs to BLOCKS multiple alignment processor (19) in order to create three independent BLOCKS (sequence multiple alignment without gaps) of the highly conserved PTSs. Query sequence searches are carried out using ‘Do-It-YourSelf Block Search’ in BLOCKS server (). Results are returned with measures of probability, showing the pairwise alignment sequence (Figure 4). The resulting tool is a powerful PTSs Predictor, which integrates in a single location the three PTSs identification algorithms, and will allow for in silico identification of novel candidate peroxisomal proteins. This is of particular interest for PTS2 and PEX19BS containing proteins, since no web-based prediction tool is available for detection of those motifs.
Figure 4

Target Signal Predictor. Tool for the prediction of the three peroxisomal target signals: PTS1, PTS2 and PEX19BS. Motifs within the query sequence are identified using ‘Do-It-YourSelf Block Search’ in BLOCKS server.

Target Signal Predictor. Tool for the prediction of the three peroxisomal target signals: PTS1, PTS2 and PEX19BS. Motifs within the query sequence are identified using ‘Do-It-YourSelf Block Search’ in BLOCKS server.

Peroxisome presence predictor

Detecting peroxisomes is not a trivial issue. Experimental peroxisome detection relies on direct visualization by electron microscopy or by immunohistochemistry against peroxisomal protein markers. Catalase is the most broadly used, although it can also be located in the cytosol. The common assumption that peroxisomes are present in all eukaryotic cells is contradicted, on experimental basis, in amitochondriates such as Encephalitozoon cuniculi, Giardia lamblia or Entamoeba histolytica (20). Using a molecular phylogenetic approach, we have very recently identified four membrane proteins as peroxisomal markers (Pex3, Pex10, Pex12 and Pex19) (2), what allowed us to demonstrate the absence of the organelle in the human pathogens Plasmodium falciparum or Toxoplasma gondii; parasites which do contain mitochondria and other organelles such as apicoplasts, micronemes or rhoptries. We have created a tool that allows automatic detection of peroxisomes by connecting the human sequences of the four marker peroxins to a page from NCBI that compiles an increasing amount of complete eukaryotic genomes (119 eukaryotic genomes to date in the site ‘Genomic Blast with eukaryotic genomes’, July 2006, Figure 5). Thus, direct blast searches to the most recent genomes and proteomes available, in the search for peroxisome presence becomes quick and simple. Absence of the organelle might provide insight into the metabolic status of an organism, and might even open new therapeutic strategies against human or animal pathogens. Indeed, using this tool we have substantiated our former findings by identifying another apicomplexa devoided of peroxisomes, the Theileria parva. Interestingly, the ciliates Tetrahymena thermophila and Paramecium tetraurelia do contain the markers. Ciliates and apicomplexa belong to the alveolata lineage, together with the dinoflagelata phylum, from which no complete genomes are yet available. Thus, it will be of interest to revisit the question of peroxisome loss within alveolata once new genomes become available.
Figure 5

Peroxisome Identification Tool. It is based on the detection of four peroxisomal markers, by launching BLAST processes. The 119 different eukaryotic genomes provided by ‘genomic Blast with eukaryotic genomes page’ from NCBI, can be automatically blasted against the four markers.

Peroxisome Identification Tool. It is based on the detection of four peroxisomal markers, by launching BLAST processes. The 119 different eukaryotic genomes provided by ‘genomic Blast with eukaryotic genomes page’ from NCBI, can be automatically blasted against the four markers.

Perspectives: maintenance and growth

Is our first priority to update PeroxisomeDB with new genomes and new peroxisomal entries curated manually by experts in biology of peroxisomes. Precomputed homology search results (BLAST and PSI-BLAST) will be automatically relaunched every 2 months. We have planned extensions to the available and annotated principal model organisms complete genomes such as mouse, rat and A.thaliana. In a second step, we will add the rest of vertebrate and fungi complete genomes, after inferring the components of their peroxisomal proteomes by homology probing and PTS prediction plus experimental literature when available. Most importantly, we wish to keep PeroxisomeDB highly dynamic by actively encouraging the users to submit their contributions through a web interface that allows automatic edition.
  21 in total

Review 1.  The life cycle of the peroxisome.

Authors:  V I Titorenko; R A Rachubinski
Journal:  Nat Rev Mol Cell Biol       Date:  2001-05       Impact factor: 94.444

2.  Motif refinement of the peroxisomal targeting signal 1 and evaluation of taxon-specific differences.

Authors:  Georg Neuberger; Sebastian Maurer-Stroh; Birgit Eisenhaber; Andreas Hartig; Frank Eisenhaber
Journal:  J Mol Biol       Date:  2003-05-02       Impact factor: 5.469

3.  MrBayes 3: Bayesian phylogenetic inference under mixed models.

Authors:  Fredrik Ronquist; John P Huelsenbeck
Journal:  Bioinformatics       Date:  2003-08-12       Impact factor: 6.937

4.  Amino acid substitution matrices from protein blocks.

Authors:  S Henikoff; J G Henikoff
Journal:  Proc Natl Acad Sci U S A       Date:  1992-11-15       Impact factor: 11.205

5.  Identification of PAHX, a Refsum disease gene.

Authors:  S J Mihalik; J C Morrell; D Kim; K A Sacksteder; P A Watkins; S J Gould
Journal:  Nat Genet       Date:  1997-10       Impact factor: 38.330

Review 6.  Kingdom protozoa and its 18 phyla.

Authors:  T Cavalier-Smith
Journal:  Microbiol Rev       Date:  1993-12

7.  ParaMEME: a parallel implementation and a web interface for a DNA and protein motif discovery tool.

Authors:  W N Grundy; T L Bailey; C P Elkan
Journal:  Comput Appl Biosci       Date:  1996-08

Review 8.  Peroxisomes (microbodies and related particles).

Authors:  C De Duve; P Baudhuin
Journal:  Physiol Rev       Date:  1966-04       Impact factor: 37.312

Review 9.  Peroxisome biogenesis: advances and conundrums.

Authors:  Paul B Lazarow
Journal:  Curr Opin Cell Biol       Date:  2003-08       Impact factor: 8.382

10.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

Authors:  Robert C Edgar
Journal:  BMC Bioinformatics       Date:  2004-08-19       Impact factor: 3.169

View more
  30 in total

1.  Peroxisome biogenesis and function.

Authors:  Navneet Kaur; Sigrun Reumann; Jianping Hu
Journal:  Arabidopsis Book       Date:  2009-09-11

2.  Targeting of hFis1 to peroxisomes is mediated by Pex19p.

Authors:  Hannah K Delille; Michael Schrader
Journal:  J Biol Chem       Date:  2008-09-09       Impact factor: 5.157

Review 3.  Peroxisome diversity and evolution.

Authors:  Toni Gabaldón
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-03-12       Impact factor: 6.237

4.  OPA3, mutated in 3-methylglutaconic aciduria type III, encodes two transcripts targeted primarily to mitochondria.

Authors:  Marjan Huizing; Heidi Dorward; Lien Ly; Enriko Klootwijk; Robert Kleta; Flemming Skovby; Wuhong Pei; Benjamin Feldman; William A Gahl; Yair Anikster
Journal:  Mol Genet Metab       Date:  2010-03-16       Impact factor: 4.797

5.  Genetic Screen for Cell Fitness in High or Low Oxygen Highlights Mitochondrial and Lipid Metabolism.

Authors:  Isha H Jain; Sarah E Calvo; Andrew L Markhard; Owen S Skinner; Tsz-Leung To; Tslil Ast; Vamsi K Mootha
Journal:  Cell       Date:  2020-04-06       Impact factor: 41.582

6.  Peroxisomal membrane proteins insert into the endoplasmic reticulum.

Authors:  Adabella van der Zand; Ineke Braakman; Henk F Tabak
Journal:  Mol Biol Cell       Date:  2010-04-28       Impact factor: 4.138

7.  The impact of a ketogenic diet and liver dysfunction on serum very long-chain fatty acids levels.

Authors:  T J Stradomska; M Bachański; J Pawłowska; M Syczewska; A Stolarczyk; A Tylki-Szymańska
Journal:  Lipids       Date:  2013-01-31       Impact factor: 1.880

8.  Predicted mouse peroxisome-targeted proteins and their actual subcellular locations.

Authors:  Yumi Mizuno; Igor V Kurochkin; Marlis Herberth; Yasushi Okazaki; Christian Schönbach
Journal:  BMC Bioinformatics       Date:  2008-12-12       Impact factor: 3.169

9.  The peroxisome: still a mysterious organelle.

Authors:  Michael Schrader; H Dariush Fahimi
Journal:  Histochem Cell Biol       Date:  2008-02-15       Impact factor: 4.304

10.  PeroxisomeDB 2.0: an integrative view of the global peroxisomal metabolome.

Authors:  Agatha Schlüter; Alejandro Real-Chicharro; Toni Gabaldón; Francisca Sánchez-Jiménez; Aurora Pujol
Journal:  Nucleic Acids Res       Date:  2009-11-05       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.