| Literature DB >> 19208617 |
Anthony C Smith1, Alan J Robinson.
Abstract
Mitochondria are a vital component of eukaryotic cells with functions that extend beyond energy production to include metabolism, signaling, cell growth, and apoptosis. Their dysfunction is implicated in a large number of metabolic, degenerative, and age-related human diseases. Therefore, it is important to characterize and understand the mitochondrion. Many experiments have attempted to define the mitochondrial proteome, resulting in large and complex data sets that are difficult to analyze. To address this, we developed a new public resource for the storage and investigation of this mitochondrial proteomics data, called MitoMiner, that uses a model to describe the proteomics data and associated biological information. The proteomics data of 33 publications from both mass spectrometry and green fluorescent protein tagging experiments were imported and integrated with protein annotation from UniProt and genome projects, metabolic pathway data from Kyoto Encyclopedia of Genes and Genomes, homology relationships from HomoloGene, and disease information from Online Mendelian Inheritance in Man. We demonstrate the strengths of MitoMiner by investigating these data sets and show that the number of different mitochondrial proteins that have been reported is about 3700, although the number of proteins common to both animals and yeast is about 1400, and membrane proteins appear to be underrepresented. Furthermore analysis indicated that enzymes of some cytosolic metabolic pathways are regularly detected in mitochondrial proteomics experiments, suggesting that they are associated with the outside of the outer mitochondrial membrane. The data and advanced capabilities of MitoMiner provide a framework for further mitochondrial analysis and future systems level modeling of mitochondrial physiology.Entities:
Mesh:
Year: 2009 PMID: 19208617 PMCID: PMC2690483 DOI: 10.1074/mcp.M800373-MCP200
Source DB: PubMed Journal: Mol Cell Proteomics ISSN: 1535-9476 Impact factor: 5.911
Species and evidence types cataloged in public databases reporting the mitochondrial localization of proteins
HMPDb, human mitochondrial protein database; AMPDb, Arabidopsis mitochondrial protein database; AMPP, Arabidopsis mitochondrial proteome project; ORMD, organelle map database; YMP, yeast mitochondrial proteome database; YDPM, yeast deletion project and proteomics of mitochondria database.
| Database | Species | Evidence | Ref. | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Hs | Mm | Rn | Dm | Ce | Nc | Sc | At | |||
| MitoP2 | + | + | − | − | − | + | + | − | M, G, A | |
| MiGenes | + | + | + | + | + | − | + | − | A | |
| MitoRes | + | + | + | + | + | − | − | − | A | |
| MitoProteome | + | − | − | − | − | − | − | − | M, A | |
| HMPDb | + | − | − | − | − | − | − | − | A | |
| AMPDb | − | − | − | − | − | − | − | + | M, A | |
| AMPP | − | − | − | − | − | − | − | + | M | |
| ORMD | − | + | − | − | − | − | − | − | M, G | |
| YMP | − | − | − | − | − | − | + | − | M | |
| YDPM | − | − | − | − | − | − | + | − | M | |
| MitoMiner | + | + | + | + | + | − | + | − | M, G, A | |
Species: Hs, H. sapiens; Mm, M. musculus; Rn, R. norvegicus; Dm, D. melanogaster; Ce, Caenorhabditis elegans; Nc, Neurospora crassa; Sc, S. cerevisiae; and At, A. thaliana.
Evidence type reported for mitochondrial protein localization: identification from mass spectrometry of purified mitochondria (M), localization from GFP tagging (G), or curated annotation from public databases and literature (A).
MitoRes includes metazoan species from UniProt.
FThe process of extracting data from external public resources for loading and integration into MitoMiner. Data were downloaded from public sources, and the identifiers among the different sources were mapped by using either the MGI or PIR identifier mapping tools. XML-formatted data and configuration files were created from the data sources by using Perl scripts. The XML files were loaded into an underlying PostgreSQL database by the InterMine system. InterMine gives access to the data through a configurable Web application (Webapp) by using Apache Tomcat. GO, Gene Ontology.
FThe Web page of the data category for proteins. This page describes the data (top left) and provides bulk download options (top right) and template queries that perform common searches (bottom).
FThe protein report page for the oxysterol-binding protein homolog 7 (Osh7p) of This page shows properties of the protein, cross-references to other data including the UniProt entry, and the evidence for its mitochondrial localization from GFP tagging and mass spectrometry. Osh7p is not annotated as mitochondrial in UniProt, the Saccharomyces Genome Database, or the Gene Ontology.
FThe Web page of the Query Builder for building bespoke queries. The Web page has three components: the model browser (top left) from which data classes and attributes of the object model are selected for inclusion in the query; the data classes included in the query, the constraints on their attributes, and the Boolean logic used to combine them (top right); and the data columns to display and sort the output of the results (bottom). The query displayed is to “show all proteins that are present in KEGG pathway 00071 (fatty acid metabolism) in H. sapiens and have evidence of mitochondrial localization by either mass spectrometry or GFP experiments.”
FAn example of a Web page for the results for a query. The query was “show all proteins that are present in KEGG pathway 00071 (fatty acid metabolism) in H. sapiens and have evidence of mitochondrial localization by either mass spectrometry or GFP tagging experiments.” Data in columns can be sorted or summarized, the columns can be moved or hidden, identifiers can be saved to a list, and the results can be exported in different formats.
FA Web page for a predefined template query. The query is show all proteins that are present in KEGG pathway 00071 (fatty acid metabolism) in H. sapiens and have evidence of mitochondrial localization by either mass spectrometry or GFP experiments. Before the query is run, the user specifies the KEGG pathway by its identifier and picks the desired organism from a drop-down box.
FThe cumulative increase in the number of (i) proteins reported as mitochondrial ( Protein redundancy was removed by using HomoloGene to merge orthologs and duplicate proteins. The number of transmembrane proteins (dark gray) as annotated in UniProt that have been found is about 20% of the total (light gray).
FThe frequency distribution ( Protein redundancy was removed by using HomoloGene to combine orthologs and duplicate proteins. Proteins that have been identified in 10 or more studies are grouped as “10+.”
Experimental evidence for the mitochondrial localization of the subunits of the FoF1 ATP synthase in M. musculus and in other species
| Subunit | UniProt ID | GFP tagging study
| No. of mass spectrometry publications
| No. of TM helices | ||
|---|---|---|---|---|---|---|
| Other species | Other species | |||||
| a | P00848 | − | − | 1 | 8 | 6 |
| b | Q9CQQ7 | − | + | 5 | 13 | 0 |
| c | P48202 | − | − | 1 | 2 | 2 |
| c | P56383 | − | − | 0 | 0 | 2 |
| c | P56384 | − | − | 0 | 1 | 2 |
| d | Q9DCX2 | − | + | 5 | 18 | 0 |
| e | Q06185 | − | + | 3 | 7 | 0 |
| f | P56135 | − | + | 5 | 9 | 0 |
| g | Q9CPQ8 | − | + | 4 | 14 | 0 |
| F6 | P97450 | − | − | 5 | 5 | 0 |
| 8 (A6L) | P03930 | − | − | 1 | 4 | 1 |
| 6.8 kDa | P56379 | − | − | 2 | 3 | 0 |
| DAPIT | Q78IK2 | − | + | 2 | 7 | 1 |
| α | Q03265 | + | + | 7 | 18 | 0 |
| β | P56480 | + | + | 7 | 19 | 0 |
| γ | Q91VR2 | + | + | 6 | 14 | 0 |
| δ | Q9D3D9 | − | − | 6 | 11 | 0 |
| ε | P56382 | − | + | 4 | 9 | 0 |
| OSCP | Q9DB20 | − | + | 5 | 15 | 0 |
Species were H. sapiens, R. norvegicus, B. taurus, D. melanogaster, and S. cerevisiae.
Number of transmembrane helices (TM) in the subunit as reported in UniProt.
Isoform 1 of subunit c.
Isoform 2 of subunit c.
Isoform 3 of subunit c.
Numbers of non-redundant proteins (by using HomoloGene) described as mitochondrial in six species
| Species | No. of proteins in species
| No. with orthologous proteins in other species | Total | ||||||
|---|---|---|---|---|---|---|---|---|---|
| GFP | Mass spec | Annotated | Combined | GFP | Mass spec | Annotated | Combined | ||
| 142 | 1037 | 853 | 1551 | 243 | 2465 | 1109 | 2701 | 3025 | |
| 52 | 2411 | 942 | 2626 | 327 | 1436 | 1063 | 1896 | 3124 | |
| 0 | 533 | 413 | 754 | 207 | 1700 | 665 | 1835 | 1907 | |
| 0 | 18 | 415 | 416 | 207 | 1483 | 649 | 1595 | 1602 | |
| 0 | 37 | 236 | 255 | 271 | 1407 | 713 | 1520 | 1576 | |
| 574 | 1004 | 922 | 1196 | 46 | 584 | 251 | 620 | 1461 | |
GFP, number of proteins determined by GFP tagging and microscopy as having mitochondrial localization.
Mass spec, number of proteins determined by using mass spectrometry of purified mitochondrial fractions.
Annotated, number of proteins annotated by UniProt, genome databases, or the Gene Ontology as mitochondrial.
Combined, number of proteins described as mitochondrial by combining evidence from GFP, mass spectrometry, and annotation.
Number of proteins that have an ortholog (as defined by HomoloGene) described as mitochondrial in the other five species.
Number of proteins described as mitochondrial by combining direct evidence and that inferred from orthologs.
FThe numbers of orthologous proteins among three mitochondrial proteomes. A protein was assigned as mitochondrial by either experimental evidence (mass spectrometry or GFP tagging) or annotation or by the mitochondrial localization of an ortholog. Main numbers were calculated by using HomoloGene to determine redundancy and orthology among proteins. Numbers in parentheses were calculated by using BLAST (with a threshold of 10−35 for the expect score) to define orthologs.