Literature DB >> 30398659

MitoMiner v4.0: an updated database of mitochondrial localization evidence, phenotypes and diseases.

Anthony C Smith1, Alan J Robinson1.   

Abstract

Increasing numbers of diseases are associated with mitochondrial dysfunction. This is unsurprising given mitochondria have major roles in bioenergy generation, signalling, detoxification, apoptosis and biosynthesis. However, fundamental questions of mitochondrial biology remain, including: which nuclear genes encode mitochondrial proteins; how their expression varies with tissue; and which are associated with disease. But experiments to catalogue the mitochondrial proteome are incomplete and sometimes contradictory. This arises because the mitochondrial proteome has tissue- and stage-specific variability, plus differences among experimental techniques and localization evidence types used. This leads to limitations in each technique's coverage and inevitably conflicting results. To support identification of mitochondrial proteins, we developed MitoMiner (http://mitominer.mrc-mbu.cam.ac.uk/), a database combining evidence of mitochondrial localization with information from public resources. Here we report upgrades to MitoMiner, including its re-engineering to be gene-centric to enable easier sharing of evidence among orthologues and support next generation sequencing, plus new data sources, including expression in different tissues, information on phenotypes and diseases of genetic mutations and a new mitochondrial proteome catalogue. MitoMiner is a powerful platform to investigate mitochondrial localization by providing a unique combination of experimental sub-cellular localization datasets, tissue expression, predictions of mitochondrial targeting sequences, gene annotation and links to phenotype and disease.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30398659      PMCID: PMC6323904          DOI: 10.1093/nar/gky1072

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Mitochondria are associated with a wide spectrum of metabolic, degenerative and age-related human diseases as well as cancer. This has generated considerable interest in mitochondria from a wide range of researchers. However, the mitochondrial proteome has yet to be conclusively identified—as evidenced by the continued discovery of new mitochondrial disease genes—which hinders investigations into the organelle’s role. Many different approaches have been used to address this problem, but each has limitations and no single technique provides full coverage of the mitochondrial proteome. Furthermore, this problem is exacerbated by how the mitochondrial proteome varies with cell type, environment and development. To determine if proteins are mitochondrial, we believe it is necessary to cross-reference between many different evidence types. This has the additional benefit of providing data to apply machine-learning techniques to the problem of identifying novel mitochondrial proteins. We described previously how MitoMiner (1–3) incorporates mitochondrial localization data from a variety of resources—including mass spectrometry of purified cell fractions, GFP-tagging and microscopy, curated annotation in the Gene Ontology (4,5) and computational prediction of mitochondrial targeting sequences. Further, MitoMiner provides a biological context for candidate mitochondrial proteins by integrating information from other public resources, including the Gene Ontology, UniProt (6), OMIM (7) and the Human Protein Atlas (HPA; http://www.proteinatlas.org/) (8,9). Here we describe a major update to the MitoMiner database including: re-factoring MitoMiner from a protein-centric to gene-centric resource (substantially aiding cross-referencing data and application in analysis of disease gene candidates); new analysis tools and widgets for RNA expression data; new data sources such as a new mitochondrial proteome dataset derived by using machine learning on the data in MitoMiner; and new data types, including human RNA expression data and phenotype descriptions of diseases and genetic mutations.

SOFTWARE IMPLEMENTATION AND DATA IMPORT

MitoMiner continues to be built on the InterMine data-warehouse (10,11), which provides the infrastructure to keep MitoMiner up-to-date, as well as a powerful and flexible user interface, enabling options from simple text searches to complicated queries with multiple constraints spanning any of the included data types, as well as application programming interfaces for programmatic access with Perl, Python, Ruby and Java. MitoMiner's database schema is based on the InterMine core schema that specifies different types of biological data. To model MitoMiner-specific data types—e.g. mass spectrometry and GFP tagging datasets, metabolic pathway data and homology mappings—the InterMine core model has been extended with bespoke additions. Data were imported by using either InterMine-provided data loaders, or Perl scripts to convert data files to InterMine compatible XML data files. The MitoMiner data sources and InterMine infrastructure are updated on a 9–12-month cycle.

UPDATES TO DATA SOURCES

MitoMiner was conceived as a proteomics resource and annotation attached to protein entries. Thus, in earlier MitoMiner versions, a single gene could be represented by multiple protein entries—each with different data—that needed separate, manual evaluation. However, changing usage patterns—especially with analysis of disease gene candidates from next generation sequencing of mitochondrial disease patients—prompted a major re-factoring of MitoMiner 4.0 to be gene-centric, rather than protein-centric. Thus a new gene entry was created (Figure 1) with data from its protein entries attached directly to it, which can be searched by Ensembl gene identifier (12), Human Gene Organization (HUGO) gene symbol, genome project gene identifier (e.g. from Mouse Genome Informatics (MGI) (13) and Rat Genome Database (RGD) (14)) and NCBI gene identifier (15). By using this gene entry, protein data are merged and consolidated at the gene level, redundancy has been removed and it is easier to share data amongst homologues of the same gene in different species, as exemplified by the summary table and cross-references (Figure 1). If the proteins of a gene have different values for a property, then the value with the greatest level of evidence is added to the gene entry, e.g. the best mitochondrial targeting sequence prediction.
Figure 1.

Header of the MitoMiner gene entry for the human TIMM50 gene. MitoMiner's new gene-centric organization presents first an overview of primary mitochondrial localization evidence, then the gene’s homologues in other species and a table of their mitochondrial localization evidence, including experimental data, annotation and mutant phenotype. Links to this gene’s presence in catalogues of mitochondrial proteomes is provided, as well as links to external resources, e.g. OMIM and Ensembl. The remainder of the gene entry page (not shown) describes other information, including localization evidence, tissue expression data and links to metabolism.

Header of the MitoMiner gene entry for the human TIMM50 gene. MitoMiner's new gene-centric organization presents first an overview of primary mitochondrial localization evidence, then the gene’s homologues in other species and a table of their mitochondrial localization evidence, including experimental data, annotation and mutant phenotype. Links to this gene’s presence in catalogues of mitochondrial proteomes is provided, as well as links to external resources, e.g. OMIM and Ensembl. The remainder of the gene entry page (not shown) describes other information, including localization evidence, tissue expression data and links to metabolism. Mutant phenotype data for the mouse TIMM50 gene recorded in the phenotype section of its gene entry. Identifier names and descriptions are from the Mouse Genome Informatics (MGD) database. Cross-references between homologous genes means the existence of these mouse data are recorded in the ‘Homologue Phenotype Recorded?’ column in the summary table of the human TIMM50 gene entry (Figure 1). Next are provided descriptions of data sources new to MitoMiner since version 3.1 (3). MitoMiner retains data sources described in earlier releases (1–3), e.g. metabolic pathways, but these are not described again here.

Mitochondrial protein localization

MitoMiner’s focus continues to be collating evidence for the mitochondrial localization of proteins from large-scale studies. New mitochondrial localization datasets have been added—seven from human; one from mouse; one from rat and two from yeast—bringing the total to 56 datasets. The full list of datasets with references is available from the Data Sources page of the MitoMiner website. New additions to the protein annotation of mitochondrial localization include: mitochondrial targeting sequence predictions from MitoFates (16); cleavage site predictions from MitoFates and MitoProt (17); and confidence notation of prediction reliability from TargetP (18).

Phenotype and disease

A disease gene candidate is more credible if mutation of the homologous gene in a model organism causes a similar phenotype to the human disease. Thus, annotations and links of genes to phenotype and disease have been added to MitoMiner. This includes: human disease (from OMIM (7)) and mitochondrial disease (from the Washington University list of mitochondrial disorders (https://neuromuscular.wustl.edu/mitosyn.html)); phenotypes of spontaneous, induced and engineered mutations in mouse (from MGI); and knock-down and genetic mutant information in zebrafish (from ZFIN (19)) and yeast (from SGD (20)). This information is cross-referenced between homologous genes and available via the ‘Summary’ section of a gene page (Figure 2).
Figure 2.

Mutant phenotype data for the mouse TIMM50 gene recorded in the phenotype section of its gene entry. Identifier names and descriptions are from the Mouse Genome Informatics (MGD) database. Cross-references between homologous genes means the existence of these mouse data are recorded in the ‘Homologue Phenotype Recorded?’ column in the summary table of the human TIMM50 gene entry (Figure 1).

Mitochondrial proteomes

To decide whether a protein is mitochondrial based upon available data, various definitions of the mitochondrial proteome have been made by evaluating literature, experimental studies and computational predictions. MitoMiner provides two mitochondrial proteome datasets: the well-respected MitoCarta2 (21) and a new Integrated Mitochondrial Protein Index (IMPI), based upon applying machine learning to mitochondrial localization data in MitoMiner (http://impi.mrc-mbu.cam.ac.uk/). Each proteome has its own page (linked from the menu bar) and is provided with a set of template queries to analyse them. Users can evaluate the evidence for a protein being included in a mitochondrial proteome dataset by surveying its gene entry in MitoMiner.

Tissue RNA expression

Genes of mitochondrial proteins tend to be highly expressed in tissues having substantial energy or metabolic demands. For example, when evaluating candidate mitochondrial disease genes it is useful to relate the organ systems affected in the patient with the tissue-specific expression of the candidate gene. Thus, a candidate disease gene identified from a patient suffering from ‘mitochondrial encephalopathy, lactic acidosis and stroke-like episodes’ (MELAS) is unlikely to be disease-causing if its expression is undetected in brain. To support such queries, MitoMiner now includes RNA tissue expression data in addition to protein level detection from the Human Protein Atlas (9,22), and new display widgets for HPA’s tissue and cancer RNA expression were created and are available on gene entry pages.

Tutorials and user guides

The MitoMiner website is accompanied with a full set of support pages including FAQ’s, user guides, examples and tutorials (http://mitominer.mrc-mbu.cam.ac.uk/support/). Nine tutorials document new and existing features of MitoMiner, including uploading and analysing data, building bespoke queries by using the Query Builderweb site and storing private lists and queries by using MyMine.

DATA AVAILABILITY

MitoMiner is freely available at the Medical Research Council Mitochondrial Biology Unit website (http://mitominer.mrc-mbu.cam.ac.uk/).
  22 in total

1.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Locating proteins in the cell using TargetP, SignalP and related tools.

Authors:  Olof Emanuelsson; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

3.  Proteomics. Tissue-based map of the human proteome.

Authors:  Mathias Uhlén; Linn Fagerberg; Björn M Hallström; Cecilia Lindskog; Per Oksvold; Adil Mardinoglu; Åsa Sivertsson; Caroline Kampf; Evelina Sjöstedt; Anna Asplund; IngMarie Olsson; Karolina Edlund; Emma Lundberg; Sanjay Navani; Cristina Al-Khalili Szigyarto; Jacob Odeberg; Dijana Djureinovic; Jenny Ottosson Takanen; Sophia Hober; Tove Alm; Per-Henrik Edqvist; Holger Berling; Hanna Tegel; Jan Mulder; Johan Rockberg; Peter Nilsson; Jochen M Schwenk; Marica Hamsten; Kalle von Feilitzen; Mattias Forsberg; Lukas Persson; Fredric Johansson; Martin Zwahlen; Gunnar von Heijne; Jens Nielsen; Fredrik Pontén
Journal:  Science       Date:  2015-01-23       Impact factor: 47.728

4.  InterMine: a flexible data warehouse system for the integration and analysis of heterogeneous biological data.

Authors:  Richard N Smith; Jelena Aleksic; Daniela Butano; Adrian Carr; Sergio Contrino; Fengyuan Hu; Mike Lyne; Rachel Lyne; Alex Kalderimis; Kim Rutherford; Radek Stepan; Julie Sullivan; Matthew Wakeling; Xavier Watkins; Gos Micklem
Journal:  Bioinformatics       Date:  2012-09-27       Impact factor: 6.937

5.  MitoMiner: a data warehouse for mitochondrial proteomics data.

Authors:  Anthony C Smith; James A Blackshaw; Alan J Robinson
Journal:  Nucleic Acids Res       Date:  2011-11-24       Impact factor: 16.971

6.  MitoFates: improved prediction of mitochondrial targeting sequences and their cleavage sites.

Authors:  Yoshinori Fukasawa; Junko Tsuji; Szu-Chin Fu; Kentaro Tomii; Paul Horton; Kenichiro Imai
Journal:  Mol Cell Proteomics       Date:  2015-02-10       Impact factor: 5.911

7.  InterMine: extensive web services for modern biology.

Authors:  Alex Kalderimis; Rachel Lyne; Daniela Butano; Sergio Contrino; Mike Lyne; Joshua Heimbach; Fengyuan Hu; Richard Smith; Radek Stěpán; Julie Sullivan; Gos Micklem
Journal:  Nucleic Acids Res       Date:  2014-04-21       Impact factor: 16.971

8.  OMIM.org: Online Mendelian Inheritance in Man (OMIM®), an online catalog of human genes and genetic disorders.

Authors:  Joanna S Amberger; Carol A Bocchini; François Schiettecatte; Alan F Scott; Ada Hamosh
Journal:  Nucleic Acids Res       Date:  2014-11-26       Impact factor: 19.160

9.  The Rat Genome Database 2015: genomic, phenotypic and environmental variations and disease.

Authors:  Mary Shimoyama; Jeff De Pons; G Thomas Hayman; Stanley J F Laulederkind; Weisong Liu; Rajni Nigam; Victoria Petri; Jennifer R Smith; Marek Tutaj; Shur-Jen Wang; Elizabeth Worthey; Melinda Dwinell; Howard Jacob
Journal:  Nucleic Acids Res       Date:  2014-10-29       Impact factor: 19.160

10.  MitoMiner, an integrated database for the storage and analysis of mitochondrial proteomics data.

Authors:  Anthony C Smith; Alan J Robinson
Journal:  Mol Cell Proteomics       Date:  2009-02-09       Impact factor: 5.911

View more
  32 in total

1.  SynGO: An Evidence-Based, Expert-Curated Knowledge Base for the Synapse.

Authors:  Frank Koopmans; Pim van Nierop; Maria Andres-Alonso; Andrea Byrnes; Tony Cijsouw; Marcelo P Coba; L Niels Cornelisse; Ryan J Farrell; Hana L Goldschmidt; Daniel P Howrigan; Natasha K Hussain; Cordelia Imig; Arthur P H de Jong; Hwajin Jung; Mahdokht Kohansalnodehi; Barbara Kramarz; Noa Lipstein; Ruth C Lovering; Harold MacGillavry; Vittoria Mariano; Huaiyu Mi; Momchil Ninov; David Osumi-Sutherland; Rainer Pielot; Karl-Heinz Smalla; Haiming Tang; Katherine Tashman; Ruud F G Toonen; Chiara Verpelli; Rita Reig-Viader; Kyoko Watanabe; Jan van Weering; Tilmann Achsel; Ghazaleh Ashrafi; Nimra Asi; Tyler C Brown; Pietro De Camilli; Marc Feuermann; Rebecca E Foulger; Pascale Gaudet; Anoushka Joglekar; Alexandros Kanellopoulos; Robert Malenka; Roger A Nicoll; Camila Pulido; Jaime de Juan-Sanz; Morgan Sheng; Thomas C Südhof; Hagen U Tilgner; Claudia Bagni; Àlex Bayés; Thomas Biederer; Nils Brose; John Jia En Chua; Daniela C Dieterich; Eckart D Gundelfinger; Casper Hoogenraad; Richard L Huganir; Reinhard Jahn; Pascal S Kaeser; Eunjoon Kim; Michael R Kreutz; Peter S McPherson; Ben M Neale; Vincent O'Connor; Danielle Posthuma; Timothy A Ryan; Carlo Sala; Guoping Feng; Steven E Hyman; Paul D Thomas; August B Smit; Matthijs Verhage
Journal:  Neuron       Date:  2019-06-03       Impact factor: 17.173

2.  The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets.

Authors:  Damian Szklarczyk; Annika L Gable; Katerina C Nastou; David Lyon; Rebecca Kirsch; Sampo Pyysalo; Nadezhda T Doncheva; Marc Legeay; Tao Fang; Peer Bork; Lars J Jensen; Christian von Mering
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

3.  Quantitative high-confidence human mitochondrial proteome and its dynamics in cellular context.

Authors:  Marcel Morgenstern; Christian D Peikert; Philipp Lübbert; Ida Suppanz; Cinzia Klemm; Oliver Alka; Conny Steiert; Nataliia Naumenko; Alexander Schendzielorz; Laura Melchionda; Wignand W D Mühlhäuser; Bettina Knapp; Jakob D Busch; Sebastian B Stiller; Stefan Dannenmaier; Caroline Lindau; Mariya Licheva; Christopher Eickhorst; Riccardo Galbusera; Ralf M Zerbes; Michael T Ryan; Claudine Kraft; Vera Kozjak-Pavlovic; Friedel Drepper; Sven Dennerlein; Silke Oeljeklaus; Nikolaus Pfanner; Nils Wiedemann; Bettina Warscheid
Journal:  Cell Metab       Date:  2021-11-19       Impact factor: 27.287

Review 4.  Psychiatric drugs impact mitochondrial function in brain and other tissues.

Authors:  Shawna T Chan; Michael J McCarthy; Marquis P Vawter
Journal:  Schizophr Res       Date:  2019-11-16       Impact factor: 4.939

5.  N6-Deoxyadenosine Methylation in Mammalian Mitochondrial DNA.

Authors:  Ziyang Hao; Tong Wu; Xiaolong Cui; Pingping Zhu; Caiping Tan; Xiaoyang Dou; Kai-Wen Hsu; Yueh-Te Lin; Pei-Hua Peng; Li-Sheng Zhang; Yawei Gao; Lulu Hu; Hui-Lung Sun; Allen Zhu; Jianzhao Liu; Kou-Juey Wu; Chuan He
Journal:  Mol Cell       Date:  2020-03-16       Impact factor: 17.970

6.  High-throughput screening identifies suppressors of mitochondrial fragmentation in OPA1 fibroblasts.

Authors:  Emma Cretin; Priscilla Lopes; Elodie Vimont; Takashi Tatsuta; Thomas Langer; Anastasia Gazi; Martin Sachse; Patrick Yu-Wai-Man; Pascal Reynier; Timothy Wai
Journal:  EMBO Mol Med       Date:  2021-05-20       Impact factor: 12.137

7.  MIROs and DRP1 drive mitochondrial-derived vesicle biogenesis and promote quality control.

Authors:  Tim König; Hendrik Nolte; Mari J Aaltonen; Takashi Tatsuta; Michiel Krols; Thomas Stroh; Thomas Langer; Heidi M McBride
Journal:  Nat Cell Biol       Date:  2021-12-06       Impact factor: 28.824

8.  Ovarian carcinoma immunoreactive antigen-like protein 2 (OCIAD2) is a novel complex III-specific assembly factor in mitochondria.

Authors:  Katarzyna Justyna Chojnacka; Praveenraj Elancheliyan; Ben Hur Marins Mussulini; Karthik Mohanraj; Sylvie Callegari; Aleksandra Gosk; Tomasz Banach; Tomasz Góral; Karolina Szczepanowska; Peter Rehling; Remigiusz Adam Serwa; Agnieszka Chacinska
Journal:  Mol Biol Cell       Date:  2022-01-26       Impact factor: 3.612

9.  Comparative analysis of MACROD1, MACROD2 and TARG1 expression, localisation and interactome.

Authors:  R Žaja; G Aydin; B E Lippok; R Feederle; B Lüscher; K L H Feijs
Journal:  Sci Rep       Date:  2020-05-19       Impact factor: 4.379

10.  Large-scale prediction and analysis of protein sub-mitochondrial localization with DeepMito.

Authors:  Castrense Savojardo; Pier Luigi Martelli; Giacomo Tartari; Rita Casadio
Journal:  BMC Bioinformatics       Date:  2020-09-16       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.