| Literature DB >> 19948060 |
Xiujuan Wang1, Yong Huang, Dennis V Lavrov, Xun Gu.
Abstract
BACKGROUND: Gene and genome duplication is the principle creative force in evolution. Recently, protein subcellular relocalization, or neolocalization was proposed as one of the mechanisms responsible for the retention of duplicated genes. This hypothesis received support from the analysis of yeast genomes, but has not been tested thoroughly on animal genomes. In order to evaluate the importance of subcellular relocalizations for retention of duplicated genes in animal genomes, we systematically analyzed nuclear encoded mitochondrial proteins in the human genome by reconstructing phylogenies of mitochondrial multigene families.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19948060 PMCID: PMC2790464 DOI: 10.1186/1471-2148-9-275
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Summary of human mitochondrial multigene families
| Total number of proteins | Phylogenetic interval | ||||||
|---|---|---|---|---|---|---|---|
| Category | Multigene familiesa | Total number of family | mitochondrial | non-mitochondrial | Before vertebrates | At the root of vertebrates | After the emergence of fish |
| Mitochondrial- cytosolic families (mt-cy families) | ACAA2, ALDH1B1, CPT2, GPAM, HMGCS2, NDUFA4, SH3BP5, HIBADH, ABCB10, CABC1, ARG2, ATP5A1, BNIP3, CYP11B2, CA5A, CPS1, CDS2, CYB5B, DECR1, DGUOK, DLD, DNAJA3, GFM1, TUFM, FTMT, GK, SHMT2, GPX4, HSPA9, ALAS2, HTRA2, HK1, IDH2, MTIF2, PPA2, AK3, CKMT2, ME2, MFN2, MGST1, MIPEP, NME4, NFS1, OAS2, OPA1, SLC25A15, PCK2, PHB, PPIF, PRDX3, SIRT3, IARS2, SARS2, DNAJC19, TMLHE, TOP1MT, TRAP1, TST, OXR1 | 59 | 144 | 196 | 27b | 29b (49.2%) | 3b(5%) |
| Mitochondrial- mitochondrial families (mt-mt families) | AIFM1, MAOA, ATP5G2, BCL2L1, D2HGDH, GLUD2, GRPEL1, LETM1, MCART1, PMPCB, MTCH2, MTERF, ENDOG, OAT, BCKDHA, PDHB, PDK4, PDP2, MTRF1, RHOT1, SCO2, TIMM17A, TOMM40, VDAC3, COX4I2, COX7A1, COX6A1, COX6B1, MCCC2, OXCT2, MRPS18A, MRPS10, MRPS30 | 33 | 79 | - | 8 (24%) | 19 (57.6%) | 6 (18.4%) |
a Families are listed using gene names of mitochondrial members.
b For families that underwent multiple rounds of duplications, only the duplications followed by subcellular relocalizations were considered.
Figure 1Maximum likelihood phylogeny of the arginase family. Numbers indicate bootstrap support based on 100 replicates. ARG1: Type I arginases; ARG2: Type II arginases. Colored boxes indicate annotated and/or predicted subcellular locations of the proteins: cytoplasm (yellow) and mitochondria (green). There is no subcellular information for the proteins in Anopheles gambiae and Nematostella vectensis.
Figure 2Maximum likelihood phylogeny of the DNA topoisomerase typeIB family. Numbers indicate bootstrap support based on 100 replicates. TOP1: DNA topoisomerase 1; TOP1MT: mitochondrial DNA topoisomerase 1. Colored boxes indicate annotated and/or predicted subcellular locations of the proteins: nucleus/cytoplasm (blue) and mitochondria (green). There is no subcellular information for the proteins in Anopheles gambiae and Nematostella vectensis.
Figure 3Evolutionary rates in mitochondrial vs. non-mitochondrial proteins. (A) A schematic phylogeny of a mt-cy two gene family with gene duplication occurred in the vertebrate lineage. Branch lengths before the divergence between fish and tetrapods are marked as a and b for mitochondrial and cytosolic proteins, respectively. The corresponding average branch lengths after this divergence are marked as a' and b'. (B) The ratios of branch lengths for mitochondrial vs. nuclear paralogs (a/b, a'/b', and (a+a')/(b+b')) were calculated on the maximum likelihood topologies as illustrated in (A) with the exception of the TST family, for which the divergence between birds (chicken) and mammals was used. TOP1MT, TST, SHMT2 and CDS2 families had undergone relocalization from cytosol to mitochondria, while the remaining 6 families had the opposite direction of relocalizations.
Figure 4Maximum likelihood phylogeny of the class I sirtuin family. Numbers indicate bootstrap support based on 100 replicates. SIRT1: Sirtuin 1; SIRT2: Sirtuin 2; SIRT3: Sirtuin 3. Colored boxes indicate annotated and/or predicted subcellular locations of the proteins: nucleus (purple), cytoplasm (yellow) and mitochondria (green). SIRT3 in Rattus norvegicus and Mus musculus lost the mitochondrial N-terminal targeting signal and thus were retained in the cytoplasm.
Figure 5The presence of mitochondrial N-terminal targeting signals (A) and mitochondrial Pfam domains (B) for human mitochondrial (mt) and non-mitochondrial (nonmt) proteins. N-terminal mitochondrial targeting signals were inferred for proteins in mt-cy families based on targetP predictions [38]. Mitochondrial Pfam domains refer to those domains that were found only in eukaryotic (excluding human) mitochondrial proteins [40].
Figure 6Maestro score distributions for human mitochondrial and non-mitochondrial proteins. Maestro scoring system incorporates eight genomic-scale data sets (targeting sequence prediction, protein domain enrichment, presence of cis-regulatory motifs, yeast homology, ancestry, tandem-mass spectrometry, coexpression and transcriptional induction during mitochondrial biogenesis) for predicting nuclear encoded mitochondrial proteins [40]. The cutoff score of 5.65 is indicated as the vertical bar.