| Literature DB >> 20624730 |
Vicente Pérez-Brocal1, Revital Shahar-Golan, C Graham Clark.
Abstract
Mitochondrial evolution has given rise to a complex array of organelles, ranging from classical aerobic mitochondria to mitochondrial remnants known as hydrogenosomes and mitosomes. The latter are found in anaerobic eukaryotes, and these highly derived organelles often retain only scant evidence of their mitochondrial origins. Intermediate evolutionary stages have also been reported as facultatively or even strictly anaerobic mitochondria, and hydrogenosomes that still retain some mitochondrial features. However, the diversity among these organelles with transitional features remains rather unclear and barely studied. Here, we report the sequence, structure, and gene content of the mitochondrial DNA of the anaerobic stramenopile Proteromonas lacertae. It has a linear genome with a unique central region flanked by two identical large inverted repeats containing numerous genes and "telomeres" with short inverted repeats. Comparison with the organelle genome of the strictly anaerobic human parasite Blastocystis reveals that, despite the close similarity of the sequences, features such as the genome structure display striking differences. It remains unclear whether the virtually identical gene repertoires are the result of convergence or descent.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20624730 PMCID: PMC2997541 DOI: 10.1093/gbe/evq015
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FStructure of Proteromonas lacertae. Left panel: Light micrograph of P. lacertae. The Giemsa-stained cell has a size of ∼13 × 3 μm. The anterior of the cell bears two flagella, one thicker and longer than the other. The single nucleus (n) is visible at the anterior pole of the cell. Right panel: Transmission electron micrograph of P. lacertae. This section of the anterior of the cell shows the rhizoplast (r) passing through the Golgi apparatus (g) and into the nucleus (n). The single large mitochondrion (m), in which cristae are visible, is adjacent to the nucleus. For a more detailed description see Brugerolle and Mignot (1990).
FCircular representation of the gene and physical map of the linear Proteromonas lacertae organellar genome. Black blocks represent genes and ORFs and are transcribed from different strands in the right and left halves of the molecule. The interior arrows identify the two repeated regions and show their transcriptional direction. Gray blocks represent tRNA genes, and these are identified by their amino acid using the single-letter code. Mf and Me1/Me2 are initiator and elongator methionyl tRNAs, respectively. Short segments at each end of the molecule represent the putative “telomeric” region. The inner circle shows the size scale. The map was created using GenomeViz 1.1 (Ghai et al. 2004). The inner circle shows the size scale (kbp).
Characteristics of the Proteromonas lacertae mtDNA
| Other Stramenopiles | |||
| Total length (bp) | 48,663 | 27,719–29,270 | 31,617–58,507 |
| G + C content (%) | |||
| Total | 20.9 (22.7) | 19.9–21.9 | 22.3–38.0 |
| Intergenic regions | 13.2 (12.0) | 10.7–10.9 | 6.8–37.4 |
| Protein-coding regions | 18.0 (19.0) | 17.9–20.7 | 20.9–37.1 |
| Structural RNA genes | 35.4 (35.4) | 28.7–30.0 | 33.9–45.2 |
| Gene number | 52 (88) | 45 | 58–79 |
| Protein coding | 27 (39) | 27 | 34–52 |
| Structural RNAs | 25 (49) | 18 | 24–34 |
| Average length ORFs (nt) | 955 (886) | 793–835 | 645–1,049 |
| Average length IGRs (nt) | 34 (31) | 28–37 | 20–160 |
NOTE.—Various genome features of the complete organelle genome of P. lacertae, Blastocystis, and other stramenopiles are compared. IGR, intergenic region.
In parentheses are the values for G + C content, gene number etc., in P. lacertae when both copies of the repeated region are included.
Data for all other stramenopile complete mitochondrial genome sequences have been summarized in this column, namely those shown in figure 3 plus six additional Saccharina species.
FGene order comparison between the mtDNA of Blastocystis sp. (linearized) and Proteromonas lacertae. For simplicity, only one of the repeated regions in P. lacertae has been depicted, and for this comparison, the beginning of the Blastocystis sp. MLO genome has been placed at rps13. The Blastocystis sp. gene nomenclature of Wawrzyniak et al. (2008) was used. “tX” indicates a tRNA gene, identified by its amino acid using the single-letter code. Solid lines connect the conserved gene blocks. Green: conserved block of genes with identical order and forward orientation. Blue: conserved block of genes in inverted orientation. Red arrowheads: genes apparently absent in one of the species or with no clear homologue.
Comparative Gene Content in 25 Completely Sequenced Stramenopile mtDNAsa
| Genes | Bacillariophyta [2 | Peronosporomycetes [4] | Phaeophyceae [12] | |||||
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| 23–25 | 22 | 23 | 24 (29) | 25 (30) | 23–25 | 15 (16) | 23 (45) | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | ▪ | □ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| □ | ▪ | □ | □ | ▪ | □ | □ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | □ | |
| ▪□ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | ▪ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | □ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | □ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | □ | ▪ | ▪ | ▪ | ▪ | ▪ | □ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | □ | □ | □ | ▪ | ▪ | ▪ | ▪ | |
| ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| ▪□ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | ▪ | |
| □ | □ | ▪ | □ | ▪□ | □ | □ | □ | |
| ORFs | 1–5 | 4 | 5 | 10 (12) | 4 | 3–16 | 1–4 (*) | 4 |
NOTE.—Filled boxes, genes present; open boxes, genes absent. Figures in parentheses represent the total gene number when genes present in two copies are included.
Based on the gene content reported by Pérez-Brocal and Clark (2008). In square brackets is the number of taxa within the group.
Genes identified as present in Blastocystis by Wawrzyniak et al. (2008) who report only one ORF (*) but considered as unidentified ORFs by Pérez-Brocal and Clark (2008) who therefore have 4 (*).
Square brackets indicate the number of taxa within the group.
FMaximum likelihood-based phylogenetic tree of nad proteins. This tree is based on the concatenated amino acid sequences of 9 NADH dehydrogenase genes present in all the stramenopiles. Numbers beside the internal nodes are the maximum likelihood bootstrap values from 500 resamplings obtained with Phyml and the Bayesian Markov chain Monte Carlo posterior probability values. Black circles indicate 100% bootstrap support and 1.0 posterior probability values. Support values over 50% are shown adjacent to the corresponding nodes. Values below 50% are omitted. The clade containing all the stramenopiles is shaded. The scale bar represents the number of amino acids substitutions per residue.