Mitochondrial genome diversity in closely related species provides an excellent platform for investigation of chromosome architecture and its evolution by means of comparative genomics. In this study, we determined the complete mitochondrial DNA sequences of eight Candida species and analyzed their molecular architectures. Our survey revealed a puzzling variability of genome architecture, including circular- and linear-mapping and multipartite linear forms. We propose that the arrangement of large inverted repeats identified in these genomes plays a crucial role in alterations of their molecular architectures. In specific arrangements, the inverted repeats appear to function as resolution elements, allowing genome conversion among different topologies, eventually leading to genome fragmentation into multiple linear DNA molecules. We suggest that molecular transactions generating linear mitochondrial DNA molecules with defined telomeric structures may parallel the evolutionary emergence of linear chromosomes and multipartite genomes in general and may provide clues for the origin of telomeres and pathways implicated in their maintenance.
Mitochondrial genome diversity in closely related species provides an excellent platform for investigation of chromosome architecture and its evolution by means of comparative genomics. In this study, we determined the complete mitochondrial DNA sequences of eight Candida species and analyzed their molecular architectures. Our survey revealed a puzzling variability of genome architecture, including circular- and linear-mapping and multipartite linear forms. We propose that the arrangement of large inverted repeats identified in these genomes plays a crucial role in alterations of their molecular architectures. In specific arrangements, the inverted repeats appear to function as resolution elements, allowing genome conversion among different topologies, eventually leading to genome fragmentation into multiple linear DNA molecules. We suggest that molecular transactions generating linear mitochondrial DNA molecules with defined telomeric structures may parallel the evolutionary emergence of linear chromosomes and multipartite genomes in general and may provide clues for the origin of telomeres and pathways implicated in their maintenance.
Genome fragmented into multiple linear chromosomes terminating with telomeric arrays is a hallmark of the eukaryotic cell. In contrast, molecular architectures of genomes in prokaryotes and organelles vary substantially (1). For instance, certain animal mitochondrial DNAs (mtDNAs) are monomeric circles (2), kinetoplastid protists have networks of catenated circles (3,4), and most plants and fungal mitochondria contain linear (circularly permuted) concatemers that are heterogeneous in size (termed polydisperse linear DNA) (5–7). Finally, uniform linear mtDNAs terminating with defined terminal structures (mitochondrial telomeres) are found in a number of phylogenetically diverse taxa (8–28). In addition, mitochondria of numerous organisms contain multipartite genome; i.e. fragmented into multiple (from few to several hundred) circular or linear chromosomes (29–39).The predominant genome architecture may even differ in closely related organisms, i.e. conceptually different (monomeric linear versus circular-mapping and linear polydisperse) or containing varying proportions of topologically different mtDNA molecules. For example, mitochondria of Candida glabrata and Saccharomyces cerevisiae contain polydisperse linear DNA molecules, with a minor fraction of circles and lariat structures generated by rolling circle replication (40). In contrast, a recent study revealed that mitochondria of C. albicans lack significant amounts of circular mtDNA molecules, containing predominantly a network of branched DNA structures with linear polydisperse mtDNA molecules—interpreted as recombination-driven replication (41,42). An alternative interpretation would be mitochondrial replication just like in S. cerevisiae, and a reduced level of circular replicative DNA molecules, due to more effective recombination that is also responsible for branched structures. Whatever the replication mechanism, physical mapping approaches such as restriction mapping, DNA sequencing or PCR amplification indicate that any of these populations of linear polydisperse mtDNA molecules have predominantly circularly permuted sequences (i.e. can be reasonably well represented as single sequence records as deposited in GenBank, but should not be labeled circular as enforced by the database management). Such genome architectures are in most cases illustrated as circular maps. In contrast, species containing linear mtDNA molecules terminating with specific telomeric structures are best depicted in linear maps (43). Accordingly, we term mitochondrial genomes as circular- or linear-mapping.However, the picture is more complex in some yeast species with multiple forms of mitochondrial genomes. In Pichia pijperi and Williopsis mrakii, the linear mtDNA molecules terminating with telomeric hairpins (t-hairpins) coexist with monomeric and dimeric circles and linear polydisperse mtDNAs (11,44,45). Two types of DNA replicons occur in mitochondria of C. parapsilosis. Namely, the linear mtDNA molecules with telomeric arrays (t-arrays) of tandem repeats and multimeric minicircles derived exclusively from the sequence of mitochondrial telomeres (telomeric circles, t-circles) implicated in the telomere maintenance pathway (16,20,46–48).At present, little is known about the biological roles of different mtDNA forms and molecular mechanisms leading to architectural alterations of the mitochondrial genomes. Our ambition is to identify these mechanisms and to uncover their role in the evolution of linear chromosomes and corresponding telomeric structures. Therefore, we initiated a large-scale comparative study of the mitochondrial genomes in yeast species closely related to C. parapsilosis, whose mitochondrial telomeres share a number of structural features with their counterparts at the ends of nuclear chromosomes (49,50). In previous reports, we described that strains of C. metapsilosis and C. orthopsilosis possess either linear-mapping mitochondrial genome, with similar architecture as found in C. parapsilosis, or a circularized (mutant) form of the genome (51,52). Moreover, we found that C. subhashii contains yet another type of linear mitochondrial genome, which does not come with any detectable circular or concatemeric form. Instead, its linear mtDNA terminates with invertron-like telomeres, with a protein covalently bound to both 5′ termini (10). The four Candida species containing linear mitochondrial genomes are classified within the monophyletic ‘CTG clade’ of Hemiascomycetes (53,54). The same phylogenetic group also contains species with circular-mapping mitochondrial genomes such as C. albicans (55), Debaryomyces hansenii (56) and Pichia sorbitophila (57). The occurrence of closely related organisms or even strains of the same species with different mitochondrial genome architecture is in line with the hypothesis that linear- and circular-mapping mitochondrial genomes do not exhibit a radical difference, but that the genome forms may sporadically interconvert via currently unknown molecular mechanism(s) (11,52).In this study, we analyze the complete mtDNA sequences of eight additional Candida species. Our survey reveals that their molecular forms vary dramatically providing a unique opportunity for identification of structural elements and molecular mechanisms affecting the genome architecture. At the same time, our analysis provides an insight on the evolution of linear chromosomes and their telomeric structures.
MATERIALS AND METHODS
Yeast strains and cultivation
Yeast strains analyzed in this study are listed in Table 1. Yeasts were grown in liquid YPDG media (1% (w/v) yeast extract, 1% (w/v) peptone, 0.5% (w/v) glucose and 3% (v/v) glycerol), with constant shaking at 25–30°C until the late exponential phase.
Table 1.
Summary of the mitochondrial genome mapping in yeast species investigated in this study
Species
Strain
Mitochondrial genome
Genome forma
Mitochondrial telomeresb
Genome sizea (bp)
% G + C
GenBank Acc. No.
Referencec
Candida alai
NRRL Y-27739T
Cd,e
30 368e
20.9
HQ267968
This study
Candida albicans
CBS 562NT
Cd
This study
SC 5314
Ce
40 420e
32.2
AF285261
(85)
WO-1
n.d.
55 284 (unfinished)e
32.6
Broad institute
Candida blackwellae
CBS 10843T (AS2.3639T)
Cd
This study
Candida bohiensis
NRRL Y-27737T
Cd
This study
Candida buenavistaensis
NRRL Y-27734T
Cd
This study
Candida chauliodes
NRRL Y-27909T
Cd
This study
Candida corydali
NRRL Y-27910T
Cd
This study
Candida dubliniensis
CD36T (CBS 7987T)
Cd,e
34 732e
30.6
Sanger Institute, this study
Candida frijolesensis
NRRL Y-48060T
3xL1d,e
t-hairpins
37 215 (master chromosome)e
21.6
HM594866
This study
Candida gigantensis
NRRL Y-27736T
Cd
This study
Candida guilliermondii (Pichia guilliermondii)
ATCC6260T (CBS 566T)f
n.d.
23 890 (unfinished)e
25.7
Broad Institute
Candida jiufengensis
CBS 10846T (AS2.3688T)
Cd,e
29 672e
28.4
GU136397
This study
Candida labiduridarum
NRRL Y-27940T
3xL1d
n.d.
∼38 000 (master chromosome)d
This study
Candida maltosa
CBS 5611T
Cd,e
62 949b
22.0
This study
Candida metapsilosis
MCO 448T
L2d,e
t-arrays + t-circles
23 062 + 2nx 620e
25.1
AY962591
(51,52)
PL 448
Cd,e
22 175e
25.8
AY391853
(52,60)
Candida neerlandica
NRRL Y-27057T (CBS434T)
Cd,e
32 141e
24.4
EU334437
(60), this study
Candida orthopsilosis
MCO 456
Cd,e
22 528e
25.0
AY962590
(51,52)
MCO 457T
Cd
∼25 000d
(52)
MCO 471
L2d,e
t-arrays + t-circles
24 697 + 2nx 777e
25.0
DQ026513
(52,60)
Candida parapsilosis
CLIB 214T (CBS 604T)
L2d,e
t-arrays + t-circles
30 928 + 2nx 738e
23.8
DQ376035
(60)
SR 23 (CBS 7157)
L2d,e
t-arrays + t-circles
30 923 + 2nx 738e
23.8
X74411
(20,46)
Candida pseudojiufengensis
CBS 10847T (AS2.3693T)
Cd
This study
Candida salmanticensis
CBS 5121T
L2d,e
t-arrays + t-circles
25 718 + 2nx 104e
20.5
HQ267969
(20), this study
Candida sojae
CBS 7871T
Cd,e
39 415e
29.1
EF468347
This study
Candida subhashii
FR-392-06T (CBS 10753T)
L3d,e
t-proteins
29 795e
52.7
GU126492
(10), this study
Candida tetrigidarum
NRRL Y-48142T
Cd
This study
Candida tropicalis
CBS 94T
Cd,e
This study
MYA-3404
Ce
50 304e
37.3
Broad institute
Candida viswanathii
CBS 1924g
C ↔ L1d
palindrome/t-hairpins
∼39 000
This study
CBS 4024T
C ↔ L1d,e
palindrome/t-hairpins
39 242e
26.2
EF536359
This study
Clavispora (Candida) lusitaniae
CBS 6936T (ATCC42720T)
Cd
18 942 (unfinished)e
27.4
Broad Institute, this study
Debaryomyces hansenii (Candida famata)
CBS 767T
Cd,e
29 462e
27.0
DQ508940
(56), this study
Lodderomyces elongisporus
NRRL YB-4239T (CBS 2605T)
Cd,e
35 601e
28.8
Broad Institute, (52)
Pichia guilliermondii (Candida guilliermondii)
CBS 2030T (ATCC46036T)f
Cd
This study
aGenome form and size were deduced from PFGE analysis and DNA sequencing (see also Figures 1–3 and Supplementary Figure S1 for more details). Note that the terminal nucleotides of the shortest linear mtDNA molecules were not precisely mapped in the case of C. salmanticensis; C, circular-mapping; L1, L2, L3, linear-mapping, the types of linear-mapping genomes are defined according to their terminal structures (10,11,20); 3xL1, tripartite type I linear-mapping.
bMitochondrial telomeres classified according to Tomaska et al. (50).
cmtDNA mapping or sequence data were determined in this study, published previously or downloaded from public databases of the Broad Institute (http://www.broad.mit.edu/) and the Wellcome Trust Sanger Institute (http://www.sanger.ac.uk/).
dPFGE analysis.
eDNA sequencing.
fCandida guilliermodnii ATCC6260 is the anamorphic strain of Pichia guilliermondii.
gCBS 1924 is the type strain of Candida lodderae, its mtDNA displays similar restriction enzyme profile as the mtDNA from C. viswanathii CBS4024T.
n.d.—not done.
T and NT indicate the type and the neotype strain of the species, respectively.
Summary of the mitochondrial genome mapping in yeast species investigated in this studyaGenome form and size were deduced from PFGE analysis and DNA sequencing (see also Figures 1–3 and Supplementary Figure S1 for more details). Note that the terminal nucleotides of the shortest linear mtDNA molecules were not precisely mapped in the case of C. salmanticensis; C, circular-mapping; L1, L2, L3, linear-mapping, the types of linear-mapping genomes are defined according to their terminal structures (10,11,20); 3xL1, tripartite type I linear-mapping.bMitochondrial telomeres classified according to Tomaska et al. (50).cmtDNA mapping or sequence data were determined in this study, published previously or downloaded from public databases of the Broad Institute (http://www.broad.mit.edu/) and the Wellcome Trust Sanger Institute (http://www.sanger.ac.uk/).dPFGE analysis.eDNA sequencing.fCandida guilliermodnii ATCC6260 is the anamorphic strain of Pichia guilliermondii.gCBS 1924 is the type strain of Candida lodderae, its mtDNA displays similar restriction enzyme profile as the mtDNA from C. viswanathii CBS4024T.n.d.—not done.T and NT indicate the type and the neotype strain of the species, respectively.
Pulsed field gel electrophoresis
Screening for linear mitochondrial genomes was performed by pulsed field gel electrophoresis (PFGE) approach (10,11). Briefly, whole-cell DNA samples were prepared in agarose blocks, and separated in a 1.5% (w/v) agarose gel using a CHEF Mapper XA Chiller System (Biorad) with pulse switching set at 5–20 s (linear ramping and 120° angle) for 42 h at 5 V/cm. All separations were performed in 0.5× TBE buffer (45 mM Tris–borate, 1 mM EDTA, pH 8.0) at 10°C.
DNA sequencing and mitochondrial genome annotation
The mtDNA used for DNA sequencing was purified from isolated mitochondria. Procedures for mtDNA preparation, DNA sequence analysis and contig assembly were described previously (10,46,51,58–60). Genome annotations were performed using the MFannot tool (http://megasun.bch.umontreal.ca/cgi-bin/mfannot/mfannotInterface.pl), and manually adjusted according to the sequence alignments of deduced protein products with their homologs from closely related yeast species. Intron sequences were identified using MFannot, RNAweasel (61) and Rfam (62). The precise boundaries were confirmed by alignments of the corresponding sequence with an intron-less gene from a related species. Putative protein products encoded by intronic open reading frames were identified by searching the Pfam database (63) and classified accordingly.
Phylogenetic analysis
For phylogenetic analysis, we have used amino acid sequences of a protein set (Atp6–Atp8–Atp9–Cob–Cox1–Cox2–Cox3–Nad1–Nad2–Nad3–Nad4–Nad4L–Nad5–Nad6) encoded by 23 mitochondrial genomes. The sequences were translated using translation table 4 (mold, protozoan and coelenterate mitochondrial code), except for S. cerevisiae, where translation table 3 (yeast mitochondrial code) applies. The multiple alignments were performed by MUSCLE (64) and concatenated to one alignment. Alignment columns with >50% of gaps were filtered out, resulting in an alignment with 3932 sites. The phylogenetic tree was built with three different programs: PhyloBayes with the CAT substitution model (65), MrBayes (66) with JTT model of amino acid substitution (67) and γ-distributed rate variation between sites, and PhyML (68) with JTT model. Application of all three programs gives the same tree topology. The only differences occur within C. metapsilosis–C. orthopsilosis–C. parapsilosis clade due to high sequence similarity among these species. In the rest of the tree, all branches are highly supported by posterior probabilities (above 0.9 in MrBayes and PhyloBayes), and most branches have bootstrap value of 100 in PhyML. Placement of C. alai in the tree has high posterior probability in both Bayesian programs, but low bootstrap values in PhyML.
Gene order comparison
To infer possible ancestral gene order, we used protein coding, rRNA and tRNA genes of 16 mitochondrial genomes from the ‘CTG clade’. Non-conserved genes that occur only in some species (i.e. trnM3 in D. hansenii and P. sorbitophila; dpoBa, dpoBb and orf756 in C. subhashii) were omitted in this analysis. We have reconstructed a possible history of rearrangements using a simple double cut and join model (DCJ) (69). The DCJ model is based on parsimony and includes commonly considered rearrangement operations, such as reversal, translocation, chromosome circularization, linearization, fusion and fission. There is an efficient algorithm to compute the parsimonious distance of two genomes in DCJ model (69); however, an exact algorithm for inferring ancestral gene orders for a given set of present day genomes is not known. To infer the ancestral gene order under DCJ, we have implemented a local optimization procedure that in each iteration attempts to improve the solution by choosing new ancestral gene orders from a local neighborhood using a dynamic programming algorithm (70). The DCJ model does not handle genomes with duplicated genes. To resolve recent duplications in some of the genomes (C. albicans, C. maltosa, C. sojae, C. viswanathii), we removed duplicated genes, and included both possible forms of the genomes as alternatives in the corresponding leaves. Similarly, both isomers are allowed in the genomes that include long inverted repeats (C. alai, C. albicans, C. maltosa, C. neerlandica, C. sojae, L. elongisporus). In each leaf, one of the alternative orders is chosen as a representative, so as to minimize the overall parsimony cost. Finally, we penalize occurrence of multiple circular chromosomes, or combinations of linear and circular chromosomes in ancestral genomes.
Enzymatic mapping of termini
Approximately 1 µg mtDNA aliquots were treated with exonuclease III (ExoIII; New England Biolabs) or BAL-31 nuclease (New England Biolabs) according to the manufacturer’s instructions, for increasing time periods. After enzyme inactivation (ExoIII for 20 min at 70°C; BAL-31 for 10 min at 65°C in the presence of EDTA), the mtDNA was precipitated with ethanol, dissolved in water, digested with a restriction endonuclease and electrophoretically separated in a 0.9% agarose gel. The labeling of mtDNA termini with T4 polynucleotide kinase has been performed essentially as described previously (20).
DNA hybridization probes
Southern hybridization of PFGE separated yeast DNA samples (Figure 1) was performed with a probe containing an equimolar mixture of PCR products derived from cox2 (345 bp) and nad4 (374 bp) of corresponding species. The following PCR primers were used: 5′-TAGATGTNCCWACWCCWTGAG-3′ and 5′-AYTCRTATTTTCAATATCATTG-3′ (cox2); 5′-AGGTATHWTGGTWAARACACC-3′ and 5′-CAGGWGAWACDAAWCCATG-3′ (nad4). For C. subhashii, the equivalent PCR primers were 5′-CGTCCCAACACCATGAGG-3′ and 5′-ACTCGTACTTCCAGTACCACTG-3′ (cox2); 5′-AGGGATCATGGTCAAGACG-3′ and 5′-CTGGTGAGACTAGCCCGTG-3′ (nad4). In subsequent experiments, we used the following probes: P-668 (668 bp fragment amplified by PCR from the C. frijolesensis mtDNA using primers 5′-ATAATGGGTCAGTGAGTT-3′ and 5′-ACGTTCTCTAGCAGTTGA-3′), EH-1350 (1350 bp EcoRV-HindIII fragment from C. frijolesensis mtDNA), H-1030 (1030 bp HindIII fragment from C. neerlandica mtDNA), and Oligo-32 (32 nt oligonucleotide 5′-AATGAGATGAGGAAGTAAAGGGATAAGGATAA-3′, corresponding to a palindrome sequence in C. viswanathii mtDNA).
Figure 1.
PFGE analysis of the yeast mtDNAs. The whole-cell DNA samples were separated by PFGE using a CHEF Mapper XA Chiller System (Biorad), blotted onto a nylon membrane and hybridized with mtDNA-derived probes as described in ‘Material and Methods’ section. Lane 1—C. viswanathii CBS 4024; lane 2—C. sojae CBS 7871; lane 3—C. maltosa CBS 5611; lane 4—C. neerlandica NRRL Y-27057; lane 5—C. alai NRRL Y-27739; lane 6—C. labiduridarum NRRL Y-27940; lane 7—C. frijolesensis NRRL Y-48060; lane 8—C. subhashii CBS 10753; lane 9—C. jiufengensis CBS 10846; lane 10—C. albicans CBS 562. Note that three discrete bands migrating in the region <50 kb represent three linear mitochondrial chromosomes in C. labiduridarum and C. frijolesensis (lanes 6 and 7). In contrast, four bands in C. alai (lane 5) do not hybridize with mtDNA probes and correspond to linear DNA plasmids (data not shown).
PFGE analysis of the yeast mtDNAs. The whole-cell DNA samples were separated by PFGE using a CHEF Mapper XA Chiller System (Biorad), blotted onto a nylon membrane and hybridized with mtDNA-derived probes as described in ‘Material and Methods’ section. Lane 1—C. viswanathiiCBS 4024; lane 2—C. sojaeCBS 7871; lane 3—C. maltosaCBS 5611; lane 4—C. neerlandicaNRRL Y-27057; lane 5—C. alai NRRL Y-27739; lane 6—C. labiduridarum NRRL Y-27940; lane 7—C. frijolesensisNRRL Y-48060; lane 8—C. subhashii CBS 10753; lane 9—C. jiufengensis CBS 10846; lane 10—C. albicansCBS 562. Note that three discrete bands migrating in the region <50 kb represent three linear mitochondrial chromosomes in C. labiduridarum and C. frijolesensis (lanes 6 and 7). In contrast, four bands in C. alai (lane 5) do not hybridize with mtDNA probes and correspond to linear DNA plasmids (data not shown).
DNA sequence accession numbers
Mitochondrial DNA sequences described in this work were deposited in the GenBank nucleotide sequence data library under following accession numbers: HQ267968 (C. alai NRRL Y-27739), HM594866 (C. frijolesensisNRRL Y-48060), GU136397 (C. jiufengensis CBS10846), EU267175 (C. maltosaCBS5611), EU334437 (C. neerlandicaNRRL Y-27057), EF468347 (C. sojaeCBS7871), EF536359 (C. viswanathiiCBS4024), HQ267969 (C. salmanticensisCBS5121).
RESULTS AND DISCUSSION
PFGE analysis of yeast mitochondrial genomes
We employed the PFGE approach (11,40) to distinguish between polydisperse and uniform linear mtDNA molecules in samples from 24 yeast species (Table 1 and Figure 1). In experimental conditions used for the screening (see ‘Materials and Methods’ section), the uniform linear mtDNA molecules from C. subhashii (10) migrate as a discrete band of ∼30 kb (Figure 1, lane 8). In contrast, a smear is typical for C. albicans [Figure 1, lane 10; (42)] and most other yeast species, with mtDNA-derived probes revealing a strong signal between ∼20 and 50 kb (Figure 1 and Table 1). This indicates that most examined species contain polydisperse linear mtDNAs. On average, their lengths are larger than the genome unit, apparently containing more than one genome equivalent per molecule. However, in C. labiduridarum and C. frijolesensis we detected three discrete bands migrating in the region between 15 and 50 kb (Figure 1, lanes 6–7). This points to a possibility that these species contain three uniform chromosomes in mitochondria (see below). Four distinct bands were also found in C. alai. However, these bands did not hybridize with the mtDNA probe (Figure 1, lane 5) and subsequent sequence analysis revealed that they correspond to linear DNA plasmids (to be described elsewhere).
Mitochondrial genome isomers in species with polydisperse mtDNA
Since the mitochondrial genome of C. albicans occurs in two isomers (42,71), we examined the presence of genome isomers also in other species with polydisperse mtDNAs. Restriction enzyme analysis of the mtDNAs from C. maltosa, C. neerlandica and C. sojae identified four minor mtDNA fragments (e.g. ∼15, ∼13, ∼9 and ∼7 kb in the case of C. neerlandica mtDNA digested with BamHI and PvuII) indicating that they contain a circular-mapping genome with large repeated regions generating two isomers that are present in a stoichiometric ratio (Figure 2A–C). Subsequent sequence analysis confirmed that all three genomes contain large inverted repeats (LIRs) that could be involved in the flip-flop recombination generating genome isomers. This is in line with the observation that the LIRs represent recombination hotspots in C. albicans mtDNA (42). The LIRs were also detected in the C. alai mtDNA sequence, but the presence of contaminating linear plasmids rendered the identification of isomers by restriction enzyme analysis inconclusive.
Figure 2.
Restriction enzyme analysis reveals circular-mapping genome isomers. Candida neerlandica (A), C. maltosa (B) and C. sojae (C) mtDNAs were digested with the restriction enzyme combinations BamHI + PvuII, ApaLI + MluI and AgeI + ApaLI, respectively, and electrophoretically separated in 0.9% (w/v) agarose gel. Black arrows indicate the DNA fragments present in both isomers, grey arrows label the pair of fragments specific to the isomers I or II. Schemes illustrate both isomers with the position of inverted repeats (shown bold within the inner circle) and corresponding restriction enzyme fragments (the outer circle).
Restriction enzyme analysis reveals circular-mapping genome isomers. Candida neerlandica (A), C. maltosa (B) and C. sojae (C) mtDNAs were digested with the restriction enzyme combinations BamHI + PvuII, ApaLI + MluI and AgeI + ApaLI, respectively, and electrophoretically separated in 0.9% (w/v) agarose gel. Black arrows indicate the DNA fragments present in both isomers, grey arrows label the pair of fragments specific to the isomers I or II. Schemes illustrate both isomers with the position of inverted repeats (shown bold within the inner circle) and corresponding restriction enzyme fragments (the outer circle).Physical mapping uncovered genome isomers also in mitochondria of C. viswanathii. However, in this case we observed two BamHI (∼7 and ∼3.5 kb) and Eco91I (∼5 and ∼2.5 kb) bands, with sizes corresponding to a monomer (lower faint band) and a dimer (upper band). Southern hybridization indicates that the two fragments have the same sequence (Figure 3A), and that the ratio between them varied in different preparations (data not shown). In this case, the complete mtDNA sequence of C. viswanathii contains LIRs arranged as a large palindrome (see below), suggesting that the palindrome (represented by the upper band) could be resolved into the smaller fragment (the lower band), i.e. a linear mtDNA with defined terminal sequences/structures. To support this idea we treated isolated mtDNA with BAL-31 nuclease prior to restriction enzyme analysis. This experiment demonstrated that the lower faint band was the only mtDNA fragment sensitive to BAL-31 nuclease activity (Figure 3B). On the other hand, this fragment seems to be refractory to both exonuclease III and T4 polynucleotide kinase (data not shown). This indicates that the termini of resolved linear mtDNA molecules are protected by a special arrangement. We presume that, similar to species from the genera Williopsis and Pichia (44), the linear mtDNA molecules terminate with single-stranded covalently closed telomeric hairpins (t-hairpins). In contrast, we did not identify genome isomers in C. jiufengensis, which lacks LIRs.
Figure 3.
Circular- and linear-mapping genome isomers in mitochondria of C. viswanathii. (A) The mtDNA samples were digested with BamHI (lane 1) or Eco91I (lane 2) and separated in 1% (w/v) agarose gel. The Southern blot was hybridized with radioactively labeled oligonucleotide probe Oligo-32 derived from the large palindrome (shown as dashed arrows). The solid arrows show positions of the palindrome and the presumed terminal fragments of resolved linear molecules capped with t-hairpins. Scheme shows presumed circular- (I) and linear-mapping (II) genome isomers. (B) Isolated mtDNA was treated or untreated with BAL-31 nuclease (0.2 U for 5 min). The mtDNA was then extracted from the reaction, digested with BamHI or Eco91I endonuclease, and electrophoretically separated. Note that the fragments containing presumed t-hairpins were sensitive to BAL-31 nuclease (indicated by asterisk).
Circular- and linear-mapping genome isomers in mitochondria of C. viswanathii. (A) The mtDNA samples were digested with BamHI (lane 1) or Eco91I (lane 2) and separated in 1% (w/v) agarose gel. The Southern blot was hybridized with radioactively labeled oligonucleotide probe Oligo-32 derived from the large palindrome (shown as dashed arrows). The solid arrows show positions of the palindrome and the presumed terminal fragments of resolved linear molecules capped with t-hairpins. Scheme shows presumed circular- (I) and linear-mapping (II) genome isomers. (B) Isolated mtDNA was treated or untreated with BAL-31 nuclease (0.2 U for 5 min). The mtDNA was then extracted from the reaction, digested with BamHI or Eco91I endonuclease, and electrophoretically separated. Note that the fragments containing presumed t-hairpins were sensitive to BAL-31 nuclease (indicated by asterisk).
Multipartite (fragmented) linear-mapping genomes
As mentioned above, PFGE analysis revealed the presence of three distinct mtDNA bands in C. labiduridarum and C. frijolesensis. The size of the largest band (chromosome I) corresponds approximately to the sum of the middle (chromosome II) and the smallest (chromosome III) bands (Figure 1, lane 6–7), suggesting that the longest molecules might represent a master chromosome split into two non-identical fragments. Therefore, we analyzed PFGE-separated mtDNA molecules of C. labiduridarum and C. frijolesensis by Southern hybridization using two probes derived from distant regions of chromosome I (Figure 4A). The probe P-668 hybridized with chromosomes I and III and also detected some mtDNA in wells and smears. The pattern detected by the probe H-1030 was similar, except that it hybridized with chromosome II instead of chromosome III. To confirm that all three chromosomes are linear, we treated C. frijolesensis mtDNA with BAL-31 nuclease, exonuclease III, and T4 polynucleotide kinase. We observed that the presumed terminal restriction enzyme fragments were all sensitive to BAL-31 nuclease (Figure 4B). Interestingly, the treatment with exonuclease III revealed two subpopulations of terminal fragments differing in their accessibility to the enzyme activity (Figure 4C). This indicates that the ends of the linear mtDNA molecules might adopt a covalently closed structure such as a t-hairpin, which in a fraction of molecules is opened and thus accessible to exonuclease III and T4 polynucleotide kinase (Figure 4D). Southern hybridization revealed four faint restriction fragments of the mtDNA (designated as LII + LIII, RII + RIII, LII + RIII, LIII + RII) that are refractory to all three enzymes (Figure 4B–D). The fragment sizes correspond to junctions between chromosomes II and III, and their presence shows that the master chromosome occurs in four flip-flop isomers (i.e. LIII − RIII − LII − RII, LIII − RIII − RII − LII, RIII − LIII − LII − RII and RIII − LIII − RII − LII). This suggests that fragmented linear-mapping genomes (i.e. uniform linear mtDNAs with resolved termini corresponding to chromosomes I–III) may coexist with circular-mapping genome forms (i.e. polydisperse linear mtDNAs lacking homogeneous terminal structures that correspond to the smear observed in PFGE).
Figure 4.
Multipartite linear-mapping genomes in C. labiduridarum and C. frijolesensis (A) PFGE separated samples of C. labiduridarum NRRL Y-27940 (lane 1) and C. frijolesensis NRRL Y-48060 (lane 2) were blotted onto a nylon membrane and hybridized with the radioactively labeled probes P-668 and H-1030 (regions hybridizing with both probes are shown as dashed lines). Presumed master (I) and two smaller chromosomes (II and III) are indicated. Note that the master chromosome occurs in four isomers (i.e. LIII − RIII − LII − RII (shown in the scheme), LIII − RIII − RII − LII, RIII − LIII − LII − RII and RIII − LIII − RII − LII. ‘L’ and ‘R’ indicate the left and the right telomere, respectively). The C. frijolesensis mtDNA (∼1 µg) was digested with BAL-31 nuclease (B) or exonuclease III (ExoIII) (C) as indicated. After nuclease inactivation, the DNA was digested with EcoRV, separated in 0.9% (w/v) agarose gel. The Southern blots were hybridized with the P-668 and EH-1350 probes specific for the left and the right arm of the master chromosome, respectively (see ‘Materials and Methods’ section). Arrows show the positions of the left (L) and right (R) terminal fragments and their fusions (R + R, R + L and L + L). Note that after ExoIII treatment the telomeric fragments form two subpopulations that differ in their sensitivity to the ExoIII treatment. This indicates that the linear mtDNA molecules possess an open structure with 5′ overhang or blunt end or covalently closed t-hairpin. (D) The C. frijolesensis mtDNA was treated with antarctic phosphatase and labeled with [γ32P]ATP and T4 polynucleotide kinase. The mtDNA was then digested with restriction endonuclease EcoRV (lane 1) or BglII (lane 2) and separated in 0.8% (w/v) agarose gel (left panel). The gel was fixed in 10% (v/v) methanol/10% (v/v) acetic acid for 30 min, dried overnight and autoradiographed (right panel). Arrows indicate the position of telomeric fragments containing the open structures accessible to terminal labeling.
Multipartite linear-mapping genomes in C. labiduridarum and C. frijolesensis (A) PFGE separated samples of C. labiduridarum NRRL Y-27940 (lane 1) and C. frijolesensisNRRL Y-48060 (lane 2) were blotted onto a nylon membrane and hybridized with the radioactively labeled probes P-668 and H-1030 (regions hybridizing with both probes are shown as dashed lines). Presumed master (I) and two smaller chromosomes (II and III) are indicated. Note that the master chromosome occurs in four isomers (i.e. LIII − RIII − LII − RII (shown in the scheme), LIII − RIII − RII − LII, RIII − LIII − LII − RII and RIII − LIII − RII − LII. ‘L’ and ‘R’ indicate the left and the right telomere, respectively). The C. frijolesensis mtDNA (∼1 µg) was digested with BAL-31 nuclease (B) or exonuclease III (ExoIII) (C) as indicated. After nuclease inactivation, the DNA was digested with EcoRV, separated in 0.9% (w/v) agarose gel. The Southern blots were hybridized with the P-668 and EH-1350 probes specific for the left and the right arm of the master chromosome, respectively (see ‘Materials and Methods’ section). Arrows show the positions of the left (L) and right (R) terminal fragments and their fusions (R + R, R + L and L + L). Note that after ExoIII treatment the telomeric fragments form two subpopulations that differ in their sensitivity to the ExoIII treatment. This indicates that the linear mtDNA molecules possess an open structure with 5′ overhang or blunt end or covalently closed t-hairpin. (D) The C. frijolesensis mtDNA was treated with antarctic phosphatase and labeled with [γ32P]ATP and T4 polynucleotide kinase. The mtDNA was then digested with restriction endonuclease EcoRV (lane 1) or BglII (lane 2) and separated in 0.8% (w/v) agarose gel (left panel). The gel was fixed in 10% (v/v) methanol/10% (v/v) acetic acid for 30 min, dried overnight and autoradiographed (right panel). Arrows indicate the position of telomeric fragments containing the open structures accessible to terminal labeling.The occurrence of multipartite genomes raises a question concerning the distribution of mtDNA molecules during cell division. Since we do not have any evidence for a specific segregation machinery analogous to the mitotic spindle ensuring the proper segregation of individual chromosomes in mitochondria, we propose that presumed circular-mapping genome and/or chromosome I may represent the ‘master copies’ playing a key role in the genome transmission.
Genetic organization of the mitochondrial genomes
With the aim to investigate the mitochondrial genome architecture in more detail and to identify sequence and/or structural features involved in the genome architecture alterations, we determined the complete mtDNA sequences of eight yeast species; C. alai, C. frijolesensis, C. jiufengensis, C. maltosa, C. neerlandica, C. salmanticensis, C. sojae and C. viswanathii (Supplementary Figure S1). The sizes of sequenced mtDNAs range from 25.7 (C. salmanticensis) to 62.9 kb (C. maltosa), and their G + C content varies from 20.5 (C. salmanticensis) to 29.1% (C. sojae) (Table 1). The genomes contain essentially the same set of conserved genes including the genes for subunits of ATP synthase (atp6,8,9), apocytochrome b (cob), cytochrome c oxidase (cox1,2,3), NADH:ubiquinone oxidoreductase (nad1,2,3,4,4L,5,6), large and small ribosomal RNA (rnl and rns) and a complete set of transfer RNAs (trn). The C. salmanticensis mtDNA has two additional genes: i.e. rps3/var1 coding for a subunit of the mitochondrial ribosome and rnpB/rpm1 for the RNA subunit of RNase P (Table 2). One or more genes are duplicated in C. maltosa, C. sojae and C. viswanathii mtDNAs as they are localized within the LIRs.
Table 2.
Genes present in the mitochondrial genomes sequenced in this study
Species
Protein subunits of oxidative phosphorylation complexes
Protein synthesis and RNA processing
I
III
IV
V
Ribosomal protein
rRNA
tRNA
RNase P RNA
nad1
nad2
nad3
nad4
nad4L
nad5
nad6
cob
cox1
cox2
cox3
atp6
atp8
atp9
rps3
rns
rnl
trn
rnpB
Candida alai
+
+
+
+
+
+
+
+
+
+
+
+a
+a
+
−
+
+
24 + 1a
−
Candida frijolesensis
+
+
+
+
+
+
+
+
+
+
+
+
+
+
−
+
+
24
−
Candida jiufengensis
+
+
+
+
+
+
+
+
+
+
+
+
+
+
−
+
+
24
−
Candida maltosa
+
+
+
+
+
+
+
+
+
+
+
+
+
+
−
+
+
24 + 2a
−
Candida neerlandica
+
+
+
+
+
+
+
+
+
+
+
+
+
+
−
+
+
24
−
Candida salmanticensis
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
24
+
Candida sojae
+
+a
+a
+
+
+
+
+a
+
+a
+
+
+
+a
−
+a
+
24 + 5a
−
Candida viswanathii
+
+
+
+
+
+
+
+
+
+
+
+
+
+
−
+
+
24 + 1a
−
aDuplicated within LIRs.
Genes present in the mitochondrial genomes sequenced in this studyaDuplicated within LIRs.The presence of tRNATrp with an UCA anticodon, and the absence of an S. cerevisiae homolog of the abnormal tRNAThr1 with 8-nt in the anticodon loop (decoding CUN as threonine), indicate that UGA and CUN codons are recognized as tryptophan and leucine, respectively. The codon assignment was verified by multiple alignments of protein sequences, which led us to the conclusion that UGA(Trp) is the only deviation from the standard genetic code.Introns were identified in cob, cox1, nad5 and rnl genes (Table 3). Their number and distribution among species vary from one (in C. alai) to 10 (in C. frijolesensis). The introns predominantly belong to group I and in many cases contain an open reading frame (ORF) coding for putative LAGLIDADG and GIY-YIG type endonucleases (group I) or reverse transcriptases/maturases (group II introns).
Identified intron sequencesOur previous studies (46,51) revealed that C. metapsilosis, C. orthopsilosis and C. parapsilosis have the same gene order in their mtDNAs. Likewise, C. frijolesensis, C. neerlandica and C. viswanathii have the same genetic organization, except that trnM1 has been duplicated and inverted in the latter species (Figure 5A). All other species examined exhibit unique gene arrangements, with synteny reduced to four conserved gene clusters (i.e. trnN-atp6, cox1-atp9, cob-nad3 and rnl-cox2) in C. jiufengensis versus C. parapsilosis (Figure 5B).
Figure 5.
Comparison of mitochondrial gene orders among species from C. neerlandica–C. tropicalis (A), L. elongisporus–C. parapsilosis (B) and C. subhashii–C. alai (C) lineages. Individual genes and blocks with conserved gene order are shown by identical colors. Duplicated regions are framed. The symbols wedge and caret indicate the orientation of genes, TEL (telomeres) and LIR. In L. elongisporus, both LIRs (LIR*) consist of two regions of ∼4 kb separated by 574 and 95 bp-long unique sequences.
Comparison of mitochondrial gene orders among species from C. neerlandica–C. tropicalis (A), L. elongisporus–C. parapsilosis (B) and C. subhashii–C. alai (C) lineages. Individual genes and blocks with conserved gene order are shown by identical colors. Duplicated regions are framed. The symbols wedge and caret indicate the orientation of genes, TEL (telomeres) and LIR. In L. elongisporus, both LIRs (LIR*) consist of two regions of ∼4 kb separated by 574 and 95 bp-long unique sequences.
LIRs and the genome architecture
Analysis of the collected yeast mtDNA sequences reveals that the most prominent feature of the genome architecture is the presence of relatively long duplications, arranged as inverted repeats (LIRs). These elements are present in all but one (C. jiufengensis) mtDNA, and their lengths vary from 109 bp (present as the subterminal repeats in the linear mtDNA of C. salmanticensis) to 14 379 bp (in C. maltosa) thus substantially expanding the genome length (Supplementary Figure S1). In most cases, LIRs comprise non-coding sequences or contain only a few genes or gene fragments. In C. sojae, the 8658 bp LIRs represent a block duplication of 11 genes (i.e. trnA, cox2, trnM1, cob, trnM2, rns, trnI, atp9, trnR1, nad2, nad3). In most cases, the pairs of LIRs are separated by long unique regions. However, we noticed two special arrangements of LIRs: (i) in C. viswanathii identical copies of 4162 bp inverted repeats separated by a 798 bp non-coding sequence form a large palindrome and (ii) the two different inverted repeats (734 and 1229 bp) are separated by a 228 bp non-coding sequence. The second arrangement is present in the region of C. frijolesensis chromosome I, corresponding to the junction of chromosomes II and III. Since the two smaller chromosomes possess different LIRs at their termini, the chromosome I contains different sequences at the opposite ends.As mentioned above, we demonstrate the presence of genome isomers in C. frijolesensis, C. labiduridarum, C. maltosa, C. neerlandica, C. sojae and C. viswanathii, but neither in C. jiufengensis lacking the LIRs nor in C. salmanticensis, which has the LIRs in subterminal regions of the linear mtDNA extended by t-arrays (2nx 104 bp).Since the LIRs represent a suitable substrate for homologous recombination generating the genome isomers, the recombination transactions may be implicated in alterations of the genome architecture, which may in turn depend on LIR arrangements. We notice that the arrangement of LIRs in the mtDNA sequences correlates with the mitochondrial genome architecture. While C. maltosa, C. neerlandica and C. sojae have two circular-mapping genome isomers, C. viswanathii contains circular- and linear-mapping isomers, and C. frijolesensis possesses circular- and multipartite linear-mapping genome forms. This suggests that specifically arranged LIR copies (such as in C. viswanathii and C. frijolesensis) play a role as resolution elements, allowing interconversion between the circular- and linear-mapping genome forms (C. viswanathii), eventually leading to genome fragmentation into multiple linear chromosomes (C. frijolesensis).The comparison of C. neerlandica, C. viswanathii and C. frijolesensis underlines the presumed role of LIRs in the genome architecture. All three species are phylogenetically closely related, with essentially the same mitochondrial genome organization. However, they differ in LIR arrangements and genome architecture (i.e. circular mapping; circular- and linear- or multipartite linear mapping).Large palindromes are structural elements suitable for resolution of uniform linear molecules from circular and/or linear polydisperse mtDNAs. In general, such sequences are known hotspots of genomic instability due to their inherent ability to form cruciform or hairpin structures resulting in DNA replication stall sites (72–74). While in Escherichia coli, palindromic sequences cause double-stranded breaks induced by SbcCD complex (75), in the spirochete Borrelia the palindromes are processed by telomere resolvase (ResT) into t-hairpins (76). The latter mechanism parallels the palindrome resolution involved in the conversions of circular replication intermediates into linear-mapping mtDNAs in Williopsis and Pichia species (44) as well as in the formation of linear mtDNA monomers from linear and circular dimeric replication intermediates in the cilliate Paramecium (77) and the crustacean Armadillidium vulgare (78), respectively. Our results indicate that the palindrome in the C. viswanathii mtDNA is resolved into t-hairpins suggesting that linear mtDNA molecules with defined termini are generated during this process. On the basis of PFGE analysis (Figure 1, lane 1), we assume that the fraction of polydisperse mtDNA molecules fully processed into uniform mtDNA monomers is relatively low. In contrast, we detected three linear chromosomes, a smear of polydisperse mtDNA molecules and flip-flop isomers of chromosome I in C. frijolesensis samples. This indicates that circular-mapping genome forms are processed into chromosome I and further resolved into two smaller chromosomes.Terminal inverted repeats appear to be a typical feature of linear-mapping mitochondrial genomes occurring in phylogenetically distant organisms (9,10,15,19–21,24,25,27,34,38,44,51,79,80) indicating that they arose by analogous evolutionary trajectories. These repeats usually consist of non-coding sequences, and sometimes a few genes. The linear-mapping mitochondrial genome of the stramenopile Proteromonas lacertae possesses even 15.6 kb-long terminal LIRs with about two-thirds of genes (21). Therefore, we assume that the terminal LIRs are remnants of resolution elements that emerge from segmental duplications of mitochondrial genome. Alternatively, the may derive from invertrons such as linear mitochondrial DNA plasmids that are known to integrate into mtDNAs (10).We took advantage of the mtDNA-derived data and analyzed phylogenetic relationship of investigated yeast species by Bayesian and maximum likelihood methods. All three methods resulted essentially in the same tree topology. The tree calculated by PhyloBayes (Figure 6) is supported by high posterior probabilities on most branches and is consistent with the study of Fitzpatrick et al. (53) indicating that the monophyletic ‘CTG clade’ splits into two major lineages: the first represented by D. hansenii and P. sorbitophila, and the second by the C. albicans–C. parapsilosis group. Incorporation of additional recently described species (81–84) in the phylogenetic analysis revealed more detailed relationship among species in the latter lineage. This lineage splits into three subgroups (i.e. L. elongisporus–C. parapsilosis, C. maltosa–C. tropicalis and C. subhashii–C. alai) each containing species with circular- and linear-mapping mtDNAs. The occurrence of different types of mitochondrial telomeres (i.e. t-arrays in C. metapsilosis, C. orthopsilosis and C. parapsilosis; t-hairpins in C. viswanathii and C. frijolesensis; inverton-like telomeres with a t-protein in C. subhashii) in each subgroup is consistent with the tree topology. Similar to C. parapsilosis, the linear mitochondrial genome of C. salmanticensis terminates with t-arrays, although the sequence of its mitochondrial telomeres is different. Since C. salmanticensis belongs to early branching hemiascomycete lineages this linear mitochondrial genome emerged independently on linear mtDNAs in species from the ‘CTG clade’, presumably by employing similar molecular mechanism(s).
Figure 6.
Phylogenetic tree based on mtDNA encoded proteins. Phylogenetic tree was calculated from multiple sequence alignments of mitochondrial proteins by PhyloBayes (65). Posterior probabilities are shown at corresponding branches. The mitochondrial genome forms were reported elsewhere (10,20,22,51,55–57,60,86–89) or analyzed in this study. C—circular-mapping genome; L1, L2 and L3 indicate the type of linear-mapping genomes according to the telomeric structures, i.e. t-hairpins, t-arrays and invertron like with t-proteins, respectively; 3xL1—tripartite linear-mapping genome with t-hairpins (see Table 1 for details).
Phylogenetic tree based on mtDNA encoded proteins. Phylogenetic tree was calculated from multiple sequence alignments of mitochondrial proteins by PhyloBayes (65). Posterior probabilities are shown at corresponding branches. The mitochondrial genome forms were reported elsewhere (10,20,22,51,55–57,60,86–89) or analyzed in this study. C—circular-mapping genome; L1, L2 and L3 indicate the type of linear-mapping genomes according to the telomeric structures, i.e. t-hairpins, t-arrays and invertron like with t-proteins, respectively; 3xL1—tripartite linear-mapping genome with t-hairpins (see Table 1 for details).
Reconstruction of ancestral mitochondrial genomes
Our previous reports (10,46) as well as the comparison of mtDNAs examined in this study revealed a number of conserved gene clusters. This prompted us to use the gene orders of 15 species from the ‘CTG clade’ for reconstruction of possible ancestral mitochondrial genomes in corresponding nodes of the phylogenetic tree (Figure 7), using the DCJ model (69) and local optimization procedures (70). For example, the presumed ancestor of C. parapsilosis and C. jiufengensis, which differ by the genome form, had a circular-mapping genome. We propose a simple evolutionary scenario leading to the linear-mapping mitochondrial genome (Figure 8). In this scenario, a resolution of a recombination transaction between the gene pairs cox2-trnN and cob-atp9 results in the formation of mtDNA with the gene order observed in circularized mutants of C. orthopsilosis and C. metapsilosis and its subsequent linearization between the genes nad3 and atp6 generates a linear mtDNA with genetic organization observed in the linear-mapping mitochondrial genomes of C. metapsilosis, C. orthopsilosis and C. parapsilosis. In contrast, recombination between the gene pairs rnl-cox1 and atp6-nad3 in the presumed ancestral genome leads to the identical arrangement of genes as is present in the C. jiufengensis mtDNA.
Figure 7.
Reconstruction of ancestral genomes. The figure shows possible ancestral gene orders and the number of events on each branch found by local optimization for the DCJ model. The intervals show range of numbers of events in equally parsimonious histories. Red connectors in the gene orders for present day and ancestral genomes represent inferred breakpoints on the branch to the nearest ancestor. Due to space constraints, the figure omits tRNA genes (even though the reconstructions were performed including tRNAs); full gene orders including tRNAs are shown in Supplementary Figure S2. The figure includes duplicated genes, which were restored after ancestral gene order reconstruction. Note that the linear and its circularized (mutant) mitochondrial genome forms of C. orthopsilosis were used in the analysis (51,60).
Figure 8.
A hypothetical pathway leading to mitochondrial genomes of C. parapsilosis and C. jiufengensis from the most recent common ancestor. We propose a simple scenario allowing delineation of the gene order found in both the circular-mapping genome of C. jiufengensis and the ‘true’ linear genome of C. parapsilosis from a reconstructed circular-mapping ancestor inferred by the analysis shown in Figure 7 and Supplementary Figure S2. The process includes reciprocal recombination events between the gene pairs (i) rnl/cox1 and nad3/atp6 or (ii) cox2/trnN and cob/atp9, followed by opening of the circular-mapping genome between the genes nad3 and atp6 in the latter case. Note that the circular-mapping genome intermediate prior its linearization has identical gene order as the mitochondrial telomere mutants of C. metapsilosis and C. orthopsilosis (51,52).
Reconstruction of ancestral genomes. The figure shows possible ancestral gene orders and the number of events on each branch found by local optimization for the DCJ model. The intervals show range of numbers of events in equally parsimonious histories. Red connectors in the gene orders for present day and ancestral genomes represent inferred breakpoints on the branch to the nearest ancestor. Due to space constraints, the figure omits tRNA genes (even though the reconstructions were performed including tRNAs); full gene orders including tRNAs are shown in Supplementary Figure S2. The figure includes duplicated genes, which were restored after ancestral gene order reconstruction. Note that the linear and its circularized (mutant) mitochondrial genome forms of C. orthopsilosis were used in the analysis (51,60).A hypothetical pathway leading to mitochondrial genomes of C. parapsilosis and C. jiufengensis from the most recent common ancestor. We propose a simple scenario allowing delineation of the gene order found in both the circular-mapping genome of C. jiufengensis and the ‘true’ linear genome of C. parapsilosis from a reconstructed circular-mapping ancestor inferred by the analysis shown in Figure 7 and Supplementary Figure S2. The process includes reciprocal recombination events between the gene pairs (i) rnl/cox1 and nad3/atp6 or (ii) cox2/trnN and cob/atp9, followed by opening of the circular-mapping genome between the genes nad3 and atp6 in the latter case. Note that the circular-mapping genome intermediate prior its linearization has identical gene order as the mitochondrial telomere mutants of C. metapsilosis and C. orthopsilosis (51,52).
On the origin of ‘true’ linear mitochondrial genomes
In a number of species, replication of linear-mapping genomes relies on circular intermediates (monomers or dimers) generating linear concatemers via rolling circle and/or recombination-dependent replication mechanisms (44,45). In contrast, no genome-sized circles or genome concatemers were detected in mitochondria of C. parapsilosis and C. subhashii, which harbor uniform linear mtDNA molecules terminating with t-arrays and t-proteins, respectively (10,20,46). Hence, these linear-mapping genomes can be considered as ‘true’ linear genomes. This is further underlined by the presence of active telomere maintenance pathways ensuring their complete replication. We posit that linear-mapping genomes with terminal structures such as t-hairpins correspond to a transient state between circular mapping and ‘truly’ monomeric linear mitochondrial genomes. T-hairpins formed at linear mtDNA termini provide substrates for terminus elongation by an active telomere maintenance pathway [e.g. recombination-dependent mechanism operating in C. parapsilosis mitochondria (49,50)]. Once this pathway ensures the stability of a linear genome, circular replication intermediates and/or polydisperse mtDNAs become dispensable for the system. Conversely, a defect in the telomere-maintenance pathway may result in intramolecular end-to-end fusion, thus re-establishing the original circular-mapping mitochondrial genome architecture (Figure 9).
Figure 9.
A hypothesis on the origin of linear chromosomes in yeast mitochondria. (A) A circular-mapping genome represented by linear polydisperse mtDNAs with [e.g. C. glabrata, S. cerevisiae (40)] or potentially without [e.g. C. albicans (42)] a fraction of circular molecules. (B) Genome rearrangements may result in the formation of a palindrome allowing the resolution of a circular-mapping genome into linear chromosomes with defined terminal structures such as t-hairpins. Such genomes were observed in several species [e.g. P. pijperi and W. mrakii (11,44), C. viswanathii] containing uniform linear mtDNAs, with t-hairpins resolved from circular molecules (monomers and dimers) and/or linear polydisperse mtDNAs. (C) Multiple resolution elements (i.e. two types of LIRs) allow the genome fragmentation into multiple linear chromosomes (e.g. C. frijolesensis, C. labiduridarum). (D) The termini of the linear chromosomes may provide a substrate for further elongation via active maintenance mechanisms, such as the t-circle dependent pathway observed in ‘true’ linear genomes (e.g. C. metapsilosis, C. orthopsilosis, C. parapsilosis, C. salmanticensis) (49,50). Defects in the telomere maintenance result in the genome circularization via end-to-end fusion, as in mitochondrial telomere mutants of C. metapsilosis and C. orthopsilosis containing circular-mapping genomes (51,52).
A hypothesis on the origin of linear chromosomes in yeast mitochondria. (A) A circular-mapping genome represented by linear polydisperse mtDNAs with [e.g. C. glabrata, S. cerevisiae (40)] or potentially without [e.g. C. albicans (42)] a fraction of circular molecules. (B) Genome rearrangements may result in the formation of a palindrome allowing the resolution of a circular-mapping genome into linear chromosomes with defined terminal structures such as t-hairpins. Such genomes were observed in several species [e.g. P. pijperi and W. mrakii (11,44), C. viswanathii] containing uniform linear mtDNAs, with t-hairpins resolved from circular molecules (monomers and dimers) and/or linear polydisperse mtDNAs. (C) Multiple resolution elements (i.e. two types of LIRs) allow the genome fragmentation into multiple linear chromosomes (e.g. C. frijolesensis, C. labiduridarum). (D) The termini of the linear chromosomes may provide a substrate for further elongation via active maintenance mechanisms, such as the t-circle dependent pathway observed in ‘true’ linear genomes (e.g. C. metapsilosis, C. orthopsilosis, C. parapsilosis, C. salmanticensis) (49,50). Defects in the telomere maintenance result in the genome circularization via end-to-end fusion, as in mitochondrial telomere mutants of C. metapsilosis and C. orthopsilosis containing circular-mapping genomes (51,52).
Howard Hughes Medical Institute (grant HHMI 55005622 to J.N.); the Fogarty International Research Collaboration Award (2-R03-TW005654-04A1 to L.T.); European Community FP7 (IRG-224885 to T.V. and IRG-231025 to B.B.); Slovak Research and Development Agency (APVV 0024-07 and LPP-0164-06 to J.N., 20-001604 to L.T.); Canadian Research Chair program (to B.F.L.); Scientific Grant Agency of the Ministry of Education of Slovak republic (VEGA 1/0219/08 to J.N., 1/0132/09 to L.T., 1/0210/10 to T.V.); Comenius University (218/2009 to M.V.); Hungarian Scholarship Board based on the bilateral agreement between Hungary and Slovakia (workplan 0.9 b to Z.F.); The Canadian Research Chair Program (to B.F.L.). Funding for open access charge: Howard Hughes Medical Institute (HHMI 55005622).Conflict of interest statement. None declared.
Authors: Paul P Jung; Joseph Schacherer; Jean-Luc Souciet; Serge Potier; Patrick Wincker; Jacky de Montigny Journal: FEMS Yeast Res Date: 2009-06-12 Impact factor: 2.796
Authors: Joachim M Gerhold; Tiina Sedman; Katarina Visacka; Judita Slezakova; Lubomir Tomaska; Jozef Nosek; Juhan Sedman Journal: J Biol Chem Date: 2014-06-20 Impact factor: 5.157
Authors: Karen L Haag; Timothy Y James; Jean-François Pombert; Ronny Larsson; Tobias M M Schaer; Dominik Refardt; Dieter Ebert Journal: Proc Natl Acad Sci U S A Date: 2014-10-13 Impact factor: 11.205
Authors: Liang Ma; Da-Wei Huang; Christina A Cuomo; Sean Sykes; Giovanna Fantoni; Biswajit Das; Brad T Sherman; Jun Yang; Charles Huber; Yun Xia; Emma Davey; Geetha Kutty; Lisa Bishop; Monica Sassi; Richard A Lempicki; Joseph A Kovacs Journal: FASEB J Date: 2013-02-07 Impact factor: 5.191
Authors: Emily K Herman; Alexander L Greninger; Govinda S Visvesvara; Francine Marciano-Cabral; Joel B Dacks; Charles Y Chiu Journal: J Eukaryot Microbiol Date: 2013-01-29 Impact factor: 3.346