Literature DB >> 25663488

Massive and widespread organelle genomic expansion in the green algal genus Dunaliella.

Michael Del Vasto1, Francisco Figueroa-Martinez2, Jonathan Featherston3, Mariela A González4, Adrian Reyes-Prieto2, Pierre M Durand5, David Roy Smith6.   

Abstract

The mitochondrial genomes of chlamydomonadalean green algae are renowned for their highly reduced and conserved gene repertoires, which are almost fixed at 12 genes across the entire lineage. The sizes of these genomes, however, are much more variable, with some species having small, compact mitochondrial DNAs (mtDNAs) and others having expanded ones. Earlier work demonstrated that the halophilic genus Dunaliella contains extremely inflated organelle genomes, but to date the mtDNA of only one isolate has been explored. Here, by surveying mtDNA architecture across the Chlamydomonadales, we show that various Dunaliella species have undergone massive levels of mitochondrial genomic expansion, harboring the most inflated, intron-dense mtDNAs available from chlorophyte green algae. The same also appears to be true for their plastid genomes, which are potentially among the largest of all plastid-containing eukaryotes. Genetic divergence data are used to investigate the underlying causes of such extreme organelle genomic architectures, and ultimately reveal order-of-magnitude differences in mitochondrial versus plastid mutation rates within Dunaliella.
© The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  Chlamydomonas; Polytoma; intron; mitochondrial DNA; plastid DNA

Mesh:

Substances:

Year:  2015        PMID: 25663488      PMCID: PMC5322560          DOI: 10.1093/gbe/evv027

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

The mitochondrial genomes of chlamydomonadalean green algae (Chlorophyta, Chlorophyceae) are somewhat of a contradiction (Leliaert et al. 2012). On the one hand, they have the smallest gene contents of any known organelle genomes from the Archaeplastida (Plantae sensu lato), encoding 7–8 proteins, 2 rRNAs, and 1–3 tRNAs (Smith, Hamaji, et al. 2013; Smith, Hua, et al. 2013), and they can also be very small (<13.5 kb) and compact (>80% coding) (Smith, Hua, et al. 2010). On the other hand, certain chlamydomonadalean mitochondrial DNAs (mtDNAs) are distended with repeats and introns, and composed almost entirely of noncoding nucleotides (Smith and Lee 2010). One species with a particularly bloated mitochondrial genome is Dunaliella salina—a unicellular biflagellate, which lives in hypersaline environments, can accumulate large amounts of β-carotene, and is a prime candidate for biofuel production (Oren 2005). Complete mtDNA sequencing of D. salina CCAP 19/18, isolated from the Hutt Lagoon in Western Australia, revealed an unprecedentedly high intron density for a green alga (∼1.5 intron per gene) as well as vast, repeat-rich intergenic regions (Smith, Lee, et al. 2010). An equally expanded genome was also uncovered in the plastid, implying that similar forces are shaping both organelle DNAs (Smith, Lee, et al. 2010). Various studies have used chlamydomonadalean algae, including the model organism Chlamydomonas reinhardtii and its close multicellular relative Volvox carteri, to explore the evolution of genome size (e.g., Smith and Lee 2010). These investigations suggest that mutation rate, DNA maintenance machineries, and random genetic drift have a major role in fashioning organelle chromosomes (Smith and Lee 2010; Hua et al. 2012; Smith, Hamaji, et al. 2013). Such studies, however, have yet to be applied to Dunaliella species, and it is still unknown if other members of the genus have inflated organelle genomes. Here, we survey mitochondrial genome size and content within and outside the Dunaliella lineage. We show that although the mtDNA gene repertoire is nearly fixed across the Chlamydomonadales, there is an approximately 4-fold variation in genome size and an 18-fold variation in intron content, with Dunaliella species having among the most expanded mtDNAs of all explored green algae. The same is likely true for their plastid genomes as well. The levels of organelle DNA divergence between distinct D. salina strains are used to investigate the potential forces underpinning such massive levels of genomic expansion.

Sequencing New Chlamydomonadalean Mitochondrial Genomes

As part of an ongoing, collaborative initiative, we have been sequencing and characterizing organelle genomes from diverse chlamydomonadalean species (Hamaji et al. 2013; Smith, Hamaji, et al. 2013). Among these species are two distinct Dunaliella isolates, which were collected from the La Rinconada hypersaline pond in the Atacama Desert in northern Chile: D. salina CONC-001 and Dunaliella viridis CONC-002 (Gómez-Silva et al. 1990; González et al. 1999; Gómez and González 2005). Closely related to Dunaliella are two other algae that we have also been investigating: the freshwater flagellate Chlamydomonas leiostraca SAG 11-49 and the free-living, nonphotosynthetic unicell Polytoma uvella UTEX 964, which are a model duo for studying the loss of photosynthesis (Figueroa-Martinez et al. 2015). Next-generation sequencing of total cellular DNA from these algae followed by mitochondrial genome assembly yielded both expected and unexpected results. At first glance, all four mtDNAs appear similar to one another and to those of various other chlamydomonadaleans: they map as single circular chromosomes (Bendich 1993), have identical gene compliments (representing 7 proteins, 3 tRNAs, and 2 rRNAs), and contain sections of overlapping gene order (fig. 1). Moreover, in each of the algae the mitochondrial large and small subunit rRNA genes are fragmented and scrambled into six and three coding modules, respectively (fig. 1), which is a common theme throughout the order, with species from the Reinhardtinia clade (Nakada et al. 2008) displaying even greater levels of rRNA gene fragmentation (Smith, Hamaji, et al. 2013). Three of the four genomes also have notably high guanine and cytosine (GC) contents: 39% (C. leiostraca), 46.7% (D. salina CONC-001), 47.1% (D. viridis CONC-002), and 55% (P. uvella). These elevated GC values are not entirely unexpected: The Chlamydomonadales is known to harbor species with exceptionally high mtDNA GC compositions, including the colorless alga Polytomella capuana (57.2%) as well as some members of the Lobochlamys genus (∼50–65%; Smith 2012). That said, D. salina CCAP 19/18, unlike its Chilean counterparts, has a low mitochondrial GC content (34.4%), underscoring that organelle nucleotide content can differ drastically even among closely related species and strains.
F

Mitochondrial genomic architecture and expansion within the Chlamydomonadales. Tree of chlamydomonadalean algae, showing mitochondrial genome conformation, size, intron content, and expansion (red); branching order based on the phylogenetic analyses of Nakada et al. (2008), González et al. (2009), Smith, Hua, et al. (2013), and Figueroa-Martinez et al. (2015), as well as that in supplementary figure S1, Supplementary Material online. Venn diagram highlighting shared and unique introns (based on insertion sites) among the three available Dunaliella mtDNAs. Mitochondrial genome maps for Dunaliella salina CONC-001 (outer) and Chlamydomonas leiostraca (inner). Dunaliella salina CONC-001 and D. salina CCAP 19/18 have identical mtDNA gene orders and contents (not including introns, intronic ORFs, or pseudogenes), and so do C. leiostraca and Polytoma uvella. Breakpoints in mitochondrial gene synteny between D. salina and D. viridis are marked with a double-diamond symbol (red). Note: the mitochondrial genome size and intron number for C. reinhardtii and V. carteri can vary due to optional introns in some strains (Smith, Hamaji, et al. 2013). Superscript 1 indicates Chlorogonium capillatum SAG 12-2e was formerly called Chlorogonium elongatum SAG 12-2e.

Mitochondrial genomic architecture and expansion within the Chlamydomonadales. Tree of chlamydomonadalean algae, showing mitochondrial genome conformation, size, intron content, and expansion (red); branching order based on the phylogenetic analyses of Nakada et al. (2008), González et al. (2009), Smith, Hua, et al. (2013), and Figueroa-Martinez et al. (2015), as well as that in supplementary figure S1, Supplementary Material online. Venn diagram highlighting shared and unique introns (based on insertion sites) among the three available Dunaliella mtDNAs. Mitochondrial genome maps for Dunaliella salina CONC-001 (outer) and Chlamydomonas leiostraca (inner). Dunaliella salina CONC-001 and D. salina CCAP 19/18 have identical mtDNA gene orders and contents (not including introns, intronic ORFs, or pseudogenes), and so do C. leiostraca and Polytoma uvella. Breakpoints in mitochondrial gene synteny between D. salina and D. viridis are marked with a double-diamond symbol (red). Note: the mitochondrial genome size and intron number for C. reinhardtii and V. carteri can vary due to optional introns in some strains (Smith, Hamaji, et al. 2013). Superscript 1 indicates Chlorogonium capillatum SAG 12-2e was formerly called Chlorogonium elongatum SAG 12-2e. Further inspection revealed even more differences among the mitochondrial genomes. The D. viridis CONC-002 and D. salina CONC-001 mtDNAs, with respective lengths of approximately 46 and 50 kb, are around 3–4-times larger than those of C. leiostraca (14 kb) and P. uvella (17.4 kb), and on average greater than 2-times larger than other available chlamydomonadalean mtDNAs, despite mitochondrial gene content being almost identical across the entire lineage. What’s more, the C. leiostraca and P. uvella mtDNAs contain no introns, are densely packed (≤30% noncoding), and have matching gene orders, whereas D. viridis CONC-002 has 13 introns and D. salina CONC-001 has 17, and both species are distended with noncoding mtDNA (>70%) and have differing gene orders (fig. 1). In fact, the mtDNA of the Chilean D. salina described here is almost twice as large as the previously reported mtDNA of the Australian D. salina CCAP 19/18 (∼50 kb vs. 28.3 kb), which, like the Chilean strain, also has an abundance of introns (18) (fig. 1; Smith, Lee, et al. 2010). In all, 48 putative introns are distributed among the three sequenced Dunaliella mitochondrial genomes, representing approximately 65% of all identified mitochondrial introns from the Chlamydomonadales. When looking at the location of the Dunaliella introns, 28 have unique insertion sites (situated within four different protein-coding genes and five different rRNA-coding modules), 17 are found in at least two of the isolates, 11 are located in only a single isolate, and 8 contain an intronic open reading frame (ORF; fig. 1). All but one of the introns appear to be of group I affiliation, and the decaying remnants of intronic ORFs were uncovered in the intergenic DNA of both D. salina CONC-001 and D. viridis CONC-002 (fig. 1), suggesting a complex history of intron loss and gain throughout the evolution of Dunaliella mitochondria.

Genomic Upheaval within Dunaliella Mitochondria

The mitochondrial genomes of Dunaliella algae seem to be in a state of upheaval (fig. 2). They are the most bloated and intron-rich mtDNAs observed from the Chlorophyta (fig. 2A), have undergone substantial genomic rearrangements, and are riddled with short, simple repeats, which have spread throughout the intergenic, intronic, and, in some cases, coding regions (figs. 2B and C). These repeats differ in sequence among the three Dunaliella mtDNAs, but can be folded into similar hairpin structures (Smith, Lee, et al. 2010), and in some respects resemble the palindromic organelle repeats from V. carteri (Smith and Lee 2009). Repeat-like insertions were also uncovered within coding regions of the Dunaliella mtDNAs, resulting in elongated exonic sequences relative to those from other closely related mtDNAs (fig. 2B). All of this is in stark contrast to the compact and “ordered” mtDNAs of C. leiostraca and P. uvella, which are devoid of introns and have few repeats (figs. 1, 2A, and 2C).
F

Mitochondrial genomic upheaval in Dunaliella. (A) Noncoding content (x axis) versus intron abundance (y axis) for chlamydomonadalean mtDNAs. Noncoding statistics were calculated following the methods of Smith, Lee, et al. (2010). Chlamydomonas leiostraca, Polytoma uvella, and the three Dunaliella isolates are marked on plot. (B) Insertions within Dunaliella salina CONC-001 (red) and D. viridis (blue) mtDNA protein-coding genes relative to those of D. salina CCAP 19/18. These insertions are absent from the C. leiostraca mtDNA. (C) Dot plot similarity matrices of chlamydomonadalean mitochondrial genomes. Each matrix contains an mtDNA sequence plotted against itself (size of the genome is marked in the bottom right corner). Dots within the matrix highlight regions of nucleotide sequence similarity. The main diagonal represents the mtDNA on the x axis matching against its partner on the y axis. Dots adjacent to the main diagonal correspond to repetitive DNA. Plots were generated with JDotter (Brodie et al. 2004), using a plot size of 1,000 bases/pixel and a sliding window size of 50.

Mitochondrial genomic upheaval in Dunaliella. (A) Noncoding content (x axis) versus intron abundance (y axis) for chlamydomonadalean mtDNAs. Noncoding statistics were calculated following the methods of Smith, Lee, et al. (2010). Chlamydomonas leiostraca, Polytoma uvella, and the three Dunaliella isolates are marked on plot. (B) Insertions within Dunaliella salina CONC-001 (red) and D. viridis (blue) mtDNA protein-coding genes relative to those of D. salina CCAP 19/18. These insertions are absent from the C. leiostraca mtDNA. (C) Dot plot similarity matrices of chlamydomonadalean mitochondrial genomes. Each matrix contains an mtDNA sequence plotted against itself (size of the genome is marked in the bottom right corner). Dots within the matrix highlight regions of nucleotide sequence similarity. The main diagonal represents the mtDNA on the x axis matching against its partner on the y axis. Dots adjacent to the main diagonal correspond to repetitive DNA. Plots were generated with JDotter (Brodie et al. 2004), using a plot size of 1,000 bases/pixel and a sliding window size of 50. Given what we know about the Chlamydomonadales, the most recent common ancestor of the group likely had a compact, circular-mapping, intron-poor mtDNA (fig. 1). However, at some point after the divergence of the Dunaliella and Polytoma/C. leiostraca lineages, the former experienced severe mtDNA inflation (fig. 1), characterized by the proliferation of intronic and repetitive DNA (figs. 2A and C). Mitochondrial genomic expansion is also observed in other chlorophyte lineages, including the volvocine line (Smith, Hamaji, et al. 2013) and certain members of the Sphaeropleales (Fučíková et al. 2014), but it is not as pronounced as that within Dunaliella, and is mostly a product of increases in intergenic sequence rather than a combination of repeats, introns, and other kinds of genomic embellishments. Although all three of the available Dunaliella mitochondrial genomes are expanded, there is, nonetheless, an impressive amount of variation in size, intron, and noncoding content among them (28–50 kb; 13–18 introns per genome; ∼60–75% noncoding DNA; figs. 1 and 2A). As is apparent from self-similarity dot plots (fig. 2C), repeat content scales positively with mitochondrial genome size across the Dunaliella genus, and the order as a whole (Smith, Hamaji, et al. 2013). Other types of genomic embellishments also go up in abundance relative to mtDNA size. For example, the Chilean D. salina has more nonstandard ORFs and pseudogenes than its Australian counterpart (fig. 1). The same, however, cannot be said for overall intron number, which again is highest in the Australian isolate, reinforcing that intron abundance alone does not account for the inflated architectures of the Dunaliella mitochondrial genomes.

Coexpansion of the Chloroplast Genomes

Using the same datasets employed for the mitochondrial genome assemblies, we explored the plastid genomic architectures of D. salina CONC-001 and D. viridis CONC-002 (those of C. leiostraca and P. uvella, which are relatively compact, will be described elsewhere). The plastid DNAs (ptDNAs) of CONC-001 and CONC-002 appear to be equally or even more expanded than the neighboring mitochondrial genomes. De novo assemblies of paired-end Illumina reads from each of the algae gave dozens of short (∼0.5–17 kb) ptDNA contigs (table 1). For both Dunaliella isolates, the plastid contigs were AT-rich, had one to a few genes apiece, harbored many introns, and contained extensive repeats, which were found in almost all of the identified intronic and intergenic regions (table 1). These repeats prevented the assembly of larger contigs and the bridging of smaller ones, which is a recurring problem in green algal plastid genomics—and one that is hampering the assembly of the D. salina CCAP 19/18 nuclear genome (United States Department of Energy Joint Genome Institute, DOE JGI). For example, palindromic repeats hindered the assembly of the approximately 525 kb plastid chromosome of V. carteri (Smith and Lee 2009), and the recently sequenced ptDNA of the ulvophyte Acetabularia acetabulum, which is approximately 1 Mb and repeat-rich, resulted in a highly fragmented assembly (>60 contigs; de Vries et al. 2013).
Table 1

Dunaliella salina CONC-001 and D. viridis CONC-002 Plastid Genome Assembly Statistics

Plastid DNA Contig StatisticsD. salina CONC-001D. viridis CONC-002
Total number8072
Size range (kb)0.6–8.90.5–16.8
Average size (kb)2.22.4
Average read coverage/base43.531.2
Average number genes/contiga0.81.2
Proportion of genes identified (%)b6588
Overall intron count3213
Accumulative size (kb)178.3171.8
Predicted plastid genome size (kb)c>370>280

aNumber is <1 because in some cases a single gene is distributed over multiple contigs.

bPercentage of plastid-encoded genes identified in the CONC-001 and CONC-002 assemblies relative to the genes found in the completely sequenced D. salina CCAP 19/18 ptDNA. Does not include nonstandard genes, such as intronic ORFs; duplicate genes, such as the rRNAs, were counted only once.

cAssuming missing genes are found on single approximately 2 kb contigs and an average gap between contigs of approximately 1 kb.

Dunaliella salina CONC-001 and D. viridis CONC-002 Plastid Genome Assembly Statistics aNumber is <1 because in some cases a single gene is distributed over multiple contigs. bPercentage of plastid-encoded genes identified in the CONC-001 and CONC-002 assemblies relative to the genes found in the completely sequenced D. salina CCAP 19/18 ptDNA. Does not include nonstandard genes, such as intronic ORFs; duplicate genes, such as the rRNAs, were counted only once. cAssuming missing genes are found on single approximately 2 kb contigs and an average gap between contigs of approximately 1 kb. Based on the number, proportion, and density of genes identified on the ptDNA contigs as well as the accumulative size of these contigs (table 1), we estimate that the Chilean D. salina and D. viridis plastid genomes are at least 370 kb and 280 kb, respectively, and are possibly much larger, which makes them giants among all available ptDNAs. This is consistent with the previously published D. salina CCAP 19/18 ptDNA (assembled with Sanger sequencing reads; Smith, Lee, et al. 2010), which at approximately 265 kb and greater than 65% noncoding, is one of the largest ptDNAs ever observed. Like the corresponding mitochondrial genomes, the CONC-001 and CONC-002 ptDNAs are intron-rich (CONC-001 likely has >30 introns), and repeats have infiltrated both the intronic and the intergenic regions. But unlike the mtDNAs, the plastid introns are mostly of group II affiliation and the plastid repeats do show sequence similarities across the three Dunaliella ptDNAs. Together, these data are an excellent illustration of how mitochondrial and plastid genomes can arrive at similar extremes in a single organism or cell—in this case, the coexpansion of mtDNA and ptDNA within members of the Dunaliella genus. Similar observations have come from other eukaryotes, including V. carteri for which both the mtDNA and the ptDNA have uncharacteristically long intergenic regions and large amounts of repetitive DNA (Smith and Lee 2009, 2010). The trend can also go in the opposite direction. For instance, the mitochondrial and plastid genomes of many prasinophyte algae, such as Ostreococcus tauri, are paragons of compactness (Robbens et al. 2007), as are those of the red alga Cyanidioschyzon merolae (Ohta et al. 2003). Convergent evolution between mtDNA and ptDNA can be seen throughout the eukaryotic tree, and in many cases both organelle genomes have independently evolved the same features and taken on similar genomic embellishments. However, when this is observed, the intensity of genomic embellishment is typically more pronounced in mitochondria than in plastids (Barbrook et al. 2010; Smith and Keeling 2015). But this does not appear to be true for Dunaliella: the ptDNA has been pushed to an equivalent or greater extreme than the mtDNA, at least in terms of noncoding content.

Genetic Divergence: High in the Mitochondrion, Low in the Plastid

To better understand the evolution of expanded Dunaliella organelle DNAs, we studied the levels of genetic divergence between them. Using a maximum-likelihood approach, we measured the rates of organelle nucleotide substitution between D. salina CONC-001 and CCAP 19/18, which revealed huge, unparalleled differences in mtDNA versus ptDNA divergence for green algae (table 2 and supplementary table S1, Supplementary Material online).
Table 2

mtDNA and ptDNA Substitution Rates for Two Geographically Distinct Isolates of Dunaliella salina: CONC-001 (Chile) and CCAP 19/18 (Australia)

Substitutions per Site
Substitution Rate Ratios (pt:mt)
ptDNAmtDNA
Synonymous sites
    Average (SD)0.09 (0.32)1.164 (0.52)1:12.9
    Concatenation0.0740.9891:13.4
Nonsynonymous sites
    Average (SD)0.005 (0.02)0.043 (0.02)1:8.6
    Concatenation0.0040.0411:10.3
    dN/dS (SD)0.084 (0.18)0.042 (0.02)––
rRNAsa0.0560.1071:1.9
Introns>>0.1Unalignable––
Intergenic regions>>0.1Unalignable––

Note.—SD, standard deviation; dN/dS, ratio of nonsynonymous to synonymous substitutions per site, based on averages of individual loci not concatenated datasets. The substitution rate statistics for the individual loci within mitochondrial and plastid genomes are shown in supplementary table S1, Supplementary Material online.

aFor mtDNA and ptDNA includes the concatenation of all rRNA-coding regions.

mtDNA and ptDNA Substitution Rates for Two Geographically Distinct Isolates of Dunaliella salina: CONC-001 (Chile) and CCAP 19/18 (Australia) Note.—SD, standard deviation; dN/dS, ratio of nonsynonymous to synonymous substitutions per site, based on averages of individual loci not concatenated datasets. The substitution rate statistics for the individual loci within mitochondrial and plastid genomes are shown in supplementary table S1, Supplementary Material online. aFor mtDNA and ptDNA includes the concatenation of all rRNA-coding regions. Substitution rates between CONC-01 and CCAP 19/18 were 2–13 times greater in the mtDNA than in the ptDNA (table 2). The average number of substitutions per synonymous site (dS) for mtDNA protein-coding genes (∼1.2) was about 13 times that of ptDNA proteins (0.09). Concatenated gene data sets gave an identical trend (table 2 and supplementary table S1, Supplementary Material online). The rates of substitution at nonsynonymous sites (dN) followed a similar pattern: dN of the mtDNA (0.04) was approximately an order of magnitude greater than that of the ptDNA (0.005); however, both organelle genomes had low dN/dS ratios (<0.1; table 2), indicating strong purifying selection at nonsynonymous sites. Substitution rates for rRNA genes were also greater for the mtDNA than for the ptDNA (0.1 vs. 0.06; table 2). We tried measuring substitutions within noncoding DNA as well. The mitochondrial introns and intergenic regions from CONC-001 and CCAP 19/18 were unalignable, implying very high levels of substitution (>>1 per site), much higher than those observed for the mtDNA synonymous sites. Although many of the plastid noncoding regions were also unalignable, we were able to align ten complete ptDNA intergenic spacers, which when concatenated (4.6 kb) harbored approximately 0.15 substitutions per site (table 2), which is greater than that observed for plastid synonymous sites. Like with the coding data, these findings are consistent with a higher rate of nucleotide substitution in the mitochondrion as compared with the plastid. Similar overall conclusions come from substitution rate analyses of D. salina versus D. viridis, but the levels of substitution are saturated making it difficult to gauge the relative rates of substitution between the mtDNA and ptDNA, and between synonymous and noncoding sites. Organelle substitution rate statistics are available for a number of plastid-bearing eukaryotes, including various green algae (supplementary table S2, Supplementary Material online; Hua et al. 2012). Compared with other species, the levels of synonymous-site divergence between CONC-001 and CCAP 19/18 are high for the mtDNA and low for the ptDNA. For example, the average synonymous-site divergence between Chlamydomonas globosa SAG 7.73 (formerly called C. incerta) and C. reinhardtii is approximately 0.30 for both mitochondrial- and plastid-located genes (Hua et al. 2012), which contrast sharply with the values from D. salina: approximately 1.2 for the mtDNA and 0.09 for the ptDNA. Moreover, the relative levels of dS between the mitochondrial and plastid compartments of D. salina (13:1) are among the highest yet observed from green, red, or glaucophyte algae (supplementary table S2, Supplementary Material online). What do these extreme differences in dS mean? At the very least, the high levels of substitution within the mtDNA suggest that D. salina CONC-001 and CCAP 19/18 represent distinct populations or “species” (González et al. 2009). They also point toward major differences in the organelle mutation rates. If synonymous nucleotide positions are assumed to be neutrally evolving, then the synonymous-site divergence between species or distinct populations can provide an entrée into mutation rate (Kimura 1983). For D. salina, there is a 13-fold difference in dS between mitochondrial-located versus plastid-located genes (table 2), indicating that there is much higher mutation rate in the mitochondrion than the plastid. These findings could also be a sign of high and low absolute mutation rates within mitochondrion and plastid, respectively, but this is speculative as we do not know how long ago CONC-001 and CCAP 19/18 shared a common ancestor. If these two compartments do have drastically different mutational patterns, then it would, on the face of it, conflict with their similarly expanded genomic architectures.

Unraveling the Roots of Organelle Genomic Expansion

In many respects, the Dunaliella organelle genomes have parallel architectures to land plant mtDNAs, which are renowned for their expansive intergenic regions, large densities of introns and repeats, and overall high levels of sequence upheaval (Sloan et al. 2012). Similar to the Dunaliella organelle genomes, land plant mtDNAs boast impressive variations in synonymous substitution rates, both within and among genomes (Sloan et al. 2012; Richardson et al. 2013; Zhu et al. 2014). For instance, the enormous, multichromosomal mtDNAs of Silene conica and S. noctiflora have extraordinarily high synonymous substitution rates (Sloan et al. 2012), whereas the tulip tree has one of the most mutationally quiescent mitochondrial genomes of any eukaryote (Richardson et al. 2013). Land plant mtDNAs, as with Dunaliella, also show vastly different rates of substitution in coding versus noncoding regions, a feature that that has provided major insights into the process of organelle genome expansion (Christensen 2013). After finding that the modes of molecular evolution differ between coding and noncoding regions in Arabidopsis mtDNA, Christensen (2013) proposed that land plants employ two types of mtDNA repair, each of which has shaped mitochondrial genomic architecture: “Within genes, a bias toward gene conversion would keep measured mutation rates low, whereas in noncoding regions, break-induced replication (BIR) explains the expansion[s] and rearrangements. Both processes are types of double-strand break repair, but enhanced second-strand capture in transcribed regions versus BIR in nontranscribed regions can explain the two seemingly contradictory features of plant mitochondrial genome evolution—the low mutation rates in genes and the striking expansions of noncoding sequences.” The same argument can be made for the Dunaliella mitochondrial and plastid chromosomes. Both organelle DNAs have much lower substitution rates within synonymous sites as compared with the noncoding regions. This is especially apparent for the ptDNA, which has an average dS of only 0.09 but for which most of the intergenic regions are unalignable (table 2). If the intergenic regions in the Dunaliella organelle genomes are repaired via BIR it would help explain the large amounts of genomic expansion and rearrangements observed between them (fig. 1) as well as the widespread intergenic turmoil (fig. 2B and C). Indeed, BIR within organelle systems is known to be inaccurate and cause rearrangements, chimeric genes, and expansions (Davila et al. 2011). Conversely, accurate repair of coding organelle DNA in Dunaliella, by homologous recombination or gene conversion, for example, would account for the comparatively low synonymous substation rates, particularly in the plastid. Nevertheless, there is still an order of magnitude variation in dS for the mitochondrial versus plastid protein-coding genes of D. salina, indicating that the efficiency of DNA repair, be it by BIR or gene conversion or homologous recombination, differs greatly between these compartments. In plants and algae, virtually all of the organelle DNA maintenance machineries are nuclear encoded, and their proficiency are known to vary between species and compartments (Sloan and Taylor 2012). A Dunaliella nuclear genome sequence is not yet available, but work is presently underway by the DOE JGI to generate one. Investigations of nuclear-encoded, organelle-targeted DNA repair proteins will likely give further insights into the evolution of the Dunaliella organelle genomes. There is also the potential that the extremely salty habitats in which many Dunaliella species reside is in some way impacting the molecular evolution of their organelle DNAs. Whatever the root cause of their inflated architectures, the Dunaliella mitochondrial and plastid genomes are veritable heavyweights among green algal organelle genomes.

Methods and Materials

Dunaliella salina CONC-001 and D. viridis CONC-002 (Microalgae Culture Collection, Universidad de Concepción, Chile; updated acronyms CCM-UDEC 001 and CCM-UDEC 002, respectively) were grown in J/1 medium, supplemented with 15% NaCl w/v, at 18 °C under a 14-h light/10-h dark cycle, and harvested as previously described (González et al. 1999; Gómez and González 2005). Total DNA from each isolate was extracted using the Qiagen DNeasy Plant Mini Kit (Qiagen, Venlo, Limburg, NL) with liquid nitrogen disruption. Illumina sequencing libraries were prepared with the Nextera DNA Sample Preparation Kit (Illumina, San Diego, CA) and sequenced on an Illumina MiSeq platform (v2 chemistry; 250 × 250 paired-end sequencing reads). Chlamydomonas leiostraca SAG 11-49 (Culture Collection of Algae at the University of Göttingen, Germany) was grown in Volvox medium (Provasoli and Pintner 1960) at 18 °C under a 14-h light/10-h dark cycle and with constant shaking (200 revolutions per minute [rpm]). Polytoma uvella UTEX 964 (Culture Collection of Algae at the University of Texas at Austin) was cultured in Polytoma medium under the same conditions, but without shaking. Cells were harvested in logarithmic growth by centrifugation at 4,500 rpm for 10 min, and whole-genomic DNA from each strain was extracted by standard phenolchloroform methods and ethanol precipitation. Library preparation and Illumina sequencing (HiSeq 2000) were performed at the Roy J. Carver Center for Genomics at the University of Iowa (100 × 100 paired-end sequencing reads). The organelle genomes of all four algae were assembled de novo with Ray v2.2.0 (Boisvert et al. 2010), using k-mers of 21, 27, 31, and 37, and separately with CLC Genomics Workbench v6.0.4 (Qiagen, Prismet, DK), using a word size of 20, bubble size of 50, and paired-end scaffolding. The resulting Ray and CLC contigs were scanned for organelle sequences using BLAST-based methods and the mitochondrial and plastid genomes of chlamydomonadalean algae (fig. 1) as queries. Hits to organelle DNA were assembled into larger contigs using read-mapping approaches with Geneious v7.1.4 (Biomatters Ltd., Auckland, NZ). Organelle introns were identified with RNAweasel (http://megasun.bch.umontreal.ca/RNAweasel/) and through alignments with other chlamydomonadalean organelle DNAs. Organelle genes were aligned with MUSCLE (Edgar 2004), implemented through Geneious, using default settings. Synonymous and nonsynonymous substitutions were measured with the CODEML program of PAML v4.3 (Yang 2007), employing the maximum likelihood method and the F3x4 codon model. Substitutions in nonprotein-coding regions were estimated with BASEML of PAML, using the HKY85 model. The mitochondrial genome data described here are deposited in GenBank under accession numbers KP691601 (D. salina CONC-001), KP691602 (D. viridis CONC-002), KP696389 (C. leiostraca), and KP696388 (P. uvella). The D. salina CONC-001 protein-coding ptDNA genes used to measure substitution rates are in supplementary table S2, Supplementary Material online.

Supplementary Material

Supplementary material is available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).
  29 in total

1.  JDotter: a Java interface to multiple dotplots generated by dotter.

Authors:  Ryan Brodie; Rachel L Roper; Chris Upton
Journal:  Bioinformatics       Date:  2004-01-22       Impact factor: 6.937

2.  PAML 4: phylogenetic analysis by maximum likelihood.

Authors:  Ziheng Yang
Journal:  Mol Biol Evol       Date:  2007-05-04       Impact factor: 16.240

Review 3.  Organization and expression of organellar genomes.

Authors:  Adrian C Barbrook; Christopher J Howe; Davy P Kurniawan; Sarah J Tarr
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2010-03-12       Impact factor: 6.237

4.  Updating our view of organelle genome nucleotide landscape.

Authors:  David Roy Smith
Journal:  Front Genet       Date:  2012-09-11       Impact factor: 4.599

5.  The Dunaliella salina organelle genomes: large sequences, inflated with intronic and intergenic DNA.

Authors:  David Roy Smith; Robert W Lee; John C Cushman; Jon K Magnuson; Duc Tran; Jürgen E W Polle
Journal:  BMC Plant Biol       Date:  2010-05-07       Impact factor: 4.215

6.  Mitochondrial and plastid genomes of the colonial green alga Gonium pectorale give insights into the origins of organelle DNA architecture within the volvocales.

Authors:  Takashi Hamaji; David R Smith; Hideki Noguchi; Atsushi Toyoda; Masahiro Suzuki; Hiroko Kawai-Toyooka; Asao Fujiyama; Ichiro Nishii; Tara Marriage; Bradley J S C Olson; Hisayoshi Nozaki
Journal:  PLoS One       Date:  2013-02-26       Impact factor: 3.240

7.  The "fossilized" mitochondrial genome of Liriodendron tulipifera: ancestral gene content and order, ancestral editing sites, and extraordinarily low mutation rate.

Authors:  Aaron O Richardson; Danny W Rice; Gregory J Young; Andrew J Alverson; Jeffrey D Palmer
Journal:  BMC Biol       Date:  2013-04-15       Impact factor: 7.431

8.  Plant mitochondrial genome evolution can be explained by DNA repair mechanisms.

Authors:  Alan C Christensen
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

9.  The mitochondrial and plastid genomes of Volvox carteri: bloated molecules rich in repetitive DNA.

Authors:  David Roy Smith; Robert W Lee
Journal:  BMC Genomics       Date:  2009-03-26       Impact factor: 3.969

10.  Palindromic genes in the linear mitochondrial genome of the nonphotosynthetic green alga Polytomella magna.

Authors:  David Roy Smith; Jimeng Hua; John M Archibald; Robert W Lee
Journal:  Genome Biol Evol       Date:  2013       Impact factor: 3.416

View more
  14 in total

1.  Comparative analysis of Chlorosarcinopsis eremi mitochondrial genome with some Chlamydomonadales algae.

Authors:  Fatemeh Khani-Juyabad; Parisa Mohammadi; Mahbubeh Zarrabi
Journal:  Physiol Mol Biol Plants       Date:  2019-08-12

2.  The Plastid Genome of Polytoma uvella Is the Largest Known among Colorless Algae and Plants and Reflects Contrasting Evolutionary Paths to Nonphotosynthetic Lifestyles.

Authors:  Francisco Figueroa-Martinez; Aurora M Nedelcu; David R Smith; Adrian Reyes-Prieto
Journal:  Plant Physiol       Date:  2016-12-08       Impact factor: 8.340

3.  Mutation rates in plastid genomes: they are lower than you might think.

Authors:  David Roy Smith
Journal:  Genome Biol Evol       Date:  2015-04-13       Impact factor: 3.416

4.  Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae).

Authors:  Jean-Simon Brouard; Monique Turmel; Christian Otis; Claude Lemieux
Journal:  PeerJ       Date:  2016-10-25       Impact factor: 2.984

5.  Genetic Drift and Indel Mutation in the Evolution of Yeast Mitochondrial Genome Size.

Authors:  Shujie Xiao; Duong T Nguyen; Baojun Wu; Weilong Hao
Journal:  Genome Biol Evol       Date:  2017-11-01       Impact factor: 3.416

6.  The plastid genomes of nonphotosynthetic algae are not so small after all.

Authors:  Francisco Figueroa-Martinez; Aurora M Nedelcu; Adrian Reyes-Prieto; David R Smith
Journal:  Commun Integr Biol       Date:  2017-02-01

Review 7.  Haematococcus lacustris: the makings of a giant-sized chloroplast genome.

Authors:  David Roy Smith
Journal:  AoB Plants       Date:  2018-10-01       Impact factor: 3.276

8.  Auxenochlorella protothecoides and Prototheca wickerhamii plastid genome sequences give insight into the origins of non-photosynthetic algae.

Authors:  Dong Yan; Yun Wang; Tatsuya Murakami; Yue Shen; Jianhui Gong; Huifeng Jiang; David R Smith; Jean-Francois Pombert; Junbiao Dai; Qingyu Wu
Journal:  Sci Rep       Date:  2015-09-25       Impact factor: 4.379

9.  Complete Chloroplast and Mitochondrial Genome Sequences of the Hydrocarbon Oil-Producing Green Microalga Botryococcus braunii Race B (Showa).

Authors:  Olga Blifernez-Klassen; Daniel Wibberg; Anika Winkler; Jochen Blom; Alexander Goesmann; Jörn Kalinowski; Olaf Kruse
Journal:  Genome Announc       Date:  2016-06-09

10.  Exploring the Limits and Causes of Plastid Genome Expansion in Volvocine Green Algae.

Authors:  Hager Gaouda; Takashi Hamaji; Kayoko Yamamoto; Hiroko Kawai-Toyooka; Masahiro Suzuki; Hideki Noguchi; Yohei Minakuchi; Atsushi Toyoda; Asao Fujiyama; Hisayoshi Nozaki; David Roy Smith
Journal:  Genome Biol Evol       Date:  2018-09-01       Impact factor: 3.416

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.