Literature DB >> 30859201

Relative Mutation Rates in Nucleomorph-Bearing Algae.

Cameron J Grisdale1,2, David R Smith3, John M Archibald1,2.   

Abstract

Chlorarachniophyte and cryptophyte algae are unique among plastid-containing species in that they have a nucleomorph genome: a compact, highly reduced nuclear genome from a photosynthetic eukaryotic endosymbiont. Despite their independent origins, the nucleomorph genomes of these two lineages have similar genomic architectures, but little is known about the evolutionary pressures impacting nucleomorph DNA, particularly how their rates of evolution compare to those of the neighboring genetic compartments (the mitochondrion, plastid, and nucleus). Here, we use synonymous substitution rates to estimate relative mutation rates in the four genomes of nucleomorph-bearing algae. We show that the relative mutation rates of the host versus endosymbiont nuclear genomes are similar in both chlorarachniophytes and cryptophytes, despite the fact that nucleomorph gene sequences are notoriously highly divergent. There is some evidence, however, for slightly elevated mutation rates in the nucleomorph DNA of chlorarachniophytes-a feature not observed in that of cryptophytes. For both lineages, relative mutation rates in the plastid appear to be lower than those in the nucleus and nucleomorph (and, in one case, the mitochondrion), which is consistent with studies of other plastid-bearing protists. Given the divergent nature of nucleomorph genes, our finding of relatively low evolutionary rates in these genomes suggests that for both lineages a burst of evolutionary change and/or decreased selection pressures likely occurred early in the integration of the secondary endosymbiont.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  evolution; genetic diversity; mutation rate; nucleotide substitution; secondary endosymbiosis

Mesh:

Year:  2019        PMID: 30859201      PMCID: PMC6456004          DOI: 10.1093/gbe/evz056

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Endosymbiosis has defined eukaryotic evolution, generating both mitochondria and plastids, which arose by the primary endosymbiosis of an alpha-proteobacterium and a cyanobacterium, respectively (Gray 2012; Archibald 2015a). Primary plastids have subsequently spread to diverse lineages via eukaryote–eukaryote endosymbioses (Gould et al. 2008; Archibald 2009), giving rise to organelles with one or two additional membranes (Cavalier-Smith 2003). Consequently, nearly all eukaryotes contain multiple genetic compartments, each with its own distinct genome. Chlorarachniophyte and cryptophyte algae exemplify this point. These two lineages gained photosynthesis through the secondary endosymbiosis of a green alga and a red alga, respectively, and still retain the nuclear genomes of their eukaryotic endosymbionts (Douglas et al. 2001; Gilson et al. 2006; Archibald 2007). Despite their different evolutionary origins, the available nucleomorph genome sequences of chlorarachniophytes and cryptophytes have many similar features. They are made up of three linear chromosomes, are generally AT-rich, and encode ∼500 proteins, 31 or fewer of which function in the plastid (Douglas et al. 2001; Gilson et al. 2006; Archibald 2007). Although our understanding of nucleomorph genome architecture has improved in recent years (Tanifuji et al. 2011, 2014; Moore et al. 2012; Suzuki et al. 2015), little is known about the mutation rate of nucleomorph DNA (nmDNA), particularly how it compares to those of the mitochondrial, plastid, and host nuclear DNA (mtDNA, ptDNA, and nuDNA). Estimating mutation rate is not easy. But insights into this fundamental parameter can be gained by measuring the rate of synonymous nucleotide substitution (dS) between distinct species or strains (Kimura 1983). Under the assumption of neutrality, the relative values of dS for different genomes within a cell should reflect the relative mutation rates of those genomes (Kimura 1983). Indeed, relative substitution rate data have revealed that for most animals the mitochondrial mutation rate is much higher than that of the nucleus (Brown et al. 1979; Nabholz et al. 2008; James et al. 2016), whereas for land plants the opposite is true (Wolfe et al. 1987; Drouin et al. 2008; Richardson et al. 2013; Zhu et al. 2014). Substitution rate analyses of ptDNA have shown that land plant plastid genomes typically have higher and lower mutation rates, respectively, than their mitochondrial and nuclear counterparts (Wolfe et al. 1987; Drouin et al. 2008; Richardson et al. 2013; Zhu et al. 2014). But in plastid-bearing protists from diverse lineages, the ptDNA tends to have a lower dS than the mtDNA and nuDNA from the same cell, implying a low plastid mutation rate (Smith 2015). Together, these data have improved our understanding of organelle genetics (Sloan and Taylor 2012) and helped spur hypotheses on molecular evolution (Lynch et al. 2006). Detailed relative rate analyses of nmDNA have not yet been carried out, even though such data could provide valuable insights into eukaryotic mutation rates and genome evolution (Smith et al. 2012). Here, we do just that. Using transcriptome sequence data from various isolates of the chlorarachniophyte Bigelowiella natans and the cryptophyte Hemiselmis andersenii, we provide the first relative mutation rate estimates for nucleomorph, mitochondrial, plastid, and host nuclear genomes from a single cell. We find that within B. natans and H. andersenii, dS is similar for the nucleomorph and nuclear DNAs, suggesting that the host and endosymbiont nuclear genomes have similar mutation rates. For both lineages, dS in the plastid genome is low relative to the nuclear and nucleomorph genomes, implying that the ptDNA has a low relative mutation rate. The mitochondrial rates showed the largest variability: elevated in B. natans and very low in H. andersenii. Overall, these mutation rate ratios provide insights into the evolutionary forces at play in endosymbiont genomes of photosynthetic eukaryotes and how they compare to those of the host.

Materials and Methods

All sequence data were downloaded from the National Center for Genome Resources (NCGR) MMETSP project page (http://data.imicrobe.us/project/view/104). The following B. natans samples were used: MMETSP0045 (strain CCMP2755), MMETSP1052 (strain CCMP623), MMETSP1054 (strain CCMP1259), MMETSP1055 (strain CCMP1258), and MMETSP1358 (strain CCMP1242). The following H. andersenii samples were used: MMETSP0043 (strain CCMP644), MMETSP1041 (strain CCMP439), MMETSP1042 (strain CCMP1180), and MMETSP1043 (strain CCMP441). A reciprocal best BLAST hit (RBH) approach was used to identify putative orthologs between strains (Altschul et al. 1990; Tatusov et al. 1997; Bork et al. 1998; Camacho et al. 2009). An e-value cut-off of 1E-50 was used for nuclear genes. An additional step was required to identify nucleomorph genes prior to the RBH analysis. MMETSP sequences were used as query against either a curated set of known nucleomorph or nuclear genes in a competitive BLAST (e-value 1E-10), with the resulting files being processed via a custom Python script to compare alignment scores and assign sequences as nucleomorph or nuclear. The nucleomorph sequences were then used for RBH analysis with an e-value cut-off of 1E-20. As part of the pipeline to prepare files for input to the PAML package, custom Python scripts were written to 1) compile files of pairwise RBH for all strains into a single file, 2) retrieve relevant gene and protein sequences from assembly files from each strain into a single file per gene, 3) filter alignment files to exclude non block-selected residues, and 4) filter amino acid alignment files for improperly identified orthologs using a percent identity cut-off of 95% (Python Programming Language, www.python.org). MAFFT was used for multiple sequence alignment with maxiterate = 1,000 (Katoh et al. 2002). Clustal format alignment files were converted to FASTA format with seqret (Peter Rice; EMBOSS). The Gblocks software was used to identify aligned blocks of amino acids (options: -b4 = 4 -k = y) (Castresana 2000). Codon alignments were produced using pal2nal.pl (options: -output paml -blockonly -nogap -nomismatch) (Mikita et al. 2006). Since the MMETSP pipeline used poly-A selected mRNA libraries, very few plastid- and mitochondrial-derived sequences are present in the assembled data sets. While a small number of plastid RBH orthologs were identified, a different strategy was needed to find mitochondrial genes. MMETSP raw RNAseq reads were thus mapped to the B. natans and H. andersenii mitochondrial genomes using Bowtie2 (Langmead and Salzberg 2012). Regions of interest were identified visually using the Integrated Genomics Viewer (Robinson et al. 2011). A Python script was written to retrieve the nucleotide sequence of regions of interest and incorporate polymorphism information for each strain directly from a SAMtools pileup file (Li et al. 2009). A similar strategy was used to retrieve rRNA subunit sequences. Substitution rates were estimated using the CODEML and BASEML programs of PAML4.8 for the analysis of protein-coding sequences and noncoding sequences, respectively (Yang 2007). For CODEML the F3 x 4 codon model was used (Goldman and Yang 1994), while for BASEML the HKY85 nucleotide substitution model was used (Tamura and Nei 1993). A Python script was written to filter dS values of any gene with dS >1 under the assumption that the strains being compared are closely related, and therefore, an elevated dS value likely indicates an error in alignment or ortholog prediction. Following this filtering step, the script calculates the mean, median, and SD of all remaining dS values. Synonymous and nonsynonymous substitution values were averaged for all pairwise comparisons of each gene and those reported for each genome are the average across all genes. Statistical analyses were performed in R (R Core Team, 2015).

Results and Discussion

Similar Relative Mutation Rates in Nucleomorph and Host Nuclear Genomes

Phylogenetic analyses have consistently shown that chlorarachniophyte and cryptophyte nucleomorph gene sequences are highly divergent relative to their nuclear homologs in other species (Keeling et al. 1999; Archibald et al. 2001; Brinkmann et al. 2005; Marinov and Lynch 2016). These observations led to the assumption that nucleomorph genomes have high mutation rates, but concrete evidence supporting this prediction is lacking. Previous work comparing amino acid distances of plastid-targeted proteins encoded in the nucleus, nucleomorph, and plastid genomes of Guillardia theta and B. natans suggested that nucleomorph-encoded proteins are more divergent than nuclear- and plastid-encoded proteins (Patron et al. 2006). A similar trend was later seen when the amino acid sequences of 26 to 28 nucleus- and nucleomorph-encoded ribosomal proteins were compared, showing much higher substitution rates for the nucleomorph proteins (Hirakawa et al. 2014). Conversely, comparisons between two cryptophyte species (Rhodomonas salina and G. theta) and the red alga Cyanidioschyzon merolae provided evidence, albeit based on a small number of sequences, that cryptophyte genes in the nucleomorph are less divergent than those in the host nucleus (Patron et al. 2006). The same study also found nuclear and nucleomorph amino acid sequence distances between C. merolae and R. salina and G. theta to be larger than those between the two cryptophytes, implying that cryptophyte nucleomorph genomes are relatively stable. Thus, the current picture of evolutionary rates in nucleomorph genomes is complex and, at times, contradictory. To better understand nucleomorph mutational pressures, we employed large transcriptome (RNA-Seq) data sets (Keeling et al. 2014) to measure synonymous and nonsynonymous substitution rates (dS and dN) by obtaining the average of pairwise comparisons among five strains of the chlorarachniophyte B. natans and four strains of the cryptophyte H. andersenii (table 1). Our substitution rate data encompassed ∼6,000 and 10,000 nuclear genes and 100 and 200 nucleomorph genes in B. natans and H. andersenii, respectively. We found nuclear and nucleomorph synonymous substitution rates to be similar in magnitude in both lineages (fig. 1). In H. andersenii, the average dS of the nuclear genes was slightly higher than that of the nucleomorph genes (4.80 x 10−3 vs. 4.08 x 10−3), while in B. natans the nuclear rate was slightly lower (4.99 x 10−3 vs. 5.36 x 10−3) (table 2). In both species, the SDs were large (i.e., greater than the mean), which was because most analyzed genes had a dS of zero, positively skewing the distributions. The mean dS values for nuclear and nucleomorph genomes were not significantly different in B. natans or H. andersenii (t-test P value = 0.4684 and 0.05686, respectively). Analysis of concatenated sequences gave a near-identical pattern with the nuclear dS being higher than the nucleomorph rates for both B. natans and H. andersenii (table 3). These data suggest that the host and endosymbiont nuclear genomes have experienced similar mutation rates and provide further support for the hypothesis that nucleomorph genomes are evolutionarily stable and no longer under strong mutational pressure resulting from ongoing reductive evolution.
Table 1

Strain Numbers, Sampling Locations, and Transcriptome Sequence Data Sets for the Bigelowiella natans and Hemiselmis andersenii Strains Used in This Study

OrganismMMETSP Data SetCCMP StrainLocationLatitude:Longitude
B. natans00452755NW of Bermuda34.71 : −66.5
1052623Gulf of Maine43 : −69
10541259Gulf of Maine43 : −69
10551258Gulf of Maine43 : −69
13581242NW of Bermuda34.72 : −66.5
H. andersenii0043644Long Island, Bahamas23 : −75
1041439Panama city, Florida29.8 : −85.67
10421180Gulf of Mexico21 : −93
1043441Long Island, Bahamas23 : −75

MMETSP, Marine Microbial Eukaryote Transcriptome Sequencing Project; CCMP, Provasoli-Guillard Center for Culture of Marine Phytoplankton (now the National Center for Marine Algae and Microbiota [NCMA]).

. 1.

—Relative average synonymous (A) and nonsynonymous (B) substitution rates (dS and dN) between nuclear, nucleomorph, plastid, and mitochondrial genomes of the chlorarachniophyte Bigelowiella natans and the cryptophyte Hemiselmis andersenii. Values were calculated relative to the nuclear substitution rate. Substitution rates are based on averages of pairwise comparisons between five B. natans strains and four H. andersenii strains. Missing bars indicate a value of zero or data not available.

Table 2

Average Synonymous (dS) and Nonsynonymous (dN) Substitution Values between Five Strains of Bigelowiella natans and between Four Strains of Hemiselmis andersenii

dS Values (×10−3)nuDNAnmDNAptDNAmtDNARelative Rate Ratios (nu:nm:pt:mt)
B. natans4.985 (10.47)5.362 (17.13)2.623 (4.28)12.726 (18.06)1:1.08:0.53:2.55
H. andersenii4.824 (16.58)4.078 (13.24)1.735 (4.97)01:0.85:0.36:0

dN Values (×10−3)nuDNAnmDNAptDNAmtDNARelative Rate Ratios (nu:nm:pt:mt)

B. natans2.21 (5.46)1.7294 (6.42)0.9117 (3.41)0.7615 (1.96)1:0.78:0.41:0.35
H. andersenii1.981 (6.81)1.539 (4.77)0.781 (2.6)01:0.78:0.39:0

nu, nuclear; nm, nucleomorph; pt, plastid; mt, mitochondrial.

Note.—SDs are shown in brackets.

Table 3

Synonymous (dS) and Nonsynonymous (dN) Substitution Values of Concatenated Gene Sequences from the Nuclear (nu), Nucleomorph (nm), Plastid (pt), and Mitochondrial (mt) DNAs of Bigelowiella natans and Hemiselmis andersenii

dS Values (×10−3)nuDNAnmDNAptDNAmtDNARelative Rate Ratios (nu:nm:pt:mt)
B. natans5.23 (1.94)4.88 (1.85)2.86 (2.02)12.94 (10.08)1:0.93:0.55:2.47
H. andersenii5.117 (0.45)4.45 (1.24)3.483 (1.40)01:0.87:0.68:0

dN values (×10−3)nuDNAnmDNAptDNAmtDNARelative Rate Ratios (nu:nm:pt:mt)

B. natans2.26 (0.58)1.69 (0.52)1.19 (1.46)0.78 (0.64)1:0.75:0.53:0.35
H. andersenii2.1 (0.20)1.767 (0.56)1.75 (0.88)01:0.84:0.83:0

Note.—SDs are shown in brackets.

Strain Numbers, Sampling Locations, and Transcriptome Sequence Data Sets for the Bigelowiella natans and Hemiselmis andersenii Strains Used in This Study MMETSP, Marine Microbial Eukaryote Transcriptome Sequencing Project; CCMP, Provasoli-Guillard Center for Culture of Marine Phytoplankton (now the National Center for Marine Algae and Microbiota [NCMA]). Average Synonymous (dS) and Nonsynonymous (dN) Substitution Values between Five Strains of Bigelowiella natans and between Four Strains of Hemiselmis andersenii nu, nuclear; nm, nucleomorph; pt, plastid; mt, mitochondrial. Note.—SDs are shown in brackets. Synonymous (dS) and Nonsynonymous (dN) Substitution Values of Concatenated Gene Sequences from the Nuclear (nu), Nucleomorph (nm), Plastid (pt), and Mitochondrial (mt) DNAs of Bigelowiella natans and Hemiselmis andersenii Note.—SDs are shown in brackets. —Relative average synonymous (A) and nonsynonymous (B) substitution rates (dS and dN) between nuclear, nucleomorph, plastid, and mitochondrial genomes of the chlorarachniophyte Bigelowiella natans and the cryptophyte Hemiselmis andersenii. Values were calculated relative to the nuclear substitution rate. Substitution rates are based on averages of pairwise comparisons between five B. natans strains and four H. andersenii strains. Missing bars indicate a value of zero or data not available. We also investigated nuclear and nucleomorph substitution rates using a branch-based model, implemented through CODEML (Yang 2007). The observed Nu:Nm synonymous-site substitution rate ratios derived from a branch-based model were very similar to those obtained from the pairwise analyses: 1:1.18 for B. natans and 1:0.82 for H. andersenii. Pairwise comparisons of strains taken from the same or different sampling locations (table 1) showed different nucleus versus nucleomorph substitution rate patterns. When looking only at B. natans and H. andersenii strains from different locations, the nucleus-to-nucleomorph dS ratio was >1, whereas for strains from the same location the ratio was <1 (fig. 2). In other words, strains from the same location showed higher relative levels of nucleotide divergence in their nmDNA than those from different locations. However, since these substitution rate differences were not statistically significant (supplementary table 1, Supplementary Material online) and little is known about the population structures and recombination frequencies of these algae, we cannot draw meaningful conclusions from this comparison. Nevertheless, the nucleus-to-nucleomorph dS ratios for strains from the same or different locations mirrored the trends when all the strains were compared, which is consistent with the idea that nuclear and nucleomorph genomes are evolving at similar rates.
. 2.

—Average synonymous and nonsynonymous substitution rates (dS and dN) of nuclear, nucleomorph, plastid, and mitochondrial genomes for strains sampled at the same or different locations. Note the different y-axis ranges. (A) Pairwise comparison of Bigelowiella natans strains. (B) Pairwise comparison of Hemiselmis andersenii strains. Missing bars indicate a value of zero or data not available.

—Average synonymous and nonsynonymous substitution rates (dS and dN) of nuclear, nucleomorph, plastid, and mitochondrial genomes for strains sampled at the same or different locations. Note the different y-axis ranges. (A) Pairwise comparison of Bigelowiella natans strains. (B) Pairwise comparison of Hemiselmis andersenii strains. Missing bars indicate a value of zero or data not available. It needs to be emphasized that we cannot be certain that the strains used to measure dS in B. natans and H. andersenii are from distinct lineages or populations; indeed, many of the genes we analyzed from both B. natans and H. andersenii showed no substitutions across the entire aligned regions. If the B. natans and H. andersenii strains used in our relative rate analyses do come from the same population, we have measured and compared the relative levels of silent-site nucleotide diversity (πsilent) among the four genomes. reflects the effective population size and the mutation rate (Nμ). In this case, the effective population size of the various genetic compartments (also called the effective number of genes, N) should not be dramatically different. The nuclear genome of B. natans is haploid and that of H. andersenii is likely haploid as well (Hirakawa and Ishida 2014). The nucleomorph, mitochondrial, and plastid genomes in explored chlorarachniophytes and cryptophytes are polyploid and presumably uniparentally inherited (Hirakawa and Ishida 2014), but the latter two DNAs (and potentially the former) are effectively haploid (Palmer 1987). Thus, our comparative work should still provide key insights into the underlying mutation pressures impacting these genomes.

Are Chlorarachniophyte and Cryptophyte Nucleomorph Genomes Evolving at Different Rates?

The nucleomorphs of chlorarachniophytes and cryptophytes evolved from lineages of primary photosynthetic algae acquired in separate endosymbiotic events (Gould et al. 2008; Archibald 2015b). As noted earlier, comparisons of nucleomorph genomes within and between these two lineages have uncovered many remarkable examples of convergent evolution (Tanifuji et al. 2011, 2014; Moore et al. 2012; Suzuki et al. 2015). But there have also been differences in the way reductive evolution has shaped chlorarachniophyte and cryptophyte nucleomorph genomes, such as giving rise to contrasting gene and intron contents (Douglas et al. 2001; Gilson et al. 2006; Lane et al. 2007; Moore et al. 2012; Tanifuji et al. 2014). Given the similarities in nucleomorph genome architecture and levels of reductive evolution, one might expect them to display evidence of similar evolutionary pressures. Despite having a near-identical average nucleus-to-nucleomorph dS ratio, B. natans and H. andersenii differed in the distribution of dS values between the nucleus and nucleomorph (supplementary fig. 1, Supplementary Material online). A Kolmogorov–Smirnov (KS) test comparing the nucleomorph and nuclear dS values for each lineage showed a significant difference for B. natans (KS test P value = 1.734e-05) but not for H. andersenii (KS test P value = 0.8612). This implies that there are some underlying differences in the evolutionary rates of B. natans nuclear and nucleomorph genes. The raw substitution rate values are not directly comparable between B. natans and H. andersenii because the divergence times between the various strains are unknown, but it is noteworthy that dS of the nuclear genome was more similar between the two lineages (0.0049 vs. 0.0048) than that of the nucleomorph (0.0054 vs. 0.0041). Although speculative, the higher nucleomorph dS for B. natans relative to H. andersenii could be indicative of a slightly higher mutation rate in the former. These trends are consistent with previous work showing that for B. natans amino acid sequence divergence was higher for certain nucleomorph- versus nucleus-encoded proteins, but for the cryptophyte G. theta divergence levels were similar in both compartments (Patron et al. 2006). Analyses of B. natans ribosomal RNA (rRNA) sequences also suggest that its nucleomorph genome is evolving slightly faster than the nuclear genome: substitution rates of the concatenated rRNA genes from the nucleomorph were nearly double those of the nucleus (0.00044 vs. 0.00024) (supplementary table 2, Supplementary Material online). Conversely, not a single nucleotide substitution was found in the nuclear rRNA genes of H. andersenii, whereas the nucleomorph rRNA-coding regions had an overall substitution rate of 0.00035. Rates of nonsynonymous substitution (dN) across the nuclear and nucleomorph genomes of B. natans and H. andersenii did not follow the same pattern as the synonymous and rRNA substitution rate data. For both B. natans and H. andersenii, dN for the nucleomorph was lower than that of the nucleus, and this was true when looking at averages among genes and concatenated data sets (tables 2 and 3). This pattern could reflect high levels of purifying selection in nucleomorph genomes, potentially pointing to large evolutionary changes having taken place in the distant past. However, this idea is not necessarily supported by the comparison of the average dN and average dS because mean values can flatten the possible variation of the per-gene dN/dS ratio (see supplementary fig. S2, Supplementary Material online, for a cumulative distribution plot of dN).

Green- and Red-Algal Derived Secondary Plastid Genomes Have Low Relative Mutation Rates

Land plant plastid genomes tend to have higher synonymous substitution rates than their mitochondrial counterparts (Wolfe et al. 1987; Sloan et al. 2012), but in unicellular eukaryotes, the pattern seems to be reversed. Nearly all plastid-bearing protists examined thus far, including red algae, green algae, and species with secondary red-algal plastids, have a mitochondrion-to-plastid dS ratio ≥1, and in many cases dS in the ptDNA is lower than that of the nuDNA (Smith and Keeling 2012; Smith 2015). The reasons for these differences in relative mutation rate of mtDNA versus ptDNA are poorly understood, but are likely connected (at least in part) to differences in DNA maintenance and repair machinery (Moriyama et al. 2014; Zhang et al. 2016). No one, however, has explored relative-rates in the organelle genomes of species with secondarily derived green algal plastids, such as B. natans, making the data presented here more interesting. Indeed, consistent with previous studies of protists, we found low relative rates of synonymous substitution in the ptDNA of B. natans: dS of the plastid was, on average, about half that of the nucleus and nucleomorph and one-fifth that of the mitochondrion (fig. 1). An almost identical trend was observed for H. andersenii for which dS of the ptDNA was ∼0.5 times that of the nuclear and nucleomorph DNAs. Substitution rates in the rRNA-coding regions from the B. natans plastid were also low (between a half and one-twentieth of those of the nucleus, nucleomorph, mitochondrion), again, indicating that the plastid genome has a lower mutation rate than its neighboring genomes. Unfortunately, a similar comparison was not possible for H. andersenii as plastid rRNA data were not available. Low relative rates of nonsynonymous substitution were also observed in the plastid genomes of B. natans and H. andersenii (fig. 1), mirroring the dS data. Altogether, these results provide additional evidence that plastid genomes have low relative mutation rates across a wide diversity of photosynthetic organisms, now including species with green-algal derived secondary plastids, and reinforce the notion that land plants are outliers in terms of their relative plastid mutational rate patterns.

Extreme Differences in mtDNA Substitution Rates between Chlorarachniophytes and Cryptophytes

The nucleomorph and plastid genomes of chlorarachniophytes and cryptophytes are derived from green and red algae, respectively—organisms for which we have some expectations of relative substitution rates based on previous studies (Popescu and Lee 2007; Hua et al. 2012; Smith et al. 2012). However, the mitochondrial genomes of chlorarachniophytes and cryptophytes belong to the host lineages for which there is limited knowledge, as exemplified by the fact that the first nuclear genome sequences from these two lineages were described recently (Curtis et al. 2012). The mtDNAs of B. natans and H. andersenii have also been sequenced (Kim et al. 2008; Tanifuji et al. 2016), but until now little was known about substitution rates in these genomes. It has been proposed that mitochondrial genome architecture may be related to mutational patterns (Smith and Keeling 2015). Mitochondrial genomes are known to be highly variable in architecture, which is also true for plastid genomes, but to a lesser degree (Smith and Keeling 2015). Accordingly, mutation rate estimates for mitochondrial genomes are more variable and have a broader range than those of plastid genomes, which, as already mentioned, is thought to be related to differences in the underlying DNA maintenance and repair pathways of mitochondria relative to plastids (Smith and Keeling 2015). Our analyses of the H. andersenii mtDNA revealed not one nucleotide substitution among the four isolates, contrasting the high levels of substitution in the B. natans mitochondrial genome, and suggesting that these two lineages have contrasting mtDNA mutational patterns. This finding is made more remarkable by the fact that dS for the nucleus, nucleomorph, and plastid genomes were very similar between B. natans and H. andersenii (tables 2 and 3). The lack of observed mtDNA sequence divergence in H. andersenii can be interpreted in various ways. The mitochondrial mutation rate for H. andersenii might be genuinely lower than those of the nucleus, nucleomorph, and plastid. Alternatively, we might not have sampled enough mitochondrial genes to get an accurate picture of the synonymous substitution rate. And there is always the possibility that the synonymous sites we analyzed are not neutral, and are under selective constraints. Why this would be true for the mtDNA of H. andersenii and not the other genomes of this organism, or for the mtDNA of B. natans, is not obvious. Again, dS in the B. natans mtDNA was 2.5–5 times higher than the averages of the other three genomes in this organism (table 2). This is in line with relative-rate data from other secondary red-algal containing lineages (i.e., haptophytes and heterokonts), where mitochondrial substitution rates were predicted to be 5–30 times those of the plastid and thought to be connected to mtDNA genomic contraction (Smith and Keeling 2012). However, despite having the highest relative synonymous substitution rate, the B. natans mtDNA had the lowest relative dN among the four genomes, potentially reflecting strong purifying selection within the mitochondrion. There is reason to believe that the B. natans mitochondrial genome has undergone rapid evolution: it uses alternate initiation codons, and displays a low degree of gene synteny and gene content within mtDNAs from other chlorarachniophyte species (Tanifuji et al. 2016). Thus, it is possible that, like our prediction for the nucleomorph genomes, rapid evolution of the mtDNA occurred long ago when selection pressures were diminished, such as during the initial phase of secondary endosymbiont integration. Overall, our data point to a trend of a high mutation rate in the B. natans mitochondrial genome that is countered by strong purifying selection.

Conclusions

Genome-wide substitution rate data were used to estimate relative mutation rates in all four genomes within chlorarachniophyte and cryptophyte algae. On average, the relative synonymous substitution rates were similar between the nuclear and nucleomorph genomes in these two algal lineages. This suggests that in B. natans and H. andersenii the nucleomorph and nuclear genomes have similar mutation rates—a finding that appears to be at odds with the fact that nmDNAs display high levels of sequence divergence among lineages (Keeling et al. 1999; Archibald et al. 2001; Brinkmann et al. 2005; Marinov and Lynch 2016). Given these seemingly conflicting data, we argue that rapid evolutionary changes occurred in these endosymbiotically derived genomes in the distant evolutionary past but are no longer underway. There is some evidence that the B. natans nucleomorph genome mutation rate is slightly elevated relative to the nuclear rate and potentially evolving more rapidly than the nucleomorph genome of the cryptophyte H. andersenii, but the difference in rates is small, and in some instances not statistically significant. This is in line with previous work (Patron et al. 2006) suggesting that cryptophyte nucleomorph genomes are evolving more conservatively than those of chlorarachniophytes. We have provided the first evidence that secondary green algal plastids have low relative substitution rates, following the patterns seen in secondary red plastids and primary plastids outside the land plant lineage (Smith 2015). Relative mitochondrial substitution rates in B. natans were high, while those in H. andersenii were low. Our results run counter to the assumption that the highly divergent nucleomorph genomes of chlorarachniophytes and cryptophytes have high mutation rates; the data are instead more consistent with a burst of nucleomorph genome sequence divergence taking place as their respective secondary endosymbionts were being “converted” into organelles, followed by a reduction in evolutionary rates. It will be important to compare substitution rates in a much larger diversity of chlorarachniophytes and cryptophytes to see if these patterns hold in all members of these lineages. It should be stressed that synonymous sites can be under various selective constraints (Chamary et al. 2006; Meiklejohn et al. 2007), and factors such as compositional biases and codon usage can impact dS and the assumption of neutrality (Popescu and Lee 2007). In this context, it will be particularly interesting to have substitution rate data from the intergenic regions of the different genomes of these lineages for gauging the underlying mutational pressures. Finally, it remains to be determined whether the various B. natans and H. andersenii strains employed here represent members of distinct populations or individuals from the same population.

Supplementary Material

Supplementary data are available at Genome Biology and Evolution online. Click here for additional data file.
  57 in total

1.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

2.  Molecular chaperones encoded by a reduced nucleus: the cryptomonad nucleomorph.

Authors:  J M Archibald; T Cavalier-Smith; U Maier; S Douglas
Journal:  J Mol Evol       Date:  2001-06       Impact factor: 2.395

Review 3.  Endosymbiosis and Eukaryotic Cell Evolution.

Authors:  John M Archibald
Journal:  Curr Biol       Date:  2015-10-05       Impact factor: 10.834

4.  An empirical assessment of long-branch attraction artefacts in deep eukaryotic phylogenomics.

Authors:  Henner Brinkmann; Mark van der Giezen; Yan Zhou; Gaëtan Poncelin de Raucourt; Hervé Philippe
Journal:  Syst Biol       Date:  2005-10       Impact factor: 15.683

Review 5.  Positive and negative selection on the mitochondrial genome.

Authors:  Colin D Meiklejohn; Kristi L Montooth; David M Rand
Journal:  Trends Genet       Date:  2007-04-05       Impact factor: 11.639

Review 6.  The puzzle of plastid evolution.

Authors:  John M Archibald
Journal:  Curr Biol       Date:  2009-01-27       Impact factor: 10.834

7.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

8.  The secondary endosymbiont of the cryptomonad Guillardia theta contains alpha-, beta-, and gamma-tubulin genes.

Authors:  P J Keeling; J A Deane; C Hink-Schauer; S E Douglas; U G Maier; G I McFadden
Journal:  Mol Biol Evol       Date:  1999-09       Impact factor: 16.240

9.  Genomic reduction and evolution of novel genetic membranes and protein-targeting machinery in eukaryote-eukaryote chimaeras (meta-algae).

Authors:  T Cavalier-Smith
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2003-01-29       Impact factor: 6.237

10.  The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): illuminating the functional diversity of eukaryotic life in the oceans through transcriptome sequencing.

Authors:  Patrick J Keeling; Fabien Burki; Heather M Wilcox; Bassem Allam; Eric E Allen; Linda A Amaral-Zettler; E Virginia Armbrust; John M Archibald; Arvind K Bharti; Callum J Bell; Bank Beszteri; Kay D Bidle; Connor T Cameron; Lisa Campbell; David A Caron; Rose Ann Cattolico; Jackie L Collier; Kathryn Coyne; Simon K Davy; Phillipe Deschamps; Sonya T Dyhrman; Bente Edvardsen; Ruth D Gates; Christopher J Gobler; Spencer J Greenwood; Stephanie M Guida; Jennifer L Jacobi; Kjetill S Jakobsen; Erick R James; Bethany Jenkins; Uwe John; Matthew D Johnson; Andrew R Juhl; Anja Kamp; Laura A Katz; Ronald Kiene; Alexander Kudryavtsev; Brian S Leander; Senjie Lin; Connie Lovejoy; Denis Lynn; Adrian Marchetti; George McManus; Aurora M Nedelcu; Susanne Menden-Deuer; Cristina Miceli; Thomas Mock; Marina Montresor; Mary Ann Moran; Shauna Murray; Govind Nadathur; Satoshi Nagai; Peter B Ngam; Brian Palenik; Jan Pawlowski; Giulio Petroni; Gwenael Piganeau; Matthew C Posewitz; Karin Rengefors; Giovanna Romano; Mary E Rumpho; Tatiana Rynearson; Kelly B Schilling; Declan C Schroeder; Alastair G B Simpson; Claudio H Slamovits; David R Smith; G Jason Smith; Sarah R Smith; Heidi M Sosik; Peter Stief; Edward Theriot; Scott N Twary; Pooja E Umale; Daniel Vaulot; Boris Wawrik; Glen L Wheeler; William H Wilson; Yan Xu; Adriana Zingone; Alexandra Z Worden
Journal:  PLoS Biol       Date:  2014-06-24       Impact factor: 8.029

View more
  4 in total

1.  A molecular timescale for eukaryote evolution with implications for the origin of red algal-derived plastids.

Authors:  Jürgen F H Strassert; Iker Irisarri; Tom A Williams; Fabien Burki
Journal:  Nat Commun       Date:  2021-03-25       Impact factor: 14.919

2.  Comparative Plastid Genomics of Cryptomonas Species Reveals Fine-Scale Genomic Responses to Loss of Photosynthesis.

Authors:  Goro Tanifuji; Ryoma Kamikawa; Christa E Moore; Tyler Mills; Naoko T Onodera; Yuichiro Kashiyama; John M Archibald; Yuji Inagaki; Tetsuo Hashimoto
Journal:  Genome Biol Evol       Date:  2020-02-01       Impact factor: 3.416

3.  Tightly Constrained Genome Reduction and Relaxation of Purifying Selection during Secondary Plastid Endosymbiosis.

Authors:  Kavitha Uthanumallian; Cintia Iha; Sonja I Repetti; Cheong Xin Chan; Debashish Bhattacharya; Sebastian Duchene; Heroen Verbruggen
Journal:  Mol Biol Evol       Date:  2022-01-07       Impact factor: 16.240

4.  The economics of organellar gene loss and endosymbiotic gene transfer.

Authors:  Steven Kelly
Journal:  Genome Biol       Date:  2021-12-20       Impact factor: 13.583

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.