Literature DB >> 32645145

Evolution of Mutation Rate in Astronomically Large Phytoplankton Populations.

Marc Krasovec1, Rosalind E M Rickaby2, Dmitry A Filatov1.   

Abstract

Genetic diversity is expected to be proportional to population size, yet, there is a well-known, but unexplained lack of genetic diversity in large populations-the "Lewontin's paradox." Larger populations are expected to evolve lower mutation rates, which may help to explain this paradox. Here, we test this conjecture by measuring the spontaneous mutation rate in a ubiquitous unicellular marine phytoplankton species Emiliania huxleyi (Haptophyta) that has modest genetic diversity despite an astronomically large population size. Genome sequencing of E. huxleyi mutation accumulation lines revealed 455 mutations, with an unusual GC-biased mutation spectrum. This yielded an estimate of the per site mutation rate µ = 5.55×10-10 (CI 95%: 5.05×10-10 - 6.09×10-10), which corresponds to an effective population size Ne ∼ 2.7×106. Such a modest Ne is surprising for a ubiquitous and abundant species that accounts for up to 10% of global primary productivity in the oceans. Our results indicate that even exceptionally large populations do not evolve mutation rates lower than ∼10-10 per nucleotide per cell division. Consequently, the extreme disparity between modest genetic diversity and astronomically large population size in the plankton species cannot be explained by an unusually low mutation rate.
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  zzm321990 Emiliania huxleyizzm321990 ; Lewontin’s paradox; codon bias; effective population size; mutation accumulation; mutation rate; phytoplankton evolution

Mesh:

Year:  2020        PMID: 32645145      PMCID: PMC7486954          DOI: 10.1093/gbe/evaa131

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Significance

Surprisingly little is known about evolutionary genetic processes in astronomically large populations of marine phytoplankton. This study reports the first estimate of a key evolutionary parameter—spontaneous mutation rate—for a major domain of life, the Haptophytes. It also helps to shed light on the Lewontin’s paradox—disproportionately modest genetic diversity in large populations—the disparity that is particularly pronounced in large plankton populations.

Introduction

The level of genetic diversity in a population is determined by the balance between the new mutations occurring in the population and the loss of polymorphisms by stochastic processes (drift) and selection (Leffler et al. 2012; Ellegren and Galtier 2016). More mutations are expected to occur in a larger population because there are more individuals to mutate. In addition, drift is weaker in larger populations (Crow and Kimura 1970), thus larger populations are expected to contain more genetic diversity. On the other hand, selection is expected to be more powerful in populations of larger size, potentially allowing selection to reduce mutation rate to lower values in larger populations (Lynch 2010; Sung, Ackerman, et al. 2012; Lynch et al. 2016), which may explain the well-known phenomenon of relatively low genetic diversity in large populations (Lewontin 1974; Leffler et al. 2012; Corbett-Detig et al. 2015; Ellegren and Galtier 2016; Filatov 2019; Xu et al. 2019). Here, we test this idea by measuring the spontaneous mutation rate in a unicellular eukaryotic marine coccolithophore Emiliania huxleyi (Haptophyta) that has an astronomically large population size and thus would be expected to evolve to a very low mutation rate if this rate is determined by efficacy of selection. Emiliania huxleyi is ubiquitous and so abundant in modern oceans that its seasonal blooms are visible from space. Like most other coccolithophores, it produces coccoliths (calcite shields with species-specific shapes) at the cell surface. Due to its abundance, E. huxleyi is thought to be the main calcite producer on Earth (Paasche 2001; Daniels et al. 2014, 2016; Rembauville et al. 2016) affecting the global CO2 budget and carbon cycle. Due to its ecological importance and ease of culturing, E. huxleyi became a model phytoplankton species with a significant body of on-going work devoted to the interplay between coccolithophore abundance, climate change and the global carbon cycle (Rickaby et al. 2007; Iglesias-Rodriguez et al. 2008). However, surprisingly little is known about evolutionary genetic processes in populations of marine phytoplankton generally (Rengefors et al. 2017) and E. huxleyi populations in particular (Bendif et al. 2019; Filatov 2019). Based on the large population sizes of marine plankton species, their genetic diversity is often assumed to be very high (Read et al. 2013). Yet, recent studies of genetic diversity in a number of marine phytoplankton species revealed a surprisingly low level of single nucleotide polymorphism (Blanc-Mathieu et al. 2017; Filatov 2019; Rastogi et al. 2020). In particular, genetic diversity in a world-wide sample of E. huxleyi is only π∼0.006 per silent site across the genome (Filatov 2019)—similar to the level of polymorphism in Arabidopsis thaliana (1001 Genomes Consortium 2016), twice lower than in marine unicellular green algae Ostreococcus tauri (Blanc-Mathieu et al. 2017) and at least three times lower than in Drosophila melanogaster (Langley et al. 2012). Patterns of genetic diversity (e.g., very low linkage disequilibrium) in the E. huxleyi genome rule out any trivial explanations for this low diversity, such as high clonality or recent expansion from a very small population (Filatov 2019). However, an unusually low mutation rate may account, at least partly, for this low genetic diversity (Xu et al. 2019). For example, the mutation rate in another group, ciliates, was reported to be two to three orders of magnitude lower than in other studied eukaryotes (Sung, Tucker, et al. 2012; Long et al. 2018). No estimates of mutation rates are available for any Haptophyte species, making it difficult to assess E. huxleyi mutation rate even to an order of magnitude. To address this, we measured the spontaneous mutation rate in the diploid E. huxleyi strain RCC1242 (=CCMP1516) for which a ∼167-Mb long-genome sequence was published previously (Read et al. 2013).

Materials and Methods

Mutation Accumulation Experiment

We performed a mutation accumulation (MA) experiment with the diploid E. huxleyi strain RCC1242 (=CCMP1516). We followed the protocol previously developed by Krasovec (Krasovec et al. 2016) for MA experiments in a liquid medium. The MA experiment included 15 MA lines and lasted 8 months. The size of the MA experiment was planned, assuming the mutation rate is of the order of 10−10 per nucleotide per cell division, as found in a few other phytoplankton species (Ness et al. 2012; Krasovec et al. 2017, 2019), but with the possibility to expand the size of the experiment should the E. huxleyi mutation rate prove to be much lower. The initial line was obtained from a single cell by dilution and used to inoculate 15 MA lines kept in 24-well plates at 20 °C in F2 medium. Serial bottlenecks every 14 days were used to reduce the efficiency of selection by decreasing the population size of the MA lines. At each bottleneck, the cell culture was counted with a Beckman Multisizer Coulter Counter to calculate the number of cell divisions in the time interval and inoculate the MA lines in fresh media with one cell by dilution following a previously developed protocol (Krasovec et al. 2016, 2017; Krasovec, Sanchez-Brosseau, et al. 2018). The average cell division per day of the MA lines between each bottleneck could be used as a proxy of the fitness throughout the experiment. We used a linear correlation to test a change in fitness over the time of the experiment with R v3.5.1.

Genome Resequencing and Identification of De Novo Mutations

DNA of the 15 MA lines and the initial culture were extracted with the DNeasy Plant Mini Kit of QIAGEN following the standard instructions. Genomic libraries were prepared and sequenced at the Wellcome Trust Centre for Human Genetics (WTCHG) at the University of Oxford, UK. Genomic DNA was quantified using Qubit (Invitrogen) and the size profile analyzed using eGel (Thermo Fisher, 1% EX Agoarose). Input material was normalized to 300 ng prior to fragmentation and library preparation. Fragmentation was performed by mechanical shearing to an average size of 300 bp (Covaris S2 series; duty Cycle—10%, intensity—5.00, cycles/Bursts—200, time—60 s). Automated library preparation was performed using the Apollo 324 prep system (Wafergen, PrepX ILMN 32i, 96 sample kit) and standard Illumina multiplexing adapters following manufacturer’s protocol up to pre-PCR amplification. Libraries were PCR amplified (10 cycles) on a Tetrad (Bio-Rad) using the NEBNext High-Fidelity 2× PCR Master Mix (NEB) and in-house unique dual indexing primers (based on Lamble et al. [2013]). Post-PCR purification performed using Agencourt Ampure XP (Beckman Coulter; ratio 1:1) before combining. Individual libraries were normalized using Qubit, and the size profile was analyzed on the 2200 or 4200 TapeStation (Agilent). Individual libraries were normalized and pooled together accordingly. The pooled library was diluted to ∼10 nM for storage. The 10-nM library was denatured and further diluted prior to loading on the sequencer. Paired-end sequencing was performed using a HiSeq4000 150-bp platform (Illumina, HiSeq 3000/4000 PE Cluster Kit, and 300 cycle SBS Kit), generating a raw read count of >34 million reads per sample (table 1).
Table 1

Nuclear De Novo Mutations Identified in the 15 Emiliania huxleyi MA Lines

LinesNCBI Sample IDGbCov G*Callable SitesGen N bs
Eh_mut_ASAMN139325766.806570.9118,881,54321034
Eh_mut_ESAMN139325776.085871.0119,008,22620922
Eh_mut_GSAMN139325786.794871.7120,234,40420940
Eh_mut_HSAMN139325797.357070.0117,447,18022144
Eh_mut_JSAMN139325838.193270.2117,688,67026740
Eh_mut_MSAMN139325847.004071.7120,186,10828519
Eh_mut_NSAMN139325856.403370.2117,776,56427330
Eh_mut_OSAMN139325867.323169.8116,961,62629126
Eh_mut_RSAMN139325876.685171.6120,132,42720221
Eh_mut_SSAMN139325887.515969.5116,500,95521452
Eh_mut_TSAMN139325897.615471.1119,224,14820947
Eh_mut_USAMN139325907.775969.9117,252,42120020
Eh_mut_XSAMN139325806.762962.3104,532,86323223
Eh_mut_YSAMN139325816.404771.7120,278,08324025
Eh_mut_ZSAMN139325827.746171.3119,602,27521812
Eh_T0SAMN139325916.873273.1122,565,059

Note.—Sequence data are available from NCBI (bioproject PRJNA532543). Gb, the amount of sequence data generated (Gigabase); Cov, the average sequence coverage; G*, the % of callable genome; Gen, the number of MA generations; Nbs, the number of de novo nucleotide substitutions per line.

Nuclear De Novo Mutations Identified in the 15 Emiliania huxleyi MA Lines Note.—Sequence data are available from NCBI (bioproject PRJNA532543). Gb, the amount of sequence data generated (Gigabase); Cov, the average sequence coverage; G*, the % of callable genome; Gen, the number of MA generations; Nbs, the number of de novo nucleotide substitutions per line. PCR duplicates were removed with GATK v3.4-46 (McKenna et al. 2010) and reads were mapped against the reference genome of the RCC1242 strain (NCBI accession: GCA_000372725.1) (Read et al. 2013) with BWA mem v.0.7.12 (Li and Durbin 2009). For organelles, we used the reference mitochondrion NC_005332.1 (Sánchez Puerta et al. 2004) and the reference chloroplast NC_007288.1 (Sánchez Puerta et al. 2005) genomes. To calculate the mutation rate per haploid organelle genome, the average ploidy levels of the organelle genomes per cell (estimated from read coverage for organellar relative to nuclear genomes) were taken into account. The BAM files were sorted with Samtools v.1.2 (Li et al. 2009) and variants called with HaplotypeCaller from GATK v3.4-46 (McKenna et al. 2010) following the best practice recommendations (RealignerTargetCreator and IndelRealigner). To avoid false positives during de novo mutation identification, we apply several criteria used in previous studies with PCR confirmation of mutation candidates (Keightley et al. 2015; Krasovec, Chester, et al. 2018). Criteria were: 1) callable sites were defined with a threshold of 20 mapping quality and 2) a minimal coverage of 20 in the MA line and ancestral genomes; 3) sites covered by >150× were removed to exclude repetitive regions; 4) the alternative allele was supported with a minimal coverage of 1/3rd of the total coverage; 5) candidates within an insertion–deletion were removed with Bcftools v1.2 (options SnpGap = 5); and 6) all de novo mutations were manually checked in the mpileup files generated by Samtools v.1.2 for all MA lines and the ancestral genome. To test the reliability of in silico identification of de novo mutations, we randomly selected de novo mutation candidates for manual verification based on PCR and Sanger sequencing. All manually checked mutations were confirmed to be true positives (supplementary table S3, Supplementary Material online). The effect of de novo mutations was determined with snpEff v4.3 (Cingolani et al. 2012). To detect any bias in the mutation distribution, we compared the mutation distribution using χ2 and binomial tests against the null hypothesis assuming that mutations appear independently and randomly in the genome.

Mutation Bias and GC Content Evolution

To detect a potential mutation bias in the mutation spectrum, we calculated R1 and R2 mutation rates for GC to AT and AT to GC nucleotide mutations, respectively. With the numbers of mutations from GC to AT and AT to GC and the number of sites in the genome GCn and ATn, R1 = (GC to AT) / GCn and R2 = (AT to GC) / ATn. Equilibrium GC content (GCeq), was calculated as GCeq=R2 / (R1+R2). The equilibrium GC content is the GC content reached by the mutation process alone, that is, the GC content, where the numbers of mutations from GC to AT and AT to GC are equal.

Codon Bias

Codon bias in E. huxleyi was measured with the effective number of codons (ENCs; Wright 1990) as implemented in software CodonW (http://codonw.sourceforge.net). First, we ran a correspondence analysis to generate the hilo.coa files containing the preferred and unpreferred codons (based on a two-way χ2 contingency test) and the fop.coa files. Then, we calculated the frequency of optimal codons and the ENCs gene by gene using the files generated by the previous correspondence analysis. Last, we calculated the strength of selected codon usage bias (S) from the codon bias in the highly expressed E. huxleyi genes using the method of Sharp et al. (2005), but using the mutation spectrum observed in our MA experiment, as we did previously in the analysis of codon bias in Phaeodactylum tricornutum (Krasovec and Filatov 2019). S was calculated for the entire genome, as well as for 500 most actively expressed genes. Expression data for this analysis were obtained from a previous study (Huff and Zilberman 2014) (raw data SRR847300 from the bioproject PRJNA201680). Raw reads were aligned against the transcriptome with RSEM v.1.2.31 (Li and Dewey 2011) to obtain the fpkm values for each gene. S was calculated only for amino acids encoded by two codons where one is the preferred one (Phe, Tyr, His, Gln, Asn, Lys, Asp, and Glu). Furthermore, the fpkm values were used to test the effect of expression level on the mutation rate by comparing expression at the genes with and without a mutated site.

Results

Mutation Accumulation Experiment and E. huxleyi Mutation Rate

To measure the spontaneous mutation rate in E. huxleyi, we conducted a MA experiment (Halligan and Keightley 2009) that included 15 MA lines grown under standard lab conditions for 232 generations on an average, totaling 3,480 generations across all MA lines. To exclude selection and allow all mutations, including deleterious ones, to be fixed, the effective population size of MA lines was reduced by serial bottlenecking—reduction of the population to one cell every 2 weeks. From the cell counts at each bottleneck time, we estimated 1.17 generations per day on an average (supplementary table S1, Supplementary Material online) and an average effective population size of Ne = 7.6 (estimated from the harmonic mean of cell number) throughout the experiment. The generation time average did not change over the time of the experiment (Spearman correlation test, P value = 0.8355). In order to identify the mutations accumulated during the MA experiment, we used Illumina high-throughput sequencing to sequence the genomes of the MA lines at the beginning and the end of MA experiment (table 1). The analysis of nuclear genome sequence data from MA lines identified 455 de novo single nucleotide mutations (tables 1 and 2 and supplementary table S2, Supplementary Material online). All mutations that we verified manually with PCR and Sanger sequencing, were confirmed to be true positives (supplementary table S3, Supplementary Material online). With 117,713,833 callable sites per MA line on an average (table 1), the spontaneous mutation rate in the nuclear genome is µ = 5.55×10−10 (Poisson CI 95%: 5.05×10−10 – 6.09×10−10) per nucleotide per cell division. The mutation rate variation between the lines was significant (fig. 1, Pearson’s χ2 test, P value = 0.0006), such as observed, for example, in Chlamydomonas reinhardtii and Caenorhabditis elegans (Ness et al. 2015; Konrad et al. 2018). The per haploid genome per cell division mutation rate (U = genome size in bp×µ) in E. huxleyi is U = 167×106×5.55×10−10 = 0.092, whereas the mutation rate per coding sequence (39,635,709 nt of annotated CDS) (Read et al. 2013) is Ucds = 0.022. Based on µ and synonymous intraspecific polymorphism from 17 E. huxleyi strains (Filatov 2019) (πs ∼0.006), the estimate of effective population size in E. huxleyi is Ne = πs / (4×µ) ∼2.7 million. In addition, we identified seven and two de novo mutations in the mitochondrial and the chloroplast genomes, respectively (supplementary table S2, Supplementary Material online). The resulting estimates of per site per cell division mutation rates are µmt = 1.45×10−9 (Poisson CI 95%: 5.82×10−10 – 2.98× 10−9) and µcl = 1.76×10−10 (Poisson CI 95%: 2.13×10−11 – 6.43×10−10) for mitochondria and chloroplasts, respectively.
Table 2

Nuclear De Novo Mutations (Nmut) Affecting Different Types of Functional Regions

Mutation Effect Nmut
UTR15
Intergenic329
Intron7
Missense_variant67
Splice_region3
Start_lost1
Stop_gained1
Synonymous_variant32

Note.—The average number of callable sites per MA line was 30,362,830 for coding sequences and 87,351,003 for noncoding sequences.

Fig. 1.

Nuclear mutation rate per MA line. Average number of mutations per MA line is ∼30.33 (SD = 11.84). The number of mutations per MA line differs from the theoretical distribution assuming equal mutation rate across the lines (Pearson’s χ2 test, χ2 = 37.733, P value = 0.0006).

Nuclear mutation rate per MA line. Average number of mutations per MA line is ∼30.33 (SD = 11.84). The number of mutations per MA line differs from the theoretical distribution assuming equal mutation rate across the lines (Pearson’s χ2 test, χ2 = 37.733, P value = 0.0006). Nuclear De Novo Mutations (Nmut) Affecting Different Types of Functional Regions Note.—The average number of callable sites per MA line was 30,362,830 for coding sequences and 87,351,003 for noncoding sequences.

Distribution of De Novo Mutations in the Genome

We detected de novo mutations in all three genomic compartments—nuclear, mitochondrial, and chloroplast DNA, with a significantly higher mutation rate in mitochondria (µmt = 1.45×10−9) compared with the nuclear genome (2×2 contingency χ2 = 5.44, P value = 0.0196). Although the chloroplast mutation rate (µcl = 1.76×10−10) appears to be lower than the nuclear rate (µ = 5.55×10−10), this difference is not significant (2×2 contingency χ2 = 2.28, P value = 0.1309). Too few chloroplast and mitochondrial mutations were detected to analyze the distributions within these genomes. Most of the nuclear de novo mutations occurred in noncoding regions (354), whereas 32 and 69 mutations occurred at synonymous and nonsynonymous positions, respectively, in annotated coding regions. These numbers do not differ significantly from the expectations based on the proportions of callable noncoding, synonymous, and nonsynonymous positions in the E. huxleyi genome (Binomial test, ns). Furthermore, we did not detect any significant correlations of local mutation rate with genomic features. Finally, gene expression was not significantly different between genes with and without a mutated site (fpkm average of genes with a mutated site = 34.69, fpkm average of genes without a mutated site = 21.62, Student’s test, P value = 0.6628). DNA methylation (and associated deamination of methyl-cytosines) is a major contributor of mutations in the eukaryotic genomes (Holliday and Grigg 1993). If methylation contributes to the rate and pattern of mutations in E. huxleyi, we would expect to detect a lack of CpG dinucleotides in the genome (e.g., as is the case in mammals) and an excess of de novo C to T transitions at such dinucleotides. CpG represents 10.8% of all dinucleotides in the genome of E. huxleyi, which does not deviate from what is expected given the genomic GC-content (χ2 test, ns). Furthermore, the total number of de novo mutations from CpG to CpH is 10.3%, that is, there is no excess of mutations at CpG sites (χ2 test, ns). These results indicate that the effect of CpG methylation on mutation rate is low or absent in E. huxleyi. The analysis of mutational patterns (fig. 2) revealed significantly more AT to GC mutations compared with GC to AT mutations (207 vs. 151, respectively, binomial test, P value = 0.0036 with probability 50:50). This indicates that the E. huxleyi mutation spectrum has a significant GC-bias, which makes E. huxleyi the first eukaryotic species with a GC-biased mutation spectrum detected in a direct MA experiment, though, indirect inference of the mutation spectrum from sequence polymorphism and divergence indicated that the genome of plant Coffea canephora may also have a slightly GC-biased mutation spectrum (Clement et al. 2017). With an average of µGC->AT=0.381×µAT->GC, the equilibrium GC-content of the E. huxleyi genome is GCeq = 72.42%. The actual GC-content is slightly lower than the GCeq for the total genome (65.7%) and the noncoding regions (63.0%). In the coding sequence, the actual GC is 69.1%, close to GCeq due to the very high GC-content at third-codon positions (GC3s = 84.2%).
Fig. 2.

Mutation spectrum of Emiliania huxleyi. Transitions occur more frequently than transversions, with observed transition/transversion ratio k = 1.11.

Mutation spectrum of Emiliania huxleyi. Transitions occur more frequently than transversions, with observed transition/transversion ratio k = 1.11.

Codon Bias and Long-Term Effective Population Size

High GC-content at third-codon positions is due to strong codon usage bias in E. huxleyi. The preferred codons (listed in supplementary table S4, Supplementary Material online) almost always end with G or C and a widely used measure of codon bias – ENC (Wright 1990), averaged across all genes was ENC = 38.17, whereas for 500 strongest and weakest expressed genes ENC is 35.52 and 39.03, respectively. Using the method of Sharp et al. (2005) we estimated the strength of selected codon usage bias (S) for amino acids encoded by one preferred and one unpreferred codons (Phe, Tyr, His, Gln, Asn, Lys, Asp, and Glu; see supplementary table S4, Supplementary Material online). For 500 most actively expressed genes, where selection for codon usage is expected to be the strongest, the frequency of optimal codons was 0.8859, corresponding to S = 1.084, which is similar to the strength of selected codon bias in other organisms with large populations, such as Drosophila, nematodes, and bacteria (Sharp et al. 2010). This allows us to estimate long-term effective population size, given that evolution of codon bias is a slow process (Lawrence and Ochman 1997) that “averages” over short-term changes in population size and efficacy of selection (Sharp et al. 2010). Assuming selective advantage (s) of the preferred compared with unpreferred codons to be of the order of 10−6–10−7, as suggested by analyses of codon bias in Drosophila (Akashi 1997; Powell and Moriyama 1997) and bacteria (Sharp et al. 2010), the long-term effective population size in E. huxleyi and its ancestral species is of the order Ne = S / 4s ∼ 1.084 / 4×10−6 ∼271,000 to Ne ∼ 1.084 / 4×10−7 ∼2,710,000. The latter estimate is almost identical to our genetic diversity-based estimate Ne = πs / (4×µ) ∼2.7 million. Given that this estimate is independent of the mutation rate and genetic diversity values, the correspondence between the genetic diversity-based and codon bias-based estimates of Ne is reassuring.

Discussion

Emiliania huxleyi Mutation Rate

Here, we reported the first estimate of spontaneous mutation rate for a Haptophyte species. Emiliania huxleyi per-nucleotide per cell division mutation rate (µ = 5.55×10−10) is close to estimates for other eukaryotic plankton, such as the diatom P. tricornutum (Krasovec et al. 2019) (µ = 4.77×10−10) or unicellular green algal species O. tauri (Krasovec et al. 2017) (µ = 4.79×10−10) and Chlamydomonas reinhardtii (Ness et al. 2012) (µ = 3.23×10−10). This similarity of mutation rates across Haptophytes, Stramenopiles, and Chlorophyta suggests that the mutation rate of the order µ ∼ 5×10−10 is typical for unicellular eukaryotes regardless of their phylogenetic affinities. Ciliates represent a notable exception to this, with Paramecium tetraurelia having an order of magnitude lower mutation rate (Sung, Tucker, et al. 2012), possibly due to their peculiar life cycle and the presence of two genomes in macro- and micronuclei (Long et al. 2018). Our estimates of mutation rates in the nuclear, mitochondrial, and chloroplast genomes of E. huxleyi reveal that, similar to animals (Denver et al. 2000; Xu et al. 2012; Konrad et al. 2017) and diatoms (Krasovec et al. 2019), but contrary to plants (Drouin et al. 2008; Ossowski et al. 2010), haptophytes have a higher mitochondrial than nuclear mutation rate. The chloroplast mutation rate in the diatoms (Krasovec et al. 2019) and green plants (Smith 2015; Smith and Keeling 2015) is lower than that of the nuclear genome. The estimates of nuclear (µ = 5.55×10−10) and chloroplast (µcl = 1.76×10−10) mutation rates in E. huxleyi show a difference in the same direction, although the difference between the two rates is not significant.

Is the Lab-Based Mutation Rate Estimate Representative of That in the Open Ocean?

The spontaneous mutation rate may be affected by environmental conditions (Jiang et al. 2014; Liu and Zhang 2019). It is not possible to accurately reconstruct all the diversity of environmental conditions for a species that is ubiquitous in the world oceans and inhabits environments ranging from the tropics to the Arctic. However, it is possible to use the rich fossil record for this species (Raffi et al. 2006) to calibrate the rate of the molecular clock and compare it with the mutation rate found in the lab. Based on the fossil record, E. huxleyi evolved from the genus Gephyrocapsa ∼290 ka (Raffi et al. 2006). This is consistent with the conclusions of an integrated analysis of fossil and genome sequence data from E. huxleyi and four Gephyrocapsa species (Bendif et al. 2019), which demonstrated that 290 kyr corresponds to the divergence between E. huxleyi and Gephyrocapsa muellerae. Given sequence divergence ds ∼ 3% (Bendif et al. 2019) and the time of divergence (T∼290 kyr) between these species, the mutation rate in the open ocean can be estimated as µyear = ds / 2T ≈ 5.18×10−8 per year. Given the maximal rate of cell division achieved under optimal lab conditions in our experiment (∼1.17 per day), E. huxleyi in the wild should have <300 generations per year, providing the minimal estimate µgen = 5.18×10−8/ 300 = 1.73×10−10 for the per generation mutation rate in the wild. Using our MA-based estimate of µgen, we can infer the number of generations per year in the wild to be ∼93 (= 3.45×10−8 / 5.55×10−10) or one generation in ∼4 days. This estimate appears realistic because environmental conditions in nature are not always optimal for growth and E. huxleyi blooms require specific conditions (Tyrrell and Merico 2004). This suggests that the lab-based estimate of mutation rate (µgen = 5.55×10−10) accurately reflects E. huxleyi mutation rate in the wild.

Effective Population Size and Evolution of Mutation Rates

Mutation rates vary widely among organisms and their evolution is thought to be driven by natural selection (Lynch 2010; Sung, Ackerman, et al. 2012), which is reflected by the negative correlation between per-nucleotide mutation rates and effective population size (fig. 3). As most mutations are deleterious (Muller 1950), in most circumstances selection is expected to favor a reduction in the overall mutation rate (Lynch 2010). However, the strength of such selection is thought to be relatively weak (Drake et al. 1998) and unable to overcome genetic drift in small populations. Such weak selection to reduce mutation rate can be effective only in populations of large size, where drift is weak (Sung, Ackerman, et al. 2012). Here, we used a unicellular eukaryotic phytoplankton species with astronomically large population size to test whether selection in such a large population is able to push its mutation rate below the usual 10−9 – 10−10 range typical for unicellular eukaryotic species.
Fig. 3.

Effective population sizes (Ne) and per site mutation rates (µ) in Emiliania huxleyi (star) and other species (blue, animals; dark green, plants; light green, unicellular eukaryotes; red, bacteria) There is a strong negative correlation between Ne and µ (Pearson corr. test, ρ = −0.78 and P value = 5.706×10−7). Data used for this plot are listed in supplementary table S5, Supplementary Material online.

Effective population sizes (Ne) and per site mutation rates (µ) in Emiliania huxleyi (star) and other species (blue, animals; dark green, plants; light green, unicellular eukaryotes; red, bacteria) There is a strong negative correlation between Ne and µ (Pearson corr. test, ρ = −0.78 and P value = 5.706×10−7). Data used for this plot are listed in supplementary table S5, Supplementary Material online. Our analysis revealed that despite the astronomically large E. huxleyi populations, selection is unable to reduce the mutation rate <10−10 per nucleotide per cell division. One possibility to explain this apparent minimum to the mutation rate is that ∼10−10 represents the limit to how low the mutation rate can be reduced due to intrinsic biochemical constraints of replication and error correction cellular machinery (Kunkel and Erie 2015). Very few known species, regardless of their biology and ecology, can reach a per site mutation rate <10−10. The only known organisms with <10−10 mutation rate are ciliates (Sung, Tucker, et al. 2012; Long et al. 2018). Another possibility is that despite the astronomical census population size of E. huxleyi, its effective population size (Ne) is relatively modest, that is, drift is relatively high due to demography or other reasons, as reflected by the modest genetic diversity of this species (Filatov 2019). Indeed, the estimates of E. huxleyi Ne from its genetic diversity (Ne = πs / (4×µ) ∼2.7 million) and codon bias Ne ∼ S/4s = 1.084 / 4×10−7 ∼2.7 million are both smaller than the estimates of Ne in other marine phytoplankton species, including the green algae O. tauri (Ne ∼ 12 million; Blanc-Mathieu et al. 2017) and the diatom P. tricornutum (Ne ∼ 8.7 million; Krasovec et al. 2019). The reasons for extreme disparity between the astronomically large census population size and modest effective population size in E. huxleyi remain uncertain, though, it is clear that given very low linkage disequilibrium, this disparity is unlikely to be caused by periodic asexual reproduction of this species (Filatov 2019). Neither population size changes (Filatov 2019) nor speciation bottlenecks can explain the limited genetic diversity of E. huxleyi because the population size of this abundant species remained large even during its recent (∼290 ka) speciation from Gephyrocapsa (figure 2c in Bendif et al. [2019]). Linked selection (“genetic draft”) may potentially account for modest Ne and limited genetic diversity, though, the amount of selection needed for this appears very high (figure 7 in Filatov [2019]). Our study rules out the possibility that the extreme disparity between modest genetic diversity and astronomically large population size in the plankton species is due to an unusually low mutation rate that is expected to evolve in very large populations (fig. 3 and Sung, Ackerman, et al. 2012). The population genetic processes dominating huge populations of marine plankton species, such as E. huxleyi studied here, remain poorly understood (Rengefors et al. 2017) and deserve more attention from the research community. Click here for additional data file.
  52 in total

1.  Mitochondrial Mutation Rate, Spectrum and Heteroplasmy in Caenorhabditis elegans Spontaneous Mutation Accumulation Lines of Differing Population Size.

Authors:  Anke Konrad; Owen Thompson; Robert H Waterston; Donald G Moerman; Peter D Keightley; Ulfar Bergthorsson; Vaishali Katju
Journal:  Mol Biol Evol       Date:  2017-06-01       Impact factor: 16.240

2.  Evolution of codon usage bias in Drosophila.

Authors:  J R Powell; E N Moriyama
Journal:  Proc Natl Acad Sci U S A       Date:  1997-07-22       Impact factor: 11.205

3.  Extreme Lewontin's Paradox in Ubiquitous Marine Phytoplankton Species.

Authors:  Dmitry A Filatov
Journal:  Mol Biol Evol       Date:  2019-01-01       Impact factor: 16.240

4.  The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana.

Authors:  Stephan Ossowski; Korbinian Schneeberger; José Ignacio Lucas-Lledó; Norman Warthmann; Richard M Clark; Ruth G Shaw; Detlef Weigel; Michael Lynch
Journal:  Science       Date:  2010-01-01       Impact factor: 47.728

Review 5.  DNA methylation and mutation.

Authors:  R Holliday; G W Grigg
Journal:  Mutat Res       Date:  1993-01       Impact factor: 2.433

6.  RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome.

Authors:  Bo Li; Colin N Dewey
Journal:  BMC Bioinformatics       Date:  2011-08-04       Impact factor: 3.307

7.  Estimation of the spontaneous mutation rate in Heliconius melpomene.

Authors:  Peter D Keightley; Ana Pinharanda; Rob W Ness; Fraser Simpson; Kanchon K Dasmahapatra; James Mallet; John W Davey; Chris D Jiggins
Journal:  Mol Biol Evol       Date:  2014-11-03       Impact factor: 16.240

8.  Natural selection constrains neutral diversity across a wide range of species.

Authors:  Russell B Corbett-Detig; Daniel L Hartl; Timothy B Sackton
Journal:  PLoS Biol       Date:  2015-04-10       Impact factor: 8.029

9.  Evolutionary forces affecting synonymous variations in plant genomes.

Authors:  Yves Clément; Gautier Sarah; Yan Holtz; Felix Homa; Stéphanie Pointet; Sandy Contreras; Benoit Nabholz; François Sabot; Laure Sauné; Morgane Ardisson; Roberto Bacilieri; Guillaume Besnard; Angélique Berger; Céline Cardi; Fabien De Bellis; Olivier Fouet; Cyril Jourda; Bouchaib Khadari; Claire Lanaud; Thierry Leroy; David Pot; Christopher Sauvage; Nora Scarcelli; James Tregear; Yves Vigouroux; Nabila Yahiaoui; Manuel Ruiz; Sylvain Santoni; Jean-Pierre Labouisse; Jean-Louis Pham; Jacques David; Sylvain Glémin
Journal:  PLoS Genet       Date:  2017-05-22       Impact factor: 5.917

10.  A genomics approach reveals the global genetic polymorphism, structure, and functional diversity of ten accessions of the marine model diatom Phaeodactylum tricornutum.

Authors:  Achal Rastogi; Fabio Rocha Jimenez Vieira; Anne-Flore Deton-Cabanillas; Alaguraj Veluchamy; Catherine Cantrel; Gaohong Wang; Pieter Vanormelingen; Chris Bowler; Gwenael Piganeau; Hanhua Hu; Leila Tirichine
Journal:  ISME J       Date:  2019-10-17       Impact factor: 10.302

View more
  3 in total

1.  How Can We Resolve Lewontin's Paradox?

Authors:  Brian Charlesworth; Jeffrey D Jensen
Journal:  Genome Biol Evol       Date:  2022-07-02       Impact factor: 4.065

2.  Rampant nuclear-mitochondrial-plastid phylogenomic discordance in globally distributed calcifying microalgae.

Authors:  Tzu-Tong Kao; Tzu-Haw Wang; Chuan Ku
Journal:  New Phytol       Date:  2022-06-15       Impact factor: 10.323

3.  Temperature dependence of spontaneous mutation rates.

Authors:  Ann-Marie Waldvogel; Markus Pfenninger
Journal:  Genome Res       Date:  2021-07-23       Impact factor: 9.043

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.