Literature DB >> 31633794

Purifying Selection in Corvids Is Less Efficient on Islands.

Verena E Kutschera1,2, Jelmer W Poelstra3, Fidel Botero-Castro4, Nicolas Dussex5,6, Neil J Gemmell5, Gavin R Hunt7, Michael G Ritchie8, Christian Rutz8, R Axel W Wiberg8,9, Jochen B W Wolf1,4.   

Abstract

Theory predicts that deleterious mutations accumulate more readily in small populations. As a consequence, mutation load is expected to be elevated in species where life-history strategies and geographic or historical contingencies reduce the number of reproducing individuals. Yet, few studies have empirically tested this prediction using genome-wide data in a comparative framework. We collected whole-genome sequencing data for 147 individuals across seven crow species (Corvus spp.). For each species, we estimated the distribution of fitness effects of deleterious mutations and compared it with proxies of the effective population size Ne. Island species with comparatively smaller geographic range sizes had a significantly increased mutation load. These results support the view that small populations have an elevated risk of mutational meltdown, which may contribute to the higher extinction rates observed in island species.
© The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  avian genomics; comparative analysis; distribution of fitness effects; molecular evolution; mutation load; selection

Year:  2020        PMID: 31633794      PMCID: PMC6993847          DOI: 10.1093/molbev/msz233

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

The fate of a mutation entering a population depends on the effect it exerts on the fitness of its carrier. Under the assumption that most changes to the DNA are detrimental, selection is expected to primarily act to remove deleterious alleles (Elena et al. 1998; Keightley and Lynch 2003). Lethal or strongly deleterious mutations have almost no possibility of becoming fixed in a population, but the retention of weakly deleterious mutations depends on the effective population size, Ne (Charlesworth 2009; Akashi et al. 2012). In populations where a large number of individuals contribute genes to the next generation, purifying selection is expected to be efficient relative to genetic drift. In small populations, however, weakly deleterious mutations can rise to high frequencies, elevating the risk of population extinction (Lande 1994; Soulé and Mills 1998). Island species are not only constrained by limited geographic range, and hence relatively small population size, but also by limited opportunities to compensate for local population crashes through immigration from larger source populations (Frankham 2015). Consistent with these theoretical expectations, extinction rates of island populations are elevated in comparison to their mainland counterparts (Frankham 1998). The distribution of fitness effects of deleterious mutations (hereafter simply referred to as DFE) is a key evolutionary parameter describing the interplay between purifying selection and genetic drift (Eyre-Walker and Keightley 2007). It quantifies the proportion of harmful mutations segregating in a population and thereby provides a useful measure of mutation load. By convention the DFE is conceptualized as discrete classes of mutations scaled by selection coefficients (s) and population size (N). Mutations in the ranges of Ns ∈ [1;10) and Ns > 10 include deleterious and strongly deleterious mutations, respectively. Mutations with Ns ∈ [0;1) include slightly deleterious mutations (Ns ∼ 1) as well as the class of neutral and effectively neutral mutations (Ns ≪ 1); we refer to this class of mutations jointly as “mildly deleterious” (cf. Deinum et al. 2015). Note that, we report the absolute value of population-scaled selection coefficients with higher values representing ever more deleterious effects. Despite the conceptual importance of the DFE, empirical data on its shape are rare (but see Huber et al. 2017; Castellano et al. 2019), and its biological determinants remain elusive. Insights come from mutagenesis or mutation accumulation experiments (Elena et al. 1998; Keightley and Lynch 2003) and indirect inference from sequencing data (Keightley and Eyre-Walker 2010; Racimo and Schraiber 2014; Deinum et al. 2015). Consistent with a central prediction of the nearly neutral theory, the proportion of mildly deleterious mutations tends to scale negatively with proxies for Ne (Slotte et al. 2010; Deinum et al. 2015; Chen et al. 2017). Because long-term Ne is generally unknown, it has been approximated by geographic range size (Leffler et al. 2012) or a number of life-history traits, including body mass, age of sexual maturity, fecundity, or propagule size (Romiguier et al. 2014; Figuet et al. 2016; Chen et al. 2017). Here, we estimated the DFE from whole-genome sequencing data of seven avian species within the genus Corvus (fig. 1). The data set includes five widely distributed mainland species and two species that are restricted to small tropical islands. Crows and jackdaws from North America (C. brachyrhynchos), Eurasia (C. (corone) spp., C. dauuricus, C. monedula), or the Indian subcontinent (C. splendens) inhabit large ranges, encompassing on an average 27×106 km2 (range: 5–82×106 km2). In contrast, New Caledonian and white-billed crows (C. moneduloides, C. woodfordi) evolved on small islands (0.02–0.05×106 km2) and remained separated from their closest mainland relatives for millions of years (Haring et al. 2012; Jønsson et al. 2012, 2016). The species also vary in body size (length: 36–53 cm, mass: 208–570 g) (Dunning 1992; del Hoyo et al. 2017), which may serve as an additional indicator of long-term Ne (Figuet et al. 2016). We expect from theory that mutation load should be elevated in large-bodied species or in species with small geographic ranges.
. 1

Study system and mutation load. (A) Phylogeny of the genus Corvus, redrawn after Jønsson et al. (2012). Species included in this study are highlighted in bold, and numbers in brackets indicate the number of whole-genome resequenced individuals. The geographic origin is indicated by color (blue, island; green, continent). For the purpose of this article, we treat the up to five taxonomic groups that have been recognized within the Corvus (corone) species complex (Parkin et al. 2003) as a single species sharing recent ancestry with a substantial amount of cosegregating genetic variation (Vijay et al. 2016). (B) Mutation load of a species, estimated as the proportion of mildly deleterious mutations Ns ∈ [0;1) differs by geographic origin (island vs. continent), and varies to a lesser extent with distributional range; the effect of body size is not statistically significant. (C) Species illustrations courtesy of Handbook of the Birds of the World Alive.

Study system and mutation load. (A) Phylogeny of the genus Corvus, redrawn after Jønsson et al. (2012). Species included in this study are highlighted in bold, and numbers in brackets indicate the number of whole-genome resequenced individuals. The geographic origin is indicated by color (blue, island; green, continent). For the purpose of this article, we treat the up to five taxonomic groups that have been recognized within the Corvus (corone) species complex (Parkin et al. 2003) as a single species sharing recent ancestry with a substantial amount of cosegregating genetic variation (Vijay et al. 2016). (B) Mutation load of a species, estimated as the proportion of mildly deleterious mutations Ns ∈ [0;1) differs by geographic origin (island vs. continent), and varies to a lesser extent with distributional range; the effect of body size is not statistically significant. (C) Species illustrations courtesy of Handbook of the Birds of the World Alive.

Results

We estimated the DFE for population samples of all seven species from segregating sites with reliable genotype information for least eight chromosomal copies per species (fig. 1). In order to control for demographic perturbation, we contrasted the site frequency spectrum (SFS) of neutral mutations at 4-fold degenerate sites with the frequency of mutations subject to selection at 0-fold degenerate sites (2-epoch model; Keightley and Eyre-Walker 2007). The inferred proportion of mildly deleterious sites Ns ∈ [0;1) ranged from 0.178 to 0.221 in the species with broad geographic ranges, and was higher in the two island species C. moneduloides (0.252) and C. woodfordi (0.332) (supplementary table S1, Supplementary Material online). Statistical models including origin (mainland vs. island) were most strongly supported, based on Akaike’s information criterion for small sample sizes (ΔAICc ≤ 2) (table 1 andfig. 1). Summing over the conditional model probabilities, origin had the strongest effect (wAICc = 0.665), followed by geographic range size (wAICc = 0.142), and body size with the least support (wAICc = 0.008). This result suggests that colonization of islands with subsequent persistence in a confined geographic area may indeed limit long-term effective size of a population, and hence reduce the efficacy of selection (Corbett-Detig et al. 2015). Consistent with this expectation, long-term Ne, as independently approximated by the harmonic mean of the cross-coalescence rate through time (Schiffels and Durbin 2014), was higher for individuals of the widely distributed species C. (c.) cornix (Ne = 88,219) and C. brachyrhynchos (Ne = 122,430) than for the island species C. moneduloides (Ne = 63,236).
Table 1

Summary of All Statistical Multiple Linear Regression Models (numbered 0–7) Exploring Relationships between Mutation Load Ns ∈ [0;1), As Estimated by the 2-Epoch Model of DFEalpha (Keightley and Eyre-Walker 2007), Geographic Range Size, Body Length, and the Mainland-Island Contrast.

ModelType I Error
k AICcΔAICcwAIC R 2 adj
InterceptGeographic Range SizeBody LengthOrigin [mainland/island]
30.0000.0133−17.93600.6630.693
00.0002−15.4042.5310.1870.000
10.0020.0413−14.8353.10.1410.521
20.1000.6103−8.8079.1290.007−0.133
50.9310.4830.1504−4.90813.0280.0010.665
40.0690.7820.0294−4.08413.8520.0010.624
60.0260.0790.9404−0.84717.08900.402
70.9350.3570.4610.153535.653.53600.640
Σ wAIC0.1420.0080.665

Note.—Type I errors are given for each model and response variable. The best supported model is highlighted in gray.

k, number of parameters; AICc, Akaike’s information criterion adjusted for small sample sizes; ΔAICc, difference in AICc relative to the best model; wAICc, Akaike weights for small sample sizes; ΣwAICc, summed Akaike weights; R2adj, squared correlation coefficient, adjusted for the number of parameters.

Summary of All Statistical Multiple Linear Regression Models (numbered 0–7) Exploring Relationships between Mutation Load Ns ∈ [0;1), As Estimated by the 2-Epoch Model of DFEalpha (Keightley and Eyre-Walker 2007), Geographic Range Size, Body Length, and the Mainland-Island Contrast. Note.—Type I errors are given for each model and response variable. The best supported model is highlighted in gray. k, number of parameters; AICc, Akaike’s information criterion adjusted for small sample sizes; ΔAICc, difference in AICc relative to the best model; wAICc, Akaike weights for small sample sizes; ΣwAICc, summed Akaike weights; R2adj, squared correlation coefficient, adjusted for the number of parameters. The reliance of the DFE on estimates of genetic variation at both selected as well as selectively neutral sites makes it sensitive to perturbation in the “neutral SFS” introduced by factors other than selection. Processes such as demographic change or population structure can accordingly modulate the SFS, mimicking the outcome of selection. For validation of the results above, we therefore explored the effects of sample size, population structure, and demographic population history using a large test data set of 118 individuals from the European crow species complex (Vijay et al. 2016). Without applying any correction for the distribution of the “neutral SFS” (1-epoch model; Keightley and Eyre-Walker 2007) the DFE varied substantially by sample size and population. However, applying model-based correction using an explicit demographic parametrization (2-epoch model) stabilized the DFE across a large range of sample sizes (3–118 individuals), demographic histories, and population stratification, including artificial admixture by pooling across all populations (supplementary text, table S2, and figs. S1 and S2, Supplementary Material online). Moreover, different methods controlling for perturbation in the “neutral SFS” yielded very similar estimates for the proportion of mildly deleterious sites in all seven species (supplementary table S1, Supplementary Material online). Results from the 2-epoch model, presented above, were near-identical to estimates using an unspecified “nuisance parameter” in a diffusion approximation framework (Eyre-Walker et al. 2006), or when simply considering the ratio of nucleotide diversity of 0- and 4-fold degenerate sites (π0/π4) (Pearson’s r = 0.95–0.99; supplementary table S3, Supplementary Material online). As for the test data set, applying model-based correction (2-epoch model) also generally improved the fit to the data for the remaining six species (supplementary table S1, Supplementary Material online). Estimates derived both from π0/π4 and the method of Eyre-Walker et al. (2006) (supplementary table S1, Supplementary Material online) likewise identified island origin and small geographic range size as the main factors associated with increased mutation load (supplementary table S4 and fig. S3, Supplementary Material online). Species samples were not evenly distributed across the phylogeny (fig. 1). Although phylogenetic conservatism for Ne is not expected, shared ancestry may contribute to explaining variation in the DFE across species and lead to overconfident statistical inference. We therefore quantified the degree of shared polymorphism as a proxy for the contribution of shared ancestral population signatures to the DFE of individual species. Proportions were nonzero for the closest species pairs, but overall low suggesting that the species under consideration have largely reached evolutionary independence (supplementary table S5, Supplementary Material online). Moreover, reanalyses of the best-supported models (table 1) with explicit inclusion of phylogenetic distance showed no evidence for a phylogenetic contribution to the inference. Models with parameter estimates of zero for the covariance structure of the error received the highest likelihood.

Discussion

In this study, we took a comparative genomic approach to address the relationship between mutation load and population size proxies. Using genome-wide data from 118 individuals of the Corvus (corone) spp. species complex, we established that a moderate sample of less than ten chromosomal copies contains reliable information characterizing the species’ DFE. Model-based correction of factors other than selection perturbing the “neutral SFS” stabilized the DFE across populations and demographic histories. Population stratification in the Corvus (corone) spp. species complex is moderate, and comparable to other species (FST range: autosomes, 0.02–0.25; sex chromosome, 0.04–0.51) (Poelstra et al. 2014). Populations within this species complex coalesce more recently with each other than with the American sister species C. brachyrhynchos (Haring et al. 2012; Vijay et al. 2016). It is thus reasonable to assume that representative population samples sharing recent common ancestry approximate the selective processes acting during much of the history of the species as a whole. Under this premise, we quantified mutation load as the proportion of mildly deleterious sites (Ns ∈ [0;1)), and related it to life-history traits for all seven species in our data set. The use of genome-scale data for multiple individuals of several species, though moderate in number, allowed for statistical treatment in a comparative framework. Moreover, in contrast to studies leveraging information on the degree of purifying selection in natural populations from broad taxonomic sampling (Chen et al. 2017), we focused our analyses on a single avian genus. This allowed us to explore the effects of Ne as approximated by variation in body size and geographic range size, while keeping variation in other relevant life-history traits—such as the genetic system or propagule size—to a minimum (Owens and Bennett 1995). It also circumvents phylogenetic inertia that may otherwise hamper the interpretation of differences in DFE across large evolutionary distances. With due caution concerning the relatively small number of species, we found a statically supported relationship between the mutation load of a species and its distributional range. This finding is in accordance with the observation that species with large population sizes appear to purge deleterious mutations more efficiently (Glémin 2003). Furthermore, our study provides evidence that species living on islands accumulate mildly deleterious mutations more readily than more widely distributed species (Johnson and Seger 2001; Woolfit and Bromham 2005; Gardiner et al. 2008). Similar to what has been reported elsewhere for the degree of genetic polymorphism (James et al. 2016), the difference between island and continental species with respect to the inferred mutation load was relatively small. Moreover, island species show a higher degree of variation (supplementary table S1, Supplementary Material online). Although this observation clearly needs to be substantiated for a larger sample of species, it may suggest a disproportionate effect of specific colonization history (bottleneck, degree of postcolonization gene flow) that is difficult to capture with simple proxies for Ne. There have been recent claims (Kern and Hahn 2018) and counter-claims (Jensen et al. 2019) about the extent to which comparative genomics studies are rejecting the explanatory power of the neutral theory. Our findings are consistent with the central prediction from nearly neutral theory of negative scaling between the proportion of deleterious mutations and effective population size (Ohta 1992). Our results further imply that populations of mainland species with large distributional ranges harbor fewer deleterious mutations than their island congeners making the latter potentially more vulnerable to mutational meltdown (Frankham 1998). To corroborate the generality of this observation studies exploring the distribution of fitness effects in a large number of closely related species across a variety of taxa are encouraged.

Materials and Methods

A detailed account of the methods can be found in the supplementary text, Supplementary Material online.

Data Collection and Processing

We collated whole-genome resequencing data for a total of 147 individuals from population samples of seven Corvus species (see fig. 1 andsupplementary table S6, Supplementary Material online). After quality assessment and adapter trimming, sequencing reads were aligned with bwa v0.7.13 (Li and Durbin 2009) to the closest available reference genome, either C. moneduloides (accession number: VRTO00000000) or C. (corone) cornix (Poelstra et al. 2014). Syntenic regions between the two reference assemblies were determined from lift-overs using SatsumaSynteny (Grabherr et al. 2010). Variant discovery and genotyping were performed using GATK v3.4.0 (Auwera et al. 2013). CpG-prone sites and sites with missing data were removed from vcf files.

Estimation of the DFE

We approximated the DFE by a gamma distribution using DFEalpha v.2.15 (Keightley and Eyre-Walker 2007) and DoFE v3.1 (Eyre-Walker et al. 2006). We further calculated the ratio of nucleotide diversity at nonsynonymous, 0-fold degenerate sites, and at synonymous, 4-fold degenerate sites (π0/π4), reflecting the strength of selection (Charlesworth and Eyre-Walker 2008).

Life-History Parameters, Demographic Inference, and Statistical Model Selection

For each species, we extracted data on geographical range size and mean body size (see supplementary table S6, Supplementary Material online, for data and references). Body length and body mass showed a high degree of collinearity (r = 0.88) and we report results for the former. Body mass, however, yielded qualitatively the same results (supplementary table S4d, Supplementary Material online). Statistical analyses of relationships between the DFE and life-history parameters were based on model selection of multiple linear regression models using Akaike’s information criterion (Burnham and Anderson 2002) and were performed in R version 3.5.1 (R Core Team 2018). To test for possible effects of genealogical nonindependence, we fitted a linear model allowing for correlation of errors as implemented in the package nlme v. 3.1-131.1 (Pinheiro et al. 2017). The correlation structure was specified by the phylogenetic distance matrix (based on both Jønsson et al. 2012, 2016) and modeled assuming Brownian motion (Freckleton et al. 2002). For all individuals with a minimum average sequencing coverage of 18×, we also estimated demographic history through time, using the program MSMC (Schiffels and Durbin 2014). Click here for additional data file.
  37 in total

1.  A supermatrix phylogeny of corvoid passerine birds (Aves: Corvides).

Authors:  Knud Andreas Jønsson; Pierre-Henri Fabre; Jonathan D Kennedy; Ben G Holt; Michael K Borregaard; Carsten Rahbek; Jon Fjeldså
Journal:  Mol Phylogenet Evol       Date:  2015-09-03       Impact factor: 4.286

2.  Population size and molecular evolution on islands.

Authors:  Megan Woolfit; Lindell Bromham
Journal:  Proc Biol Sci       Date:  2005-11-07       Impact factor: 5.349

3.  Determining the factors driving selective effects of new nonsynonymous mutations.

Authors:  Christian D Huber; Bernard Y Kim; Clare D Marsden; Kirk E Lohmueller
Journal:  Proc Natl Acad Sci U S A       Date:  2017-04-11       Impact factor: 11.205

4.  Phylogenetic analysis and comparative data: a test and review of evidence.

Authors:  R P Freckleton; P H Harvey; M Pagel
Journal:  Am Nat       Date:  2002-12       Impact factor: 3.926

5.  Distribution of fitness effects caused by random insertion mutations in Escherichia coli.

Authors:  S F Elena; L Ekunwe; N Hajela; S A Oden; R E Lenski
Journal:  Genetica       Date:  1998       Impact factor: 1.082

6.  Genetic rescue of small inbred populations: meta-analysis reveals large and consistent benefits of gene flow.

Authors:  Richard Frankham
Journal:  Mol Ecol       Date:  2015-03-23       Impact factor: 6.185

7.  Joint inference of the distribution of fitness effects of deleterious mutations and population demography based on nucleotide polymorphism frequencies.

Authors:  Peter D Keightley; Adam Eyre-Walker
Journal:  Genetics       Date:  2007-12       Impact factor: 4.562

8.  Recent Evolution in Rattus norvegicus Is Shaped by Declining Effective Population Size.

Authors:  Eva E Deinum; Daniel L Halligan; Rob W Ness; Yao-Hua Zhang; Lin Cong; Jian-Xu Zhang; Peter D Keightley
Journal:  Mol Biol Evol       Date:  2015-06-01       Impact factor: 16.240

9.  Molecular Evolutionary Consequences of Island Colonization.

Authors:  Jennifer E James; Robert Lanfear; Adam Eyre-Walker
Journal:  Genome Biol Evol       Date:  2016-07-02       Impact factor: 3.416

10.  Revisiting an old riddle: what determines genetic diversity levels within species?

Authors:  Ellen M Leffler; Kevin Bullaughey; Daniel R Matute; Wynn K Meyer; Laure Ségurel; Aarti Venkat; Peter Andolfatto; Molly Przeworski
Journal:  PLoS Biol       Date:  2012-09-11       Impact factor: 8.029

View more
  2 in total

Review 1.  Genetic load: genomic estimates and applications in non-model animals.

Authors:  Giorgio Bertorelle; Francesca Raffini; Hernán E Morales; Cock van Oosterhout; Mirte Bosse; Chiara Bortoluzzi; Alessio Iannucci; Emiliano Trucchi
Journal:  Nat Rev Genet       Date:  2022-02-08       Impact factor: 59.581

2.  A Nearly Neutral Model of Molecular Signatures of Natural Selection after Change in Population Size.

Authors:  Rebekka Müller; Ingemar Kaj; Carina F Mugal
Journal:  Genome Biol Evol       Date:  2022-05-03       Impact factor: 4.065

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.