Literature DB >> 34188867

Demographic history and local adaptation of Myripnois dioica (Asteraceae) provide insight on plant evolution in northern China flora.

Nan Lin1,2,3, Jacob B Landis4, Yanxia Sun2,5, Xianhan Huang1, Xu Zhang2, Qun Liu6, Huajie Zhang2, Hang Sun1, Hengchang Wang2,5, Tao Deng1.   

Abstract

The flora of northern China forms the main part of the Sino-Japanese floristic region and is located in a south-north vegetative transect in East Asia. Phylogeographic studies have demonstrated that an arid belt in this region has promoted divergence of plants in East Asia. However, little is known about how plants that are restricted to the arid belt of flora in northern China respond to climatic oscillation and environmental change. Here, we used genomic-level data of Myripnois dioica across its distribution as a representative of northern China flora to reconstruct plant demographic history, examine local adaptation related to environmental disequilibrium, and investigate the factors related to effective population size change. Our results indicate M. dioica originated from the northern area and expanded to the southern area, with the Taihang Mountains serving as a physical barrier promoting population divergence. Genome-wide evidence found strong correlation between genomic variation and environmental factors, specifically signatures associated with local adaptation to drought stress in heterogeneous environments. Multiple linear regression analyses revealed joint effects of population age, mean temperature of coldest quarter, and precipitation of wettest month on effective population size (Ne). Our current study uses M. dioica as a case for providing new insights into the evolutionary history and local adaptation of northern China flora and provides qualitative strategies for plant conservation.
© 2021 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd.

Entities:  

Keywords:  Myripnois dioica; RAD‐seq; demographic history; effective population size; genomic variations; local adaption

Year:  2021        PMID: 34188867      PMCID: PMC8216978          DOI: 10.1002/ece3.7628

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


INTRODUCTION

The Sino‐Japanese floristic region (SJFR) is one of the oldest and richest area of temperate floral elements in the North Hemisphere, and the northern China flora composes the main part of the SJFR (Chen et al., 2018; Gao et al., 2003; Lu et al., 2018; Wu & Wu, 1996). Northern China has served as an important region composing the north–south vegetation transects from tropical to cold forests and taiga, and paleovegetation evidence suggests that the current components of the northern China flora are mainly derived from the Neogene–Quaternary period (Gao et al., 2003; Manchester et al., 2009; Wang, 1999). Covering a relatively wide geographic location, along with past climatic oscillations and geological changes, a high number of species are found in northern China (Wu et al., 2005; Wu & Wu, 1996; Yu et al., 2000), of which are an abundance of endemic species, such as Xanthoceras sorbifolium Bunge (Sapindaceae), Opisthopappus taihangensis (Ling) Shih (Compositae), and Taihangia rupestris Yu et Li (Rosaceae) (Wang, 1999). Genetic evidence has suggested that plants in the northern China region contracted to southern glacial refugia during the last glacial maxima (LGM) and then expanded northwards during postglacial migration. This hypothesis is supported in present‐day phylogroups with northern region populations originating from recent range expansions of a southern refugium, which may present founder and bottleneck effects with reduced genetic diversity within and between populations (Cao et al., 2015; Chen et al., 2008; Hao et al., 2018; Tang et al., 2018; Tian et al., 2009). Alternative hypotheses include the remnants of multiple geographically isolated refugia occurring in northern China causing population differentiation of many plant species in the region (Hou et al., 2020; Wang et al., 2017; Ye et al., 2019). Studies addressing both hypotheses have predominately been limited to using several loci and almost exclusively conifer species, which inadequately address the questions. Thus, genome‐level data and plants with shorter life cycles can help to better demonstrate the evolution of plants making up the northern China flora. Physical barriers containing multiple mountains and drainage systems have acted on species differentiation in northern China, such as the Taihang Mountains promoting intraspecies divergence (Hou et al., 2014; Ye et al., 2018). In addition, the uplift of the Qinghai–Tibetan Plateau (QTP) has aggravated monsoon intensity and enhanced aridity in the Asian interior, which profoundly influenced the climate of northern China (Guo et al., 2008; Li et al., 2001; Shi et al., 1999). With the formation of an east–west arid belt, this ecological barrier played a significant role in plant divergence and can be tested with phylogeographic studies (Bai et al., 2016; Guo et al., 2008). Thus, northern China is a model region for studying how these intricate factors together influence the evolution of plants. In response to selective pressures from environmental heterogeneity, species undergo local adaptation that can be identified using population genetic analyses based on genome–environment association data (Joost et al., 2007; Li et al., 2017). Many cases based on nonmodel plant species have illustrated environmental factors including soil, temperature, and precipitation have played important roles in driving differentiation in locally adaptive species (Chao et al., 2020; De Kort et al., 2014; Jones et al., 2013). The regional climate impacting the local flora of northern China is arid with small quantities of precipitation that may be the major environmental factors effecting adaptive evolution. Previous studies in Phyteuma hemisphaericum L. (Campanulaceae), Campanula barbata L. (Campanulaceae), and Carex sempervirens Vill. (Cyperaceae) (Basu et al., 2016; Jones et al., 2013; Manel et al., 2012) identified genetic variation that is highly related to precipitation factors, emphasizing local adaptation within species to cope with drier climatic conditions. However, few environmental–genetic studies have been reported for the flora of northern China, and the link to whether/how the evolution of plant populations is driven by adaptation to drought in this region has not fully been investigated. In this study, we use genome‐wide data to find correlations between loci and ecological factors based on a representative species to explore adaptive evolution pattern of regional vegetation in the northern China flora. Myripnois dioica Bunge is a temperate, deciduous shrub species ranging from 35°N to 45°N (Figure 1), geographically overlapping the arid belt. The species belongs to the daisy family and is wind‐dispersed with seeds having long pappi, but is distinguishable from other closely related species by its shrub life form (Gao & Nicholas, 2011). There is little known about this long‐neglected species except for its ornamental value (Xie & Zhang, 2012; Xu et al., 2007). The species is widely distributed along the Taihang Mountains in habitats of uneven altitude from 200 m a. s. l. to 1,500 m a. s. l. and can survive in dry, cold, and rocky conditions (Gao & Nicholas, 2011; Xie & Zhang, 2012). Since M. dioica is one of the dominate components of shrub community in northern China, the species has undergone complicated geographic and climatic changes together with extreme ecological adaption. Thus, this species is an ideal system to investigate the joint effects of Pleistocene climatic oscillations, biogeographic barriers, and patterns of ecological adaption in the flora of northern China.
FIGURE 1

(a) Geographical distribution of two genomic groups of Myripnois dioica based on ADMIXTURE; populations are color‐coded corresponding to different groups. (b) Plot of individuals of M. dioica along principal component analysis (PCA) scores of genetic variations based on the analysis of 22,868 SNPs; the triangles and dots are consistent with south and north groups. (b) Plots of posterior probabilities for individuals of M. dioica assigned to K genetic clusters from Admixture analyses for K = 2. Population names listed along the bottom of the plot and the south and north groups are delimited by yellow and blue color

(a) Geographical distribution of two genomic groups of Myripnois dioica based on ADMIXTURE; populations are color‐coded corresponding to different groups. (b) Plot of individuals of M. dioica along principal component analysis (PCA) scores of genetic variations based on the analysis of 22,868 SNPs; the triangles and dots are consistent with south and north groups. (b) Plots of posterior probabilities for individuals of M. dioica assigned to K genetic clusters from Admixture analyses for K = 2. Population names listed along the bottom of the plot and the south and north groups are delimited by yellow and blue color Given the complex factors contributing to the evolution of M. dioica in northern China, efforts in identifying and quantifying important factors will undoubtedly lead to better conservation strategies, providing an evolutionary diversity reference for the framework of conservation genetics (Jarzyna & Jetz, 2016; Moritz & Potter, 2013). Phylogeography has been shown to be useful in plant conservation by investigating historical and evolutionary questions utilizing genetic structure, spatial, and demographic perspectives (Médail & Baumel, 2018). Studies in plants have also suggested that both expansion history and abiotic climate directly impact effective population size variation (Braasch et al., 2019; Micheletti & Storfer, 2015). In this case, inherited mechanisms for how specific historical and climatic factors affect species evolution and population size changes could protect plant species from extinction under disturbances by human encroachment and global climate change (Pauls et al., 2013; Polechová & Storch, 2008; Pritchard et al., 2000). Genome‐wide data derived from restriction enzyme‐associated DNA sequencing (RAD‐seq) have been effective and well‐suited for phylogeographic studies and investigating adaptive evolution (Haselhorst et al., 2019; Warschefsky & von Wettberg, 2019). Large numbers of markers can be obtained from RAD‐seq to identify evolutionary patterns that are not easily visible in traditional analyses based limited loci. In the current study, we employed M. dioica as a representative member of the flora of northern China to identify phylogeographic patterns and processes impacting this species. Our goals are to (a) investigate demographic history of M. dioica; (b) explore local adaptation of M. dioica to dry environmental conditions in northern China; (c) address specific mechanism for potential historical and environmental factors related to changes in effective population size of M. dioica.

MATERIALS AND METHODS

Sampling, DNA extraction, and RAD sequencing

We sampled 77 individuals from 16 populations spanning the geographic range of M. dioica. Fresh leaves from each individual were dried in silica gel. Total genomic DNA was extracted using a modified cetyltrimethylammonium bromide (CTAB) method (Doyle et al., 1996). The quality of DNA was visualized on a 1% agarose gel and quantified on a Qubit® 2.0 fluorometer. Genomic DNA was digested with the restriction enzyme EcoRI, and restriction site‐associated DNA library construction and sequencing were performed by the Beijing Genomics Institute BGI (Wuhan, China) with paired‐end sequencing on an Illumina HiSeq 2000 platform. The raw data were deposited in the NCBI Short Read Archive (SRA) (Bioproject No. PRJNA607823).

Bioinformatics pipeline for SNP calling

All 77 individuals from 16 populations were used for de novo SNP calling (Table S1 and Figure 1). Only the forward reads of the paired ends were used for analyses due to low coverage of the reverse reads. Reads were filtered for quality and demultiplexed using process_radtags with default parameter in Stacks 2.0 (Catchen et al., 2013). We then used the denovo_map.pl wrapper to identify SNPs by clustering reads with a minimum number of four raw reads and allowing two mismatches between loci. Given the limited number of individuals within each population, the populations command was used to filter loci so that polymorphic loci in at least 50% of the individuals and twelve populations were retained. We further removed loci with minor allele frequencies <0.05 and only included the first SNP per locus in the final analysis to avoid linkage bias. Finally, a maximum observed heterozygosity was set to 0.5. PGDSPIDER 2.1.1.5 and VCFtools 0.1.15 were used to generate input files for downstream analyses (Danecek et al., 2011; Lischer & Excoffier, 2012).

Summary statistics and population structure

We used the populations command of Stacks to estimate nucleotide diversity (π), observed heterozygosity (H O), expected heterozygosity (H E), and fixation index (F IS) for each population. We performed pairwise population F ST values using an analysis of molecular variance (AMOVA) with 1,000 permutations as implemented in ARLEQUIN 3.0 (Excoffier et al., 2005). Population structure analyses were performed to infer the most likely number of ancestral populations using ADMIXTURE 1.23 (Alexander et al., 2009) by determining the optimal partitioning of the populations according to cross‐validation error value of clusters 1–10. A principal component analysis (PCA) using the genome‐wide complex trait analysis (GCTA) was used to explore the genetic structure and visualized in R (Pluess et al., 2016; RC, 2013). RAxML v8.2.10 (Stamatakis, 2014) was used to construct a maximum‐likelihood (ML) phylogeny of the 16 populations from unlinked SNPs using the ascertainment bias correction for the GTRGAMMA model (‐m ASC_GTRGAMMA, –asc‐corr = lewis) with 1,000 bootstrap replicates.

Demographic history and species distribution modeling

Effective population size (Ne) for each population was estimated using the molecular co‐ancestry method implemented in NEESTIMATOR 2.01 (Do et al., 2014). To explore the historical demography of M. dioica, we employed DIYABC v. 2.0 software based on an approximate Bayesian computation algorithm (ABC) (Cornuet et al., 2014). The 16 populations samples were divided into two groups (group N and group S) based on genetic structure, PCA, and phylogenetic relationships of M. dioica, and we only used a single SNP per locus and neutral SNPs and without missing data for the ABC simulation. Most populations are distributed in the northern Taihang Mountains, and we first tested four possible divergence scenarios to estimate whether M. dioica originated from the northern region (Figure S1). We employed a uniform prior probability and selected all summary statistics to generate a reference table based on 4 × 106 simulated datasets with 1% of the simulated datasets closest to the observed data to estimate the relative posterior probabilities for all scenarios. Based on our field observations and closely related species (Zhao & Gong, 2015), we set a conservative estimate for generation time to 5 years to estimate M. dioica demographic history. Additionally, we investigated demographic scenarios of changes in population size in two groups of M. dioica. Five models of population size changes were used with the same parameter settings strategy as above to choose the best fit demographic scenario and parameter estimation (Figure S1; Fan et al., 2018; Wang et al., 2016). Gene flow patterns among different populations were analyzed with setting the number of migration events from 1 to 3 in the TreeMix 1.11 software (Pickrell & Pritchard, 2012). To further test the effects of glaciation on M. dioica, species distribution modeling was generated with MAXENT 3.2 (Phillips et al., 2006) using occurrence data gathered from field collections and herbarium specimens. After eliminating highly correlated variables, six bioclimatic variables from WorldClim (http://www.worldclim.org) (Hijmans et al., 2005) were used as environmental predictors for species distribution modeling (SDM). The following two layers from the Model for the Community Climate System Model (CCSM, (Hasumi & Emori, 2004)) were employed for distribution modeling: the current layer and the last glacial maximum (LGM: c. 21 ka) layer. For evaluation, models were calibrated on 75% of the data and evaluated on the remaining 25% using area under the curve (AUC) and replicated 10 times.

Genomic variations and outliers related to climate‐driven local adaptation

To assess the correlation between geographic distance and genetic distance, we conducted isolation‐by‐distance (IBD) analyses using ARLEQUIN (Excoffier et al., 2005). We also tested the contribution of environmental differences to population differentiation (IBE) by comparing pairwise F ST values and environmental distances among populations using a Mantel test in GenAlEx 6.3 with 1,000 permutations (Peakall & Smouse, 2006). A total of 19 bioclimatic variables for all populations under current conditions were obtained from of WorldClim at 30‐arc seconds resolution (1960–1990, http://www.world clim.org). We first conducted a principal component analysis (PCA) to obtain the first two principal components of the 19 bioclimatic variables (Clim_PC1 and Clim_PC2), and these values were used as points in two dimensions to calculate a pairwise distance matrix (Pluess et al., 2016). To evaluate genomic variation and its association with geographic and environmental factors, redundancy analyses (RDA) were used to partition the variance of genomic variation explained by variables (Peres‐Neto et al., 2006) as implemented in the VEGAN 2.5.7 package in R (Oksanen, 2020). Our initial model was as follows: Y (individual genotype) ~ Latitude + Longitude + Clim_PC1 + Clim_PC2, and we also tested the significance of the estimated variance explained by all variables and separate variables with 999 permutations. Furthermore, a gradient forest (GF) analysis was performed using the R package gradientForest 0.1‐17 (Ellis et al., 2012), which investigated the relationships between genomic variables and the six irrelevant bioclimatic variables. To detect signatures of natural selection, we first used BayeScan 2.10 following the Bayesian‐likelihood method of reversible‐jump MCMC, which is based on a Dirichlet distribution decomposing locus‐population F coefficients into a population‐specific component (beta) (Foll & Gaggiotti, 2008). Moreover, a burn‐in of 50,000 iterations followed by 50,000 iterations was used for estimation using a thinning interval of 20. We then used BayPass 2.1 to identify climatic associations with all SNPs (Gautier, 2015) and to detect signature of adaptive selection associated with population‐specific covariates (Clim_PC1 and Clim_PC2). Consensus sequences of outlier loci were aligned to the available genome of sunflower, Helianthus annuus L. (Badouin et al., 2017) for annotation.

Optimization linear modeling of effective population size, expansion dynamics, and climatic factors

The current distribution of M. dioica in northern China may suggest the important role of climate variation or expansion history in demographic performance. We further tested for the influence of the demographic history and climate on effective population size (Ne) among populations. We divided the 19 climate factors into two types associated with energy and moisture factors and used the subsequent principal component (PC) axes to describe climatic variation. We estimated the date of colonization for each population (hereafter named population age). Bayesian reconstructions were performed for 16 M. dioica populations using MrBayes v3.2.6 under GTR+I+G model based on four makers (trnL‐F, matK, ndhF, and psbA‐trnH; accession number: MW380873–MW380901, MW464868–MW464895) (Ronquist & Huelsenbeck, 2003), with Pertya rigidula (Miq.) Makino selected as an outgroup based on Funk et al. (2014). In order to estimate divergence time among populations, molecular dating was performed in BEAST 1.6 using uncorrelated log‐normal (UCLN) relaxed‐clock model (Drummond & Rambaut, 2007). The root node was constrained using a normal prior distribution using secondary calibration based on the results of Funk et al. (2014). Sister populations sharing one node were identified as having a consistent colonization history. To quantify the contributions of both population age and climatic environment to variation in N for populations, we used a general linear model using Akaike information criterion in the R package STATS 3.0.2, with N as the dependent variable of population age, climatic factors, and their interactions as explanatory variables (RC, 2013).

RESULTS

RAD processing

After demultiplexing, and filtering low‐quality reads of 16 populations (Table S1 and Figure 1), we obtained an average of ~1.6 million reads per sample with detailed information presented in Table S2. Our Stacks pipeline with subsequent filtering generated 22,868 SNPs within M. dioica.

Population structure

Based on the 22,868 SNPs, the average within‐population nucleotide diversity (π) was 0.0347 ranging from 0.0303 to 0.0384 in populations JL and JW, respectively (Table 1). Our Pearson's product–moment correlation revealed that there was no correlation between the number of individuals and nucleotide diversity (r = −0.21, p = 0.44). Meanwhile, average expected heterozygosity (H E) and observed heterozygosity (H O) across all populations were 0.1433 and 0.1915 (Table 1). The average level of differentiation between populations as reflected by pairwise F ST values was 0.081 (FZ vs. HY) to 0.339 (HH vs. HX) indicating uneven genetic differentiation (Figure 2, Table S3). Analysis of all SNPs with admixture, ML phylogeny, and PCA revealed a well‐supported split at K = 2 corresponding to geography, and the populations included in the southern group (BA, GSJ, and HX; hereafter group S) and northern group (the rest populations; hereafter group N) were separated by the Taihang Mountains (Figures 1 and S2).
TABLE 1

Detailed summary statistics per population based on 22,868 restriction site‐associated DNA sequencing SNPs in Myripnois dioica

Population N Pi H O H E F IS N e (95% CI)Population age (Ma)
BA50.03460.140.10−0.033.3 (3.2–3.5)2.71
FZ70.03480.210.17−0.058.0 (7.6–8.5)1.38
GSJ60.03460.180.13−0.071.9 (1.8–1.9)1.02
HH30.03800.220.15−0.05Infinite0.41
HX50.03510.180.10−0.114.7 (3.9–4.3)1.02
HY40.03690.200.16−0.0315.6 (14.1–17.1)0.60
JC50.03780.190.15−0.057.0 (6.6–7.3)1.17
JH50.03420.210.16−0.063.3 (3.2–3.4)1.16
JL50.03030.220.17−0.064.6 (4.5–4.7)1.16
JW30.03840.220.15−0.0410.6 (9.7–11.5)0.41
MF50.03290.220.18−0.056.6 (6.3–6.9)3.63
QHD30.03410.200.16−0.01Infinite0.76
TL40.03560.210.16−0.0512.7 (11.7–13.8)1.49
XS50.03100.240.17−0.094.5 (4.4–4.7)2.38
YJP50.03200.220.17−0.045.6 (5.4–5.8)0.60
YX70.03440.210.17−0.063.4 (3.3–3.5)0.97

N, the number of individuals analyzed; Pi, nucleoid diversity; H O, observed heterozygosity; H E, expected heterozygosity; F IS, fixation index; N E (95% CI), effective population size estimates with 95% confidence intervals; population age, divergence time of each population.

FIGURE 2

Heatmap of pairwise values among 16 populations of Myripnois dioica based on 22,868 SNPs. Populations from the south and north groups are labeled by yellow and blue color

Detailed summary statistics per population based on 22,868 restriction site‐associated DNA sequencing SNPs in Myripnois dioica N, the number of individuals analyzed; Pi, nucleoid diversity; H O, observed heterozygosity; H E, expected heterozygosity; F IS, fixation index; N E (95% CI), effective population size estimates with 95% confidence intervals; population age, divergence time of each population. Heatmap of pairwise values among 16 populations of Myripnois dioica based on 22,868 SNPs. Populations from the south and north groups are labeled by yellow and blue color Among populations, N estimates ranged from 1.9 to 15.6 with the lowest and highest N detected in population GSJ and HY, respectively (Table 1). In southern populations, N estimates ranged from 1.9 to 4.7 (populations GSJ and HX), while in northern populations the values ranged from 3.3 to 15.6 (populations JH and HY). DIYABC estimations of the divergence history of M. dioica indicated that the scenario 3 had the higher posterior probability (posterior probabilities = 0.805, 95% CI: 0.579–1.000) (Figure S1 and Table S4). For demographic history, the best fit scenario for both group N and group S was Scenario 4 of M. dioica (Figures 3, S1 and Table S4). Group N was found to be the ancestral population and started to expand its distribution at c. 0.62 Ma (95% CI: 0.14–4.54 Ma), followed by a bottleneck at c. 0.033 Ma (95% CI: 0.003–0.094 Ma). Group S diverged from the ancestral population at c. 2.37 Ma (95% CI: 2.25–4.11 Ma), followed by expansion at 1.97 Ma (95% CI: 1.41–2.94 Ma) and a bottleneck at 0.046 Ma (95% CI: 0.029–0.097 Ma). Using TreeMix to infer migration patterns among different populations, we did not find significant gene flow between populations (not shown). Species distribution modeling revealed the optimal habitat for M. dioica has changed minimally during the LGM (Figure S3), and there are two main habitable areas located at 40°N and 37.5°N, respectively.
FIGURE 3

(a) The best ABC divergence model for Myripnois dioica based on diy‐abc analyses; (b) Demographic history of the two lineages under the best‐fitting ABC models. Times of population size changes are indicated by horizontal dashed lines

(a) The best ABC divergence model for Myripnois dioica based on diy‐abc analyses; (b) Demographic history of the two lineages under the best‐fitting ABC models. Times of population size changes are indicated by horizontal dashed lines

Genomic variation and outliers related to local adaptation

The IBD analyses indicated a significant correlation between geographic distance and genetic distance across populations (r = 0.33, p < 0.001) (Figure 4a). The first two principal components (Clim_PC1 and Clim_PC2) for all populations summarized 59.61% and 28.27% of the variation in the 19 climatic variables used in this study. A Mantel test among populations also determined significant correlation between genetic distance and environmental distance (r = 0.46, p < 0.001, Figure 4b). Our full RDA model indicated a significant role of geographic and environmental conditions in shaping the distribution of genotypes (p = 0.001; R = 0.16). The first two RDA axes were significant and explained 12.20% of the constrained variance (Table S5). When single variable effects were conditioned by other variables effect in partial RDA analysis, our results revealed a significant effect of Clim_PC1 and Latitude on genomic variations (Table S6). Gradient forest analyses indicated that the most two important predictors were temperature seasonality and precipitation seasonality (Figure 5). Our results indicated a shift temperature seasonality (standard deviation × 100) was 101 and precipitation of driest month was 5.5 mm (Figure 5).
FIGURE 4

Correlations between geography, environment, and genetic data. The correlation of mean pairwise geographic distance versus mean pairwise F ST (a), and correlation of mean pairwise environmental distance versus mean pairwise F ST (b)

FIGURE 5

(a–e) Cumulative importance of genotype change along the five environmental gradients; (f) R 2‐weighted importance of environmental variables that explain genetic gradients

Correlations between geography, environment, and genetic data. The correlation of mean pairwise geographic distance versus mean pairwise F ST (a), and correlation of mean pairwise environmental distance versus mean pairwise F ST (b) (a–e) Cumulative importance of genotype change along the five environmental gradients; (f) R 2‐weighted importance of environmental variables that explain genetic gradients A total of eight and 21 F ST outliers were identified in M. dioica using BayeScan and BayPass, respectively, with no overlapping outlier loci found between the two methods (Figures S4–S5). The BayPass analysis indicated a total of one and four loci significantly associated with Clim‐PC1 and Clim‐PC2, respectively (Figure S5). Based on the H. annuus genome, 22 loci associated with environment variables were successfully aligned to reference genome. A majority of the genes closest to the outlier loci were related to environmental stress response under drought tolerance, such as DREB and EAR motif protein 3, lipid transfer protein 3, and F‐box family protein (Table S6).

Joint effects of expansion dynamics and climatic factors on Ne

No association between the number of individuals sampled and N was observed (r = −0.50, p = 0.07). We independently employed climate factors and population age for predicting N in M. dioica populations. Our results showed no significant correlation between N and population age (r = −0.24, p = 0.41, Figure S6). Although principal component analysis revealed that the first two cumulative contribution rates of variance for bioclimatic factors explained the majority of the variance, no significant correlation was found between PC1 of all climate factors and N (r = −0.24, p = 0.42, Figure S6). In addition, there was no significant correlation between PC1 of energy/moisture and PC2 of energy/moisture to predict N (Figure S7). The above results suggest that a single factor within either population history or climatic factors could not predict N using a linear model. Thus, we further tested the multiple linear regression model of all the climate factors and population age. Due to the unevenness of the data, all variables were first normalized. Our stepwise regression showed the best fit linear model (r = 0.51, p = 0.05, AIC = 35.46, Table 2) included population age, mean temperature of coldest quarter, and precipitation of wettest month.
TABLE 2

The optimal general linear models of species factors predicting effective population size Ne

Model Ne ~ Bio11* + Bio13 + Population age
Multiple R‐squared0.51
Akaike information criterion35.46
p‐value0.05

“*” is corresponding to significant to Ne. Bio11 = mean temperature of coldest quarter; Bio13 = precipitation of wettest month.

The optimal general linear models of species factors predicting effective population size Ne “*” is corresponding to significant to Ne. Bio11 = mean temperature of coldest quarter; Bio13 = precipitation of wettest month.

DISCUSSION

Phylogeographic structure and demographic history

Our genomic data of M. dioica indicated a strong genetic population structure consisting of the northern and southern groups representing two distinct lineages geographically (Figure 1). Our best‐fitting ABC model identified the northern lineage as being ancestral and the southern group having been derived from it, which suggests that M. dioica originated north of the Taihang Mountains and then expanded south of the Taihang Mountains (Figure 3). The estimated divergence times for the southern and northern lineage occurring during the Pliocene to Pleistocene coincides with the intense uplift of the Taihang Mountains during the Late Pliocene to Pleistocene (Gong, 2010; Wu et al., 1999). With no detected migration events between the two groups, the division is inferred to be associated with restricted gene flow caused by long‐term isolation of geographic barrier. Visible mountains as barriers driving population subdivision and divergence have been reported in the Qinghai–Xizang Tibet Plateau and the Hengduan Mountains (Gao et al., 2003; Wu & Wu, 1996). The uplift of the Taihang Mountains during the Pliocene to Pleistocene, along with heterogeneous terrain and environmental change, undoubtedly promoted population divergence of plants in northern China flora, which has been indicated in previous results (Chai et al., 2020; Hou et al., 2014; Zhao et al., 2011). As phylogeographic information is useful in characterizing potential phylogeoregions, current genetic pattern of M. dioica and phylogeographical result in Opisthopappus suggests Taihang Mountains may be a potential phytogeographical boundary in Northern China (Chai et al., 2020). It is worth mentioning that we should be careful to estimate expansion and divergence time and population size changes from ABC model, as our data could not provide reliable distributions parameters and only provide a reference. The expansion–contraction model and in situ survival model are two alternative biogeographical models that have been applied to explain plant dynamics of the Quaternary period (Ni et al., 2010). When focusing on temperate forests in China, many studies have suggested the in situ survival model or the existence of multiple refugia in the northern regions during the Quaternary, such as in Ostryopsis davidiana Decne. (Tian et al., 2009) and Juglans mandshurica Maxim. (Bai et al., 2010). Our species distribution modeling revealed that no obvious southward contractions were found for M. dioica during the LGM, supporting the hypothesis of stable distribution by in situ survival. Analyses of the demographic history of the northern and southern group populations showed that both have experienced ancient expansions during the Pleistocene followed by bottlenecks during the recent warmer period (Figure 3). As affected by glaciation during the Pleistocene, temperature of the northern regions in China was significantly lower than those during the Pliocene and also became drier (Shi et al., 1999). This altered climate condition could be conducive to M. dioica, a cold and dry adapted species. Consequently, both geological events and climate oscillations contributed to phylogeographic pattern of M. dioica.

Genomic variations and climate‐driven adaptation

Our results based on IBE analyses indicated a significant effect of climatic factors on genetic differentiation of M. dioica, highlighting the contributions of environmental variables to the landscape genomics. Meanwhile, our RDA results indicated that the environment explained more genomic variations than geography, which is consistent with previous results (Orsini et al., 2013; Sexton et al., 2014). This also suggests that adaptation to the environment plays a key role in shaping plant divergence in the flora of northern China, and we used a landscape genomics approach to detect candidate genes for local adaption. A total of 22 SNPs were annotated to associate with climatic variables and a high proportion of located genes (over 50%) are involved in abiotic stress response of drought. Similar results have been presented in previous physiological studies in which M. dioica showed superior drought resistance of the root system (Dai et al., 2014; Xu et al., 2007). Among the outlier loci, we found genes involved in regulation of reduced lateral root formation. Plants decrease the metabolic cost of soil exploration, improving water acquisition and plant growth, which has been demonstrated to improve drought tolerance in Zea mays L. (Poaceae) and Oryza glaberrima Steud. (Poaceae) (Inukai et al., 2005; Lynch, 2013). Furthermore, we also detected loci in the super family of F‐box proteins and DREB, for which many studies have tested significant associations with drought in varied plant species (Ren et al., 2019; Zhang et al., 2008; Zhou et al., 2014). Consequently, current results based on M. dioica verified that drought stress is important and a dominant driver for local adaptation of plants in the flora of northern China, and M. dioica has undergone adaptive changes to better cope with an arid environment. Meanwhile, temperature seasonality and precipitation seasonality served as the most two crucial factors for the genomic variation of M. dioica among all the temperature and precipitation factors tested, and genetic composition shifts occurred when temperature seasonality was 101 and precipitation of driest month was 5.5 mm, which likely restricted the distribution margin of M. dioica.

Factors shaping limited effective population size and implications for conservation

To find evidence of the effects of both population history and climatic environment on populations evolution, we established relationships between effective population size across 16 populations and investigated factors being important for shaping evolutionary outcomes. Our results failed to indicate a significant positive correlation between a single factor (i.e., population age and expansion or climatic factors) and N, which suggests that population history or climatic factors alone cannot predict the evolution of populations (Figure 5). Nevertheless, our multiple regression analysis showed that historical and environmental factors have complementary roles in explaining the evolutionary patterns of M. dioica (Table 2). First, we recognized population age of M. dioica to be one of the direct contributing factors for estimating population change (Braasch et al., 2019; Excoffier & Ray, 2008). Meanwhile, mean temperature of coldest quarter and precipitation of wettest month were shown to be important factors for population change in M. dioica inferring tolerance to coldness and drought allowing the species distribution in northern China (Xie & Zhang, 2012). Our current genome‐based linear model only offers a qualitative reference for predicting the population change of M. dioica and this specialized species evolved due to both population evolutionary history and climatic variables. As our genome‐based result offers a qualitative guideline for predicting population change, it is easy to screen factors directly related to population evolution. This will be highly effective to providing guidance for endemic plant of narrow area. Although M. dioica are distributed throughout northern China, it is particularly troubling that many specimen locations from previous records cannot be found, and only a very few individuals are left in some places during our filed investigations. In this case, M. dioica may be currently under threat and it is necessary to take measures in advance to prevent further disappearance. Myripnois dioica is usually distributed in disturbed, open habitats, occurring on slopes of limestone, and even along the roadside. Human activities have caused species diversity loss by habitat destruction, suggesting that M. dioica habitats are under constant threat with some populations already having been extirpated (Chittibabu & Parthasarathy, 2000; Fang et al., 2018). Meanwhile, the clear result in the current study is the distinct isolation between southern and northern lineages which are almost certainly not interconnected by gene flow. Given the distinct two lineages among M. dioica populations which have adapted to the local ecology, overpromoting gene flow between populations may lead to outbreeding depression and decreased fitness. A recent study suggested that intraspecific gene flow between edge populations potentially facilitates population conservation (Hannah et al., 2014). In addition to placing attention to populations around the genetic hot spots, we suggest a background survey of wild populations of M. dioica including the total number of populations, genetic differentiation between populations, intermediate/edge populations should be implemented across its entire range. Combined with our predicted evolution factors, these will help to adequately address accurate conservation strategies of M. dioica with significant reference to other species responding to climatic change and conservation management.

AUTHOR CONTRIBUTIONS

Nan Lin: Data curation (lead); Formal analysis (lead); Investigation (lead); Methodology (lead); Software (lead); Writing‐original draft (lead). Jacob B. Landis: Formal analysis (supporting); Methodology (supporting); Software (supporting); Writing‐review & editing (supporting). Yanxia Sun: Methodology (supporting); Software (supporting). Xianhan Huang: Methodology (supporting); Software (supporting). Xu Zhang: Investigation (supporting); Resources (supporting). Qun Liu: Investigation (supporting); Resources (supporting). Huajie Zhang: Investigation (supporting); Resources (supporting). Hang Sun: Conceptualization (equal); Funding acquisition (equal); Supervision (equal); Writing‐review & editing (equal). Hengchang Wang: Conceptualization (equal); Supervision (equal); Writing‐review & editing (equal). Tao Deng: Conceptualization (equal); Funding acquisition (equal); Supervision (equal); Writing‐review & editing (equal). Supplementary Material Click here for additional data file.
  55 in total

1.  Gradient forests: calculating importance gradients on physical predictors.

Authors:  Nick Ellis; Stephen J Smith; C Roland Pitcher
Journal:  Ecology       Date:  2012-01       Impact factor: 5.499

2.  Genome-Wide Scan for Adaptive Divergence and Association with Population-Specific Covariates.

Authors:  Mathieu Gautier
Journal:  Genetics       Date:  2015-10-19       Impact factor: 4.562

3.  Variation partitioning of species data matrices: estimation and comparison of fractions.

Authors:  Pedro R Peres-Neto; Pierre Legendre; Stéphane Dray; Daniel Borcard
Journal:  Ecology       Date:  2006-10       Impact factor: 5.499

4.  Genetic isolation by environment or distance: which pattern of gene flow is most common?

Authors:  Jason P Sexton; Sandra B Hangartner; Ary A Hoffmann
Journal:  Evolution       Date:  2013-09-23       Impact factor: 3.694

5.  Integrating landscape genomics and spatially explicit approaches to detect loci under selection in clinal populations.

Authors:  Matthew R Jones; Brenna R Forester; Ashley I Teufel; Rachael V Adams; Daniel N Anstett; Betsy A Goodrich; Erin L Landguth; Stéphane Joost; Stéphanie Manel
Journal:  Evolution       Date:  2013-09-04       Impact factor: 3.694

6.  The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution.

Authors:  Hélène Badouin; Jérôme Gouzy; Christopher J Grassa; Florent Murat; S Evan Staton; Ludovic Cottret; Christine Lelandais-Brière; Gregory L Owens; Sébastien Carrère; Baptiste Mayjonade; Ludovic Legrand; Navdeep Gill; Nolan C Kane; John E Bowers; Sariel Hubner; Arnaud Bellec; Aurélie Bérard; Hélène Bergès; Nicolas Blanchet; Marie-Claude Boniface; Dominique Brunel; Olivier Catrice; Nadia Chaidir; Clotilde Claudel; Cécile Donnadieu; Thomas Faraut; Ghislain Fievet; Nicolas Helmstetter; Matthew King; Steven J Knapp; Zhao Lai; Marie-Christine Le Paslier; Yannick Lippi; Lolita Lorenzon; Jennifer R Mandel; Gwenola Marage; Gwenaëlle Marchand; Elodie Marquand; Emmanuelle Bret-Mestries; Evan Morien; Savithri Nambeesan; Thuy Nguyen; Prune Pegot-Espagnet; Nicolas Pouilly; Frances Raftis; Erika Sallet; Thomas Schiex; Justine Thomas; Céline Vandecasteele; Didier Varès; Felicity Vear; Sonia Vautrin; Martin Crespi; Brigitte Mangin; John M Burke; Jérôme Salse; Stéphane Muños; Patrick Vincourt; Loren H Rieseberg; Nicolas B Langlade
Journal:  Nature       Date:  2017-05-22       Impact factor: 49.962

7.  BEAST: Bayesian evolutionary analysis by sampling trees.

Authors:  Alexei J Drummond; Andrew Rambaut
Journal:  BMC Evol Biol       Date:  2007-11-08       Impact factor: 3.260

8.  RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

9.  Genetic Divergence and Relationship Among Opisthopappus Species Identified by Development of EST-SSR Markers.

Authors:  Min Chai; Hang Ye; Zhi Wang; Yuancheng Zhou; Jiahui Wu; Yue Gao; Wei Han; En Zang; Hao Zhang; Wenming Ru; Genlou Sun; Yling Wang
Journal:  Front Genet       Date:  2020-02-28       Impact factor: 4.599

10.  Demographic history and genetic differentiation of an endemic and endangered Ulmus lamellosa (Ulmus).

Authors:  Huimin Hou; Hang Ye; Zhi Wang; Jiahui Wu; Yue Gao; Wei Han; Dongchen Na; Genlou Sun; Yiling Wang
Journal:  BMC Plant Biol       Date:  2020-11-17       Impact factor: 4.215

View more
  1 in total

1.  Genomic Data Reveals Profound Genetic Structure and Multiple Glacial Refugia in Lonicera oblata (Caprifoliaceae), a Threatened Montane Shrub Endemic to North China.

Authors:  Xian-Yun Mu; Yuan-Mi Wu; Xue-Li Shen; Ling Tong; Feng-Wei Lei; Xiao-Fei Xia; Yu Ning
Journal:  Front Plant Sci       Date:  2022-05-09       Impact factor: 6.627

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.