Literature DB >> 30210523

Effects of Geological and Environmental Events on the Diversity and Genetic Divergence of Four Closely Related Pines: Pinus koraiensis, P. armandii, P. griffithii, and P. pumila.

Yun Jia1, Juan Zhu1, Ying Wu1, Wei-Bing Fan1, Gui-Fang Zhao1, Zhong-Hu Li1.   

Abstract

The effects of mountain uplift and environmental oscillations on nucleotide variability and species divergence remain largely unknown in East Asia. In this study, based on multiple nuclear DNA markers, we investigated the levels and patterns of nucleotide diversity and interspecific divergence in four closely related pines in China, i.e., Pinus koraiensis, P. armandii, P. griffithii, and P. pumila. The four pine taxa shared low levels of nucleotide polymorphisms at the species level. P. pumila had the highest silent nucleotide diversity (πsil = 0.00661) whereas P. griffithii had the lowest (πsil = 0.00175), while the levels of genetic polymorphism in P. armandii (πsil = 0.00508) and P. koraiensis (πsil = 0.00652) were intermediate between the other two species. Population genetic structure analysis showed that variations primarily existed within populations of the four pine species, presumably due to habitat fragmentation or the island-like distributions of Pinus species. Population divergence (FST) analysis showed that the genetic divergence between P. griffithii and P. koraiensis was much greater than that between P. koraiensis and the other two pines species. Isolation-with-migration analysis suggested that asymmetric gene flow had occurred between any two pairs of pine species. Phylogenetic analyses indicated that the four allied species split into two groups about 1.37 million years ago, where P. armandii and P. pumila were closer and clustered as sister species, whereas P. koraiensis and P. griffithii were clustered on another branch. Our results and those obtained in previous studies suggest that mountain uplift and geological climate oscillations may have led to the patterns of genetic divergence and nucleotide variations in these four pine species.

Entities:  

Keywords:  Pinus armandii; Pinus griffithii; Pinus koraiensis; Pinus pumila; genetic divergence; nucleotide polymorphism

Year:  2018        PMID: 30210523      PMCID: PMC6121107          DOI: 10.3389/fpls.2018.01264

Source DB:  PubMed          Journal:  Front Plant Sci        ISSN: 1664-462X            Impact factor:   5.753


Introduction

Nucleotide diversity levels within populations and spatial patterns, as well as species divergence are of great importance in the field of evolutionary biology (Coyne and Orr, 2004; Hao et al., 2015; Ortego et al., 2015). Mountain uplift and past environmental oscillations may have been largely responsible for shaping the spatial patterns of diversity and genetic divergence among species (Coyne and Orr, 2004; Wachowiak et al., 2009). In general, the level and distribution of nucleotide diversity are historical products of the long-term evolution of a species, and they are largely associated with the evolutionary potential or future fate of a species (Wright and Gaut, 2005; Wachowiak et al., 2009; Zhou et al., 2014; Tsuda et al., 2017). In addition, ecological or proximal causes (e.g., mating systems) and various barriers (e.g., geographic and spatio-temporal isolation) due to geological history can cause fragmented of species distributions, which may lead to reduced gene flow between isolated populations and adaptability. This process initiates allopatric divergence, and local adaptation can ultimately drive populations toward speciation and change evolutionary processes (Cutter and Gray, 2016; Ren et al., 2017). Conifers are anemophilous and outcrossing (Fu et al., 1999). They are mainly characterized by long life cycles, large effective population sizes, incomplete lineage sorting, and extensive introgression/hybridization among populations, which makes their genetic structure and spatial patterns of diversity very different from those found in traditional model plants (Neale and Kremer, 2011; Gao et al., 2012; Li et al., 2012, 2013; Hao et al., 2015). For instance, conifers tend to share haplotypes/genotypes among species, with no distinct genetic divergence across species ranges, and most of the genetic variations are found within populations (Willyard et al., 2009; Chen et al., 2010; Ren et al., 2012; Liu et al., 2014; Zhou et al., 2014). In recent years, many studies have determined nucleotide polymorphisms and speciation history patterns using multiple nuclear loci (Ma et al., 2006; Li et al., 2010; Gao et al., 2012; Wachowiak et al., 2013; Zhou et al., 2014; Zou et al., 2016). These biparentally inherited nuclear genes are functional genes that encode proteins and they are characterized by their orthology, moderate to high rates of evolution, and the presence of many phylogenetically informative sites (Zhou et al., 2014). Therefore, large numbers of nuclear markers can be used to detect the deep evolutionary relationships among closely related species, especially recently diverged taxa (Chen et al., 2010; Zou et al., 2016). In this study, we employed nucleotide polymorphisms as well as the population structure and speciation history to explore the relationships among four Pinus species. Four related Pinus species in subsection Strobus occur in East Asia: P. armandii Franch., P. koraiensis Sieb. et Zucc., P. griffithii McClelland, and P. pumila (Pall.) Regel. These species share some common features, such as possessing five needle leaves in a bundle. There are obvious differences among these species in terms of their ecological niche, natural geographic distribution, morphology, wood anatomy, and cytology (Wu and Feng, 1995). P. armandii, P. pumila, and P. koraiensis occur according to the changes in the hydrothermal conditions, and P. griffithii is distributed on the China–Nepal and China–Bhutan borders (Supplementary Figure ; Wu and Feng, 1995). The distributions of these pines also increase successively from low to high altitudes, where P. koraiensis occurs at altitudes of 150–1,800 m and P. pumila always forms copses with other coniferous trees on mountain tops at altitudes of 1,000–2,300 m. P. armandii usually occurs in pure forest or mixed forest at altitudes of 1,000–3,300 m, and P. griffithii is distributed in the same manner at altitudes of 1,600–3,300 m on the Qinghai–Tibet Plateau and Mount Everest. According to the classic categorization of Pinus sect. Strobus, P. pumila is categorized into the P. koraiensis taxon and P. armandii into the P. griffithii group (Fu et al., 1999). A study of the divergence of the resin ducts in Pinus sect. Strobus suggested that P. armandii is the most primitive species and its southward spread gave rise to P. griffithii, whereas its northward spread gave rise to P. pumila and P. griffithii (Peng, 1999). In recent years, several studies based on plastid molecular markers have shown that P. koraiensis and P. pumila are most closely related to each other (Peng, 1999; Wang et al., 2016). However, the accurate phylogenetic relationships and interspecific divergence among these related pine species is still controversial due to the limited availability of morphological and molecular biological evidence (Peng, 1999; Liu et al., 2014; Hao et al., 2015). In addition, some studies found low variations in DNA barcodes, such as rbcL and matK, among related species due to low levels of cpDNA diversity and genetic divergence (Syring et al., 2007; Li et al., 2015). Moreover, frequent interspecific introgression and hybridization among species have important effects on their genetic diversity levels, especially in parapatric or allopatric of species in China. Thus, in the current study, we used multiple nuclear genes to investigate the genetic diversity and divergence in four closely related pine species comprising P. koraiensis, P. armandii, P. griffithii, and P. pumila. We specifically addressed the following two questions. (1) How is the level and pattern of population divergence among the four pines species? (2) How is the pattern of gene flow and interspecific introgression between these closely related species in East Asia?

Materials and Methods

Population Sampling

To accurately determine the nucleotide diversity and interspecific relationships among pines, we sampled 216 individuals from 16 allopatric populations of the four pine species (Figure ). The distance between any two trees of the same species was at least 50 m (Supplementary Table ). We isolated the haploid megagametophyte from each of the sampled trees. (A) Geographical distribution of the sampled populations of the four closely related species: Pinus pumila (red), P. griffithii (green), P. koraiensis (yellow), and P. armandii (blue). Color scales indicate different altitudes. (B) Bayesian clustering analysis to determine the population structure of the four pine species. Red, green, and blue represent the dominant clusters (K = 3) identified by STRUCTURE in each population.

DNA Extraction, PCR Amplification, and Sequencing

Total genomic DNA was extracted from the megagametophyte samples for each individual using the modified CTAB method (Doyle and Doyle, 1987). In preliminary studies, about 40 nuclear gene loci were screened for cross-amplification in the four pine species (Ma et al., 2006; Eckert et al., 2013). Finally, six polymorphic loci (1_1609_01, CL1694, PTIFG2009, 0_12929_02, 0_14221_01, and 0_1688_02) associated with protein kinase family protein, serine-tRNA ligase, and leucine-rich repeat family protein were selected for subsequent sequence amplification and analysis (Supplementary Table ). PCR amplification was conducted in a volume of 25 μL with a DNA concentration of 10–40 ng/μL, 50 mM of Tris-HCl, 0.5 mM of each dNTP, l.5 mM of MgCl2, 2 μM of each primer, and 0.75 U of Ex Taq DNA polymerase (Runde, Xi’an, China). The PCR program comprised initial denaturation at 94°C for 5 min, followed by 35 cycles for 1 min at 94°C, at a specific annealing temperature (53–60°C, see Supplementary Table for details) for 1.5 min and extension for 1 min at 72°C, and a final extension for 10 min at 72°C. Primer synthesis and sequencing of the PCR products were performed by Shanghai Biological Engineering Co. Ltd. (Shanghai, China). Sequencing was conducted using both forward and reverse primers for each gene (Supplementary Table ) on an ABI Prism 3730xl sequencer (Applied Biosystems, Foster City, CA, United States).

Data Analysis

Data Reconciliation

Sequences were aligned and manually adjusted with Chromas and MEGA5.0 (Librado and Rozas, 2009; Tamura et al., 2011) to correct random errors generated by sequencing.

Nucleotide Diversity and Neutral Tests

The genetic diversity parameters for the four pine species were calculated using DnaSP v. 5.10 (Librado and Rozas, 2009), including the number of segregation sites S, Watterson parameters θw (Watterson, 1975), total nucleotide polymorphisms πt (Li and Nei, 1975), nucleotide diversities of non-synonymous sites and silent loci (synonymous sites and non-coding positions), πa and πsil, number of haplotypes Nh and haplotype diversity Hd (Nei and Tajima, 1981; Fu, 1997; Depaulis and Veuille, 1998; Depaulis et al., 2001), and intragenic minimum recombination events (RM) (Hudson and Kaplan, 1985). In addition, in order to accurately detect departure from the neutral model of molecular evolution at each locus, the neutral equilibrium was tested for various parameters using Tajima’s D (Tajima, 1989), Fu and Li’s D and F (Fu and Li, 1993), and the standardized Fay and Wu’s H (Fay and Wu, 2000). Tajima’s D measures the standardized difference between π and θW, whereas Fay and Wu’s H measures the difference between π and θH. The former is more sensitive to an excess of rare variants whereas the latter is more sensitive to an excess of high-frequency-derived variants. Both D and H are expected to be zero under the standard neutral model (Zou et al., 2013). We also conducted maximum frequency of derived mutations (MFDM) tests to examine the likelihood of natural selection acting on individual loci at species levels. The MFDM tests exclude the confounding effects of demography completely when detecting recent positive selection (Li, 2011). In practice, a single DNA fragment (i.e., a locus) may have a short length and only contain a few RM. The MFDM v. 1.1 test always depends on the estimate of RM (Li, 2011).

Genetic Divergence and Population Structure

The sources of genetic variation among the four species (group), populations and individuals were analyzed by analysis of molecular variance (AMOVA) with ARLEQUIN v.3.11 (Excoffier et al., 2005). We estimated F statistics hierarchically, both among species (FCT) and among populations within species (FST). FST (Wright, 1949) is a coancestry statistic that provides the variance within populations relative to the total population. We used NETWORK v. 4.6.1.3 (Bandelt et al., 1999) to construct phylogenetic relationships based on the haplotypes of each species at the six loci (gaps were excluded). In addition, genetic clustering based on individuals was estimated by Bayesian clustering using the STRUCTURE V.2.3 program (Hubisz et al., 2009). To estimate the number of clusters (K) in the data, K values from 1 to 16 were explored using 10 independent runs per K and an admixture model. As described in previous studies, in order to generate a reliable estimate of the optimal K, the burn-in was set to at least 200,000 and the run length was at least 500,000 (e.g., Zou et al., 2013; Zhou et al., 2014; Tsuda et al., 2015). We also utilized the STRUCTURE HARVESTER program to estimate the most likely number (K) of genetic clusters (Evanno et al., 2005; Earl and vonHoldt, 2012).

Reconstruction of Historical Dynamics and Species Relationship

The migration rates, effective population sizes, and population split times were calculated based on the isolation-with-migration (IM) model using the IMa2 program to infer the population history dynamics of the four pine species (Nielsen and Wakeley, 2001; Hey, 2006, 2010; Kuhner, 2009). We analyzed sibling species in a pairwise manner using a basic two-population model. We extracted the largest region with no recombination for each of the six nuclear loci. Functions of the model parameters were estimated in the M-mode based on 1 × 106 Monte Carlo Markov chain (MCMC) steps following 5 × 105 burn-in periods in order to obtain reliable estimates (i.e., similar posterior distributions for the parameter), and the effective sample size for each parameter was at least 200. The divergence time between species was estimated based on a mean mutation rate of μ = 4.875 × 10−9 (per site per generation), and the generation time for pines was assumed to be 25 years (Ma et al., 2006). In addition, we constructed the phylogenetic relationships among the four pine species using ∗BEASTv1.8.0 (Heled and Drummond, 2010). The species tree was computed using the six nuclear genes sequenced for the sampled species. We selected a Yule model as the species tree prior, a constant population size, and relaxed lognormal clock models for all nuclear loci (Heled, 2012). Pinus bungeana was used as outgroup. We ran the MCMC analysis for one billion generations with sampling every 50,000 generations. Two independent runs were conducted. Tracer[1] v1.5 (Rambaut and Drummond, 2009) was used to assess the convergence of chains to the stationary distribution (effective sample size >200). After discarding the first 2,500 trees as a burn-in, the remaining trees were summarized in a maximum clade credibility tree with the TreeAnnotator v1.8.0 program (Drummond and Rambaut, 2007). Joint Bayesian species delimitation and species tree estimation were also analyzed using the BPP v3.4 program based on the multispecies coalescent model (Yang, 2015). In addition, phylogenetic relationships based on nuclear haplotypes were reconstructed with the maximum likelihood (ML) model using PAUP 4.0 (Swofford, 2002). The ML analysis employed the HKY substitution model, where support values for the nodes were estimated based on 1,000 bootstrap replicates.

Results

Nucleotide Polymorphisms

For all nuclear loci, P. griffithii had the lowest estimates for the total average segregating sites and average values of the segregating sites in silent sites compared with the other three species (Table ). P. griffithii had two singleton mutation sites in PTIFG2009. The numbers of shared polymorphisms (SS) were similar among the four closely related species and the numbers of fixed differences (Sf) were low. The differences in the polymorphisms between P. pumila and P. griffithii were mainly due to the 0_14221_01 gene locus, with 18 polymorphic sites in P. pumila but only three in P. griffithii (Supplementary Table ). Similarly, the differences between P. koraiensis and P. pumila were mainly due to the CL1694 locus, with 20 polymorphic sites in the former but only five in the latter. The 0_14221_01 and 0_12929_02 loci accounted for the observed differences between P. armandii and P. pumila. Nucleotide variations in four Pinus species: Pinus pumila, P. griffithii, P. koraiensis, and P. armandii.

Neutrality Tests

Positive Fu and Li’s D∗ and Fu and Li’s F∗ values were estimated for most loci, although most of these values were not significant in each species (Table ). The mean Tajima’s D (D) values were negative for P. pumila (−0.236) and P. griffithii (−0.030), but positive for P. koraiensis (0.274) and P. armandii (0.254) (Table ). In addition, the mean Fay and Wu’s H (H) values were negative for P. koraiensis and P. griffithii but positive for P. pumila and P. armandii (Table ). However, with the exception of locus PtIFG2009 (P = 0.04674) in P. pumila, no significant deviation from neutrality were detected for the six loci using the MFDM test (Supplementary Table ). The MFDM test detected slight deviation from the standard neutral model at the PtIFG2009 locus in the four pine species by considering genetic recombination (P < 0.05). Haplotype diversity and neutrality tests for Pinus pumila, P. griffithii, P. koraiensis, and P. armandii: number of haplotypes (Nh), haplotype diversity (Hd), Tajima’s D (D), Fu and Li’s D∗ (D∗), Fu and Li’s F∗ (F∗), and Fay and Wu’s H (H).

Population Genetic Structure

Within the four species, the 0_12929_02 locus had the highest genetic divergence among populations (FST = 0.714, P < 0.001), whereas the 1_1609_01 locus had the lowest genetic divergence among populations (FST = 0.085, P < 0.001) (Supplementary Table ). The population genetic divergence was also significant across all loci (FST = 0.624, P < 0.001). It should be noted that FST was much higher for the overall loci than interspecific genetic differentiation (FCT) except for the 0_12929_02 locus (Supplementary Table ). Between pairs of species, FST varied from 0.008 to 1.000 (Supplementary Table ). The divergence among the four pine species at the six nuclear loci was also supported by Bayesian clustering analysis (Figure ). The most likely number of clusters for the entire dataset was K = 3 (Supplementary Figure ). P. pumila and P armandii individuals were separated into two groups that corresponded to their respective species, whereas the majority of P. koraiensis and P. griffithii individuals were assigned to another cluster. Remarkable levels of gene flow and gene introgression were apparent between P. koraiensis and P. armandii (Figure and Supplementary Figure ).

Genealogy of Each Locus

The average number of haplotypes (Nh) and haplotype diversity (Hd) were much higher in P. pumila (Nh = 12.167, Hd = 0.822) and P. armandii (Nh = 10, Hd = 0.781) than P. koraiensis (Nh = 7.167, Hd = 0.655) and P. griffithii (Nh = 4.167, Hd = 0.420) (Table ). The haplotypes in the center of the network were shared (Figure ), but most haplotypes were exclusive to specific species at the five loci (1_1609_01, 0_1688_02, PTIFG2009, 0_14221_01, and CL1694). In addition, there was no shared haplotype at the 0_12929_02 locus. According to the ΦST values for all loci among the species in Supplementary Table , each two Pinus species exhibited significant genetic divergence, where the highest ΦST value (0.62006) was found between P. koraiensis and P. griffithii, whereas the divergence between P. pumila and P. griffithii was lowest (ΦST = 0.18941). Networks obtained for the six nuclear genes in the four species comprising Pinus pumila (red), P. griffithii (green), P. koraiensis (yellow), and P. armandii (blue). Each sector of a circle corresponds to the frequency of the haplotype for each species.

Evolutionary Relationships Among the Four Species

The mean divergence time between P. pumila and P. armandii was estimated at 1.13 million years ago (Mya). A younger divergence time (0.319 Mya) was estimated between P. griffithii and P. koraiensis (Table ). In addition, we found asymmetric historical gene flow between pairs of species. In particular, the migration rate from P. pumila to P. griffithii was 2.0450, with 0.0005 in the reverse direction (Table ). Pinus griffithii and P. koraiensis had smaller population sizes (0.0918–0.1846 and 0.3264–0.5369, respectively; Table ) than P. pumila and P. armandii (0.3738–0.8326 and 0.4087–0.7019, respectively; Table ). The species tree analyses demonstrated that the relationship was closer between P. armandii and P. pumila where they clustered as sister groups, whereas P. koraiensis and P. griffithii were located in another clade. The four species split into two groups about 1.37 Mya (Figure ). Moreover, we obtained the best species-tree model using BPP v3.4 software, where the posterior probability of the species tree was one and the acceptance proportion was near to zero (0.025) based on multiple runs (Supplementary Figure ). The topology of the tree was consistent with the results obtained by ∗BEAST (Figure and Supplementary Figure ). Maximum-likelihood estimates and 90% highest posterior density (HPD) intervals for demographic parameters obtained from pairwise IM multilocus analyses. Divergence times and phylogenetic relationships among four pine species. The tree was constructed based on six nuclear genes using ∗BEAST. The tree was rooted with Pinus bungeana. The numbers on the branches indicate the corresponding posterior probabilities values, mean divergence dates, and 95% credibility interval.

Discussion

Nucleotide Diversity

The nucleotide diversity at silent sites basically agreed with the neutral model of molecular evolution (Ma et al., 2006; Wachowiak et al., 2016). We detected low levels of silent polymorphisms in the four closely related pines species, because the average values were much lower than the average polymorphism for most conifers at multiple nuclear genes (πsil = 0.0029–0.0122) (Ma et al., 2006; Li et al., 2012). Among the four species, P. pumila had the highest silent nucleotide diversity (πsil = 0.00661), whereas P. griffithii had the lowest (πsil = 0.00175). However, these diversity values were much lower than those found in other Pinus species, such as P. densata and P. yunnanensis (Ma et al., 2006). Factors such as the nuclear gene loci selected in the study, sample size variations, mutation rates within species, demographic effects, and natural selection can influence the nucleotide diversity levels and patterns in species or populations (Loveless and Hamrick, 1984; Lande, 1988; Li et al., 2012; Zou et al., 2013). The four related species and six nuclear loci investigated in our research have also been studied previously, and thus these nuclear loci were not the main cause of the low levels of nucleotide polymorphisms. However, the unequal sample sizes for different genes and species may have caused differences in the nucleotide variability among species. To verify this bias, we detected the nucleotide diversity parameters based on the same sample sizes for each gene from each Pinus species. The results showed that there were significant differences in the levels of diversity among different species compared with the previous estimates (Table and Supplementary Table ). We concluded that the levels of nucleotide variability among species were significantly associated with the samples sizes of the pine species. Similar differences in the patterns of diversity have also been detected in some other gymnosperm species (Ma et al., 2006; Du et al., 2009; Zou et al., 2013; Zhou et al., 2014). In addition, the long life cycles and low mutation rates in conifers may explain the low levels of nucleotide polymorphisms in P. pumila, P. griffithii, P. koraiensis, and P. armandii. Moreover, high levels of linkage disequilibrium were detected and some species deviated from neutrality according to the tests conducted in our study. This was particularly evident at one locus, i.e., PtIFG2009, which suggests that this locus might have undergone selection or population shrinkage according to the results obtained from the Tajima’s D and MFDM tests. In particular, for the populations of P. griffithii and P. koraiensis, Tajima’s D was positive, and Fay and Wu’s H was negative, which is a pattern that is consistent with a recent bottleneck. In addition, P. griffithii and P. pumila descendant populations had a somewhat smaller size than the ancestral population (Table ), and thus it is possible that the populations have experienced from genetic bottlenecks. The population dynamics due to geological isolation and climatic oscillations probably contributed to their relatively lower diversity (Chen et al., 2017). In addition, the mean Tajima’s D (D) and mean Fay and Wu’s H (H) values were negative but close to zero for P. armandii at the PtIFG2009 locus, which may indicate a neutral equilibrium (Holliday et al., 2010). The numbers of nucleotide polymorphism were higher in P. pumila, P. koraiensis, and P. armandii than P. griffithii. These results can partly be explained by the fact that their seeds are food for nutcrackers and rodents such as squirrels. These animals may screen the seeds and transport them over long or short distances for secondary storage and dispersal, thereby also enhancing the spread of the seeds, and this may affect their genetic differences (Li et al., 2007; Fan and Jin, 2011). The low level of nucleotide polymorphism in P. griffithii may be explained by its small geographic distribution compared with more common and widespread species, because of drift, founder events, and other stochastic processes (Cole, 2003; Chen et al., 2017). In addition, P. pumila is well known because of its larger island-like distribution and it rarely develops into pure forest, thereby accounting for its higher diversity compared with the other three species (Qiu et al., 2007; Chen et al., 2017).

Interspecific Gene Flow and Species Divergence

Analysis of molecular variance detected remarkable divergence in the four Pinus species where the variations were mainly within populations, which agreed with the small differences among populations of wind-pollinated gymnosperms (Supplementary Table ). However, we also found a high level of genetic divergence within groups, possibly due to habitat fragmentation or the island-like distribution of the four species, particularly when considering that the habitats of P. pumila and P. griffithii are harsher than those of the other two species. The former often grows in barren soil on bare rocky peaks. This type of habitat is vulnerable to fragmentation but mountains and ravines may partly hinder the gene flow between populations, thereby leading to the isolation of groups. Genetic divergence was found among the four species, although some degree of gene flow and introgression was detected. In particular, populations of P. koraiensis had a mosaic-like pattern and they were further subdivided into independent sub-clusters when K = 4, which suggests that a high level of introgression in this species. The pairwise migration rate between P. koraiensis and P. pumila was relatively high compared with that between P. koraiensis and P. armandii. In addition, P. pumila and P. koraiensis had a relatively limited distribution in the northwest and northeast of North China, and there was no clear phylogenetic resolution among P. pumila, P. koraiensis, and P. armandii based on DNA fragments from the chloroplast, mitochondrial, and nuclear genomes according to previous phylogenetic studies (Liu et al., 2014; Wang and Wang, 2014; Hao et al., 2015). These results suggest that migration may have led to a sympatric distribution in addition to the existing incomplete reproductive isolation. The phylogenetic relationships determined based on ML and NETWORK analysis also showed that the shared haplotypes were located in the center of the topological structure (Figure and Supplementary Figure ), and thus incomplete genealogical screening based on a large effective population of Pinus may have led to the sharing of ancestral polymorphisms. However, there were no shared haplotypes based on the 0_12929_02 locus, and the interspecies variation was similar to the genetic differentiation among populations (FCT = 0.718, FST = 0.714; P < 0.001; Supplementary Table ). In general, different nuclear loci have different evolutionary rates and molecular functions (Li et al., 2012; Eckert et al., 2013; Zhou et al., 2014). Previous studies have shown that the 0_12929_02 locus is associated with the protein kinase family and that it has been under selection (Eckert et al., 2013). The rapid fixation of genetic variation in this locus may have led to greater species divergence (Nei et al., 1975; Nei and Tajima, 1981; Ellstrand and Elam, 1993). In addition, nuclear DNA introgression in ancestral populations among species may have also affected the topology of the phylogenetic trees. The significant topological incongruence among the nuclear gene trees (Supplementary Figure ) indicates a complex evolutionary history, thereby providing novel insights into the evolution of Pinus. The four species split into two clades about 1.37 Mya (Figure ). However, we should be cautious when inferring divergence times based on the assumption that the generation time is 25 years in the four Pinus species because of the longevity of Pinus, the long overlaps between generations, the variable age of maturity and the replacement speed of forests. In addition, our multilocus analysis determined that the genetic divergence among the four pine species, was consistent with geological events and climatic oscillations in the mid- to late Tertiary period about 5 Mya. The uplift of the Tibetan Plateau caused by Himalayan orogeny had a great impact on the climate in China, with decreases of in temperature in some areas, but increases of 4–8°C in the region east of 100°E (across the Inner Mongolia, Gansu, Qinghai, Sichuan, and Yunnan regions of China) and of 1–4°C to the west (Jiang, 2009). These climatic conditions may have changed the geographic distributions of plants, and thus we suggest that P. armandii and its ancestral population spread eastward to the northeast of China and westward to Tibet. However, the intensities of the winter and summer monsoons were reduced greatly during the middle-late Pliocene, and the dispersal of Pinaceae pollen by the wind might have been affected (Zhou et al., 2014). Moreover, after gradual changes in the microhabitats and variations in the directions and amounts of gene flow, as well as the accumulation of mutations, new relatives may have emerged by gradual divergence. Effective migration, hybridization, and introgression among species can increase genetic diversity (Wachowiak et al., 2016), and other factors such as selection, isolation, and genetic drift among different microhabitats can promote divergence and speciation. Indeed, significant and asymmetric gene flow and introgression were detected in the four closely related Pinus species. Gene flow and genetic introgression among different pines could have led to changes in genetic variability (Hao et al., 2015; Wachowiak et al., 2016).

Data Availability

Sequence data obtained in this study were deposited in GenBank (KF286539 – KF286739).

Author Contributions

Z-HL conceived the study. YJ and JZ performed the experiments. Z-HL, YJ, YW, W-BF, and G-FZ contributed materials and analysis tools. Z-HL, YJ, and JZ wrote the manuscript. YJ and ZL revised the manuscript. All authors approved the final version of the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Table 1

Nucleotide variations in four Pinus species: Pinus pumila, P. griffithii, P. koraiensis, and P. armandii.

SpeciesLocusTotalNonsynonymous sitesSilent SitesRm



NLS (Singl.)θπtLSθWπaLSθWπSil
P. pumila1_1609_018038714 (0)0.008500.0082710130.007000.00916282110.009160.008071
PTIFG20098850513 (1)0.005920.0064819440.004740.0060031190.006660.006787
0_1688_02806136 (0)0.002330.0025821700.000000.0000038760.003640.004041
CL1694563055 (2)0.004210.003305900.000000.0000024550.005240.004100
0_12929_0210073514 (1)0.004250.0025528820.001550.00078443120.006040.003722
0_14221_016858518 (1)0.008010.0077928940.003910.00359285150.013150.012974
Average7811.670.005540.00516191.32.20.002870.00326325.59.70.007320.0066115
P. griffithii1_1609_01563873 (0)0.002000.0016210120.005090.0048328010.000920.000490
PTIFG2009525059 (2)0.004670.0056819430.004050.0054131160.005060.005854
0_1688_025261310 (0)0.004320.0029121760.007260.0040938740.002710.002260
CL1694163050 (0)0.000000.000005900.000000.0000024500.000000.00000
0_12929_02407350 (0)0.000000.0000029000.000000.0000044100.000000.00000
0_14221_01605853 (0)0.001400.0018028910.001010.0017428520.001780.001881
Average464.170.002070.00200191.720.002900.00268324.82.20.001750.001755
P. koraiensis1_1609_01243877 (0)0.005990.0077510120.006590.0101128250.005860.007020
PTIFG20095250511 (2)0.005710.0068419420.002710.0039431190.007580.008651
0_1688_02206135 (0)0.002920.0037421700.000000.0000038750.004560.005851
CL169417230519 (0)0.013050.011445920.006760.00156245180.014610.013865
0_12929_02207350 (0)0.000000.0000028700.000000.0000044400.000000.00000
0_14221_01245396 (3)0.003720.0028128920.002640.0018728540.004770.003711
Average528.000.005230.00543191.21.30.003120.00291325.76.80.006230.006528
P. armandii1_1609_01803879 (1)0.005470.0051910120.004670.0062228270.005830.004900
PTIFG20095650512 (4)0.006110.0048519430.003970.0032631190.007440.005844
0_1688_029661310 (0)0.003720.0066121700.000000.00000387100.005820.010342
CL1694363056 (0)0.005720.005145910.004940.0035524550.005930.005550
0_12929_02527354 (1)0.001430.0021228840.003640.0054044000.000000.000000
0_14221_0110853914 (1)0.005250.0037829370.005240.0038028770.005360.003824
Average718.000.004620.004621922.80.003740.00371325.36.30.005060.005088
Table 2

Haplotype diversity and neutrality tests for Pinus pumila, P. griffithii, P. koraiensis, and P. armandii: number of haplotypes (Nh), haplotype diversity (Hd), Tajima’s D (D), Fu and Li’s D∗ (D∗), Fu and Li’s F∗ (F∗), and Fay and Wu’s H (H).

Haplotype diversity
Neutrality tests
SpecieslocusNhHdHDDF
P. pumila1_1609_01110.7302.503−0.0881.5351.183
PTIFG2009200.9361.8010.2921.0150.916
0_1688_0280.796−0.3050.2931.1921.069
CL169460.7350.825−0.604−0.576−0.679
0_12929_02120.8091.584−1.2161.0590.346
0_14221_01160.9252.624−0.0951.2880.988
Average12.1670.8221.505−0.2360.9190.637
P. griffithii1_1609_0140.5500.508−0.4560.9590.642
PTIFG2009100.778−0.3450.6940.2530.449
0_1688_0240.551−0.074−1.0711.4070.772
CL169410.000
0_12929_0210.000
0_14221_0150.641−0.2020.6540.9501.001
Average4.1670.420−0.019−0.0300.4320.477
P. koraiensis1_1609_0160.8181.6361.1511.3511.474
PTIFG200970.689−7.4090.6600.4790.623
0_1688_0260.8440.5331.1331.3001.411
CL1694170.8201.378−0.3631.730∗∗1.141
0_12929_0210.000
0_14221_0160.758−4.030−0.934−0.504−0.696
Average7.1670.655−1.3150.2740.7260.659
P. armandii1_1609_0190.7621.554−0.1490.7330.532
PTIFG2009130.902−0.138−0.681−0.368−0.543
0_1688_02110.8610.0512.2391.3991.972∗∗
CL169460.8370.078−0.3321.2590.943
0_12929_0230.5200.0191.2890.0890.501
0_14221_01180.8020.122−0.8421.0510.482
Average100.7810.2810.2540.6940.648
Table 3

Maximum-likelihood estimates and 90% highest posterior density (HPD) intervals for demographic parameters obtained from pairwise IM multilocus analyses.

Comparisonq1q2qam1m2tT (Mya)
P. pumila and P. griffithii0.83260.09182.86500.00052.04500.90500.356
HPD90Lo0.54810.04070.75250.00050.78500.48500.191
HPD90Hi1.19160.189812.93040.46055.14509.11503.582
P. pumila and P. koraiensis0.81420.32644.85910.22751.88651.91900.754
HPD90Lo0.55710.18001.36480.00250.00351.04500.411
HPD90Hi1.15050.563713.17951.63753.846533.915013.327
P. pumila and P. armandii0.37380.40870.54560.45503.19502.62001.030
HPD90Lo0.18400.21110.00320.00500.56501.34000.527
HPD90Hi0.65280.74575.80133.97506.975039.980015.711
P. griffithii and P. koraiensis0.14700.53691.50491.07500.29500.81250.319
HPD90Lo0.06240.29890.69030.03500.00500.31750.125
HPD90Hi0.30570.90195.28693.54501.73504.56751.795
P. griffithii and P. armandii0.18460.70191.47301.15250.30505.68752.235
HPD90Lo0.07260.40460.03460.05750.00501.78750.702
HPD90Hi0.40871.144512.45503.51751.865064.837525.479
P. koraiensis and P. armandii0.43640.62720.26500.43050.01005.79002.275
HPD90Lo0.22090.33270.00590.00350.01001.87000.735
HPD90Hi0.82511.074810.57145.05054.410018.73007.360
  54 in total

1.  Distinguishing migration from isolation: a Markov chain Monte Carlo approach.

Authors:  R Nielsen; J Wakeley
Journal:  Genetics       Date:  2001-06       Impact factor: 4.562

2.  On the number of segregating sites in genetical models without recombination.

Authors:  G A Watterson
Journal:  Theor Popul Biol       Date:  1975-04       Impact factor: 1.570

3.  Population genetic evidence for complex evolutionary histories of four high altitude juniper species in the Qinghai-Tibetan Plateau.

Authors:  Zhonghu Li; Jiabin Zou; Kangshan Mao; Kao Lin; Haipeng Li; Jianquan Liu; Thomas Källman; Martin Lascoux
Journal:  Evolution       Date:  2011-12-16       Impact factor: 3.694

4.  Neutrality tests based on the distribution of haplotypes under an infinite-site model.

Authors:  F Depaulis; M Veuille
Journal:  Mol Biol Evol       Date:  1998-12       Impact factor: 16.240

5.  Statistical tests of neutrality of mutations against population growth, hitchhiking and background selection.

Authors:  Y X Fu
Journal:  Genetics       Date:  1997-10       Impact factor: 4.562

6.  Climatic adaptation and ecological divergence between two closely related pine species in Southeast China.

Authors:  Yongfeng Zhou; Lirui Zhang; Jianquan Liu; Guili Wu; Outi Savolainen
Journal:  Mol Ecol       Date:  2014-07       Impact factor: 6.185

7.  Ephemeral ecological speciation and the latitudinal biodiversity gradient.

Authors:  Asher D Cutter; Jeremy C Gray
Journal:  Evolution       Date:  2016-08-23       Impact factor: 3.694

8.  Statistical tests of neutrality of mutations.

Authors:  Y X Fu; W H Li
Journal:  Genetics       Date:  1993-03       Impact factor: 4.562

9.  Demography and speciation history of the homoploid hybrid pine Pinus densata on the Tibetan Plateau.

Authors:  Jie Gao; Baosheng Wang; Jian-Feng Mao; Pär Ingvarsson; Qing-Yin Zeng; Xiao-Ru Wang
Journal:  Mol Ecol       Date:  2012-07-31       Impact factor: 6.185

10.  Natural Selection and Recombination Rate Variation Shape Nucleotide Polymorphism Across the Genomes of Three Related Populus Species.

Authors:  Jing Wang; Nathaniel R Street; Douglas G Scofield; Pär K Ingvarsson
Journal:  Genetics       Date:  2015-12-31       Impact factor: 4.562

View more
  1 in total

1.  Phenotypic Variability and Genetic Diversity in a Pinus koraiensis Clonal Trial in Northeastern China.

Authors:  David Kombi Kaviriri; Qinhui Zhang; Xinxin Zhang; Luping Jiang; Jinpeng Zhang; Jingyuan Wang; Damase P Khasa; Xiangling You; Xiyang Zhao
Journal:  Genes (Basel)       Date:  2020-06-19       Impact factor: 4.096

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.