| Literature DB >> 31745089 |
Jia-Hui Chen1, Yuan Huang2, Benjamin Brachi3, Quan-Zheng Yun4, Wei Zhang5,6, Wei Lu6, Hong-Na Li4, Wen-Qing Li7, Xu-Dong Sun8,9, Guang-Yan Wang8,9, Jun He9, Zhuo Zhou7, Kai-Yun Chen7, Yun-Heng Ji7, Ming-Ming Shi7, Wen-Guang Sun7, Yong-Ping Yang10,11, Ren-Gang Zhang4, Richard J Abbott12, Hang Sun13.
Abstract
The Hengduan Mountains (HDM) biodiversity hotspot exhibits exceptional alpine plant diversity. Here, we investigate factors driving intraspecific divergence within a HDM alpine species Salix brachista (Cushion willow), a common component of subnival assemblages. We produce a high-quality genome assembly for this species and characterize its genetic diversity, population structure and pattern of evolution by resequencing individuals collected across its distribution. We detect population divergence that has been shaped by a landscape of isolated sky island-like habitats displaying strong environmental heterogeneity across elevational gradients, combined with population size fluctuations that have occurred since approximately the late Miocene. These factors are likely important drivers of intraspecific divergence within Cushion willow and possibly other alpine plants with a similar distribution. Since intraspecific divergence is often the first step toward speciation, the same factors can be important contributors to the high alpine species diversity in the HDM.Entities:
Mesh:
Year: 2019 PMID: 31745089 PMCID: PMC6864086 DOI: 10.1038/s41467-019-13128-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Cushion willow distribution pattern and locations of population samples. a Female individual with flowers. b Male individual with flowers. c Low elevation Cushion willow population located on open river bank at an elevation of 2950 m above sea level. d High elevation Cushion willow population on a scree slope at an elevation of 4000 m. e Sample locations of the 14 Cushion willow populations in HDM and nearby area of the eastern Himalaya. Pie charts at sampling locations indicate the distribution of genetic groups identified by a Structure analysis of the population. f Scheme of sky island distribution pattern
Fig. 2Cushion willow evolution. a Circos plot showing the genomic features of Salix brachista. The features highlighted from innermost to outermost circle are (i) intra-genome collinear blocks connected by curved lines, (ii) GC content (30%–50%), (iii) SNP density, (iv) gene density (0–50 per window), (v) distribution of Class II TE (scale: 0%–5%), (vi) distribution of Class I TE (scale: 0%–5%). All statistics are computed for windows of 200 kb. b Gene family evolution of Cushion willow. The phylogenetic tree was constructed from sequences of 3040 single-copy genes shared by ingroup Sal-SP species. The number of gene families that expanded (turquoise), contracted (magenta) and were rapidly evolving (blue) in each lineage after speciation are indicated above the corresponding branch. c Gene collinearity between S. brachista and P. trichocarpa. The x-axis and y-axis correspond to the P. trichocarpa and S. brachista chromosomes, respectively. d Collinearity between chromosome 1 and 16 of S. brachista and P. trichocarpa
Statistics of the Cushion willow genome assembly
| Total assembly size (Mb) | 339.588 |
| Total number of contigs | 78 |
| Total anchored size (Mb) | 337.27 |
| Maximum contig length (Mb) | 21.273 |
| Minimum contig length (Kb) | 57.08 |
| Contig N50 length (Mb) | 9.522 |
| Contig L50 count | 13 |
| Contig N90 length (Mb) | 2.775 |
| Contig L90 count | 38 |
| Total number of scaffolds | 30 |
| Maximum scaffold length (Mb) | 39.689 |
| Minimum scaffold length (Kb) | 57.08 |
| Scaffold N50 length (Mb) | 17.922 |
| Scaffold L50 count | 8 |
| Scaffold N90 length (Mb) | 13.388 |
| Scaffold L90 count | 17 |
| Gap number | 48 |
| Gap length (bp) | 3408 |
| GC content (%) | 34.15 |
| Gene number | 30,209 |
| Repeat content (%) | 41.65 |
Fig. 3Population genetic strucure and demographic history. a Genetic structure of Cushion willow. The length of each colored segment represents the proportion of the individual’s genome from K = 7 ancestral genetic groups. The populations grouped by county individual identifiers are indicated along the x-axis. b Neighbor-joining phylogenetic tree of Cushion willow based on SNPs from whole-genome resequencing. Branch lengths are scaled to genetic similarity (p distance, see scale bar). c PCA plot of Cushion willow genetic variation. The fractions of the variance explained by eigenvector 1 and 2 are 11.0% and 9.7%, respectively. Each shaded area groups populations from the same region: pink for populations from northwest Yunnan at the center of the area sampled, blue for eastern peripheral populations, and green for the west most peripheral populations. d Violin plot of Tajima’s D in whole genome of Salix brachista. The width depicts a 900-rotated kernel density trace and its reflection. Vertical black boxes denote the interquartile range between 25 and 75 percentiles, and the white point inside denotes the median. n = 168,546. e Detection of gene flow between Cushion willow populations. Lines represent gene flow, with arrows indicating the direction of gene flow. The horizontal scale bar at the bottom (drift parameter) shows a tenfold average standard error of the entries in the sample covariance matrix. The color scale shows the migration weight: red denotes strong gene flow, while yellow denotes weak gene flow. f Stairway plot showing historical changes in effective population size (y-axis) for LJ population with a generation time of 10 years. Red, dark gray, and light gray lines denote the medians, 12.5 and 87.5 percentiles, 2.5 and 97.5 percentiles of population sizes, respectively. The source data underlying Fig. 3d are provided as a Source Data file
Fig. 4Signatures of selection in Cushion willow sampled from high and lower elevation sites. a Pairwise FST between populations (y-axis) as a function of the pairwise spatial distance between populations (x-axis). Each point represents one population pair. The solid blue line represents a significant linear relationship; parameter estimates for the linear regression, as well as Pearson’s correlation coefficient and associated p-value are given at the top-right corner of panel b. Heatmap of pairwise FST estimates between populations. c Patterns of linkage disequilibrium (LD). LD (y-axis) decays as a function of genomic distance between polymorphisms (x-axis) in high elevation (solid red line) and low elevation (solid blue line) Cushion willows populations. LD was measured by r2. d Manhattan plots of CLR among the high and low elevation Cushion willow plants of LJ populations. Dashed line indicates CLR value threshold. Genes located within the significant CLR peaks and corresponding annotations are denoted