| Literature DB >> 34292587 |
Yong-Peng Ma1, Hafiz Muhammad Wariss1, Rong-Li Liao1,2, Ren-Gang Zhang3, Quan-Zheng Yun3, Richard G Olmstead4, John H Chau5, Richard I Milne6, Yves Van de Peer7,8,9,10, Wei-Bang Sun1.
Abstract
Understanding processes that generate and maintain large disjunctions within plant species can provide valuable insights into plant diversity and speciation. The butterfly bush Buddleja alternifolia has an unusual disjunct distribution, occurring in the Himalaya, Hengduan Mountains (HDM) and the Loess Plateau (LP) in China. We generated a high-quality, chromosome-level genome assembly of B. alternifolia, the first within the family Scrophulariaceae. Whole-genome re-sequencing data from 48 populations plus morphological and petal colour reflectance data covering its full distribution range were collected. Three distinct genetic lineages of B. alternifolia were uncovered, corresponding to Himalayan, HDM and LP populations, with the last also differentiated morphologically and phenologically, indicating occurrence of allopatric speciation likely to be facilitated by geographic isolation and divergent adaptation to distinct ecological niches. Moreover, speciation with gene flow between populations from either side of a mountain barrier could be under way within LP. The current disjunctions within B. alternifolia might result from vicariance of a once widespread distribution, followed by several past contraction and expansion events, possibly linked to climate fluctuations promoted by the Kunlun-Yellow river tectonic movement. Several adaptive genes are likely to be either uniformly or diversely selected among regions, providing a footprint of local adaptations. These findings provide new insights into plant biogeography, adaptation and different processes of allopatric speciation.Entities:
Keywords: Kunlun-Yellow river tectonic movement; Loess Plateau; Scrophulariaceae; allopatric speciation; demographic history; whole-genome sequencing
Mesh:
Year: 2021 PMID: 34292587 PMCID: PMC9291457 DOI: 10.1111/nph.17637
Source DB: PubMed Journal: New Phytol ISSN: 0028-646X Impact factor: 10.323
Fig. 1Flower characteristics and contrasting habitats of Buddleja alternifolia in the Loess Plateau (LP) (a, b) and Himalaya (c, d). The red arrow is pointing to individuals of B. alternifolia growing some distance apart in LP in (b), whereas many plants are commonly growing close together alongside the river in the Himalaya (d). (e) PCA result of morphological characteristics in Himalaya, Hengduan Mountains (HDM) and LP, as well as eastern (eLP) and western LP (wLP). (f) Petal colour reflection spectrum result of B. alternifolia in Himalaya, HDM and LP, as well as eastern (eLP) and western LP (wLP).
Fig. 2Genome evolution analysis of Buddleja alternifolia. (a) The genome features across 19 chromosomes of B. alternifolia. From the outermost to innermost circles are Class I transposable element (TE) (long and short interspersed nuclear elements) density; Class II TE (DNA and Heliron) density; coding gene (messenger RNA) density, heterozygous (single‐nucleotide polymorphisms, insertions, and deletions) density; GC content and genome colinear blocks. (b) Phylogenetic tree reconstructed using the maximum‐likelihood (ML) method and 1227 single‐copy genes with divergence time estimated with r8s v.1.81 on the basis of three calibration points (red circles). * Indicates conflicting positions between A. paniculata and S. indicum inferred by Astral‐based and ML‐based approaches (see Supporting Information Fig. S1 for details).
Fig. 3Population genomics of Buddleja alternifolia. (a) Sample collection of 48 populations of B. alternifolia, with colours in the pie charts referring to genetic groups identified by faststructure results, and size to the heterozygosity rate in different areas. Dotted line indicates the location of the Liupanshan mountain (LPS). (b) Population genetic structure of B. alternifolia by faststructure. (c) Significant differences of heterozygosity rates (mean ± SE) among three distribution regions of B. alternifolia. (d, e) PCA plots of genetic variation in B. alternifolia, with the proportion of the variance explained as (d) 44.4% for PC1 and 25.1% for PC2, and (e) 44.4% for PC1 and 5.8% for PC3. (f) A neighbour‐joining (NJ) phylogenetic tree of B. alternifolia based on SNPs from whole‐genome re‐sequencing data.
Statistics of Buddleja alternifolia genome assembly and annotation.
| Characteristics | Size and number |
|---|---|
| Total assembly size | 853 754 563 |
| Total number of contigs | 1096 |
| Maximum contig length | 10 270 997 bp |
| Minimum contig length | 5000 bp |
| Contig N50 length | 1942 762 bp |
| Contig L50 count | 129 |
| Contig N90 length | 514 978 bp |
| Contig L90 count | 431 |
| Total number of scaffolds | 439 |
| Maximum scaffold length | 62 290 481 bp |
| Minimum scaffold length | 5000 bp |
| Scaffold N50 length | 43 104 400 bp |
| Scaffold L50 count | 9 |
| Scaffold N90 length | 33 029 929 bp |
| Scaffold L90 count | 18 |
| Gap number | 657 |
| GC content | 36.10% |
| Gene number | 31 116 |
| Protein‐coding genes | 29 434 |
| Repeat content | 71.70% |
Sample size and genetic diversity of three lineages of Buddleja alternifolia.
| Region | No. populations/plants | No. of SNPs | π (SD) (10−3) |
|
|---|---|---|---|---|
| Himalaya | 15/30 | 4022 382 | 13.84 (0.79) | 13.25 (1.28) |
| LP | 29/59 | 2539 728 | 6.51 (0.59) | 7.11 (0.57) |
| HDM | 4/8 | 2073 747 | 8.45 (0.72) | 7.13 (0.56) |
| Total | 48/97 | 6373 626 | 9.60 (3.20) | 9.16 (3.04) |
π average number of pairwise nucleotide differences per site, θ W Watterson's estimator of θ per base pair.
The individual for de novo whole‐genome sequencing was included in LP.
Fig. 4Demography histories and ancestral area reconstruction of Buddleja alternifolia. (a) Schematic of demographic scenarios among the three lineages and (b) eastern and western sublineages within the Loess Plateau (eLP and wLP), modelled using fastsimcoal2, with column width representing the relative effective population size. The numbers on the vertical axis indicate the estimated time of population divergence. (c) Historic changes in N for Himalaya, LP and Hengduan Mountains (HDM), and (d) for the two sublineages within LP using stairway plotting, with lighter and darker colours indicating 95% and 75% confident intervals of shaded areas. (e) Maximum clade credibility (MCC) chronogram of ancestral area reconstruction of B. alternifolia inferred from Bayesian analysis in Beast. Pie charts indicate relative probability of ancestral distributions at nodes from BioGeoBears analysis, using the best fitting Divalike model.