| Literature DB >> 35137004 |
Hai-Lin Liu1,2,3,4, A J Harris1,5, Zheng-Feng Wang1,6,7,8, Hong-Feng Chen1,5, Zhi-An Li1,6,7,8, Xiao Wei9.
Abstract
Among relic species, genomic information may provide the key to inferring their long-term survival. Therefore, in this study, we investigated the genome of the Paleogene relic tree species, Bretschneidera sinensis, which is a rare endemic species within southeastern Asia. Specifically, we assembled a high-quality genome for B. sinensis using PacBio high-fidelity and high-throughput chromosome conformation capture reads and annotated it with long and short RNA sequencing reads. Using the genome, we then detected a trade-off between active and passive disease defences among the gene families. Gene families involved in salicylic acid and MAPK signalling pathways expanded as active defence mechanisms against disease, but families involved in terpene synthase activity as passive defences contracted. When inferring the long evolutionary history of B. sinensis, we detected population declines corresponding to historical climate change around the Eocene-Oligocene transition and to climatic fluctuations in the Quaternary. Additionally, based on this genome, we identified 388 single nucleotide polymorphisms (SNPs) that were likely under selection, and showed diverse functions in growth and stress responses. Among them, we further found 41 climate-associated SNPs. The genome of B. sinensis and the SNP dataset will be important resources for understanding extinction/diversification processes using comparative genomics in different lineages.Entities:
Keywords: SNP; gene family; genome assembly; population genetics; resequencing
Mesh:
Year: 2022 PMID: 35137004 PMCID: PMC8825261 DOI: 10.1093/dnares/dsac003
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.477
Figure 1Representative photographs of Bretschneidera sinensis, including the whole tree, flowers, and fruits.
Sampling and climate information for 13 B. sinensis is populations in China
| Population | Abbreviation | Longitude | Latitude | Sample size | AMT ( | RH (%) | AP (mm) | E (mm) | AS (mm) | LWR (W/m2) | SWR (W/m2) |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Enshi | ES | 109 | 30 | 6 | 12.066 | 86.004 | 1,413.520 | 1,022.056 | 45.793 | 315.045 | 185.184 |
| Chongyi | CY | 114 | 25 | 15 | 16.835 | 85.914 | 1,617.485 | 1,202.128 | 28.841 | 346.931 | 195.432 |
| Jingning | JN | 119 | 27 | 15 | 15.289 | 88.085 | 2,152.491 | 1,226.724 | 38.463 | 340.070 | 193.905 |
| Longquan | LQ | 119 | 28 | 6 | 17.425 | 86.121 | 1,821.218 | 1,253.450 | 12.913 | 351.562 | 192.497 |
| Nanping | NP | 118 | 26 | 15 | 18.612 | 82.797 | 1,809.063 | 1,255.184 | 2.377 | 360.926 | 188.629 |
| Huizhou | HZ | 113 | 23 | 13 | 17.508 | 81.551 | 2,249.696 | 1,266.683 | 6.929 | 347.709 | 199.541 |
| Longsheng | LS | 109 | 25 | 15 | 14.477 | 85.583 | 1,896.454 | 1,169.204 | 50.396 | 332.110 | 198.686 |
| Jinxiu | JX | 110 | 24 | 15 | 14.594 | 82.132 | 1,694.328 | 1,138.587 | 33.598 | 329.956 | 201.576 |
| Liping | LP | 109 | 26 | 15 | 16.124 | 86.940 | 1,425.229 | 1,152.270 | 26.413 | 343.832 | 192.702 |
| Pingbian | PB | 103 | 22 | 8 | 16.874 | 78.295 | 1,589.989 | 1,195.990 | 1.225 | 338.621 | 223.474 |
| Zhangjiajie | ZJJ | 109 | 29 | 15 | 11.956 | 86.213 | 1,472.285 | 1,027.856 | 64.263 | 315.403 | 190.303 |
| Taibei | TB | 121 | 25 | 10 | 22.234 | 83.709 | 2,404.350 | 1,474.141 | 0.000 | 384.059 | 180.040 |
| Wuzhishan | WZS | 109 | 18 | 11 | 20.126 | 81.715 | 1,874.228 | 1,155.587 | 0.000 | 364.364 | 214.244 |
| Total | 154 |
Climate variables are averaged from year 1982 to 2020 as downloaded from https://disc.gsfc.nasa.gov (last accessed 20 December 2021).
AMT: annual mean temperature; RH: relative humidity; AP: annual precipitation; E: Evapotranspiration; AS: annual snowfall; LWR: long-wave radiation; SWR: short-wave radiation.
Figure 2Sampling locations and population structures in B. sinensis. (A) Geographic distribution of 13 B. sinensis populations in China. (B) First two components of a PCA of SNPs. (C)ADMIXTURE results for K = 2.
Statistics of the B. sinensis genome assembly
| Statistics of HiFi assembly | Statistics of the Hi-C scaffolded assembly | |||
|---|---|---|---|---|
| Sequence length (bp) | Order of sequence length | Sequence length (bp) | Order of sequence length | Sequence length in the genome (%) |
| N10 = 64,553,695 | L10 = 2 | N10 = 152,689,475 | L10 = 1 | 13.04 |
| N20 = 54,398,357 | L20 = 4 | N20 = 151,999,468 | L20 = 2 | 12.98 |
| N30 = 48,448,289 | L30 = 6 | N30 = 144,368,824 | L30 = 3 | 12.33 |
| N40 = 29,147,647 | L40 = 10 | N40 = 140,467,444 | L40 = 4 | 12.00 |
| N50 = 24,630,967 | L50 = 14 | N50 = 137,464,959 | L50 = 5 | 11.74 |
| N60 = 19,150,349 | L60 = 20 | N60 = 103,046,003 | L60 = 6 | 8.80 |
| N70 = 14,033,915 | L70 = 28 | N70 = 102,104,410 | L70 = 7 | 8.72 |
| N80 = 9,569,185 | L80 = 38 | N80 = 101,254,378 | L80 = 8 | 8.65 |
| N90 = 3,151,252 | L90 = 58 | N90 = 87,593,448 | L90 = 9 | 7.48 |
| N100 = 10,839 | L100 = 1,416 | N100 = 1,000 | L100 = 1,621 | |
| Total length | 1,222,797,721 bp | 1,170,884,685 bp | ||
| Average | 863,557.71 bp | 722,322.45 bp | ||
| Largest | 89,979,610 bp | 152,689,475 bp | ||
| Shortest | 10,839 bp | 1,000 bp | ||
Figure 3Features of the genome assembly (A) Hi-C interaction heat map (bin length 210,000 bp) for the B. sinensis genome. (B) Genome features across nine inferred chromosomes of B. sinensis. The grey-shaded tracks represent nine assembled chromosomes (scale: Mb); green represents repeat density; red represents gene density, and dark blue represents SNP density; and the cyan lines inside the circle represent syntenic blocks. The densities of repeats, genes, and SNPs were estimated based on a 106 bp window.
Figure 4A phylogenetic tree with mapped gene family contractions (–) and expansions (+) in B. sinensis and other species. The number at the root denotes the total gene families within the most recent common ancestor (MRCA). Please note that not all gene family contractions or expansions presented in the tree are significant. A time scale (in millions of years) is shown beneath the tree. Numbers of identified TPS genes with their Pfam ID PF01397 and PF03936 in malvids species are shown on the left of the tree.
Figure 5GO enrichment analysis of the genes in significantly expanded gene families.
Figure 6Demographic history of B. sinensis shows the historical changes in effective population size. Arrows indicate large declines in population size that are described in the text.
Figure 7RDA shows the association of climate variables with (A) SNPs and (B) individuals. The vectors indicate the climate variables of the first two RDA components. IBD shows the association between genetic differentiation () and geographic distance using (C) non-adaptive and (D) adaptive SNPs in B. sinensis. ‘**’ means P < 0.01.