| Literature DB >> 32118265 |
Xupo Ding1, Wenli Mei1, Qiang Lin2, Hao Wang1, Jun Wang1, Shiqing Peng3, Huiliang Li3, Jiahong Zhu3, Wei Li1, Pei Wang1, Huiqin Chen1, Wenhua Dong1, Dong Guo3, Caihong Cai1, Shengzhuo Huang1, Peng Cui2, Haofu Dai1.
Abstract
BACKGROUD: Aquilaria sinensis (Lour.) Spreng is one of the important plant resources involved in the production of agarwood in China. The agarwood resin collected from wounded Aquilaria trees has been used in Asia for aromatic or medicinal purposes from ancient times, although the mechanism underlying the formation of agarwood still remains poorly understood owing to a lack of accurate and high-quality genetic information.Entities:
Keywords: zzm321990 Aquilaria sinensiszzm321990 ; Hi-C sequencing; agarwood; annotation; chromosome-level genome assembly
Year: 2020 PMID: 32118265 PMCID: PMC7050300 DOI: 10.1093/gigascience/giaa013
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Morphological characteristics of Aquilaria sinensis. (a) mature tree; (b) flower; (c) fruit; (d) seed; (e) cracked seed; (f) agarwood generation; (g) agarwood. The images b–e were captured using a stereoscopic fluorescence microscope (Olympus SZX16, Pittsburgh, PA) under the dark field. All the photos were taken by Dr. Jun Wang and processed by Dr. Xupo Ding.
Figure 2:Hi-C interaction matrix for A. sinensis genome assembly using 8 clusters.
Statistics of the final genome assembly for Aquilaria sinensis
| Statistic | Contig length (bp) | Contig No. | Scaffold length (bp) | Scaffold No. |
|---|---|---|---|---|
| N50 | 1,058,652 | 164 | 88,784,932 | 4 |
| N60 | 726,407 | 246 | 86,380,100 | 5 |
| N70 | 495,861 | 366 | 84,956,755 | 6 |
| Longest | 11,913,571 | 1 | 109,870,270 | 1 |
| Total | 720,187,708 | 2,015 | 726,587,161 | 9 |
| Length ≥1 kb | 720,187,482 | 2,013 | 726,587,161 | 9 |
| Length ≥2 kb | 720,179,880 | 2,008 | 726,587,161 | 9 |
| Length ≥5 kb | 720,112,854 | 1,991 | 726,587,161 | 9 |
Statistics of transposable elements in Aquilaria sinensis genome sequences
| Type | Repbase TEs | Mips-REdat TEs | TE proteins | RepeatModeler | Combined TEs | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Length (Mb) | % in genome | Length (Mb) | % in genome | Length (Mb) | % in genome | Length (Mb) | % in genome | Length (Mb) | % in genome | |
| DNA | 13,223,408 | 1.84 | 1,392,136 | 0.19 | 10,456,270 | 1.45 | 28,698 131 | 3.98 | 38,895,471 | 5.4 |
| LINE | 2,916,904 | 0.41 | 253,492 | 0.04 | 7,680,548 | 1.07 | 6,394,899 | 0.89 | 12,239,695 | 1.7 |
| LTR | 73,748,923 | 10.24 | 22,973,865 | 3.19 | 75,336,839 | 10.46 | 138,348,032 | 19.21 | 192,609,862 | 26.74 |
| SINE | 2,232 | 0 | 1,145 | 0 | 0 | 0 | 0 | 0 | 4,539 | 0 |
| Other | 6,189,190 | 0.86 | 380,555 | 0.05 | 1,369,337 | 0.19 | 0 | 0 | 87,659,087 | 12.17 |
| Unknown | 35,443 | 0 | 0 | 0 | 0 | 0 | 124,331,790 | 17.26 | 94,460,416 | 13.12 |
| Total | 96,116,100 | 13.35 | 25,001,193 | 3.47 | 94,842,994 | 13.17 | 296,679,047 | 41.19 | 425,869,070 | 59.13 |
Figure 3:Comparative genomic analysis of Aquilaria sinensis and other plant species. (a) Distribution of genes and gene families of 13 plant species we investigated. (b) Venn diagram showing the distribution of shared gene families among the Malvales plants Aquilaria sinensis (agarwood), Theobroma cacao (cocoa), Gossypium hirsutum (cotton), Corchorus olitorius (jute), and the model plant Arabidopsis thaliana (Arabidopsis). (c) Divergence time estimation and gene family changes among 13 plant species. The black number at each node denotes estimated divergence time from present (million years ago). The blue number at the root (11,885) denotes the total number of gene families predicted in the most recent common ancestor (MRCA), and the green/red numbers around each branch denote gene family gain/loss number. The red nodes indicate the known divergence time of Asterids and Rosids. (d) Transversion substitutions at 4-fold degenerate sites (4dTv) distribution in selected assemblies of A. sinensis, A. thaliana, O. sativa, M. truncatula, and V. vinifera.