| Literature DB >> 32808664 |
Zhou Hong1, Jiang Li2, Xiaojin Liu1, Jinmin Lian2, Ningnan Zhang1, Zengjiang Yang1, Yongchao Niu2, Zhiyi Cui1, Daping Xu1.
Abstract
BACKGROUND: Dalbergia odorifera T. Chen (Fabaceae) is an International Union for Conservation of Nature red-listed tree. This tree is of high medicinal and commercial value owing to its officinal, insect-proof, durable heartwood. However, there is a lack of genome reference, which has hindered development of studies on the heartwood formation.Entities:
Keywords: zzm321990 Dalbergia odorifera T. Chen; zzm321990 de novo sequencing; annotation; chromosome-level genome assembly; phylogeny
Year: 2020 PMID: 32808664 PMCID: PMC7433187 DOI: 10.1093/gigascience/giaa084
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Sequencing data used for the D. odorifera genome assembly
| Libraries | Insert size (bp) | Raw data (Gb) | Clean data (Gb) | Read length (bp) | Sequence coverage (×) |
|---|---|---|---|---|---|
| Illumina reads | 350 | 73.88 | 73.80 | 150 | 113.06 |
| PacBio reads | 20,000 | 67.80 | 67.74 | 11,201 | 103.76 |
| 10x Genomics | 500 | 118.27 | 116.06 | 150 | 180.99 |
| Hi-C | 350 | 156.84 | 155.86 | 150 | 240.02 |
| Total | 416.79 | 413.46 | 637.83 |
The coverage was calculated using an estimated genome size of 653.45 Mb.
Figure 1:Circos plot shows the characterization of the D.odorifera genome. I: Syntenic regions within D. odorifera assembly based on homology searches were found with MCscan [51] requiring ≥30 genes per block (links); II: GC content in non-overlapping 1-Mb windows (histograms); III: Percent coverage of TEs in non-overlapping 1-Mb windows (heat maps); IV: Gene density calculated on the basis of the number of genes in non-overlapping 1 Mb windows (heat maps); V: Length of super-scaffolds in megabase pairs.
Statistics for the D. odorifera genome
| Assembly feature | Value |
|---|---|
| Estimated genome size (by | 653.45 Mb |
| No. of scaffolds | 384 |
| Contig N50 | 5.92 Mb |
| Scaffold N50 | 56.16 Mb |
| Longest scaffold | 79.61 Mb |
| Assembly length | 638.26 Mb |
| Assembly % of genome | 97.68 |
| Repeat region % of assembly | 54.17 |
| Predicted gene models | 30,310 |
| Mean coding sequence length | 1121.36 bp |
| Mean exons per gene | 4.93 |
Classifications of transposable elements predicted by each method
| Type | Repbase + Denovo | TE proteins | Combined TEs | |||
|---|---|---|---|---|---|---|
| Length (bp) | % in genome | Length (bp) | % in genome | Length (bp) | % in genome | |
| DNA | 55,376,910 | 8.68 | 10,310,575 | 1.62 | 58,455,913 | 9.16 |
| LINE | 6,620,833 | 1.04 | 4,004,118 | 0.63 | 9,241,306 | 1.45 |
| SINE | 292,685 | 0.05 | 0 | 0 | 292,685 | 0.05 |
| LTR | 236,380,844 | 37.03 | 56,821,684 | 8.9 | 240,620,255 | 37.7 |
| Other | 0 | 0 | 0 | 0 | 0 | 0 |
| Satellite | 116,222 | 0.02 | 0 | 0 | 116,222 | 0.02 |
| Simple repeat | 1,238,189 | 0.19 | 0 | 0 | 1,238,189 | 0.19 |
| Unknown | 48,638,508 | 7.62 | 0 | 0 | 48,638,508 | 7.62 |
| Total | 332,705,455 | 52.13 | 71,048,270 | 11.13 | 340,525,618 | 53.35 |
Note: Repbase + Denovo: RepeatMasker results based on Repbase, RepeatModeler, RepeatScout, Piler, and LTR_FINDER; TE proteins: RepeatProteinMask results based on Repbase; Combined TEs: combined results of Denovo + Repbase and TE proteins. LINE: long interspersed nuclear element; SINE: short interspersed nuclear element; TE: transposable element.
Figure 2:Evolution of the D. odorifera genome. (A) Venn diagram of shared and unique orthologous gene families in D. odorifera and 4 other legumes. (B) Predicted orthologous protein compositions for the 10 genomes. (C) Expansion and contraction of gene families. The numbers in green indicate the number of gene families that expanded in the species during evolution, and the numbers in red indicate the number of gene families that contracted. (D) 4DTV (4-fold degenerate transversion rate) plot. A.tha:Arabidopsis thaliana; A. dur:Arachis duranensis; P.tri:Populus trichocarpa; D.odo:Dalbergia odorifera; C.caj:Cajanus cajan; G.max:Glycine max; M.tru:Medicago truncatula; M.dom:Malus domestica; E.gra:Eucalyptus grandis; V.vin:Vitis vinifera.