| Literature DB >> 30698701 |
Dongyan Zhao1, John P Hamilton1, Wajid Waheed Bhat2,3, Sean R Johnson2, Grant T Godden4, Taliesin J Kinser4,5, Benoît Boachon6, Natalia Dudareva6, Douglas E Soltis4,5, Pamela S Soltis4, Bjoern Hamberger2, C Robin Buell1,7,8.
Abstract
BACKGROUND: Teak, a member of the Lamiaceae family, produces one of the most expensive hardwoods in the world. High demand coupled with deforestation have caused a decrease in natural teak forests, and future supplies will be reliant on teak plantations. Hence, selection of teak tree varieties for clonal propagation with superior growth performance is of great importance, and access to high-quality genetic and genomic resources can accelerate the selection process by identifying genes underlying desired traits.Entities:
Keywords: chromosomal-scale assembly; tandem-duplicated genes; teak; terpene synthases
Mesh:
Substances:
Year: 2019 PMID: 30698701 PMCID: PMC6394206 DOI: 10.1093/gigascience/giz005
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:A young teak tree. Photo taken by Phong Ek [CC BY 2.0 (https://creativecommons.org/licenses/by/2.0)], via Wikimedia Commons
Metrics of contigs and scaffolds of the current assembly
| Initial assembly using PacBio reads (contigs) | Assembly after Hi-C scaffolding (scaffolds) | |
|---|---|---|
| Total sequences | 1,474 | 936 |
| Total size (bp) | 338,318,549 | 338,300,341 |
| Maximum sequence size (bp) | 21,267,566 | 20,661,910 |
| Minimum sequence size (bp) | 1,168 | 1,168 |
| N50 sequence size (bp) | 3,749,470 | 16,483,567 |
| N90 sequence size (bp) | 52,675 | 463,203 |
| Average sequence size (bp) | 229,524 | 361,432 |
Cumulative size of contigs and scaffolds of the current assembly
| Initial assembly using PacBio reads (contigs) | |||
|---|---|---|---|
| Contig size | Total size (bp) | %Total assembly | Number of Contigs |
| ≥1 Mbp | 248,187,558 | 73.37 | 64 |
| ≥0.5 Mbp | 267,412,682 | 79.06 | 91 |
| ≥0.1 Mbp | 291,028,790 | 86.04 | 198 |
| ≥0.05 Mbp | 305,851,391 | 90.42 | 420 |
| Assembly after Hi-C scaffolding (scaffolds) | |||
| Scaffold size | Total size (bp) | %Total assembly | Number of Scaffolds |
| ≥1 Mbp | 304,435,280 | 89.99 | 19 |
| ≥0.5 Mbp | 304,435,280 | 89.99 | 19 |
| ≥0.1 Mbp | 308,724,809 | 91.26 | 41 |
| ≥0.05 Mbp | 314,467,503 | 92.96 | 134 |
Whole genome shotgun reads
| Sample name | NCBI SRA run ID | QC-passed reads | Mapped | Properly paired out of total reads |
|---|---|---|---|---|
| Teak_TruSeq_01 | SRR7984127 | 168,566,966 | 165,783,328 (98.35%) | 163,390,358 (97.40%) |
| Teak_TruSeq_02 | SRR7984127 | 188,504,116 | 185,541,771 (98.43%) | 182,934,854 (97.15%) |
| TEC_AA_01 | SRR7984129 | 371,978,214 | 364,473,434 (97.98%) | 357,722,188 (96.65%) |
| TEC_AA_02 | SRR7984129 | 394,477,964 | 386,545,305 (97.99%) | 379,620,884 (96.72%) |
| TEC_AB_01 | SRR7984130 | 89,116,777 | 87,087,277 (97.72%) | 84,001,838 (94.93%) |
| TEC_AB_02 | SRR7984130 | 81,436,054 | 79,540,000 (97.67%) | 76,733,986 (94.89%) |
Figure 2:Gene and repeat density across the 19 pseudomolecules in the assembly. Green asterisks denote telomere tracks.
Figure 3:Differential expression of tandem copies of genes in lignin biosynthetic pathway. stem12yr: stem secondary xylem of a 12-year-old teak tree; stem60yr: stem secondary xylem of a 60-year-old teak tree; branch12yr: branch secondary xylem of a 12-year-old teak tree; branch60yr: branch secondary xylem of a 60-year-old teak tree.
Figure 4:Maximum likelihood tree of peptide sequences of terpene synthase (TPS) family genes from the Tectona grandis (red branches), Arabidopsis thaliana (green branches), and Eucalyptus grandis (blue branches). Red dots denote teak TPSs expressed in stems.
Figure 5:Proposed diterpene pathway based on functional validation.
Figure 6:Expression of terpene synthases in various tissues of teak. Six monoterpene synthases (clade I and II as denoted on the nodes) and three putative sesquiterpene synthases (clade III) exhibited high expression in branches and stems of 12- and 60-year-old teak trees.
Figure 7:A physical cluster of TPS/CYP genes on pseudomolecule 5 and their expression in different tissues of teak. Horizontal arrows denote genes with their gene classification listed above and gene IDs below, where unfilled arrows denote partial genes and black arrows denote genes that are not TPS/CYP.