| Literature DB >> 35171284 |
Liang Tang1, Xuezhu Liao2, Luke R Tembrock3, Song Ge4, Zhiqiang Wu2.
Abstract
Vatica mangachapoi is a tropical tree species native to Southeast Asia. It has long been valued as a timber species because the wood resists decay, but it is now considered vulnerable to extinction due to habitat loss and overexploitation. Here, we present the first chromosome-level genome assembly of V. mangachapoi that we created by combining data from PacBio long read sequencing with Hi-C proximity ligation and Illumina short-read sequencing. The assembled genome was 456.21 Mb, containing 11 chromosome and a BUSCO score of 93.4%. From the newly assembled genome, 46,811 protein-coding genes were predicted. Repetitive DNA accounted for 53% of the genome. Phylogenomic and gene family analyses showed that V. mangachapoi diverged from a common ancestor of Gossypium raimondii 70 million years ago. Transcriptome analyses found 227 genes that were differentially expressed in the leaves of plants grown in normal soil relative to plants grown in dry, coastal, sandy soil. For these genes, we identified three significantly enriched with GO terms: responses to organonitrogen compounds, chitin-triggered immunity, and wound response. This genome provides an important comparative benchmark not only for future conservation work on V. mangachapoi but also for phylogenomics work on Dipterocarpaceae.Entities:
Keywords: conservation biology; dipterocarp forests; genome assembly; tree genomics; whole-genome duplication
Mesh:
Year: 2022 PMID: 35171284 PMCID: PMC8882376 DOI: 10.1093/dnares/dsac005
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 2Comparison of protein-coding genes and phylogenetic analysis. (A) Comparison of classifications of protein-coding genes in the nine species used for comparative genomic analysis. (B) Copy number distribution of gene families in the nine species. (C) Phylogenetic tree inferred from the orthologous gene sets of V. mangachapoi and eight other species. The numbers after ‘+’ and ‘−’ represent the numbers of expanded or contracted gene families, respectively. Blue numbers indicate the estimated divergence times from MYA with 95% confidence intervals (CIs).
Figure 1Characterization of the V. mangachapoi genome. (A) Frequency distribution of 21-mers derived from 57 Gb of cleaned Illumina sequencing reads. (B) Intensity signal heatmap of the Hi-C chromosome.
Figure 3(A) Collinearity within the genome of V. mangachapoi. (B) Density distribution of Ks values between homologous pairs. Vm: V. mangachapoi, Grai: G. raimondii. (C) Separate and average (in red) historical population sizes of three V. mangachapoi populations. g, generation time; μ, substitution rate.
Figure 4Transcriptome analysis of V. mangachapoi. (A) Log-fold difference in differential gene expression between the control group and the drought-stressed group. (B) KEGG enrichment of DEGs between the control and the drought condition in leaves. (C) Phylogenetic analysis of the TPS gene family in V. mangachapoi compared with similar genes in Arabidopsis thaliana.