| Literature DB >> 26427005 |
Su-Fang Chen1, Ming-Wan Li1, Hui-Juan Jing1, Ren-Chao Zhou1, Gui-Li Yang2, Wei Wu3, Qiang Fan1, Wen-Bo Liao1.
Abstract
Many Firmiana species are locally endemic, providing an interesting system for studying adaptation and speciation. Among these species, F. danxiaensis is a tree species endemic to Mount Danxia in Guangdong, China, which is an area known for presenting the Danxia landform. How F. danxiaensis could have adapted to the stressful environment of rocky cliffs covered with barren soils in the Danxia landform is still unknown. In this study, we performed de novo assembly of the transcriptome of F. danxiaensis, obtaining 47,221 unigenes with an N50 value of 987 bp. Homology analysis showed that 32,318 of the unigenes presented hits in the NCBI non-redundant database, and 31,857 exhibited significant matches with the protein database of Theobroma cacao. Gene Ontology (GO) annotation showed that hundreds of unigenes participated in responses to various stresses or nutritional starvation, which may help us to understand the adaptation of F. danxiaensis to Danxia landform. Additionally, we found 263 genes related to responses to Cd, partially explaining the high accumulation of Cd observed in Firmiana species. The EuKaryotic Orthologous Groups (KOG) and Kyoto Encyclopedia of Genes and Genomes (KEGG) annotations revealed many genes playing roles in the biosynthesis of secondary metabolites and environmental adaptation, which may also contribute to the survivor and success of Firmiana species in extreme environments. Based on the obtained transcriptome, we further identified a Firmiana-specific whole-genome duplication event that occurred approximately 20 Mya, which may have provided raw materials for the diversification of Firmiana species.Entities:
Mesh:
Year: 2015 PMID: 26427005 PMCID: PMC4591120 DOI: 10.1371/journal.pone.0139373
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1The length distribution of the assembled transcripts or unigenes.
Fig 2Similarity and number of matches to A. thaliana, T. cacao and G. raimondii.
Fig 3Length distribution of those unigenes unmatched to NR database.
Fig 4Length coverage distribution of F. danxiaensis’s transcriptome to known proteins of the NR database.
Fig 5GO classification of the assembled unigenes of F. danxiaensis and the coding sequences of T. cacao.
Fig 6KOG functional classification of the assembled unigenes of F. danxiaensis and the protein sequences of T. cacao.
Fig 7KEGG pathway assignment of the assembled unigenes of F. danxiaensis and the protein sequences of T. cacao.
Fig 8K frequency plots for paralogous gene pairs from F. danxiaensis.
K distribution components estimated using EMMIX are superimposed on the histogram. These components are hypothesized to represent background gene duplications (green) and gene duplication associated with polyploidy events (red).
Fig 9Estimated AIC and BIC values under various number of distinct components with EMMIX program.
Distribution and divergence estimates.
| No. of distributions | Natural log (Ln) of the mean | Median of distribution | Divergnce estimate (Mya) |
|---|---|---|---|
| 2 | -1.383 (0.074) | 0.251 | 20.6 |
| -1.2264 (0.828) | 0.293 | ||
| 3 | -1.518 (0.402) | 0.219 | |
| -1.366 (0.068) | 0.255 | 20.9 | |
| 0.287 (0.044) | 1.332 | 109.0 |
a The median was determined from back-transformation of the natural log (Ln) of the mean (standard deviation in parentheses)
b Divergence estimates were calculated from the median of the distribution (Mya, million years ago)