| Literature DB >> 31426352 |
Aboozar Soorni1, Tahereh Borna2, Ali Alemardan2, Manohar Chakrabarti3, Arthur G Hunt3, Aureliano Bombarely4,5.
Abstract
Among the Lamiaceae family, the genus Thymus is an economically important genera due to its medicinal and aromatic properties. Most Thymus molecular research has focused on the determining the phylogenetic relationships between different species, but no published work has focused on the evolution of the transcriptome across the genus to elucidate genes involved in terpenoid biosynthesis. Hence, in this study, the transcriptomes of five different Thymus species were generated and analyzed to mine putative genes involved in thymol and carvacrol biosynthesis. High-throughput sequencing produced ~43 million high-quality reads per sample, which were assembled de novo using several tools, then further subjected to a quality evaluation. The best assembly for each species was used as queries to search within the UniProt, KEGG (Kyoto Encyclopedia of Genes and Genomes), COG (Clusters of Orthologous Groups) and TF (Transcription Factors) databases. Mining the transcriptomes resulted in the identification of 592 single-copy orthogroups used for phylogenetic analysis. The data showed strongly support a close genetic relationship between Thymus vulgaris and Thymus daenensis. Additionally, this study dates the speciation events between 1.5-2.1 and 9-10.2 MYA according to different methodologies. Our study provides a global overview of genes related to the terpenoid pathway in Thymus, and can help establish an understanding of the relationship that exists among Thymus species.Entities:
Keywords: Thymus; phylotranscriptome; terpenoid biosynthesis; transcriptome
Mesh:
Substances:
Year: 2019 PMID: 31426352 PMCID: PMC6723042 DOI: 10.3390/genes10080620
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Mean (three replicate measurements per plant sample) 2C-value and genome size of Thymus accessions.
| Species | Location (Province) | 1C-Value ± SD (pg) | Haploid Size (Mbp) | Estimated Polyploidy |
|---|---|---|---|---|
|
| East Azerbaijan | 0.63 ± 0.02 | 616.14 | 2n = 2x |
|
| West Azerbaijan | 1.11 ± 0.01 | 1085.84 | 2n = 4x |
|
| West Azerbaijan | 0.61 ± 0.01 | 596.72 | 2n = 2x |
|
| Lorestan | 0.56 ± 0.01 | 547.81 | 2n = 2x |
|
| Fereydunshahr | 0.55 ± 0.27 | 538.03 | 2n = 2x |
|
| Arak | 0.54 ± 0.28 | 528.25 | 2n = 2x |
|
| West Azerbaijan | 1.08 ± 0.01 | 1056.49 | 2n = 2x |
|
| Qazvin | 1.09 ± 0.05 | 1066.28 | 2n = 2x |
|
| Qazvin | 1.09 ± 0.04 | 1066.28 | 2n = 2x |
|
| Isfahan | 0.56 ± 0.01 | 547.81 | 2n = 2x |
|
| Kordestan | 1.11 ± 0.04 | 1085.84 | 2n = 4x |
|
| Cultivated | 0.82 ± 0.02 | 802.15 | 2n = 2n |
|
| Semnan | 0.61 ± 0.01 | 596.72 | 2n = 2x |
|
| Yazd | 1.17 ± 0.02 | 1144.54 | 2n = 4x |
|
| Khorasan | 1.09 ± 0.02 | 1066.28 | 2n = 2x |
|
| Yazd | 1.07 ± 0.02 | 1046.71 | 2n = 2x |
|
| Yazd | 1.14 ± 0.02 | 1115.19 | 2n = 2x |
|
| Kerman | 1.08 ± 0.02 | 1056.49 | 2n = 2x |
Figure 1Venn diagram showing the number of overlapping proteins between the five Thymus transcriptome assemblies.
Figure 2Gene ontology (GO) classification map derived from Thymus transcriptomes. The results are summarized in three GO categories: biological process, molecular function, and cellular component. The x-axis indicates the percentage of sequences in each category.
Figure 3COG classification map. The abscissa represents 24 COG group names, while the vertical axis represents the number of unigenes annotated into each group.
Figure 4Distribution of transcripts in different transcription factor (TF) families showing over-representation in bHLH.
Figure 5Maximum likelihood (ML) phylogeny of putative TPS genes in Thymus species compared to other species of TPS subfamilies.
Figure 6Phylogenetic relationships among five Thymus species inferred from 592 single-copy orthogroups. (A) Maximum likelihood tree. (B) Bayesian tree.
Figure 7Ks distribution plots for T. daenensis and T. persicus. The components are (1) background gene duplications (green), (2) putative whole-genome duplications (purple), (3) alleles or sequencing errors resulting in high identity transcripts (blue), and (4) potential whole-genome duplications with low support (black).