| Literature DB >> 35822822 |
Emily Amor Stander1, Wesley Williams1,2, Yamkela Mgwatyu1, Peter van Heusden1, Fanie Rautenbach3, Jeanine Marnewick3, Marilize Le Roes-Hill3, Uljana Hesse1,2,4.
Abstract
Rooibos (Aspalathus linearis), widely known as a herbal tea, is endemic to the Cape Floristic Region of South Africa (SA). It produces a wide range of phenolic compounds that have been associated with diverse health promoting properties of the plant. The species comprises several growth forms that differ in their morphology and biochemical composition, only one of which is cultivated and used commercially. Here, we established methodologies for non-invasive transcriptome research of wild-growing South African plant species, including (1) harvesting and transport of plant material suitable for RNA sequencing; (2) inexpensive, high-throughput biochemical sample screening; (3) extraction of high-quality RNA from recalcitrant, polysaccharide- and polyphenol rich plant material; and (4) biocomputational analysis of Illumina sequencing data, together with the evaluation of programs for transcriptome assembly (Trinity, IDBA-Trans, SOAPdenovo-Trans, CLC), protein prediction, as well as functional and taxonomic transcript annotation. In the process, we established a biochemically characterized sample pool from 44 distinct rooibos ecotypes (1-5 harvests) and generated four in-depth annotated transcriptomes (each comprising on average ≈86,000 transcripts) from rooibos plants that represent distinct growth forms and differ in their biochemical profiles. These resources will serve future rooibos research and plant breeding endeavours.Entities:
Keywords: Aspalathus linearis; NGS; RNA-Seq; biochemical screening; bioinformatics; medicinal plants; method evaluation; non-model organism; rooibos; transcriptomics
Year: 2020 PMID: 35822822 PMCID: PMC9258316 DOI: 10.3390/biotech9040019
Source DB: PubMed Journal: BioTech (Basel) ISSN: 2673-6284
Figure 1Diversity of thin-layer chromatography (TLC) banding patterns in the rooibos samples. Bands 1–6 were commonly observed in samples from commercial and wild rooibos plants; bands A and B were only found in selected plants. S: solvent front.
Botanical and growth characteristics of rooibos plants selected for transcriptome analyses.
| Plant | Growth Type | Seeder/Sprouter | Growth Form | Location |
|---|---|---|---|---|
|
| Red type (commercial) | seeder | upright, densely branched bush | S 031° 43′ 18′′ |
|
| Nieuwoudtville sprouter | sprouter | low-growing, densely branched bush | S 031° 45′ 48′′ |
|
| Black type | seeder | tall, slender shrub | S 031° 59′ 21′′ |
|
| Grey sprouter | sprouter | upright, sparsely branched bush | S 032° 37′ 17′′ |
Biochemical characterization of rooibos plants selected for transcriptome analyses (October 2016 samples).
| Plant | Aspalathin | Orientin | Iso-Orientin | Iso-Vitexin | Vitexin | Hyperoside | Luteolin | Quercetin |
|---|---|---|---|---|---|---|---|---|
|
| 66.52 ± 0.68 a | 4.70 ±0.19 a | 6.94 ± 0.05 a | 2.18 ± 0.07 a | 1.98 ± 0.10 a | 2.06 ± 0.09 a | 0.01 ± 0.00 a | 0.01 ± 0.00 a |
|
| 9.24 ± 0.17 b | 1.56 ± 0.04 c | 1.50 ± 0.01 c | 0.17 ± 0.00 c | 0.49 ± 0.01 b | 0.41 ± 0.01 b | 0.21 ± 0.00 b | 0.01 ± 0.00 a |
|
| 0.00 ± 0.00 c | 1.82 ± 0.08 c | 1.71 ± 0.03 c | 0.17 ± 0.00 c | 0.23 ± 0.01 c | 0.32 ± 0.01 b,c | 0.59 ± 0.01 c | 0.01 ± 0.00 a |
|
| 0.00 ± 0.00 c | 2.29 ± 0.06 b | 2.85 ± 0.08 b | 0.69 ± 0.01 b | 0.25 ± 0.00 c | 0.17 ± 0.00 c | 0.58 ± 0.01 c | 0.05 ± 0.00 b |
* Concentrations of major flavonoids are provided in mg/100 g dry weight ± STD (n = 3). Biochemical analyses were conducted using HPLC-DAD. Different letters indicate significant differences as verified using the Tukey test.
Summary of the sequencing data generated from the October 2017 samples of four rooibos ecotypes.
| Plant | A | B | C | D |
|---|---|---|---|---|
|
| 53.0 | 59.0 | 58.0 | 46.0 |
|
| 8.3 | 7.1 | 7.9 | 8.1 |
|
| 437 | 448 | 479 | 452 |
|
| 54.7 | 66.5 | 26.7 | 44.1 |
|
| 54.6 | 65.8 | 26.5 | 43.8 |
|
| 99.9 | 99.1 | 99.2 | 99.2 |
|
| 83.6 | 83.7 | 83.8 | 83.7 |
|
| 30–127 | 30–127 | 30–127 | 30–127 |
Comparison of four different de novo transcriptome assemblies, assembled from the processed reads from sample C.
| Trinity_all | Trinity_longest | IDBA_Trans | SOAPdenovo_Trans | CLC | |
|---|---|---|---|---|---|
|
| 22 | 22 | 4 | 1 | 4 |
|
| 100,778 | 53,363 | 76,784 | 50,503 | 59,716 |
|
| 28,701 | 22,049 | 15,941 | 11,203 | 27,416 |
|
| 27,747 | 14,995 | 19,885 | 13,139 | 18,131 |
|
| 43,922 | 16,145 | 40,701 | 25,795 | 14,046 |
|
| 408 | 174 | 257 | 364 | 123 |
|
| 0 | 0 | 0 | 2 | 0 |
|
| 97.0 | 82.9 | 89.1 | 76.8 | 78.7 |
|
| 82.0 | 66.0 | 75.9 | 58.1 | 59.8 |
|
| 1258 | 1092 | 1230 | 1019 | 923 |
|
| 721 | 1061 | 374 | 825 | 870 |
|
| 537 | 31 | 856 | 194 | 53 |
|
| 73 | 152 | 59 | 229 | 244 |
|
| 109 | 196 | 151 | 192 | 273 |
|
| 2065 | 1278 | 2561 | 1477 | 1223 |
|
| 2.0 | 2.4 | 3.3 | 2.9 | 2.0 |
Comparison of gene-finding algorithms GenemarkS-T, TransDecoder, and Angel.
| Transcriptome (76,784) | Angel | GenemarkS-T | TransDecoder | |
|---|---|---|---|---|
|
| - | 74,767 | 58,284 | 54,205 |
|
| - | 71,791 | 54,754 | 54,205 |
|
| - | 1.04 ± 0.21 | 1.06 ± 0.26 | 1.00 ± 0.00 |
|
| 1230 | 1211 | 1202 | 1200 |
|
| 374 | 396 | 394 | 390 |
|
| 856 | 815 | 808 | 810 |
|
| 59 | 71 | 68 | 74 |
|
| 151 | 158 | 170 | 166 |
Rooibos transcriptomes.
| A | B | C | D | |
|---|---|---|---|---|
|
| 121.87 | 122.11 | 98.02 | 102.83 |
|
| 91,171 | 96,865 | 76,784 | 80,456 |
|
| 20,986 | 24,955 | 15,941 | 18,460 |
|
| 22,161 | 24,463 | 19,885 | 20,767 |
|
| 47,231 | 46,736 | 40,701 | 40,674 |
|
| 793 | 707 | 257 | 547 |
|
| 0 | 4 | 0 | 8 |
|
| 85,234 | 91,301 | 75,426 | 79,234 |
|
| 77.11 | 66.75 | 89.05 | 81.11 |
|
| 60.63 | 51.94 | 75.86 | 63.66 |
|
| 7.63 | 5.60 | 8.74 | 8.57 |
|
| 8.3 | 7.1 | 7.9 | 8.1 |
|
| 1291 | 1221 | 1230 | 1242 |
|
| 383 | 335 | 374 | 391 |
|
| 908 | 886 | 856 | 851 |
|
| 48 | 65 | 59 | 62 |
|
| 101 | 154 | 151 | 136 |
Figure 2Distribution of rooibos-specific and general plant orthologous groups (OGs) across the transcriptomes.
Taxonomic transcript annotations using DIAMOND-NCBI(NR) and Kraken2.
| A (91,171) | B (96,865) | C (76,784) | D (80,456) | |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
| 62,368 | 68.41 | 63,097 | 65.14 | 57,558 | 74.96 | 57,878 | 71.94 |
|
| 2905 | 3.19 | 3036 | 3.13 | 2324 | 3.03 | 2566 | 3.19 |
|
| 4 | 0 | 3140 | 3.24 | 661 | 0.86 | 664 | 0.83 |
|
| 28 | 0.03 | 38 | 0.04 | 24 | 0.03 | 82 | 0.1 |
|
| 15 | 0.02 | 14 | 0.01 | 4 | 0.01 | 8 | 0.01 |
|
| 16 | 0.02 | 333 | 0.34 | 199 | 0.26 | 0 | 0 |
|
| 65,336 | 71.66 | 69,658 | 71.91 | 60,770 | 79.14 | 61,198 | 76.06 |
|
|
|
|
|
|
|
|
|
|
|
| 72,016 | 78.99 | 72,771 | 75.13 | 62,359 | 81.21 | 63,673 | 79.14 |
|
| 387 | 0.42 | 493 | 0.51 | 340 | 0.44 | 344 | 0.43 |
|
| 78 | 0.09 | 1467 | 1.51 | 345 | 0.45 | 293 | 0.36 |
|
| 72,481 | 79.5 | 74,731 | 77.15 | 63,044 | 82.11 | 64,310 | 79.93 |
Number of transcript and protein sequences annotated in each rooibos transcriptome.
| A | B | C | D | |
|---|---|---|---|---|
|
|
|
|
|
|
|
| 65,336 | 69,658 | 60,770 | 61,198 |
|
|
|
|
|
|
|
| 21,040 | 21,126 | 19,210 | 18,646 |
|
|
|
|
|
|
|
| 52,688 | 56,090 | 50,413 | 50,338 |
|
| 26,011 | 28,187 | 25,244 | 24,894 |
|
| 44,390 | 46,868 | 42,244 | 42,258 |
Figure 3KAAS vs. eggNOG-mapper in terms of annotated complete plant-specific KEGG modules covered. The number of KEGG database plant-specific modules that were annotated to completion by rooibos transcripts using KAAS (blue) or rooibos proteins using eggNOG (orange) are compared. The total number of each plant specific module in the KEGG database is depicted in brackets.
Figure 4Established biocomputational pipeline for high-throughput comprehensive plant transcriptome analysis.