| Literature DB >> 26633323 |
Yue-Mei Zhao1,2, Tao Zhou3, Zhong-Hu Li4, Gui-Fang Zhao5.
Abstract
Gynostemma pentaphyllum is an important medicinal herb of the Cucurbitaceae family, but limited genomic data have hindered genetic studies. In this study, transcriptomes of two closely-related Gynostemma species, Gynostemma cardiospermum and G. pentaphyllum, were sequenced using Illumina paired-end sequencing technology. A total of 71,607 nonredundant unigenes were assembled. Of these unigenes, 60.45% (43,288) were annotated based on sequence similarity search with known proteins. A total of 11,059 unigenes were identified in the Kyoto Encyclopedia of Genes and Genomes Pathway (KEGG) database. A total of 3891 simple sequence repeats (SSRs) were detected in 3526 nonredundant unigenes, 2596 primer pairs were designed and 360 of them were randomly selected for validation. Of these, 268 primer pairs yielded clear products among six G. pentaphyllum samples. Thirty polymorphic SSR markers were used to test polymorphism and transferability in Gynostemma. Finally, 15 SSR makers that amplified in all 12 Gynostemma species were used to assess genetic diversity. Our results generated a comprehensive sequence resource for Gynostemma research.Entities:
Keywords: EST-SSR; Gynostemma; de novo assembly; illumina paired-end sequencing; transcriptome
Mesh:
Substances:
Year: 2015 PMID: 26633323 PMCID: PMC6332360 DOI: 10.3390/molecules201219758
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Transcriptome reads and assembled contigs information for two Gynostemma species.
| Species | Total Reads | Total Clean Nucleotides (Nt) | Q30 Percentage | GC Percentage | Total Number of Contigs | Total Length of Contigs (Nt) | N50 of Contigs | Mean (Nt) | |
|---|---|---|---|---|---|---|---|---|---|
| 43,175,448 | 4,360,277,191 | 80.16% | 43.55% | 1,488,035 | 110,745,998 | 71 | 74 | ||
| 52,782,146 | 5,330,256,148 | 82.69% | 44.00% | 1,911,378 | 136,726,651 | 66 | 71 | ||
Figure 1Frequency distribution of the contig sizes from two Gynostemma species. The frequency distribution of contig sizes resulting from Illumina HiSeq™ 2000 sequencing, as assembled using Trinity.
Summary of the unigenes from two Gynostemma species.
| Unigene Source | Total Number of Unigenes | Total Length of Unigenes (Nt) | Mean Length of Unigenes (Nt) | N50 of Unigenes |
|---|---|---|---|---|
| 40,257 | 35,161,843 | 873.43 | 1516 | |
| 44,000 | 37,234,004 | 846.23 | 1504 | |
| All | 71,607 | 61,367,129 | 857.00 | 1535 |
Figure 2Comparison of unigene length between hit and not-hit unigenes. Longer unigenes were more likely to have BLAST matches in protein databases.
Figure 3Gene Ontology classification of assembled unigenes. The results are summarized in three main categories: Biological Process, Cellular Component and Molecular Function. In total, 35,968 unigenes with BLAST matches to known proteins were assigned to gene ontology terms.
Figure 4Histogram presentation of clusters of orthologous groups (COG). In total, 20,585 sequences were assigned to 25 COG classifications.
Figure 5Kyoto Encyclopedia of Genes and Genomes (KEGG) classification of unigenes. 5,443 unigenes were assigned to the top 20 pathways in KEGG.
List of triterpene saponin biosynthesis-related genes in the Gynostemma transcriptome.
| Gene ID | Length | KO ID | Annotation |
|---|---|---|---|
| T3_Unigene_BMK.23540 | 2373 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| T3_Unigene_BMK.25990 | 2720 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| T3_Unigene_BMK.37882 | 301 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| T4_Unigene_BMK.23719 | 2489 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| T4_Unigene_BMK.30793 | 3057 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| T4_Unigene_BMK.33182 | 2483 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| CL10430Contig1 | 2754 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| T3_Unigene_BMK.15665 | 2671 | K01662 | 1-deoxyxylulose-5-phosphate synthase, (EC:2.2.1.7) |
| CL9143Contig1 | 1586 | K00919 | 4-diphosphocytidyl-2- |
| T3_Unigene_BMK.22099 | 359 | K03526 | 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase, (EC:1.17.7.1) |
| T3_Unigene_BMK.28355 | 906 | K03527 | 4-hydroxy-3-methylbut-2-enyl diphosphate reductase, (EC:1.17.1.2) |
| CL12440Contig1 | 616 | K03527 | 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate reductase, (EC:1.17.1.2) |
| T3_Unigene_BMK.9388 | 411 | K03527 | 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate reductase, (EC:1.17.1.2) |
| CL12699Contig1 | 1973 | K01823 | isopentenyl-diphosphate delta-isomerase, (EC:5.3.3.2) |
| T4_Unigene_BMK.25472 | 575 | K01823 | isopentenyl-diphosphate delta-isomerase, (EC:5.3.3.2) |
| CL7433Contig1 | 750 | K01823 | isopentenyl-diphosphate delta-isomerase, (EC:5.3.3.2) |
| T3_Unigene_BMK.21000 | 1177 | K13789 | geranyl geranyl pyrophosphate synthase, (EC:2.5.1.29) |
| T4_Unigene_BMK.20525 | 1569 | K14066 | geranylgeranyl pyrophosphate synthase, (EC:2.5.1.30) |
| T3_Unigene_BMK.18996 | 1754 | K14066 | geranylgeranyl pyrophosphate synthase, (EC:2.5.1.30) |
| T3_Unigene_BMK.35842 | 305 | K00626 | acetyl-CoA acetyltransferase, mitochondrial, (EC:2.3.1.9) |
| T3_Unigene_BMK.35857 | 278 | K00626 | acetyl-CoA acetyltransferase, mitochondrial, (EC:2.3.1.9) |
| T4_Unigene_BMK.23349 | 1785 | K00626 | acetyl-CoA C-acetyltransferase, (EC:2.3.1.9) |
| T3_Unigene_BMK.18569 | 1657 | K00626 | acetyl-CoA C-acetyltransferase, (EC:2.3.1.9) |
| CL11048Contig1 | 2014 | K01641 | hydroxymethylglutaryl-CoA synthase, (EC:2.3.3.10) |
| T3_Unigene_BMK.2712 | 475 | K00021 | hmg-CoA reductase, (EC:1.1.1.34) |
| CL14352Contig1 | 2368 | K00021 | hmg-CoA reductase, (EC:1.1.1.34) |
| T4_Unigene_BMK.28606 | 1468 | K00021 | hmg-CoA reductase, (EC:1.1.1.34) |
| T4_Unigene_BMK.28692 | 322 | K00021 | hmg-CoA reductase, (EC:1.1.1.34) |
| T3_Unigene_BMK.22340 | 296 | K00021 | hmg-CoA reductase, (EC:1.1.1.34) |
| T3_Unigene_BMK.3271 | 467 | K00869 | mevalonate kinase, (EC:2.7.1.36) |
| CL13115Contig1 | 526 | K00869 | mevalonate kinase, (EC:2.7.1.36) |
| T4_Unigene_BMK.23148 | 1576 | K00869 | mevalonate kinase, (EC:2.7.1.36) |
| T4_Unigene_BMK.26974 | 522 | K00869 | mevalonate kinase, (EC:2.7.1.36) |
| T3_Unigene_BMK.14474 | 1691 | K00869 | mevalonate kinase, (EC:2.7.1.36) |
| T3_Unigene_BMK.22024 | 1903 | K00938 | Phosphomevalonate kinase, (EC:2.7.4.2) |
| T4_Unigene_BMK.29327 | 2282 | K01597 | diphosphomevalonate decarboxylase, (EC:4.1.1.33) |
| T3_Unigene_BMK.15864 | 1831 | K01597 | diphosphomevalonate decarboxylase, (EC:4.1.1.33) |
| T3_Unigene_BMK.5322 | 215 | K11778 | undecaprenyl pyrophosphate synthetase, (EC:2.5.1.31) |
| T4_Unigene_BMK.33183 | 1237 | K11778 | undecaprenyl pyrophosphate synthetase, (EC:2.5.1.31) |
| T4_Unigene_BMK.21428 | 1774 | K00801 | farnesyl-diphosphate farnesyltransferase, (EC:2.5.1.21) |
| T3_Unigene_BMK.3061 | 271 | K00511 | Squalene monooxygenase, (EC:1.14.99.7) |
| CL1336Contig1 | 2048 | K00511 | Squalene monooxygenase, (EC:1.14.99.7) |
KO: KEGG Orthology.
Figure 6Frequency distribution of the Gynostemma expressed sequence tag (EST)-SSRs of different sizes. (A) Unit size; (B) Number of repeats; (C) SSR locus length.
Frequency distribution of the di- and tri-nucleotide repeat motifs in the Gynostemma.
| Serial No. | Repeat Motif | Number of Reiterations of the Motif | Total | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | |||
| 1 | AG/CT | # | 456 | 285 | 217 | 189 | 192 | 81 | 3 | 1423 |
| 2 | AAG/CTT | 576 | 358 | 137 | 6 | 0 | 0 | 0 | 0 | 1077 |
| 3 | ATC/ATG | 159 | 74 | 25 | 0 | 0 | 0 | 0 | 0 | 258 |
| 4 | AT/AT | # | 95 | 54 | 36 | 18 | 14 | 2 | 1 | 220 |
| 5 | AAT/ATT | 97 | 50 | 19 | 2 | 0 | 0 | 0 | 0 | 168 |
| 6 | AGG/CCT | 90 | 41 | 12 | 1 | 0 | 0 | 0 | 0 | 144 |
| 7 | AGC/CTG | 83 | 24 | 9 | 1 | 0 | 0 | 0 | 0 | 117 |
| 8 | ACC/GGT | 52 | 32 | 6 | 1 | 0 | 0 | 0 | 0 | 91 |
| 9 | AAC/GTT | 49 | 24 | 14 | 2 | 0 | 0 | 0 | 0 | 89 |
| 13 | CCG/CGG | 49 | 17 | 3 | 0 | 0 | 0 | 0 | 0 | 69 |
| 10 | AC/GT | # | 28 | 17 | 10 | 4 | 3 | 0 | 1 | 63 |
| 11 | ACG/CGT | 24 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 34 |
| 12 | ACT/AGT | 15 | 5 | 2 | 2 | 0 | 0 | 0 | 0 | 24 |
| 14 | CG/CG | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| 15 | Other motifs * | 95 | 18 | 0 | 0 | 0 | 0 | 0 | 0 | 113 |
| Total | 1289 | 1233 | 583 | 278 | 211 | 209 | 83 | 5 | 3891 | |
Note: * indicated tetra-, penta-, and hexa-nucleotide motifs in our study; # means that this item was not considered when detecting EST-SSRs.
Details of 15 genic-SSR loci showing polymorphism among 12 Gynostemma species.
| Primer Name | Base on Sequence ID | SSRs | EPS (bp) | OPS (bp) | Alleles | PIC |
|---|---|---|---|---|---|---|
| G-EST-SSR19 | CL10682Contig1 | (AT)6 | 149 | 149–161 | 5 | 0.67 |
| G-EST-SSR20 | CL10765Contig1 | (GAA)5 | 151 | 148–163 | 8 | 0.82 |
| G-EST-SSR40 | CL11560Contig1 | (TCT)5 | 151 | 151–160 | 4 | 0.68 |
| G-EST-SSR44 | CL13255Contig1 | (AG)6 | 164 | 162–196 | 11 | 0.87 |
| G-EST-SSR47 | CL13343Contig1 | (TCT)6 | 148 | 151–172 | 8 | 0.79 |
| G-EST-SSR57 | CL14153Contig1 | (TGA)5 | 149 | 152–170 | 6 | 0.74 |
| G-EST-SSR75 | CL400Contig1 | (TCA)6 | 172 | 172–202 | 7 | 0.80 |
| G-EST-SSR76 | CL435Contig1 | (TCT)5 | 131 | 122–137 | 6 | 0.76 |
| G-EST-SSR89 | CL1983Contig1 | (GAAA)5 | 146 | 146–166 | 6 | 0.79 |
| G-EST-SSR100 | CL2782Contig1 | (GAA)6 | 145 | 136–163 | 9 | 0.79 |
| G-EST-SSR118 | CL8580Contig1 | (CGG)5 | 141 | 138–153 | 6 | 0.62 |
| G-EST-SSR131 | CL9957Contig1 | (TCT)5 | 149 | 149–158 | 5 | 0.55 |
| G-EST-SSR140 | CL10320Contig1 | (CTA)6 | 158 | 158–170 | 6 | 0.73 |
| G-EST-SSR306 | CL11525Contig1 | (CT)7 | 169 | 169–189 | 6 | 0.73 |
| G-EST-SSR316 | CL12699Contig1 | (GAA)5 | 135 | 135–159 | 8 | 0.66 |
| Mean | - | - | - | 6.73 | 0.73 |
EPS: expected product size; OPS: observed product size.
The 26 individual plants (belonging to 12 species) used for validation and the genetic diversity study.
| No. | Species | Locality, Province | Latitude (N), longitude (E) | Characteristics |
|---|---|---|---|---|
| 1 | Ankang, Shaanxi | 32°25′N,109°04′E | wild | |
| 2 | Ankang, Shaanxi | 32°25′N,109°04′E | Cultivar | |
| 3 | Kunming, Yunnan | 25°14′N,102°49′E | Wild | |
| 4 | Kunming, Yunnan | 25°14′N,102°49′E | Cultivar | |
| 5 | Panzhihua, Sichuan | 26°36′N,101°43′E | Wild | |
| 6 | Panzhihua, Sichuan | 26°36′N,101°43′E | Cultivar | |
| 7 | Dehong, Yunnan | 24°48′N, 98°17′E | Wild | |
| 8 | Menghai, Yunnan | 22°02′N, 100°22′E | Wild | |
| 9 | Dehong, Yunnan | 24°36′N, 97°39′E | Wild | |
| 10 | Mengla, Yunnan | 22°14′N, 101°15′E | Wild | |
| 11 | Jinghong, Yunnan, | 22°01′N, 100°45′E | Wild | |
| 12 | Xuancheng, Anhui | 30°55′N,118°46′E | Wild | |
| 13 | Ankang, Shaanxi | 32°25′N,109°22′E | Wild | |
| 14 | Enshi, Hubei | 30°03′N, 109°49′E | Wild | |
| 15 | Zhaotong, Yunnan | 27°43′N,103°54′E | Wild | |
| 16 | Menghai, Yunnan | 21°56′N, 100°36′E | Wild | |
| 17 | Menglun, Yunnan | 21°56′N, 101°14′E | Wild | |
| 18 | Yingjiang, Yunnan | 24°42′N, 97°55′E | Wild | |
| 19 | Enshi, Hubei | 30°18′N, 109°31′E | Wild | |
| 20 | Jiujiang, Jiangxi, | 29°17′N, 115°07′E | Wild | |
| 21 | Pu’er, Yunnan | 23°07′N, 100°22′E | Wild | |
| 22 | Xuancheng, Anhui | 30°41′N,118°39′E | Wild | |
| 23 | Tongling, Anhui | 30°57′N,117°48′E | Wild | |
| 24 | Renhuai, Guizhou | 27°48′N, 106°26′E | Wild | |
| 25 | Ankang, Shaanxi | 32°13′N,109°01′E | Wild | |
| 26 | Ankang, Shaanxi | 32°13′N,109°01′E | Wild |
Figure 7Genetic relationships among Gynostemma species based on EST-SSR markers. Genetic relationships among 26 individual plants. The scale at the bottom of the dendrogram indicates the level of similarity between the genotypes. Bootstrap values (>50) were labeled on the branches from 1000 re-samplings.