| Literature DB >> 36028808 |
Shujie Dong1,2, Manjia Zhou2, Jinxing Zhu3, Qirui Wang2, Yuqing Ge4, Rubin Cheng5,6.
Abstract
BACKGROUND: Tetrastigma hemsleyanum is a valuable traditional Chinese medicinal plant widely distributed in the subtropical areas of China. It belongs to the Cayratieae tribe, family Vitaceae, and exhibited significant anti-tumor and anti-inflammatory activities. However, obvious differences were observed on the quality of T. hemsleyanum root from different regions, requiring the discrimination strategy for the geographical origins. RESULT: This study characterized five complete chloroplast (cp) genomes of T. hemsleynum samples from different regions, and conducted a comparative analysis with other representing species from family Vitaceae to reveal the structural variations, informative markers and phylogenetic relationships. The sequenced cp genomes of T. hemsleyanum exhibited a conserved quadripartite structure with full length ranging from 160,124 bp of Jiangxi Province to 160,618 bp of Zhejiang Province. We identified 112 unique genes (80 protein-coding, 28 tRNA and 4 rRNA genes) in the cp genomes of T. hemsleyanum with highly similar gene order, content and structure. The IR contraction/expansion events occurred on the junctions of ycf1, rps19 and rpl2 genes with different degrees, causing the differences of genome sizes in T. hemsleyanum and Vitaceae plants. The number of SSR markers discovered in T. hemsleyanum was 56-57, exhibiting multiple differences among the five geographic groups. Phylogenetic analysis based on conserved cp genome proteins strongly grouped the five T. hemsleyanum species into one clade, showing a sister relationship with T. planicaule. Comparative analysis of the cp genomes from T. hemsleyanum and Vitaceae revealed five highly variable spacers, including 4 intergenic regions and one protein-coding gene (ycf1). Furthermore, five mutational hotspots were observed among T. hemsleyanum cp genomes from different regions, providing data for designing DNA barcodes trnL and trnN. The combination of molecular markers of trnL and trnN clustered the T. hemsleyanum samples from different regions into four groups, thus successfully separating specimens of Sichuan and Zhejiang from other areas.Entities:
Keywords: Chloroplast genome; DNA barcoing markers; Geographical origins; Nucleotide diversity; Phylogenetic relationships; Tetrastigma hemsleyanum
Mesh:
Year: 2022 PMID: 36028808 PMCID: PMC9412808 DOI: 10.1186/s12864-022-08755-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Fig. 1The geographical distributions and morphological characteristics of Tetrastigma hemsleyanum from different regions. A Regional distribution map of Tetrastigma hemsleyanum sample collection (B) The microscopic characters of the dried roots from Tetrastigma hemsleyanum: 1. brown patches; 2. marginal orifice catheter; 3. cluster crystals of calcium oxalate; 4. needle crystals of calcium oxalate; 5. starch granules; 6. cork cell. C Comparison of macroscopic characters of the root tubers of Tetrastigma hemsleyanum from five different regions
Fig. 2The chloroplast genome map of Tetrastigma hemsleyanum from five different regions. Genes labeled inside the circle are transcribed clockwise, while those outside the circle are transcribed anti-clockwise. The tick lines exhibited the extent of the Inverted Repeats (IRA and IRB) separating the Large Single-Copy (LSC) and the Small Single-Copy (SSC) regions. The darker gray and the lighter gray in the inner circle corresponds to GC and AT content, respectively. Genes with different functions are represented in different colors
List of genes annotated in the chloroplast genomes of Tetrastigma hemsleyanum
| Classification of genes | Gene Names | Number |
|---|---|---|
| Photosystem I | 5 | |
| Photosystem II | 15 | |
| Cytochrome b/f complex | 6 | |
| ATP synthase | 6 | |
| NADH dehydrogenase | 11 | |
| RubisCO large subunit | 1 | |
| RNA polymerase | 4 | |
| Ribosomal proteins (SSU) | 12 | |
| Ribosomal proteins (LSU) | 9 | |
| Ribosomal RNAs | 4 | |
| Protein of unknown function | 5 | |
| Transfer RNAs | 28 | |
| Other genes | 6 | |
| Total | 112 |
*indicate a duplicated gene
Statistics on the basic feature of the cp genomes of five T. hemsleyanum plants and three Vitaceae species
| Characteristics | ||||||||
|---|---|---|---|---|---|---|---|---|
| Zhejiang | Fujian | Guangxi | Sichuan | Jiangxi | ||||
| Genbank accession No. | MW375707 | MW375708 | MW375709 | MW375710 | MW375711 | MW401672 | NC_042235 | NC_007957 |
| Total length (bp) | 160,518 | 160,152 | 160,153 | 160,127 | 160,124 | 160,323 | 161,430 | 160,928 |
| LSC length (bp) | 89,298 | 88,183 | 88,798 | 88,131 | 88,142 | 88,181 | 89,626 | 89,147 |
| SSC length (bp) | 18,968 | 18,965 | 18,965 | 18,962 | 18,962 | 19,096 | 18,977 | 19,065 |
| IR length (bp) | 26,126 | 26,502 | 26,195 | 26,517 | 26,510 | 26,523 | 26,413 | 26,358 |
| Gene number (bp) | 112 | 112 | 112 | 112 | 112 | 113 | 114 | 113 |
| Gene number in IR regions | 19 | 19 | 19 | 19 | 19 | 18 | 19 | 18 |
| Protein-coding gene number | 80 | 80 | 80 | 80 | 80 | 79 | 80 | 79 |
| rRNA gene number | 4 | 4 | 4 | 4 | 4 | 4 | 4 | 4 |
| tRNA gene number | 28 | 28 | 28 | 28 | 28 | 30 | 30 | 30 |
| Total GC content (%) | 37.50 | 37.50 | 37.50 | 37.52 | 37.52 | 37.49 | 37.32 | 37.40 |
Fig. 3Comparison of the LSC, IR, and SSC junction regions among five Tetrastigma hemsleyanum samples with different geographical origins, Tetrastigma planicaule, Ampelopsis japonica and Vitis vinifera cp genomes
The number and types of SSR in five T. hemsleyanum plants and three Vitaceae species
| SSR type | Repeat unit | Amount | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Mono | A/T | 42 | 41 | 41 | 42 | 42 | 39 | 45 | 34 |
| C/G | / | 1 | 1 | 1 | / | / | / | 1 | |
| Di | AG/CT | 1 | 1 | 1 | 1 | 1 | 1 | / | / |
| AT/AT | 10 | 10 | 10 | 10 | 10 | 11 | 13 | 8 | |
| Tri | AAT/ATT | / | / | / | / | / | / | 2 | 4 |
| AGC/CTG | / | / | / | / | / | / | 1 | 1 | |
| AAG/CTT | / | / | / | / | / | / | 1 | / | |
| Tetra | AAAT/ATTT | 2 | 2 | 2 | 2 | 2 | 2 | 2 | 1 |
| AATC/ATTG | / | / | / | / | / | / | 1 | 1 | |
| AATT/AATT | 1 | 1 | 1 | 1 | 1 | 2 | / | / | |
| ACAT/ATGT | / | / | / | / | / | / | / | 1 | |
| AGAT/ATCT | / | / | / | / | / | / | 1 | 2 | |
| Penta | AGGAT/ATCCT | / | / | / | / | / | / | 1 | |
| AAAAT/ATTTT | / | / | / | / | / | 1 | / | ||
| AATAT/ATATT | / | / | / | / | / | 2 | / | ||
| Total | 56 | 56 | 56 | 57 | 56 | 55 | 69 | 54 | |
Fig. 4Analysis of simple sequence repeats (SSRs) in Vitaceae plants and Tetrastigma hemsleyanum species with different geographical origins. A The number of different types of SSRs in five samples of Tetrastigma hemsleyanum from different regions. B The number of different types of SSRs in the cp genomes of Tetrastigma hemsleyanum sample from Jiangxi Province, Tetrastigma planicaule, Ampelopsis japonica and Vitis vinifera
Number of the RNA editing sites in the cp genome of T. hemsleyanum and three Vitaceae species
| Gene | Number of RNA editing sites | |||
|---|---|---|---|---|
| 1 | 1 | 2 | 1 | |
| 3 | 3 | 3 | 3 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 4 | 4 | 4 | 4 | |
| 4 | 4 | 4 | 4 | |
| 11 | 11 | 12 | 11 | |
| 8 | 8 | 8 | 8 | |
| 7 | 7 | 6 | 6 | |
| 3 | 3 | 3 | 3 | |
| 2 | 2 | 2 | 2 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 1 | 1 | 1 | 1 | |
| 2 | 2 | 2 | 2 | |
| 5 | 5 | 5 | 5 | |
| 2 | 2 | 2 | 2 | |
| 4 | 4 | 4 | 4 | |
| 2 | 2 | 2 | 2 | |
| 2 | 2 | 2 | 2 | |
| 3 | 3 | 3 | 3 | |
| Total | 71 | 71 | 72 | 70 |
The value of Ka/Ks in 25 protein coding genes with RNA editing sites in T. hemsleyanum (Jiangxi)
| Gene | Number of RNA editing sites | non-synonymous substitutions (Ka) | synonymous substitutions (Ks) | Ka/Ks |
|---|---|---|---|---|
| 11 | 0.0113 | 0.0243 | 0.4650 | |
| 8 | 0.0516 | 0.3257 | 0.1584 | |
| 7 | 0.1087 | 0.3022 | 0.3597 | |
| 5 | 0.0240 | 0.2243 | 0.1070 | |
| 4 | 0.0383 | 0.2928 | 0.1308 | |
| 4 | 0.1296 | 0.2342 | 0.5534 | |
| 4 | 0.0704 | 0.2663 | 0.2644 | |
| 3 | 0.0491 | 0.2718 | 0.1807 | |
| 3 | 0.0334 | 0.2751 | 0.1214 | |
| 3 | 0.1109 | 0.1950 | 0.5687 | |
| 2 | 0.0700 | 0.1879 | 0.3725 | |
| 2 | 0.0267 | 0.2567 | 0.1040 | |
| 2 | 0.0124 | 0.1742 | 0.0712 | |
| 2 | 0.0111 | 0.2074 | 0.0535 | |
| 2 | 0.0263 | 0.1387 | 0.1896 | |
| 1 | 0.1093 | 0.2714 | 0.4027 | |
| 1 | 0.0469 | 0.1480 | 0.3169 | |
| 1 | 0.0263 | 0.1612 | 0.1631 | |
| 1 | 0.0844 | 0.2810 | 0.3004 | |
| 1 | 0 | 0 | 0 | |
| 1 | 0.0053 | 0.2050 | 0.0259 | |
| 1 | 0 | 0.0934 | 0 | |
| 1 | 0 | 0 | 0 | |
| 1 | 0.0709 | 0.1643 | 0.4315 |
Fig. 5Phylogenetic relationships based on the conserved chloroplast protein coding genes from five samples of Tetrastigma hemsleyanum and other representative Vitaceae species. A The tree was constructed using maximum likelihood (ML) method. B The tree was constructed using maximum parsimony (MP) method. The number above each node referred to the bootstrap value from 500 replicates. The areas corresponding to blue, lavender and yellow represented the tribes of Viteae, Ampelopsideae and Cayratieae, respectively. Melaleuca alternifolia and Melaleuca cajuputi were set as the outgroups for phylogenetic analysis
Fig. 6Comparison of potential mutational hotspots in the complete chloroplast genomes among Vitaceae plants and Tetrastigma hemsleyanum samples from different regions. A Nucleotide diversity (Pi) analysis among four Vitaceae chloroplast genomes. B Nucleotide diversity (Pi) analysis in the cp genomes of Tetrastigma hemsleyanum from five different regions. Sliding window length was 800 bp and step size was selected as 200 bp. The Pi value of each window is shown on the Y-axis, and their positions of the midpoint represented on the X-axis
Multiple analysis of the mutational hotspots in four Vitaceae plants
| mutational hotspots | Specises | Length | GC content | Number of SNP sites | Total length of Gaps |
|---|---|---|---|---|---|
| 1139 bp | 23.09% | / | / | ||
| 1141 bp | 23.14% | 5 | 4 | ||
| 1208 bp | 20.86% | 149 | 167 | ||
| 1076 bp | 21.84% | 98 | 138 | ||
| 895 bp | 35.08% | / | / | ||
| 868 bp | 35.37% | 79 | 39 | ||
| 860 bp | 33.37% | 60 | 77 | ||
| 843 bp | 34.28% | 41 | 100 | ||
| 892 bp | 24.33% | / | / | ||
| 883 bp | 24.46% | 8 | 19 | ||
| 907 bp | 24.33% | 104 | 95 | ||
| 911 bp | 22.39% | 115 | 69 | ||
| 1031 bp | 33.56% | / | / | ||
| 1029 bp | 33.92% | 3 | 2 | ||
| 1113 bp | 33.33% | 97 | 128 | ||
| 1123 bp | 33.93% | 97 | 146 | ||
| 977 bp | 30.60% | / | / | ||
| 977 bp | 30.40% | 6 | 0 | ||
| 965 bp | 29.95% | 77 | 12 | ||
| 980 bp | 30.61% | 67 | 15 |
Fig. 7Comparative analysis of trnL and trnN sequences of T. hemsleyanum samples (A) Agarose gel electrophoresis of PCR products of five DNA barcodes from T. hemleyanum in Zhejiang Province. B Genetic distance analysis between the samples of T. hemsleyanum in Jiangxi Province and other regions, three representative Vitaceae species as well as two Melaleuca species based on trnL sequence, (C) trnN sequence and (D) combination of trnL + trnN sequences. The Neighbor-Joining (NJ) trees of 21 samples of T. hemsleyanum from different regions and 3 representative Vitaceae species, based on trnL sequence (E) and trnN sequence (F)
Sequence analysis of T. hemsleyanum samples and other Vitaceae species basing on two DNA barcodes
| Species | Sample number | Sample source | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Length | GC content | Genbank accession No. | Haplotype | Length | GC content | Genbank accession No. | Haplotype | |||
| JT-01 | Lushan District, Jiujiang, Jiangxi 25°08′N, 117°02′E | 1143 bp | 38.85% | MZ995437 | A1 | 469 bp | 43.50% | MZ995468 | B1 | |
| JT-02 | MZ995438 | A1 | MZ995469 | B1 | ||||||
| JT-03 | MZ995439 | A1 | MZ995470 | B1 | ||||||
| JT-04 | ON561859 | A1 | ON561826 | B1 | ||||||
| JT-05 | ON561860 | A1 | ON561827 | B1 | ||||||
| JT-06 | ON561861 | A1 | ON561828 | B1 | ||||||
| JT-07 | ON561862 | A1 | ON561829 | B1 | ||||||
| JT-08 | ON561863 | A1 | ON561830 | B1 | ||||||
| ZT-01 | Linhai, Taizhou, Zhejiang 28°85′N, 121°13′E | OK058531 | A1 | MZ995452 | B1 | |||||
| ZT-02 | OK058532 | A1 | MZ995453 | B1 | ||||||
| ZT-03 | MZ995433 | A1 | MZ995454 | B1 | ||||||
| ZT-07 | ON561864 | A1 | ON561831 | B1 | ||||||
| ZT-08 | ON561865 | A1 | ON561832 | B1 | ||||||
| ZT-09 | ON561866 | A1 | ON561833 | B1 | ||||||
| ZT-10 | ON561867 | A1 | ON561834 | B1 | ||||||
| ZT-04 | Suichang County, Lishui, Zhejiang 28°61′N, 119°05′E | MZ995434 | A2 | MZ995455 | B1 | |||||
| ZT-05 | MZ995435 | A2 | MZ995456 | B1 | ||||||
| ZT-06 | MZ995436 | A2 | MZ995457 | B1 | ||||||
| ZT-11 | 38.93% | ON561868 | A3 | ON561835 | B1 | |||||
| ZT-12 | ON561869 | A3 | ON561836 | B1 | ||||||
| ZT-13 | ON561870 | A3 | ON561837 | B1 | ||||||
| ZT-14 | ON561871 | A3 | ON561838 | B1 | ||||||
| ZT-15 | ON561872 | A3 | ON561839 | B1 | ||||||
| MT-01 | Longyan, Fujian 25°08′N, 117°02′E | 38.85% | MZ995440 | A1 | MZ995458 | B1 | ||||
| MT-02 | MZ995441 | A1 | MZ995459 | B1 | ||||||
| MT-03 | MZ995442 | A1 | MZ995460 | B1 | ||||||
| MT-04 | MZ995443 | A1 | MZ995461 | B1 | ||||||
| MT-05 | ON561873 | A1 | ON561840 | B1 | ||||||
| MT-06 | ON561874 | A1 | ON561841 | B1 | ||||||
| MT-07 | ON561875 | A1 | ON561842 | B1 | ||||||
| MT-08 | ON561876 | A1 | ON561843 | B1 | ||||||
| YT-01 | Baise, Guangxi 23°90′N, 106°62′E | MZ995444 | A1 | MZ995462 | B1 | |||||
| YT-02 | MZ995445 | A1 | MZ995463 | B1 | ||||||
| YT-03 | MZ995446 | A1 | MZ995464 | B1 | ||||||
| YT-04 | ON561877 | A1 | ON561844 | B1 | ||||||
| YT-05 | ON561878 | A1 | ON561845 | B1 | ||||||
| YT-06 | ON624114 | A1 | ON561846 | B1 | ||||||
| YT-07 | ON561879 | A1 | ON561847 | B1 | ||||||
| YT-08 | ON561880 | A1 | ON561848 | B1 | ||||||
| CT-01 | Wanyuan, Dazhou, Sichuan 32°08′N, 108°03′E | 38.76% | MZ995447 | A4 | MZ995465 | B2 | ||||
| CT-02 | MZ995448 | A4 | MZ995466 | B2 | ||||||
| CT-03 | MZ995449 | A4 | MZ995467 | B2 | ||||||
| CT-04 | ON561881 | A1 | ON561849 | B2 | ||||||
| CT-05 | ON561882 | A1 | ON561850 | B2 | ||||||
| CT-06 | ON561883 | A1 | ON561851 | B2 | ||||||
| CT-07 | ON561884 | A1 | ON561852 | B2 | ||||||
| DT-01 | Shaoguan, Guangdong 24°80′N, 113°59′E | 38.85% | MZ995450 | A1 | MZ995471 | B1 | ||||
| DT-02 | MZ995451 | A1 | MZ995472 | B1 | ||||||
| DT-03 | ON561885 | A1 | ON561853 | B1 | ||||||
| DT-04 | ON561886 | A1 | ON561854 | B1 | ||||||
| DT-05 | ON561887 | A1 | ON561855 | B1 | ||||||
| DT-06 | ON561888 | A1 | ON561856 | B1 | ||||||
| DT-07 | ON561889 | A1 | ON561857 | B1 | ||||||
| DT-08 | ON561890 | A1 | ON561858 | B1 | ||||||
| NCBI | Genbank | MT827073 | A1 | MT827073 | B1 | |||||
| TP-01 | 38.76% | MN401672 | A5 | 43.28% | MN401672 | B3 | ||||
| AJ-01 | 1136 bp | 39.00% | NC_042235 | A6 | NC_042235 | B4 | ||||
| VV-01 | NC_007957 | A7 | 43.07% | NC_007957 | B4 | |||||
| MA-01 | 1090 bp | 38.53% | MN310606 | A8 | 491 bp | 41.75% | MN310606 | B5 | ||
| MC-01 | 1077 bp | NC_052729 | A9 | 41.55% | NC_052729 | B6 | ||||
Fig. 8The Neighbor-Joining (NJ) tree analysis of 21 T. hemsleyanum samples from different regions and 3 representative Vitaceae species, based on the combination of trnL + trnN sequence. The six colors represent samples of T. hemsleyanum from six different provinces