| Literature DB >> 35241112 |
Gengyu Lu1, Juanjuan Qiao1, Long Wang2, Hui Liu3, Gang Wu2, Yan Zhu1, Yucheng Zhao1, Guoyong Xie1, Minjian Qin4,5.
Abstract
BACKGROUND: Viola philippica Cav. is the only original plant for Violae Herba, as described in the Chinese Pharmacopoeia. The quality of this crude drug is affected by several adulterants from congeneric Viola species, and the authentic plant origin of Violae Herba is still controversial. Genome-based identification offers abundant genetic information and potential molecular markers that can be used for the authentication of closely related species. This study aims to investigate the certified origin of Violae Herba and to develop more effective markers for these easily confused species at the genetic level.Entities:
Keywords: Chloroplast genome; Herbgenomics; Integrated study; Pharmacopoeia revision; Superbarcodes; Viola; Violae Herba
Year: 2022 PMID: 35241112 PMCID: PMC8892722 DOI: 10.1186/s13020-022-00585-9
Source DB: PubMed Journal: Chin Med ISSN: 1749-8546 Impact factor: 5.455
Fig. 1Comparison of morphological and TLC traits. A Leaves of the commercial dried samples were expanded in the water to show the morphological traits. Each cell indicates one typical blade shape. B TLC traits of 18 batches of Violae Herba with the standard drug and esculetin as references. M1-M18 indicate the 18 batches of samples from the TCM markets. C TLC traits of the six Viola species with the standard drug and esculetin as references. V.c V. collina, V.j V. japonica, V.b V. betonicifolia, V.i V. inconspicua, V.p V. philippica, V.pr V. prionantha, p1 and p2 are two specific identification spots
Fig. 2Fresh leaves variation and original plants of the six Viola species. A V. inconspicua; B V. betonicifolia; C V. japonica; D V. collina; E V. philippica; F V. prionantha
Fig. 3Comparative studies of the index components and universal DNA barcoding authentication. A HPLC chromatograms of commercial VH, wild-collected samples, and reference drug. B Total ion current chromatogram of V. prionantha indicating the 4 main index components in positive ion mode. C NJ tree constructed by a two-locus barcode rbcLa and trnH-psbA
Fig. 4Chloroplast genome map of the six Viola species. Only one map was drawn here, because the number of functional genes and linear arrangement are almost the same. Genes drawn inside the circle are transcribed clockwise, genes outside are transcribed counterclockwise. Genes possess different functions are color coded in the legend. The area in darker gray and lighter gray in the inner circle indicates GC content and AT content, respectively
The basic cp genome features of six Viola species
| Characteristics | ||||||
|---|---|---|---|---|---|---|
| Total size (bp) | 157,391 | 156,395 | 157,419 | 158,067 | 157,414 | 157,215 |
| Overall GC content (%) | 36.25 | 36.35 | 36.25 | 36.25 | 36.24 | 36.25 |
| LSC | ||||||
| Length (bp) | 85,828 | 85,779 | 85,908 | 86,509 | 85,907 | 85,692 |
| GC content | 33.83 | 33.87 | 33.82 | 33.82 | 33.81 | 33.83 |
| Length (%) | 54.53 | 54.85 | 54.57 | 54.73 | 54.57 | 54.51 |
| IR | ||||||
| Length (bp) | 27,161 | 27,139 | 27,116 | 27,114 | 27,118 | 27,105 |
| GC content (%) | 42.12 | 42.14 | 42.14 | 42.16 | 42.14 | 42.15 |
| Length (%) | 17.26 | 17.35 | 17.23 | 17.15 | 17.23 | 17.24 |
| SSC | ||||||
| Length (bp) | 17,241 | 16,338 | 17,279 | 17,330 | 17,271 | 17,313 |
| GC content (%) | 29.79 | 30.16 | 29.83 | 29.91 | 29.76 | 29.76 |
| Length (%) | 10.95 | 10.45 | 10.98 | 10.96 | 10.97 | 11.01 |
| Genes | ||||||
| Total genes | 129 | 129 | 129 | 129 | 129 | 129 |
| Duplicate genes | 18 | 18 | 18 | 18 | 18 | 18 |
| Protein coding genes | 77 | 77 | 77 | 77 | 77 | 77 |
| rRNA genes | 4 | 4 | 4 | 4 | 4 | 4 |
| tRNA genes | 30 | 30 | 30 | 30 | 30 | 30 |
GC guanine and cytosine, LSC large single-copy region, SSC small single-copy region, IRs inverted repeats region
Gene contents in the cp genomes of six Viola species
| Group of genes | Name of genes | Amount |
|---|---|---|
| Ribosomal RNAs | rrn16(× 2), rrn23(× 2), rrn4.5(× 2), rrn5(× 2) | 8 |
| Transfer RNAs | 37 | |
| Small ribosomal subunit (SSU) | 13 | |
| Large ribosomal subunit (LSU) | 10 | |
| DNA-dependent RNA polymerase | 4 | |
| Subunits of ATP synthase | 6 | |
| Subunits of NADH dehydrogenase | 12 | |
| Cytochrome b/f complex | 6 | |
| Subunits of photosystem I | 5 | |
| Assembly/Stability of photosystem I | 2 | |
| Subunits of photosystem II | 15 | |
| Large subunit of Rubisco | 1 | |
| Acetyl-CoA carboxylase | 1 | |
| c-type cytochrome synthesis gene | 1 | |
| Envelop membrane protein | 1 | |
| ATP-dependent protease | 1 | |
| Maturase | 1 | |
| Conserved open reading frames | 5 | |
| Pseudogenes | ||
| Total | 129 |
One and two asterisks (*) indicate one- and two-intron containing genes. Genes with two copies indicate by the (× 2) symbol. (Ψ) symbol indicates the pseudogene. The rps12 gene is a trans-spliced gene
Fig. 5Comparative analyses of codon usage and repeat sequences. A Codon contents for the CDS sequences in the six cp genomes. The ordinate shows the RSCU values, and the abscissa represents 20 amino acids and termination codons. The histogram represents codon usage for each amino acid of the six species (From left to right: V. inconspicua, V. betonicifolia, V. japonica, V. collina, V. philippica, V. prionantha). B Types and numbers of SSRs C Numbers of tandem repeats. D Types and numbers of dispersed repeats. F, P, R, and C indicate the repeat types F(forward), P(palindrome), R(reverse), and C(complement)
Fig. 6Analyses of sequences divergence. A Comparison of the LSC, IR, and SSC junctions. Colored boxes for genes represent the gene position. JLB denotes the LSC/IRb junction, JSB denotes the SSC/IRb junction, JSA denotes the SSC/IRa junction, and JLA denotes the LSC/IRa junction. B Global alignment of six Viola cp genomes using the annotation of V. inconspicua as a reference. Dark blue blocks indicate exons, and red blocks indicate conserved non-coding sequences (CNS). tRNA and rRNA genes are denoted by cyan blocks. White peaks represent regions with sequence variation among cp genomes. Arrows are drawn above the graph pointing the direction of genes. 70% cut-off was used for the plot, and the vertical axis indicates sequence identity between 50 and 100%. The red rectangle frames indicate the divergent hotspot regions. C Sliding window analysis of the complete cp genomes of six Viola species (window length: 800 bp, step size: 200 bp). X-axis indicates the midpoint of windows. Y-axis indicates nucleotide diversity (Pi) within each window. And the red line indicates the threshold for potential genetic markers (Pi threshold = 0.015). Regions with Pi values meet or exceed 0.015 were marked out on the graph. The multicolored legend above shows the gene order
Fig. 7Phylogenetic tree based on the cp genomes of 17 Viola species using maximum likelihood (ML) and Bayesian inferences (BI) methods. Passiflora edulis was used as outgroup. Numbers besides the nodes represent ML bootstrap support (BS) values and BI posterior probabilities (PP) values. The legend on the right shows the Section-Subsection taxonomic synopsis and the corresponding evolution of stigma
The variability and discrimination rates of cp genomes, specific and universal barcodes
| No | Markers | Aligned length | Polymorphic sites | Parsimony informative sites (PICs) | Mean K2P distance | Discrimination rates by tree-based method | ||
|---|---|---|---|---|---|---|---|---|
| Numbers | % | Numbers | % | |||||
| Complete cp genomes | 136,352 | 4664 | 3.42% | 2175 | 1.60% | 0.00824 | 100% | |
| Specific barcodes | ||||||||
| 1 | 506 | 65 | 12.85% | 30 | 5.93% | 0.03880 | 100% | |
| 2 | 1155 | 118 | 10.22% | 58 | 5.02% | 0.02759 | 35.29% | |
| 3 | 1189 | 138 | 11.61% | 55 | 4.63% | 0.03028 | 64.71% | |
| 4 | 977 | 157 | 16.07% | 108 | 11.05% | 0.04316 | 70.59% | |
| 1580 | 129 | 8.16% | 69 | 4.37% | 0.03071 | 64.71% | ||
| 6 | 300 | 46 | 15.33% | 24 | 8.00% | 0.04475 | 41.18% | |
| 7 | 1923 | 222 | 11.54% | 93 | 4.84% | 0.03428 | 47.06% | |
| Combination (1–2-3) | 2850 | 321 | 11.26% | 143 | 5.02% | 0.03034 | 100% | |
| Universal barcodes | ||||||||
| 1 | 424 | 25 | 5.90% | 17 | 4.01% | 0.02121 | 41.18% | |
| 2 | 625 | 46 | 7.36% | 15 | 2.40% | 0.01793 | 47.06% | |
| Combination (1–2) | 1049 | 71 | 6.77% | 32 | 3.05% | 0.01929 | 47.06% | |
Discrimination rates (%) = [the number of species with bootstrap values more than 50% / the total number of species] × 100%
Fig. 8Neighbor-joining tree for 17 Viola species using complete cp genomes as super-barcodes