| Literature DB >> 28902130 |
Dan Jiang1,2, Zhenyu Zhao3, Teng Zhang4, Wenhao Zhong5, Chunsheng Liu6, Qingjun Yuan7, Luqi Huang8.
Abstract
Scutellaria baicalensis Georgi (Lamiaceae) is the source of the well-known traditional Chinese medicine "HuangQin" (Radix Scutellariae). Natural sources of S. baicalensis are rapidly declining due to high market demand and overexploitation. Moreover, the commercial products of Radix Scutellariae have often been found to contain adulterants in recent years, which may give rise to issues regarding drug efficacy and safety. In this study, we developed valuable chloroplast molecular resources by comparing intraspecific and interspecific chloroplast genome. The S. baicalensis chloroplast genome is a circular molecule consisting of two single-copy regions separated by a pair of inverted repeats. Comparative analyses of three Scutellaria chloroplast genomes revealed six variable regions (trnH-psbA, trnK-rps16, petN-psbM, trnT-trnL, petA-psbJ, and ycf1) that could be used as DNA barcodes. There were 25 single nucleotide polymorphisms(SNPs) and 29 indels between the two S. baicalensis genotypes. All of the indels occurred within non-coding regions. Phylogenetic analysis suggested that Scutellarioideae is a sister taxon to Lamioideae. These resources could be used to explore the variation present in Scutellaria populations and for further evolutionary, phylogenetic, barcoding and genetic engineering studies, in addition to effective exploration and conservation of S. baicalensis.Entities:
Keywords: Scutellaria baicalensis; chloroplast genome; genomic resource; indels; microsatellites; single nucleotide polymorphisms
Year: 2017 PMID: 28902130 PMCID: PMC5615360 DOI: 10.3390/genes8090227
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Summary of the characteristics of Scutellaria baicalensis chloroplast genome.
| Genotype | BOP028265 | BOP025266 |
|---|---|---|
| Total cpDNA size (bp) | 151,824 | 151,817 |
| LSC size (bp) | 83,976 | 83,960 |
| IR size (bp) | 25,255 | 25,263 |
| SSC size (bp) | 17,338 | 17,331 |
| Number of genes | 114 | 114 |
| Number of different protein-coding genes | 80 | 80 |
| Number of different tRNA genes | 30 | 30 |
| Number of different rRNA genes | 4 | 4 |
| Number of different duplicated genes | 17 | 17 |
| GC content | 38.3% | 38.3% |
| GC content of LSC | 36.3% | 36.3% |
| GC content of SSC | 32.7% | 32.7% |
| GC content of IR | 43.6% | 43.6% |
cpDNA: chloroplast DNA; LSC: large single-copy; IR: inverted repeat; SSC: small single-copy.
Figure 1Chloroplast genome map of Scutellaria baicalensis. The genes drawn outside of the circle are transcribed clockwise, while those inside the circle are transcribed counterclockwise. Small single copy (SSC), large single copy (LSC), and inverted repeats (IRa, IRb) are indicated.
Figure 2The distribution, type and presence of microsatellites (SSRs) in the chloroplast genome of Scutellaria. baicalensis. (A) Number of different SSR types; (B) Proportion of SSRs in LSC, SSC, and IR regions; (C) Number of identified SSR motifs in different repeat class types.
Figure 3Visualization of the alignment of chloroplast genome sequences of Scutellaria. VISTA-based similarity graphical information illustrating the sequence identity of Scutellaria with reference S. baicalensis chloroplast genomes. Grey arrows above the alignment indicate the orientation of genes. Purple bars represent exons; blue bars represent introns; and pink bars represent non-coding sequences (CNS). A cut-off of 50% identity was used for the plots. The Y-scale axis represents the percent identity within 50–100%. Genome regions are color-coded as protein-coding exons, rRNAs, tRNAs, or conserved CNS.
Figure 4Sliding window analysis of the whole chloroplast genomes of three Scutellaria species. Window length: 800 bp; step size: 200 bp. X-axis: position of the midpoint of a window. Y-axis: nucleotide diversity of each window.
Figure 5Phylogenetic tree reconstruction of 36 taxa using maximum likelihood and Bayesian inference based on the concatenated sequences of 84 genes. Maximum likelihood (ML) topology, with the ML bootstrap support value/Bayesian posterior probability given at each node.