| Literature DB >> 35385539 |
Conglong Lian1, Hao Yang1, Jinxu Lan1, Xueyu Zhang1, Fei Zhang1, Jingfan Yang1, Suiqing Chen1.
Abstract
Isodon rubescens (Hemsley) H. Hara (Lamiaceae) is a traditional Chinese medicine plant that has been used to treat various human diseases and conditions such as inflammation, respiratory and gastrointestinal bacterial infections, and malignant tumors. However, the contents of the main active components of I. rubescens from different origins differ significantly, which greatly affected its quality. Therefore, a molecular method to identify and classify I. rubescens is needed. Here, we report the DNA sequence of the chloroplast genome of I. rubescens collected from Lushan, Henan province. The genome is 152,642 bp in length and has a conserved structure that includes a pair of IR regions (25,726 bp), a LSC region (83,527 bp) and a SSC region (17,663 bp). The chloroplast genome contains 113 unique genes, four rRNA genes, 30 tRNA genes, and 79 protein-coding genes, 23 of which contain introns. The protein-coding genes account for a total of 24,412 codons, and most of them are A/T biased usage. We identified 32 simple sequence repeats (SSRs) and 48 long repeats. Furthermore, we developed valuable chloroplast molecular resources by comparing chloroplast genomes from three Isodon species, and both mVISTA and DnaSP analyses showed that rps16-trnQ, trnS-trnG, and ndhC-trnM are candidate regions that will allow the identification of intraspecific differences within I. rubescens. Also 14 candidate fragments can be used to identify interspecific differences between species in Isodon. A phylogenetic analysis of the complete chloroplast genomes of 24 species in subfamily Nepetoideae was performed using the maximum likelihood method, and shows that I. rubescens clustered closer to I. serra than I. lophanthoides. Interestingly, our analysis showed that I. rubescens (MW018469.1) from Xianyang, Shaanxi Province (IR-X), is closer to I. serra than to the other two I. rubescens accessions. These results strongly indicate that intraspecific diversity is present in I. rubescens. Therefore, our results provide further insight into the phylogenetic relationships and interspecific diversity of species in the genus Isodon.Entities:
Mesh:
Year: 2022 PMID: 35385539 PMCID: PMC8985940 DOI: 10.1371/journal.pone.0266546
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Circular gene map of the I. rubescens chloroplast genome.
Genes inside the circle are transcribed clockwise, and those outside are transcribed counter clockwise. Groups of genes with different functions are color-coded. The darker gray in the inner circle shows the GC content, while the lighter gray shows the AT content.
List of genes in the chloroplast genome of I. rubescens.
| Category for genes | Group of genes | Name of genes |
|---|---|---|
|
| Subunits of photosystem I | |
| Subunits of photosystem II | ||
| Subunits of cytochrome b/f complex | ||
| Large subunit of Rubisco |
| |
| Subunits of ATP synthase | ||
| Subunits of NADH-dehydrogenase | ||
|
| Ribosomal RNA genes | |
| Transfer RNA genes | ||
| Small subunit of ribosome | ||
| Large subunit of ribosome | ||
| DNA-dependent RNA polymerase | ||
|
| Maturase |
|
| Envelope membrane protein |
| |
| Subunit of acetyl-CoA |
| |
| C-type cytochrome synthesis gene |
| |
| Protease |
| |
|
| Conserved Open reading frames |
a. Two gene copies.
b. Genes containing introns.
c. Genes divided into two independent transcription units.
Codon-anticodon recognition patterns and codon usage in the chloroplast genome of I. rubescens.
| Amino Acid | Codon | No. | RSCU | tRNA | Amino Acid | Codon | No. | RSCU | tRNA |
|---|---|---|---|---|---|---|---|---|---|
| Ala | GCU | 598 | 1.77 | Arg | CGU | 325 | 1.32 | trnR-ACG | |
| Ala | GCC | 219 | 0.65 | Arg | CGC | 112 | 0.46 | ||
| Ala | GCA | 382 | 1.13 | trnA-UGC | Arg | CGA | 330 | 1.34 | |
| Ala | GCG | 150 | 0.44 | Arg | CGG | 118 | 0.48 | ||
| Ile | AUU | 1055 | 1.5 | Arg | AGA | 454 | 1.85 | trnR-UCU | |
| Ile | AUC | 424 | 0.6 | trnI-GAU | Arg | AGG | 137 | 0.56 | |
| Ile | AUA | 627 | 0.89 | Asn | AAU | 894 | 1.54 | ||
| Leu | UUA | 844 | 1.97 | trnL-UAA | Asn | AAC | 265 | 0.46 | trnN-GUU |
| Leu | UUG | 502 | 1.17 | trnL-CAA | Asp | GAU | 754 | 1.62 | |
| Leu | CUU | 537 | 1.25 | Asp | GAC | 179 | 0.38 | trnD-GUC | |
| Leu | CUC | 160 | 0.37 | Cys | UGU | 207 | 1.54 | ||
| Leu | CUA | 369 | 0.86 | trnL-UAG | Cys | UGC | 61 | 0.46 | trnC-GCA |
| Leu | CUG | 159 | 0.37 | Gln | CAA | 653 | 1.53 | trnQ-UUG | |
| Met | AUG | 586 | 1 | trnI-CAU, trnM-CAU, trnfM-CAU | Gln | CAG | 199 | 0.47 | |
| Pro | CCU | 388 | 1.51 | Glu | GAA | 977 | 1.56 | trnE-UUC | |
| Pro | CCC | 212 | 0.82 | Glu | GAG | 278 | 0.44 | ||
| Pro | CCA | 284 | 1.1 | trnP-UGG | Gly | GGU | 528 | 1.25 | |
| Pro | CCG | 145 | 0.56 | Gly | GGC | 188 | 0.44 | trnG-GCC | |
| Ser | UCU | 537 | 1.73 | Gly | GGA | 690 | 1.63 | trnG-UCC | |
| Ser | UCC | 294 | 0.95 | trnS-GGA | Gly | GGG | 290 | 0.68 | |
| Ser | UCA | 364 | 1.18 | trnS-UGA | His | CAU | 437 | 1.53 | |
| Ser | UCG | 166 | 0.54 | His | CAC | 134 | 0.47 | trnH-GUG | |
| Ser | AGU | 390 | 1.26 | Lys | AAA | 1005 | 1.51 | trnK-UUU | |
| Ser | AGC | 107 | 0.35 | trnS-GCU | Lys | AAG | 326 | 0.49 | |
| Thr | ACU | 503 | 1.61 | Phe | UUU | 945 | 1.38 | ||
| Thr | ACC | 243 | 0.78 | trnT-GGU | Phe | UUC | 428 | 0.62 | trnF-GAA |
| Thr | ACA | 375 | 1.2 | trnT-UGU | TER | UAA | 46 | 1.6 | |
| Thr | ACG | 129 | 0.41 | TER | UAG | 24 | 0.84 | ||
| Val | GUU | 504 | 1.48 | TER | UGA | 16 | 0.56 | ||
| Val | GUC | 157 | 0.46 | trnV-GAC | Trp | UGG | 421 | 1 | trnW-CCA |
| Val | GUA | 543 | 1.59 | trnV-UAC | Tyr | UAU | 714 | 1.63 | |
| Val | GUG | 161 | 0.47 | Tyr | UAC | 163 | 0.37 | trnY-GUA |
SSRs in the chloroplast genome of I. rubescens.
| SSR nr. | SSR type | SSR | size | start | end | Region | Position |
|---|---|---|---|---|---|---|---|
|
| p1 | (A)11 | 11 | 4444 | 4454 | LSC | IGS |
|
| p1 | (A)10 | 10 | 6668 | 6677 | LSC | IGS |
|
| p1 | (A)10 | 10 | 8147 | 8156 | LSC | IGS |
|
| c | (T)12gaaaagaaaaaaaataatgccttttttttaag(T)10 | 54 | 11734 | 11787 | LSC | IGS |
|
| c | (T)10c(A)12 | 23 | 16305 | 16327 | LSC | IGS |
|
| p1 | (T)11 | 11 | 18536 | 18546 | LSC | rpoC2 |
|
| p1 | (T)11 | 11 | 28760 | 28770 | LSC | IGS |
|
| p1 | (T)12 | 12 | 35002 | 35013 | LSC | IGS |
|
| p1 | (A)14 | 14 | 36248 | 36261 | LSC | trnH-AUG(intron) |
|
| p1 | (T)12 | 12 | 41884 | 41895 | LSC | IGS |
|
| p1 | (A)10 | 10 | 44220 | 44229 | LSC | IGS |
|
| p1 | (T)13 | 13 | 46005 | 46017 | LSC | IGS |
|
| p2 | (TA)7 | 14 | 46164 | 46177 | LSC | IGS |
|
| p1 | (T)10 | 10 | 46693 | 46702 | LSC | IGS |
|
| p1 | (A)11 | 11 | 46869 | 46879 | LSC | IGS |
|
| p1 | (T)10 | 10 | 54295 | 54304 | LSC | atpB |
|
| p1 | (T)10 | 10 | 58922 | 58931 | LSC | IGS |
|
| p1 | (T)13 | 13 | 59595 | 59607 | LSC | IGS |
|
| p1 | (T)10 | 10 | 60808 | 60817 | LSC | IGS |
|
| p1 | (T)10 | 10 | 63527 | 63536 | LSC | IGS |
|
| p1 | (T)11 | 11 | 67056 | 67066 | LSC | IGS |
|
| p1 | (T)13 | 13 | 69185 | 69197 | LSC | IGS |
|
| p1 | (T)10 | 10 | 70138 | 70147 | LSC | clpP1(intron) |
|
| p1 | (A)11 | 11 | 70426 | 70436 | LSC | clpP1(intron) |
|
| p1 | (T)11 | 11 | 71303 | 71313 | LSC | clpP1(intron) |
|
| p1 | (A)10 | 10 | 73740 | 73749 | LSC | IGS |
|
| p1 | (T)12 | 12 | 79583 | 79594 | LSC | IGS |
|
| p1 | (T)10 | 10 | 98083 | 98092 | IRB | IGS |
|
| p1 | (T)10 | 10 | 112297 | 112306 | SSC | IGS |
|
| p1 | (T)10 | 10 | 117749 | 117758 | SSC | IGS |
|
| p1 | (T)11 | 11 | 125193 | 125203 | SSC | ycf1 |
|
| p1 | (A)10 | 10 | 138158 | 138167 | IRA | IGS |
Long repeat sequences identified in the chloroplast genome of I. rubescens.
| No. | Repeat size (bp) | Type | Repeat 1 start | Repeat 2 start | Mismatch(bp) | E-Value | Region |
|---|---|---|---|---|---|---|---|
|
| 29 | P | 8217 | 45035 | 0 | 2.27×10−08 | LSC |
|
| 26 | P | 87192 | 87192 | 0 | 1.46×10−06 | IRB |
|
| 26 | F | 87192 | 149031 | 0 | 1.46×10−06 | IRB;IRA |
|
| 26 | F | 125658 | 125676 | 0 | 1.46×10−06 | SSC |
|
| 26 | P | 149031 | 149031 | 0 | 1.46×10−06 | IRA |
|
| 24 | P | 74746 | 74746 | 0 | 2.33×10−05 | LSC |
|
| 24 | P | 114085 | 114132 | 0 | 2.33×10−05 | SSC |
|
| 23 | P | 29238 | 29263 | 0 | 9.31×10−05 | LSC |
|
| 22 | F | 9802 | 36047 | 0 | 3.72×10−04 | LSC |
|
| 22 | P | 30490 | 30490 | 0 | 3.72×10−04 | LSC |
|
| 22 | F | 45998 | 59588 | 0 | 3.72×10−04 | LSC |
|
| 22 | F | 46185 | 46215 | 0 | 3.72×10−04 | LSC |
|
| 22 | P | 46808 | 46808 | 0 | 3.72×10−04 | LSC |
|
| 22 | F | 90824 | 90842 | 0 | 3.72×10−04 | IRB |
|
| 22 | P | 90824 | 145385 | 0 | 3.72×10−04 | IRB, IRA |
|
| 22 | P | 90842 | 145403 | 0 | 3.72×10−04 | IRB, IRA |
|
| 22 | F | 97710 | 118963 | 0 | 3.72×10−04 | IRB, SSC |
|
| 22 | P | 118963 | 138517 | 0 | 3.72×10−04 | IRA, SSC |
|
| 22 | F | 145385 | 145403 | 0 | 3.72×10−04 | IRA |
|
| 21 | F | 8222 | 35143 | 0 | 1.49×10−03 | LSC |
|
| 21 | C | 18527 | 46860 | 0 | 1.49×10−03 | LSC |
|
| 21 | R | 31283 | 31283 | 0 | 1.49×10−03 | LSC |
|
| 21 | P | 35143 | 45038 | 0 | 1.49×10−03 | LSC |
|
| 21 | F | 36272 | 66485 | 0 | 1.49×10−03 | LSC |
|
| 20 | P | 8649 | 8649 | 0 | 5.96×10−03 | LSC |
|
| 20 | R | 8853 | 8853 | 0 | 5.96×10−03 | LSC |
|
| 20 | R | 11769 | 117746 | 0 | 5.96×10−03 | LSC |
|
| 20 | F | 38272 | 40496 | 0 | 5.96×10−03 | LSC |
|
| 20 | R | 50182 | 50182 | 0 | 5.96×10−03 | LSC |
|
| 20 | P | 51977 | 102498 | 0 | 5.96×10−03 | LSC, IRB |
|
| 20 | F | 51977 | 133731 | 0 | 5.96×10−03 | IRB, IRA |
|
| 20 | P | 73915 | 73939 | 0 | 5.96×10−03 | LSC |
|
| 19 | R | 1790 | 14187 | 0 | 2.38×10−02 | LSC |
|
| 19 | C | 7309 | 47088 | 0 | 2.38×10−02 | LSC |
|
| 19 | R | 15030 | 15030 | 0 | 2.38×10−02 | LSC |
|
| 19 | P | 30326 | 115836 | 0 | 2.38×10−02 | LSC, SSC |
|
| 19 | R | 46864 | 46864 | 0 | 2.38×10−02 | LSC |
|
| 19 | P | 73723 | 122022 | 0 | 2.38×10−02 | LSC, SSC |
|
| 19 | F | 90786 | 90804 | 0 | 2.38×10−02 | IRB |
|
| 19 | P | 90786 | 145426 | 0 | 2.38×10−02 | IRB, IRA |
|
| 19 | P | 90804 | 145444 | 0 | 2.38×10−02 | IRB, IRA |
|
| 19 | R | 118508 | 118508 | 0 | 2.38×10−02 | SSC |
|
| 19 | F | 145426 | 145444 | 0 | 2.38×10−02 | IRA |
|
| 18 | R | 148 | 69617 | 0 | 9.54×10−02 | LSC |
|
| 18 | R | 4515 | 112976 | 0 | 9.54×10−02 | LSC, SSC |
|
| 18 | F | 6222 | 27249 | 0 | 9.54×10−02 | LSC |
|
| 18 | R | 6251 | 6251 | 0 | 9.54×10−02 | LSC |
|
| 18 | P | 7904 | 112977 | 0 | 9.54×10−02 | LSC, SSC |
F: forward repeats, P: palindromic repeats, R: reverse repeats, C: complementary repeats
Fig 2Sequence alignment of five Isodon chloroplast genomes using mVISTA.
The annotation of I. rubescens (IR-L) was used as the reference. Gray arrows above the alignment indicate the position and direction of transcription of each gene. The scale on the vertical axes represents the percent sequence identity between 50% and 100%. IR-J, IR-X, and IR-L are the three accessions of I. rubescens.
Fig 3Sliding window analysis of five complete Isodon chloroplast genomes.
Window length: 800 bp; step size: 200 bp. X-axis: nucleotide positions in the chloroplast genomes. Y-axis: K values of nucleotide diversity. A: I. rubescens from Lushan, Henan province (IR-L), B: I. rubescens (MW018469.1) from Xianyang, Shaanxi Province (IR-X), C: I. rubescens (MW376483.1) from Jiyuan, Henan Province (IR-J), D: I. serra (MT317099.1), E: I. lophanthoides (MT317098.1). Total: the total nucleotide diversity values for the A vs. B, A vs. C, A vs. D, and A vs. E comparisons.
Fig 4Comparison of the LSC, SSC, and IR border regions among the five Isodon chloroplast genomes of examined in this study.
IR-J, IR-X, and IR-L are the three accessions of I. rubescens.
Fig 5Phylogenetic tree based on the complete chloroplast genomes from 25 species constructed using the maximum likelihood method.
The Scutellaria baicalensis cpDNA sequence was used as the outgroup. Bootstrap values are shown at the nodes.