| Literature DB >> 27391235 |
Lianghong Ni1, Zhili Zhao1, Gaawe Dorje2, Mi Ma2.
Abstract
Scrophularia dentata is an important Tibetan medicinal plant and traditionally used for the treatment of exanthema and fever in Traditional Tibetan Medicine (TTM). However, there is little sequence and genomic information available for S. dentata. In this paper, we report the complete chloroplast genome sequence of S. dentata and it is the first sequenced member of the Sect. Tomiophyllum within Scrophularia (Scrophulariaceae). The gene order and organization of the chloroplast genome of S. dentata are similar to other Lamiales chloroplast genomes. The plastome is 152,553 bp in length and includes a pair of inverted repeats (IRs) of 25,523 bp that separate a large single copy (LSC) region of 84,058 bp and a small single copy (SSC) region of 17,449 bp. It has 38.0% GC content and includes 114 unique genes, of which 80 are protein-coding, 30 are transfer RNA, and 4 are ribosomal RNA. Also, it contains 21 forward repeats, 19 palindrome repeats and 41 simple sequence repeats (SSRs). The repeats and SSRs within S. dentata were compared with those of S. takesimensis and present certain discrepancies. The chloroplast genome of S. dentata was compared with other five publicly available Lamiales genomes from different families. All the coding regions and non-coding regions (introns and intergenic spacers) within the six chloroplast genomes have been extracted and analysed. Furthermore, the genome divergent hotspot regions were identified. Our studies could provide basic data for the alpine medicinal species conservation and molecular phylogenetic researches of Scrophulariaceae and Lamiales.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27391235 PMCID: PMC4938499 DOI: 10.1371/journal.pone.0158488
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Chloroplast genome map of Scrophularia dentata.
Genes drawn inside the circle are transcribed clockwise, and those outside are counterclockwise. Genes belonging to different functional groups are shown in different colors. The asterisks indicate genes which contain intron(s).
Genes present within Scrophularia dentata chloroplast genome.
| Group of genes | Gene names | |
|---|---|---|
| 1 | Photosystem I | |
| 2 | Photosystem II | |
| 3 | Cytochrome b/f complex | |
| 4 | ATP synthase | |
| 5 | NADH dehydrogenase | |
| 6 | RubisCO large subunit | |
| 7 | RNA polymerase | |
| 8 | Ribosomal proteins (SSU) | |
| 9 | Ribosomal proteins (LSU) | |
| 10 | Other genes | |
| 11 | Proteins of unknown function | |
| 12 | Ribosomal RNAs | |
| 13 | Transfer RNAs |
*Gene containing one intron.
**Gene containing two introns.
(×2) Two gene copies in the IRs.
Genes with introns within Scrophularia dentata chloroplast genome and the length of exons and introns.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
| LSC | 6 | 720 | 642 | |||
| LSC | 8 | 735 | 475 | |||
| LSC | 144 | 723 | 411 | |||
| SSC | 552 | 1067 | 531 | |||
| IR | 777 | 679 | 756 | |||
| LSC | 435 | 756 | 1626 | |||
| LSC | 114 | - | 232 | 537 | 26 | |
| LSC | 42 | 848 | 225 | |||
| IR | 393 | 664 | 435 | |||
| LSC | 9 | 886 | 399 | |||
| LSC | 69 | 727 | 291 | 626 | 228 | |
| LSC | 129 | 702 | 228 | 720 | 153 | |
| IR | 38 | 813 | 35 | |||
| LSC | 23 | 683 | 47 | |||
| IR | 42 | 943 | 35 | |||
| LSC | 37 | 2517 | 35 | |||
| LSC | 37 | 488 | 50 | |||
| LSC | 38 | 585 | 37 |
* The rps12 is a trans-spliced gene with the 5' end located in the LSC region and the duplicated 3' end in the IR regions.
Comparison of the base composition of six Lamiales chloroplast genomes.
| LSC | 84,058 bp | 84,692 bp | 86,616 bp | 82,695 bp | 82,459 bp | 85,170 bp |
| SSC | 17,449 bp | 17,901 bp | 17,780 bp | 17,555 bp | 17,190 bp | 17,872 bp |
| IR | 25,523 bp | 25,450 bp | 25,712 bp | 25,539 bp | 25,300 bp | 25,141 bp |
| Total | 152,553 bp | 153,493 bp | 155,820 | 151,328 bp | 150,249 bp | 153,324 bp |
| CG content | 38.0% | 37.6% | 37.8% | 36.2% | 38.3% | 38.2% |
| Coding gene | 80 | 86 | 80 | 80 | 80 | 80 |
| tRNA | 30 | 29 | 30 | 30 | 30 | 30 |
| rRNA | 4 | 4 | 4 | 4 | 4 | 4 |
| References | this study | [ | [ | [ | NC_022451 | [ |
Fig 2Sequence alignment of six chloroplast genomes in the Lamiales order using mVISTA program with S. dentata as reference.
The top line shows genes in order (transcriptional direction indicated with arrow). A cut-off of 70% identity was used for the plots, and the Y-scale represents the percent identity between 50–100%. Genome regions are color-coded as exon, intron, and conserved non-coding sequences (CNS).
Fig 3Percentages of variable sites in homologous regions across the six Lamiales chloroplast genomes.
(A) coding regions; (B) introns and intergenic spacers (non-coding regions).
Fig 4Comparison of the borders of LSC, SSC and IR regions in six Lamiales chloroplast genomes.
Ψ indicates a pseudogene. This figure is not to scale.
Repeat sequences and their distribution within Scrophularia dentata chloroplast genome.
| No. | Size(bp) | Type | Repeat 1 start (location) | Repeat 2 start (location) | Region |
|---|---|---|---|---|---|
| 1 | 30 | F | LSC | ||
| 2 | 30 | F | LSC | ||
| 3 | 30 | F | LSC; SSC | ||
| 4 | 30 | F | IRb | ||
| 5 | 30 | F | IRb | ||
| 6 | 30 | F | IRb | ||
| 7 | 30 | F | IRa | ||
| 8 | 30 | F | IRa | ||
| 9 | 30 | F | IRa | ||
| 10 | 30 | F | LSC; IRb | ||
| 11 | 31 | F | 12909( | 69566( | LSC |
| 12 | 32 | F | LSC | ||
| 13 | 34 | F | IRb | ||
| 14 | 34 | F | IRa | ||
| 15 | 36 | F | IRb | ||
| 16 | 36 | F | IRa | ||
| 17 | 39 | F | LSC; IRb | ||
| 18 | 39 | F | LSC; SSC | ||
| 19 | 40 | F | IRb | ||
| 20 | 40 | F | IRa | ||
| 21 | 41 | F | 98291( | 119047( | IRb; SSC |
| 22 | 30 | P | LSC | ||
| 23 | 30 | P | LSC; IRa | ||
| 24 | 30 | P | 75205( | 75205( | LSC |
| 25 | 30 | P | LSC | ||
| 26 | 30 | P | IRb; IRa | ||
| 27 | 30 | P | IRb; IRa | ||
| 28 | 30 | P | IRb; IRa | ||
| 29 | 30 | P | IRb; IRa | ||
| 30 | 30 | P | IRb; IRa | ||
| 31 | 30 | P | IRb; IRa | ||
| 32 | 34 | P | IRb; IRa | ||
| 33 | 34 | P | IRb; IRa | ||
| 34 | 36 | P | IRb; IRa | ||
| 35 | 36 | P | IRb; IRa | ||
| 36 | 39 | P | LSC; IRa | ||
| 37 | 40 | P | IRb; IRa | ||
| 38 | 40 | P | IRb; IRa | ||
| 39 | 41 | P | SSC; IRa | ||
| 40 | 44 | P | LSC |
* part in the gene.
Repeat sequences conserved in the S. takesimensis chloroplast genome are highlighted in bold.
F = forward, P = palindrome, IGS = intergenic spacer.
Fig 5Repeat sequences and simple sequence repeats (SSRs) within Scrophularia dentata and Scrophularia takesimensis chloroplast genomes.
(A) number of different types of repeats. F = forward repeats; P = palindrome repeats; C = complement repeats; R = reverse repeats. (B) distribution of repeats. (C) number of different types of SSRs. (D) distribution of SSRs. CDS = coding sequence; IGS = intergenic spacer; CDS-IGS = part in CDS and part in IGS.
Simple sequence repeats within Scrophularia dentata chloroplast genome.
| Unit | Length | No. | SSR start | SSR-containing region | Region |
|---|---|---|---|---|---|
| A | 16 | 1 | 44297 | LSC | |
| 13 | 1 | 12158 | LSC | ||
| 12 | 2 | 8024 | IGS ( | LSC | |
| 16175 | IGS ( | LSC | |||
| 11 | 1 | 75204 | LSC | ||
| 10 | 8 | 22732 | LSC | ||
| 44557 | IGS ( | LSC | |||
| 59913 | IGS ( | LSC | |||
| LSC | |||||
| 111914 | IGS ( | SSC | |||
| 112458 | IGS ( | SSC | |||
| 137924 | IGS ( | IRa | |||
| 152482 | IGS ( | IRa | |||
| T | 15 | 1 | 12921 | IGS ( | LSC |
| 14 | 1 | 60099 | IGS ( | LSC | |
| 13 | 1 | 69580 | IGS ( | LSC | |
| 12 | 1 | 124137 | SSC | ||
| 11 | 8 | 8632 | IGS ( | LSC | |
| 14323 | IGS ( | LSC | |||
| LSC | |||||
| 57534 | IGS ( | LSC | |||
| LSC | |||||
| 80606 | IGS ( | LSC | |||
| 112115 | IGS ( | SSC | |||
| SSC | |||||
| 10 | 8 | 52007 | LSC | ||
| LSC | |||||
| 54959 | IGS ( | LSC | |||
| 78182 | LSC | ||||
| 84121 | IGS ( | IRb | |||
| 98679 | IGS ( | IRb | |||
| 124603 | SSC | ||||
| SSC | |||||
| AT | 12 | 1 | LSC | ||
| 10 | 2 | LSC | |||
| 13822 | IGS ( | LSC | |||
| TA | 18 | 1 | 46524 | IGS ( | LSC |
| 12 | 3 | LSC | |||
| 31456 | IGS ( | LSC | |||
| 42433 | IGS ( | LSC | |||
| ATA | 12 | 1 | LSC | ||
| GTCT | 12 | 1 | LSC | ||
| TTTA | 12 | 1 | 49645 | IGS ( | LSC |
| AATA | 12 | 1 | SSC |
* part in the gene.
SSRs that are identical in the S. takesimensis chloroplast genome are highlighted in bold.