| Literature DB >> 28575004 |
Xiang-Qin Yu1,2, Bryan T Drew3, Jun-Bo Yang1, Lian-Ming Gao2, De-Zhu Li1.
Abstract
Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.Entities:
Mesh:
Year: 2017 PMID: 28575004 PMCID: PMC5456055 DOI: 10.1371/journal.pone.0178026
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
List of taxa sampled in this study, with the voucher, chloroplast genome size, Illumina reads and coverage depth information.
| Taxon | Voucher specimen | Sources | Genome size | LSC length (bp) | SSC length (bp) | IR length (bp) | GC content (%) | No. reads (trimmed) | Mean coverage | GenBank No. |
|---|---|---|---|---|---|---|---|---|---|---|
| YXQ041 | Yunnan, China | 157,245 | 87,222 | 18,091 | 25,966 | 37.43 | 7,448,533 | 620.1 | KY406780 | |
| YXQ069 | Yunnan, China | 157,227 | 87,202 | 18,089 | 25,968 | 37.44 | 687,249 | 1538.6 | KY406758 | |
| YXQ103 | Hainan, China | 157,288 | 87,232 | 18,104 | 25,976 | 37.44 | 701,233 | 1679.4 | KY406755 | |
| YXQ070 | Yunnan, China | 157,252 | 87,208 | 18,112 | 25,966 | 37.43 | 459,809 | 1106.5 | KY406794 | |
| YXQ146 | Guangxi, China | 157,278 | 87,233 | 18,103 | 25,971 | 37.44 | 737,292 | 1882.5 | KY406763 | |
| YXQ034 | Yunnan, China | 157,278 | 87,217 | 18,091 | 25,985 | 37.43 | 527,901 | 1284.3 | KY406787 | |
| YXQ186 | Hunan, China | 157,284 | 87,229 | 18,103 | 25,976 | 37.43 | 562,801 | 1389.5 | KY406749 | |
| YXQ053 | Yunnan, China | 157,302 | 87,272 | 18,122 | 25,954 | 37.45 | 748,129 | 1735.2 | KY406779 | |
| YXQ2902 | Sichuan, China | 157,297 | 87,243 | 18,102 | 25,976 | 37.45 | 10,152,425 | 716.8 | KY406762 | |
| YXQ142 | Guangxi, China | 157,254 | 87,202 | 18,100 | 25,976 | 37.44 | 457,215 | 1221.9 | KY406788 | |
| YXQ001 | Yunnan, China | 157,240 | 87,204 | 18,104 | 25,966 | 37.44 | 28,866 | 71.7 | KY406795 |
Voucher specimens were deposited in the Herbarium of Kunming Institute of Botany (KUN), Chinese Academy of Sciences.
Fig 1Gene map of Schima superba chloroplast genome.
Fig 2Comparisons of the border regions among the chloroplast genomes of 11 Schima species.
ycf1* (ycf1_like) and rps19* (rps19_like) represent the incomplete duplication of the gene within the IR region.
Fig 3mVISTA percent identity plot comparison among the chloroplast genomes with S. sinensis as a reference.
Sequence divergence of 49 variable coding regions (>150 bp) from 11 chloroplast genomes of Schima, with one of the Inverted Repeat regions removed.
| Fragments | Length (bp) | Aligned length (bp) | Variable positions | Nucleotide substitutions | Number of indels | Total length of indels | Percent variability (%) |
|---|---|---|---|---|---|---|---|
| 1527 | 1527 | 5 | 5 | 0 | 0 | 0.33 | |
| 186 | 186 | 1 | 1 | 0 | 0 | 0.54 | |
| 153–156 | 156 | 3 | 0 | 1 | 3 | 0.64 | |
| 1524 | 1524 | 3 | 3 | 0 | 0 | 0.20 | |
| 567 | 567 | 2 | 2 | 0 | 0 | 0.35 | |
| 744 | 744 | 1 | 1 | 0 | 0 | 0.13 | |
| 711 | 711 | 1 | 1 | 0 | 0 | 0.14 | |
| 4137 | 4137 | 11 | 11 | 0 | 0 | 0.27 | |
| 2061 | 2061 | 3 | 3 | 0 | 0 | 0.15 | |
| 3213 | 3213 | 11 | 11 | 0 | 0 | 0.34 | |
| 1422 | 1422 | 3 | 3 | 0 | 0 | 0.21 | |
| 2205 | 2205 | 2 | 2 | 0 | 0 | 0.09 | |
| 2253 | 2253 | 6 | 6 | 0 | 0 | 0.27 | |
| 606 | 606 | 2 | 2 | 0 | 0 | 0.33 | |
| 678 | 678 | 1 | 1 | 0 | 0 | 0.15 | |
| 363 | 363 | 2 | 2 | 0 | 0 | 0.55 | |
| 402 | 402 | 1 | 1 | 0 | 0 | 0.25 | |
| 1497 | 1497 | 1 | 1 | 0 | 0 | 0.07 | |
| 1428 | 1428 | 3 | 3 | 0 | 0 | 0.21 | |
| 1542 | 1542 | 4 | 4 | 0 | 0 | 0.26 | |
| 555 | 555 | 3 | 3 | 0 | 0 | 0.54 | |
| 690 | 690 | 2 | 2 | 0 | 0 | 0.29 | |
| 963 | 963 | 2 | 2 | 0 | 0 | 0.21 | |
| 354 | 354 | 1 | 1 | 0 | 0 | 0.28 | |
| 372 | 372 | 1 | 1 | 0 | 0 | 0.27 | |
| 645 | 645 | 3 | 3 | 0 | 0 | 0.47 | |
| 1527 | 1527 | 2 | 2 | 0 | 0 | 0.13 | |
| 663 | 663 | 4 | 4 | 0 | 0 | 0.60 | |
| 1014 | 1014 | 2 | 2 | 0 | 0 | 0.20 | |
| 417 | 417 | 3 | 3 | 0 | 0 | 0.72 | |
| 234 | 234 | 1 | 1 | 0 | 0 | 0.43 | |
| 408 | 408 | 1 | 1 | 0 | 0 | 0.25 | |
| 369 | 369 | 3 | 3 | 0 | 0 | 0.81 | |
| 411 | 411 | 2 | 2 | 0 | 0 | 0.49 | |
| 657 | 657 | 2 | 2 | 0 | 0 | 0.30 | |
| 474 | 474 | 1 | 1 | 0 | 0 | 0.21 | |
| 6867–6873 | 6873 | 8 | 2 | 1 | 6 | 0.04 | |
| 468 | 468 | 1 | 1 | 0 | 0 | 0.21 | |
| 2247–2253 | 2253 | 26 | 14 | 2 | 12 | 0.71 | |
| 162 | 162 | 1 | 1 | 0 | 0 | 0.62 | |
| 963 | 963 | 10 | 10 | 0 | 0 | 1.04 | |
| 1530 | 1530 | 3 | 3 | 0 | 0 | 0.20 | |
| 246 | 246 | 1 | 1 | 0 | 0 | 0.41 | |
| 306 | 306 | 1 | 1 | 0 | 0 | 0.33 | |
| 531 | 531 | 3 | 3 | 0 | 0 | 0.56 | |
| 1092 | 1092 | 2 | 2 | 0 | 0 | 0.18 | |
| 1182 | 1182 | 2 | 2 | 0 | 0 | 0.17 | |
| 273 | 273 | 3 | 3 | 0 | 0 | 1.10 | |
| 5652–5658 | 5658 | 43 | 37 | 1 | 6 | 0.67 |
Fig 4Percentage of variation in 79 variable noncoding regions of the 11 Schima chloroplast genomes.
These regions are oriented according to their locations in the genome.
Sequence divergence of 79 variable noncoding loci (>150 bp) from 11 chloroplast genomes of Schima, with one of the invert repeat regions removed.
| Fragments | Length (bp) | Aligned length (bp) | Variable positions | Nucleotide substitutions | Number of indels | Total length of indels | Percent variability (%) |
|---|---|---|---|---|---|---|---|
| 395–426 | 428 | 37 | 4 | 1 | 33 | 1.17 | |
| 220 | 220 | 1 | 1 | 0 | 0 | 0.45 | |
| 270 | 270 | 2 | 2 | 0 | 0 | 0.74 | |
| 712–713 | 713 | 2 | 1 | 1 | 1 | 0.28 | |
| 809–818 | 819 | 22 | 10 | 4 | 12 | 1.71 | |
| 838–844 | 845 | 10 | 3 | 2 | 7 | 0.59 | |
| 1689–1698 | 1700 | 36 | 18 | 4 | 18 | 1.29 | |
| 333 | 333 | 2 | 2 | 0 | 0 | 0.60 | |
| 345 | 345 | 3 | 3 | 0 | 0 | 0.87 | |
| 681–682 | 682 | 3 | 3 | 0 | 0 | 0.44 | |
| 690–696 | 696 | 8 | 2 | 1 | 6 | 0.43 | |
| 276–311 | 311 | 37 | 2 | 1 | 35 | 0.96 | |
| 701 | 701 | 2 | 2 | 0 | 0 | 0.29 | |
| 387–389 | 389 | 7 | 5 | 1 | 2 | 1.54 | |
| 1143–1153 | 1153 | 15 | 4 | 4 | 11 | 0.69 | |
| 254–255 | 255 | 1 | 0 | 1 | 1 | 0.39 | |
| 732–734 | 734 | 2 | 0 | 1 | 2 | 0.14 | |
| 1211–1222 | 1222 | 26 | 9 | 4 | 17 | 1.06 | |
| 722–727 | 727 | 8 | 3 | 1 | 5 | 0.55 | |
| 1123–1125 | 1125 | 15 | 13 | 1 | 2 | 1.24 | |
| 1136–1153 | 1153 | 27 | 8 | 5 | 19 | 1.13 | |
| 473 | 473 | 5 | 5 | 0 | 0 | 1.06 | |
| 1513–1517 | 1519 | 19 | 11 | 3 | 8 | 0.92 | |
| 234–239 | 239 | 6 | 1 | 1 | 5 | 0.84 | |
| 283 | 283 | 1 | 1 | 0 | 0 | 0.35 | |
| 157–159 | 160 | 5 | 2 | 2 | 3 | 2.50 | |
| 154 | 154 | 2 | 2 | 0 | 0 | 1.30 | |
| 747–755 | 755 | 17 | 3 | 3 | 14 | 0.79 | |
| 721 | 721 | 2 | 2 | 0 | 0 | 0.28 | |
| 726 | 726 | 2 | 2 | 0 | 0 | 0.28 | |
| 839–842 | 842 | 8 | 4 | 3 | 4 | 0.83 | |
| 293 | 293 | 1 | 1 | 0 | 0 | 0.34 | |
| 982–999 | 999 | 46 | 19 | 6 | 27 | 2.50 | |
| 522–529 | 529 | 7 | 0 | 1 | 7 | 0.19 | |
| 704–714 | 715 | 21 | 3 | 4 | 18 | 0.98 | |
| 400–414 | 417 | 36 | 3 | 6 | 33 | 2.16 | |
| 585 | 585 | 2 | 2 | 0 | 0 | 0.34 | |
| 166 | 166 | 1 | 1 | 0 | 0 | 0.60 | |
| 229–238 | 240 | 13 | 0 | 2 | 13 | 0.83 | |
| 765–768 | 768 | 5 | 2 | 1 | 3 | 0.39 | |
| 525–526 | 526 | 4 | 3 | 1 | 1 | 0.76 | |
| 681–683 | 683 | 4 | 2 | 1 | 2 | 0.44 | |
| 423–425 | 425 | 3 | 1 | 1 | 2 | 0.47 | |
| 909–915 | 915 | 13 | 7 | 1 | 6 | 0.87 | |
| 220–228 | 228 | 8 | 0 | 2 | 8 | 0.88 | |
| 1035–1042 | 1043 | 12 | 4 | 3 | 8 | 0.67 | |
| 1277–1287 | 1287 | 22 | 12 | 2 | 10 | 1.09 | |
| 185–186 | 186 | 1 | 0 | 1 | 1 | 0.54 | |
| 170–175 | 175 | 9 | 4 | 1 | 5 | 2.86 | |
| 391–393 | 393 | 2 | 0 | 1 | 2 | 0.25 | |
| 455–457 | 457 | 5 | 3 | 1 | 2 | 0.88 | |
| 175–176 | 176 | 2 | 1 | 1 | 1 | 1.14 | |
| 284 | 284 | 1 | 1 | 0 | 0 | 0.35 | |
| 786 | 786 | 4 | 4 | 0 | 0 | 0.51 | |
| 598–605 | 607 | 13 | 3 | 5 | 10 | 1.32 | |
| 797–798 | 798 | 3 | 2 | 1 | 1 | 0.38 | |
| 473–479 | 479 | 8 | 2 | 1 | 6 | 0.63 | |
| 172–174 | 174 | 5 | 3 | 1 | 2 | 2.30 | |
| 787 | 787 | 3 | 3 | 0 | 0 | 0.38 | |
| 711 | 711 | 7 | 7 | 0 | 0 | 0.98 | |
| 200–213 | 215 | 15 | 0 | 5 | 15 | 2.33 | |
| 196 | 196 | 1 | 1 | 0 | 0 | 0.51 | |
| 996–998 | 998 | 13 | 11 | 1 | 2 | 1.20 | |
| 150–200 | 200 | 52 | 2 | 1 | 50 | 1.50 | |
| 536 | 536 | 2 | 2 | 0 | 0 | 0.37 | |
| 1602–1619 | 1619 | 29 | 2 | 5 | 27 | 0.43 | |
| 938 | 938 | 1 | 1 | 0 | 0 | 0.11 | |
| 152 | 152 | 1 | 1 | 0 | 0 | 0.66 | |
| 256–275 | 275 | 19 | 0 | 1 | 19 | 0.36 | |
| 248–249 | 249 | 2 | 1 | 1 | 1 | 0.80 | |
| 595 | 595 | 7 | 7 | 0 | 0 | 1.18 | |
| 825–847 | 851 | 32 | 6 | 4 | 26 | 1.18 | |
| 908–926 | 928 | 36 | 13 | 6 | 23 | 2.05 | |
| 240–244 | 245 | 6 | 1 | 2 | 5 | 1.22 | |
| 251–254 | 254 | 4 | 1 | 1 | 3 | 0.79 | |
| 230 | 230 | 5 | 5 | 0 | 0 | 2.17 | |
| 359–360 | 360 | 6 | 5 | 1 | 1 | 1.67 | |
| 1106–1111 | 1112 | 16 | 8 | 3 | 8 | 0.99 | |
| 379 | 379 | 5 | 5 | 0 | 0 | 1.32 |
Location of SSR loci within the 11 Schima genomes.
| No. | Motif | Location | Region | Repeat length |
|---|---|---|---|---|
| 1 | A | LSC | 12–43 | |
| 2 | A | LSC | 10–11 | |
| 3 | A | LSC | 9–10 | |
| 4 | A | LSC | 11–14 | |
| 5 | A | LSC | 8–10 | |
| 6 | T | LSC | 9–14 | |
| 7 | A | LSC | 8–10 | |
| 8 | T | LSC | 8–11 | |
| 9 | T | LSC | 10, 13 | |
| 10 | A | LSC | 14,15 | |
| 11 | T | LSC | 9–12 | |
| 12 | AT | LSC | 12,14 | |
| 13 | A | LSC | 13–20 | |
| 14 | T | LSC | 12,13 | |
| 15 | A | LSC | 10,11 | |
| 16 | A | LSC | 9,10 | |
| 17 | T | LSC | 10,11 | |
| 18 | T | LSC | 9,10 | |
| 19 | A | LSC | 9–14 | |
| 20 | T | LSC | 11–16 | |
| 21 | C | LSC | 8, 10 | |
| 22 | A | LSC | 9,10 | |
| 23 | A | LSC | 10–14 | |
| 24 | A | LSC | 11,12 | |
| 25 | A | LSC | 11,12 | |
| 26 | A | LSC | 13,14 | |
| 27 | T | LSC | 9, 10 | |
| 28 | T | LSC | 8–16 | |
| 29 | T | LSC | 9–12 | |
| 30 | TTA | LSC | 3,12 | |
| 31 | T | LSC | 9,10 | |
| 32 | T | LSC | 9–12 | |
| 33 | T | LSC | 12–15 | |
| 34 | T | LSC | 11,12 | |
| 35 | T | LSC | 13–15 | |
| 36 | T | LSC | 10–12 | |
| 37 | T | LSC | 10–13 | |
| 38 | T | LSC | 10,11 | |
| 39 | T | LSC | 10,11 | |
| 40 | T | LSC | 10–12 | |
| 41 | T | LSC | 6,10 | |
| 42 | T | LSC | 10,11 | |
| 43 | A | LSC | 9–11 | |
| 44 | A | LSC | 9,10 | |
| 45 | ATT | LSC | 6,12 | |
| 46 | A | LSC | 8–10 | |
| 47 | T | LSC | 9,10 | |
| 48 | A | LSC | 9–11 | |
| 49 | A | LSC | 9,10 | |
| 50 | A | IR | 9,10 | |
| 51 | T | SSC | 9–12 | |
| 52 | T | SSC | 10,11 | |
| 53 | A | SSC | 10,11 | |
| 54 | T | SSC | 10–13 | |
| 55 | T | SSC | 10–13 | |
| 56 | T | SSC | 9–12 | |
| 57 | T | SSC | 10,11 | |
| 58 | A | SSC | 9,10 |
Fig 5Phylogenetic relationships among the 11 Schima species.
The phylogenetic tree was reconstructed using the whole chloroplast genome data set minus a copy of the IR region. Numbers above the branches show bootstrap support values that are above 80%.