| Literature DB >> 28611359 |
Yun Song1,2, Shaojun Wang3, Yuanming Ding3, Jin Xu1,2, Ming Fu Li1,2, Shuifang Zhu1, Naizhong Chen4,5.
Abstract
Paris is famous in China for its medicinal value and has been included in the Chinese Pharmacopoeia. Inaccurate identification of these species could confound their effective exploration, conservation, and domestication. Due to the plasticity of the morphological characteristics, correct identification among Paris species remains problematic. In this regard, we report the complete chloroplast genome of P. thibetica and P. rugosa to develop highly variable molecular markers. Comparing three chloroplast genomes, we sought out the most variable regions to develop the best cpDNA barcodes for Paris. The size of Paris chloroplast genome ranged from 162,708 to 163,200 bp. A total of 134 genes comprising 81 protein coding genes, 45 tRNA genes and 8 rRNA genes were observed in all three chloroplast genomes. Eight rapidly evolving regions were detected, as well as the difference of simple sequence repeats (SSR) and repeat sequence. Two regions of the coding gene ycf1, ycf1a and ycf1b, evolved the quickest and were proposed as core barcodes for Paris. The complete chloroplast genome sequences provide more integrated and adequate information for better understanding the phylogenetic pattern and improving efficient discrimination during species identification.Entities:
Mesh:
Year: 2017 PMID: 28611359 PMCID: PMC5469780 DOI: 10.1038/s41598-017-02083-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene map of Paris Chloroplast genome. The genes inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. The thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the genomes into small single-copy (SSC) and large single-copy (LSC) regions.
Comparison of feature of Paris rugose, Paris thibetica, Paris polyphylla var. yunnanensis.
| Species |
|
|
|
|---|---|---|---|
| Accession number | KY247142 | KY247143 | KT805945 |
| Genome size (bp) | 163200 | 162708 | 162990 |
| LSC (bp) | 84058 | 84010 | 84108 |
| SSC (bp) | 12854 | 12930 | 12984 |
| IRs (bp) | 33144 | 32884 | 32949 |
| Number of protein-coding genes1 | 81(9) | 81(9) | 81(9) |
| Number of tRNAs genes1 | 45(8) | 45(8) | 45(8) |
| Nubmer of rRNAs genes1 | 8(4) | 8(4) | 8(4) |
| GC content (%) | 37.1 | 37.2 | 37.1 |
| Raw data read number | 10380007 | 26745248 | / |
| Mapped read number | 401240 | 297202 | / |
| Chloroplast coverage(X) | 368 | 373 | / |
1The numbers in parenthesis indicate the genes duplicated in the IR regions.
Figure 2Analysis of repeated sequences in the three Paris chloroplast genomes. (A) Frequency of simple sequence repeats (SSRs) by MISA. (B) Frequency of repeat sequences determined by REPuter.
Figure 3Sliding window analysis of the whole chloroplast genome of three Paris species. (window length: 600 bp, step size: 200 bp). X-axis: position of the midpoint of a window, Y-axis: nucleotide diversity of each window.
Variability of the seven new markers and universal chloroplast DNA barcode in Paris.
| Markers | Region | Length | Variable sites | Information sites | Nucleotide diversity | Discrimination success (%) based on Distance method | ||
|---|---|---|---|---|---|---|---|---|
| Numbers | % | Numbers | % | |||||
|
| LSC | 1279 | 80 | 6.25% | 38 | 2.97% | 0.00602 | 47.37% |
|
| LSC | 999 | 24 | 2.40% | 17 | 1.70% | 0.00589 | 15.79% |
|
| LSC | 1035 | 12 | 1.16% | 8 | 0.77% | 0.00226 | 42.11% |
|
| IR | 940 | 29 | 3.09% | 21 | 2.23% | 0.00846 | 36.84% |
|
| IR | 1140 | 88 | 7.72% | 57 | 5.00% | 0.01826 | 52.63% |
|
| IR | 556 | 36 | 6.47% | 23 | 4.14% | 0.01442 | 31.58% |
|
| SSC | 1010 | 53 | 5.25% | 31 | 3.07% | 0.00226 | 47.37% |
|
| IR | 1696 | 124 | 7.31% | 80 | 4.72% | 0.01737 | 89.47% |
|
| LSC | 734 | 7 | 0.95% | 4 | 0.54% | 0.00197 | 21.05% |
|
| LSC | 602 | 10 | 1.66% | 8 | 1.33% | 0.00456 | 26.32% |
|
| LSC | 876 | 6 | 0.68% | 5 | 0.57% | 0.00172 | 5.26% |
|
| LSC | 2212 | 23 | 1.04% | 17 | 0.77% | 0.00256 | 42.11% |
Figure 4NJ tree for Paris using the rbcL + matK + trnH-psbA (A) and ycf1a + ycf1b (B) DNA barcode combination. NJ topology shown with NJ/MP/ML bootstrap support values were listed at each node.