| Literature DB >> 29462921 |
Jianguo Zhou1, Yingxian Cui2, Xinlian Chen3, Ying Li4, Zhichao Xu5, Baozhong Duan6, Yonghua Li7, Jingyuan Song8, Hui Yao9.
Abstract
Papaver rhoeas L. and P. orientale L., which belong to the family Papaveraceae, are used as ornamental and medicinal plants. The chloroplast genome has been used for molecular markers, evolutionary biology, and barcoding identification. In this study, the complete chloroplast genome sequences of P. rhoeas and P. orientale are reported. Results show that the complete chloroplast genomes of P. rhoeas and P. orientale have typical quadripartite structures, which are comprised of circular 152,905 and 152,799-bp-long molecules, respectively. A total of 130 genes were identified in each genome, including 85 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. Sequence divergence analysis of four species from Papaveraceae indicated that the most divergent regions are found in the non-coding spacers with minimal differences among three Papaver species. These differences include the ycf1 gene and intergenic regions, such as rpoB-trnC, trnD-trnT, petA-psbJ, psbE-petL, and ccsA-ndhD. These regions are hypervariable regions, which can be used as specific DNA barcodes. This finding suggested that the chloroplast genome could be used as a powerful tool to resolve the phylogenetic positions and relationships of Papaveraceae. These results offer valuable information for future research in the identification of Papaver species and will benefit further investigations of these species.Entities:
Keywords: Papaver orientale; Papaver rhoeas; chloroplast genome; comparative analysis; molecular structure; phylogenetic analysis
Mesh:
Substances:
Year: 2018 PMID: 29462921 PMCID: PMC6017017 DOI: 10.3390/molecules23020437
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Gene map of the complete chloroplast genomes of P. rhoeas and P. orientale. Genes inside the circle are transcribed clockwise, whereas those on the outside are transcribed counter-clockwise. The darker gray area in the inner circle corresponds to GC content, whereas the lighter gray area corresponds to AT content.
Base composition in the chloroplast genomes of P. rhoeas and P. orientale.
| Species | Regions | Positions | T(U) (%) | C (%) | A (%) | G (%) | Length (bp) |
|---|---|---|---|---|---|---|---|
| LSC | 31.9 | 19.2 | 30.8 | 18.1 | 83,172 | ||
| SSC | 33.3 | 17.8 | 33.3 | 15.6 | 17,971 | ||
| IRa | 28.6 | 22.2 | 28.3 | 21.0 | 25,881 | ||
| IRb | 28.3 | 21.0 | 28.6 | 22.2 | 25,881 | ||
| Total | 30.9 | 19.8 | 30.3 | 19.0 | 152,905 | ||
| CDS 1 | 31.0 | 18.0 | 30.4 | 20.6 | 78,285 | ||
| 1st position 2 | 23.5 | 18.9 | 30.4 | 27.2 | 26,095 | ||
| 2nd position 3 | 32.1 | 20.6 | 29.2 | 18.1 | 26,095 | ||
| 3rd position 4 | 37.4 | 14.6 | 31.5 | 16.5 | 26,095 | ||
| LSC | 32.0 | 19.1 | 30.9 | 18.1 | 83,151 | ||
| SSC | 33.4 | 17.7 | 33.5 | 15.4 | 17,934 | ||
| IRa | 28.6 | 22.2 | 28.3 | 20.9 | 25,857 | ||
| IRb | 28.3 | 20.9 | 28.6 | 22.2 | 25,857 | ||
| Total | 31.0 | 19.7 | 30.4 | 18.9 | 152,799 | ||
| CDS | 31.1 | 18.0 | 30.4 | 20.6 | 78,117 | ||
| 1st position | 23.5 | 18.9 | 30.4 | 27.2 | 26,039 | ||
| 2nd position | 32.2 | 20.5 | 29.2 | 18.1 | 26,039 | ||
| 3rd position | 37.5 | 14.6 | 31.5 | 16.4 | 26,039 |
1 CDS: protein-coding regions; 2 1st position: 1st base of codons; 3 2nd position: 2nd base of codons; 4 3rd position: 3rd base of codons.
Gene contents in the chloroplast genomes of P. rhoeas and P. orientale.
| Classificaion of Genes | Gene Names | Number of Genes |
|---|---|---|
| Photosystem I | 5 | |
| Photosystem II | 15 | |
| Cytochrome b/f complex | 6 | |
| ATP synthase | 6 | |
| NADH dehydrogenase | 12 | |
| RubisCO large subunit | 1 | |
| RNA polymerase | 4 | |
| Ribosomal proteins (SSU) | 14 | |
| Ribosomal proteins (LSU) | 11 | |
| Ribosomal RNAs | 8 | |
| Proteins of unknown function | 6 | |
| Transfer RNAs | 37 tRNAs (6 contain an intron, 7 in the inverted repeats (IRs)) | 37 |
| Other genes | 6 |
* Gene contains one intron; ** gene contains two introns; (×2) indicates the number of the repeat unit is 2.
Figure 2Codon content of 20 amino acid and stop codons in all protein-coding genes of the chloroplast genomes of two Papaver species. The histogram on the left hand side of each amino acid shows codon usage within the P. rhoeas chloroplast genome, whereas the right hand side illustrates the genome of P. orientale.
Types and amounts of SSRs in the chloroplast genomes of P. rhoeas and P. orientale.
| SSR Type | Repeat Unit | Amount | Ratio(%) | ||
|---|---|---|---|---|---|
| Mono | A/T | 72 | 83 | 92.3 | 92.2 |
| C/G | 6 | 7 | 7.7 | 7.8 | |
| Di | AG/CT | 20 | 18 | 52.6 | 51.4 |
| AT/AT | 16 | 15 | 42.1 | 42.9 | |
| AC/GT | 2 | 2 | 5.3 | 5.7 | |
| Tri | AAG/CTT | 25 | 25 | 41.7 | 43.9 |
| AAT/ATT | 12 | 12 | 20.0 | 21.1 | |
| AAC/GTT | 8 | 8 | 13.3 | 14.0 | |
| ACC/GGT | 3 | 1 | 5.0 | 1.7 | |
| ACT/AGT | 1 | 1 | 1.7 | 1.7 | |
| AGC/CTG | 5 | 5 | 8.3 | 8.8 | |
| AGG/CCT | 3 | 2 | 5.0 | 3.5 | |
| ATC/ATG | 3 | 3 | 5.0 | 5.3 | |
| Tetra | AAAC/GTTT | 1 | 1 | 25.0 | 25.0 |
| AAAT/ATTT | 1 | 1 | 25.0 | 25.0 | |
| AACC/GGTT | 1 | 1 | 25.0 | 25.0 | |
| AGAT/ATCT | 1 | 1 | 25.0 | 25.0 | |
| Hexa | AAGAAT/ATTCTT | 2 | 0 | 100.0 | 0.0 |
Figure 3Distribution of simple sequence repeats (SSRs) in the chloroplast genomes of two Papaver species. (A) SSR type distribution in the chloroplast genomes of two species. (B) Proportion of SSRs in different genomic regions of the chloroplast genomes of two species.
Figure 4Repeat sequences in four chloroplast genomes. REPuter was used to identify repeat sequences with length ≥30 bp and sequence identified ≥90% in the chloroplast genomes. F, P, R, and C indicate the repeat types F (forward), P (palindrome), R (reverse), and C (complement), respectively. Repeats with different lengths are indicated in different colors.
Figure 5The borders of LSC, SSC and IR regions among six chloroplast genomes. The number above the gene features means the distance between the ends of genes and the borders sites. The IRb/SSC border extended into the ycf1 genes to create various lengths of ycf1 pseudogenes among the six chloroplast genomes. The IRb/LSC border extended into the rps19 genes to create various lengths of rps19 pseudogenes among the five chloroplast genomes. The arrows indicated the orientation (5’ → 3’) of the ycf1, rps19, and ndhF genes. These features are not to scale.
Figure 6Sequence identity plot comparing the four chloroplast genomes with P. rhoeas as a reference by using mVISTA. Gray arrows and thick black lines above the alignment indicate genes with their orientation and the position of the inverted repeats (IRs), respectively. A cut-off of 70% identity was used for the plots, and the Y-scale represents the percent identity ranging from 50 to 100%.
Figure 7Sliding window analysis of the whole chloroplast genomes. Window length: 800 bp; step size: 200 bp. X-axis: position of the midpoint of a window. Y-axis: nucleotide diversity of each window. (A) Pi among three Papaver species. (B) Pi among three Papaver species and C. hylomeconoides. ①–⑧ indicate mutational hotspots and highly divergent loci.
Figure 8Phylogenetic trees constructed with 54 protein-coding genes of 32 species using maximum likelihood (ML) and maximum parsimony (MP) methods. Numbers at nodes are values for bootstrap support. These trees are unrooted cladograms. (A) ML tree; (B) MP tree.