| Literature DB >> 28178222 |
Liu He1, Jun Qian2, Xiwen Li3,4, Zhiying Sun5, Xiaolan Xu6, Shilin Chen7,8.
Abstract
The complete chloroplast (cp) genome of Lonicera japonica, a common ornamental and medicinal plant in North America and East Asia, was sequenced and analyzed. The length of the L. japonica cp genome is 155,078 bp, contains a pair of inverted repeat regions (IRa and IRb), of 23,774 bp each, as well as large (LSC, 88,858 bp) and small (SSC, 18,672 bp) single-copy regions. A total of 129 genes were identified in the cp genome, 16 of which were duplicated within the IR regions. Relative to other plant cp genomes, the L. japonica cp genome had a unique rearrangement between trnI-CAU and trnN-GUU. In L. japonica cpDNA, rps19, rpl2, and rpl23 move to the LSC region, from the IR region. The ycf1 pesudogene in the IR region is lost, and only one copy locates in the SSC region. Comparative cp DNA sequence analyses of L. japonica with other cp genomes reveal that the gene order, and the gene and intron contents, are slightly different. The introns in ycf2 and rps18 genes are found for the first time. Four genes (clpP, petB, petD, and rpl16) lost introns. However, its genome structure, GC content, and codon usage were similar to those of typical angiosperm cp genomes. All preferred synonymous codons were found to use codons ending with A/T. The AT-rich sequences were less abundant in the coding regions than in the non-coding ones. A phylogenetic analysis based on 71 protein-coding genes supported the idea that L. japonica is a sister of the Araliaceae species. This study identified unique characteristics of the L. japonica cp genome that contribute to our understanding of the cpDNA evolution. It offers valuable information for the phylogenetic and specific barcoding of this medicinal plant.Entities:
Keywords: Lonicera japonica; chloroplast genome; contraction
Mesh:
Substances:
Year: 2017 PMID: 28178222 PMCID: PMC6155926 DOI: 10.3390/molecules22020249
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Base composition in the L. japonica chloroplast genome.
| T(U) (%) | C (%) | A (%) | G (%) | Length (bp) | ||
|---|---|---|---|---|---|---|
| LSC | 32.1 | 19.0 | 30.8 | 18.1 | 88,858 | |
| SSC | 34.3 | 16.9 | 32.3 | 16.5 | 18,672 | |
| IRa | 28.6 | 23.1 | 27.9 | 20.4 | 23,774 | |
| IRb | 27.9 | 20.4 | 28.6 | 23.1 | 23,774 | |
| Total | 31.2 | 19.6 | 30.2 | 19.0 | 155,078 | |
| CDS | 31.3 | 18.1 | 30.0 | 20.6 | 74,724 | |
| 1st position | 23.7 | 19.2 | 30.0 | 27.1 | 24,908 | |
| 2nd position | 32.8 | 20.5 | 28.8 | 17.9 | 24,908 | |
| 3rd position | 37.5 | 14.4 | 31.3 | 16.8 | 24,908 |
CDS: protein-coding regions.
Figure 1Map of the L. japonica chloroplast genome. Genes drawn inside the circle are transcribed in a clockwise direction, whereas those outside the circle are transcribed in a counterclockwise direction. Genes belonging to the same functional groups have the same colors.
Codon–anticodon recognition patterns and codon usage of the L. japonica chloroplast genome.
| Amino Acid | Codon | No. | RSCU | tRNA | Amino Acid | Codon | No. | RSCU | tRNA |
|---|---|---|---|---|---|---|---|---|---|
| Phe | UUU | 911 | 1.27 | Tyr | UAU | 720 | 1.6 | ||
| Phe | UUC | 527 | 0.73 | Tyr | UAC | 182 | 0.4 | ||
| Leu | UUA | 785 | 1.75 | Stop | UAA | 44 | 1.63 | ||
| Leu | UUG | 570 | 1.27 | Stop | UAG | 21 | 0.78 | ||
| Leu | CUU | 589 | 1.31 | His | CAU | 448 | 1.54 | ||
| Leu | CUC | 202 | 0.45 | His | CAC | 135 | 0.46 | ||
| Leu | CUA | 377 | 0.84 | Gln | CAA | 666 | 1.5 | ||
| Leu | CUG | 169 | 0.38 | Gln | CAG | 220 | 0.5 | ||
| Ile | AUU | 972 | 1.42 | Asn | AAU | 857 | 1.5 | ||
| Ile | AUC | 418 | 0.61 | Asn | AAC | 284 | 0.5 | ||
| Ile | AUA | 660 | 0.97 | Lys | AAA | 942 | 1.43 | ||
| Met | AUG | 582 | 1 | Lys | AAG | 371 | 0.57 | ||
| Val | GUU | 521 | 1.5 | Asp | GAU | 827 | 1.6 | ||
| Val | GUC | 172 | 0.5 | Asp | GAC | 205 | 0.4 | ||
| Val | GUA | 490 | 1.41 | Glu | GAA | 902 | 1.44 | ||
| Val | GUG | 204 | 0.59 | Glu | GAG | 354 | 0.56 | ||
| Ser | UCU | 554 | 1.75 | Cys | UGU | 191 | 1.4 | ||
| Ser | UCC | 312 | 0.99 | Cys | UGC | 82 | 0.6 | ||
| Ser | UCA | 366 | 1.16 | Stop | UGA | 16 | 0.59 | ||
| Ser | UCG | 171 | 0.54 | Trp | UGG | 454 | 1 | ||
| Pro | CCU | 402 | 1.47 | Arg | CGU | 318 | 1.26 | ||
| Pro | CCC | 209 | 0.77 | Arg | CGC | 95 | 0.38 | ||
| Pro | CCA | 330 | 1.21 | Arg | CGA | 367 | 1.45 | ||
| Pro | CCG | 150 | 0.55 | Arg | CGG | 113 | 0.45 | ||
| Thr | ACU | 515 | 1.65 | Arg | AGA | 377 | 1.19 | ||
| Thr | ACC | 226 | 0.72 | Arg | AGG | 117 | 0.37 | ||
| Thr | ACA | 369 | 1.18 | Ser | AGU | 456 | 1.8 | ||
| Thr | ACG | 140 | 0.45 | Ser | AGC | 168 | 0.66 | ||
| Ala | GCU | 603 | 1.76 | Gly | GGU | 544 | 1.27 | ||
| Ala | GCC | 227 | 0.66 | Gly | GGC | 189 | 0.44 | ||
| Ala | GCA | 388 | 1.13 | Gly | GGA | 646 | 1.51 | ||
| Ala | GCG | 155 | 0.45 | Gly | GGG | 331 | 0.77 |
RSCU: Relative Synonymous Codon Usage.
Genes with introns in the L. japonica chloroplast genome, including the exon and intron length.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
| LSC | 411 | 735 | 144 | |||
| SSC | 539 | 1093 | 553 | |||
| IR | 756 | 679 | 777 | |||
| IR | 434 | 661 | 394 | |||
| IR | 498 | 249 | 6105 | |||
| LSC | 430 | 775 | 1619 | |||
| LSC | 114 | - | 232 | 536 | 26 | |
| LSC | 230 | 869 | 40 | |||
| LSC | 39 | 63 | 202 | 309 | 47 | |
| IR | 35 | 809 | 38 | |||
| LSC | 47 | 717 | 23 | |||
| IR | 35 | 947 | 37 | |||
| LSC | 35 | 2535 | 37 | |||
| LSC | 49 | 515 | 36 | |||
| LSC | 36 | 565 | 40 | |||
| LSC | 155 | 758 | 226 | 732 | 126 |
* The rps12 gene is divided into 5′-rps12 in the LSC region and 3′-rps12 in the IR region.
Figure 2Comparison of four chloroplast genomes using mVISTA program. Grey arrows and thick black lines above the alignment indicate the orientation of genes and the position of the IR regions, respectively. Purple bars represent exons, blue ones represent UTRs, and pink ones represent non-coding sequences (CNS). A cut-off of 70% identity was used for the plots. The Y-scale axis represents the percent identity within 50%–100%. Genome regions are color-coded as either protein-coding exons, rRNAs, tRNAs, or conserved noncoding sequences (CNS).
Figure 3Comparison of the borders of the LSC, SSC, and IR regions among five chloroplast genomes. Ψ: pseudogenes, *: the distance from the edge.
Figure 4MP phylogenetic tree of the Apiales clade based on 71 protein-coding genes. The MP tree has a length of 17,973, with a consistency index of 0.8080 and a retention index of 0.8285. Numbers above each node are the bootstrap support values. Solanum lycopersicum and Nicotiana tabacum were set as the outgroups.