| Literature DB >> 28948105 |
Caihui Chen1,2, Yongjie Zheng2, Sian Liu1, Yongda Zhong3, Yanfang Wu2, Jiang Li2, Li-An Xu1, Meng Xu1.
Abstract
Cinnamomum camphora, a member of the Lauraceae family, is a valuable aromatic and timber tree that is indigenous to the south of China and Japan. All parts of Cinnamomum camphora have secretory cells containing different volatile chemical compounds that are utilized as herbal medicines and essential oils. Here, we reported the complete sequencing of the chloroplast genome of Cinnamomum camphora using illumina technology. The chloroplast genome of Cinnamomum camphora is 152,570 bp in length and characterized by a relatively conserved quadripartite structure containing a large single copy region of 93,705 bp, a small single copy region of 19,093 bp and two inverted repeat (IR) regions of 19,886 bp. Overall, the genome contained 123 coding regions, of which 15 were repeated in the IR regions. An analysis of chloroplast sequence divergence revealed that the small single copy region was highly variable among the different genera in the Lauraceae family. A total of 40 repeat structures and 83 simple sequence repeats were detected in both the coding and non-coding regions. A phylogenetic analysis indicated that Calycanthus is most closely related to Lauraceae, both being members of Laurales, which forms a sister group to Magnoliids. The complete sequence of the chloroplast of Cinnamomum camphora will aid in in-depth taxonomical studies of the Lauraceae family in the future. The genetic sequence information will also have valuable applications for chloroplast genetic engineering.Entities:
Keywords: Chloroplast genome; Cinnamomum camphora; Illumina sequencing; Lauraceae; Phylogeny
Year: 2017 PMID: 28948105 PMCID: PMC5609524 DOI: 10.7717/peerj.3820
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Gene map of the Cinnamomum camphora chloroplast genome.
Genes lying inside of the molecule are transcribed in the clockwise direction, where as those outside are transcribed in the counterclockwise direction. Genes are color-coded based on their functional category. The innermost circle denotes the GC content across the genome. The dark gray inner circle corresponds to the GC content, and the light gray corresponds to the AT content of the genome. Large single copy (LSC), small single copy (SSC) and inverted repeat (IRA and IRB) regions are indicated.
List of genes in the chloroplast genome of Cinnamomum camphora.
| Groups of genes | Names of genes | |
|---|---|---|
| Protein synthesis and DNA replication | Transfer RNAs | |
| Ribosomal RNAs | ||
| Ribosomal protein small subunit | ||
| Ribosomal protein large subunit | ||
| Subunits of RNA polymerase | ||
| Photosynthesis | Photosystem I | |
| Photosystem II | ||
| Cytochrome b/f complex | ||
| ATP synthase | ||
| NADH-dehydrogenase | ||
| Large subunit Rubisco | ||
| Miscellaneous group | Translation initiation factor IF-1 | |
| Acetyl-CoA carboxylase | ||
| Cytochrome c biogenesis | ||
| Maturase | ||
| ATP-dependent protease | ||
| Inner membrane protein | ||
| Pseudogenes of unknown function | Conserved hypothetical chloroplast open reading frame |
Relative synonymous codon usage (RSCU) for protein coding in the chloroplast genome of Cinnamomum camphora.
| AA | Codon | ObsFreq | RSCU | AA | Codon | ObsFreq | RSCU |
|---|---|---|---|---|---|---|---|
| 31 | 1.26 | Trp | UGG | 370 | 1 | ||
| UAG | 20 | 0.81 | Ala | 576 | 1.81 | ||
| UGA | 23 | 0.93 | GCC | 207 | 0.65 | ||
| Leu | 658 | 1.79 | 352 | 1.1 | |||
| 463 | 1.26 | GCG | 140 | 0.44 | |||
| 445 | 1.21 | Tyr | 613 | 1.56 | |||
| CUC | 162 | 0.44 | UAC | 171 | 0.44 | ||
| CUA | 315 | 0.86 | His | 420 | 1.51 | ||
| CUG | 160 | 0.44 | CAC | 137 | 0.49 | ||
| Ile | 867 | 1.44 | Gln | 546 | 1.44 | ||
| AUC | 397 | 0.66 | CAG | 210 | 0.56 | ||
| AUA | 540 | 0.9 | Asn | 717 | 1.54 | ||
| 461 | 1.46 | AAC | 213 | 0.46 | |||
| GUC | 165 | 0.52 | Lys | 704 | 1.46 | ||
| 442 | 1.4 | AAG | 261 | 0.54 | |||
| GUG | 193 | 0.61 | Asp | 658 | 1.56 | ||
| Ser | 444 | 1.58 | GAC | 184 | 0.44 | ||
| 288 | 1.02 | Glu | 789 | 1.46 | |||
| 341 | 1.21 | GAG | 189 | 0.54 | |||
| UCG | 176 | 0.63 | Cys | 190 | 1.49 | ||
| 344 | 1.22 | UGC | 65 | 0.51 | |||
| AGC | 96 | 0.34 | Arg | 302 | 1.39 | ||
| Pro | 346 | 1.47 | CGC | 75 | 0.35 | ||
| CCC | 221 | 0.94 | 297 | 1.37 | |||
| 250 | 1.06 | CGG | 99 | 0.46 | |||
| CCG | 122 | 0.52 | 379 | 1.75 | |||
| Thr | 430 | 1.55 | AGG | 18 | 0.68 | ||
| ACC | 227 | 0.82 | Gly | 499 | 1.26 | ||
| 324 | 1.17 | GGC | 189 | 0.48 | |||
| ACG | 126 | 0.46 | 602 | 1.53 | |||
| Met | AUG | 513 | 1 | GGG | 288 | 0.73 |
Notes:
The preferred codons are in bold (RSCU > 1.0).
AA: amino acids.
Figure 2Comparison of the nucleotide variability (Pi) values of the whole plastomes in the (A) Cinnamomum and (B) Lauraceae (window length: 600 bp, step size: 200 bp).
X-axis: positions of the midpoints of a window, Y-axis: nucleotide diversity in each window.
Figure 3Number and type of repeated sequences in five plastomes of Lauraceae.
(A) Frequencies of the palindromic repeats by length; (B) frequencies of the tandem repeats by length; (C) frequencies of the forward repeats by length; (D) total of the three repeat types.
Figure 4Number and type of simple sequence repeats (SSRs) in five plastomes of Lauraceae.
(A) Numbers of different SSR types identified in the five Lauraceae chloroplast genomes and (B) frequencies of identified SSR motifs in different repeat class types. SSRs were classified as the number of repeat unit lengths.
Figure 5Molecular phylogenetic tree of 26 species based on a neighbor joining analysis.
Numbers above and below nodes are bootstrap support values ≥50%.