| Literature DB >> 31886246 |
Qiuyue Ma1, Yanan Wang2, Lu Zhu1, Changwei Bi3, Shuxian Li2, Shushun Li1, Jing Wen1, Kunyuan Yan1, Qianzhong Li1.
Abstract
Acer truncatum, which is a new woody oil tree species, is an important ornamental and medicinal plant in China. To assess the genetic diversity and relationships of A. truncatum, we analyzed its complete chloroplast (cp) genome sequence. The A. truncatum cp genome comprises 156,492 bp, with the large single-copy, small single-copy, and inverted repeat (IR) regions consisting of 86,010, 18,050, and 26,216 bp, respectively. The A. truncatum cp genome contains 112 unique functional genes (i.e., 4 rRNA, 30 tRNA, and 78 protein-coding genes) as well as 78 simple sequence repeats, 9 forward repeats, 1 reverse repeat, 5 palindromic repeats, and 7 tandem repeats. We analyzed the expansion/contraction of the IR regions in the cp genomes of six Acer species. A comparison of these cp genomes indicated the noncoding regions were more diverse than the coding regions. A phylogenetic analysis revealed that A. truncatum is closely related to A. miaotaiense. Moreover, a novel ycf4-cemA indel marker was developed for distinguishing several Acer species (i.e., A. buergerianum, A. truncatum, A. henryi, A. negundo, A. ginnala, and A. tonkinense). The results of the current study provide valuable information for future evolutionary studies and the molecular barcoding of Acer species.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31886246 PMCID: PMC6925723 DOI: 10.1155/2019/7417239
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Map of the complete A. truncatum chloroplast genome. Genes marked inside the circle are transcribed in a clockwise direction, and those on the outside of the circle are transcribed in a counter-clockwise direction. The dark and light gray areas correspond to the GC and AT contents, respectively.
Details regarding the A. truncatum chloroplast genome.
| Characteristics | Chloroplast features | ||
|---|---|---|---|
| Total cp DNA size (bp) | 156, 492 | ||
| LSC size (bp) | 86, 010 | ||
| SSC size (bp) | 18, 050 | ||
| IR size (bp) | 26, 216 | ||
| Total GC content (%) | 37.9 | ||
| GC content of LSC (%) | 36.1 | ||
| GC content of SSC (%) | 32.2 | ||
| GC content of IR (%) | 42.8 | ||
| Total CDS length (bp) | 77, 796 | ||
| Protein-coding genes | 78 | ||
| tRNAs | 30 | ||
| rRNAs | 4 | ||
| Genes duplicated | 20 | ||
| Genes with a single intron(s) | 15 | ||
| Gene with two introns | 3 | ||
| Pseudogenes | 2 |
Gene composition of the A. truncatum chloroplast genome.
| Classification of genes | Gene name | Number | |
|---|---|---|---|
| 1 | Photosystem I | psaA, B, C, I, J | 5 |
| 2 | ATP synthase | atpA, B, E, Fa, H, I | 6 |
| 3 | Photosystem II | psbA, B, C, D, E, F, H, I, J, K, M, N, T, Z | 15 |
| 4 | Rubisco | rbcL | 1 |
| 5 | Cytochrome | petA, Ba, Da, G, L, N | 6 |
| 6 | NADH oxidoreductase | ndhAa, Ba,c(×2), C, D, E, F, G, H, I, J, K | 12 |
| 7 | Ribosomal proteins (SSU) | rps2, 3, 4, 7c(×2), 8, 11, 12(×2)a,c,d, 14, 15, 16a, 18, 19c | 14 |
| 8 | Ribosomal proteins (LSU) | rpl2a,c(×2), 14, 16a, 20, 22, 23c(×2), 32, 33, 36 | 11 |
| 9 | RNA polymerase | rpoA, rpoB, rpoC1a, rpoC2 | 4 |
| 10 | Ribosomal RNAs | rrn4.5c(×2), 5c(×2), 16c(×2), 23c(×2) | 8 |
| 11 | Other proteins | accD, ccsA, matK, clpPb, cemA | 5 |
| 12 | Transfer RNAs | trnA-UGC(×2)a,c, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-UCCa, trnG-GCC, trnH-GUG(×2), trnI-GAU (×2)a,c, trnL-CAAc(×2), trnL-UAAa, trnL-UAG, trnM-CAU, trnN-GUU(×2)c, trnP-UGG, trnQ-UUG, trnK-UUUa trnR-UCU, trnS-GCU, trnS-GGA, trnS-UGA, trnT-GGU, trnT-UGU, trnV-GAC(×2)c, trnR-ACG (×2)c, trnV-UACa, trnW-CCA, trnY-GUA, trnI-CAU(×2) c | 38 |
| 13 | Hypothetical proteins | ycf1, 2c(×2), 3b, 4, 15c(×2), | 7 |
| 14 | Pseudogenes | infA, ycf1 | 2 |
| Total | 134 |
aA single intron gene.
bTwo introns gene.
cTwo gene copies in IRs regions.
dGene divided into two independent transcription units.
ePseudogene.
Codon usage in the A. truncatum chloroplast genome.
| Amino acid | Codon | No. | RSCUa | Amino acid | Codon | No. | RSCU |
|---|---|---|---|---|---|---|---|
| Phe | UUC | 555 | 0.74 | Tyr | UAC | 187 | 0.4 |
| Phe | UUU | 955 | 1.26 | Tyr | UAU | 759 | 1.6 |
| Leu | CUC | 227 | 0.5 | His | CAC | 161 | 0.5 |
| Leu | UUG | 555 | 1.21 | Stop | UAG∗ | 24 | 0.73 |
| Leu | UUA | 822 | 1.80 | Stop | UAA∗ | 53 | 1.62 |
| Leu | CUU | 560 | 1.22 | His | CAU | 479 | 1.5 |
| Leu | CUG | 207 | 0.45 | Gln | CAG | 224 | 0.5 |
| Leu | CUA | 376 | 0.82 | Gln | CAA | 667 | 1.5 |
| Ile | AUC | 445 | 0.62 | Asn | AAC | 293 | 0.48 |
| Ile | AUU | 1054 | 1.46 | Asn | AAU | 921 | 1.52 |
| Ile | AUA | 668 | 0.92 | Lys | AAA | 944 | 1.46 |
| Met | AUG | 615 | 1 | Lys | AAG | 345 | 0.54 |
| Val | GUU | 522 | 1.45 | Asp | GAU | 809 | 1.56 |
| Val | GUC | 182 | 0.51 | Asp | GAC | 229 | 0.44 |
| Val | GUG | 205 | 0.57 | Glu | GAG | 362 | 0.55 |
| Val | GUA | 532 | 1.48 | Glu | GAA | 966 | 1.45 |
| Ser | UCC | 352 | 1.03 | Cys | UGC | 81 | 0.53 |
| Ser | UCU | 531 | 1.56 | Cys | UGU | 222 | 1.47 |
| Ser | UCG | 202 | 0.59 | Trp | UGG | 440 | 1 |
| Ser | UCA | 433 | 1.27 | Stop | UGA∗ | 21 | 0.64 |
| Pro | CCC | 221 | 0.8 | Arg | CGC | 126 | 0.49 |
| Pro | CCU | 417 | 1.52 | Arg | CGU | 308 | 1.19 |
| Pro | CCG | 153 | 0.56 | Arg | CGG | 144 | 0.56 |
| Pro | CCA | 309 | 1.12 | Arg | CGA | 352 | 1.36 |
| Thr | ACC | 256 | 0.78 | Ser | AGC | 132 | 0.39 |
| Thr | ACU | 501 | 1.53 | Ser | AGU | 397 | 1.16 |
| Thr | ACG | 160 | 0.49 | Arg | AGG | 182 | 0.7 |
| Thr | ACA | 390 | 1.19 | Arg | AGA | 441 | 1.7 |
| Ala | GCC | 236 | 0.66 | Gly | GGC | 188 | 0.41 |
| Ala | GCU | 617 | 1.74 | Gly | GGU | 591 | 1.29 |
| Ala | GCG | 194 | 0.55 | Gly | GGG | 338 | 0.74 |
| Ala | GCA | 373 | 1.05 | Gly | GGA | 721 | 1.57 |
a Relative synonymous codon usage.
∗ Stop codon.
Figure 2Type and length of repeats in the A. truncatum chloroplast genome.
Figure 3Analysis of SSRs in the A. truncatum chloroplast genome. (a) Number of detected SSR types. (b) Frequency of SSRs in the tRNA genes, protein-coding genes, intergenic spacers, and introns.
Figure 4Comparison of the IR boundaries among the chloroplast genomes of six Acer species. Boxes above or below the main line indicate the adjacent border genes. ø, pseudogene.
Figure 5Comparison of the chloroplast genomes of six Acer species with mVISTA. The 50% and 100% refer to the similarity among sequences. Gray arrows above the aligned sequences represent genes and their orientation.
Figure 6K /K ratios for 78 protein-coding genes in A. truncatum, C. platymamma, D. longan, and S. mombin.
Figure 7Maximum-likelihood tree based on 64 single-copy orthologous genes shared among 23 species. Numbers at the nodes are bootstrap support values. The position of A. truncatum is indicated in bold. Euonymus hamiltonianus served as the outgroup.
Figure 8Divergence times of 22 species of Sapindales based on complete chloroplast sequences. Values at nodes indicate divergence dates in millions of years. Euonymus hamiltonianus served as the outgroup.
Details regarding the primers used to develop the ycf4-cemA marker.
| Primer name | Primer sequence (5′ > 3′) | Position |
|---|---|---|
|
| GCTGGGCGTTTATCCTTTTT |
|
|
| GGATTGTTTCTTTGTGGAGC |
Figure 9Schematic diagram of the development of the ycf4-cemA indel marker in six Acer species. (a) Results of the PCR amplification of the ycf4-cemA indel marker in the following Acer species: (1) A. tonkinense, (2) A. ginnala, (3) A. negundo, (4) A. henryi, (5) A. truncatum, and (6) A. buergerianum; M, 2,000-bp ladder. (b) Alignment of the ycf4-cemA marker sequence with MEGA 6.0. The arrow indicates the 91-bp deletion in A. buergerianum.