| Literature DB >> 35271574 |
Xiaonong Guo1,2,3, Zhuanxia Wang1,2,3, Deyu Cai1,2,3, Lei Song1, Jialin Bai1,2,3.
Abstract
Apocynum venetum L. (Apocynaceae) is valuable for its medicinal compounds and fiber content. Native A. venetum populations are threatened and require protection. Wild A. venetum resources are limited relative to market demand and a poor understanding of the composition of A. venetum at the molecular level. The chloroplast genome contains genetic markers for phylogenetic analysis, genetic diversity evaluation, and molecular identification. In this study, the entire genome of the A. venetum chloroplast was sequenced and analyzed. The A. venetum cp genome is 150,878 bp, with a pair of inverted repeat regions (IRA and IRB). Each inverted repeat region is 25,810 bp, which consist of large (LSC, 81,951 bp) and small (SSC, 17,307 bp) single copy areas. The genome-wide GC content was 38.35%, LSC made up 36.49%, SSC made up 32.41%, and IR made up 43.3%. The A. venetum chloroplast genome encodes 131 genes, including 86 protein-coding genes, eight ribosomal RNA genes, and 37 transfer RNA genes. This study identified the unique characteristics of the A. venetum chloroplast genome, which will help formulate effective conservation and management strategies as well as molecular identification approaches for this important medicinal plant.Entities:
Mesh:
Year: 2022 PMID: 35271574 PMCID: PMC8912179 DOI: 10.1371/journal.pone.0261710
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Map of A. venetum cpgenome.
Thick lines indicate the extent of the inverted repeat regions (Ira and Irb), which separate the genome into small (SSC) and large (LSC) single copy regions. Genes drawn inside the circle are transcribed clockwise, and those outside are transcribed counterclockwise. Different colors represent different functional groups of genes.
Base composition in the A. venetum chloroplast genome.
| Region | Length | A (%) | T (%) | C (%) | G (%) | AT (%) | GC (%) |
|---|---|---|---|---|---|---|---|
| Total_genome | 150878 | 30.43 | 31.21 | 19.52 | 18.83 | 61.64 | 38.35 |
| LSC | 81951 | 31.02 | 32.49 | 18.69 | 17.8 | 63.51 | 36.49 |
| IRA | 25810 | 28.59 | 28.11 | 20.84 | 22.46 | 56.7 | 43.3 |
| SSC | 17307 | 33.85 | 33.65 | 17.09 | 15.32 | 67.49 | 32.41 |
| IRB | 25810 | 28.11 | 28.59 | 22.46 | 20.84 | 56.7 | 43.3 |
Genes present in the A. venetum chloroplast genome.
| Category for genes | Group of genes | Name of genes |
|---|---|---|
| Transcription and translation-related genes | transfer RNAs | trnM-CAU, trnR-ACG, trnY-GUA, trnG-UCC, trnL-UAG, trnI-GAU, trnW-CCA, trnR-UCU, trnQ-UUG, trnL-UAA, trnS-GGA, trnH-GUG, trnT-GGU, trnT-UGU, trnP-UGG, trnK-UUU, trnN-GUU, trnG-GCC, trnI-CAU, trnD-GUC, trnF-GAA, trnS-GCU, trnS-UGA, trnfM-CAU, trnE-UUC, trnV-GAC, trnA-UGC, trnV-UAC, trnL-CAA, trnC-GCA |
| RNA polymerase | rpoB, rpoA, rpoC1, rpoC2 | |
| ribosomal proteins(SSU) | rps8, rps4, rps16, rps14, rps7, rps12, rps2, rps11, rps19-fragment, rps19, rps18, rps3, rps15 | |
| ribosomal proteins(LSU) | rpl2, rpl23, rpl32, rpl33, rpl36, rpl14, rpl16, rpl22, rpl20 | |
| Translational initiation factor | infA | |
| ribosomal RNAs | rrn4.5, rrn5, rrn23, rrn16 | |
| Photosynthesis-related genes | NADH dehydrogenase | ndhA, ndhH, ndhF, ndhJ, ndhE, ndhI, ndhG, ndhK, ndhC, ndhD, ndhB |
| photosystem I | psaI, psaJ, psaC, psaB, psaA | |
| photosystem II | psbA, psbL, psbF, psbB, psbK, psbJ, psbM, psbT, psbE, psbD, psbC, psbH, psbI, psbN, psbZ | |
| cytochrome b/f complex | petL, petN, petB, petG, petA, petD | |
| RubisCO | rbcL | |
| ATP synthase | atpA, atpE, atpH, atpI, atpB, atpF | |
| hypothetical chloroplast reading frames(ycf) | ycf2, ycf4, ycf1, ycf3, ycf1-fragment | |
| Other genes | Maturase | matK |
| Protease | clpP | |
| Envelope membrane protein | cemA | |
| Subunit of Acetyl-CoA carboxylase | accD | |
| C-type cytochrome synthesis gene | ccsA |
Codon-anticodon recognition patterns and codon usage of the A. venetum chloroplast genome.
| Amino Acid | Codon | Number | RSCU* | tRNA | Amino Acid | Codon | Number | RSCU* | tRNA |
|---|---|---|---|---|---|---|---|---|---|
| Stop | UAA | 23 | 1.53 | Met | AUG | 401 | 1 | trnM-CAU | |
| Stop | UAG | 9 | 0.6 | Asn | AAU | 627 | 1.56 | ||
| Stop | UGA | 13 | 0.87 | Asn | AAC | 178 | 0.44 | ||
| Ala | GCU | 459 | 1.78 | Pro | CCU | 270 | 1.49 | ||
| Ala | GCC | 174 | 0.67 | Pro | CCC | 153 | 0.84 | ||
| Ala | GCA | 281 | 1.09 | Pro | CCA | 191 | 1.05 | trnP-UGG | |
| Ala | GCG | 118 | 0.46 | Pro | CCG | 111 | 0.61 | ||
| Cys | UGU | 128 | 1.44 | Gln | CAA | 495 | 1.54 | trnQ-UUG | |
| Cys | UGC | 50 | 0.56 | trnC-GCA | Gln | CAG | 148 | 0.46 | |
| Asp | GAU | 578 | 1.59 | Arg | CGU | 227 | 1.32 | trnR-ACG | |
| Asp | GAC | 147 | 0.41 | trnD-GUC | Arg | CGC | 78 | 0.45 | |
| Glu | GAA | 682 | 1.51 | trnE-UUC | Arg | CGA | 240 | 1.39 | |
| Glu | GAG | 220 | 0.49 | Arg | CGG | 90 | 0.52 | ||
| Phe | UUU | 611 | 1.28 | Arg | AGA | 285 | 1.65 | trnR-UCU | |
| Phe | UUC | 343 | 0.72 | trnF-GAA | Arg | AGG | 115 | 0.67 | |
| Gly | GGU | 412 | 1.28 | Ser | UCU | 366 | 1.67 | ||
| Gly | GGC | 144 | 0.45 | trnG-GCC | Ser | UCC | 223 | 1.02 | trnS-GGA |
| Gly | GGA | 474 | 1.48 | Ser | UCA | 246 | 1.12 | trnS-UGA | |
| Gly | GGG | 253 | 0.79 | Ser | UCG | 131 | 0.6 | ||
| His | CAU | 315 | 1.45 | Ser | AGU | 269 | 1.23 | ||
| His | CAC | 120 | 0.55 | trnH-GUG | Ser | AGC | 81 | 0.37 | trnS-GCU |
| Ile | AUU | 722 | 1.5 | Thr | ACU | 353 | 1.63 | ||
| Ile | AUC | 294 | 0.61 | trnI-CAU | Thr | ACC | 176 | 0.81 | trnT-GGU |
| Ile | AUA | 431 | 0.89 | Thr | ACA | 238 | 1.1 | trnT-UGU | |
| Lys | AAA | 594 | 1.49 | Thr | ACG | 98 | 0.45 | ||
| Lys | AAG | 204 | 0.51 | Val | GUU | 361 | 1.49 | ||
| Leu | UUA | 597 | 1.97 | Val | GUC | 114 | 0.47 | trnV-GAC | |
| Leu | UUG | 365 | 1.21 | trnL-CAA | Val | GUA | 360 | 1.48 | |
| Leu | CUU | 388 | 1.28 | Val | GUG | 135 | 0.56 | ||
| Leu | CUC | 107 | 0.35 | Trp | UGG | 319 | 1 | trnW-CCA | |
| Leu | CUA | 225 | 0.74 | Tyr | UAU | 512 | 1.64 | ||
| Leu | CUG | 132 | 0.44 | Tyr | UAC | 114 | 0.36 | trnY-GUA |
RSCU *: relative synonymous codon usage.
Length of exons and introns in genes with introns in the A. venetum chloroplast genome.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | Intron II (bp) | Exon III (bp) |
|---|---|---|---|---|---|---|
| trnK-UUU | LSC | 35 | 2474 | 37 | ||
| rps16 | LSC | 226 | 837 | 41 | ||
| trnG-UCC | LSC | 23 | 672 | 48 | ||
| atpF | LSC | 411 | 706 | 144 | ||
| rpoC1 | LSC | 1599 | 748 | 451 | ||
| ycf3 | SSC | 155 | 794 | 226 | 717 | 126 |
| trnV-UAC | LSC | 37 | 588 | 36 | ||
| rps12 | LSC | 114 | 536 | 234 | ||
| clpP | LSC | 228 | 642 | 291 | 763 | 69 |
| rpl2 | IR | 434 | 649 | 391 | ||
| ndhB | IR | 756 | 685 | 777 | ||
| trnI-GAU | IR | 37 | 952 | 35 | ||
| trnA-UGC | IR | 38 | 817 | 35 | ||
| ndhA | SSC | 545 | 1039 | 553 |
Fig 2Comparison of the cpgenome sequences of four plants.
Comparison of the cp genome sequences of N attenuata, G. hirsutum, A, thaliana, and A. venetum generated with mVISTA. Gray arrows indicate the position and direction of each gene. Red and blue areas indicate the intergenic and genic regions, respectively. The vertical scale indicates the percentage of identity, ranging from 50% to 100%.
Fig 3Phylogenetic tree analysis of whole chloroplast genome.
Maximum likelihood (ML) phylogenetic tree reconstruction including 21 species based on all chloroplast genomes. The bootstrap value, based on 1,000 replicates, is shown on each node. V. vinfera was used as the outgroup. The GenBank accession numbers are listed following the species name.