| Literature DB >> 34901281 |
Caixiang Wang1, Juanjuan Liu1, Yue Su1, Meili Li1, Xiaoyu Xie1, Junji Su1.
Abstract
Sonchus brachyotus DC. possesses both edible and medicinal properties and is widely distributed throughout China. In this study, the complete cp genome of S. brachyotus was sequenced and assembled. The total length of the complete S. brachyotus cp genome was 151,977 bp, including an LSC region of 84,553 bp, SSC region of 18,138 bp, and IR region of 24,643 bp. Sequence analyses revealed that the cp genome encoded 132 genes, including 87 protein-coding genes, 37 tRNA genes, and 8 rRNA genes. The GC content was 37.6%. One hundred mononucleotide microsatellites, 4 dinucleotide microsatellites, 67 trinucleotide microsatellites, 4 tetranucleotide microsatellites, and 1 long repeat were identified. The SSR frequency of the LSC region was significantly greater than that of the IR and SSC regions. In total, 175 SSRs and highly variable regions were recognized as potential cp markers. By analyzing the IR/LSC and IR/SSC boundaries, structural differences between S. brachyotus and 6 other species were detected. According to phylogenetic analyses, S. brachyotus was most closely related to S. arvensis and S. oleraceus. Overall, this study provides complete cp genome resources for S. brachyotus that will be beneficial for identifying potential molecular markers and evolutionary patterns of S. brachyotus and its closely related species.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34901281 PMCID: PMC8654571 DOI: 10.1155/2021/9410496
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Summary of the features of 7 Sonchus chloroplast genomes.
| Taxon | Length (bp) | Number of genes | GC content (%) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Genome | LSC | SSC | IR | Total | Protein coding | tRNA | rRNA | ||
|
| 151,977 | 84,553 | 18,138 | 24,643 | 132 | 87 | 37 | 8 | 37.6 |
|
| 151,967 | 84,251 | 18,184 | 24,766 | 130 | 87 | 37 | 6 | 37.6 |
|
| 151,808 | 84,142 | 18,217 | 24,739 | 130 | 87 | 37 | 6 | 37.6 |
|
| 152,016 | 83,988 | 18,566 | 24,731 | 130 | 88 | 36 | 6 | 37.6 |
|
| 152,017 | 84,355 | 18,244 | 24,746 | 131 | 88 | 37 | 6 | 37.6 |
|
| 152,075 | 84,338 | 18,245 | 24,746 | 131 | 88 | 37 | 6 | 37.6 |
|
| 152,194 | 84,269 | 18,409 | 24,758 | 131 | 88 | 37 | 6 | 37.6 |
LSC: large single copy; SSC: small single copy; IR: inverted repeat; tRNA: transfer RNA; rRNA: ribosomal RNA.
Figure 1Gene map of the S. brachyotus chloroplast genome. The genes inside and outside of the circle are transcribed in the clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are indicated in different colors. The thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the genomes into small single-copy (SSC) and large single-copy (LSC) regions.
List of genes found in the chloroplast genome of S. brachyotus.
| Category of genes | Group of genes | Names of genes |
|---|---|---|
| Self-replication | Large subunit of ribosome (LSU) | rpl33, rpl20, rpl36, rpl14, rpl16∗, rpl22, rpl2(2)∗, rpl23(2), rpl32 |
| Small subunit of ribosome (SSU) | rps16∗, rps2, rps14, rps4, rps18, rps12(2)∗, rps11, rps8, rps3, rps19, rps7(2), rps15 | |
| RNA polymerase subunits | rpoB, rpoC1∗, rpoC2, rpoA | |
| Ribosomal RNA genes | rrn16(2), rrn23(2), rrn4.5(2), rrn5(2) | |
| Transfer RNAs (tRNAs) | trnH-GUG, trnK-UUU∗, trnQ-UUG, trnS-GCU, trnC-GCA, trnD-GUC, trnY-GUA, trnE-UUC, trnR-UCU, trnG-UCC∗, trnT-GGU, trnS-UGA, trnG-GCC, trnfM-CAU, trnS-GGA, trnT-UGU, trnL-UAA∗, trnF-GAA, trnV-UAC∗, trnM-CAU, trnW-CCA, trnP-UGG, trnI-CAU(2), trnL-CAA(2), trnV-GAC(2), trnI-GAU(2)∗, trnA-UGC(2)∗, trnR-ACG(2), trnN-GUU(2), trnL-UAG | |
|
| ||
| Photosynthesis | Photosystem I | psaB, psaA, ycf3∗∗, psaI, ycf4, psaJ, psaC |
| Photosystem II | psbA, psbK, psbI, psbM, psbD, psbC, psbZ, psbJ, psbF, psbE, psbB, psbT, psbN, psbH | |
| Subunits of NADH dehydrogenase | ndhJ, ndhK, ndhC, ndhB(2)∗, ndhF, ndhD, ndhE, ndhG, ndhI, ndhA∗, ndhH | |
| Cytochrome b/f complex | petN, petA, petL, petG, petB∗, petD∗ | |
| ATP synthase | atpI, atpH, atpF∗, atpA, atpE, atpB | |
| Large chain of rubisco | rbcL | |
|
| ||
| Other genes | Translation initiation factor | infA |
| Maturase | matK | |
| Protease | clpP∗∗ | |
| Envelope membrane protein | cemA | |
| Subunit of acetyl-CoA-carboxylase | accD | |
| Cytochrome c biogenesis protein | ccsA | |
| Hypothetical chloroplast reading frames | ycf2(2), ycf15(2), ycf1(2) | |
∗Genes containing a single intron. ∗∗Genes containing 2 introns.
Figure 2SSR numbers according to their distribution and repeat type and repeat numbers according to repeat type and repeat length in S. brachyotus. (a) Number of SSR motifs in S. brachyotus. (b) Variation in the distribution of forward (F), reverse (R), complementary (C), and palindromic (P) repeats and the number of different repeats in the chloroplast genome of S. brachyotus.
Figure 3Comparison of the border positions of LSC, SSC, and IR regions among 7 chloroplast genomes from Sonchus species. Gene names are indicated in boxes.
Figure 4Mauve alignment of the 7 Sonchus chloroplast genomes. The rectangles represent the similarity between genomes, and the lines between rectangles represent a type of collinearity. The small square indicates the gene location in each genome. White represents CDSs, green represents tRNAs, and red represents rRNAs.
Figure 5Nucleotide diversity (Pi) values among the 7 Sonchus species.
Figure 6Phylogenetic analysis of the chloroplast genomes of 43 Compositae plants using maximum parsimony and Bayesian inference. MP bootstrap value/Bayesian posterior probability given at each node. The pentastar in the phylogenetic tree indicated that the support rate of the branch is 100/1.0. The yellow area is the first branch, the green area is the second branch, and the purple area is the third branch.