| Literature DB >> 35930571 |
Xingyu Shi1, Wenfen Xu1, Mingxiang Wan2, Qingwen Sun1, Qiyu Chen1, Chao Zhao1, Kaifen Sun1, Yanxia Shu1.
Abstract
Carpesium (Asteraceae) is a genus that contains many plant species with important medicinal values. However, the lack of chloroplast genome research of this genus has greatly hindered the study of its molecular evolution and phylogenetic relationship. This study used the Illumina sequencing platform to sequence three medicinal plants of the Carpesium genus: Carpesium abrotanoides, Carpesium cernuum, and Carpesium faberi, obtaining three complete chloroplast genome sequences after assembly and annotation. It was revealed that the three chloroplast genomes were typical quadripartite structures with lengths of 151,389 bp (C. abrotanoides), 151,278 bp (C. cernuum), and 151,250 bp (C. faberi), respectively. A total of 114 different genes were annotated, including 80 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Abundant SSR loci were detected in all three chloroplast genomes, with most composed of A/T. The expansion and contraction of the IR region indicate that the boundary regions of IR/SC are relatively conserved for the three species. Using C. abrotanoides as a reference, most of the non-coding regions of the chloroplast genomes were significantly different among the three species. Five different mutation hot spots (trnC-GCA-petN, psaI, petA-psbJ, ndhF, ycf1) with high nucleotide variability (Pi) can serve as potential DNA barcodes of Carpesium species. Additionally, phylogenetic evolution analysis of the three species suggests that C. cernuum has a closer genetic relationship to C. faberi than C. abrotanoides. Simultaneously, Carpesium is a monophyletic group closely related to the genus Inula. Complete chloroplast genomes of Carpesium species can help study the evolutionary and phylogenetic relationships and are expected to provide genetic marker assistance to identify Carpesium species.Entities:
Mesh:
Year: 2022 PMID: 35930571 PMCID: PMC9355210 DOI: 10.1371/journal.pone.0272563
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Cp genome mapping of three Carpesium species.
The inner genes are transcribed clockwise, and the outer genes are transcribed counterclockwise. Different colors indicate genes with different functions. The light black of the inner circle indicates GC content, and dark gray indicates AT content.
Basic characteristics of the cp genomes of three Carpesium species.
| Genome size (bp) | 151,389 | 151,278 | 151,250 | |
| LSC length (bp) | 83,059 | 82,927 | 82,915 | |
| SSC length (bp) | 18,426 | 18,447 | 18,431 | |
| IR length (bp) | 49,904 | 49,904 | 49,904 | |
| Number of genes (total /unique) | 132/114 | 132/114 | 132/114 | |
| protein-coding genes (CDS) (total/in IR) | 87/7 | 87/7 | 87/7 | |
| rRNA (total/unique) | 8/4 | 8/4 | 8/4 | |
| tRNA (total/unique) | 37/30 | 37/30 | 37/30 | |
| genes duplicated in IR | 18 | 18 | 18 | |
| GC content (%) | Total (%) | 37.6 | 37.7 | 37.7 |
| LSC (%) | 35.7 | 35.9 | 35.9 | |
| SSC (%) | 31.2 | 31.3 | 31.3 | |
| IR (%) | 43 | 43 | 43 | |
| protein-coding genes (CDS) (% bp) | 51.74 | 51.97 | 51.97 | |
| rRNA genes (% bp) | 5.98 | 5.98 | 5.98 | |
| tRNA genes (% bp) | 1.84 | 1.84 | 1.84 | |
| GenBank accession number | OM302256 | OM302256 | OM302256 |
Genetic composition of the cp genomes of three Carpesium species.
| Category of genes | Group of gene | Gene name | Number |
|---|---|---|---|
| Genes for photosynthesis | Subunits of photosystem I | 5 | |
| Subunits of photosystem II | 15 | ||
| Subunits of ATP synthase | 6 | ||
| Subunits of NADH-dehydrogenase | 12 | ||
| Subunits of cytochrome b/f complex | 6 | ||
| Subunit of rubisco |
| 1 | |
| Self-repilcation | Large subunit of ribosome | 11 | |
| Small subunit of ribosome | 14 | ||
| DNA-dependent RNA polymerase | 4 | ||
| Ribosomal RNAs | 8 | ||
| tRNA genes | 37 | ||
| Other genes | Translation initiation factor |
| 1 |
| Subunit of acetyl-CoA-carboxylase |
| 1 | |
| c-type cytochrome synthesis gene |
| 1 | |
| Envelop membrane protein |
| 1 | |
| Protease |
| 1 | |
| Maturase |
| 1 | |
| Genes with unknown function | Conserved open reading frames | 7 |
(×2): Two gene copies in IRs;
*: gene containing one intron;
**: gene containing two introns;
a: pseudogene.
Fig 2Comparison of amino acid frequencies in the cp genomes of three Carpesium species.
Fig 3Number and proportion of SSRs in the cp genomes of three Carpesium species.
(a) The number of different SSR types. (b-d) The frequencies of different SSR types of three Carpesium species. (e) The proportion of different SSR types.
Fig 4Repeat sequences in the cp genomes of three Carpesium species.
(a) Repeat sequence types and number of repeats. (b) Number of repeat sequences of different lengths.
Fig 5Comparative analysis of the LSC, IRs, and SSC boundary regions of the cp genomes of three Carpesium species.
Fig 6Global alignment of three cp genomes of Carpesium generated using Mvista with C. abrotanoides as a reference.
The y-axis represents the range of identity (50%–100%). The x-axis indicates the coordinate in the cp genome. The gray arrows above the comparison represent the gene orientation and location, and different colors indicate the various regions of the genome.
Fig 7Sliding window analysis of the cp genomes of three Carpesium species.
Fig 8Maximum likelihood (ML) method phylogenetic tree constructed based on cp genome sequences of 38 species.
The data next to each column represent bootstrap test scores. Taraxacum mongolicum, Taraxacum officinale, and Lactuca sativa were set as outgroups.