| Literature DB >> 35385520 |
Meng Li1, Yan-Feng Song1, Steven P Sylvester1, Steven P Sylvester1, Xian-Rong Wang1.
Abstract
Prunus subgenus Cerasus (cherry) is an economically important group that distributed in temperate regions of the northern hemisphere. However, shared interspecific morphological traits and variability across taxa of Cerasus are among the impediments to taxonomic efforts to correctly delimit taxa. This is further complicated by a lack of genetic information on these taxa, with no focused genomic or phylogenetic studies being done on Cerasus. In this study, we conducted comparative analysis on the complete plastid genomes (plastomes) of 20 Cerasus species to gain a greater understanding of the attributes of the plastome of these taxa while helping resolve their phylogenetic placement in Prunus sensu lato and interspecific relationships within the subgenus. Our results displayed that (1) the plastomes of the 20 Cerasus species studied exhibited a typical quadripartite structure with conversed genome arrangement, structure, and moderate divergence. (2) The average size of complete plastomes for the Cerasus taxa studied was 157,861 bp, ranging from 157,458 to 158,024 bp. A total of 134 genes were annotated, including 86 protein-coding genes, 40 tRNAs, and 8 rRNAs across all species. In simple sequence repeat analysis, we found Cerasus had a comparable number of dispersed and tandem repeats to those identified in other angiosperm taxa, with only P. pseudocerasus found to contain trinucleotide repeats. Nucleotide diversity analysis revealed that the trnG-GCC gene and rpl32-trnL region had the highest Pi value showing potential as phylogenetic markers. (3) Two phylogenetic trees of the plastomes verified the monophyletic relationship of Cerasus and provided a more resolved species-level phylogeny. Our study provides detailed plastome information for exploring the phylogeny of subg. Cerasus taxa. We identified various types of repeats and nucleotide diversity hotspots, which can be a reference for species identification and reconstruction of phylogenetic relationships.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35385520 PMCID: PMC8985974 DOI: 10.1371/journal.pone.0266535
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sample information and summary of plastome characteristics for the 20 subg. Cerasus species studied.
| Species | Reference Sequence | Total Length (bp) | LSC length (bp) (GC content) | SSC length (bp) (GC content) | IR length (bp) (GC content) | GC content | Total number of genes |
|---|---|---|---|---|---|---|---|
|
| NC044701 | 157886 | 85990(34.54%) | 19080(30.24%) | 26408(42.52%) | 36.69% | 131 |
| NC044123 | 157938 | 85947(34.59%) | 19119(30.21%) | 26436(42.52%) | 36.72% | 128 | |
| NC035891 | 157685 | 85792(34.59%) | 19061(30.26%) | 26416(42.51%) | 36.71% | 129 | |
| MT374065 | 158019 | 85910(34.60%) | 19247(30.06%) | 26431(42.53%) | 36.70% | 130 | |
| MN158647 | 158024 | 85953(34.59%) | 19133(30.20%) | 26469(42.50%) | 36.70% | 130 | |
| MN389436 | 157458 | 85567(34.54%) | 19121(30.12%) | 26385(42.53%) | 36.68% | 129 | |
| MN695296 | 157813 | 85931(34.62%) | 19120(30.19%) | 26381(42.56%) | 36.73% | 129 | |
| MN652612 | 157905 | 85910(34.59%) | 19123(30.19%) | 26436(42.52%) | 36.71% | 129 | |
| MN245147 | 157898 | 85926(34.59%) | 19070(30.26%) | 26451(42.52%) | 36.73% | 129 | |
| MN913372 | 157935 | 85940(34.58%) | 19121(30.22%) | 26437(42.52%) | 36.71% | 129 | |
| NC045230 | 157928 | 85904(34.59%) | 19104(30.23%) | 26460(42.50%) | 36.71% | 129 | |
| NC026981 | 157852 | 85847(34.62%) | 19133(30.12%) | 26436(42.56%) | 36.72% | 129 | |
| MN427872 | 157953 | 86030(34.55%) | 19135(30.15%) | 26394(42.52%) | 36.68% | 129 | |
| KX255667 | 157834 | 85954(34.64%) | 19084(30.36%) | 26398(42.46%) | 36.70% | 131 | |
| NC048528 | 157723 | 85860(34.60%) | 19081(30.23%) | 26391(42.55%) | 36.73% | 129 | |
| KP760073 | 157882 | 85968(34.60%) | 19120(30.22%) | 26397(42.53%) | 36.72% | 129 | |
| MH998233 | 157916 | 85927(34.59%) | 19123(30.19%) | 26433(42.53%) | 36.71% | 129 | |
| KP760075 | 157833 | 85951(34.59%) | 19120(30.20%) | 26381(42.56%) | 36.73% | 129 | |
| NC039379 | 157948 | 85959(34.59%) | 19117(30.12%) | 26436(42.53%) | 36.71% | 129 | |
| NC026980 | 157792 | 85914(34.64%) | 19120(30.26%) | 26379(42.49%) | 36.74% | 130 |
Fig 1Plastid map of 20 subg. Cerasus species.
The colored boxes represent conserved plastid genes. Genes shown inside the circle are transcribed clockwise, whereas genes outside the circle are transcribed counter-clockwise. Genes are color-coded to indicate functional groups. The dark gray area in the inner circle corresponds to GC content while the light gray corresponds to the AT content of the genome. The small (SSC) and large (LSC) single copy regions and inverted repeat (IRa and IRb) regions are noted in the inner circle.
List of genes within plastomes of the 20 subg. Cerasus species studied.
| Category | Group of gene | Name of gene |
|---|---|---|
| Photosynthetic | Photosystem Ⅰ | |
| Photosystem Ⅱ | ||
| NADH dehydrogenase | ||
| Cytochrome b6/f complex | ||
| ATP synthase | ||
| Rubisco large subunit |
| |
| Transcription & Translation | Ribosomal protein, LSU | |
| Ribosomal protein, SSU | ||
| RNA polymerase | ||
| Ribosomal RNAs | ||
| Transfer RNAs | ||
| Biosynthesis | Maturase |
|
| Protease |
| |
| Envelope membrane protein |
| |
| Acetyl-CoA carboxylase |
| |
| c-type cytochrome synthesis gene |
| |
| Unknown function | Conserved hypothetical plastid Reading Frames |
×2 refers to genes duplicated in the IR regions; Genes marked with asterisks are with single (a) or double (b) introns.
Information on 10 intron-containing genes in the plastome of the 20 subg. Cerasus species studied.
| Gene | Location | Start | End | ExonⅠ(bp) | IntronⅠ(bp) | ExonⅡ(bp) | IntronⅡ(bp) | ExonⅢ(bp) |
|---|---|---|---|---|---|---|---|---|
|
| IRa | 99918 | 100711 | 26 | 536 | 233 | ||
|
| LSC | 5217 | 6349 | 30 | 863 | 40 | ||
|
| LSC | 11606 | 12907 | 410 | 747 | 145 | ||
|
| LSC | 20998 | 23809 | 1617 | 760 | 435 | ||
|
| LSC | 43927 | 45919 | 153 | 777 | 228 | 709 | 126 |
|
| IRb | 143240 | 144033 | 232 | 536 | 26 | ||
|
| LSC | 71714 | 73781 | 228 | 668 | 289 | 812 | 71 |
|
| LSC | 76703 | 78109 | 6 | 759 | 642 | ||
|
| LSC | 78297 | 79522 | 8 | 743 | 475 | ||
|
| LSC | 83036 | 84454 | 399 | 1011 | 9 | ||
|
| IRa | 86241 | 87742 | 434 | 683 | 385 | ||
|
| IRa | 96877 | 99089 | 756 | 680 | 777 | ||
|
| SSC | 122795 | 125028 | 539 | 1142 | 553 | ||
|
| IRb | 144889 | 147101 | 777 | 680 | 756 | ||
|
| IRb | 156236 | 157737 | 385 | 683 | 434 |
Fig 2Comparisons of LSC, SSC and IR region boundaries among the plastomes of the 20 subg. Cerasus species studied.
Fig 3Sequence alignment of 20 subg. Cerasus plastomes with P. avium as the reference.
The y-axis represents the percent similarity between 50% and 100%. Different colors represent different genetic regions.
Fig 4Investigation of repeated sequences in subg. Cerasus plastomes.
The 20 subg. Cerasus plastomes studied have four repeat types, which are forward (F), reverse (R), palindrome (P) and complement (C).
Fig 5Combined ML and BI phylogenetic trees of 20 subg. Cerasus species based on either (a) complete plastome data or (b) coding region (CDS) data. The support value is displayed above the branch in the order of Maximum Likelihood bootstrap support and Bayesian Inference posterior probability. “‐” indicates the branch collapse in the Bayesian tree. Clades, subclades or lineages are indicated by gradual colors of boxes covered on taxa names, taxa with changing molecular placement are connected by dotted lines.