| Literature DB >> 35812969 |
Jun-Yi Zhang1,2, Min Liao1,3, Yue-Hong Cheng4, Yu Feng1, Wen-Bing Ju1,3, Heng-Ning Deng1, Xiong Li1,3, Andelka Plenković-Moraj5, Bo Xu1,3.
Abstract
The species in the genus Cypripedium (Orchidaceae) are considered endangered, mainly distributed in the temperate regions of the Northern Hemisphere, with high ornamental and economic value. Despite previous extensive studies based on both morphology and molecular data, species and sections relationships within Cypripedium remain controversial. Here, we employed two newly generated Cypripedium chloroplast genomes with five other published genomes to elucidate their genomic characteristics. The two genomes were 162,773-207,142 bp in length and contained 128-130 genes, including 82-84 protein-coding genes, 38 tRNA genes, and 8 rRNA genes. We identified 2,192 simple sequence repeats, 786 large repeat sequences, and 7,929 variable loci. The increase of repeat sequences (simple sequence repeats and large repeat sequences) causes a significant amplification in the chloroplast genome size of Cypripedium. The expansion of the IR region led to the pseudogenization or loss of genes in the SSC region. In addition, we identified 12 highly polymorphic loci (Pi > 0.09) suitable for inferring the phylogeny of Cypripedium species. Based on data sets of whole chloroplast genomes (IRa excluded) and protein-coding sequences, a well-supported phylogenetic tree was reconstructed, strongly supporting the five subfamilies of Orchidaceae and the genus Cypripedium as monophyletic taxa. Our findings also supported that C. palangshanense belonged to sect. Palangshanensia rather than sect. Retinervia. This study also enriched the genomic resources of Cypripedium, which may help to promote the conservation efforts of these endangered species.Entities:
Keywords: Cypripedium; IR expansion; Orchidaceae; chloroplast genome; molecular markers; phylogenomics
Year: 2022 PMID: 35812969 PMCID: PMC9257239 DOI: 10.3389/fpls.2022.911702
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
FIGURE 1Chloroplast genome maps of C. palangshanense (A) and C. debile (B). Genes of different functional groups are shown in colored bars. The inner circle (dashed gray area) indicates the proportional GC content of the corresponding genes. Regions of the large single-copy (LSC), small single-copy (SSC) and inverted repeats (IRA and IRB) are indicated.
General characteristics of the chloroplast genomes of the seven Cypripedium species.
| Species |
|
|
|
|
|
|
|
| GenBank No. | MW924110 | MW924111 | MT937100 | MT937101 | NC_027227 | NC_026772 | NC_045400 |
| Total length (bp) | 207,142 | 162,773 | 212,668 | 197,815 | 174,417 | 178,131 | 175,122 |
| Length of LSC (bp) | 128,862 | 89,446 | 129,998 | 117,193 | 97,322 | 100,973 | 97,486 |
| Length of SSC (bp) | 9,450 | 10,049 | 27,414 | 25,094 | 21,911 | 21,921 | 22,260 |
| Length of IR (bp) | 34,415 | 31,639 | 27,628 | 27,764 | 27,592 | 27,579 | 27,688 |
| Total GC content (%) | 29.5 | 35.4 | 28.2 | 30.5 | 34.5 | 33.9 | 34.4 |
| LSC GC content (%) | 24.7 | 32.7 | 23.7 | 26.5 | 31.7 | 30.7 | 31.6 |
| IR GC content (%) | 38.6 | 40.1 | 42.6 | 42.5 | 42.7 | 42.7 | 42.6 |
| SSC GC content (%) | 29.0 | 29.0 | 20.6 | 22.4 | 26.4 | 26.4 | 26.1 |
| Number of genes | 128 (20) | 130(20) | 131 (20) | 132(20) | 131(20) | 132(20) | 132(20) |
| Number of CDs genes | 82(8) | 84(8) | 85 (8) | 86 (8) | 85 (8) | 86 (8) | 86 (8) |
| Number of tRNA genes | 38(8) | 38(8) | 38 (8) | 38 (8) | 38 (8) | 38 (8) | 38 (8) |
| Number of rRNA genes | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) | 8 (4) |
*Sequences downloaded from GenBank. Numbers in brackets indicate genes duplicated in the IR regions.
FIGURE 2Analyses of expansion and contraction of inverted repeats in the seven Cypripedium chloroplast genomes.
FIGURE 3Nucleotide variability values compared between the seven chloroplast genomes of Cypripedium using the window sliding analysis. (X-axis indicates the position of the midpoint of the window, while Y-axis indicates the nucleotide diversity of each window).
FIGURE 4Analyses of repeat sequences in seven chloroplast genomes. (A) Number of different SSRs types. (B) Number of SSRs, tandem repeats and large LSRs. (C) Number of different LSRs types in the LSC regions. (D) Number of different LSRs types in the SSC region. (E) Number of different LSRs types in the IR region.
FIGURE 5Phylogram depicting the relationships of the plastid CDSs of 47 different Orchidaceae species using ML method. Numbers associated with nodes indicated Bayesian posterior probabilities (PP), maximum likelihood (BSML) and maximum parsimony (BSMP) bootstrap values, respectively. The PP on the left, BSML in the middle, BSMP on the right.