| Literature DB >> 35937412 |
Jing-Lu Feng1, Li-Wei Wu1,2, Qing Wang1,2, Yun-Jia Pan1, Bao-Li Li1, Yu-Lin Lin1, Hui Yao1,2.
Abstract
Iris species, commonly known as rainbow flowers because of their attractive flowers, are extensively grown in landscape gardens. A few species, including Belamcanda chinensis, the synonym of I. domestica and I. tectorum, are known for their medicinal properties. However, research on the genomes and evolutionary relationships of Iris species is scarce. In the current study, the complete chloroplast (CP) genomes of I. tectorum, I. dichotoma, I. japonica, and I. domestica were sequenced and compared for their identification and relationship. The CP genomes of the four Iris species were circular quadripartite with similar lengths, GC contents, and codon usages. A total of 113 specific genes were annotated, including the ycf1 pseudogene in all species and rps19 in I. japonica alone. All the species had mononucleotide (A/T) simple sequence repeats (SSRs) and long forward and palindromic repeats in their genomes. A comparison of the CP genomes based on mVISTA and nucleotide diversity (Pi) identified three highly variable regions (ndhF-rpl32, rps15-ycf1, and rpl16). Phylogenetic analysis based on the complete CP genomes concluded that I. tectorum is a sister of I. japonica, and the subgenus Pardanthopsis with several I. domestica clustered into one branch is a sister of I. dichotoma. These findings confirm the feasibility of superbarcodes (complete CP genomes) for Iris species authentication and could serve as a resource for further research on Iris phylogeny.Entities:
Mesh:
Year: 2022 PMID: 35937412 PMCID: PMC9348943 DOI: 10.1155/2022/2194021
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.246
Length and composition of the CP genomes of I. tectorum, I. japonica, I. dichotoma, and I. domestica.
| Types/species |
|
|
|
|
|---|---|---|---|---|
| Accession number | MW201731 | OK448493 | OK448492 | OK448491 |
| Total length (bp) | 153,253 | 152,443 | 153,658 | 153,736 |
| SSC (bp) | 18,562 | 18,490 | 18,150 | 18,168 |
| LSC (bp) | 82,833 | 83,237 | 83,116 | 83,140 |
| IRs (bp) | 51,858 | 50,716 | 52,392 | 52,428 |
| CDS (bp) | 78,957 | 78,507 | 79,050 | 79,059 |
| Total GC (%) | 37.89 | 37.85 | 37.87 | 37.85 |
| GC of SSC (%) | 31.42 | 31.40 | 31.49 | 31.46 |
| GC of LSC (%) | 36.16 | 36.13 | 36.00 | 35.97 |
| GC of IRa (%) | 42.97 | 43.03 | 43.04 | 43.05 |
| GC of IRb (%) | 42.97 | 43.03 | 43.04 | 43.05 |
| GC of CDS (%) | 38.15 | 38.08 | 38.03 | 38.02 |
| AT at the 1st position (%) | 54.42 | 54.48 | 54.43 | 54.44 |
| AT at the 2nd position (%) | 61.77 | 61.81 | 61.75 | 61.77 |
| AT at the 3rd position (%) | 69.36 | 69.46 | 69.73 | 69.72 |
Figure 1Chloroplast genome map of Iris tectorum. Arrows represent the transcription direction of genes. The dark (GC) and light (AT) gray areas are nucleotide contents.
Genes in the CP genomes of I. tectorum, I. japonica, I. dichotoma, and I. domestica.
| Functional group | Genes | Number of genes |
|---|---|---|
| Photosystem I |
| 5 |
| Photosystem II |
| 15 |
| Cytochrome b/f complex |
| 6 |
| ATP synthase |
| 6 |
| NADH dehydrogenase |
| 12 |
| RubisCO large subunit |
| 1 |
| RNA polymerase |
| 4 |
| Ribosomal proteins (SSU) |
| 15 |
| Ribosomal proteins (LSU) |
| 11 |
| Other genes |
| 6 |
| Proteins of unknown function |
| 6 |
| Transfer RNAs | 38 tRNAs (8 in the IRs (×2), 6 contain one intron) | 38 |
| Ribosomal RNAs |
| 8 |
×2 indicates two gene copies. ∗ and ∗∗ indicate genes that contain 1 and 2 introns, respectively. Ψ indicates a pseudogene.
Figure 2Codon usage of 20 amino acids and stop codons of the CDS in the CP genomes of Iris species. The four histograms from left to right in each amino acid represent I. tectorum, I. japonica, I. dichotoma, and I. domestica.
SSRs in the CP genomes of four Iris species.
| SSR types | Repeat units | Number | Proportion (%) | ||||||
|---|---|---|---|---|---|---|---|---|---|
| ① | ② | ③ | ④ | ① | ② | ③ | ④ | ||
| Mono | A/T | 38 | 22 | 34 | 32 | 100.0 | 100.0 | 97.1 | 97.0 |
| C/G | — | — | 1 | 1 | — | — | 2.9 | 3.0 | |
| Di | AT/AT | 9 | 8 | 11 | 10 | 81.8 | 80.0 | 84.6 | 83.3 |
| AG/CT | 2 | 2 | 2 | 2 | 18.2 | 20.0 | 15.4 | 16.7 | |
| Tri | AAG/CTT | 2 | 2 | 2 | 2 | 50.0 | 50.0 | 66.7 | 66.7 |
| AAT/ATT | 2 | 2 | 1 | 1 | 50.0 | 50.0 | 33.3 | 33.3 | |
| Tetra | AAAT/ATTT | 2 | 3 | 3 | 3 | 66.7 | 75.0 | 75.0 | 75.0 |
| AATG/ATTC | 1 | 1 | 1 | 1 | 33.3 | 25.0 | 25.0 | 25.0 | |
| Penta | AACTT/AAGTT | 1 | 1 | 1 | 1 | 33.3 | 100.0 | 50.0 | 33.3 |
| AAAAT/ATTTT | 1 | — | — | 1 | 33.3 | — | — | 33.3 | |
| AAAAC/GTTTT | 1 | — | — | — | 33.3 | — | — | — | |
| AATAT/ATATT | — | — | — | 1 | — | — | — | 33.3 | |
| ACTAT/AGTAT | — | — | 1 | — | — | — | 50.0 | — | |
| Hexa | AACAAG/CTTGTT | — | 1 | 1 | 1 | — | 100.0 | 100.0 | 100.0 |
①: I. tectorum; ②: I. japonica; ③: I. dichotoma; ④ I. domestica; —: the absence of a particular type.
Figure 3Distribution of the six types of SSRs in the CP genomes of four Iris species.
Figure 4Number of long repeat sequences in the CP genomes of four Iris species. F, P, R, and C indicate the forward, palindromic, reverse, and complement repeats. Histograms in different colors represent repeats with different lengths.
Figure 5Junctions of the LSC, IR, and SSC regions in the CP genomes of the four Iris species. Numbers above genes indicate the distance between the gene ends and border sites. These features are not to scale. Ψ indicates a pseudogene.
Figure 6Alignment of the complete CP genomes of nine Iris species with the reference I. tectorum (MW201731) using the mVISTA program. White plots show various regions among species. The genomic regions are color coded. The vertical scale represents the percent identity ranging from 50% to 100%.
Figure 7Nucleotide diversity of the gene (a) and intergenic spacer regions (b) on the whole CP genomes of four Iris species (Pi > 0, length > 100 bp).
Figure 8ML tree constructed based on the complete CP genomes of 26 Iris species and S. angustifolium (outgroup). Bootstrap support value is shown at each node.