| Literature DB >> 36247567 |
Zi-Xun Wang1,2, Ding-Jie Wang1,3, Ting-Shuang Yi1,3.
Abstract
Plastids are one of the main distinguishing characteristics of the plant cell. The plastid genome (plastome) of most autotrophic seed plants possesses a highly conserved quadripartite structure containing a large single-copy (LSC) and a small single-copy (SSC) region separated by two copies of the inverted repeat (termed as IRA and IRB). The IRs have been inferred to stabilize the plastid genome via homologous recombination-induced repair mechanisms. IR loss has been documented in seven autotrophic flowering plant lineages and two autotrophic gymnosperm lineages, and the plastomes of these species (with a few exceptions) are rearranged to a great extent. However, some plastomes containing normal IRs also show high structural variation. Therefore, the role of IRs in maintaining plastome stability is still controversial. In this study, we first integrated and compared genome structure and sequence evolution of representative plastomes of all nine reported IR-lacking lineages and those of their closest relative(s) with canonical inverted repeats (CRCIRs for short) to explore the role of the IR in maintaining plastome structural stability and sequence evolution. We found the plastomes of most IR-lacking lineages have experienced significant structural rearrangement, gene loss and duplication, accumulation of novel small repeats, and acceleration of synonymous substitution compared with those of their CRCIRs. However, the IR-lacking plastomes show similar structural variation and sequence evolution rate, and even less rearrangement distance, dispersed repeat number, tandem repeat number, indels frequency and GC3 content than those of IR-present plastomes with variation in Geraniaceae. We argue that IR loss is not a driver of these changes but is instead itself a consequence of other processes that more broadly shape both structural and sequence-level plastome evolution.Entities:
Keywords: comparative genomics; inverted repeat region loss; plastid genome evolution; structural variation; substitution rate
Year: 2022 PMID: 36247567 PMCID: PMC9560873 DOI: 10.3389/fpls.2022.888049
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 6.627
Statistics of dispersed repeats and their corresponding inversions.
| Species | Inversion Site | Inversion Length | Repeat Length |
|---|---|---|---|
|
| 30710–36608 | 5898 | 445 |
|
| 5647–30221 | 24574 | 352 |
|
| 23425–46586 | 23161 | 3359 |
|
| 5605–89001 | 83396 | 1260 |
|
| 5474–89321 | 83847 | 1191 |
|
| 5529–90486 | 84957 | 1047 |
|
| 6031–90468 | 84437 | 1484 |
|
| 5716–89014 | 83298 | 1221 |
|
| 5967–89759 | 83792 | 1357 |
|
| 5917–90107 | 84190 | 1398 |
|
| 80769–112976 | 32207 | 791 |
|
| 77218–114228 | 37010 | 1070 |
| 77218–84722 | 7504 | 280 | |
|
| 7164–41842 | 34678 | 271 |
|
| 46836–123831 | 76995 | 274 |
|
| 7855–47024 | 39169 | 157 |
|
| 32987–110212 | 77225 | 152 |
|
| 8507–32712 | 24205 | 395 |
|
| 7812–53059 | 45247 | 1176 |
|
| 9126–51283 | 42157 | 411,44 |
|
| 8159–50365 | 42206 | 610,66,15 |
|
| 49465–50435 | 970 | 232 |
Figure 1Violin plots for comparisons of eight variables between IR-lacking plastomes and those of their CRCIRs with phylogenetic t-test. (A) dispersed repeat number; (B) tandem repeat number; (C) rearrangement distance; (D) indels frequency; (E) SNVs frequency; (F) whole plastome GC content; (G) protein-coding genes GC content; (H) GC3 content.
Figure 3PGLS analysis of (A) dispersed repeat length and inversion length; (B) dispersed repeat number and rearrangement distance; (C) rearrangement distance and ω.
Figure 2Boxplots for comparisons of synonymous substitution rates of whole protein-coding genes between IR-lacking plastomes and those of their CRCIRs with phylogenetic t-test. (A) Cupressophyta; (B) Pinaceae; (C) IRLC; (D) the putranjivoid clade; (E) Cactaceae; (F) Geraniaceae; (G) Passiflora; (H) Arecaceae; (I) Camoensia. Thick lines within boxes are medians, and outliers are shown as circles.
Selectional strength and direction. ω0 indicates the result of the one-ratio model (all branches in the phylogenetic tree of this model are equal); ω1 indicates the result of the two-ratio model.
| Lineage | ω0 | ω1 |
| Selection | Change | |
|---|---|---|---|---|---|---|
| Arecaceae | 0.4106 | 0.3777 | 0.6097 | not significant | Negative | intensification |
| Cactaceae | 0.1520 | 0.7628 | 0.0000 | significant | Negative | relaxation |
| Geraniaceae | 0.1799 | 0.2870 | 0.0000 | significant | Negative | relaxation |
| Leguminosae | 0.1612 | 0.2858 | 0.0000 | significant | Negative | relaxation |
|
| 0.2028 | 0.3950 | 0.0000 | significant | Negative | relaxation |
| the putranjivoid clade | 0.2042 | 0.4302 | 0.0000 | significant | Negative | relaxation |
|
| 0.1998 | 0.3226 | 0.0000 | significant | Negative | relaxation |
| Cupressophyta | 0.1905 | 0.3499 | 0.0000 | significant | Negative | relaxation |
| Pinaceae | 0.2157 | 0.4162 | 0.0000 | significant | Negative | relaxation |
P-value indicates the significance of LRTs of the one-ratio model against the two-ratio model.
Figure 4Boxplots for phylogenetic ANOVA statistical with post-hoc tests of eight variables between IR-lacking plastomes IR-present plastomes with variation and their CRCIRs. (A) dispersed repeat number; (B) tandem repeat number; (C) rearrangement distance; (D) indels frequency; (E) SNVs frequency; (F) whole plastome GC content; (G) GC3 content; (H) synonymous substitution rates.