| Literature DB >> 35383845 |
Joel Sharbrough1,2, Justin L Conover3, Matheus Fernandes Gyorfy1, Corrinne E Grover3, Emma R Miller3, Jonathan F Wendel3, Daniel B Sloan1.
Abstract
Whole-genome duplications (WGDs) are a prominent process of diversification in eukaryotes. The genetic and evolutionary forces that WGD imposes on cytoplasmic genomes are not well understood, despite the central role that cytonuclear interactions play in eukaryotic function and fitness. Cellular respiration and photosynthesis depend on successful interaction between the 3,000+ nuclear-encoded proteins destined for the mitochondria or plastids and the gene products of cytoplasmic genomes in multi-subunit complexes such as OXPHOS, organellar ribosomes, Photosystems I and II, and Rubisco. Allopolyploids are thus faced with the critical task of coordinating interactions between the nuclear and cytoplasmic genes that were inherited from different species. Because the cytoplasmic genomes share a more recent history of common descent with the maternal nuclear subgenome than the paternal subgenome, evolutionary "mismatches" between the paternal subgenome and the cytoplasmic genomes in allopolyploids might lead to the accelerated rates of evolution in the paternal homoeologs of allopolyploids, either through relaxed purifying selection or strong directional selection to rectify these mismatches. We report evidence from six independently formed allotetraploids that the subgenomes exhibit unequal rates of protein-sequence evolution, but we found no evidence that cytonuclear incompatibilities result in altered evolutionary trajectories of the paternal homoeologs of organelle-targeted genes. The analyses of gene content revealed mixed evidence for whether the organelle-targeted genes are lost more rapidly than the non-organelle-targeted genes. Together, these global analyses provide insights into the complex evolutionary dynamics of allopolyploids, showing that the allopolyploid subgenomes have separate evolutionary trajectories despite sharing the same nucleus, generation time, and ecological context.Entities:
Keywords: zzm321990 Brachypodium hybridumzzm321990 ; zzm321990 Chenopodium quinoazzm321990 ; zzm321990 Coffea arabicazzm321990 ; zzm321990 Gossypium hirsutumzzm321990 ; zzm321990 Nicotiana tabacumzzm321990 ; zzm321990 Triticum dicoccoideszzm321990 ; CyMIRA; allopolyploidy; chloroplast; cytonuclear incompatibility; mitochondrion
Mesh:
Substances:
Year: 2022 PMID: 35383845 PMCID: PMC9040051 DOI: 10.1093/molbev/msac074
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 8.800
Fig. 1.Evolutionary relationships and origins of six allotetraploid angiosperms. (a) Cladogram depicting evolutionary relationships among six independently derived allotetraploid angiosperms. (b) The scatter plot depicts the synonymous substitutions per synonymous site (d) between the polyploid subgenome-diploid pair with the lowest amount of divergence on the x-axis as a proxy for the amount of time since allopolyploidization. Amino acid sequence divergence between subgenomes, measured as nonsynonymous substitutions per nonsynonymous site (d) between the two diploid relatives, is shown on the y-axis. Higher levels of amino acid sequence divergence between the subgenomes increase the probability of a genetic incompatibility in the polyploid, whereas long periods since allopolyploidization increase the probability that evolutionary responses to incompatibilities are detectable in the polyploid.
Summary of Allopolyploid Lineages Used in This Study.
| Allopolyploid Species Complex | Time Since Polyploidization (Ma) | Subgenome Divergence ( | Genome Type | Maternal Diploid (Genome Type) | Paternal Diploid (Genome Type) | Outgroup Species |
|---|---|---|---|---|---|---|
|
| 0.10–0.15 | 0.103 | SSTT |
|
|
|
|
| 4–6 | 0.105 | AABB |
|
|
|
|
| 0.01–0.05 | 0.026 | EECC |
|
|
|
|
| 1–2 | 0.041 | AADD |
|
|
|
|
| 0.1–0.20 | 0.096 | SSTT |
|
|
|
|
| 0.5–1 | 0.076 | BBAA |
|
|
|
Synonymous substitutions per synonymous site inferred from the concatenated estimates of d from the non-organelle-targeted genes, see fig. 1.
Fig. 2.Synonymous and nonsynonymous rates of evolution in the genomes (and subgenomes) of focal allopolyploid systems. Substitution rates per site for synonymous (d—left) and nonsynonymous (d—right) sites from the concatenated analyses of non-organelle-targeted genes are represented by branch lengths for each genome (and subgenome). Allopolyploid systems are arranged from the oldest (top) to the youngest (bottom) from fig. 1. The paternal subgenomes of allotetraploids are bolded in green (top subgenome) and the maternal subgenomes are bolded in purple (bottom subgenome).
Orthologous Gene Groups in Six Allotetraploid Angiosperms.
| Species | Phylogenetic Orthologous Groups | Syntenic Orthologous Groups | Merged, Single-copy Quintets[ | Filtered, Merged, Single-copy Quintets (Phylo/Intersect/Synteny) |
|---|---|---|---|---|
| Quinoa | 10,511 | 17,896 | (1,873/3,284/3,931) | (3,679/3,121/1,401) |
| Wheat | 25,454 | 24,212 | (3,070/3,602/3,508) | (1,754/1,759/691) |
| Cotton | 29,504 | 31,841 | (2,392/10,222/6,101) | (2,327/10,023/4,783) |
| Coffee | 19,399 | 20,926 | (2,005/3,869/789) | (1,225/2,379/428) |
| Tobacco | 24,797 | 32,088 | (8,837/166/56) | (8,536/163/52) |
|
| 24,854 | 34,440 | (5,076/8,084/1,289) | (4,948/7,912/1,140) |
Single-copy quintets include orthologous gene groups with one and only one sequence from an outgroup, two closely related diploids, and two sequences from the allopolyploid.
Functional Gene Partitioning in Six Allotetraploid Angiosperms.
| Species | Mitochondria-targeted | Mitochondria-targeted Interacting[ | Mitochondria Enzyme Complexes[ | Plastid-targeted | Plastid-targeted Interacting[ | Plastid Enzyme Complexes[ |
|---|---|---|---|---|---|---|
| Quinoa | 2,830 | 894 | 279 | 3,528 | 686 | 215 |
| Wheat | 4,077 | 1,048 | 378 | 4,419 | 693 | 245 |
| Cotton | 4,728 | 1,232 | 458 | 5,670 | 800 | 307 |
| Coffee | 3,221 | 921 | 285 | 3,889 | 621 | 193 |
| Tobacco | 3,851 | 1,092 | 402 | 4,567 | 740 | 297 |
|
| 4,540 | 981 | 339 | 4,684 | 674 | 238 |
| Mean (SD) | 3,880 (730) | 1,031 (121) | 358 (68) | 4,464 (731) | 704 (61) | 250 (45) |
|
| 1,773 | 617 | 180 | 2,931 | 375 | 128 |
Mitochondria- and plastid-targeted interacting genes are a subset of the total number of mitochondria- and plastid-targeted genes.
Mitochondria and plastid enzyme complex genes are a subset of the total number of mitochondria- and plastid-targeted interacting genes.
Biased Gene Content of the Non-organelle-targeted Genes Across the Subgenomes of Five Allotetraploid Angiosperms.
| Species[ | Diploid Paternal Model | Paternal Subgenome | Maternal Subgenome | Diploid Maternal Model |
| Binomial test |
|---|---|---|---|---|---|---|
| Quinoa | 19,525 | 9,786 | 11,053 | 19,336 | −0.061 (−0.074 to −0.047) | <0.0001 |
| Wheat | 32,734 | 48,786 | 52,571 | 46,164 | −0.037 (−0.044 to −0.031) | <0.0001 |
| Cotton | 34,004 | 29,762 | 28,871 | 34,201 | 0.015 (0.007–0.023) | 0.00024 |
| Coffee | 22,897 | 19,008 | 19,773 | 26,017 | −0.020 (−0.030 to −0.010) | 0.00011 |
|
| 31,446 | 34,860 | 29,605 | 27,039 | 0.082 (0.074–0.089) | <0.0001 |
Tobacco was excluded from this analysis because its subgenomes cannot be easily disentangled based on the chromosome number.
r refers to the ratio of genes found in the paternal subgenome relative to the paternal diploid model, and r refers to the ratio of genes found in the maternal subgenome relative to the maternal diploid model.
95% CIs were inferred from the Exact binomial test in the R stats package, following Clopper and Pearson (1934).
Fig. 3.Gene content bias across allotetraploid subgenomes. The proportion of genes present in the paternal (r) versus maternal (r) subgenomes is depicted for each of five allotetraploid species arranged vertically from the oldest (top) to youngest (bottom). Tobacco was excluded from this analysis because the massive rearrangement it has experienced makes subgenomic identification based on chromosomal position intractable. The left panel includes only the non-organelle-targeted genes, the middle panel includes only the mitochondria-targeted genes, and the right panel includes only the plastid-targeted genes. In the left panel, the red-dashed line represents equal content across the subgenomes. In the right two panels, the r and r are normalized by the overall genome-wide gene number changes, excluding those genes targeted to organelles. Proportion deltas that depart significantly from the red line are filled in solid according to the direction of subgenomic bias (i.e., green: r > r; purple: r < r; no fill: r ≈ r). The intimacy of interactions is depicted on the y-axis for each of the right two panels from low or no interaction with organelle gene products (top), to interacting genes (middle), to genes involved in mitochondrial or plastid enzyme complexes (bottom).
Single-copy Orthologous Quintets Partitioned by Functional Category in Six Allotetraploid Species.
| Species | Functional Category | Number of Quintets |
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| Quinoa | Non-organelle-targeted | 6,885 | 0.499 | 0.096 | 0.193 | 0.290 (0.28–0.30) | 0.332 (0.32–0.34) |
| Mitochondria-targeted noninteracting | 615 | 0.444 | 0.079 | 0.179 | 0.278 (0.25–0.31) | 0.319 (0.29–0.35) | |
| Mitochondria-targeted interacting[ | 213 | 0.477 | 0.105 | 0.220 | 0.352 (0.29–0.42) | 0.369 (0.31–0.43) | |
| Mitochondria enzyme complexes | 69 | 0.465 | 0.084 | 0.180 | 0.279 (0.19–0.40) | 0.350 (0.26–0.45) | |
| Plastid-targeted noninteracting | 900 | 0.449 | 0.081 | 0.180 | 0.279 (0.25–0.30) | 0.322 (0.30–0.35) | |
| Plastid-targeted interacting | 212 | 0.463 | 0.091 | 0.197 | 0.290 (0.24–0.34) | 0.356 (0.29–0.43) | |
| Plastid enzyme complexes | 74 | 0.483 | 0.081 | 0.168 | 0.305 (0.15–0.40) | 0.361 (0.21–0.47) | |
| Wheat | Non-organelle-targeted | 3,507 | 0.1882 | 0.035 | 0.187 | 0.444 (0.41–0.48) | 0.201 (0.19–0.21) |
| Mitochondria-targeted noninteracting | 476 | 0.179 | 0.030 | 0.169 | 0.466 (0.38–0.55) | 0.241 (0.20–0.28) | |
| Mitochondria-targeted interacting | 67 | 0.162 | 0.033 | 0.206 | 0.651 (0.30–0.82) | 0.215 (0.12–0.31) | |
| Mitochondria enzyme complexes[ | 38 | 0.191 | 0.039 | 0.206 | 0.661 (0.27–0.81) | 0.077 (0.05–0.13) | |
| Plastid-targeted noninteracting | 561 | 0.179 | 0.031 | 0.171 | 0.469 (0.39–0.54) | 0.237 (0.20–0.28) | |
| Plastid-targeted interacting | 86 | 0.171 | 0.030 | 0.175 | 0.353 (0.19–0.49) | 0.278 ( 0.17–0.36) | |
| Plastid enzyme complexes | 38 | 0.228 | 0.030 | 0.131 | 0.310 (0.09–0.53) | 0.159 (0.06–0.27) | |
| Cotton | Non-organelle-targeted | 14,957 | 0.108 | 0.038 | 0.348 | 0.422 (0.41–0.43) | 0.535 (0.52–0.55) |
| Mitochondria-targeted noninteracting | 1,076 | 0.106 | 0.033 | 0.309 | 0.391 (0.35–0.43) | 0.480 (0.44–0.52) | |
| Mitochondria-targeted interacting | 375 | 0.103 | 0.034 | 0.332 | 0.392 (0.35–0.44) | 0.564 (0.48–0.65) | |
| Mitochondria enzyme complexes | 100 | 0.119 | 0.037 | 0.310 | 0.376 (0.28–0.53) | 0.568 (0.42–0.72) | |
| Plastid-targeted noninteracting | 1,502 | 0.106 | 0.033 | 0.309 | 0.392 (0.36–0.43) | 0.514 (0.48–0.56) | |
| Plastid-targeted interacting | 270 | 0.102 | 0.031 | 0.303 | 0.352 (0.31–0.40) | 0.525 (0.45–0.60) | |
| Plastid enzyme complexes | 94 | 0.100 | 0.029 | 0.289 | 0.322 (0.23–0.46) | 0.516 (0.38–0.68) | |
| Coffee | Non-organelle-targeted | 3,397 | 0.181 | 0.055 | 0.306 | 0.486 (0.45–0.52) | 0.429 (0.40–0.46) |
| Mitochondria-targeted noninteracting | 306 | 0.181 | 0.051 | 0.281 | 0.533 (0.44–0.61) | 0.548 (0.40–0.66) | |
| Mitochondria-targeted interacting | 121 | 0.170 | 0.052 | 0.306 | 0.513 (0.39–0.65) | 0.426 (0.33–0.55) | |
| Mitochondria enzyme complexes | 31 | 0.187 | 0.057 | 0.307 | 0.597 (0.25–0.92) | 0.482 (0.39–0.96) | |
| Plastid-targeted noninteracting | 420 | 0.180 | 0.051 | 0.285 | 0.510 (0.42–0.57) | 0.514 (0.39–0.61) | |
| Plastid-targeted interacting | 88 | 0.163 | 0.049 | 0.300 | 0.438 (0.29–0.66) | 0.537 (0.36–0.76) | |
| Plastid enzyme complexes | 25 | 0.159 | 0.043 | 0.273 | 1.182 (0.23–5.17) | 0.363 (0.08–1.44) | |
| Tobacco | Non-organelle-targeted | 7,323 | 0.438 | 0.090 | 0.205 | 0.522 (0.38–0.54) | 0.631 (0.40–0.65) |
| Mitochondria-targeted noninteracting | 675 | 0.375 | 0.071 | 0.190 | 0.466 (0.40–0.53) | 0.654 (0.58–0.73) | |
| Mitochondria-targeted interacting | 209 | 0.374 | 0.082 | 0.220 | 0.490 (0.42–0.57) | 0.628 (0.53–0.74) | |
| Mitochondria enzyme complexes | 59 | 0.392 | 0.070 | 0.178 | 0.490 (0.37–0.61) | 0.770 (0.50–1.37) | |
| Plastid-targeted noninteracting | 952 | 0.380 | 0.072 | 0.191 | 0.470 (0.42–0.52) | 0.628 (0.57–0.69) | |
| Plastid-targeted interacting | 183 | 0.370 | 0.074 | 0.200 | 0.541 (0.41–0.71) | 0.591 (0.47–0.73) | |
| Plastid enzyme complexes | 72 | 0.406 | 0.070 | 0.173 | 0.736 (0.35–1.21) | 0.604 (0.37–0.99) | |
|
| Non-organelle-targeted | 11,886 | 0.449 | 0.105 | 0.234 | 0.328 (0.31–0.45) | 0.347 (0.33–0.47) |
| Mitochondria-targeted noninteracting | 1,310 | 0.388 | 0.0759 | 0.196 | 0.318 (0.30–0.47) | 0.327 (0.27–0.40) | |
| Mitochondria-targeted interacting | 367 | 0.398 | 0.086 | 0.216 | 0.386 (0.27–0.38) | 0.405 (0.30–0.49) | |
| Mitochondria enzyme complexes | 116 | 0.399 | 0.0645 | 0.162 | 0.222 (0.14–0.34) | 0.126 (0.36–0.24) | |
| Plastid-targeted noninteracting | 1,497 | 0.389 | 0.0763 | 0.196 | 0.305 (0.26–0.37) | 0.312 (0.26–0.38) | |
| Plastid-targeted interacting | 256 | 0.396 | 0.0829 | 0.209 | 0.385 (0.24–0.49) | 0.284 (0.20–0.39) | |
| Plastid enzyme complexes | 83 | 0.485 | 0.0626 | 0.129 | 0.123 (0.06–0.22) | 0.276 (0.08–0.50) |
d here reflects the ML estimates of the total synonymous branch length of the entire tree from 1,000 gene-level bootstrap replicates (5 replicate runs per bootstrap replicate).
d here reflects the ML estimates of the total nonsynonymous branch length of the entire tree from 1,000 gene-level bootstrap replicates (5 replicate runs per bootstrap replicate).
ω here reflects the ML estimates of quintet-wide d relative to quintet-wide d.
ω refers to the ML estimate of the ω value for the paternal subgenome branch, with 95% CIs obtained from 1,000 gene-level bootstrap replicates (5 replicate runs per bootstrap replicate).
ω refers to the ML estimate of the ω value for the maternal subgenome branch, with 95% CIs obtained from 1,000 gene-level bootstrap replicates (5 replicate runs per bootstrap replicate).
Interacting genes are defined as those nuclear-encoded genes whose products interact with the mitochondrial/plastid genomes or gene products according to the CyMIRA classifications scheme (Forsythe et al. 2019).
Significant result is likely due to poor alignment. See supplementary fig. S4, Supplementary Material online for more details.
Fig. 4.Genome-wide bias in ω (d) across the maternal and paternal subgenomes. (a) Log-transformed ratios of ω values in the paternal (ω) versus maternal (ω) subgenomes from concatenations (circles), and the underlying bootstrap distributions (density curves) of genes encoding proteins that are not targeted to either the plastids or mitochondria. Species panels are arranged vertically from the oldest (top) to the youngest (bottom). The red-dashed line indicates equal ω values across the subgenomes, the left side of each plot indicates higher ω values in the maternal subgenomes, and the right side of each plot indicates higher ω values in the paternal subgenome. The bootstrap distributions of ω ratios that depart significantly (P < 0.05) from the red line are filled in solid according to the direction of subgenomic bias (i.e., green: ω/ω > 1.0; purple: ω/ω < 1.0; no fill: ω/ω ≈ 1.0). (b) Estimates of ω − ω for each individual gene is depicted on the bottom half of each species’ panel and the proportion of genes with higher ω values in the paternal subgenome (p) minus the proportion of genes with higher ω values in the maternal subgenome (p) is depicted on the top half of each species’ panel for all genes not targeted to either the mitochondria or the plastids. The red-dashed line represents equal proportions of genes with higher ω values across subgenomes, and bars are filled in when proportion deltas are significantly different from zero (i.e., green: p > p; purple: p < p; no fill: p ≈ p).
Fig. 5.Ratios of maternal versus paternal subgenome ω values in the organelle-targeted genes. Log-transformed ratios of maternal versus paternal ω values from the terminal polyploid subgenome branches for concatenations (black circles) and underlying bootstrap distributions (density curves) of the mitochondria- (left) and plastid-targeted (right) genes in six focal allotetraploid species. Species panels are arranged vertically from the oldest (top) to the youngest (bottom). The red-dashed line indicates the ω/ω ratio for a concatenation of genes not targeted to the organelles (fig. 4). Ratios left of the red line indicate higher ω values in the maternal subgenome, and ratios right of the red line indicate higher ω values in the paternal subgenome, after accounting for genome-wide patterns. Bootstrap distributions of ω ratios that depart significantly (P < 0.05) from the red line are filled in solid according to the direction of subgenomic bias (i.e., green: normalized ω/ω > 1.0; purple: normalized ω/ω < 1.0; no fill: normalized ω/ω ≈ 1.0). The intimacy of interactions is indicated on the y-axis from low or no interaction with organelle gene products (top), to interacting genes (middle), to genes involved in mitochondrial or plastid enzyme complexes (bottom).
Fig. 6.Comparison of branch-specific ω values in the polyploid subgenomes compared with their respective diploid models. Left: Maximum likelihood estimates (circles) and block-bootstrap distributions (density curves) of branch-specific ω values in the maternal (purple) and paternal (green) polyploid subgenomes compared with their diploid relatives (gray), with species arranged from the bottom to the top by increasing time since polyploidization. Trees to the left of the plots depict diploid polyploid relationships, with branch lengths representing synonymous substitution rates (i.e., d), as shown in the left side of fig. 2. Statistical comparisons were made by calculating ω ratios for each bootstrap replicate and testing for overlap of 95% CIs with 1.0. Cases in which polyploid subgenomes significantly differ from their diploid relative are denoted by an asterisk and by a dark fill of the polyploid density curve. Right: Maximum likelihood estimates (circles) and 95% CIs inferred by the block-bootstrapping (error bars) of branch-specific ω ratios between the polyploid subgenomes and diploid relatives. The dashed red line represents equal ω values across comparisons.
Genomic Resources for Six Allotetraploid Species Complexes.
| Species complex | Species | Ploidy | Version/Accession | Reference |
|---|---|---|---|---|
|
|
| 2x | GCA_901482405.1 |
|
|
| 2x | GCA_000005505.4 |
| |
|
| 2x | B_stacei_v1_1 |
| |
|
| 4x | B_hybridum_v1_1 |
| |
| Coffee |
| 2x | GCA_013103745.1 |
|
|
| 2x | GCA_900059795.1 |
| |
|
| 2x | GCA_003713205.1 |
| |
|
| 4x | GCA_003713225.1 |
| |
| Cotton |
| 2x | Gossypioides_kirkii_ISU-v3.0 |
|
|
| 2x | G.raimondii_JGI_221_v2.0 |
| |
|
| 2x | G.arboreum_CRI-A2_assembly_v1.0 |
| |
|
| 4x | Ghirsutum_458_v1.0 |
| |
| Quinoa |
| 2x | GCA_002007265.1 |
|
|
| 2x | Csuecicum_DT_PBjellyM2 | ||
|
| 2x | PGA_assembly_final_assembly_Cpallidicaule |
| |
|
| 4x | quinoa_pb_chicago-2-final_PBJELLY_pilon |
| |
| Tobacco |
| 2x | ITAG4.0 |
|
|
| 2x | GCA_000390325.2 |
| |
|
| 2x | GCA_000393655.1 |
| |
|
| 4x | GCA_002210045.1 |
| |
| Wheat |
| 2x | GCA_901482405.1 |
|
|
| 2x | GCA_003073215.1 |
| |
|
| 2x | SRR949822 | ||
|
| 4x | GCA_002162155.2 |
|
Species used as outgroup sequence.
Closest extant relative to maternal progenitor inferred from the plastid genome data (Gordon et al. 2020).
Closest extant relative to maternal progenitor inferred from the plastid genome data (Cros et al. 1998).
Closest extant relative to maternal progenitor inferred from the mitochondrial and plastid genome data (Wendel 1989; Chen et al. 2017b).
Closest extant relative to maternal progenitor inferred from the mitochondrial and plastid genome data (Kolano et al. 2016).
Closest extant relative to maternal progenitor inferred from the mitochondrial and plastid genome data (Bland et al. 1985; Sasaki et al. 2003).
Closest extant relative to maternal progenitor inferred from the plastid genome data (Gornicki et al. 2014).