| Literature DB >> 34410426 |
Karel Janko1,2, Oldřich Bartoš1,3, Jan Kočí1,2, Jan Roslein1,2, Edita Janková Drdová4, Jan Kotusz5, Jan Eisner6, Martin Mokrejš1,7, Eva Štefková-Kašparová1,8.
Abstract
Hybridization and genome duplication have played crucial roles in the evolution of many animal and plant taxa. The subgenomes of parental species undergo considerable changes in hybrids and polyploids, which often selectively eliminate segments of one subgenome. However, the mechanisms underlying these changes are not well understood, particularly when the hybridization is linked with asexual reproduction that opens up unexpected evolutionary pathways. To elucidate this problem, we compared published cytogenetic and RNAseq data with exome sequences of asexual diploid and polyploid hybrids between three fish species; Cobitis elongatoides, C. taenia, and C. tanaitica. Clonal genomes remained generally static at chromosome-scale levels but their heterozygosity gradually deteriorated at the level of individual genes owing to allelic deletions and conversions. Interestingly, the impact of both processes varies among animals and genomic regions depending on ploidy level and the properties of affected genes. Namely, polyploids were more tolerant to deletions than diploid asexuals where conversions prevailed, and genomic restructuring events accumulated preferentially in genes characterized by high transcription levels and GC-content, strong purifying selection and specific functions like interacting with intracellular membranes. Although hybrids were phenotypically more similar to C. taenia, we found that they preferentially retained C. elongatoides alleles. This demonstrates that favored subgenome is not necessarily the transcriptionally dominant one. This study demonstrated that subgenomes in asexual hybrids and polyploids evolve under a complex interplay of selection and several molecular mechanisms whose efficiency depends on the organism's ploidy level, as well as functional properties and parental ancestry of the genomic region.Entities:
Keywords: asexual reproduction; gene conversions; hemizygous deletions; hybridization; loss of heterozygosity; polyploidy
Mesh:
Year: 2021 PMID: 34410426 PMCID: PMC8662595 DOI: 10.1093/molbev/msab249
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.The Cobitis taenia hybrid complex. (a) Example of gametogenetic pathways (adapted from Lutes et al. [2010]) involving endoreplication and followed by meiotic pairing of either sister chromosomes (upper pathway) or orthologous chromosomes (lower pathway). Note that the latter case may cause loss of heterozygosity among progeny via crossovers or gene conversions. Inset on the right demonstrates empirical evidence for the presence of proper bivalents in hybrid’s oocytes as from Dedukh et al. (2020). (b) Reproduction scheme of Cobitis; letters correspond to haploid genomes: E = C. elongatoides, N = C. tanaitica, T = C. taenia. (c) Map of species distribution and samples’ origin; sites’ numeric code corresponds to supplementary table S1, Supplementary Material online.
Fig. 2.(a) Multidimensional scaling (MDS; SVD algorithm) of individual samples based on filtered SNPs; clustering visualization of first two coordinates (Plink v1.9b) validates samples’ genetic origin; letters in the legend correspond to haplotype genomes as follows: E = C. elongatoides, N = C. tanaitica, T = C. taenia. Hybrid individuals are denoted by numerals, which indicate the ID of determined clonal lineages (MLLs), as in supplementary table S2, Supplementary Material online (note that clone-mates tend to be clustered in the MDS plot). (b) Heterozygosity of sexual and asexual (split to diploids and triploids) samples estimated across all filtered variant sites; missing sites are not included. Expected heterozygosity levels for diploid asexuals are depicted on last boxplot as obtained by simulation from parental individuals. (c) Boxplots indicate the proportions of shared genotypes at LOH positions among all pairs of individuals belonging to the same (right) and different (left) clonal lineages. Note that we used the set of E–TN diagnostic sites to maintain compatibility of elongatoides–taenia and elongatoides–tanaitica hybrids.
Fig. 3.(a) Correlation between proportion of private hybrid SNPs as the proxy for asexual individual’s age (x-axis), and proportion of LOH loci (y-axis) in each individual. For plotting points and their linear regression (solid line) we used E–T diagnostic sites for elongatoides–taenia hybrids and E–N diagnostic sites for elongatoides–tanaitica hybrids. Dashed line represents the linear regression fit for data calculated on E–TN diagnostic sites used for all individuals (points not shown). Letters correspond to haplotype genomes as follows: E = C. elongatoides, N = C. tanaitica, T = C. taenia. (b) Barplot showing proportions of LOH events according to their genomic origin. Height of each bar represents absolute number of unique SNPs that appeared as LOH in a given biotype (note that barplots are not corrected for number of individuals pooled in respective biotypes).
Fig. 4.(a–e) Histograms of relative coverages at LOH loci in ET, EET, ETT, EN, and EEN hybrids pooled into respective biotypes. Arrows depict biologically meaningful values (for given ploidy); blue lines represent the fit of single Gamma distribution with mean centered at value 1, red lines represent the fitted mixture of two Gamma distribution with the coefficients A and B indicating proportions of both Gamma distributions in the combined model; A relates to the distribution assuming the mean relative coverage approximately 1, B to the distribution with the mean approximately 0.5 or 0.66 (for diploid or triploid biotype, respectively); (f) Orange represents empirical cumulative distribution function (ECDF) of relative coverages at LOH sites of ET biotype, black represents ECDF of relative coverages at the same sites, but taken from parental species, where no deletions are expected. ECDF curves for all biotypes and results of KS test are provided in supplementary figure S1, Supplementary Material online.
Fig. 5.(a) G/C bias at asexuals’ LOH sites as demonstrated for one representative of every biotype. Each individual is represented by two columns with E-like (left) and T-like (right) LOH events. Bar widths scale with absolute numbers of observed LOH events in each individual, whereas heights of color fields demonstrate proportions of LOH causing weak to strong, strong to weak, and no GC change. The last bar (E vs. TN) represents overall differences between C. elongatoides and both other parental species at all E–TN diagnostic sites. Note the consistent increase in weak to strong substitution rates in E-like LOH events of all hybrids as compared with interparental divergence. The application of contingency tables indicated significance of this trend in all cases, except the F1 hybrids and one ET specimen (csc052). By contrast, no such shift has been detected at TN-like LOH sites. (b) TPM normalized expression characteristics of LOH positive (blue) and LOH negative (red) genes in all Cobitis biotypes analyzed by Bartoš et al. (2019) (individuals are pooled into biotypes). For better orientation, the oocyte data (left part) are depicted in lighter color tones, whereas liver data (right part) are in darker tones.
Top 20 Enriched GO Terms in Cellular Compartment GO Category Ranked by Their P-value in Each Hybrid Biotype.
| GO ID | GO Description |
|
|---|---|---|
| ET (292 LOH positive genes, 4,582 annotated with diagnostic SNP) | ||
| GO:0005576 | Extracellular region | pV 0.0028351 (22 vs. 184) |
| GO:0005923 | Bicellular tight junction | pV 0.0161340 (3 vs. 9) |
| GO:0070160 | Tight junction | pV 0.0219788 (3 vs. 10) |
| GO:0000347 | THO complex | pV 0.0222870 (2 vs. 4) |
| GO:0000445 | THO complex part of transcription export complex | pV 0.0222870 (2 vs. 4) |
| GO:0016021 | Integral component of membrane | pV 0.0338391 (76 vs. 987) |
| GO:0031224 | Intrinsic component of membrane | pV 0.0353113 (76 vs. 989) |
| GO:0031680 | G-protein beta/gamma-subunit complex | pV 0.0355939 (2 vs. 5) |
| GO:0097431 | Mitotic spindle pole | pV 0.0355939 (2 vs. 5) |
| GO:0005834 | Heterotrimeric G-protein complex | pV 0.0366592 (3 vs. 12) |
| GO:0043296 | Apical junction complex | pV 0.0366592 (3 vs. 12) |
| GO:1905360 | GTPase complex | pV 0.0366592 (3 vs. 12) |
| GO:0030687 | Preribosome, large subunit precursor | pV 0.0552198 (3 vs. 14) |
| GO:0000346 | Transcription export complex | pV 0.0686844 (2 vs. 7) |
| GO:0000152 | Nuclear ubiquitin ligase complex | pV 0.0878187 (2 vs. 8) |
| GO:0005911 | Cell–cell junction | pV 0.0891895 (4 vs. 27) |
| GO:0000922 | Spindle pole | pV 0.0897273 (3 vs. 17) |
| GO:0031234 | Extrinsic component of cytoplasmic side of plasma membrane | pV 0.0897273 (3 vs. 17) |
| GO:0043230 | Extracellular organelle | pV 0.1083001 (2 vs. 9) |
| GO:0070062 | Extracellular exosome | pV 0.1083001 (2 vs. 9) |
| ETT (226 LOH positive genes, 4,582 annotated with diagnostic SNP) | ||
| GO:0005844 | Polysome | pV 0.0287870 (3 vs. 14) |
| GO:0031514 | Motile cilium | pV 0.0318521 (2 vs. 6) |
| GO:0005730 | Nucleolus | pV 0.0390230 (11 vs. 123) |
| GO:0005732 | Small nucleolar ribonucleoprotein complex | pV 0.0431628 (2 vs. 7) |
| GO:0071007 | U2-type catalytic step 2 spliceosome | pV 0.0431628 (2 vs. 7) |
| GO:0030684 | Preribosome | pV 0.0446155 (6 vs. 53) |
| GO:0044798 | Nuclear transcription factor complex | pV 0.0577484 (4 vs. 30) |
| GO:0090575 | RNA polymerase II transcription factor complex | pV 0.0577484 (4 vs. 30) |
| GO:0031974 | Membrane-enclosed lumen | pV 0.0808439 (31 vs. 488) |
| GO:0043233 | Organelle lumen | pV 0.0808439 (31 vs. 488) |
| GO:0070013 | Intracellular organelle lumen | pV 0.0808439 (31 vs. 488) |
| GO:0005657 | Replication fork | pV 0.0839463 (2 vs. 10) |
| GO:0034708 | Methyltransferase complex | pV 0.0990699 (4 vs. 36) |
| GO:0071013 | Catalytic step 2 spliceosome | pV 0.1015022 (3 vs. 23) |
| GO:0005793 | Endoplasmic reticulum–Golgi intermediate compartment | pV 0.1228584 (3 vs. 25) |
| GO:0031461 | Cullin-RING ubiquitin ligase complex | pV 0.1456830 (3 vs. 27) |
| GO:0032040 | Small-subunit processome | pV 0.1456830 (3 vs. 27) |
| GO:0071944 | Cell periphery | pV 0.1530158 (23 vs. 373) |
| GO:0035097 | Histone methyltransferase complex | pV 0.1697646 (3 vs. 29) |
| GO:0031981 | Nuclear lumen | pV 0.1815560 (23 vs. 382) |
| EET (662 LOH positive genes, 4,582 annotated with diagnostic SNP) | ||
| GO:0030117 |
| pV 0.0007544 (12 vs. 31) |
| GO:0048475 |
| pV 0.0007544 (12 vs. 31) |
| GO:0030120 | Vesicle coat | pV 0.0014970 (9 vs. 21) |
| GO:0030662 | Coated vesicle membrane | pV 0.0137591 (9 vs. 28) |
| GO:0031224 | Intrinsic component of membrane | pV 0.0145280 (165 vs. 989) |
| GO:0016021 | Integral component of membrane | pV 0.0172777 (164 vs. 987) |
| GO:0030126 | COPI vesicle coat | pV 0.0186441 (4 vs. 8) |
| GO:0031314 | Extrinsic component of mitochondrial inner membrane | pV 0.0186441 (4 vs. 8) |
| GO:0016020 | Membrane | pV 0.0189391 (234 vs. 1456) |
| GO:0005921 | Gap junction | pV 0.0208470 (2 vs. 2) |
| GO:0030057 | Desmosome | pV 0.0208470 (2 vs. 2) |
| GO:0042627 | Chylomicron | pV 0.0208470 (2 vs. 2) |
| GO:0071782 | Endoplasmic reticulum tubular network | pV 0.0208470 (2 vs. 2) |
| GO:0098827 | Endoplasmic reticulum subcompartment | pV 0.0208470 (2 vs. 2) |
| GO:0030663 | COPI-coated vesicle membrane | pV 0.0297850 (4 vs. 9) |
| GO:0005911 | Cell–cell junction | pV 0.0322002 (8 vs. 27) |
| GO:0012506 | Vesicle membrane | pV 0.0322119 (11 vs. 42) |
| GO:0030659 | Cytoplasmic vesicle membrane | pV 0.0322119 (11 vs. 42) |
| GO:0030660 | Golgi-associated vesicle membrane | pV 0.0395767 (8 vs. 28) |
| GO:0030687 | Preribosome, large subunit precursor | pV 0.0402965 (5 vs. 14) |
| EN (755 LOH positive genes, 3,197 annotated with diagnostic SNP) | ||
| GO:0005789 |
| pV 0.0006197 (46 vs. 126) |
| GO:0072546 | ER membrane protein complex | pV 0.0007271 (5 vs. 5) |
| GO:0042175 | Nuclear outer membrane–endoplasmic reticulum membrane network | pV 0.0009193 (46 vs. 128) |
| GO:0031090 | Organelle membrane | pV 0.0041040 (107 vs. 364) |
| GO:0012505 | Endomembrane system | pV 0.0071884 (116 vs. 405) |
| GO:0005783 | Endoplasmic reticulum | pV 0.0075373 (53 vs. 166) |
| GO:0005924 | Cell–substrate adherens junction | pV 0.0098916 (5 vs. 7) |
| GO:0005925 | Focal adhesion | pV 0.0098916 (5 vs. 7) |
| GO:0030055 | Cell–substrate junction | pV 0.0098916 (5 vs. 7) |
| GO:0099081 | Supramolecular polymer | pV 0.0121988 (19 vs. 49) |
| GO:0099512 | Supramolecular fiber | pV 0.0121988 (19 vs. 49) |
| GO:0016528 | Sarcoplasm | pV 0.0131308 (3 vs. 3) |
| GO:0016529 | Sarcoplasmic reticulum | pV 0.0131308 (3 vs. 3) |
| GO:0017087 | Mitochondrial processing peptidase complex | pV 0.0131308 (3 vs. 3) |
| GO:0042383 | Sarcolemma | pV 0.0131308 (3 vs. 3) |
| GO:0030176 | Integral component of endoplasmic reticulum membrane | pV 0.0134881 (12 vs. 27) |
| GO:0031227 | Intrinsic component of endoplasmic reticulum membrane | pV 0.0134881 (12 vs. 27) |
| GO:0030016 | Myofibril | pV 0.0147223 (6 vs. 10) |
| GO:0030017 | Sarcomere | pV 0.0147223 (6 vs. 10) |
| GO:0005813 | Centrosome | pV 0.0251815 (12 vs. 29) |
| EEN (1,235 LOH positive genes, 3,197 annotated with diagnostic SNP) | ||
| GO:0012505 |
| pV 0.00017207 (190 vs. 405) |
| GO:0005783 |
| pV 0.0008575 (84 vs. 166) |
| GO:0042175 |
| pV 0.0009040 (67 vs. 128) |
| GO:0005789 |
| pV 0.0009619 (66 vs. 126) |
| GO:0072546 | ER membrane protein complex | pV 0.0007271 (5 vs. 5) |
| GO:0030176 | Integral component of endoplasmic reticulum membrane | pV 0.0087823 (17 vs. 27) |
| GO:0031227 | Intrinsic component of endoplasmic reticulum membrane | pV 0.0087823 (17 vs. 27) |
| GO:0030496 | Midbody | pV 0.0095585 (8 vs. 10) |
| GO:0005794 | Golgi apparatus | pV 0.0153109 (69 vs. 145) |
| GO:0005924 | Cell–substrate adherens junction | pV 0.0154665 (6 vs. 7) |
| GO:0005925 | Focal adhesion | pV 0.0154665 (6 vs. 7) |
| GO:0030055 | Cell–substrate junction | pV 0.0154665 (6 vs. 7) |
| GO:0045121 | Membrane raft | pV 0.0201926 (7 vs. 9) |
| GO:0098589 | Membrane region | pV 0.0201926 (7 vs. 9) |
| GO:0098857 | Membrane microdomain | pV 0.0201926 (7 vs. 9) |
| GO:0097525 | Spliceosomal snRNP complex | pV 0.0259453 (10 vs. 15) |
| GO:0031090 | Organelle membrane | pV 0.0351521 (157 vs. 364) |
| GO:0030532 | Small nuclear ribonucleoprotein complex | pV 0.0456027 (10 vs. 16) |
| GO:0031300 | Intrinsic component of organelle membrane | pV 0.0534485 (28 vs. 56) |
| GO:0031301 | Integral component of organelle membrane | pV 0.0534485 (28 vs. 56) |
Note.—For each hybrid biotype, we indicate number of genes affected by LOH event and total number of annotated genes with diagnostic SNP relevant for given combination of parental species. For each GO term, we indicate its ID, description and uncorrected P-value, as well as numbers of LOH-positive genes and total number of genes with given GO in parentheses. Underlined GO terms are significant after correction for multiple tests at alpha level = 0.1. Background colors are used to highlight GO terms shared between distinct biotypes so that the same color across biotypes indicates GO terms that are identical or nested. In particular, orange indicates genes related to intrinsic component of membrane; gray the genes related to cell–cell junction; dark yellow the genes related to endoplasmic reticulum; dark green the genes related to endomembrane system; and light green the genes related to organelle membrane.