| Literature DB >> 31010824 |
Matthew A Campbell1, Matthew C Hale2, Garrett J McKinney3, Krista M Nichols4, Devon E Pearse5.
Abstract
Whole-genome duplications (WGDs) have occurred repeatedly and broadly throughout the evolutionary history of eukaryotes. However, the effects of WGD on genome function and evolution remain unclear. The salmonid WGD that occurred approximately 88 million years ago presents an excellent opportunity for studying the effects of WGD as ∼10-15% of each salmonid genome still exhibits tetrasomic inheritance. Herein, we utilized the rainbow trout (Oncorhynchus mykiss) genome assembly and brain transcriptome data to examine the fate of gene pairs (ohnologs) following the salmonid whole-genome duplication. We find higher sequence identity between ohnologs located within known tetrasomic regions than between ohnologs found in disomic regions, and that tetrasomically inherited ohnologs showed greater similarity in patterns of gene expression and per ohnolog were lower expressed, than disomically inherited ohnologs. Enrichment testing for Gene Ontology terms identified 49 over-represented terms in tetrasomically inherited ohnologs compared to disomic ohnologs. However, why these ohnologs are retained as tetrasomic is difficult to answer. It could be that we have identified salmonid specific "dangerous duplicates", that is, genes that cannot take on new roles following WGD. Alternatively, there may be adaptive advantages for retaining genes as functional duplicates in tetrasomic regions, as presumably, movement of these genes into disomic regions would affect both their sequence identity and their gene expression patterns.Entities:
Keywords: Differential Gene Expression; Homeologs; Ohnologs; Salmonidae; Smoltification
Mesh:
Year: 2019 PMID: 31010824 PMCID: PMC6553544 DOI: 10.1534/g3.119.400070
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Summary of ohnologs split by identity and location in genome. Ohnolog pairs are divided into three categories of overall similarity in terms of DNA sequence similarity for both tetrasomic and disomic pairs, and the percentage of each category is presented
| Similarity Category | Tetrasomic Ohnolog Pairs | Category as Percentage | Disomic Ohnolog Pairs | Category as Percentage |
|---|---|---|---|---|
| >95% | 1,410 | 62% | 1,591 | 28% |
| 95–90% | 528 | 23% | 1,950 | 34% |
| <90% | 340 | 15% | 2,160 | 38% |
| Total | 2,278 | 5,701 |
Figure 2Sequence similarity between ohnologs in different regions of the rainbow trout genome. Disomic inherited regions of the genome are represented in blue and tetrasomic regions are represented in green. The x - axis is the position on each chromosome with percent identity in terms of DNA sequence similarity between ohnolog pairs plotted on the y - axis.
Figure 3Histograms of the distribution of DNA sequence percent similarity (PID) of ohnolog pairs within disomic (blue) and tetrasomic (green) regions. Mean disomic PID = 91.28, n = 11,402. Mean tetrasomic PID = 94.98, n = 4,556. The means are significantly different (Welch Two Sample t-Test, t = -43.989, df = 9133.3, p-value < 2.2e-16).
Figure 4Box plot showing variance in DNA sequence percentage identity between ohnologs located in tetrasomic regions of the genome. Ohnologs are split into eight groups based on their locations in known tetrasomic regions of the rainbow trout genome. Lines represent the median value for, boxes the first and third quartiles and whiskers represent the minimum and maximum value.
For each cross split by sex the mean and variance for the absolute value of log2 fold change for both disomically and tetrasomically inherited ohnologs is presented. The results of Welch t-Tests testing for a difference in means with the 95% confidence interval of the test are presented
| Cross Type | Sex | Mean Absolute Value of Log2 Fold Change between ohnologs | Variance Absolute Value of Log2 Fold Change between ohnologs | Welch t-Test Results | 0.95 | ||
|---|---|---|---|---|---|---|---|
| Disomic | Tetrasomic | Disomic | Tetrasomic | ||||
| A x A | F | 2.11 | 1.78 | 5.22 | 4.14 | t = 8.8826, df = 9147.3, p-value < 2.2e-16 | 0.26 - 0.40 |
| A x A | M | 2.11 | 1.76 | 5.02 | 3.94 | t = 9.4028, df = 9107.8, p-value < 2.2e-16 | 0.28 - 0.42 |
| A x A | All | 2.08 | 1.74 | 5.04 | 3.98 | t = 9.2441, df = 9178.2, p-value < 2.2e-16 | 0.27 - 0.41 |
| R x R | F | 2.08 | 1.72 | 5.38 | 4.21 | t = 9.5302, df = 9186.5, p-value < 2.2e-16 | 0.29 - 0.43 |
| R x R | M | 2.08 | 1.73 | 5.36 | 4.16 | t = 9.2808, df = 9197.5, p-value < 2.2e-16 | 0.28 - 0.42 |
| R x R | All | 2.05 | 1.69 | 5.24 | 4.03 | t = 9.6825, df = 9294.2, p-value < 2.2e-16 | 0.29 - 0.43 |
| Both | All | 2.04 | 1.69 | 5.08 | 3.96 | t = 9.5941, df = 9255.6, p-value < 2.2e-16 | 0.28 - 0.42 |
Figure 5Two dimensional kernel density estimates of the absolute value of log2 fold change of ohnolog pairs located in disomic and tetrasomic genomic regions for (A) the female progeny of the A x A cross; (B) the male progeny of the A x A cross; (C) the female progeny of the R x R cross; and, (D) the male progeny of the R x R cross. The plot is color-coded by the density of (x,y) observations, where the x - axis is relative position along chromosomes, and the y - axis is the absolute value of log2 fold change between ohnologs. Red indicates a high density of observations, and blue indicates a low density. Tetrasomically inherited ohnologs are overall more similar in expression level than disomically inherited ohnologs and are clustered near the telomeric ends of chromosomes.
Enriched GO terms in tetrasomic high similarity (>95% protein similarity). GO categories are abbreviated to BP (biological process), MF (molecular function), and CC (cellular component. P-values following a Fisher’s exact test are reported
| GO ID | GO Term | P-value | |
|---|---|---|---|
| GO:0051295 | establishment of meiotic spindle localization | BP | 4.38E-05 |
| GO:0010728 | regulation of hydrogen peroxide biosynthetic process | BP | 9.27E-05 |
| GO:0010248 | establishment or maintenance of transmembrane electrochemical gradient | BP | 1.29E-04 |
| GO:0050665 | hydrogen peroxide biosynthetic process | BP | 3.80E-04 |
| GO:0090662 | ATP hydrolysis coupled transmembrane transport | BP | 6.41E-04 |
| GO:1903919 | negative regulation of actin filament severing | BP | 9.22E-04 |
| GO:0010729 | positive regulation of hydrogen peroxide biosynthetic process | BP | 9.22E-04 |
| GO:0032796 | uropod organization | BP | 1.10E-03 |
| GO:1903918 | regulation of actin filament severing | BP | 1.10E-03 |
| GO:0021678 | third ventricle development | BP | 1.13E-03 |
| GO:0040023 | establishment of nucleus localization | BP | 1.22E-03 |
| GO:0006958 | complement activation, classical pathway | BP | 1.89E-03 |
| GO:0070602 | regulation of centromeric sister chromatid cohesion | BP | 1.89E-03 |
| GO:0007097 | nuclear migration | BP | 2.07E-03 |
| GO:0001778 | plasma membrane repair | BP | 2.41E-03 |
| GO:0030241 | skeletal muscle myosin thick filament assembly | BP | 2.75E-03 |
| GO:0071688 | striated muscle myosin thick filament assembly | BP | 2.75E-03 |
| GO:0002455 | humoral immune response mediated by circulating immunoglobulin | BP | 2.85E-03 |
| GO:0010726 | positive regulation of hydrogen peroxide metabolic process | BP | 3.02E-03 |
| GO:0021592 | fourth ventricle development | BP | 3.02E-03 |
| GO:0060004 | reflex | BP | 5.58E-03 |
| GO:0031034 | myosin filament assembly | BP | 5.75E-03 |
| GO:0030220 | platelet formation | BP | 6.29E-03 |
| GO:0036344 | platelet morphogenesis | BP | 6.29E-03 |
| GO:0070162 | adiponectin secretion | BP | 6.63E-03 |
| GO:0070163 | regulation of adiponectin secretion | BP | 6.63E-03 |
| GO:1903921 | regulation of protein processing in phagocytic vesicle | BP | 6.80E-03 |
| GO:1903923 | positive regulation of protein processing in phagocytic vesicle | BP | 6.80E-03 |
| GO:1900756 | protein processing in phagocytic vesicle | BP | 6.80E-03 |
| GO:0048172 | regulation of short-term neuronal synaptic plasticity | BP | 7.58E-03 |
| GO:1904707 | positive regulation of vascular smooth muscle cell proliferation | BP | 8.64E-03 |
| GO:0060283 | negative regulation of oocyte development | BP | 8.64E-03 |
| GO:0070989 | oxidative demethylation | BP | 8.65E-03 |
| GO:0031449 | regulation of slow-twitch skeletal muscle fiber contraction | BP | 8.65E-03 |
| GO:0031444 | slow-twitch skeletal muscle fiber contraction | BP | 8.65E-03 |
| GO:0008635 | activation of cysteine-type endopeptidase activity involved in apoptotic process cytochrome c | BP | 8.65E-03 |
| GO:0005826 | actomyosin contractile ring | CC | 1.54E-04 |
| GO:0005833 | hemoglobin complex | CC | 2.85E-03 |
| GO:0016705 | oxidoreductase activity, acting on paired donors, with incorporation or reduction of oxygen | MF | 5.14E-05 |
| GO:0005506 | iron ion binding | MF | 1.95E-04 |
| GO:0019825 | oxygen binding | MF | 3.16E-04 |
| GO:0043495 | protein anchor | MF | 2.75E-03 |
| GO:0030250 | guanylate cyclase activator activity | MF | 6.63E-03 |
| GO:0030249 | guanylate cyclase regulator activity | MF | 6.63E-03 |
| GO:0019785 | ISG15-specific protease activity | MF | 6.63E-03 |
| GO:0010853 | cyclase activator activity | MF | 6.63E-03 |
| GO:0005344 | oxygen transporter activity | MF | 6.80E-03 |
| GO:0004030 | aldehyde dehydrogenase [NAD(P)+] activity | MF | 8.65E-03 |
| GO:0016840 | carbon-nitrogen lyase activity | MF | 8.65E-03 |
Figure 1Circos plot showing homeologous blocks in the rainbow trout genome dating from the last whole-genome duplication in salmonids. Chromosomes (Omy01-Omy29) are plotted, arms (e.g., 1p, and 1q) are shaded, and the length of the chromosomes are indicated. Links show pairing between genomic regions originating from duplication at the Ss4R. Track A, overall similarity between homeologous regions is indicated by shading, yellow - low, red - high, and height of the plot. Track B. The eight pairs of homeologous blocks still exhibiting tetrasomic inheritance are indicated by green bars in the outer ring. Circos plot adapted from Pearse et al. (2018).