| Literature DB >> 28359300 |
Bryan T Barney1, Christiane Munkholm2, David R Walt2, Stephen R Palumbi3.
Abstract
BACKGROUND: Atlantic cod (Gadus morhua), is known to vary genetically across the North Atlantic, Greenland, and Newfoundland. This genetic variation occurs both spatially and temporally through decades of heavy fishing, and is concentrated in three linkage disequilibrium blocks, previously defined by pedigreed linkage mapping analysis. Variation within these genomic regions is correlated with both seawater temperature and behavioral ecotype. The full extent and nature of these linkage groups is important information for interpreting cod genetic structure as a tool for future fisheries management.Entities:
Keywords: Atlantic cod; Gadus morhua; Genomic divergence; Gulf of Maine; Linkage disequilibrium; Linkage map; Supergene
Mesh:
Substances:
Year: 2017 PMID: 28359300 PMCID: PMC5374575 DOI: 10.1186/s12864-017-3660-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Sampling locations used in this study. Twenty one adults in spawning condition were sampled in northwestern Massachusetts Bay within the Gulf of Maine. These were subdivided into 11 spring spawning cod (site S) and 10 winter spawning cod (site W). Ten adults in spawning condition were sampled from Georges Bank (site G). Figure was generated using the marmap package in R
Cod sampling summary
| Stock | Date | Samples (n) | Latitude | Longitude |
|---|---|---|---|---|
| Gulf of Maine winter | Dec 2013 | 10 | 42.3813 | -70.6016 |
| Gulf of Maine spring | June 2014 | 11 | 42.5177 | -70.6921 |
| Georges Bank | Mar 2014 | 10 | 42.1192 | -67.3575 |
Locations, dates, and number of samples collected at each sampling site
Fig. 2Linkage and population differentiation for chromosomes 2, 7 and 12 inside and outside linkage blocks. Mean pairwise LD (as r 2) and FST are shown on the y-axis for every 250th SNP, with SNP position within its linkage group along the x-axis. The right hand figures show a regression of LD vs FST for each SNP
Linkage disequilibrium block extent, and approximate length of each block
| Starting base | Ending base | # genes | length | |
|---|---|---|---|---|
| LG02 | 18,442,392 | 24,053,701 | 294 | 5.6 Mb |
| LG07 | 13,743,341 | 23,039,143 | 306 | 9.3 Mb |
| LG21 | 2,039,256 | 13,611,225 | 437 | 11.6 Mb |
GO enrichment analysis of linkage disequilibrium block on LG02
| GO.ID | Term | Annotated | Significant | Fisher pval | padj |
|---|---|---|---|---|---|
| GO:0006325 | chromatin organization | 142 | 18 | 1.70E-14 |
|
| GO:0065004 | protein-DNA complex assembly | 105 | 16 | 2.80E-14 |
|
| GO:0071824 | protein-DNA complex subunit organization | 105 | 16 | 2.80E-14 |
|
| GO:0051276 | chromosome organization | 178 | 19 | 7.10E-14 |
|
| GO:0006334 | nucleosome assembly | 100 | 15 | 2.50E-13 |
|
| GO:0031497 | chromatin assembly | 100 | 15 | 2.50E-13 |
|
| GO:0034728 | nucleosome organization | 100 | 15 | 2.50E-13 |
|
| GO:0006333 | chromatin assembly or disassembly | 101 | 15 | 2.90E-13 |
|
| GO:0006323 | DNA packaging | 106 | 15 | 6.00E-13 |
|
| GO:0071103 | DNA conformation change | 112 | 15 | 1.40E-12 |
|
| GO:0034622 | cellular macromolecular complex assembly | 180 | 16 | 1.30E-10 |
|
| GO:0043933 | macromolecular complex subunit organization | 324 | 20 | 3.90E-10 |
|
| GO:0006461 | protein complex assembly | 244 | 17 | 1.50E-09 |
|
| GO:0070271 | protein complex biogenesis | 244 | 17 | 1.50E-09 |
|
| GO:0065003 | macromolecular complex assembly | 258 | 17 | 3.60E-09 |
|
| GO:0071822 | protein complex subunit organization | 271 | 17 | 7.50E-09 |
|
| GO:0022607 | cellular component assembly | 308 | 18 | 7.90E-09 |
|
| GO:0006996 | organelle organization | 432 | 21 | 1.00E-08 |
|
| GO:0044085 | cellular component biogenesis | 339 | 18 | 3.50E-08 |
|
| GO:0016043 | cellular component organization | 636 | 22 | 1.70E-06 |
|
| GO:0071840 | cellular component organization or biogenesis | 666 | 22 | 3.70E-06 |
|
| GO:0042157 | lipoprotein metabolic process | 32 | 5 | 2.70E-05 |
|
| GO:0044711 | single-organism biosynthetic-process | 295 | 9 | 0.0061 | 0.7060 |
| GO:0006897 | endocytosis | 35 | 3 | 0.0071 | 0.7895 |
| GO:0043543 | protein acylation | 12 | 2 | 0.0078 | 0.8318 |
| GO:0016568 | chromatin modification | 38 | 3 | 0.0089 | 0.8645 |
Using topGO, (in R), enrichment analysis shows multiple GO categories that may be enriched in the LD block from LG02, mostly centered on DNA/chromatin structuring. Total number of annotated genes in each GO category is shown, as well as those within the LD block that are significant (Fisher test p < 0.01). Multiple-test correction (calculated as false-discovery rate, ‘fdr’) is shown in p-adj column, p-values < 0.05 shown in bold
Genes in significant GO terms within LD block on LG02
| Gene | Starting base | Gene description |
|---|---|---|
| GAMO_00059458 | 18475327 | zgc:112234: Histone H2B 1/2 (Danio rerio) |
| GAMO_00059459 | 18476591 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00059461 | 18489582 | TGas006m08.1: Histone H4 (Xenopus tropicalis) |
| GAMO_00059462 | 18491420 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00059465 | 18506398 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00059556 | 19188916 | STRA13: Centromere protein X (Homo sapiens) |
| GAMO_00059593 | 19396062 | Crebbp: CREB-binding protein (Rattus norvegicus) |
| GAMO_00059696 | 20010135 | NMT1: Glycylpeptide N-tetradecanoyltransferase 1 (Homo sapiens) |
| GAMO_00059920 | 21260662 | Coro1b: Coronin-1B (Mus musculus) |
| GAMO_00059924 | 21287842 | suv420h2: Histone-lysine N-methyltransferase SUV420H2 (Xenopus laevis) |
| GAMO_00059998 | 21707418 | KCNA1: Potassium voltage-gated channel subfamily A member 1 (Homo sapiens) |
| GAMO_00060305 | 23130440 | tmem11: Transmembrane protein 11%2C mitochondrial (Danio rerio) |
| GAMO_00060314 | 23170545 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00060318 | 23177224 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00060319 | 23198635 | Histone H3 (Urechis caupo) |
| GAMO_00060320 | 23199855 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00060321 | 23201514 | TGas006m08.1: Histone H4 (Xenopus tropicalis) |
| GAMO_00060322 | 23202645 | Histone H3 (Urechis caupo) |
| GAMO_00060324 | 23204017 | Histone H2B (Salmo trutta) |
| GAMO_00060364 | 23359510 | Pigq: Phosphatidylinositol N-acetylglucosaminyltransferase subunit Q (Mus musculus) |
| GAMO_00060373 | 23393752 | Histone H2A (Oncorhynchus mykiss) |
| GAMO_00060375 | 23400842 | TGas006m08.1: Histone H4 (Xenopus tropicalis) |
| GAMO_00060376 | 23401993 | Histone H3 (Urechis caupo) |
| GAMO_00060390 | 23480761 | HDAC9: Histone deacetylase 9 (Homo sapiens) |
| GAMO_00060462 | 23855114 | APOL3: Apolipoprotein L3 (Homo sapiens) |
| GAMO_00060465 | 23866729 | APOL3: Apolipoprotein L3 (Homo sapiens) |
| GAMO_00060467 | 23891523 | APOL3: Apolipoprotein L3 (Homo sapiens) |
While several GO terms were significantly enriched, many of them collapsed to the same set of DNA structural genes, including several genes encoding all four major histone proteins that make up the nucleosome
GO enrichment analysis of linkage disequilibrium block on LG07
| GO.ID | Term | Annotated | Significant | Fisher pval | p-adj (fdr) |
|---|---|---|---|---|---|
| GO:0006855 | drug transmembrane transport | 5 | 2 | 0.0014 | 1 |
| GO:0006066 | alcohol metabolic process | 24 | 3 | 0.0028 | 1 |
| GO:0015893 | drug transport | 8 | 2 | 0.0038 | 1 |
| GO:0042493 | response to drug | 8 | 2 | 0.0038 | 1 |
| GO:0044765 | single-organism transport | 1098 | 23 | 0.0044 | 1 |
| GO:0023051 | regulation of signalling | 275 | 9 | 0.0054 | 1 |
| GO:1902578 | single-organism localization | 1118 | 23 | 0.0055 | 1 |
| GO:1901615 | organic hydroxy compound metabolic process | 31 | 3 | 0.0058 | 1 |
| GO:0006071 | glycerol metabolic process | 10 | 2 | 0.0059 | 1 |
| GO:0019400 | alditol metabolic process | 10 | 2 | 0.0059 | 1 |
Using topGO, (in R), enrichment analysis shows multiple GO categories that may be enriched in the LD block from LG07, including genes involved with signal transport and metabolic processes. Total number of annotated genes in each GO category is shown, as well as those within the LD block that are significant (Fisher test p < 0.01). Multiple-test correction (calculated as false-discovery rate, ‘fdr’) is shown in p-adj column
GO enrichment analysis of linkage disequilibrium block on LG12
| GO.ID | Term | Annotated | Significant | Fisher pval | p-adj (fdr) |
|---|---|---|---|---|---|
| GO:0051648 | vesicle localization | 4 | 2 | 0.002 | 1 |
| GO:0051650 | establishment of vesicle localization | 4 | 2 | 0.002 | 1 |
| GO:0009057 | macromolecule catabolic process | 144 | 8 | 0.0051 | 1 |
| GO:0051656 | establishment of organelle localization | 7 | 2 | 0.0067 | 1 |
| GO:0051640 | organelle localization | 8 | 2 | 0.0088 | 1 |
Using topGO, (in R), enrichment analysis shows multiple GO categories that may be enriched in the LD block from LG12. Total number of annotated genes in each GO category is shown, as well as those within the LD block that are significant (Fisher test p < 0.01). Multiple-test correction (calculated as false-discovery rate ‘fdr’) is shown in p-adj column
Comparison of genome contents and population divergence of cod chromosome linkage groups 2, 7 and 12
| Group | # SNPs | Block length (Mb) | SNP linked | # genes | Genes in LD | SNP density | proportion exonic SNPs | Fst in LD block | FST vs LD | FST outside LD block | Notes |
|---|---|---|---|---|---|---|---|---|---|---|---|
| LG02 | 43,341 | 5.6 | 33 | 1095 | 294 | 7.72E-03 | 0.089 | 0.098 | negative | 0.0025 | discontinuous block |
| LG07 | 77,751 | 9.3 | 54 | 1166 | 306 | 8.36E-03 | 0.064 | 0.089 | positive | 0.0004 | |
| LG12 | 61,251 | 11.6 | 17 | 975 | 437 | 5.29E-03 | 0.071 | 0.173 | higly positive | 0.0022 | marked lack of SNPs adjacent to linkage block |
| Not 2, 7, or 12 | 2,792,388 | 493.2 |
Mean pairwise FST comparisons between sampled populations
| Spring vs. Winter | Spring vs. GB | Winter vs. GB | |
|---|---|---|---|
| LG02 in LD |
| -0.0189 |
|
| LG02 out | 0.0025 | 0.0021 | 0.0016 |
| LG07 in LD | -0.0062 | 0.0121 |
|
| LG07 out | 0.0004 | -7.43E-05 | 0.0001 |
| LG12 in LD | 0.0231 | 0.0766 |
|
| LG12 out | 0.0022 | 0.0021 | 0.0036 |
| Genome | 0.0001 | -0.0003 | 0.0003 |
Mean pairwise FST calculations for each sampled population show some patterns of divergence between sampled populations, with high mean FST values between winter spawners and Georges Bank cod in all three LD blocks (shown in bold). LG02 is also highly divergent between spring and winter spawners
Fig. 3Distributions of pairwise FST within LD blocks between sampled populations. Pairwise FST was determined within LD blocks (blue), on the same LG but not in the LD block (red), as well as the remainder of the genome (black) for LD blocks on LG02 (top row) LG07 (middle row), and LG12 (bottom row). Column 1 is the pairwise FST comparison between spring and winter spawners, column 2 is spring vs. Georges Bank, and column 3 is winter vs. Georges Bank
Fig. 4Principal component analyses. PCA was performed on exonic SNPs from LG02 (top row), LG07 (middle row), and LG12 (bottom row). SNPs within the LD block on that LG are shown on the left, SNPs outside the LD block on the right. In all cases, as expected, SNPs within each LD block separate into three distinct vertical clusters along PC axis 1, indicating the three genotypes that individuals might possess for each LD block. This separation is lost, however, for PCA on SNPs outside the LD block, but in the same linkage group. Note that SNPs outside of the LD block on LG07 (D, above) may signify the presence of another, undiscovered linkage disequilibrium cluster, as the pattern appears diffusely similar to that in C