| Literature DB >> 31406984 |
Nadezhda V Terekhanova1,2, Anna E Barmintseva3, Alexey S Kondrashov4,5, Georgii A Bazykin1,2, Nikolai S Mugue3,6.
Abstract
Adaptation of threespine stickleback to freshwater involves parallel recruitment of freshwater alleles in clusters of closely linked sites, or divergence islands (DIs). However, it remains unclear to what extent the DIs and the alleles that constitute them coincide between populations that underwent adaptation to freshwater independently. We examine threespine sticklebacks from ten freshwater lakes that emerged 500-1500 years ago in the White Sea basin, with the emphasis on repeatability of genomic patterns of adaptation among the lake populations and the role of local recombination rate in the distribution and structure of DIs. The 65 detected DIs are clustered in the genome, forming 12 aggregations, and this clustering cannot be explained by the variation of the recombination rate. Only 21 of the DIs are present in all the freshwater populations, likely being indispensable for successful colonization of freshwater environment by the ancestral marine population. Within most DIs, the same set of single nucleotide polymorphisms (SNPs) distinguish marine and freshwater haplotypes in all the lake populations; however, in some DIs, freshwater alleles differ between populations, suggesting that they could have been established by recruitment of different haplotypes in different populations.Entities:
Keywords: adaptation; divergence islands; molecular evolution
Mesh:
Year: 2019 PMID: 31406984 PMCID: PMC6761963 DOI: 10.1093/gbe/evz175
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Description of the Locations of Populations Studied
| Sample | ID | Description | Geographic Location | Number of Individuals | Number of Reads | Number of Reads Properly Paired | Mean Coverage |
|---|---|---|---|---|---|---|---|
| White Sea, WSBS | MAR | Marine | 66°57.04′N, 33°10.40′E | 12 | 415,169,461 | 365,058,887 | 90 |
| Nilmaa | Marine | 66°30.45′N, 33°7.68′E | 16 | ||||
| Ershovskoyea (anadromous) | Marine | 66°32.21′N, 33°3.62′E | 10 | ||||
| Lobaneshskoyea | LOB | Freshwater, older | 66°33.64′N, 33°13.45′E | 8 | 120,452,652 | 81,595,168 | 20 |
| Mashinnoyea | MAS | Freshwater, older | 66°17.74′N, 33°21.82′E | 10 | 92,891,151 | 64,737,645 | 16 |
| Canon | CAN | Freshwater, older | 66°16.69′N, 34°13.33′E | 24 | 202,051,636 | 190,057,977 | 45 |
| Lake Nilma | LN | Freshwater, older | 66°49.15′N, 33.09.75′E | 16 | 185,364,807 | 171,676,383 | 41 |
| Son | SON | Freshwater, older | 66°17.64′N, 34°14.73′E | 24 | 76,682,927 | 73,382,541 | 25 |
| Mashinnoye-3 | MAS3 | Freshwater, older | 66°29.85′N, 33°34.41′E | 12 | 44,514,740 | 41,667,387 | 15 |
| Kumyazh'i | KUM | Freshwater, older | 66°56.24′N, 33°32.63′E | 24 | 156,410,799 | 147,101,391 | 35 |
| Ogorodnoye | OG | Freshwater, older | 66°56.80′N, 33°21.07′E | 20 | 89,328,329 | 79,583,953 | 27 |
| Belaya Guba | BG | Freshwater, older | 66°91.08′N, 32°45.83′E | 20 | 69,020,533 | 64,178,878 | 22 |
| Voron'ye | VOR | Freshwater, older | 66°95.04′N, 32°41.90′E | 20 | 73,677,728 | 63,567,981 | 21 |
| Malysha | MAL | Freshwater, 34 | 66°18.27′N, 33°25.27′E | 20 | 308,787,163 | 270,452,403 | 68 |
| Goluboya | GOL | Freshwater, 34 | 66°17.20′N, 33°23.29′E | 20 | 247,299,313 | 165,873,385 | 41 |
| Ershovskoyea (residential) | ER | Freshwater, ∼30 | 66°32.21′N, 33°3.62′E | 12 | 255,561,219 | 204,988,298 | 51 |
| Martsya | MART | Freshwater, ∼250 | 66°35.95′N, 33°15.35′E | 10 | 126,624,880 | 77,327,734 | 19 |
Note.—The sequencing reads from the three marine populations were pooled together. The populations marked with superscript letter (a) have been analyzed previously (Terekhanova et al. 2014).
. 1.—Genomic positions of DIs. The colors from red to yellow correspond to the five bins of recombination rate from low to high (average values for 1 Mb genomic windows). For each identified DI, the pie chart shows DI pervasiveness, that is, the fraction of the populations (out of ten) in which this DI has “responded,” so that the marker SNPs carry the freshwater alleles at mean frequency of >0.5. Above the bars: gray dots, marker SNP densities in each 5 Kb window; black lines, their smoothing by loess function with the span = 0.0005.
. 2.—Divergence in the freshwater populations. (a) Map showing the locations of populations studied. (b) Dxy and Fst calculated between one marine and ten older freshwater populations; π calculated for one marine and ten older freshwater populations. (c) Marker SNP density plotted against recombination rate in 10 Kb genomic windows. Red color denotes windows located inside identified DI regions; dark gray, all other genomic segments. Black dashed line, the average recombination rate across the genome. (d–e) Distribution of distances between all pairs of DIs located on the same chromosome. Each cross denotes the number of pairs of DIs falling into a particular 200 Kb distance bin (horizontal axis) from each other. Lines, loess smoothing (span = 0.5) for actual data (red) or for each of the 1,000 reshuffling trials (gray) without (d) and with (e) accounting for local recombination rate. (f–g) Distribution of the numbers of DIs per chromosome without (f) and with (g) accounting for the local recombination rate. Expected numbers are calculated from the 1,000 permutation trials.
. 3.—Genomic architecture of the DIs. (a) Freshwater allele frequencies within the 65 identified DIs. Columns, populations from left to right: one marine population, four young freshwater populations, and ten freshwater populations of older ages. Color codes for populations: blue—marine; coral—young freshwater; red—older freshwater. Rows, DIs. Gray cells correspond to values that are missing due to insufficient sequencing coverage. (b) DI marker SNP overlap. Columns, individual populations, or the values averaged over all populations (rightmost column); rows, DIs, gray cells, missing data. (c) The marker SNP overlap of a DI varies along its length. This panel is similar to panel (b), except that each DI is subdivided into four equal bins along its length, and marker SNP overlap is calculated independently for each bin. Only freshwater populations in which this DI had more than five marker SNPs were considered; the remaining cells are colored gray. (d–g) Allelic composition of individual DIs. Rows, populations. Columns, marker SNPs within the DI, with genes overlapping the DIs indicated with brackets (unannotated genes are marked with an asterisk); green arrows indicate nonsynonymous marker SNPs. Cell color, freshwater allele frequency. Gray cells correspond to values that are missing due to insufficient sequencing coverage. Scatterplots are PCA plots based on densities of marker SNPs in each population. (d) High marker SNP overlap (R = 0.99) DI IV-2; many of the marker SNPs are nonsynonymous, including 14 marker SNPs within a short exon of the LRRC32 gene. (e) Low marker SNP overlap (R = 0.17) DI III-2. (f) Low marker SNP overlap (R = 0.06) DI X-2. (g) Variable marker SNP overlap DI IX-4; most of the marker SNPs are shared between populations at the 5′ part of the DI, but are private to some of the populations at the central part of the DI.