| Literature DB >> 31649060 |
Christina M Rochus1, Fan Han1, Mats E Pettersson1, Junfeng Chen1, Jason Hill1, Ola Wallerman1, Guangyi Fan2,3, Xiaoning Hong2,4, Qiwu Xu2, He Zhang2, Shanshan Liu2, Xin Liu2,5,6, Leanne Haggerty7, Toby Hunt7, Fergal J Martin7, Paul Flicek7, Ignas Bunikis8, Arild Folkvord9,10, Leif Andersson1,11,12.
Abstract
The Atlantic herring is a model species for exploring the genetic basis for ecological adaptation, due to its huge population size and extremely low genetic differentiation at selectively neutral loci. However, such studies have so far been hampered because of a highly fragmented genome assembly. Here, we deliver a chromosome-level genome assembly based on a hybrid approach combining a de novo Pacific Biosciences (PacBio) assembly with Hi-C-supported scaffolding. The assembly comprises 26 autosomes with sizes ranging from 12.4 to 33.1 Mb and a total size, in chromosomes, of 726 Mb, which has been corroborated by a high-resolution linkage map. A comparison between the herring genome assembly with other high-quality assemblies from bony fishes revealed few inter-chromosomal but frequent intra-chromosomal rearrangements. The improved assembly facilitates analysis of previously intractable large-scale structural variation, allowing, for example, the detection of a 7.8-Mb inversion on Chromosome 12 underlying ecological adaptation. This supergene shows strong genetic differentiation between populations. The chromosome-based assembly also markedly improves the interpretation of previously detected signals of selection, allowing us to reveal hundreds of independent loci associated with ecological adaptation.Entities:
Mesh:
Year: 2019 PMID: 31649060 PMCID: PMC6836730 DOI: 10.1101/gr.253435.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Flow chart of the assembly compilation process (see Methods for further details).
Summary statistics for different assemblies of the herring genome
Physical and genetic sizes of the chromosomes
Figure 2.Chromosome size distribution and recombination rate. (A) Physical extent of the assembly for each chromosome, with average recombination rate, in 100-kb windows, shown on top of each chromosome and markers used in the linkage map indicated as black bars. (B) Linkage map data (black lines) and recombination-rate profile (colored segments) for Chromosome 8. Solid line: sex-average; dashed line: male; dotted line: female.
Figure 3.Conserved synteny between Atlantic herring and stickleback. (A) Whole-genome alignment between the Atlantic herring and threespine stickleback assemblies. (B) Detailed view of sequence homologies between herring Chromosome 11 and stickleback Chromosome XX, indicating many intra-chromosomal rearrangements.
Figure 4.Signals of selections associated with ecological adaptation. The panels show the association, measured as –log(P) from χ2 tests on read counts of previously published data (Martinez Barrio et al. 2016) replotted along the new assembly. (A) Genetic differentiation between seven populations of Atlantic herring from the Atlantic Ocean, North Sea, Skagerrak, and Kattegat (salinity in the range 20–35 psu) and 10 populations from the Baltic Sea (salinity in the range 3–12 psu). (B) Genetic differentiation between 10 populations of spring-spawning herring versus three populations of autumn-spawning herring. In both panels, the blue and red dots indicate identified, independent regions of selection at a P-value cut-off of either 10−20 (blue) or 10−15 (red). (C) Zoom in on Chromosome 15 for the contrast based on differences in spawning time, which contains the most significant peak, located around TSHR. (D) Improved gene model of TSHR. TSHR_1 to 5 are selected SNPs showing highly significant association (P < 10−95) to spawning time and/or being nonsynonymous coding, while covering the extent of the THSR gene model.
Figure 5.Identification of a 7.8-Mb inversion on herring Chromosome 12. (A) The spawning time contrast for Chromosome 12 highlights the block-like association pattern for the region from 17.9 to 25.6 Mb. The sketch shows the location of inverted repeats flanking the supergene. (B) LD patterns across the region in different groups of individuals sorted according to genotype for the putative inversion. (C) The distribution, as number of SNPs per 100 kb, of shared (black) and diagnostic (red) SNPs across the inversion region. Purple boxes (inset) are estimated locations of breakpoints in individuals that appear to carry a recombinant chromosome (see text). (D) Neighbor-joining tree based on genotypes for all SNPs in the inversion region called from individual whole-genome sequencing. The distances indicated across the tree are average nucleotide differences between haplotypes, either within groups (dashed) or between groups (solid). The letters designate the supergene type of the individual as follows: N = Northern homozygote; S = Southern homozygote; H = N/S heterozygote; R/N = individual carrying an N haplotype and a recombinant haplotype (see text); P = Pacific herring. (E) Heat map of the genotypes for diagnostic SNPs based on individual whole-genome sequencing. Supergene type of the samples is indicated as in D.
Figure 6.Genetic differentiation in the region encompassing the putative inversion on Chromosome 12. (A) Allele frequencies for all SNPs (n = 11,965) inside the inversion among individuals homozygous for the Southern (S) (y-axis) or Northern (N) haplotypes (x-axis). All frequencies are expressed in terms of the allele that is more common in the “N-”context than in the “S-”context. (B,C) Estimated frequencies for the Northern and Southern haplotypes in pooled population samples in the Baltic Sea and East Atlantic (B) and in the West Atlantic (C). The location and date of capture of the pooled samples are listed in Lamichhaney et al. (2017).