Literature DB >> 26384769

Linkage Mapping Reveals Strong Chiasma Interference in Sockeye Salmon: Implications for Interpreting Genomic Data.

Morten T Limborg1, Ryan K Waples2, Fred W Allendorf3, James E Seeb2.   

Abstract

Meiotic recombination is fundamental for generating new genetic variation and for securing proper disjunction. Further, recombination plays an essential role during the rediploidization process of polyploid-origin genomes because crossovers between pairs of homeologous chromosomes retain duplicated regions. A better understanding of how recombination affects genome evolution is crucial for interpreting genomic data; unfortunately, current knowledge mainly originates from a few model species. Salmonid fishes provide a valuable system for studying the effects of recombination in nonmodel species. Salmonid females generally produce thousands of embryos, providing large families for conducting inheritance studies. Further, salmonid genomes are currently rediploidizing after a whole genome duplication and can serve as models for studying the role of homeologous crossovers on genome evolution. Here, we present a detailed interrogation of recombination patterns in sockeye salmon (Oncorhynchus nerka). First, we use RAD sequencing of haploid and diploid gynogenetic families to construct a dense linkage map that includes paralogous loci and location of centromeres. We find a nonrandom distribution of paralogs that mainly cluster in extended regions distally located on 11 different chromosomes, consistent with ongoing homeologous recombination in these regions. We also estimate the strength of interference across each chromosome; results reveal strong interference and crossovers are mostly limited to one per arm. Interference was further shown to continue across centromeres, but metacentric chromosomes generally had at least one crossover on each arm. We discuss the relevance of these findings for both mapping and population genomic studies.
Copyright © 2015 Limborg et al.

Entities:  

Keywords:  genome data; gynogenesis; interference; meiosis; recombination

Mesh:

Year:  2015        PMID: 26384769      PMCID: PMC4632065          DOI: 10.1534/g3.115.020222

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Recombination plays a crucial role in creating novel genetic variation in sexually reproducing species (Barton and Charlesworth 1998; Otto and Lenormand 2002) and for securing proper disjunction of sister chromatids and chromosomes during meiosis. Chiasma interference (hereafter simply referred to as interference) is a fundamental process that influences crossover locations because the formation of a chiasma reduces the chance of a nearby recombination (Sturtevant 1915). Interference was thought to only occur within independent chromosome arms because the centromere acted as a barrier to interference (Mather 1938). Another important role of recombination includes crossing over between homeologs; these crossovers slow rediploidization of polyploid-origin genomes resulting from recent whole genome duplications (WGD). Pioneering studies considered allozyme loci in lake trout and brook trout hybrids to characterize segregation of duplicated loci in males and presented a meiotic model for explaining this residual tetrasomic inheritance (May ; Wright ). These meiotic models were recently synthesized to further promote awareness of these processes (Allendorf ; May and Delany 2015). Indeed, WGDs have played an important role in the evolution of many polyploid taxa that often serve as complex, but enlightening, models for understanding genome evolution (Comai 2005; Parisod ; Mable 2013). Knowledge about how meiotic recombination affects the evolution and structure of genomes is crucial to better understand and interpret genomic data. Historically, basic meiotic processes have mainly been described in model organisms, but today advances in sequencing technology have catalyzed dense genome mapping in nonmodel species. Linkage maps have informed population-based studies, including many of polyploid-origin taxa. However, detailed interrogations of recombination patterns in nonmodel species still lag behind surveys from a few select model organisms (Broman and Weber 2000; Basu-Roy ). Insights on recombination patterns from a broader range of taxa, including polyploid-origin species, are warranted. Understanding how recombination shapes genetic diversity across genomes is also crucial for interpreting signals of population divergence in genomic data. The number and distribution of so-called islands of elevated divergence have received immense attention in recent literature on ecological speciation (Ellegren ; Jones ; Tine ). Recombination rates have been invoked as an important factor explaining the size and genomic location of such islands (Renaut ; Cruickshank and Hahn 2014). A better understanding of how recombination mechanisms vary across a genome will help explain the relative contribution of neutral drift vs. divergent selection in creating local regions of elevated divergence among ecotypes or populations within a species (see also discussion in Roesti ). Salmonid fishes are an excellent system for studying the role of recombination in genome evolution for a number of reasons. Single pair matings produce thousands of embryos, enabling the examination of large numbers of meiosis from a single individual. Also, details of ploidy manipulation are well worked out; the use of haploid or gynogenetic diploid families greatly enhances genotyping and mapping capabilities (Komen and Thorgaard 2007). Early inheritance studies in salmonids, although often based on fewer than 50 allozyme loci, suggested strong crossover interference: in females, extended regions between telomeric loci and the centromeres invariably had a single crossover (Thorgaard ; Allendorf ; Lindner ). These early findings were restricted to only a subset of chromosomes and did not include genome-wide interrogations of interference. We know that homolog recognition and pairing initiates at the telomeres (reviewed in Calderon ) and interference occurs across centromeres in humans (Colombo and Jones 1997; Broman and Weber 2000) and zebrafish (Danio rerio) (Demarest ). However, detailed genome-wide descriptions of interference are scarce for nonmodel organisms, and the few that exist reveal significant interspecific differences (Segura ). Further, the salmonid ancestor went through a recent WGD (Ohno 1970; Allendorf and Thorgaard 1984), and the rediploidization process is not complete. Early inheritance studies in salmonids described a complicated pattern of both disomic and residual tetrasomic inheritance for a suite of isoloci duplicated genes that share alleles (May ; Wright ; Allendorf and Danzmann 1997). These isoloci were found to reside mainly in telomeric regions (Thorgaard ; Allendorf ; Seeb and Seeb 1986). More recently, studies in a few Pacific salmonid species have combined genotyping by sequencing and the use of haploid mapping to map isoloci (Oncorhynchus tshawytscha, Brieuc ; O. kisutch, Kodama ; O. keta, Waples ). Results suggest that eight syntenic pairs of homeologous chromosome arms remain duplicated across species. These findings are further supported by insights from the rainbow trout (O. mykiss) genome sequence that shows conserved sequence identity and gene order between paired paralogous regions (Berthelot ). Data from additional species will contribute to a more complete understanding of how the WGD has shaped genome evolution in salmonids and other polyploid-origin species. Our objective is to use genetic mapping to improve understanding of recombination and interference. We combine the use of genotyping by sequencing data from gynogenetic haploid and gynogenetic diploid progeny from a single female sockeye salmon (O. nerka). We then: (1) produce a dense genetic map and locate the centromeres and retained duplications; (2) test for the occurrence and strength of interference; and (3) test for the occurrence of interference across centromeres.

Materials and Methods

Gynogenetic mapping families

We produced two families of gynogenetic progeny as described in Chourrout (1980). One gravid female and one male sockeye salmon from the Lake Sammamish population were stripped for eggs and sperm at the Issaquah State Salmon Hatchery (Washington, USA) in December 2012. Fin clips were taken from both parents and stored in ethanol. All progeny were produced by fertilizing all of the eggs with sperm that had been genetically inactivated with 10 min exposure to UV light (Figure 1). A haploid family for linkage map construction was created by placing half of these haploid embryos into the incubator with no further treatment (Figure 1). We produced a second family of gynogenetic diploids (half-tetrads) by exposing the remaining embryos to 10 min heat shock at 28° to induce retention of the second polar body (Figure 1). The first embryo hatched after 86 d; remaining embryos from both families were immediately sampled and stored in ethanol.
Figure 1

Schematic overview of the construction of gynogenetic families used for linkage map construction. Gametes were collected from a single set of parents followed by UV irradiation of sperm ensuring that only maternal DNA was incorporated into developing embryos. After fertilization, half the gynogens were allowed to develop into gynogenetic haploids and the others were heat-shocked to produce gynogenetic diploids (Chourrout 1980). Both families were RAD sequenced for linkage map construction and recombination analyses.

Schematic overview of the construction of gynogenetic families used for linkage map construction. Gametes were collected from a single set of parents followed by UV irradiation of sperm ensuring that only maternal DNA was incorporated into developing embryos. After fertilization, half the gynogens were allowed to develop into gynogenetic haploids and the others were heat-shocked to produce gynogenetic diploids (Chourrout 1980). Both families were RAD sequenced for linkage map construction and recombination analyses. DNA was extracted from fin clips and embryos using DNEasy-96 kits (Qiagen, Valencia, CA, USA) following the manufacturer’s directions. Whole embryos were dissected from the yolk and chorion and added directly to the lysis buffer. Both parents as well as 142 putative haploid and 141 putative gynogenetic diploid progeny were genotyped for 96 EST-derived SNPs using 5′-nuclease assays (Elfstrom ; Storer ). Genotypes were called using the BioMark software v3.0.2 (Fluidigm, South San Francisco, CA). Embryos with paternal alleles signaled failure of the UV treatment and were excluded from further analysis. Success of the heat shock treatment was evaluated by screening the same 96 SNPs from the 5′-nuclease assays in putative gynogenetic diploids for the occurrence of completely homozygous embryos. Fully homozygous embryos, for loci segregating in the female parent, would signal failure of the heat shock to incorporate the second polar body; those individuals were also excluded. Genotypes for the 5′-nuclease assay SNPs that were segregating in the gynogenetic families were also used for mapping after filtering for Mendelian inheritance and segregation distortion (see below).

Sequencing

Genotyping was performed by sequencing restriction site–associated DNA (RAD-seq). All sequencing was done at the University of Oregon High Throughput Sequencing Facility using an Illumina HiSeq2000. We generated sequencing libraries using the restriction enzyme SbfI following methods previously described (Baird ; Everett ). We ligated unique barcodes (6 bp) to digested DNA following the work of Miller . The female parent was sequenced together with gynogenetic haploid and diploid progeny to 101 bp reads (Figure 1). Samples were sequenced on five lanes: two lanes each included the female parent and 47 gynogenetic haploids and three lanes each included 32 gynogenetic diploid progeny.

Detection and genotyping of polymorphisms

We used the Stacks software package v1.04 (Catchen ) to identify polymorphic loci and to assign genotypes. We used the process radtags program to remove reads characterized by low-quality, uncalled bases, or with ambiguous barcodes. Retained reads were demultiplexed and trimmed to 94 bp by removing the barcode and terminal base. We then identified duplicated loci based on the segregation patterns of alleles in the offspring following the approach of Waples . More details about this approach are given in Supporting Information, File S1. Duplicated loci contain sequences from at least two distinct genomic locations and have segregation patterns deviating from disomic Mendelian inheritance (i.e., confounded loci sensu; Waples ). RAD loci, including duplicated loci, were then genotyped for the haploid family and combined with genotypes for the 5′-nuclease SNPs for linkage map construction. Strict segregation distortion tests served to detect and discard any mis-specified loci. We excluded progeny with less than 0.5 × 106 reads because genotypes could not be reliably assigned with lower sequence coverage.

Linkage map construction

The haploid family was used to construct the linkage map and the gynogenetic diploid family was used to locate centromeres (Figure 1). The map was generated by using the minimum spanning tree method implemented in the program MSTMap (Wu ). Duplicated loci were mapped following the work of Waples . We compared our map with previously published linkage maps for sockeye salmon (Everett ) and rainbow trout (Miller ) to aid construction and annotation of linkage groups (LGs). We name LGs by considering the numbers given by Everett preceded by a ‘So’ prescript (see File S1 and Table S1 for more details). Half-tetrad analysis of the gynogenetic diploid offspring was used to place centromeres. Raw Illumina reads were processed in Stacks as described for haploids. Only haplotypes for nonduplicated loci were exported. The need to accurately call heterozygote genotypes in the diploid progeny necessitated a higher cut-off threshold of 1.5 × 106 reads compared to the haploid progeny. We exported haplotype alleles from Stacks to call genotypes and merged these with genotypes for SNPs detected with the 5′-nuclease assays. Loci with >10% missing genotypes were discarded. We estimated the frequency of second division segregation (y) for each locus following the work of Thorgaard to identify centromeric regions on each LG. Values of y are scored as observed heterozygosity in the gynogenetic diploid family and range between zero and one. A heterozygote genotype indicates that an odd number of recombination events have occurred between the locus and the centromere; a homozygous genotype indicates zero or an even number of recombination events. Centromeres are nonrecombining regions that occur as a single location on a linkage group; therefore, adjacent loci are expected to have low y values because recombinations will be rare over the short locus-centromere distances. For loci with low y values, it can be difficult to identify their orientation in relation to the centromere. Therefore, centromeric regions were conservatively defined as the shortest interval on an LG including all markers with y < 0.10. Finally, loci were assigned to specific chromosomal regions, arms or centromeres, to complete the linkage map and to enable downstream analyses of interference. We labeled loci with y < 0.10 as centromeric (c), and, for acrocentric chromosomes, loci with y > 0.10 were assigned to reside on arm a1. For metacentric chromosomes, loci with y > 0.10 were arbitrarily assigned to reside on arm a1 if located before the centromeric region [i.e., smaller centimorgan (cM) value on the map], or to reside on arm a2 if located after the centromeric region. Finally, it is important to note that even with a few thousand loci, our coverage of the genome will often be insufficient to detect the short p arm for some acrocentric chromosomes.

Interference

First, we identified recombination events in both haploid and diploid gynogenetic families along each linkage group. We infer phase changes within haploid offspring to be the result of recombination using the parental phase inferred during linkage map construction. We count offspring with zero, one, or two crossover events per arm. In gynogenetic diploids, homozygotes and heterozygotes reflect different phases of the recombinant chromatid resulting from meiosis I. Accordingly, homozygotes and heterozygotes represent distinct maternal phases. Duplicated loci were not considered because alleles could not be assigned to a unique map location. This framework was then used to count number and location of recombination events within each offspring and for each LG (Figure 2). Crossover events were detected as phase changes observed using a sliding window that recorded the mean phase over 11 consecutive marker locations. Up to two crossovers were placed along each chromosome arm. Because double crossover events were rare, we discarded putative third crossovers because these are likely to be the result of genotyping error at one or a few terminal loci (Brieuc ).
Figure 2

Examples of crossover patterns for four different gynogenetic haploids along LG So2. The two maternal allelic phases are shown on the y-axis. The gray area identifies the centromeric region (see text). Upper left plot shows a haploid offspring with no recombination events, an occurrence in approximately 50% of meiotic products when all four tetrads are sampled. The upper right shows an offspring with one crossover on each arm. The lower left plot depicts an offspring with one crossover on arm a1. The lower right plot shows an example of a haploid offspring with two crossovers on arm a2.

Examples of crossover patterns for four different gynogenetic haploids along LG So2. The two maternal allelic phases are shown on the y-axis. The gray area identifies the centromeric region (see text). Upper left plot shows a haploid offspring with no recombination events, an occurrence in approximately 50% of meiotic products when all four tetrads are sampled. The upper right shows an offspring with one crossover on each arm. The lower left plot depicts an offspring with one crossover on arm a1. The lower right plot shows an example of a haploid offspring with two crossovers on arm a2. We estimated the genome-wide distribution of y to assess the potential occurrence of interference. With no interference, and therefore fully independent crossover locations, the maximum expected value of y is 0.67 (Anderson 1925). In contrast, with complete interference, y is expected to reach 1.00 at distal locations because there will always be exactly one crossover between distal loci and the centromere. Next, we estimated the strength of interference by fitting a gamma model to the distribution of observed interchiasma distances using CODA (Gauthier ). The shape parameter (ν) of the fitted gamma distribution is referred to as the interference parameter; values between zero and one indicate negative interference (i.e., crossover locations are more tightly clustered than expected at random). A value of one signals no interference and that crossover locations are independent of each other (i.e., follow an exponential distribution), and values above one signal positive interference, with higher values indicating stronger interference. Interchiasma distances were only recorded for the diploid progeny to avoid any potential bias from merging observations with the haploid progeny that were used to generate the map distances. The distribution of interchiasma distances was then fit using the biologically realistic two parameter model that allows some fraction (p) of observed crossovers to follow a noninterfering pathway (Falque ; Gauthier ; Basu-Roy ). As recommended by Gauthier , we used the hill-climbing algorithm to search a parameter space with ν = [1:20] and p = [0:1] to find the best fit for each LG. For LGs where estimates of ν equaled 20, the analysis was repeated with increased boundaries. The CIs around estimates of ν were calculated using the Fisher information matrix (Gauthier ).

Interference across centromeres

We tested for the occurrence of interference across centromeres on metacentric chromosomes by considering all offspring showing at least one crossover on both arms. We followed the method described in Colombo and Jones (1997) by using the Spearman correlation function:Here, χa1 and χa2 are the genetic distance from the midpoint of defined centromeric region to the first crossover location on arms a1 and a2. We considered different interval sizes around the centromeric midpoint (d) ranging from 10 cM, minimum distance allowing enough observations, and sequentially increasing d by 1 cM. A negative correlation indicates interference across centromeres, because a crossover close to the centromere translates into a larger-than-expected distance to the first crossover on the opposite arm. Data were pooled across all metacentric LGs because our data have too few observations to consider individual LGs. We estimated ρ(d) for both haploid and diploid families; families were treated independently for the reasons given above.

Data availability

Raw sequence data are deposited in the Short Read Archive (SRA) with accession number SRP063568. Genotypes for both haploid and diploid progeny are deposited on Dryad (doi: 10.5061/dryad.q675s).

Results

Linkage map

An average number of 2.5 × 106 reads for 93 haploids and 3.0 × 106 reads for 77 diploids (Table S2) resulted after quality filtering, barcode recovery, demultiplexing, and discarding individuals with low coverage. A total of 3496 loci remained after excluding loci uninformative for mapping (monomorphic in the female parent or with >25% missing genotypes in the offspring). Of these, 868 loci were classified as duplicated. Adding the 31 polymorphic 5′-nuclease SNPs resulted in 3527 loci available for linkage map construction. Initial linkage group construction using the haploid family produced 30 LGs containing between 21 and 184 markers each (Table S3). Comparison to an existing but less dense map for sockeye salmon served to validate and name LGs (File S1). In two instances, two of our linkage groups matched the same LG in the map of Everett ; these were denoted by A and B after the LG number. Two LGs corresponded to LG9 in Everett and syntenic comparison with rainbow trout identified these LGs to be the two female sockeye salmon sex chromosomes X1 and X2 (File S1); these were named So9A_(X2) and So9B_(X1) following previous descriptions (Thorgaard 1978). For the LGs So18A and So18B pair, So18B does not have a centromere (Figure 3); here, we consider LGs So18A and So18B to be the two arms of a single metacentric chromosome (LG18 in Everett ). The total map length was 2839 cM and included 2640 nonduplicated and 605 duplicated loci (Table S3).
Figure 3

Estimated y values plotted along each linkage group. Each plot represents an LG with mapped loci presented by circles in the top with nonduplicated loci shown as gray circles and duplicated loci shown as red circles. Estimated y values are plotted on the y-axis for each nonduplicated marker and shown as black circles. The horizontal dotted line equals the maximum expected value of y (0.67) without interference. Loci with low y values are expected to locate proximal to centromeres, and higher values are expected for loci with increasing distance from centromeres. Overall, these plots serve to effectively distinguish acrocentric (e.g., LG So1) from metacentric (e.g., LG So2) chromosomes and define regions that include the centromere.

Estimated y values plotted along each linkage group. Each plot represents an LG with mapped loci presented by circles in the top with nonduplicated loci shown as gray circles and duplicated loci shown as red circles. Estimated y values are plotted on the y-axis for each nonduplicated marker and shown as black circles. The horizontal dotted line equals the maximum expected value of y (0.67) without interference. Loci with low y values are expected to locate proximal to centromeres, and higher values are expected for loci with increasing distance from centromeres. Overall, these plots serve to effectively distinguish acrocentric (e.g., LG So1) from metacentric (e.g., LG So2) chromosomes and define regions that include the centromere. The gynogenetic diploid family was used to place centromeres; 2562 of the 2640 nonduplicated loci on the linkage map were successfully genotyped for the diploid progeny. Centromeric regions were located by plotting y values along the linkage map (Figure 3). Acrocentric chromosomes were characterized by a linear pattern of y along linkage groups (e.g., So1), and clearly distinguishable from metacentric chromosomes characterized by a V-shaped y plot (e.g., So3) with values below 0.10 defining centromeric regions. After centromere placement, the 30 LGs (2n = 60) translated into a total of 102 chromosome arms (NF = 102; Table S3) which is within the range of previously reported karyotypes for sockeye salmon (NF = 100–104) (Thorgaard 1978; Phillips and Rab 2001).

Distribution of retained duplicated regions

Duplicated loci were not randomly distributed across the genome. Mapping of two paralogs from 40 duplicated loci enabled identification of six pairs of homeologous regions located on 11 different LGs (Table 1). The majority of duplicated loci (457, 76%) mapped to these 11 LGs, including both arms on So21. Within these 11 LGs, duplicated loci represented between 22 and 66% of all loci and concentrated towards telomeres within distinct arms (red circles in Figure 3; Table S3). Twelve pairs of paralogs colocated within 10 cM on the same LG and likely represent local duplication events that do not originate from the WGD (Waples ). Another extended, but unpaired, region dominated by duplicated loci occurred on arm a1 of So27; remaining duplicated loci, possibly segmental duplications, were dispersed across the genome (Figure 3).
Table 1

Pairing of homeologous chromosome arms in sockeye salmon

Homeolog 1Homeolog 2No. of Paired LociHomeology in Rainbow Trout
LGArm (Region)LGArm (Region)
So2a1 (2.9–3.2)So5a1 (0.0–6.5)4Omy07p-Omy18p
So3a1 (1.1–5.4)So14a1 (5.4–28.8)3Omy05p-Omy02p
So8a2 (92.6–100.1)So23a2 (105.3)4Omy21p-Omy15q
So11a1 (8.6)So18Aa1 (6.8)1NA
So15a2 (101.1–102.5)So21a1 (2.2–3.7)4Omy17q-Omy13pa
So21a2 (103.7)So26a1 (12.0–22.2)4NA

The linkage group region including the loci supporting a homeologous relationship is given as the start and end location in cM. The two right columns present syntenic comparisons for rainbow trout as inferred from alignment results (File S1; Table S1) and previous reports of homeologous pairs in rainbow trout (Phillips ; Kodama ).

The synteny between So21 and rainbow trout chromosome Omy13p were only supported by syntenic reports in rainbow trout (Table S1).

The linkage group region including the loci supporting a homeologous relationship is given as the start and end location in cM. The two right columns present syntenic comparisons for rainbow trout as inferred from alignment results (File S1; Table S1) and previous reports of homeologous pairs in rainbow trout (Phillips ; Kodama ). The synteny between So21 and rainbow trout chromosome Omy13p were only supported by syntenic reports in rainbow trout (Table S1).

Interference along chromosome arms

The number of observed recombination events for each arm reveals a pattern of strong, but incomplete, interference. Among haploid progeny, only 64 of 1902 (3.4%) observations showed two crossovers within a single arm (Table 2). Similarly, we observed 178 of 3777 (4.7%) arms with more than one crossover among the diploid progeny (Table 2). Further evidence for interference was revealed by a high fraction of distal loci with y values above 0.67 (Figure 4).
Table 2

Observed numbers of haploid and diploid gynogenetic offspring with zero, one, or two crossovers are given for each linkage group and for each arm

Number of Offspring with 0, 1, or 2 Crossovers
LGChromosome TypeLinkage Group Arm Length (cM)Gynogenetic HaploidsGynogenetic Diploids
Incl. Duplicated LociExcl. Duplicated LociEntire Linkage GroupArm 1Arm 2Entire Linkage GroupArm 1Arm 2
Arm 1Arm 2Arm 1Arm 2012012012012012012
So2M4966476622472053400493490265473006610
So3M585856582250204547151420216537222676
So4M457145712250196726045453126146851657
So5M55555055224723573605140221362146303740
So6M485348532547195340055371027027411723
So7M5061506120502257360553443758106433674
So8M465546442552166825072210086527306683
So10M546854681945245043048387366057018617
So11M5561426122472152410484141964146301703
So12M565556552543234944049422126446931685
So13M61426142244819563436231031248460916591
So14M705644562643246428167260266196802669
So15M544854462550165139356370256026569680
So19M56405640294618553806825012253374021551
So20M695469541951195932259331006207140648
So21M51534252323823623106726001958195800770
So22M7651765123422545471613202059261122712
So23M495749572055185439043500336937207700
So24M635363532147234844154381116717422695
So27M726855681948245041241502146657201705
So28M81581574190858082110373285522051260
So18AM (arm 1)5352573505720
So18BM (arm 2)8391
So1A6969404764636
So16A7272563706578
So17A42425538045815
So25A63633951375710
So26A7862464708661
So9A_(X2)A6565385235569
So9B_(X1)A5757523926642

Chromosome type denotes metacentric (M) or acrocentric (A) chromosomes according to Everett and Figure 3. Lengths of each chromosome arm are given for two linkage map versions; one including and one excluding duplicated loci. Cells with — indicate parameters that could not be estimated for the given LG for reasons given in the text. Linkage groups are ordered by chromosome type to facilitate comparison within and between metacentric and acrocentric chromosomes.

Figure 4

Genome-wide distribution of y values estimated for all nonduplicated loci in the gynogenetic diploid family. Loci with values near zero are expected to be proximal to centromeres, and loci with higher values are expected to be more distally located. Values of y above 0.67 are only expected with the occurrence of positive crossover interference.

Chromosome type denotes metacentric (M) or acrocentric (A) chromosomes according to Everett and Figure 3. Lengths of each chromosome arm are given for two linkage map versions; one including and one excluding duplicated loci. Cells with — indicate parameters that could not be estimated for the given LG for reasons given in the text. Linkage groups are ordered by chromosome type to facilitate comparison within and between metacentric and acrocentric chromosomes. Genome-wide distribution of y values estimated for all nonduplicated loci in the gynogenetic diploid family. Loci with values near zero are expected to be proximal to centromeres, and loci with higher values are expected to be more distally located. Values of y above 0.67 are only expected with the occurrence of positive crossover interference. We used the observed distribution of interchiasma distances to estimate the strength of interference across chromosomes. Estimates of ν equaled the upper limit of 20 for five LGs (So9A_(X2), So9B_(X1), So18A, So25, and So26). When the upper limit of ν in the searched parameter space was increased, the ν estimates continued to equal the maximum allowed value (i.e., ν → ∞). Common to these five LGs is that they represent one chromosome arm with few, if any, double crossovers; this likely impeded the fitting of a gamma distribution for the few observed interchiasma distances (Table 2). Therefore, we do not report estimates of ν for these LGs, but conclude that the data signal strong interference as well (Basu-Roy ; Table S4). The average point estimate among remaining LGs (ν = 7.3) was considerably larger than the value expected without interference (ν = 1). These observations were further supported by low estimates of p (p = 0.00–0.16) (Table S4) indicating that most crossovers are affected by interference (Falque ). Figure 5 shows estimates of ν for each LG. All LGs show positive interference, although CIs for So4, So16, and So28 include one, the expected value with no interference. Further, CIs around ν did not include the genome-wide average for seven LGs (So5, So6, So13, So15, So17, So27, So28), suggesting that strength of interference varies among chromosomes (Figure 5). No clear correlation was found when plotting ν as a function of genetic length of each LG (Figure S1).
Figure 5

Maximum-likelihood estimates of the interference parameter ν from the gamma model (black circles). Vertical gray lines show the 95% C.I. around each ν estimate. If interference does not affect crossover location, then ν will take a value of 1.0 (horizontal dotted line). The genome-wide average (ν = 7.3) is shown by the solid horizontal line. Estimates of ν were not plotted for six LGs (So9A_(X2), So9B_(X1), So18A, So18B, So25, and So26) because too few double crossovers on these linkage groups impeded reliable estimation of ν (see text).

Maximum-likelihood estimates of the interference parameter ν from the gamma model (black circles). Vertical gray lines show the 95% C.I. around each ν estimate. If interference does not affect crossover location, then ν will take a value of 1.0 (horizontal dotted line). The genome-wide average (ν = 7.3) is shown by the solid horizontal line. Estimates of ν were not plotted for six LGs (So9A_(X2), So9B_(X1), So18A, So18B, So25, and So26) because too few double crossovers on these linkage groups impeded reliable estimation of ν (see text). We detected striking drops of y values toward telomeric regions of some LGs; this pattern was particularly pronounced on So2 (arm a2) and So22 (arm a1; Figure 3). These observations result from an increased number of gynogenetic diploid progeny that show two recombinations along these arms (Table 2) because loci located distal to a second chiasma will reduce estimates of y. Arm a2 of So2 showed an increased number of double crossovers in the haploids as well, whereas this was not the case for So22 arm a1 (Table 2). In general, occurrence of double crossovers was more common on longer chromosome arms, but absence of double crossovers in some of the longer arms suggest that genetic length is not the only factor determining the frequency of multiple crossovers (Figure 6).
Figure 6

Number of observed double crossovers plotted against arm length. Symbols are shown in transparent gray to highlight over plotting. Results are given for haploid and diploid families separately because observations for diploid offspring do not include duplicated regions.

Number of observed double crossovers plotted against arm length. Symbols are shown in transparent gray to highlight over plotting. Results are given for haploid and diploid families separately because observations for diploid offspring do not include duplicated regions. We estimated the Spearman correlation coefficient (ρ) between distances from centromeres to the nearest chiasma location on both arms for the 20 metacentric LGs. For the haploid data, no values of ρ were significantly different from 0, whereas estimates for diploids were significantly negative for d < 15 cM (Figure 7). Nevertheless, the haploid data showed a similar pattern of negative correlations for d < 15 cM. Lack of significant correlations in the haploid family may be due to reduced statistical power from the limited number of observations; the similar patterns observed between the haploid and diploid families lead us to conclude that interference does affect crossover patterns across centromeres.
Figure 7

Interference across centromeres. Estimates of the correlation function ρ(d) = corr(χa1, χa2|max([χa1, χa2] ≤ d) for different windows of genetic distance (d) in both directions from the midpoint of defined centromeric regions. Estimates of ρ(d) were obtained for each integer value of d ranging from 10 to 75 cM. The gray area denotes the 95% C.I. around ρ(d). Observations are for all diploid offspring and pooled across all 20 metacentric LGs.

Interference across centromeres. Estimates of the correlation function ρ(d) = corr(χa1, χa2|max([χa1, χa2] ≤ d) for different windows of genetic distance (d) in both directions from the midpoint of defined centromeric regions. Estimates of ρ(d) were obtained for each integer value of d ranging from 10 to 75 cM. The gray area denotes the 95% C.I. around ρ(d). Observations are for all diploid offspring and pooled across all 20 metacentric LGs.

Discussion

We demonstrate the power of using both haploid and diploid gynogenetic offspring from a single-pair mating to describe detailed patterns of recombination in a duplicated salmonid genome. Results reveal extensive distal regions dominated by duplicated loci as well as strong, but varying, levels of interference for most LGs. Our use of a single female parent comes with some limitations for drawing general conclusions: our data only reflect information from heterozygous loci in the single female parent. Likewise, variability among populations, individuals, and sexes are not captured (c.f., Johnston ).

Extensive subtelomeric regions remain duplicated in sockeye salmon

Linkage mapping revealed the occurrence of numerous regions with high levels of retained sequence identity between putative homeologs in sockeye salmon. These observations are consistent with an ongoing rediploidization of the sockeye salmon genome where some homeologous chromosomes still undergo residual tetrasomic inheritance (May ), a pattern that is shared among many extant species within the salmonid family (Allendorf ). We detected six pairs of homeologous regions dominated by duplicated loci as well as an unpaired region on the So27 arm a1. A total of eight conserved pairs of duplicated homeologs have been described for a range of Oncorhynchus (Kodama ) and Salmo (Lien ) species. Although we cannot rule out the possibility that some of these regions have fully diploidized in sockeye salmon based on these data, a more parsimonious explanation is that we have failed to identify another two pairs because we considered too few meiosis (progeny). Many genomic studies of polyploids, including salmonids, exclude duplicates to avoid inclusion of loci that deviate from standard assumptions such as Hardy-Weinberg proportions (Poland ; Gagnaire ; Hyma and Fay 2013; Limborg ). Consequently, extended regions that remain duplicated are not included in most population genomic studies. The filtering of duplicated loci is especially pertinent to polyploid species where extended telomeric regions are excluded in downstream scans for selection. Important genes located in these duplicated regions will remain undetected, leading to an incomplete understanding of the genetic basis of local adaptation (see discussion in Allendorf ). Finally, our study demonstrates how comparative mapping among species represents a powerful tool for validating de novo linkage maps in nonmodel species. Indeed, conserved restriction enzyme cut sites among genomes in related species make RAD sequencing, and related techniques, particularly useful for identifying orthologous chromosomal regions through comparative mapping (File S1); a feature that is expected to enrich mapping studies among related species.

Recombination patterns reveal strong interference

Thorgaard first reported strong interference across the salmonid genome based on data from only 10 allozyme loci, three of which had y values at or close to 1.0. We expand the evidence for strong interference by mapping 3245 loci, finding loci with y values approaching 1.0 on nearly all linkage groups. It is important to note that we only used nonduplicated loci for estimating y. Thus, the proportion of loci with y values above 0.67 is likely to be an underrepresentation of the true number (c.f., Lindner ). Interestingly, similar to our finding, Lindner also observed an overrepresentation of loci with low values of y (see y ≤ 0.1 in Figure 4). This peak can be explained by stronger crossover suppression near centromeres, which has been observed in a number of model species (Koehler ; Lamb ; Rockmill ). One explanation involves selection against crossovers proximal to the centromere that has been shown to destabilize meiotic segregation and increase occurrence of nondisjunction (Talbert and Henikoff 2010). Assuming an even genome coverage of loci, our observation of an increased number of loci with low y values supports a similar model of selection against crossovers proximal to centromeres in sockeye salmon. Our study adds new insights about genome-wide interference in a salmonid genome by presenting the first quantitative estimates of the strength of interference across each chromosome. Positive interference for most, if not all, chromosomes described for sockeye salmon in this study is common to a range of other eukaryotes including plants (Lhuissier ; Giraut ) and mammals (Broman ; Segura ). Nevertheless, most existing data come from model species, and the strength of interference varies greatly among taxa (Giraut ), illustrating the need for obtaining species-specific estimates. Importantly, we demonstrate that data from mapping studies that use genotyping by sequencing in nonmodel species can be used to obtain quantitative estimates of interference. Because no additional data are needed, we advocate a routine execution of interference analyses in future mapping studies. This will lead to a deeper understanding of variation across more taxa and of how recombination interference is affecting genome evolution in general. Observations of more than one crossover, coupled with subtelomeric regions showing declining values of y, illustrate that interference is not complete, a pattern supported by observations in both haploid and diploid families. The occurrence of double crossovers was generally more common on longer chromosome arms (Figure 6). Our measurement of recombination length, without knowledge of the corresponding physical length, complicates interpretation. It is tempting to posit that interference simply erodes with chromosome arm length, increasing the chance of observing multiple crossovers for longer chromosomes (Broman ; Giraut ; Mary ). However, significant outlier LGs belie this interpretation (see Table 2): LG So17 is a short acrocentric (42 cM), yet it has 15 double crossovers; So14 and So27 have two of the longest arms (≥70 cM), yet they have no double crossovers. Clearly the recombination length of chromosome arms is not the only factor affecting interference, and recombination patterns are likely dictated by a complex blend of different meiotic mechanisms. Varied strength of interference among chromosomes would have strong implications for interpreting genome data; unfortunately, there are little, if any, data on this issue in nonmodel taxa. Although no clear correlation between strength of interference and genetic distance is observed here, the two shortest LGs, So17 and So28, also have the lowest estimates of ν (Table S4). Evidence from yeast and humans also shows that interference is weaker on small chromosomes (Kaback ; Kaback 1996). However, our observation of interchromosomal differences has to be interpreted with care. All LGs are shorter than 150 cM and contain few double crossovers that inevitably translate into uncertainties because ν estimates are conditioned on the observation of interchiasma distances (Broman and Weber 2000). Nevertheless, we do detect a genome-wide pattern of strong, but incomplete, interference with some observed variation in the strength of interference among chromosomes; these results have important implications for the analyses of genomic data (see below).

Interference occurs across centromeres

We found evidence that interference occurs across a region spanning ∼15 cM on either side of the centromere; this implies that crossovers occurring near the centromere affect recombination events on the opposite arm. Transcentromere interference has been reported in other species groups, including insects (Colombo and Jones 1997), mammals (Broman and Weber 2000; Mary ), and fish (Demarest ). In our data, the effect declines rapidly with increasing distance from the centromere and disappears at distances greater than 15 cM. This rapid decline of transcentromere interference contrasts with patterns observed in humans (Broman and Weber 2000) and pigs (Mary ).

Implications for analyzing genomic data

Recombination is an important evolutionary process that generates novel genetic variation. Compelling theoretical constructs predict an evolutionary advantage of increased recombination rates (Felsenstein 1974; Barton and Charlesworth 1998), and empirical evidence for this theory has been found in, for example, Escherichia coli (Cooper 2007) and Drosophila melanogaster (Presgraves 2005; Campos ). Yet, it has proven nontrivial to broadly demonstrate such an effect because studies have been restricted to a few model species and because recombination rates vary among chromosomes and the few taxonomic groups studied (Cutter and Payseur 2013). In this study, we demonstrate how data generated for linkage mapping in nonmodel species can also be used to estimate interference. These methods can be used to estimate species-specific rates of interference whenever a linkage map is created, leading to a more complete understanding of how this fundamental meiotic process shapes genomic data across species. Our results will also aid interpretation of new analyses facilitated by genomic data. One such method involves examining length distributions of genomic runs of homozygosity (ROH) to infer migration or inbreeding history (Pool and Nielsen 2009; Kirin ; Harris and Nielsen 2013). The length of the ROH is affected by recombination patterns, because shorter runs are expected in regions with higher recombination rates. Interference is also expected to affect the length distribution of ROHs, especially if the strength of interference varies among the individuals and populations being compared (Broman and Weber 2000). The concrete effect of strong interference on these analyses remains unclear but deserves attention in future developments of these methods. A first step could include estimates of interference within each of the populations being compared; if the strength and pattern of interference are similar among populations, then it will be easier to justify conclusions based on ROH comparisons. Assumptions of interference strength are crucial to genetic mapping functions. Models assuming different levels of interference have been shown to produce different results based on the same data (Zhao and Speed 1996). Our genome-wide estimate of the interference parameter (ν = 7.3) in sockeye salmon is closer to that assumed by the Carter-Falconer mapping model (ν = 7.6) than the more often used Haldane (ν = 1.0) and Kosambi (ν = 2.6) functions (Zhao and Speed 1996; Broman and Weber 2000). With the accumulation of more accurate estimates of species-specific interference levels it will be possible to improve species-specific mapping efforts; such improvements will have implications when mapping genes that control important phenotypic traits.
  63 in total

Review 1.  Chromosome evolution in the Salmonidae (Pisces): an update.

Authors:  R Phillips; P Ráb
Journal:  Biol Rev Camb Philos Soc       Date:  2001-02

2.  Trans-centromere effects on meiotic recombination in the zebrafish.

Authors:  Bradley L Demarest; Wyatt H Horsley; Erin E Locke; Kenneth Boucher; David J Grunwald; Nikolaus S Trede
Journal:  Genetics       Date:  2010-10-26       Impact factor: 4.562

Review 3.  The advantages and disadvantages of being polyploid.

Authors:  Luca Comai
Journal:  Nat Rev Genet       Date:  2005-11       Impact factor: 53.242

4.  Hot regions of noninterfering crossovers coexist with a nonuniformly interfering pathway in Arabidopsis thaliana.

Authors:  Sayantani Basu-Roy; Franck Gauthier; Laurène Giraut; Christine Mézard; Matthieu Falque; Olivier C Martin
Journal:  Genetics       Date:  2013-09-11       Impact factor: 4.562

Review 5.  Reanalysis suggests that genomic islands of speciation are due to reduced diversity, not reduced gene flow.

Authors:  Tami E Cruickshank; Matthew W Hahn
Journal:  Mol Ecol       Date:  2014-06-17       Impact factor: 6.185

6.  Linkage mapping with paralogs exposes regions of residual tetrasomic inheritance in chum salmon (Oncorhynchus keta).

Authors:  R K Waples; L W Seeb; J E Seeb
Journal:  Mol Ecol Resour       Date:  2015-03-11       Impact factor: 7.090

7.  Spontaneous X chromosome MI and MII nondisjunction events in Drosophila melanogaster oocytes have different recombinational histories.

Authors:  K E Koehler; C L Boulton; H E Collins; R L French; K C Herman; S M Lacefield; L D Madden; C D Schuetz; R S Hawley
Journal:  Nat Genet       Date:  1996-12       Impact factor: 38.330

8.  Two types of meiotic crossovers coexist in maize.

Authors:  Matthieu Falque; Lorinda K Anderson; Stephen M Stack; Franck Gauthier; Olivier C Martin
Journal:  Plant Cell       Date:  2009-12-29       Impact factor: 11.277

9.  The subtelomeric region is important for chromosome recognition and pairing during meiosis.

Authors:  María del Carmen Calderón; María-Dolores Rey; Adoración Cabrera; Pilar Prieto
Journal:  Sci Rep       Date:  2014-10-01       Impact factor: 4.379

10.  Recombination speeds adaptation by reducing competition between beneficial mutations in populations of Escherichia coli.

Authors:  Tim F Cooper
Journal:  PLoS Biol       Date:  2007-09       Impact factor: 8.029

View more
  8 in total

1.  Identification of Multiple QTL Hotspots in Sockeye Salmon (Oncorhynchus nerka) Using Genotyping-by-Sequencing and a Dense Linkage Map.

Authors:  Wesley A Larson; Garrett J McKinney; Morten T Limborg; Meredith V Everett; Lisa W Seeb; James E Seeb
Journal:  J Hered       Date:  2015-12-28       Impact factor: 2.645

2.  Genomic signatures among Oncorhynchus nerka ecotypes to inform conservation and management of endangered Sockeye Salmon.

Authors:  Krista M Nichols; Christine C Kozfkay; Shawn R Narum
Journal:  Evol Appl       Date:  2016-10-21       Impact factor: 5.183

3.  A Dense Brown Trout (Salmo trutta) Linkage Map Reveals Recent Chromosomal Rearrangements in the Salmo Genus and the Impact of Selection on Linked Neutral Diversity.

Authors:  Maeva Leitwein; Bruno Guinand; Juliette Pouzadoux; Erick Desmarais; Patrick Berrebi; Pierre-Alexandre Gagnaire
Journal:  G3 (Bethesda)       Date:  2017-04-03       Impact factor: 3.154

4.  Salmonid Chromosome Evolution as Revealed by a Novel Method for Comparing RADseq Linkage Maps.

Authors:  Ben J G Sutherland; Thierry Gosselin; Eric Normandeau; Manuel Lamothe; Nathalie Isabel; Céline Audet; Louis Bernatchez
Journal:  Genome Biol Evol       Date:  2016-12-01       Impact factor: 3.416

5.  Genomic consequences of colonisation, migration and genetic drift in barn owl insular populations of the eastern Mediterranean.

Authors:  Ana Paula Machado; Alexandros Topaloudis; Tristan Cumer; Eléonore Lavanchy; Vasileios Bontzorlos; Renato Ceccherelli; Motti Charter; Nicolaos Kassinis; Petros Lymberakis; Francesca Manzia; Anne-Lyse Ducrest; Mélanie Dupasquier; Nicolas Guex; Alexandre Roulin; Jérôme Goudet
Journal:  Mol Ecol       Date:  2021-12-23       Impact factor: 6.622

Review 6.  Crossover Interference: Shedding Light on the Evolution of Recombination.

Authors:  Sarah P Otto; Bret A Payseur
Journal:  Annu Rev Genet       Date:  2019-08-20       Impact factor: 16.830

7.  The sockeye salmon genome, transcriptome, and analyses identifying population defining regions of the genome.

Authors:  Kris A Christensen; Eric B Rondeau; David R Minkley; Dionne Sakhrani; Carlo A Biagi; Anne-Marie Flores; Ruth E Withler; Scott A Pavey; Terry D Beacham; Theresa Godin; Eric B Taylor; Michael A Russello; Robert H Devlin; Ben F Koop
Journal:  PLoS One       Date:  2020-10-29       Impact factor: 3.240

8.  Mapping of Adaptive Traits Enabled by a High-Density Linkage Map for Lake Trout.

Authors:  Seth R Smith; Stephen J Amish; Louis Bernatchez; Jeremy Le Luyer; Chris C Wilson; Olivia Boeberitz; Gordon Luikart; Kim T Scribner
Journal:  G3 (Bethesda)       Date:  2020-06-01       Impact factor: 3.154

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.