Literature DB >> 27986793

A SNP Based Linkage Map of the Arctic Charr (Salvelinus alpinus) Genome Provides Insights into the Diploidization Process After Whole Genome Duplication.

Cameron M Nugent1, Anne A Easton2, Joseph D Norman2, Moira M Ferguson2, Roy G Danzmann1.   

Abstract

Diploidization, which follows whole genome duplication events, does not occur evenly across the genome. In salmonid fishes, certain pairs of homeologous chromosomes preserve tetraploid loci in higher frequencies toward the telomeres due to residual tetrasomic inheritance. Research suggests this occurs only in homeologous pairs where one chromosome arm has undergone a fusion event. We present a linkage map for Arctic charr (Salvelinus alpinus), a salmonid species with relatively fewer chromosome fusions. Genotype by sequencing identified 19,418 SNPs, and a linkage map consisting of 4508 markers was constructed from a subset of high quality SNPs and microsatellite markers that were used to anchor the new map to previous versions. Both male- and female-specific linkage maps contained the expected number of 39 linkage groups. The chromosome type associated with each linkage group was determined, and 10 stable metacentric chromosomes were identified, along with a chromosome polymorphism involving the sex chromosome AC04. Two instances of a weak form of pseudolinkage were detected in the telomeric regions of homeologous chromosome arms in both female and male linkage maps. Chromosome arm homologies within the Atlantic salmon (Salmo salar) and rainbow trout (Oncorhynchus mykiss) genomes were determined. Paralogous sequence variants (PSVs) were identified, and their comparative BLASTn hit locations showed that duplicate markers exist in higher numbers on seven pairs of homeologous arms, previously identified as preserving tetrasomy in salmonid species. Homeologous arm pairs where neither arm has been part of a fusion event in Arctic charr had fewer PSVs, suggesting faster diploidization rates in these regions.
Copyright © 2017 Nugent et al.

Entities:  

Keywords:  diploidization; duplicated genes; epigenetic modification; linkage map; salmonid fishes; transmission genetics; transposition

Mesh:

Year:  2017        PMID: 27986793      PMCID: PMC5295600          DOI: 10.1534/g3.116.038026

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Whole genome duplications (WGDs) are rare evolutionary events that drastically alter genomic architecture by producing duplicate copies of every chromosome. The doubling of all loci can be a major driving force of evolution, as there is the potential to produce loci with novel functions through the accumulation of mutations that were formerly selected against (Ohno 1970). WGDs provide a surfeit of genetic information that can be associated with adaptive innovation and evolutionary change (Taylor and Raes 2004; Moghadam ; Nakatani ; Berthelot ). WGDs have been important events in the evolutionary history of vertebrate animals (Smith ). Two WGDs occurred in the common ancestor of all vertebrates, and these (referred to as the 1R and 2R WGDs) yielded a karyotype of between 40 and 52 chromosomes from the protovertebrate karyotype of 10–13 chromosomes (Nakatani ). A third (3R) WGD occurred ∼400 million yr ago (MYA) in the ancestor of teleost fish, while a fourth salmonid specific WGD (Ss4R) took place ∼96 MYA in the ancestor of salmonid fishes (Allendorf and Thorgaard 1984; Berthelot ; Macqueen and Johnston 2014; Lien ). Subsequent chromosome fusions and fissions caused changes in chromosome numbers and genomic architecture in specific vertebrate lineages (Kasahara ; Nakatani ). Following WGD, the genome undergoes the process of diploidization, where it reverts from a tetraploid (4n) to a diploid (2n) state. Specifically, chromosome pairs that shared a common genetic ancestor prior to WGD (termed homeologs), diverge from one another due to genomic rearrangements, gene deletion, pseudogenization, and mutation (Comai 2005; Bergthorsson ). Salmonids are in the midst of the diploidization process, in that some regions of the genome have diverged into two pairs of diploid loci, while in other regions, residual tetrasomy occurs as the result of multivalent formation and recombination among homeologues during meiosis (Berthelot ; Allendorf ; May and Delany 2015; Lien ). For example, 52% of genes in the rainbow trout (Oncorhynchus mykiss) genome have diploidized since the Ss4R WGD, while the other 48% of genes have retained both copies, and have yet to revert to a diploid state (Berthelot ). Similarly, work on the Atlantic salmon (Salmo salar) genome has found that 55% of genes have been retained as two functional copies since the Ss4R WGD (Lien ). Chromosomal architecture appears to play a role in determining which genomic regions undergo homeologous recombination during meiosis in these fishes (Brieuc ; Lien ; Kodama ; Waples ). All recombining pairs of homeologs appear to include one large chromosome, either a metacentric produced through the Robertsonian fusion of two chromosome arms, or a fused acrocentric resulting from the tandem fusion of two smaller acrocentric chromosomes. This suggests that the large size of these chromosomes may provide the stability necessary for homeologous recombination (Brieuc ; Kodama ; Lien ). Furthermore, the telomeric regions of homeologs show the slowest rates of diploidization within several salmonid species, as illustrated by a relatively high number of paralogous loci (Brieuc ; Waples ; Lien ). The development of detailed linkage maps has shown that lineage specific changes in chromosomal architecture have taken place in salmonids following WGD. Maps are available for species of Salmo and Oncorhynchus (Rexroad ; Gonen ; Limborg ; McKinney ; Waples ; Tsai ). In the karyotypes of certain species such as Atlantic salmon, a large number of chromosomes are the product of fusions (e.g., 42/58) (Phillips and Ráb 2001). Variation in genomic architecture may cause the rate of diploidization to vary across the genomic landscape of the different taxa, and which regions of the genome undergo residual tetrasomy (Lien ; Brieuc ; Kodama ; Lien ). Similar studies of taxa with more basal karyotypes, such as the Salvelinus species, would improve our understanding of the role that genomic rearrangements have in the diploidization process. For instance, karyotype data suggests that only 20 of the 78 chromosomes (2n = 78) in Arctic charr (Salvelinus alpinus) are metacentric, suggesting that fewer Robertsonian fusion events have occurred in their evolutionary history (Hartley 1989; Phillips ). The study of genomic evolution in Arctic charr is currently limited by the low resolution of available genetic linkage maps, and the paucity of known molecular markers (Woram ; Norman ; Timusk ). Expanding the genomic resources of Arctic charr through the addition of several thousand SNPs will make it possible to track the genomic rearrangements that have shaped the modern Arctic charr karyotype. The initial aims of this study were to (1) increase the number of known genetic markers in the Arctic charr genome using genotype by sequencing (GBS); (2) create a second generation genetic linkage map of the Arctic charr genome using newly identified SNP markers, and integrate the revised map to previous versions primarily based on microsatellite loci; and (3) identify the chromosome type associated with each linkage group, and use this information to test for the existence of acrocentric-acrocentric homeologous pairs in the genome. These are of interest because a lack of homeologous recombination could cause diploidization to occur faster in homeologous pairs of this type. Using the data produced to meet the above goals, we also aimed to (4) compare genomic data from Arctic charr to the genomes of rainbow trout and Atlantic salmon to identify chromosome arm homologies across these three species, allowing the characterization of conserved genomic rearrangements and fusion/fission events unique to the Arctic charr lineage; and, also, (5) identify putative duplicate loci, and assess their distribution across the Arctic charr genome. As we analyzed sequence data, it became apparent that there were signatures of significant transposon activity within the Arctic charr genome. We therefore used this opportunity to characterize transposable element (TE) activity in the Arctic charr genome, and see how TE distribution is affected by residual tetrasomy, chromosome architecture, and the uneven distribution of duplicate loci throughout the genome. Transposon activity is associated with important evolutionary transitions, adaptation to novel environments, and extensive changes in genome evolution (de Boer ; Schrader ; Staton and Burke 2015). In fact, the estimated time of radiation of Salmoninae into Salmo, Oncorhynchus, and Salvelinus (14–23 MYA) (Macqueen and Johnston 2014) coincides with a known spike in TE activity (de Boer ). TEs cause sequence deletion or duplication due to unequal homologous recombination and segmental duplication, and they facilitate genomic rearrangements (Kazazian 2004). Genomic regions with an accumulation of TEs appear to evolve faster than the rest of the genome (Schrader ). Therefore, TE activity might be influenced by the rate of residual tetrasomy in particular chromosomal regions, or vice versa. Reduced presence of TEs has been observed in the duplicated regions of Atlantic salmon, chum salmon (O. keta), and chinook salmon (O. tshawytscha) that lag behind in the diploidization process (Larson ; McKinney ; Waples ; Lien ). Therefore, our final goal was to characterize TE activity in the Arctic charr genome, and determine if there is a relationship between residual tetrasomy and TE activity in the telomeric regions of homeologs that undergo homeologous recombination.

Materials and Methods

Source mapping panel

The analysis utilized 85 full-siblings, and their parents, from a single family of Fraser strain Arctic charr obtained from the Coastal Zones Research Institute (CZRI), Shippagan, NB Canada. The Fraser strain originated from collections of fish from the Fraser River, Labrador, Canada, in the 1980s. The family was produced on November 6, 2012, and reared communally until March 11, 2014, at which time each fish was PIT tagged, weighed (to the nearest gram), measured (fork length), and samples of adipose fin were removed for DNA analysis. DNA was extracted using a commercial kit (Qiagen DNeasy Blood & Tissue), as per manufacturer’s instructions, and treated with RNase A to remove any RNA. The samples were then quantified using a Qubit Fluorometer to ensure that all DNA concentrations exceeded 50 ng/µl.

Sequencing analysis

The DNA samples were submitted for GBS (Elshire ) at the Cornell Institute of Biotechnology. DNA from each progeny was added to a single well of a 96-well plate, while the parents were analyzed in triplicate, to increase sequencing depth and provide the information necessary for linkage mapping based on the inheritance of SNP alleles. The four grandparents of the family were also added to a single well each. Samples were digested with the restriction enzyme EcoT22I, and unique barcode sequence adapters (4–8 bp in length) were ligated to each of the DNA samples (three for each parent) such that the DNA sequence data could be assigned to a specific individual or parental subsample. After the barcodes were added, sequencing primers, and the samples from all 95 wells containing DNA samples (and a single blank control well) were pooled. Paired end sequencing primers with oligonucleotides that allow binding to the sequencing flowcell were then added to the pooled samples. Polymerase chain reaction (PCR) was then used to amplify the DNA fragment pool, and the resulting DNA products were analyzed for fragment size. The DNA samples were then sequenced on an Illumina Hisequation 2000 high-throughput sequencing instrument, and replicated across two flowcells. The sequencing process produced 100 bp single-end reads. GBS sequence reads are available in the NCBI sequence read archive (www.ncbi.nlm.nih.gov/sra) under the BioProject accession number #SRP026259 and BioSample accession numbers #SAMN06165956 and #SAMN06165957.

SNP identification from raw sequence data

Raw sequence reads were analyzed using the UNEAK pipeline, part of the Tassel 3.0 software package produced by the maize genetics laboratory at Cornell University (Glaubitz ). The UNEAK pipeline allows for the identification of SNPs in species where a reference genome is not available. The 5′ sequence barcodes were used to define individual-specific reads, but were trimmed prior to sequence analysis. Alignment of all the sequence data was conducted to produce a master “tag” list for the dataset. Tags are unique sequences of up to 64 bp in length observed across multiple reads. Further alignment of the master tag list identified tag pairs with a single base pair mismatch, and these were considered SNPs (with a sequencing error tolerance rate parameter of 0.03) (for more information, see Glaubitz ). Note the program only considered tag pairs with a 1 bp mismatch to be SNPs, so any 64 bp read with 2+ SNPs would be excluded. The number of times each tag is observed in the sequencing data from each individual is used to determine the individual’s genotype for a particular SNP.

SNP filtering

The inventory list of SNP genotypes identified within the UNEAK pipeline was analyzed manually to remove markers where: (1) data were available for <75/85 progeny; (2) the genotype for one or both of the parents was missing. Markers where SNP inheritance displayed significant segregation distortion (i.e., SNPs with G values >6.693, P < 0.01) were identified and removed using LINKMFEX (Danzmann 2016).

Microsatellite genotyping

To anchor the newly identified SNP markers to previous Arctic charr linkage maps, genotypes at 102 microsatellite loci (Supplemental Material, File S1) from known locations across the Arctic charr genome were determined for all progeny and parents using established genotyping methods (Moghadam ; Timusk ; Norman ).

Linkage mapping

The high quality SNPs selected for mapping, and the microsatellite marker anchors, were assessed for genetic linkage using LINKMFEX (Danzmann 2016). The SNPs were split into three categories: heterozygous male, heterozygous female, and double heterozygote (DH) SNPs where both parents were heterozygotes. The SNPs where only a single parent was heterozygous had high information content, given that all progeny genotypes are informative for linkage mapping. Double heterozygous SNP markers are informative in only about half of the progeny, given that linkage phases cannot be assigned in heterozygous progeny. Markers that are heterozygous in only one parent are problematic in that SNP mapping locations cannot be compared between parents. Data from additional mapping panels will be required to compare map orders between the sexes for these markers. Double heterozygous SNP markers were assigned to specific linkage groups, but were not added to specific map locations. Linkage groups were identified using a logarithm of odds (LOD) threshold of 10 for linkage group assessment. We first created male-parent- and female-parent-specific linkage maps using the SNPs that were heterozygous in only one parent using LINKMFEX. In order to integrate the male and female linkage maps with one another, SNPs where both parents were heterozygous were added to each of the two datasets, and analyzed in LINKMFEX for genetic linkage with a LOD=10 threshold. It was then possible to identify overlapping sets of male- and female-specific linkage groups. To determine the location of unlinked markers, repeated analysis was performed using descending LOD scores (LOD = 6 down to LOD = 3) using LINKMFEX, and LOD = 3 additions were accepted if they created joinings of two or more linkage groups that are known to be homeologous to one another. For some markers, a pseudolinkage (see below) of known homeologous chromosome arms was detected within the range of LOD = 3.0 to LOD = 5.0, and were therefore accepted. Linkage groups were named according to historical designations based on previous microsatellite marker assignments (Woram ; Norman ; Timusk ). For each linkage group, marker order was determined in OneMap using the record algorithm (Margarido ). The record algorithm was selected for ordering because it consistently gave the shortest map distances [i.e., smallest number of adjacent double cross-over (DCO) alignment points] out of the three possible OneMap ordering functions (ug, rcd, and record). This option also gave shorter map lengths than those produced by LINKMFEX. Male and female marker orderings were determined separately for each linkage group. As mentioned above, markers heterozygous in both parents were not ordered. OneMap marker ordering was further refined using the LINKMFEX program Adjacent-DCO-Count_Ripple-Check. This program was used to identify and reorder marker placements causing adjacent DCOs in the ordering. Adjacent double cross-overs are biologically unlikely in salmonids, given the high levels of chromatid interference detected during meiosis (Sakamoto ; Danzmann and Gharbi 2001; Allendorf ) in these species. The revised marker ordering minimized the number of adjacent DCOs in the dataset. Final map distances were calculated using the MAPDIS-V program in LINKMFEX, and selecting the option to ignore adjacent DCO events. We chose this option, as we considered that remaining adjacent DCOs may be due to errors in genotyping calls.

sdY marker

The progeny were genotyped for the sexually dimorphic gene (sdY) located on the Y-chromosome, using the PCR and agarose gel visualization methods described in Yano with two modifications. First, we substituted insulin-like growth factor binding protein 5 (IGFBP5) as a positive control, using primers we developed: DQ206713-F3 (CCACCAGCTAATTACTGCAA) and DQ206713-R3 (GTAGAATTTGGCTGGCCCTA). Second, the following PCR temperature cycling conditions was used: denaturation for 5 min at 95°, followed by five cycles of 95° for 1 min, 58° for 30 sec, and 72° for 30 sec, then 30 cycles of 95° for 30 sec, 58° for 30 sec, and 72° for 30 sec, followed by a final 10 min at 72°. The sdY marker was validated based on conformity of the sdY genotypes to phenotypic assessment of the individual’s sex in this population.

Comparison with rainbow trout and Atlantic salmon genomes

SNP sequences were compared to the rainbow trout draft genome, and the Atlantic salmon genome (Berthelot ; Lien ), using the following BLASTn parameter settings (-word_size 11 -gapopen 5 -evalue 0.00001 -gapextend 2 -reward 2 -penalty -3) (Altschul ). Blast hits were filtered, and the hits with the lowest e-value were used for subsequent homology identification. In the case of equivalent e-values, all Blast hits were retained, and considered equal “top hits” for the given SNP.

Moveable genetic elements and homeologies

Following Blast comparison of the Arctic charr markers to the rainbow trout and Atlantic salmon genomes, SNPs that aligned to adjacent regions of a single rainbow trout, and Atlantic salmon chromosome arm, allowed for the identification of arm homologies across the species. Within the SNP clusters that displayed consistent homology to a given Atlantic salmon, or rainbow trout, chromosome arm, individual markers sometimes showed homologies to disparate regions of the genome. Two hypotheses about the cause of these disparate SNPs were tested: (1) the SNPs lie within moveable, and/or highly repetitive DNA sequences; (2) the SNPs may be aligning to a homeologous chromosome arm with highly similar sequences. To test for the existence of repetitive sequences and moveable DNA elements, BLASTn (same parameters as above) was used to compare the linkage map SNPs to Repbase Update’s database of known vertebrate moveable and repetitive DNA elements (Jurka ). The distribution of TE Blast hits in the linkage map was used to assess TE activity in the Arctic charr genome. TE activity in each linkage group was determined by assessing the proportion of markers in each linkage group that displayed significant Blast alignments to TEs.

Genomic architecture and residual tetrasomy

Paralogous sequence variants (PSVs) were identified to assess the distribution of duplicate loci through the Arctic charr genome. To do this, fixed heterozygote SNPs were used as PSV markers. These SNPs were heterozygous in both parents, and 100% of the progeny. The lack of homozygous progeny suggests that these are duplicate, monomorphic, loci with a single base-pair difference, causing them to appear heterozygous in all individuals. Since these PSVs lack recombinant progeny, their location relative to the linkage map SNPs cannot be determined. However, their linkage group affinities can be inferred based upon comparative homologies. We tentatively assigned these PSVs to Arctic charr linkage groups by aligning the PSVs and linkage map SNPs to the Atlantic salmon genome using BLASTn, and comparing their top hit locations. Two characteristics were assessed: (1) the distribution of PSVs between, and within, chromosomes of different genomic architectures, including Acrocentric Homeolog Pairs (AHPs), and High Residual Tetrasomy Arms (HRTAs); (2) the distribution of PSVs along chromosome arms relative to the centromere or telomere (see Figure 1). AHPs were defined as homeologous pairs of Arctic charr chromosome arms where neither arm in the pair is fused with another chromosome arm. HRTAs are defined as pairs of homeologous chromosomes that have higher levels of duplicate loci than the rest of the genome in multiple salmonid species (Danzmann ; Lien , 2016; Brieuc ), and likely form multivalents during meiosis due to crossing-over between their homeologous arms (Sakamoto )
Figure 1

Visual representation of how Atlantic salmon chromosome arms were divided into quarters to assess Arctic charr PSV and map marker BLASTn hit distributions. Circles represent centromeres.

Visual representation of how Atlantic salmon chromosome arms were divided into quarters to assess Arctic charr PSV and map marker BLASTn hit distributions. Circles represent centromeres. The distribution of duplicate loci in the genome was assessed using the Blast TopHit dataset, which consisted of markers in the Arctic charr linkage map and PSVs. The linkage map markers represent diploid loci, and the PSVs represent tetraploid loci. These markers were assigned to Atlantic salmon chromosome arm based on their top BLASTn hit locations (determined by lowest observed e-value). For markers with BLASTn hits of equal e-values to a single chromosome arm, only a single hit per chromosome arm was retained in the dataset. In the case of markers (both PSVs and linkage map markers) with equal BLASTn hit locations on two Atlantic salmon chromosome arms, the top Blast hit to each chromosome arm was retained in the dataset. Markers with equal top Blast hits to three or more chromosome arms, and markers with no Blast hits in the Atlantic salmon genome, were removed from the dataset. To further assess whether any of the apparent single copy SNP markers may be duplicate copies of one another, we performed a BLASTn analysis of all SNPs against all SNPs (non-PSVs) in the database (see parameter settings above). Duplicate pairs exceeding 95% identity, and a 95% overlap in length, and occurring on separate chromosome arms, were considered potential homeologs of one another. Duplicates mapping to the same linkage groups were considered regional marker duplicates, unless they mapped adjacent to one another, indicating some type of tandem duplication. We tested if duplicate loci are preserved in a higher frequency on HRTAs. Using Atlantic salmon as a reference, the HRTA homeolog pairs are represented by: Ssa02p/Ssa05q, Ssa11qa/Ssa26, Ssa16qa/Ssa17qa, Ssa03q/Ssa06p, Ssa12qa/Ssa02q, Ssa07q/Ssa17qb, and Ssa04p/Ssa08q. Using a contingency chi-square test, the number of linkage map and PSV top BLASTn hits on the HRTAs was compared to the number of hits on all other chromosome arms. An additional contingency chi-square test was performed to test if Arctic charr AHPs preserve duplicate loci in the same manner as the rest of the genome. Putative AHPs were identified based on homologies in the Arctic charr genome, and their respective Atlantic salmon (Lien ), and rainbow trout (Danzmann ; Berthelot ), homeologies. Based on this information, the following AHPs were identified in Arctic charr: AC02/AC36, AC05/AC29, AC19/AC32, and AC30/AC31. A contingency chi-square test compared the number of linkage map and PSV top BLASTn hits on acrocentric homeolog chromosome arms to the number of hits on all other chromosome arms. The base pair hit location (s.start) for each marker in the Blast TopHit dataset was used to assess the PSV and linkage map markers’ distributions along Atlantic salmon chromosome arms. Each Atlantic salmon chromosome arm was divided equally into four quarters based on the known base-pair start and end locations (Lien ). Quarter 1 was closest to the centromere, and quarter 4 was telomeric (Figure 1). All marker hits were then assigned to a chromosome arm quarter, and data from all the chromosome arms were merged. A chi-square test (goodness-of-fit) was performed to see if the frequency of linkage map SNPs and PSVs varied across different chromosome arm quarters. In addition to the above tests, BLASTn was used to compare the PSVs to Repbase update’s list of vertebrate TEs. The number of TE hits in PSVs was then compared to the number of TE hits in the linkage map SNPs using a contingency chi-square test. We tested if PSVs in Arctic charr aligned more toward the telomeres of Atlantic salmon chromosome arms compared to linkage map SNPs. Tetrasomy is more readily preserved near the telomeres of chromosomes and most cross-overs in multivalent chromosome formations occur toward the telomeres (Sakamoto ). Therefore, residual tetrasomic inheritance would likely persist in these regions for a longer period of time following the Ss4R WGD, and highly similar duplicate loci are expected to be found in high frequency closer to the telomeres of chromosome arms (Brieuc ; Allendorf ).

Data availability

Details on data used in this study can be found in Materials and Methods under the Sequencing Analysis section, and in the supplementary files.

Results and Discussion

SNP identification and linkage mapping

GBS of the Arctic charr family produced 4 × 108 sequence reads (roughly 4.7 million reads per progeny). Using the UNEAK pipeline (Tassel 3.0), 19,418 SNPs were then identified. Following the SNP filtering process, and the addition of microsatellite markers, the mapping dataset consisted of 4536 markers (see File S1 for the DNA sequence corresponding to each SNP marker). The linkage mapping process produced 39 linkage groups containing 4508 markers (4405 SNPs, 1 sdY, and 102 SSR), while 28 markers (24 SNP and 4 SSR) remained unlinked. Separate male and female maps were produced (File S2). A total of 1538 markers was ordered in the male map, spanning a distance of 2808.5 cM, while 1709 markers were ordered in the female map, covering 4302.7 cM (Figure 2 and Table 1). Markers that were heterozygous in both parents were not ordered, as mentioned previously, but 1283 of these were assigned to linkage groups (linkage group assignments are found in File S2).
Figure 2

Visual representation of Arctic charr linkage groups. Female linkage groups are shown in red, and male linkage groups are shown in blue. Each point along the length of the line represents a single marker, or zero recombination cluster of several markers.

Table 1

List of Arctic charr linkage groups; the number of markers in each linkage group and the map distance covered (centiMorgans) by each male and female linkage group

Linkage GroupMale Marker NumberMale Distance (cM)Female Marker NumberFemale Distance (cM)Unordered MarkersChromosome Type
AC01/21116161.194268.192AC01 is M/AC21 is A
AC025275.23992.947A
AC0355108.22475.234M
AC04p3956.441102.322A/split M
AC04q55101.1132321.143Fused A/split M
AC051242.3228085A
AC0674150.5102224.733M
AC072334.12445.829A
AC083865.846162.324M
AC09410.5918.876A
AC10501071556.413A
AC11528.2841.110A
AC121222.32494.110A
AC1373137.655172.918M
AC147410087172.947M
AC154989.446152.976M
AC1651102.365118.827A
AC1746121.14068.219Fused A
AC186810062181.134M
AC19957.61748.278A
AC20a3754.137108.27A
AC20b4776.4128272.938M
AC223175.23138.820A
AC234574.13884.725A
AC243550.533876A
AC253151.72451.718A
AC265088.243102.357A
AC27348761201.134M
AC281623.548142.39A
AC292841.11755.237A
AC302736.42635.23A
AC313572.94090.533A
AC3237105.867135.230A
AC334874.14464.762A
AC342230.51881.18A
AC353651.72475.229A
AC364777.6328030A
AC3727674698.820A
Totals15382808.517094302.71283aMetacentric: 10b
Acrocentric: 27–29
Split meta: 1c

The chromosome type [metacentric (M) or acrocentric (A)] is also shown for each linkage group.

Note that there are 4508 markers in the linkage map, but the total marker numbers here sum to 4530. This is because a small number of markers were successfully ordered in both the male and female maps, and are therefore counted twice in this row.

10 metacentric assumes AC20b is metacentric in structure, and in karyotypes where AC04p/q are joined, 11 metacentrics would be observed.

27 acrocentrics would be observed if AC20b is metacentric, and AC04 was metacentric in the karyotype, while 29 acrocentrics would be present in the configuration where AC04p and AC04q form separate arms. “Fused A” designations indicate acrocentric arms that appear to be composed to two ancestral teleost chromosome arms.

Visual representation of Arctic charr linkage groups. Female linkage groups are shown in red, and male linkage groups are shown in blue. Each point along the length of the line represents a single marker, or zero recombination cluster of several markers. The chromosome type [metacentric (M) or acrocentric (A)] is also shown for each linkage group. Note that there are 4508 markers in the linkage map, but the total marker numbers here sum to 4530. This is because a small number of markers were successfully ordered in both the male and female maps, and are therefore counted twice in this row. 10 metacentric assumes AC20b is metacentric in structure, and in karyotypes where AC04p/q are joined, 11 metacentrics would be observed. 27 acrocentrics would be observed if AC20b is metacentric, and AC04 was metacentric in the karyotype, while 29 acrocentrics would be present in the configuration where AC04p and AC04q form separate arms. “Fused A” designations indicate acrocentric arms that appear to be composed to two ancestral teleost chromosome arms.

Microsatellite anchors

Of the 102 microsatellite markers genotyped, 98 were successfully added to the linkage map (four remained unlinked). Five markers were duplicated and mapped to two linkage groups. Of 39 linkage groups, 36 contained one or more microsatellite markers, allowing the new SNP-based linkage groups to be aligned with the microsatellite-based linkage maps (Woram ; Timusk ; Norman ). The identity of three linkage groups without microsatellite markers (AC25, AC30, and AC31) was determined based on BLASTn search results, and previously identified arm homologies among the salmonid species compared.

Chromosome type and salmonid arm homologies

A comparison of the Arctic charr linkage map to the Atlantic salmon (Lien ) and rainbow trout genomes (Berthelot ) based on the BLASTn analysis is presented in Table 2. The top Blast hit data for each marker are available in File S2 and File S3. Nine linkage groups (AC01, -03, -06, -08, -13, -14, -15, -18, and -27) were identified as metacentric chromosomes based on homologies to two salmonid chromosome arms. Two potential split metacentric groupings were identified (AC04 and AC20). One set involves the sex-linkage group AC04, and has been identified as possessing a fusion polymorphism in Arctic charr (Moghadam ). Our data also suggest that a two acrocentric vs. one metacentric polymorphism exists for AC04 (see below). The q-arm in this chromosome set involves a fusion of two ancestral salmonid chromosome arms that are homologous to Omy02q and Omy25, while the p-arm is homologous to Omy24. None of these arms are homeologous to one another, and are therefore unlikely to show pseudolinkage affinities. The second set of chromosomes (AC20 group) involves a small (AC20a) and a large (AC20b) acrocentric arm, where the large arm appears to be composed of two fused ancestral salmonid arms that are homologous to Omy12p/q and Ots09/q. Therefore, it is more likely that AC20b represents an entire metacentric chromosome, and AC20a represents a separate acrocentric arm, rather than AC20a/20b representing a large metacentric chromosome comprised of joined homeologous arms (Woram ). This would support the suggestion that there are 10 stable metacentrics in North American Arctic charr (Hartley 1989; Phillips ), with the AC04 polymorphism generating an additional metacentric in some individuals. The homologous chromosome arm of AC20a in rainbow trout is Omy13q, and, since Omy12q/Omy13q are homeologs, they may show pseudolinkage affinities. Twenty-seven linkage groups appear to be acrocentric, including two (AC04 and AC17) that result from a tandem fusion of qa and qb chromosome arms. AC17 appears to include segments homologous to both Omy16q and 20q. The total chromosome arm number (NF) observed was 100, containing 52 haploid ancestral arm segments.
Table 2

Salmonid chromosome arm homologies referenced to Arctic charr linkage group homologies

Ssa Homeolog PairaAtlantic SalmonRainbow TroutbArctic CharrChinook SalmoncSsa Homeolog PairAtlantic SalmonRainbow TroutArctic CharrChinook Salmon
10qa/bSsa16qaOmy01p15AC26Ots06p27Ssa14qbOmy14pAC30Ots31
01qa/bSsa18qa14Omy01q14AC25Ots06q1409qbSsa05pOmy14qAC06qOts21
02p1Ssa05q1Omy02p1AC06pOts23119qaSsa298Omy15pAC27qOts298
16qa&23Ssa10qbOmy02q15AC04qbOts1907q7Ssa17qb7Omy15q7AC24Ots177
05q1Ssa02p1Omy03p1AC35Ots03p128&29SSa19qb8Omy16p16AC18pOts248
21Ssa25Omy03q9AC02Ots03q15qbSsa13qa1Omy16q1AC17qaOts221
10qa/bSsa23Omy04p10AC13pOts01p22Ssa12qbOmy17p11AC01pOts02p
15qaSsa06qOmy04qAC14qOts1812qa6Ssa02q6Omy17q6AC01qOts02q6
11qb&13qbSsa01qbOmy05pAC29Ots2017qa3Ssa16qb3Omy18p3AC27pOts14p3
23Ssa10qaOmy05q10AC16Ots05q14qbSsa27Omy18qAC31Ots13p
20qaSsa24Omy06pAC15pOts04p08q/04p4Ssa04p/08q4Omy19p4AC34Ots11p4
11qa2Ssa262Omy06q2AC15qOts04q209qaSsa01pOmy19qAC09Ots11q
16qb3Ssa17qa3Omy07p3AC12Ots07p305p/19qa/b&01qaSsa09qb/28Omy20p16AC08qOts25
12qbSsa22Omy07q11AC11Ots07q29Ssa19qaOmy20qAC17qbOts25
06qSsa15qaOmy08pAC28Ots05p18qbSsa07p7Omy21p7AC03pOts15p7
03pSsa14qaOmy08q12AC32Ots10q17qb7Ssa07qOmy21q13AC03qOts15q
07pSsa18qbOmy09p13AC37Ots10p25Ssa21Omy22p9AC36Ots26
13qaSsa15qb1Omy09q1AC07Ots16q1Omy22q
13qbSsa04qOmy10pAC23Ots3028&29Ssa01qa14Omy2314AC18qOts01q14
04p/08q4Ssa08q/04p4Omy10q4AC13qOts34420qbSsa09qcOmy24AC04pdOts14q
24/2920qa/19qaOmy11pAC14pOts16p/12p01pSsa09qaOmy25AC04qaOts08p
24Ssa20qaOmy11qAC33Ots12p262Ssa11qa2Omy262AC10Ots12q2
04qSsa13qbOmy12pAC20b-2Ots09p09qcSsa20qbOmy27AC22Ots13q
06q/01qb&04q503q/13qb5Omy12q5AC20b-1Ots09q514qaSsa03pOmy2812AC19Ots28
02q6Ssa12qa6Omy13p6AC21Ots32605p/19qa/b&01qaSsa09qb/28Omy29Ac08pOts08q
03q5Ssa06p5Omy13q5AC20aOts27501qb&4pSsa11qbSexAC05Ots33

These were determined based on the most common BLASTn hit locations of a linkage group’s markers when compared to the Atlantic salmon genome and the rainbow trout draft genome. Additionally, homologies with Chinook salmon chromosome arms are presented based on known homologies in Atlantic salmon and rainbow trout, though direct BLASTn comparison of the Arctic charr linkage map and Chinook salmon genome was not performed. The column “Ssa Homeolog pair” shows each chromosome arm’s homeolog partner derived from a common pre-Ss4R ancestor.

Cells in columns 1 and 6 with matching superscript numbers represent HRTA identified in Atlantic Salmon [data from Lien ].

Homeolog pairs identified in rainbow trout based on high numbers of duplicate markers have matching superscript numbers in columns 3 and 8 [data from Danzmann and Berthelot ].

Homeolog pairs identified in Chinook salmon based on high numbers of duplicate markers have matching superscript numbers in columns 5 and 10 [data from Brieuc ].

Indicates sex the linkage group of Arctic charr.

These were determined based on the most common BLASTn hit locations of a linkage group’s markers when compared to the Atlantic salmon genome and the rainbow trout draft genome. Additionally, homologies with Chinook salmon chromosome arms are presented based on known homologies in Atlantic salmon and rainbow trout, though direct BLASTn comparison of the Arctic charr linkage map and Chinook salmon genome was not performed. The column “Ssa Homeolog pair” shows each chromosome arm’s homeolog partner derived from a common pre-Ss4R ancestor. Cells in columns 1 and 6 with matching superscript numbers represent HRTA identified in Atlantic Salmon [data from Lien ]. Homeolog pairs identified in rainbow trout based on high numbers of duplicate markers have matching superscript numbers in columns 3 and 8 [data from Danzmann and Berthelot ]. Homeolog pairs identified in Chinook salmon based on high numbers of duplicate markers have matching superscript numbers in columns 5 and 10 [data from Brieuc ]. Indicates sex the linkage group of Arctic charr.

Sex-linkage group polymorphism in AC04

Previous work has shown AC04 to be polymorphic, and taking the form of either a single linkage group (type 1) or two unlinked linkage groups (type 2) (Moghadam ). The mapping parents in this study are both type 2 individuals; microsatellite markers associated with AC04 are found on both AC04p and AC04q (Woram ; Moghadam ; Timusk ). Previous research identified three salmonid chromosome arms homologous with AC04 (Timusk ). Two of these arms are homologous with AC04q, and the third is homologous with AC04p (Table 2). AC04p also contains the sex-determining gene, sdY (Yano ). Although two separate, type 2 AC04 linkage groups are observed in the current linkage map, the polymorphic nature of this linkage group, and the evidence presented here, is indicative of a fusion/fission polymorphism for the sex-determining chromosome of Arctic charr. Affinity of the sdY marker to the Slml-family of TEs supports the suggestion that the reported sex-linkage difference between North American (AC04) and European (AC01/21) Arctic charr may be the result of a translocation through TE movements (Woram ; Küttner ).

Pseudolinkage

Two pairs of homeologous linkage groups (AC01q/21 and AC13q/34) were detected as possessing a weak pseudolinkage to one another (LOD ≥ 3–5). Interestingly, pseudolinkage was detected between both homeologous pairs of linkage groups in both the male and female parents (Table 3). Previously, pseudolinkage was thought to occur only within male meiosis, with rare reports of female pseudolinkage (Ostberg ; Allendorf ). Our identification of pseudolinkage in a female confirms that multivalents are also likely formed during female meioses.
Table 3

Allele counts of the two homeolog pairs displaying pseudolinkage within females

Marker PairGenotypesLinkage GroupParental Phases (Marker A/Marker B)Recombinant Phases (Marker A/Marker B)Chi-SquaredP-value
Marker AMarker BMarker AMarker BMarker AMarker BAllelesCountAllelesCount
TP47181TP21253G,CA,GAC13PseudolinkG/A32C/A2318.73.20E−04
C/G25G/G5
TP21253TP30908A,GT,CPseudolinkAC34A/T43A/C12367.50E−08
G/C23G/T7
TP47181TP30908G,CT,CAC13AC34G/T23C/T274.20.243
C/C21G/C14
TP10591TP15996T,GG,AAC21PseudolinkT/G14G/G415.80.0013
G/A13T/A1
TP15996TP32826G,AA,TPseudolinkAC01G/A18A/A0333.20E−07
A/T14G/T0
TP10591TP32826T,GA,TAC21AC01T/A22G/A237.90.049
G/T27T/T10

These instances appear to result from an excess of parental phase genotypes. Flanking markers from both linkage groups with the most complete genotypes, along with the principle marker causing pseudolinkage, are displayed. Note, for AC21/AC01, the marker causing pseudolinkage (TP15996) was heterozygous in both parents (ab X ab cross). Therefore, the phases for half of the progeny could not be ascertained. Chi-squared goodness of fit tests were performed for each pair of alleles, comparing the observed genotype frequencies to a null hypothesis of a 1:1:1:1 genotype distribution.

These instances appear to result from an excess of parental phase genotypes. Flanking markers from both linkage groups with the most complete genotypes, along with the principle marker causing pseudolinkage, are displayed. Note, for AC21/AC01, the marker causing pseudolinkage (TP15996) was heterozygous in both parents (ab X ab cross). Therefore, the phases for half of the progeny could not be ascertained. Chi-squared goodness of fit tests were performed for each pair of alleles, comparing the observed genotype frequencies to a null hypothesis of a 1:1:1:1 genotype distribution. Pseudolinkage is a phenomenon arising due to segregation of gametes following preferential pairing of homeologous chromosome arms during meiosis I (Allendorf ; May and Delany 2015). Preferential as opposed to random pairing typically occurs in cases of hybridization, where the homeolog pairs provided by one parent may be more closely related to one another due to their species-specific ancestry. Therefore, within the hybrid, the homeologs from one parent may therefore be more likely to pair with one another and recombine. Crossing-over tends not to occur between homeologous markers located very close to the centromere. If preferential pairing occurs at meiosis I, this leads to a significant excess of nonparental genotypes being produced following meiosis II, because alternate disjunction occurs as the multivalents separate at meiosis I [see Allendorf and May and Delany (2015) for a more detailed explanation of these models]. This produces a statistical linkage between the two homeologous chromosome arms characterized by a significant excess of nonparental gametes, and is a characteristic feature of hybrid salmonids. Random pairing of homeologous pairs may, however, still occur as the genome undergoes the process of diploidization within species. This may involve the random formation of both bivalents, and multivalents, between the homeologous pairs, such that gametic expectation models cannot be precisely defined. This may result in varying levels of exchange among alleles for loci that are located proximal and distal to chiasmata junctions along the length of randomly paired homeologous chromosomes [see Sakamoto , for an explanation]. We have developed models to explain this weaker form of pseudolinkage, and will present these in a subsequent publication.

Homeologies

Arm homologies identified in Atlantic salmon, rainbow trout, and chinook salmon (Table 2) with Arctic charr, were used to identify homeologous arm pairings within Arctic charr (Table 4). Lower levels of duplicate marker regions were detected for certain arms, but can nonetheless be inferred based upon the recent extensive survey of duplicate gene copies in Atlantic salmon (Hermansen ) (Table 4). All seven homeologous pairs in Arctic charr identified as HRTAs in other species contain one chromosome arm that has undergone a chromosome fusion. One pair of HRTA identified in Atlantic salmon (Ssa09qc/20qb) (Lien ) did not show a high number of duplicates in either rainbow trout or chinook salmon. Similarly a HRTA region in both rainbow trout and chinook salmon (Omy01q and Ots06q/Omy23 and Ots01q), was not identified as an HRTA in Atlantic salmon (Table 2). Four AHPs, where neither homeolog has undergone a fusion event since the Ss4R, were identified in Arctic charr (Table 5). AHP likely undergo minimal residual tetrasomic inheritance, given that AHP multivalent pairings are unlikely due to structural instability (May and Delany 2015). We found support for this prediction in that the number of PSVs aligning to AHPs was significantly lower compared to the number aligning to the other chromosome arms (P < 0.0001). Of the linkage map markers that successfully aligned to the Atlantic salmon genome, 20% (861/4290) aligned to HRTA chromosome arms, while only 14.0% of PSVs (158/1130) had top hit locations on AHPs. These results suggest that fewer duplicate loci exist on AHPs, which might be indicative of more rapid diploidization relative to the rest of the genome (Figure 3).
Table 4

Reference table of homeologous chromosome pairs in Arctic charr

Homeolog 1Homeolog 2
AC01pAC11
AC01qAC21
AC02AC36
AC03pAC24
AC03qAC37
AC04pAC14p/AC33a
AC04pAC22
AC04qaAC09
AC04qbAC26
AC06AC08p/AC08q
AC06pAC35
AC07AC17qa
AC08pAC18p?a
AC10AC15q
AC12AC27p
AC13pAC16
AC13qAC34
AC14qAC28a
AC15pAC14p/AC14q/AC33a
AC17qbAC27q
AC18pAC27q
AC18qAC25
AC19AC32
AC20b-1AC20a
AC23AC04qb/AC05/AC20b-1/AC-20b-2a
AC29AC05/AC20b-2a
AC30AC31

Identified through their homologies in Atlantic salmon, and homeologies identified in Hermansen .

Table 5

Reference table for the analysis of Arctic charr duplicate loci distribution

Atlantic Salmon Chromosome ArmArctic Charr Linkage GroupHigh Residual Tetrasomy Arms (HRTA)Acrocentric Homeolog Pairs (AHP)Atlantic Salmon Chromosome ArmArctic Charr Linkage GroupHigh Residual Tetrasomy Arms (HRTA)Acrocentric Homeolog Pairs (AHP)
Ssa01pAC09Ssa13qaAC17qa
Ssa01qaAC18qSsa13qbAC20b-2
Ssa01qbAC29AHPSsa14qaAC32AHP
Ssa02pAC35HRTASsa14qbAC30AHP
Ssa02qAC01qHRTASsa15qaAC28
Ssa03pAC19AHPSsa15qbAC07
Ssa03qAC20b-1/AC20b-2HRTASsa16qaAC26
Ssa04pAC13q/AC-34HRTASsa16qbAC27pHRTA
Ssa04qAC23Ssa17qaAC12HRTA
Ssa05pAC06qSsa17qbAC24HRTA
Ssa05qAC06pHRTASsa18qaAC25
Ssa06pAC20aHRTASsa18qbAC37
Ssa06qAC14qSsa19qaAC17qb/AC14p
Ssa07pAC03pSsa19qbAC18p
Ssa07qAC03qHRTASsa20qaAC14p/AC33
Ssa08qAC13q/AC34HRTASsa20qbAC22
Ssa09qaAC04qaSsa21AC36AHP
Ssa09qbAC08p/AC08qSsa22AC11
Ssa09qcAC04pSsa23AC13p
Ssa10qaAC16Ssa24AC15p
Ssa10qbAC04qbSsa25AC02AHP
Ssa11qaAC10HRTASsa26AC15qHRTA
Ssa11qbAC05AHPSsa27AC31AHP
Ssa12qaAC21HRTASsa28AC08p/AC08q
Ssa12qbAC01pSsa29AC27q

The Atlantic salmon chromosome arms are listed alongside their Arctic charr homologs. The table lists all chromosome arms classified as belonging to the HRTA or AHP categories.

Figure 3

BLASTn hit locations in the Atlantic salmon genome for Arctic charr PSVs, and linkage map markers; 40.8% (462/1130) of PSVs had their top hit locations on HRTAs, while only 22.1% (950/4290) of the linkage map SNPs had their top Blast hit locations on HRTAs.

Identified through their homologies in Atlantic salmon, and homeologies identified in Hermansen . The Atlantic salmon chromosome arms are listed alongside their Arctic charr homologs. The table lists all chromosome arms classified as belonging to the HRTA or AHP categories. BLASTn hit locations in the Atlantic salmon genome for Arctic charr PSVs, and linkage map markers; 40.8% (462/1130) of PSVs had their top hit locations on HRTAs, while only 22.1% (950/4290) of the linkage map SNPs had their top Blast hit locations on HRTAs.

Duplicate loci

The distribution of PSVs throughout the Arctic charr genome appeared to be nonrandom, with certain regions preserving higher numbers of duplicate loci. BLASTn alignment of PSVs, and linkage map SNPs to the Atlantic salmon genome, was performed to compare the locations of linkage map markers and PSVs. The top BLASTn hit locations (based on lowest e-values) were used to assess the distribution of SNPs throughout the genome. Direct comparison of the markers through linkage mapping was not possible, given that PSVs lack any type of segregation pattern, and therefore cannot be mapped. For markers with BLASTn hits of equal e-values on a single chromosome arm, only a single hit was used per chromosome arm in the dataset. In the case of markers (both linkage map markers and PSVs) with BLASTn hit locations of equal e-values on two Atlantic salmon chromosomes, a BLASTn hit to each chromosome arm was retained in the dataset, as these markers represented potential duplicate loci. Markers with equal BLASTn hits to three or more locations were excluded from the dataset. The analysis identified 1130 PSV Blast hits in the Atlantic salmon genome (Figure 4), in addition to the 4429 SNPs assigned to the linkage map. If the PSV distribution throughout the genome is random, and chromosome arm size was the only factor influencing their distribution, then the proportion of PSVs aligning to each chromosome would be similar to the proportion of linkage map SNPs aligning to each chromosome arm. To assess the effect of genomic architecture on the preservation of duplicate loci, chromosome arms were binned into two categories: HRTAs (listed in Table 5) and non-HRTA (other) salmonid chromosome arms. A contingency test showed higher numbers of PSVs aligned to the HRTAs than expected (P < 0.0001) (Figure 3). Of the 14 chromosome arms in the HRTA category, 12 had higher proportions of PSV Blast hits than linkage map marker Blast hits (Figure 4). The two HRTA arms with lower numbers of PSV BLASTn hits (Ssa04p and Ssa17qb) had homeologs with high numbers of PSV hits (Ssa08q and Ssa07q, respectively). This suggests that the lower number of PSV hits on the HRTA arms may be due to more PSVs aligning to their homeologs because of small sequence differences. High numbers of PSVs are observed in regions that preserved residual tetrasomy longer after the Ss4R than other parts of the genome, which may be indicative of historically reduced diploidization rates (Brieuc ).
Figure 4

The top BLASTn hit locations of Arctic charr linkage map SNPs, and Arctic charr PSVs, across Atlantic salmon chromosome arms. The data are shown in the proportion of hits from a category (Map SNPs or PSVs) in order to account for the bias of Atlantic salmon chromosome size. Arms with a * are HRTAs identified in Lien .

The top BLASTn hit locations of Arctic charr linkage map SNPs, and Arctic charr PSVs, across Atlantic salmon chromosome arms. The data are shown in the proportion of hits from a category (Map SNPs or PSVs) in order to account for the bias of Atlantic salmon chromosome size. Arms with a * are HRTAs identified in Lien . Several other Atlantic salmon arms displayed high numbers of PSV BLASTn hits, notably Ssa09qb, Ssa09qc, Ssa10qb, and Ssa22. The Arctic charr homologs of these arms are AC08p/q, AC04p, AC04qb, and AC11, respectively. All of these arms, or their homeologs in Arctic charr, have undergone a fusion. Interestingly, these arms also share homology to rainbow trout chromosome arms with high numbers of duplicates (Table 2). Previously, it had been thought that one metacentric chromosome must be present in a homeolog pair to provide the stability necessary for homeologous pairing, and multivalent formation (Kodama ). However, Lien have recently shown that metacentric structures are not a requirement for homeologous pairing, given that two HRTA pairs in Atlantic salmon (Ssa11qa/Ssa26 and Ssa16qb/Ssa17qa) preserve high sequence similarity without the presence of a metacentric, but, in both cases, one of the arms has undergone a fusion event. This suggests that fused acrocentrics, as well as metacentric chromosomes, provide the structural stability necessary for homeologous recombination. The conservation of duplicate loci on HRTAs does not appear to be due to current chromosome structure, because the HRTAs do not display homologous chromosome arm fusions across species (with the exception of AC03/Ssa07/Omy21/Ots15). Therefore, the slower diploidization in these regions may be attributed to some aspect of their evolutionary past. For instance, certain fusions may have arisen in the common ancestor of all these salmonid lineages following the Ss4R. This could have provided the HRTA homeolog pairs with the ability to form multivalents during meiosis, and undergo residual tetrasomy, thereby slowing diploidization rates. A large number of species-specific fusion/fission events since the more recent divergence of salmonids (Macqueen and Johnston 2014) could disjoin previous chromosome structures, and also explain why the seven HRTA homeolog pairs preserve duplicate loci in multiple species, despite the large variation in karyotypic structures across current salmonid species. To search for possible duplicate SNP positions that could have arisen from the Ss4R event, we reciprocally BLASTn aligned all linkage map SNP markers against each other, and retained those duplicate pairs that shared ≥95% identity to one another, as well as retaining ≥95% of an overlap in their length distributions. Surprisingly, of the 362 duplicate pairs identified, only 18 pairs were interchromosomal duplicate pairs. Of these 18 pairs, only three were considered to be possible WGD paralogs, as the other 15 pairs involved either one or both SNPs with TE-specific sequence (see File S4). One of the potential paralogs that was not associated with TE, involved markers from identified homeolog pair of AC03q/37. Two of the potential paralogs not associated with TEs involved markers from the linkage groups AC06p and AC32, which is not an identified homeolog region in the Arctic charr map. However, these two paralog sets show homology to the Ssa02p/05q paralogs, and therefore may indicate a small region of previously unidentified homeology. The vast majority of duplicate pair markers appear to be intrachromosomal duplicates (95.1%) (File S4). To assess whether chromosome structure may have had an influence on the distribution of these duplicates, we compared the proportion of duplicated SNP markers between metacentric vs. acrocentric type linkage groups. Duplicates within AC04p and AC04q were included in the metacentric grouping, as well as duplicates within AC17, given that this acrocentric appears to be composed of fused chromosome arms. No differences were detected in the distribution of duplicates between the chromosome types (P > 0.05) (Table 6).
Table 6

Arctic charr linkage map location of duplicate pair SNPs that shared ≥95% identity with one another

Linkage Group (LG)Inter-LG Pair MembersaIntra-LG DuplicatesProportion Marker DuplicatesbLinkage Group (LG)Inter-LG Pair MembersIntra-LG DuplicatesProportion Marker Duplicates
AC014300.192AC20a2120.15
AC020200.149AC20b0280.136
AC033160.149AC210180.125
AC04p1220.224AC220100.125
AC04q3300.133AC231220.206
AC050160.136AC24080.114
AC064260.129AC252160.219
AC07080.108AC263320.219
AC081120.115AC270240.192
AC090120.136AC28060.086
AC10060.08AC291160.2
AC11000AC30180.143
AC12020.044AC310220.204
AC130300.211AC323160.121
AC140200.099AC331300.2
AC150200.12AC34020.043
AC163260.183AC350220.256
AC170160.16AC36080.075
AC180280.175AC373300.133
AC190240.233

Number of SNPs where the duplicate member maps to a different linkage group.

Number of duplicates/total number of SNPs mapped to the LG.

Number of SNPs where the duplicate member maps to a different linkage group. Number of duplicates/total number of SNPs mapped to the LG. Duplicate marker pairs do not map next to one another within linkage groups, suggesting that they are not tandem repeats. These marker duplicates may be part of larger segmental duplicate blocks that can vary in size from 1 to 400 kb (Mendivil Ramos and Ferrier 2012), but alignment of these regions to more complete scaffold contigs would be needed to test this idea. The reason so few apparent WGD paralog regions were detected is likely due to the high stringency in which duplicated regions were identified. Interchromosomal 4R paralogs have average identity levels ranging from 86 to 90% (Moghadam ; Berthelot ), with average levels up to 96% for protein-coding duplicate regions (Berthelot ). Hence, an analysis using lowered stringency cut-offs will likely reveal higher frequencies of WGD paralog regions than those reported here. Many of the duplicate markers appear to align to identical locations within both the rainbow trout and Atlantic salmon genomes, but in opposite strand orientations (see File S4). Upon closer inspection of these duplicates, we observed that several reads aligning to each contig cluster were longer in either the 5′ or 3′ direction, and these reads spanned an EcoT22I cut-site on either side. Both the 3′ side of the forward strand, and 5′ side of the reverse strand were bordered by the 5 bp signature of a DNA strand cut by EcoT22I. This suggests that an internal cut-site may have been skipped due to an epigenetic modification, and that an additional EcoT22I cut-site, 20–30 bp away, flanked the uncut site. We queried all 425,391,431 reads obtained from the fish used in this study, and determined that the presence of uncut EcoT22I sites averaged 2.8%, while those found in the paired duplicates having extended reads ranged from 4 to 6% supporting the suggestion that these regions may be more epigenetically modified. However, many of these reads also appeared as chimeric religations, highlighting that enzymes susceptible to epigenetic modification may lead to genotyping errors (Jiang ).

Distribution of Arctic charr duplicated markers in relation to the Atlantic salmon genome

BLASTn alignments of the Arctic charr duplicated markers were not uniformly distributed along the length of Atlantic salmon chromosomes from the Lien assembly. Significantly more Arctic charr PSVs aligned to the 4th quarter (telomeric ends) of chromosome arms (P < 0.0001) (Figure 5). This suggests that PSVs are preserved near the telomeres in the regions of chromosomes that undergo residual tetrasomy (Wright ; Allendorf ; May and Delany 2015). Previous studies in salmonids have also observed that duplicate loci are present in higher frequencies near the telomeres (Sakamoto ; Brieuc ; Kodama ; Larson ; McKinney ; Waples ). There was also a significant reduction of linkage map markers with top Blast hits in the 4th quarter of chromosomes, which suggests that residual tetrasomy may generate high numbers of duplicate loci that cause telomeric regions to be under-represented with markers (Allendorf ).
Figure 5

Distribution of the top Blast hits of Arctic charr linkage map SNPs and PSVs across Atlantic salmon chromosome arms. q1 is the quarter closest to the centromere, q4 is the quarter closest to the telomere.

Distribution of the top Blast hits of Arctic charr linkage map SNPs and PSVs across Atlantic salmon chromosome arms. q1 is the quarter closest to the centromere, q4 is the quarter closest to the telomere.

Transposon activity

Recent studies of salmonid genomes have observed lower instances of sequence homology to TEs within genomic regions characterized as preserving tetrasomy (McKinney ; Waples ; Lien ). We assessed TE distributions through BLASTn alignments to the Atlantic salmon genome to determine if chromosomes homologous to HRTA had lower levels of TE alignments. Of the 4405 linkage map SNPs, 1608 (36.5%) had homology with transposable elements in Repbase Update’s list of known vertebrate TEs (Jurka ). This was a lower proportion of TE hits than expected as repetitive elements comprise 58–60% of the Atlantic salmon genome, and suggests that our dataset underrepresents the proportion of repetitive elements in Arctic charr. We detected a small but significant reduction in TE activity between HRTAs (30.9% of SNPs on these arms had significant TE hits), and non-HRTA chromosome arms (35.8%) (P = 0.0009) (Table 7). Using the Atlantic salmon genome as a scaffold, we detected no significant difference in the frequency of TE hits between the telomeric and centromeric regions of chromosomes (P = 0.1322) (Figure 6).
Table 7

TE distribution of the SNPs aligning to the Atlantic salmon chromosome arms, and PSVs markers not found in the linkage map

No TE HitsTE Hits
Linkage groups containing HRTAs976436
Linkage Groups with no HRTAs25721436
Total35481872

There was a slight but significant reduction in TE activity seen on HRTA (30.9% of SNPs with significant TE hits) relative to all other chromosome arms (35.8% with significant TE hits) (P = 0.0009).

Figure 6

Proportion of Arctic charr Map SNPs and PSVs with TE hits across the length of chromosome arms. SNPs were grouped based on their alignment to Atlantic salmon chromosome arms. q1 is the quarter closest to the centromere, q4 is the quarter closest to the telomere. Atlantic salmon chromosome arms were grouped according to whether they were HRTAs (A) or non-HRTAs (B).

There was a slight but significant reduction in TE activity seen on HRTA (30.9% of SNPs with significant TE hits) relative to all other chromosome arms (35.8% with significant TE hits) (P = 0.0009). Proportion of Arctic charr Map SNPs and PSVs with TE hits across the length of chromosome arms. SNPs were grouped based on their alignment to Atlantic salmon chromosome arms. q1 is the quarter closest to the centromere, q4 is the quarter closest to the telomere. Atlantic salmon chromosome arms were grouped according to whether they were HRTAs (A) or non-HRTAs (B).

Conclusion

We have presented a SNP-based linkage map of the Arctic charr genome, which is comprised of 4508 markers spanning 39 linkage groups. The map was used to identify the chromosome type of each linkage group, and the homologous chromosome arms in other salmonid species. Using data from the Atlantic salmon genome, we have identified putative homeologous arm pairs in Arctic charr. Based on the distribution of PSV, we suggest that genomic architecture is influencing diploidization rate in the Arctic charr genome, with higher levels of duplicate loci being preserved on HRTAs and lower numbers of duplicate loci preserved on AHPs. Transposon activity was also quantified, but we failed to detect a strong influence of genomic architecture on TE distribution. Pseudolinkage was also detected in both the male and female parents, and this involved two HRTA homeolog pairs (AC01q/21 and AC13q/34). This map also characterized the genome of a salmonid species with a more basal karyotype, and shows how these differences in genomic architecture have influenced diploidization.

Supplementary Material

Supplemental material is available online at www.g3journal.org/lookup/suppl/doi:10.1534/g3.116.038026/-/DC1. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  50 in total

Review 1.  Chromosome evolution in the Salmonidae (Pisces): an update.

Authors:  R Phillips; P Ráb
Journal:  Biol Rev Camb Philos Soc       Date:  2001-02

Review 2.  Mobile elements: drivers of genome evolution.

Authors:  Haig H Kazazian
Journal:  Science       Date:  2004-03-12       Impact factor: 47.728

Review 3.  Repbase Update, a database of eukaryotic repetitive elements.

Authors:  J Jurka; V V Kapitonov; A Pavlicek; P Klonowski; O Kohany; J Walichiewicz
Journal:  Cytogenet Genome Res       Date:  2005       Impact factor: 1.636

4.  Whole genome duplication: challenges and considerations associated with sequence orthology assignment in Salmoninae.

Authors:  H K Moghadam; M M Ferguson; R G Danzmann
Journal:  J Fish Biol       Date:  2011-09       Impact factor: 2.051

5.  A genetic linkage map for Arctic char (Salvelinus alpinus): evidence for higher recombination rates and segregation distortion in hybrid versus pure strain mapping parents.

Authors:  R A Woram; C McGowan; J A Stout; K Gharbi; M M Ferguson; B Hoyheim; E A Davidson; W S Davidson; C Rexroad; R G Danzmann
Journal:  Genome       Date:  2004-04       Impact factor: 2.166

6.  The Atlantic salmon genome provides insights into rediploidization.

Authors:  Sigbjørn Lien; Ben F Koop; Simen R Sandve; Jason R Miller; Matthew P Kent; Torfinn Nome; Torgeir R Hvidsten; Jong S Leong; David R Minkley; Aleksey Zimin; Fabian Grammes; Harald Grove; Arne Gjuvsland; Brian Walenz; Russell A Hermansen; Kris von Schalburg; Eric B Rondeau; Alex Di Genova; Jeevan K A Samy; Jon Olav Vik; Magnus D Vigeland; Lis Caler; Unni Grimholt; Sissel Jentoft; Dag Inge Våge; Pieter de Jong; Thomas Moen; Matthew Baranski; Yniv Palti; Douglas R Smith; James A Yorke; Alexander J Nederbragt; Ave Tooming-Klunderud; Kjetill S Jakobsen; Xuanting Jiang; Dingding Fan; Yan Hu; David A Liberles; Rodrigo Vidal; Patricia Iturra; Steven J M Jones; Inge Jonassen; Alejandro Maass; Stig W Omholt; William S Davidson
Journal:  Nature       Date:  2016-04-18       Impact factor: 49.962

7.  Genome evolution in the fish family salmonidae: generation of a brook charr genetic map and comparisons among charrs (Arctic charr and brook charr) with rainbow trout.

Authors:  Evan R Timusk; Moira M Ferguson; Hooman K Moghadam; Joseph D Norman; Chris C Wilson; Roy G Danzmann
Journal:  BMC Genet       Date:  2011-07-28       Impact factor: 2.797

8.  A dense linkage map for Chinook salmon (Oncorhynchus tshawytscha) reveals variable chromosomal divergence after an ancestral whole genome duplication event.

Authors:  Marine S O Brieuc; Charles D Waters; James E Seeb; Kerry A Naish
Journal:  G3 (Bethesda)       Date:  2014-03-20       Impact factor: 3.154

9.  Mechanisms of Gene Duplication and Translocation and Progress towards Understanding Their Relative Contributions to Animal Genome Evolution.

Authors:  Olivia Mendivil Ramos; David E K Ferrier
Journal:  Int J Evol Biol       Date:  2012-08-07

10.  Construction and Annotation of a High Density SNP Linkage Map of the Atlantic Salmon (Salmo salar) Genome.

Authors:  Hsin Y Tsai; Diego Robledo; Natalie R Lowe; Michael Bekaert; John B Taggart; James E Bron; Ross D Houston
Journal:  G3 (Bethesda)       Date:  2016-07-07       Impact factor: 3.154

View more
  13 in total

1.  Sex Chromosome Evolution, Heterochiasmy, and Physiological QTL in the Salmonid Brook Charr Salvelinus fontinalis.

Authors:  Ben J G Sutherland; Ciro Rico; Céline Audet; Louis Bernatchez
Journal:  G3 (Bethesda)       Date:  2017-08-07       Impact factor: 3.154

2.  A Dense Brown Trout (Salmo trutta) Linkage Map Reveals Recent Chromosomal Rearrangements in the Salmo Genus and the Impact of Selection on Linked Neutral Diversity.

Authors:  Maeva Leitwein; Bruno Guinand; Juliette Pouzadoux; Erick Desmarais; Patrick Berrebi; Pierre-Alexandre Gagnaire
Journal:  G3 (Bethesda)       Date:  2017-04-03       Impact factor: 3.154

3.  Differential gene expression during early development in recently evolved and sympatric Arctic charr morphs.

Authors:  Jóhannes Guðbrandsson; Sigríður Rut Franzdóttir; Bjarni Kristófer Kristjánsson; Ehsan Pashay Ahi; Valerie Helene Maier; Kalina Hristova Kapralova; Sigurður Sveinn Snorrason; Zophonías Oddur Jónsson; Arnar Pálsson
Journal:  PeerJ       Date:  2018-02-07       Impact factor: 2.984

4.  The Arctic charr (Salvelinus alpinus) genome and transcriptome assembly.

Authors:  Kris A Christensen; Eric B Rondeau; David R Minkley; Jong S Leong; Cameron M Nugent; Roy G Danzmann; Moira M Ferguson; Agnieszka Stadnik; Robert H Devlin; Robin Muzzerall; Michael Edwards; William S Davidson; Ben F Koop
Journal:  PLoS One       Date:  2018-09-13       Impact factor: 3.240

5.  Design and characterization of an 87k SNP genotyping array for Arctic charr (Salvelinus alpinus).

Authors:  Cameron M Nugent; Jong S Leong; Kris A Christensen; Eric B Rondeau; Matthew K Brachmann; Anne A Easton; Christine L Ouellet-Fagg; Michelle T T Crown; William S Davidson; Ben F Koop; Roy G Danzmann; Moira M Ferguson
Journal:  PLoS One       Date:  2019-04-05       Impact factor: 3.240

6.  Extensive genetic differentiation between recently evolved sympatric Arctic charr morphs.

Authors:  Jóhannes Guðbrandsson; Kalina H Kapralova; Sigríður R Franzdóttir; Þóra Margrét Bergsveinsdóttir; Völundur Hafstað; Zophonías O Jónsson; Sigurður S Snorrason; Arnar Pálsson
Journal:  Ecol Evol       Date:  2019-09-12       Impact factor: 2.912

7.  Exploring a Pool-seq-only approach for gaining population genomic insights in nonmodel species.

Authors:  Sara Kurland; Christopher W Wheat; Maria de la Paz Celorio Mancera; Verena E Kutschera; Jason Hill; Anastasia Andersson; Carl-Johan Rubin; Leif Andersson; Nils Ryman; Linda Laikre
Journal:  Ecol Evol       Date:  2019-09-26       Impact factor: 2.912

8.  Parallelism in eco-morphology and gene expression despite variable evolutionary and genomic backgrounds in a Holarctic fish.

Authors:  Arne Jacobs; Madeleine Carruthers; Andrey Yurchenko; Natalia V Gordeeva; Sergey S Alekseyev; Oliver Hooker; Jong S Leong; David R Minkley; Eric B Rondeau; Ben F Koop; Colin E Adams; Kathryn R Elmer
Journal:  PLoS Genet       Date:  2020-04-17       Impact factor: 5.917

9.  Using Linkage Maps as a Tool To Determine Patterns of Chromosome Synteny in the Genus Salvelinus.

Authors:  Matthew C Hale; Garrett J McKinney; Courtney L Bell; Krista M Nichols
Journal:  G3 (Bethesda)       Date:  2017-11-06       Impact factor: 3.154

10.  Mapping of Adaptive Traits Enabled by a High-Density Linkage Map for Lake Trout.

Authors:  Seth R Smith; Stephen J Amish; Louis Bernatchez; Jeremy Le Luyer; Chris C Wilson; Olivia Boeberitz; Gordon Luikart; Kim T Scribner
Journal:  G3 (Bethesda)       Date:  2020-06-01       Impact factor: 3.154

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.