Literature DB >> 25428894

Construction of a high-density, high-resolution genetic map and its integration with BAC-based physical map in channel catfish.

Yun Li1, Shikai Liu1, Zhenkui Qin1, Geoff Waldbieser2, Ruijia Wang1, Luyang Sun1, Lisui Bao1, Roy G Danzmann3, Rex Dunham1, Zhanjiang Liu4.   

Abstract

Construction of genetic linkage map is essential for genetic and genomic studies. Recent advances in sequencing and genotyping technologies made it possible to generate high-density and high-resolution genetic linkage maps, especially for the organisms lacking extensive genomic resources. In the present work, we constructed a high-density and high-resolution genetic map for channel catfish with three large resource families genotyped using the catfish 250K single-nucleotide polymorphism (SNP) array. A total of 54,342 SNPs were placed on the linkage map, which to our knowledge had the highest marker density among aquaculture species. The estimated genetic size was 3,505.4 cM with a resolution of 0.22 cM for sex-averaged genetic map. The sex-specific linkage maps spanned a total of 4,495.1 cM in females and 2,593.7 cM in males, presenting a ratio of 1.7 : 1 between female and male in recombination fraction. After integration with the previously established physical map, over 87% of physical map contigs were anchored to the linkage groups that covered a physical length of 867 Mb, accounting for ∼90% of the catfish genome. The integrated map provides a valuable tool for validating and improving the catfish whole-genome assembly and facilitates fine-scale QTL mapping and positional cloning of genes responsible for economically important traits.
© The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

Entities:  

Keywords:  SNP; catfish; genome; linkage map; physical map

Mesh:

Year:  2014        PMID: 25428894      PMCID: PMC4379976          DOI: 10.1093/dnares/dsu038

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


Introduction

Genetic linkage maps are essential for the understanding of genomic levels of organization of inheritance of traits.[1] Construction of high-density and high-resolution genetic maps is a key step for fine mapping of quantitative trait loci (QTL) and marker-assisted selection. In addition, genetic linkage maps are valuable resources for the generation of chromosome-level assembly of whole-genome sequences and for comparative genome analysis.[2] In most of the recent whole-genome sequencing cases, whole-genome sequences are generated using next-generation sequencing (NGS). Short sequence reads are assembled into contigs. Such contigs are generally still relatively short although they vary in sizes from several kilobases to tens of kilobases. Increases in genome sequencing coverage and sequencing libraries can increase the quality of the assembly, allowing the sizes of contigs to be increased. However, NGS methods alone cannot provide the resources to assemble complex genomes at the chromosomal level. Genome sequence assemblies at this level require the assembly of tens of thousands to hundreds of thousands of contigs. Highly segmented genome assemblies prohibit efficient genome analysis. Therefore, various genome resources have been created to reduce the segmentation of the genome assemblies. One of these resources is the large-insert-based physical maps. Historically, several types of large insert libraries have been used. These include yeast artificial chromosomes (YACs),[3] bacterial artificial chromosomes (BACs),[4] and cosmid-based libraries.[5] YACs have the largest capacity for cloning the large inserts, but they are relatively unstable; therefore, their use in genome studies has been limited. Cosmid libraries have the smallest capacity for cloning the large inserts; therefore, their use in large insert libraries has also been limited. BACs are the most popular large insert libraries as they are stable and can hold inserts of up to 200 kb. BAC-based physical maps can organize the entire genome into restriction fingerprint-based contigs. Such contigs are similar to the whole-genome sequencing contigs, but they are constructed using overlapping restriction enzyme fingerprints rather than overlapping sequences themselves for the whole-genome sequence assemblies. By analysis of restriction fingerprints of overlapping genomic clones of BAC inserts, the whole genome can be organized into a limited number of contigs, most often in thousands. For instance, the catfish physical maps had 3,307 contigs and 1,891 contigs.[6,7] The integrated catfish physical map included over 2,500 contigs (unpublished). Integration of physical maps and whole-genome sequence contigs allows the relationship to be established between the sequence-based contigs and the restriction fingerprint-based contigs, thereby reducing the levels of segmentation of the genome. One of the major applications of genetic linkage map is to integrate physical maps and whole-genome assemblies. Integration of genetic map with physical map is useful for understanding genomes from different dimensions and is essential for comparative genome analysis,[8] fine-scale QTL mapping and positional cloning of genes responsible for performance and production traits.[9,10] In aquaculture fish species, genetic maps have been constructed in a few species, such as Asian seabass,[11] Atlantic salmon,[12] half-smooth tongue sole,[13] rainbow trout,[14] common carp,[15] and catfish.[2,9,16,17] These maps harbour several hundred to a couple of thousands of markers, with which QTL for agriculturally important performance and production traits can only be mapped in large genomic regions. Integrated maps have been developed in several aquaculture fish species using low-density genetic maps. In Atlantic salmon, in addition to 579 BAC contigs that were integrated into the linkage map using microsatellite markers, identification and mapping of new BAC-anchored single-nucleotide polymorphism (SNP) markers from BAC-end sequences placed 73 additional BAC contigs to Atlantic salmon linkage groups.[18] The second generation of rainbow trout integrated map anchored up to 265 contigs to the genetic map, covering ∼11% of the genome.[10] In common carp, a total of 463 physical map contigs and 88 single BACs were integrated into the genetic linkage map, which covered 30% of the common carp genome.[15] In catfish, 2,030 BAC-end sequence (BES)-derived microsatellites from 1,481 physical map contigs were used for linkage map construction, which anchored 44.8% of the catfish BAC physical map contigs, covering ∼52.8% of the genome.[9] Apparently, the level of integration is dependent on the density and resolution of the genetic maps. One objective of this study was to construct a high-density and high-resolution genetic linkage map by using a large number of molecular markers, covering the entire genome and a large resource panel for linkage mapping analysis. In addition to this primary goal, a secondary goal was to increase integration of the linkage and physical maps, using markers derived from BECs for genetic linkage analysis. Our previous genetic linkage analysis used mostly microsatellite markers. In spite of their high polymorphism, and low cost of single marker genotyping, an analysis of tens of thousands of microsatellites with a large number of mapping fish is extremely labourious and cost-prohibitive. SNPs overcome these difficulties by providing high efficiency and low-cost, large-scale genotyping. SNPs have become markers of choice because of their abundance, even genomic distribution, and easy adaptation to automation.[19] Recent advances in NGS have allowed rapid discovery of genome-wide SNPs in any organism in a cost-effective manner.[20] With the availability of large numbers of SNPs, high-density SNP array platform can be developed for high-throughput and efficient genotyping. Alternatively, an NGS-based genotyping-by-sequencing (GBS)[19-21] is also highly efficient. SNP arrays and GBS have both been used to genotype large-scale SNPs for genetic mapping and association analyses.[12,22-28] Compared with GBS, SNP arrays are more cost-effective. In addition, they provide a greater level of genome coverage. For instance, in most cases, the total number of commonly analysed SNPs is limited to several thousands while millions of SNPs can be analysed by using very high-density SNP arrays. In addition, GBS of large resource panels for high-resolution maps is cost-prohibitive. Channel catfish, Ictalurus punctatus, is the primary aquaculture species in the United States. Since the initiation of genome research over two decades ago, several genetic maps have been constructed with different types of molecular markers and various resource families.[2,9,16,17] Up to date, the highest density map was developed to contain 2,030 microsatellites and 100 SNPs.[9] Although this genetic map has been very useful for genetic and genomic analysis,[29,30] marker density in this map was still fairly low and only facilitated integration to the physical map for ∼52% of catfish genome. Recently, following efforts to expand catfish genomic resources, we have identified millions of SNPs in channel catfish[31,32] and developed high-density SNP arrays,[33] which provided the opportunity to develop a high-density and high-resolution SNP-based genetic map. Here we report the construction of genetic linkage map with over 50,000 SNPs in channel catfish, which is, to the best of our knowledge, the highest density genetic map for any aquaculture species. Along with the high density of markers, the utilization of BAC-associated SNPs also allowed significant increase of the integration of the genetic linkage map with the BAC-based physical map in catfish, allowing over 90% of the catfish genome physical map contigs to be mapped to linkage groups. This genetic map should serve as a valuable framework for validating the reference whole-genome sequences, extensive comparative and functional genomic studies, and fine-scale QTL mapping and association studies in catfish.

Materials and methods

Resource families and DNA preparation

A total of 576 fish, with 192 full-sibling individuals from each of three full-sibling channel catfish families, were used for linkage mapping. The parent and grandparent samples were also obtained. All DNAs for these samples used in this study were provided by USDA-ARS Warmwater Aquaculture Research Unit, which were prepared following the procedures as previously described.[17]

SNP genotyping

DNA samples were arranged in 96-well microtitre plates and diluted to a final concentration of 50 ng/µl with the final volume of 10 µl. Genotyping was conducted by GeneSeek (Lincoln, NE, USA) using the catfish 250K SNP array.[33] Affymetrix CEL files were analysed using Affymetrix Genotyping Console software (version 4.0) for quality control analysis and SNP genotype calling using the Affymetrix AxiomGT1 algorithm. Samples passing the quality control (Dish value > 0.85) and SNP call rates threshold (>95%) were retained for analysis. Following genotyping, an R package, SNPolisher, was used to generate the genotyping outputs. The package produced SNP quality control metrics and divided SNPs into several classes based on the quality of genotype calls.[33] The polymorphic markers with high resolution of cluster separation (high genotyping quality) were remained for further analysis. The CHP files generated from the Affymetrix Genotyping Console were imported into SNP & Variation Suite (SVS version 7, Golden Helix, Inc.) for further filtering to remove SNPs with missing genotypes >10% and minor allele frequency <5%.

Linkage map construction

Only SNPs that had high quality of genotype calls and were heterozygous in at least one parent were retained for linkage analysis. Based on segregation patterns, SNPs were categorized into three types: 1 : 2 : 1 type (AB × AB, segregating in both parents), 1 : 1 type (AB × AA or AB × BB, segregating only in female), and 1 : 1 type (AA × AB or BB × AB, segregating only in male). The χ2 goodness-of-fit tests were performed to examine the segregation distortion. Markers with significant segregation distortion were excluded (P < 0.001). Linkage maps were initially developed independently for the three mapping families. Sex-specific maps were constructed for each parent using 1 : 1 segregation-type markers only (AB × AA or AB × BB for female map and AA × AB or BB × AB for male map). The markers of segregation type 1 : 2 : 1 (AB × AB) were used to build the sex-averaged map. The LINKMFEX software (version 2.4, http://www.uoguelph.ca/~rdanzman/software.htm) was used to perform linkage analysis for sex-specific markers, and OneMap (R package, version 2.0) was used for linkage analysis of sex-averaged markers. Linkage between markers was examined by estimating logarithm of the odds (LOD) scores for recombination fraction (θ). A LOD threshold of 8.0 was used, and a maximum θ of 0.35 was set to assign markers into linkage groups. For markers falling into ‘zero recombination clusters', only one marker was picked that had the most informative meiosis as the representative marker of each cluster for linkage map construction to reduce the power required for computation of linkage. Genetic maps were constructed using JoinMap software version 4.0 with the regression mapping algorithm.[34] The Kosambi mapping function was used to convert the recombination frequencies into map distances (centiMorgans). The positions of markers were determined according to the sequential build-up of the map.[35] Briefly, a pair of markers was firstly selected, followed by sequential addition of other markers. The ‘ripple’ was performed each time after adding one marker. The best fitting position of the added markers was searched based on the goodness-of-fit test for the resulting map. When a marker generated a negative map distance or a large ‘jump’ value in goodness-of-fit test, the marker was removed, and map calculation was continued to construct a first-round map. Thereafter, the removed markers were added to the first-round map and again subjected to the goodness-of-fit test to generate an optimum order of markers. For the sex-specific markers, the linkage phases were inferred automatically, while for the markers with 1 : 2 : 1 segregation type (AB × AB), the linkage phase was deduced based on the genotypes of grandparents to assist the map construction. The consensus maps were then established using the MergeMap[36] by integrating individual maps from three reference families through shared markers. Lastly, the markers falling into ‘zero recombination clusters’ that were excluded during map construction were anchored to the linkage map, based on the positions of their corresponding representative markers. All genetic linkage maps were drawn using MAPCHART 2.2.[37]

Differences in recombination rates between families and sexes

To assess differences in recombination rates among the three resource families, the sex-averaged map was used following the M-test according to Ott's method.[38] where represents the LOD scores of maximum-likelihood estimation (MLE) for the ith reference family for common marker pairs among the three families. The represents the total LOD scores of MLE for all three families. The recombination fractions for all mapped locus intervals from each family were obtained from JoinMap. To investigate sex-specific heterogeneity throughout each linkage group, common marker pairs were used to compare the locus intervals across the male-specific map and female-specific map using contingency G-test.[39]

Integration of linkage map with physical map

To integrate previously developed physical map[40] with the high-density linkage map constructed in the present work, all the mapped SNPs with 70-bp flanking sequences were aligned with BES[8,40] and BAC-based physical map contig-specific sequences (PMCSS)[41] using BLAST with the E-value cut-off 1E−10, minimal alignment length of 36, and minimal identity of 95%. The relationship of physical map and genetic map was built up based on the SNP-associated BES and PMCSS. The BAC physical map contigs that harboured BAC-derived BES and PMCSS were anchored onto the genetic map based on the SNP positions.

Results

Selection of SNP markers

As summarized in Table 1, SNP genotypes were obtained from 576 samples of three mapping families (192 samples per family). According to the assessment of genotyping quality and polymorphism in all samples from the three mapping families, genotypes of a total of 121,521 SNPs were used (Table 1). Owing to the low genotype calling rate (<95%), nine individuals were excluded for analysis, including one individual from family 1, three individuals from family 2, and five individuals from family 3. After further filtering to remove SNPs with low calling rate (<97%) and non-Mendelian inheritance (P < 0.001), a total of 18,866 SNPs from family 1, 33,224 SNPs from family 2, and 35,093 SNPs from family 3 were retained for linkage mapping. Taken together, a total of 64,889 SNPs that were informative in at least one of the three mapping families were used.
Table 1.

SNPs selected for linkage mapping

ItemNumber
SNPs on array250,113
SNPs polymorphic and with high genotyping quality121,521
SNPs used for linkage mapping after further filtering64,889
SNPs from family 1 used for linkage mapping18,866
SNPs from family 2 used for linkage mapping33,224
SNPs from family 3 used for linkage mapping35,093
SNPs selected for linkage mapping

Linkage mapping

Linkage maps were first constructed for each of the three families, separately. For all three mapping families, 29 linkage groups (LGs) were obtained, which was consistent with the number of chromosomes of the catfish haploid genome. The consensus genetic linkage maps were obtained by merging separate maps from three mapping families. A total of 31,387 markers with distinct genetic positions (hereafter referred to as unique markers) were placed on the consensus linkage maps, which included 8,644 SNPs from family 1, 13,477 SNPs from family 2, and 14,343 SNPs from family 3 (Table 2). After anchoring the previously excluded markers that fell into ‘zero recombination clusters’ based on the genetic positions of representative markers, a total of 54,342 SNP markers were placed onto the current linkage map (Table 2).
Table 2.

SNPs placed on the linkage maps

ItemNumber
Unique SNPs from family 1 mapped to linkage map8,644
Unique SNPs from family 2 mapped to linkage map13,477
Unique SNPs from family 3 mapped to linkage map14,343
Total unique SNPs mapped to linkage map31,387
Total SNPs mapped to linkage map54,342
SNPs placed on the linkage maps The sex-specific maps were constructed using markers that were heterozygous in only female or male parent. The female genetic map consisted of 18,444 SNPs including 9,746 unique markers, with a total genetic length of 4,495.1 cM (Table 3; Fig. 1). The marker intervals estimated based on the unique marker positions ranged from 0.37 cM/marker pair in LG12 to 0.7 cM/marker pair in LG18 with the average marker interval of 0.46 cM/marker pair in female genetic map. The male genetic map comprised 15,148 SNPs with 7,250 unique markers and a total genetic length of 2,593.7 cM, which was much shorter than female genetic map. The marker intervals of male genetic map ranged from 0.25 cM/marker pair (LG18) to 0.55 cM/marker pair (LG1), with an average marker interval of 0.36 cM (Table 3, Fig. 2). It should be noted that these distances refer to intervals where recombination was detected and of course it should be recognized that in both sexes, the minimum distance observed was 0 cM for completely linked markers. The female- and male-specific maps are illustrated in Figs 1 and 2, respectively. The detailed map information was provided in Supplementary data S1.
Table 3.

Summary of the sex-specific linkage maps of channel catfish

Linkage groupFemale-specific map
Male-specific map
F:M ratio
Mapped markersDistinct positionsGenetic length (cM)Marker interval (cM)Mapped markersDistinct positionsGenetic length (cM)Marker interval (cM)
1810397209.80.5337415383.90.552.50
2616370147.10.4051725190.60.361.62
3934481230.60.48665342126.40.371.82
4774380166.80.446993131100.351.52
5643353165.30.4744119979.10.402.09
6821470205.30.44606300132.10.441.55
7583313134.90.4337019663.10.322.14
8550326160.40.4961529792.70.311.73
9751399177.60.4554629197.10.331.83
10488238133.40.5651622875.70.331.76
11815434198.50.46616273111.60.411.78
12695361133.90.3772731182.20.261.63
13604311149.30.48557285123.60.431.21
14598340133.90.3935115079.20.531.69
15651332145.50.4452424873.80.301.97
16799385178.10.46635337111.20.331.60
17792420175.40.42506264103.60.391.69
18473230160.30.7064029073.50.252.18
19542302139.30.4649525695.90.371.45
20657344162.50.4755724488.20.361.84
21576318134.10.4243721776.40.351.76
2230815365.80.4338016967.90.400.97
23668319125.30.3947824573.90.301.70
24518296134.50.4540819572.60.371.85
25665348150.20.43535269109.30.411.37
26389181124.90.6951224276.90.321.62
27595327152.90.4752223370.40.302.17
28663334141.70.4248423780.50.341.76
29466284157.40.5543521572.50.342.17
Total18,4449,7464,495.10.4615,1487,2502,593.70.361.73
Figure 1.

Illustration of female-specific linkage map.

Figure 2.

Illustration of male-specific linkage map.

Summary of the sex-specific linkage maps of channel catfish Illustration of female-specific linkage map. Illustration of male-specific linkage map. The sex-averaged map was constructed using markers that were heterozygous in both parents. As summarized in Table 4, a total of 29,081 SNPs were mapped, which consisted of 15,598 unique markers. The sex-averaged map spanned 3,505.4 cM with an average marker interval of 0.22 cM, ranging from 0.17 cM/marker (LG12) to 0.30 cM/marker (LG18). The genetic map is illustrated in Fig. 3. The detailed map information was provided in Supplementary data S1.
Table 4.

Summary of the sex-averaged linkage map of channel catfish

Linkage groupSex-averaged map
Mapped markersDistinct positionsGenetic length (cM)Marker interval (cM)
11,039556139.50.25
21,014557116.20.21
31,212673169.70.25
41,220653128.10.20
51,002491110.20.22
61,183680163.30.24
7861422102.10.24
81,073557117.30.21
91,100604146.90.24
10913475105.80.22
11999572152.70.27
121,216625104.80.17
131,176630131.10.21
14868472113.80.24
151,047562115.70.21
161,219695145.30.21
171,178623138.20.22
18782388114.80.30
19914523118.90.23
201,0295501260.23
21968484110.30.23
2277936468.40.19
2392948693.30.19
24854492106.90.22
251,097584127.60.22
26811427101.20.24
27925516113.70.22
28969516114.10.22
29704421109.50.26
Total29,08115,5983,505.40.22
Figure 3.

Illustration of sex-averaged linkage map.

Summary of the sex-averaged linkage map of channel catfish Illustration of sex-averaged linkage map.

Analysis of recombination rates

Within each linkage group, mild to strong localized specific recombination patterns were observed, whereby recombination rates were usually elevated towards the end and suppressed in the middle of the map (Fig. 4). Clustered markers that fell into ‘zero recombination rate’ were observed in every linkage group, especially in positions close to the centromeres in contrast to the telomeres (Supplementary data S1).
Figure 4.

The patterns of localized regional recombination rates along each linkage group.

The patterns of localized regional recombination rates along each linkage group. There is no significant difference in recombination rate among three mapping families based on the examination with common marker pairs among the three families (see Methods section). In contrast, significant higher recombination rates were observed in a majority of the linkage groups of the female genetic map than that of male genetic map (P < 0.01). Overall, the female genetic map was 1,901.4 cM longer than male genetic map, with an average female-to-male ratio of 1.7 : 1 (Table 3). The ratio varied by linkage groups, ranging from 0.97 : 1 in LG22 to 2.50 : 1 in LG1 (Table 3). The largest differences in recombination rate between the female and male maps were observed in LG1, LG5, LG7, LG18, LG27, and LG29. Across all these linkage groups, the female : male recombination ratios exceeded 2.0. LG22 was unusual in that recombination rates were very similar between the sexes (i.e. 0.97 : 1). Within unique map positions, 504 SNPs were commonly shared between the two sexes, while significantly higher recombination rates were observed in majority of the common locus intervals (P < 0.01) (Fig. 5).
Figure 5.

Comparison of the recombination rate between female and male. The inter-marker distances (cM) for all pairs of adjacent markers from both female- and male-specific maps were used. The diagonal line represents sex-equal recombination rates.

Comparison of the recombination rate between female and male. The inter-marker distances (cM) for all pairs of adjacent markers from both female- and male-specific maps were used. The diagonal line represents sex-equal recombination rates.

Integration and validation with physical map

The integration of the genetic map with the BAC-based physical map anchored 2,728 (83%) of the 3,307 physical map contigs (Table 5). Together with the 1,481 (44.8%) physical map contigs that were anchored previously,[6] we were able to anchor a total of 2,880 (87.1%) physical map contigs consisting of 28,416 BAC clones (92.9% of total BAC clones). The sizes of anchored physical map contigs varied from 66.0 kb to 2,005.8 kb (Supplementary data S2), with an average size of 301 kb. Together, a total of 867.4 Mb of the physical map was integrated by genetic linkage map that accounted for ∼90% of the channel catfish genome (Table 5). Detailed information regarding integration of linkage map and physical map is provided in Supplementary data S2.
Table 5.

Integration of channel catfish linkage map with physical map

ItemNumberPercentage
Number of physical map contigsa3,307100
Number of physical map contigs with SNP markers3,24398.1
Number of physical map contigs anchored to linkage map in this work2,72882.5
Number of physical map contigs anchored in previous studyb1,48144.8
Total physical map contigs mapped to linkage map2,88087.1
Total BAC clones in the catfish physical mapa30,582100
Total BAC clones mapped on the linkage map28,41692.9
Total size of physical map contigsa965,279100
Total size of anchored physical map contigs867,36489.9

Data obtained from Xu et al.[6]

Data obtained from Ninwichian et al.[9]

Integration of channel catfish linkage map with physical map Data obtained from Xu et al.[6] Data obtained from Ninwichian et al.[9] The integration of linkage map and physical map enabled the cross-validation of the quality of the physical map and the genetic map (Table 6). A total of 1,467 physical map contigs containing 2,961 BES-associated SNPs were placed onto linkage maps. With 615 physical map contigs, two or more SNP markers were mapped that allowed the determination of orientation of the physical map contigs along the chromosome. With such contigs, the quality of physical map contigs was assessed based on the mapped markers. An illustration with linkage group 12 is provided in Fig. 6. The vast majority of physical contigs were validated, but a small fraction of physical map contigs was found incorrectly assembled, or the ordering of the SNP markers onto the map was in error.
Table 6.

Cross-validation between linkage map and physical map

ItemNumber
Number of physical map contigs with BES-associated SNPs2,790
Number of physical map contigs anchored through BES-associated SNPs1,467
Number of physical map contigs with at least two BES-associated SNPs615
Number of physical map contigs mapped onto different LGs47
Number of BAC clones with BES-associated SNPs2,707
Number of BAC clones with at least two BES-associated SNPs238
Number of BAC clones mapped onto different LGs7
Figure 6.

Illustration of integration of the linkage map (left) with the physical map (right). The linkage group 12 (LG12) was used for the illustration. Vertical bars represent physical map contigs containing at least two SNPs mapped onto the linkage map, and the dot spots indicate physical map contigs with only one SNP.

Cross-validation between linkage map and physical map Illustration of integration of the linkage map (left) with the physical map (right). The linkage group 12 (LG12) was used for the illustration. Vertical bars represent physical map contigs containing at least two SNPs mapped onto the linkage map, and the dot spots indicate physical map contigs with only one SNP. A total of 47 physical map contigs had markers that were mapped in different linkage groups, indicating the potential errors of the physical map assembly or the mapping errors of incorrectly assigned SNPs. These errors could be caused by SNPs falling into duplicated regions of genome. Moreover, of the 2,707 BAC clones with BES-associated SNPs, 238 BAC clones contained at least two BES-associated SNPs. Within this group, 7 BAC clones were mapped onto different LGs, suggesting the misplacement of those SNP markers onto the linkage groups. As an illustration, the integration of sex-averaged linkage map with physical map Contig72 and Contig534 is shown in Fig. 7, presenting the correct and incorrect physical map assembly, respectively. The manual check of the potential assembly or mapping error warrants the correctness of both physical map and linkage map in the next step.
Figure 7.

Example of validation of physical map using the linkage map. (A) shows the correct physical map contig using Contig72 as an example; (B) shows the incorrect physical map contig using Contig534 as an example.

Example of validation of physical map using the linkage map. (A) shows the correct physical map contig using Contig72 as an example; (B) shows the incorrect physical map contig using Contig534 as an example. Physical distances were estimated based on the position of SNP markers within each physical map contig, which allowed for estimation of the ratios of physical distance to genetic distance (Supplementary data S3). On the other hand, recombination frequencies can be calculated for physical map contigs with multiple SNP markers mapped onto the linkage map. With the sex-averaged linkage map, the genetic size of 3,505.4 cM and the physical size of 965,279 kb based on the physical map, we estimated the average genetic size as 3.6 cM per Mb. The ratios observed from the 250 physical map contigs that contained multiple SNPs mapped onto the linkage map ranged from 0 to 2.26 cM per Mb. The markers with higher ratios might indicate the potential loci of recombination hot spots.

Discussion

In this work, we constructed a high-density genetic map for channel catfish. This map possesses the highest marker density among all the genetic linkage maps constructed for any aquaculture species. Taking advantage of the catfish 250K SNP array, we were able to efficiently and cost-effectively genotype 250,000 SNPs in the 576 fish from three mapping families (192 fish per family). Such a high-density linkage map should be a valuable resource for analysis and fine-scale QTL mapping in catfish. Although the efficient SNP genotyping technology allowed the construction of high-density linkage map, the high-resolution was yet to be achieved. A large number of markers fell into ‘zero recombination clusters’ where no recombination events occur during meiosis between the markers among the fish used for mapping analysis. These represent closely arrayed adjacent SNP positions, and their ordering is likely best achieved by assessing their order within the aligned physical map contigs. Conversely, it was also observed that genetic recombination rates could be artificially inflated when genotyping errors occur or when markers with large amounts of missing data are included in the analysis.[42,43] Similar situations were observed previously in zebrafish during the construction of high-density genetic map. In zebrafish, the genetic sizes of initial map were over 1,000 cM per chromosome. After removal of genotyping errors, the genetic sizes were reduced to around 100 cM.[44] To reduce the effect of clustered markers on the map construction and reduce the computing load, we picked one representative anchor marker that had the most informative meiosis from each of these clusters in the linkage map construction. Afterwards, we rejoined the markers within the ‘zero recombination clusters’ that were excluded during the initial mapping steps. This procedure relocated the informative markers onto the linkage maps based on the positions of their representative anchor marker. Such clustered markers are still valuable for the chromosome-scale scaffolding. With the availability of such high-density mapped SNPs, the patterns of marker distribution across chromosomes can be examined. It appears that clustered markers were commonly found in regions around centromeres while less frequently found around the telomeres. This was consistent with observations in various genetic maps previously developed in fish species, such as tilapia,[45] medaka,[46] rainbow trout,[47] Atlantic salmon,[12] and catfish.[9,16] One potential explanation is that the centromeres contain abundant tandemly repeated, heterochromatic DNA sequences.[48] As shown in Fig. 4, the distribution of mapped SNP markers across chromosomes in channel catfish is consistent with the observation of telomere and centromere effects,[49] which would result in a higher recombination rate near the telomeres while a lower recombination rate near the centre of the chromosomes. The recombination rates might be positively correlated with GC content, as in human,[50] pig,[51] chicken,[52] rodent,[53] and yeast,[54] or correlated with some other genomic features such as gene density and the presence of genes determining recombination hotspots. The PRDM9 gene was recently found to determine the recombination hotspots in the mammalian genomes.[55-57] Future investigations on recombination landscape warrant the characterization of genomic features, affecting the meiotic recombination in catfish upon the availability of fully assembled genome sequences. The suppression of recombination in males relative to the females observed in channel catfish (averaged female-to-male ratio is 1.7 : 1) was consistent with our previous observation[2] as well as many other studies. Recombination rates differ between the two sexes in many organisms where recombination occurs more frequently in the homogametic sex than in the heterogametic sex.[50,58] The similar phenomenon was reported in many other fish species such as Atlantic salmon (1.38 : 1), rainbow trout (1.68 : 1), European sea bass (1.60 : 1), Arctic char (1.69 : 1), and silver carp (1.52 : 1).[12,14,58-61] Striking recombination differences between the sexes were observed in zebrafish (2.74 : 1)[58] and grass carp (2.0 : 1).[62] These observations indicated that even closely related fish species differ in genetic sizes and the extent of sexual dimorphism for recombination fraction,[50] similarly as in mammals. Although the genetic basis for recombination bias between sexes remains unknown, several theories were proposed to explain this observation. One explanation from selection perspective suggests that selection pressure is stronger in male gametes than in female gametes during the haploid life stage. This difference could lead to male-specific selection to maintain beneficial haplotype to decrease the male recombination rate.[58,63] An alternative explanation is that the female recombination rate is higher to compensate for the apparently less stringent checkpoint for achiasmatic chromosomes compared with males.[50] In human, cytological studies suggested that sex-specific differences in recombination may derive from chromatin differences established prior to the onset of the recombination pathway.[64] The high-density and the relatively high-resolution genetic map provides valuable resource for integrating the physical map and whole-genome assemblies. In this study, we anchored the catfish physical map to genetic linkage map through BES and PMCSS that contain SNP markers. With this linkage map of channel catfish, we were able to anchor 87% of the catfish BAC physical map contigs, covering ∼92% of the catfish genome. This is a great improvement on map integration compared with our previous efforts that only anchored 44.8% of the catfish BAC physical map contigs accounting for 52.8% of the catfish genome. To our knowledge, this is also the highest percentage of physical maps integration with genetic maps that were obtained in any aquaculture species. The well-integrated map is useful for comprehensive comparative genomic analyses, fine-scale mapping of QTL, and positional cloning for candidate genes.[10] Moreover, the integrated map can be used as an important resource for the validation of the catfish whole-genome assembly.

Supplementary data

Supplementary data are available at www.dnaresearch.oxfordjournals.org.

Funding

This project was supported by Agriculture and Food Research Initiative Competitive Grant (No. 2012-67015-19410) from the US Department of Agriculture (USDA) National Institute of Food and Agriculture (NIFA) and partially supported by USDA Aquaculture Research Program Competitive Grant (No. 2014-70007-22395) from USDA NIFA. Funding to pay the Open Access publication charges for this article was provided by a grant from USDA Aquaculture Research Program Competitive Grant no. 2014-70007-22395.
  58 in total

Review 1.  Genome-wide genetic marker discovery and genotyping using next-generation sequencing.

Authors:  John W Davey; Paul A Hohenlohe; Paul D Etter; Jason Q Boone; Julian M Catchen; Mark L Blaxter
Journal:  Nat Rev Genet       Date:  2011-06-17       Impact factor: 53.242

2.  A physical map of the human genome.

Authors:  J D McPherson; M Marra; L Hillier; R H Waterston; A Chinwalla; J Wallis; M Sekhon; K Wylie; E R Mardis; R K Wilson; R Fulton; T A Kucaba; C Wagner-McPherson; W B Barbazuk; S G Gregory; S J Humphray; L French; R S Evans; G Bethel; A Whittaker; J L Holden; O T McCann; A Dunham; C Soderlund; C E Scott; D R Bentley; G Schuler; H C Chen; W Jang; E D Green; J R Idol; V V Maduro; K T Montgomery; E Lee; A Miller; S Emerling; R Gibbs; S Scherer; J H Gorrell; E Sodergren; K Clerc-Blankenburg; P Tabor; S Naylor; D Garcia; P J de Jong; J J Catanese; N Nowak; K Osoegawa; S Qin; L Rowen; A Madan; M Dors; L Hood; B Trask; C Friedman; H Massa; V G Cheung; I R Kirsch; T Reid; R Yonescu; J Weissenbach; T Bruls; R Heilig; E Branscomb; A Olsen; N Doggett; J F Cheng; T Hawkins; R M Myers; J Shang; L Ramirez; J Schmutz; O Velasquez; K Dixon; N E Stone; D R Cox; D Haussler; W J Kent; T Furey; S Rogic; S Kennedy; S Jones; A Rosenthal; G Wen; M Schilhabel; G Gloeckner; G Nyakatura; R Siebert; B Schlegelberger; J Korenberg; X N Chen; A Fujiyama; M Hattori; A Toyoda; T Yada; H S Park; Y Sakaki; N Shimizu; S Asakawa; K Kawasaki; T Sasaki; A Shintani; A Shimizu; K Shibuya; J Kudoh; S Minoshima; J Ramser; P Seranski; C Hoff; A Poustka; R Reinhardt; H Lehrach
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

3.  A detailed linkage map of medaka, Oryzias latipes: comparative genomics and genome evolution.

Authors:  K Naruse; S Fukamachi; H Mitani; M Kondo; T Matsuoka; S Kondo; N Hanamura; Y Morita; K Hasegawa; R Nishigaki; A Shimada; H Wada; T Kusakabe; N Suzuki; M Kinoshita; A Kanamori; T Terado; H Kimura; M Nonaka; A Shima
Journal:  Genetics       Date:  2000-04       Impact factor: 4.562

4.  Heterogeneity in rates of recombination across the mouse genome.

Authors:  M W Nachman; G A Churchill
Journal:  Genetics       Date:  1996-02       Impact factor: 4.562

5.  Use of an ordered cosmid library to deduce the genomic organization of Mycobacterium leprae.

Authors:  K Eiglmeier; N Honoré; S A Woods; B Caudron; S T Cole
Journal:  Mol Microbiol       Date:  1993-01       Impact factor: 3.501

6.  Development of the catfish 250K SNP array for genome-wide association studies.

Authors:  Shikai Liu; Luyang Sun; Yun Li; Fanyue Sun; Yanliang Jiang; Yu Zhang; Jiaren Zhang; Jianbin Feng; Ludmilla Kaltenboeck; Huseyin Kucuktas; Zhanjiang Liu
Journal:  BMC Res Notes       Date:  2014-03-10

7.  Generation of physical map contig-specific sequences useful for whole genome sequence scaffolding.

Authors:  Yanliang Jiang; Parichart Ninwichian; Shikai Liu; Jiaren Zhang; Huseyin Kucuktas; Fanyue Sun; Ludmilla Kaltenboeck; Luyang Sun; Lisui Bao; Zhanjiang Liu
Journal:  PLoS One       Date:  2013-10-24       Impact factor: 3.240

8.  Identification and analysis of genome-wide SNPs provide insight into signatures of selection and domestication in channel catfish (Ictalurus punctatus).

Authors:  Luyang Sun; Shikai Liu; Ruijia Wang; Yanliang Jiang; Yu Zhang; Jiaren Zhang; Lisui Bao; Ludmilla Kaltenboeck; Rex Dunham; Geoff Waldbieser; Zhanjiang Liu
Journal:  PLoS One       Date:  2014-10-14       Impact factor: 3.240

9.  A dense genetic linkage map for common carp and its integration with a BAC-based physical map.

Authors:  Lan Zhao; Yan Zhang; Peifeng Ji; Xiaofeng Zhang; Zixia Zhao; Guangyuan Hou; Linhe Huo; Guiming Liu; Chao Li; Peng Xu; Xiaowen Sun
Journal:  PLoS One       Date:  2013-05-21       Impact factor: 3.240

10.  Mapping and validation of the major sex-determining region in Nile tilapia (Oreochromis niloticus L.) Using RAD sequencing.

Authors:  Christos Palaiokostas; Michaël Bekaert; Mohd G Q Khan; John B Taggart; Karim Gharbi; Brendan J McAndrew; David J Penman
Journal:  PLoS One       Date:  2013-07-11       Impact factor: 3.240

View more
  33 in total

1.  Multiple across-strain and within-strain QTLs suggest highly complex genetic architecture for hypoxia tolerance in channel catfish.

Authors:  Xiaozhu Wang; Shikai Liu; Chen Jiang; Xin Geng; Tao Zhou; Ning Li; Lisui Bao; Yun Li; Jun Yao; Yujia Yang; Xiaoxiao Zhong; Yulin Jin; Rex Dunham; Zhanjiang Liu
Journal:  Mol Genet Genomics       Date:  2016-10-12       Impact factor: 3.291

2.  Genome-wide association analysis of intra-specific QTL associated with the resistance for enteric septicemia of catfish.

Authors:  Huitong Shi; Tao Zhou; Xiaozhu Wang; Yujia Yang; Chenglong Wu; Shikai Liu; Lisui Bao; Ning Li; Zihao Yuan; Yulin Jin; Suxu Tan; Wenwen Wang; Xiaoxiao Zhong; Guyu Qin; Xin Geng; Dongya Gao; Rex Dunham; Zhanjiang Liu
Journal:  Mol Genet Genomics       Date:  2018-07-02       Impact factor: 3.291

3.  First High-Density Linkage Map and QTL Fine Mapping for Growth-Related Traits of Spotted Sea bass (Lateolabrax maculatus).

Authors:  Yang Liu; Haolong Wang; Haishen Wen; Yue Shi; Meizhao Zhang; Xin Qi; Kaiqiang Zhang; Qingli Gong; Jifang Li; Feng He; Yanbo Hu; Yun Li
Journal:  Mar Biotechnol (NY)       Date:  2020-05-19       Impact factor: 3.619

4.  GWAS analysis of QTL for enteric septicemia of catfish and their involved genes suggest evolutionary conservation of a molecular mechanism of disease resistance.

Authors:  Tao Zhou; Shikai Liu; Xin Geng; Yulin Jin; Chen Jiang; Lisui Bao; Jun Yao; Yu Zhang; Jiaren Zhang; Luyang Sun; Xiaozhu Wang; Ning Li; Suxu Tan; Zhanjiang Liu
Journal:  Mol Genet Genomics       Date:  2016-11-08       Impact factor: 3.291

5.  Identification of novel genes significantly affecting growth in catfish through GWAS analysis.

Authors:  Ning Li; Tao Zhou; Xin Geng; Yulin Jin; Xiaozhu Wang; Shikai Liu; Xiaoyan Xu; Dongya Gao; Qi Li; Zhanjiang Liu
Journal:  Mol Genet Genomics       Date:  2017-12-12       Impact factor: 3.291

6.  Allelically and Differentially Expressed Genes After Infection of Edwardsiella ictaluri in Channel Catfish as Determined by Bulk Segregant RNA-Seq.

Authors:  Yulin Jin; Tao Zhou; Wansheng Jiang; Ning Li; Xiaoyan Xu; Suxu Tan; Huitong Shi; Yujia Yang; Zihao Yuan; Wenwen Wang; Guyu Qin; Shikai Liu; Dongya Gao; Rex Dunham; Zhanjiang Liu
Journal:  Mar Biotechnol (NY)       Date:  2022-02-15       Impact factor: 3.619

7.  Increased Alternative Splicing as a Host Response to Edwardsiella ictaluri Infection in Catfish.

Authors:  Suxu Tan; Wenwen Wang; Xiaoxiao Zhong; Changxu Tian; Donghong Niu; Lisui Bao; Tao Zhou; Yulin Jin; Yujia Yang; Zihao Yuan; Dongya Gao; Rex Dunham; Zhanjiang Liu
Journal:  Mar Biotechnol (NY)       Date:  2018-07-16       Impact factor: 3.619

8.  A Genome-Wide Association Study Reveals That Genes with Functions for Bone Development Are Associated with Body Conformation in Catfish.

Authors:  Xin Geng; Shikai Liu; Zihao Yuan; Yanliang Jiang; Degui Zhi; Zhanjiang Liu
Journal:  Mar Biotechnol (NY)       Date:  2017-10-02       Impact factor: 3.619

9.  Construction of a High-Density Genetic Linkage Map and QTL Mapping for Growth-Related Traits in Takifugu bimaculatus.

Authors:  Yue Shi; Zhixiong Zhou; Bo Liu; Shengnan Kong; Baohua Chen; Huaqiang Bai; Leibin Li; Fei Pu; Peng Xu
Journal:  Mar Biotechnol (NY)       Date:  2020-01-03       Impact factor: 3.619

10.  Construction of High-Density Genetic Map and Mapping of Sex-Related Loci in the Yellow Catfish (Pelteobagrus fulvidraco).

Authors:  Dong Gao; Min Zheng; Genmei Lin; Wenyu Fang; Jing Huang; Jianguo Lu; Xiaowen Sun
Journal:  Mar Biotechnol (NY)       Date:  2020-01-02       Impact factor: 3.619

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.