| Literature DB >> 29495372 |
Xiang Zhang1, Misaki Mizukoshi2, Hong Zhang3, Engkong Tan4, Yoji Igarashi5, Yutaka Suzuki6, Susumu Mitsuyama7, Shigeharu Kinoshita8, Kazuyoshi Saito9, Shugo Watabe10, Shuichi Asakawa11.
Abstract
Next-generation sequencing enables genome-wide genotyping of a large population and further facilitates the construction of a genetic linkage map. Low-coverage whole-genome sequencing has been employed for genetic linkage map construction in several species. However, this strategy generally requires available high-quality reference genomes and/or designed inbred pedigree lines, which restrict the scope of application for non-model and unsequenced species. Here, using torafugu (Takifugu rubripes) as a test model, we propose a new strategy for ultrahigh-density genetic linkage map construction using low-coverage whole-genome sequencing of a haploid/doubled haploid (H/DH) population without above requirements. Low-coverage (≈1×) whole-genome sequencing data of 165 DH individuals were used for de novo assembly and further performed single nucleotide polymorphisms (SNPs) calling, resulting in the identification of 1,070,601 SNPs. Based on SNP genotypes and de novo assembly, genotypes were associated with short DNA segments and an ultrahigh-density linkage map was constructed containing information of 802,277 SNPs in 3090 unique positions. Comparative analyses showed near-perfect concordance between the present linkage map and the latest published torafugu genome (FUGU5). This strategy would facilitate ultrahigh-density linkage map construction in various sexually reproducing organisms for which H/DH populations can be generated.Entities:
Keywords: Takifugu rubripes; doubled haploid population; genetic linkage map; linkage phase unknown; low-coverage whole-genome sequencing
Year: 2018 PMID: 29495372 PMCID: PMC5867841 DOI: 10.3390/genes9030120
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Flowchart of high-call-rate markers genotyping. (a) Low-coverage whole-genome sequencing of the doubled-haploid (DH) mapping population was performed to generate a library of short gun reads (in blue) for each sample (S1‒Sn). (b) De novo assembly was performed using the total whole-genome sequencing data to generate scaffolds (in red) containing a series of segments (in green). (c) Single nucleotide polymorphisms (SNP) calling and genotyping of each sample was carried out to construct a low-call-rate SNPs array (SNPs × Samples) with unknown phase. The SNPs (SNP1‒5) located on one short segment (Seg1) were phased at the segment level and merged together to assign the short segment genotype (SSG1). The high-call-rate SSGs array (SSGs × Samples) was then constructed with unknown phase at the chromatid level.
Figure 2Schematic illustration of the conversion of SSGs under different circumstances. The maternal chromatids possess linkage phase score A (blue) and B (red). In Sample 1, recombination occurs on Seg2 but outside of Seg1. In Sample 2, recombination occurs outside of both Seg1 and Seg2, which is the most common case.
Figure 3Ideograms of the genetic linkage map of torafugu. The genetic position of each SSG is illustrated in each linkage group. SSGs are indicated as a blue line if the sequences could be aligned to the published genome FUGU5 using BLASTN (version 2.2.29), and are otherwise shown as a pink line. LG: linkage group; the y-axis represents the genetic position (cM).
Summary of the genetic linkage map of torafugu.
| Linkage Group | Genetic Distance (cM) | SSGs Numbers | SNPs Numbers | Unique Positions | Marker Interval |
|---|---|---|---|---|---|
| LG1 | 198.25 | 2997 | 64,119 | 252 | 0.79 |
| LG2 | 96.95 | 1588 | 34,369 | 128 | 0.76 |
| LG3 | 112.13 | 1876 | 39,280 | 147 | 0.76 |
| LG4 | 95.15 | 1722 | 35,535 | 132 | 0.72 |
| LG5 | 96.42 | 1454 | 31,043 | 136 | 0.71 |
| LG6 | 81.72 | 1431 | 31,616 | 110 | 0.74 |
| LG7 | 128.42 | 1819 | 37,225 | 166 | 0.77 |
| LG8 | 132.44 | 2055 | 44,302 | 174 | 0.76 |
| LG9 | 104.88 | 1670 | 34,975 | 143 | 0.73 |
| LG10 | 62.75 | 1396 | 30,320 | 82 | 0.77 |
| LG11 | 104.43 | 1543 | 33,186 | 143 | 0.73 |
| LG12 | 79.40 | 1408 | 30,636 | 110 | 0.72 |
| LG13 | 122.28 | 1797 | 38,739 | 157 | 0.78 |
| LG14 | 97.47 | 1770 | 37,750 | 128 | 0.76 |
| LG15 | 92.43 | 1605 | 35,558 | 119 | 0.78 |
| LG16 | 64.71 | 1417 | 30,875 | 89 | 0.73 |
| LG17 | 110.91 | 1727 | 38,620 | 150 | 0.74 |
| LG18 | 67.20 | 1157 | 26,774 | 89 | 0.76 |
| LG19 | 112.95 | 1714 | 35,312 | 154 | 0.73 |
| LG20 | 120.46 | 1830 | 38,989 | 152 | 0.79 |
| LG21 | 133.31 | 1893 | 40,975 | 180 | 0.74 |
| LG22 | 104.98 | 1474 | 32,079 | 149 | 0.70 |
| Total | 2319.65 | 37,343 | 802,277 | 3090 | 0.75 1 |
1 Mean value of marker intervals.
Figure 4Heat map of recombination fractions (upper triangle) and logarithm of odds (LOD) scores (lower triangle) for all pairs of markers (SSGs) in the 22 linkage groups. LG represents linkage group.
Figure 5Comparison between the present genetic linkage map and FUGU5. Links connect the genetic and physical locations of each mapped SSG. Grey blocks on the left represent the 22 linkage groups of the present linkage map, whereas colorful blocks on the right represent the 22 chromosomes of FUGU5. LG: linkage group; chr: chromosome.
Figure 6Concordance of SNP positions on the present genetic linkage map with those on FUGU5. The x-axis represents the physical position (Mb) of SNPs on FUGU5, whereas the y-axis represents the genetic position (cM) of SNPs on the linkage map. Red arrows indicate potentially misplaced or segmentally polymorphic regions, whereas green arrows indicate potentially mis-oriented or segmentally polymorphic regions in FUGU5.