| Literature DB >> 28220139 |
Che-Wei Chang1, Yu-Hua Wang2, Chih-Wei Tung1.
Abstract
Although genotyping-by-sequencing (GBS) enables the efficient and low-cost generation of large numbers of markers, the utility of resultant genotypes are limited, because they are enormously error-prone and contain high proportions of missing data. In this study, we generated single nucleotide polymorphism (SNP) markers for 109 recombinant inbred lines of melon (Cucumis melo L.) using the GBS approach and ordered them according to their physical position on the draft double haploid line DHL92 genome. Next, by investigating associations between these SNPs, we discovered that some segments on the physical map conflict with linkage relationships. Therefore, to filter out error-prone loci, 4,110 SNPs in which we have a high degree of confidence were selected as anchors to test independence with respect to unselected markers, and the resultant dataset was then analyzed using the Full-Sib Family Haplotype (FSFHap) algorithm in the software TASSEL 5.2. On the basis of this analysis, 22,933 loci that have an average rate of missing data of 0.281% were used to construct a genetic map, which spans 1,088.3 cM across 12 chromosomes and has a maximum spacing of 6.0 cM. Use of this high-quality linkage map enabled the identification of several quantitative trait loci (QTL) known to control traits in fruit and validated our approach. This study highlights the utility of GBS markers for the identification of trait-associated QTLs in melon and facilitates further investigation of genome structure.Entities:
Keywords: genetic map; genotyping-by-sequencing; imputation; melon; quantitative trait loci mapping; single nucleotide polymorphism
Year: 2017 PMID: 28220139 PMCID: PMC5292975 DOI: 10.3389/fpls.2017.00125
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Imputation accuracy using the FSFHap method.
| Accuracy | Proportion of non-imputed genotype in masked sites | |
|---|---|---|
| Rep1 | 0.9927 | 0.0402 |
| Rep2 | 0.9931 | 0.042 |
| Rep3 | 0.993 | 0.0467 |
| Rep4 | 0.9926 | 0.1172 |
| Rep5 | 0.9931 | 0.0536 |
| Rep6 | 0.9931 | 0.0582 |
| Rep7 | 0.9928 | 0.0649 |
| Rep8 | 0.9931 | 0.0765 |
| Rep9 | 0.9925 | 0.0842 |
| Rep10 | 0.9927 | 0.091 |
| Average ± | 0.9929 ± 0.002 | 0.0675 ± 0.0247 |
Summary data of 22,933 single nucleotide polymorphisms (SNPs) imputed across 12 chromosomes.
| Chr | Heterozygous rate | Missing data rate | Proportion of non-imputed data | Proportion of imputed data |
|---|---|---|---|---|
| 1 | 0.005509976 | 0.002536071 | 0.002512495 | 0.4655728 |
| 2 | 0.009325329 | 0.002777643 | 0.002664380 | 0.4184533 |
| 3 | 0.007932834 | 0.004225234 | 0.004208400 | 0.4573773 |
| 4 | 0.005278082 | 0.003570840 | 0.003511678 | 0.4162332 |
| 5 | 0.016516580 | 0.001686420 | 0.001639444 | 0.4452289 |
| 6 | 0.017157728 | 0.001795166 | 0.001764291 | 0.4001191 |
| 7 | 0.008913678 | 0.002074646 | 0.002017306 | 0.4422383 |
| 8 | 0.008539611 | 0.002205918 | 0.002152657 | 0.4491951 |
| 9 | 0.003367869 | 0.003242645 | 0.003209691 | 0.4142676 |
| 10 | 0.019466502 | 0.002576260 | 0.002467042 | 0.4601933 |
| 11 | 0.013954779 | 0.004302875 | 0.004239196 | 0.4321751 |
| 12 | 0.008225462 | 0.002576348 | 0.002532361 | 0.4225085 |
| Total | 0.010184034 | 0.002811141 | 0.002759534 | 0.4365637 |
Genetic map and recombination rate summary.
| Number of SNPs | Number of bins | Genetic distance (cM) | Maximum gap | Physical distance (bp) | Recombination rate (cM/Mb) | |
|---|---|---|---|---|---|---|
| Chr 1 | 2,724 | 260 (263∗) | 106.7 (120.0∗) | 4.3 (8.8∗) | 35,191,472 | 3.03 (3.41∗) |
| Chr 2 | 1,701 | 169 | 79.5 | 3.8 | 26,170,228 | 3.04 |
| Chr 3 | 2,180 | 196 | 78.2 | 2.9 | 29,150,566 | 2.68 |
| Chr 4 | 2,171 | 253 | 114.9 | 6.0 | 32,910,366 | 3.49 |
| Chr 5 | 1,953 | 198 | 86.8 | 1.8 | 28,148,921 | 3.08 |
| Chr 6 | 2,080 | 253 | 100.4 | 3.5 | 35,697,279 | 2.81 |
| Chr 7 | 1,760 | 202 | 96.3 | 2.9 | 26,603,112 | 3.62 |
| Chr 8 | 2,067 | 204 | 86.6 | 4.6 | 32,490,372 | 2.66 |
| Chr 9 | 1,392 | 199 | 89.2 | 2.9 | 24,104,665 | 3.70 |
| Chr 10 | 1,428 | 142 | 66.2 | 5.7 | 24,868,054 | 2.66 |
| Chr 11 | 2,017 | 221 | 105.3 | 5.8 | 31,233,764 | 3.37 |
| Chr 12 | 1,460 | 196 | 78.3 | 2.7 | 26,355,078 | 2.97 |
| Total | 22,933 | 2,493 (2,496∗) | 1,088.3 (1101.6∗) |
Quantitative trait loci (QTL) mapping results using composite interval mapping (CIM) and 2,493 bins.
| Trait | Chr | Logarithm of odds (LOD) | Peak (cM) | 95% credible interval (cM) | Nearest bina | Flanking binsa | Additive effectb | PVE (%)c | ||
|---|---|---|---|---|---|---|---|---|---|---|
| EC | 3 | 4.57 | 70.71 | 66.43 | 71.63 | S3_27813199 | S3_27435285 | S3_28191793 | 0.6302 | 14.85 |
| 4 | 11.27 | 4.59 | 0.00 | 6.00 | S4_1583322 | S4_205086 | S4_1655078 | -1.0580 | 39.8 | |
| FL | 7 | 6.06 | 52 | 49 | 54.82 | S7_22247752 | S7_21673362 | S7_22751211 | 1.0372 | 19.82 |
| FT | 10 | 6.47 | 7.54 | 5 | 9.59 | S10_1291780 | S10_771413 | S10_1545821 | -0.2770 | 15.29 |
| FC | 4 | 5.11 | 9.03 | 8 | 12 | S4_1736057 | S4_1664870 | S4_1966752 | -0.7697 | 19.75 |
| 8 | 10.20 | 76.73 | 73.63 | 77.71 | S8_29946807 | S8_29681675 | S8_30003393 | -1.0594 | 36.57 | |
| ND | 2 | 9.40 | 40.99 | 39 | 41.04 | S2_17480256 | S2_16415727 | S2_17972200 | -0.6432 | 29.32 |
| NT | 2 | 6.49 | 33.66 | 32.26 | 34.60 | S2_11852937 | S2_7481584 | S2_14567314 | -0.4776 | 24.34 |
| PSW | 2 | 6.19 | 35.00 | 34 | 38 | S2_14567314 | S2_11852937 | S2_16194372 | -0.5565 | 21.93 |
| RLD | 7 | 6.73 | 52.28 | 51 | 54.40 | S7_22398054 | S7_21928298 | S7_22713003 | 0.0605 | 17.18 |