| Literature DB >> 28744181 |
Satoshi Watanabe1, Chikaharu Tsukamoto1, Tatsuki Oshita1, Tetsuya Yamada2, Toyoaki Anai1, Akito Kaga3.
Abstract
Soybean (Glycine max) has a paleopolyploid genome, and many re-sequencing experiments to characterize soybean genotypes have been conducted using next-generation sequencing platforms. The accumulation of information about single nucleotide polymorphisms (SNPs) throughout the soybean genome has accelerated identification of genomic regions related to agronomically important traits through association studies. However, although many efficient mapping techniques that use next-generation sequencing are available, the number of practical approaches to identify genes/loci is still limited. In this study, we used a combination of restriction site-associated DNA sequencing (RAD-seq) and bulk segregant analysis (BSA) to identify quantitative trait locus (QTLs) for flowering time in a segregating population derived from a cross between Japanese soybean cultivars. Despite the homogeneous genetic background of the parents, over 7000 SNPs were identified and can be used to detect QTLs by RAD-seq BSA analysis. By comparing genotype frequency between early and late-flowering bulks from the F3 segregating population, we identified a QTL on Gm10, which corresponds to the previously identified E2 locus, and a QTL on Gm04, which is close to the E8 locus. Out of these SNPs, more than 2000 were easily converted to conventional DNA markers. Our approach would improve the efficiency of genetic mapping.Entities:
Keywords: QTL analysis; SNPs; flowering time; linkage map; next-generation sequencing; restriction site–associated DNA sequencing; soybean
Year: 2017 PMID: 28744181 PMCID: PMC5515319 DOI: 10.1270/jsbbs.17013
Source DB: PubMed Journal: Breed Sci ISSN: 1344-7610 Impact factor: 2.086
Summary of NGS data obtained from each file separated with an indexed sequence
| ID | Bulk or cultivar | Indexed sequence | Total reads (M) | Mapped pairs (M) | Ratio of mapped sequences | Average insert size (bp) | Standard deviation of insert size | Number of loci (K) | Average depth | Median of depth | Mode of depth | Estimated genome coverage |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| A01 | Early | TGACGCCA | 9.2 | 6.5 | 70.5% | 336.2 | 44.8 | 113.1 | 57.1 | 1 | 6 | 2.3% |
| A02 | Early | GGCTTA | 11.8 | 8.1 | 68.4% | 337.1 | 44.9 | 123.3 | 65.7 | 1 | 7 | 2.5% |
| A03 | Late | CTAAGCA | 9.7 | 6.9 | 71.3% | 336.6 | 44.8 | 120.1 | 57.7 | 1 | 6 | 2.4% |
| A04 | Fukuyutaka | GCCTACCT | 9.7 | 6.6 | 68.5% | 336.5 | 44.8 | 103.5 | 64.2 | 1 | 6 | 2.1% |
| B01 | Early | CAGATA | 8.6 | 6.0 | 69.4% | 335.4 | 44.9 | 120.8 | 49.7 | 1 | 6 | 2.4% |
| B02 | Early | AACGCACATT | 10.9 | 7.7 | 70.6% | 334.3 | 44.8 | 121.4 | 63.6 | 1 | 6 | 2.4% |
| B03 | Late | ATTAT | 12.3 | 8.4 | 68.2% | 333.4 | 45.1 | 125.5 | 66.6 | 1 | 7 | 2.5% |
| B04 | Fukuyutaka | CACCA | 9.5 | 6.6 | 68.9% | 338.4 | 44.7 | 108.5 | 60.5 | 1 | 6 | 2.2% |
| C01 | Early | GAAGTG | 8.9 | 6.5 | 72.5% | 337.2 | 44.8 | 124.9 | 51.6 | 1 | 6 | 2.5% |
| C02 | Early | GAGCGACAT | 11.7 | 8.2 | 69.7% | 335.7 | 44.7 | 121.4 | 67.3 | 1 | 6 | 2.4% |
| C03 | Late | GCGCTCA | 11.0 | 7.6 | 69.1% | 336.6 | 44.6 | 116.7 | 65.1 | 1 | 6 | 2.3% |
| C04 | Toyoshirome | AATTAG | 10.5 | 7.5 | 71.5% | 336.6 | 44.9 | 108.0 | 69.8 | 1 | 7 | 2.2% |
| D01 | Early | TAGCGGAT | 10.9 | 7.6 | 69.8% | 336.0 | 44.7 | 126.2 | 60.1 | 1 | 6 | 2.5% |
| D02 | Early | CCTTGCCATT | 8.2 | 5.8 | 70.5% | 334.5 | 44.8 | 119.3 | 48.2 | 1 | 6 | 2.4% |
| D03 | Late | ACTGCGAT | 12.8 | 8.4 | 65.4% | 335.7 | 44.8 | 124.6 | 67.1 | 1 | 7 | 2.5% |
| D04 | Toyoshirome | GGAACGA | 11.1 | 8.0 | 71.9% | 338.0 | 44.7 | 104.3 | 76.3 | 1 | 6 | 2.1% |
| E01 | Early | TATTCGCAT | 9.5 | 6.7 | 69.9% | 335.7 | 44.8 | 117.8 | 56.6 | 1 | 6 | 2.4% |
| E02 | Late | GGTATA | 9.4 | 6.6 | 69.9% | 334.9 | 45.0 | 113.0 | 58.4 | 1 | 6 | 2.3% |
| E03 | Late | TTCGTT | 10.3 | 6.6 | 64.4% | 336.3 | 44.8 | 117.9 | 56.3 | 1 | 6 | 2.4% |
| F01 | Early | ATAGAT | 8.5 | 6.0 | 70.3% | 334.2 | 45.0 | 121.2 | 49.5 | 1 | 6 | 2.4% |
| F02 | Late | TCTTGG | 10.0 | 7.1 | 71.0% | 338.3 | 44.7 | 119.5 | 59.2 | 1 | 6 | 2.4% |
| F03 | Late | ATATAA | 12.0 | 8.6 | 71.6% | 330.7 | 45.0 | 115.6 | 74.6 | 1 | 7 | 2.3% |
| G01 | Early | CCGAACA | 9.1 | 6.4 | 70.3% | 336.8 | 44.7 | 125.5 | 51.0 | 1 | 6 | 2.5% |
| G02 | Late | GGTGT | 16.1 | 10.9 | 67.4% | 338.5 | 44.8 | 121.5 | 89.5 | 1 | 6 | 2.4% |
| G03 | Late | TGGCAACAGA | 11.2 | 7.7 | 69.0% | 334.3 | 44.7 | 112.6 | 68.5 | 1 | 6 | 2.3% |
| H01 | Early | GGAAGACAT | 8.7 | 6.2 | 70.6% | 335.3 | 44.8 | 121.9 | 50.6 | 1 | 6 | 2.4% |
| H02 | Late | GGATA | 10.1 | 7.2 | 71.7% | 335.8 | 44.9 | 118.4 | 61.1 | 1 | 6 | 2.4% |
| H03 | Late | CTCGTCG | 4.5 | 3.0 | 66.5% | 337.1 | 44.7 | 89.7 | 33.1 | 1 | 4 | 1.8% |
|
| ||||||||||||
| ALL | 286.40 | 199.14 | 69.5% | N. C. | N. C. | 284.4 | 677.6 | 1 | 13 | 5.7% | ||
M, million; K, thousand.
Not calculated.
Fig. 1Distribution of flowering time in an F3 population derived from a cross between ‘Toyoshirome’ and ‘Fukuyutaka’. Arrows indicate the flowering time of parental lines. Black and white bars indicate plants included in the early and late bulks, respectively.
Fig. 2The physical distribution of SNPs (A) and the location of SNPs showing significant differences in genotype frequencies between the early and late bulks (B). (A) Physical locations of all 7077 SNPs used to evaluate regions with skewed genotype distribution by bulk segregant analysis. All SNPs were ordered according to their physical position. The number of SNPs on each chromosome is shown above the graph. (B) Positions of the SNPs with genotype frequency skewed between early and late bulks. The X-axis indicates physical positions of markers for each SNP/Indel locus. The Y-axis indicates the index score used to evaluate skewed genotype frequency. Black circles indicate loci whose index scores exceeded the threshold index score calculated with a permutation test with 1000 replications at a 1% significance level. The number of significant SNPs is shown above each chromosome. In (A) and (B), vertical lines separate chromosomes; gray boxes indicate pericentromeric regions.
Fig. 3DNA banding pattern of a dCAPS marker and QTL analysis using a linkage map constructed with dCAPS markers. (A) Segregation pattern of the dCAPS marker Gm10_50869394_HhaI in the TFF3 population analyzed by 12% polyacrylamide gel electrophoresis. Two samples on the right are parental lines: T, ‘Toyoshirome’; F, ‘Fukuyutaka’. (B) A genetic linkage map showing positions of dCAPS markers on Gm10. Physical position of each SNP on the reference soybean genome (Gmax189) is indicated. White circles indicate dCAPS markers listed in Supplemental Table 2; the black circle indicates the E2 locus. (C) LOD profiles for flowering phenotype analyzed by composite interval mapping.
Genetic effects estimated by linear regression analysis for candidate QTL loci
| Trait | AW277661(Gm04) | Gm05_36142849_TaqI | |||||
|---|---|---|---|---|---|---|---|
|
|
|
| |||||
| Additive effect | Dominance effect | Additive effect | Dominance effect | Additive effect | Dominance effect | PVE | |
| First flowering day from sowing (R1) | −2.1 | ns | 1.3 | ns | ns | ns | 41.0% |
| Flowering day of the top raceme (R2) | −3.2 | ns | 1.5 | ns | ns | ns | 54.3% |
| Harvest date (R7–R8) | −3.4 | ns | 2.1 | ns | ns | ns | 32.9% |
Genetic effects of the Toyoshirome allele compared to the Fukuyutaka allele.
Phenotypic variance explained by the QTLs.
Genetic effects significant at a level below 0.001 are shown; ns, not significant (p ≥ 0.001).