| Literature DB >> 31231761 |
Eri Ogiso-Tanaka1, Takehiko Shimizu1, Makita Hajika1, Akito Kaga1, Masao Ishimoto1.
Abstract
Whole-genome re-sequencing is a powerful approach to detect gene variants, but it is expensive to analyse only the target genes. To circumvent this problem, we attempted to detect novel variants of flowering time-related genes and their homologues in soybean mini-core collection by target re-sequencing using AmpliSeq technology. The average depth of 382 amplicons targeting 29 genes was 1,237 with 99.85% of the sequence data mapped to the reference genome. Totally, 461 variants were detected, of which 150 sites were novel and not registered in dbSNP. Known and novel variants were detected in the classical maturity loci-E1, E2, E3, and E4. Additionally, large indel alleles, E1-nl and E3-tr, were successfully identified. Novel loss-of-function and missense variants were found in FT2a, MADS-box, WDR61, phytochromes, and two-component response regulators. The multiple regression analysis showed that four genes-E2, E3, Dt1, and two-component response regulator-can explain 51.1-52.3% of the variation in flowering time of the mini-core collection. Among them, the two-component response regulator with a premature stop codon is a novel gene that has not been reported as a soybean flowering time-related gene. These data suggest that the AmpliSeq technology is a powerful tool to identify novel alleles.Entities:
Keywords: AmpliSeq; flowering time-related gene; genotyping; soybean; target re-sequencing
Mesh:
Year: 2019 PMID: 31231761 PMCID: PMC6589554 DOI: 10.1093/dnares/dsz005
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Genomic regions or SNP targeted by the AmpliSeq design
| Gmax275 (ver. 2.0) | Gmax189 (ver. 1.1) | Description | Gene name | Target | Total designed target length of amplicons | Target size | Number of amplicon | Coverage | Reference | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene ID | Chr. | Start | End | Gene ID | Chr. | Start | End | ||||||||
| Glyma.02G069500 | 2 | 6116379 | 6117379 | Glyma02g07650 | 2 | 6041770 | 6042409 | FLOWERING LOCUS T |
| Exon + UTR | 861 | 639 | 6 | 100.0 | 37, 38 |
| Glyma.03G194700 | 3 | 40522597 | 40525110 | Glyma03g35250 | 3 | 42534078 | 42535865 | TERMINAL FLOWER 1 |
| Exon + UTR | 1,577 | 1,003 | 9 | 100.0 | 37, 38 |
| Glyma.03G227300 | 3 | 42918771 | 42923401 | Glyma03g38620 | 3 | 44925730 | 44930360 | Phytochrome A |
| Exon + UTR | 3,688 | 2,876 | 20 | 98.6 | |
| Glyma.04G143300 | 4 | 26120011 | 26120532 | Glyma18g22670 | 18 | 25739929 | 25740831 | B3 domain-containing protein |
| Exon + UTR | 1,037 | 902 | 5 | 100.0 | 11, 54 |
| Glyma.04G156400 | 4 | 36758125 | 36758770 | Glyma04g24640 | 4 | 28293933 | 28294806 | B3 domain-containing protein |
| Exon + UTR | 967 | 873 | 5 | 100.0 | 11, 54 |
| Glyma.06G207800 | 6 | 20207077 | 20207940 | Glyma06g23026 | 6 | 20006928 | 20007814 | B3 domain-containing protein |
| Exon + UTR | 1,059 | 886 | 6 | 100.0 | 3, 4, 11 |
| Glyma.08G363100 | 8 | 47458142 | 47459829 | Glyma08g47810 | 8 | 46606934 | 46608654 | FLOWERING LOCUS T |
| Exon + UTR | 1,300 | 746 | 8 | 100.0 | 20 |
| Glyma.08G363200 | 8 | 47472881 | 47473362 | Glyma08g47823 | 8 | 46621704 | 46622185 | FLOWERING LOCUS T |
| Exon + UTR | 612 | 265 | 3 | 100.0 | 37, 38 |
| Glyma.09G035500 | 9 | 2960395 | 2967229 | Glyma09g03990 | 9 | 2919887 | 2926740 | Phytochrome B |
| Exon + UTR | 5,150 | 4,336 | 27 | 99.7 | 32 |
| Glyma.09G143500 | 9 | 35652219 | 35653967 | Glyma09g26550 | 9 | 33049107 | 33050904 | BROTHER OF FT AND TFL 1 |
| Gene | 1,813 | 1,122 | 11 | 98.7 | 37 |
| Glyma.10G141400 | 10 | 37489560 | 37495624 | Glyma10g28170 | 10 | 36962521 | 36968813 | Phytochrome A |
| Exon + UTR | 5,212 | 4,065 | 29 | 94.6 | 14 |
| Glyma.10G221500 | 10 | 45294735 | 45316121 | Glyma10g36600 | 10 | 44732730 | GIGANTEA |
| SNP | 222 | 1 | 1 | 100.0 | 3, 4, 12 | |
| Glyma.12G073900 | 12 | 5508365 | 5522772 | Glyma12g07861 | 12 | 5496565 | 5511828 | Two-component response regulator-like |
| Exon + UTR | 4,116 | 2,871 | 24 | 100.0 | |
| Glyma.15G140000 | 15 | 11435551 | 11442683 | Glyma15g14980 | 15 | 11415495 | 11422656 | Phytochrome B |
| Exon + UTR | 5,445 | 4,542 | 28 | 98.1 | 32 |
| Glyma.16G044100 | 16 | 4135885 | 4137742 | Glyma16g04830 | 16 | 4115033 | 4116923 | FLOWERING LOCUS T |
| Exon + UTR | 1,683 | 1,109 | 9 | 98.0 | 15, 19, 37 |
| Glyma.16G044200 | 16 | 4162525 | 4164824 | Glyma16g04840 | 16 | 4141774 | 4144073 | FLOWERING LOCUS T |
| Exon + UTR | 1,031 | 486 | 6 | 92.7 | 15, 37 |
| Glyma.16G150700 | 16 | 31109999 | 31114963 | Glyma16g26660 | 16 | 30741660 | 30746677 | FLOWERING LOCUS T |
| Exon + UTR | 1,515 | 935 | 9 | 99.4 | 9, 15, 18, 33, 37 |
| Glyma.16G151000 | 16 | 31148829 | 31151842 | Glyma16g26690 | 16 | 30780496 | 30783509 | FLOWERING LOCUS T |
| Exon + UTR | 860 | 464 | 5 | 88.0 | 15, 37 |
| Glyma.16G196300 | 16 | 35777815 | 35779317 | Glyma16g32080 | 16 | 35274147 | 35275762 | BROTHER OF FT AND TFL 1 |
| Exon + UTR | 1,554 | 1,038 | 6 | 99.0 | 37 |
| Glyma.16G200700 | 16 | 36179891 | 36187469 | Glyma16g32540 | 16 | 35676581 | 35684221 | MADS box protein |
| Exon + UTR | 2,534 | 1,095 | 15 | 98.8 | 48, 49 |
| Glyma.17G052100 | 17 | 3955518 | 3958432 | Glyma17g05990 | 17 | 4225839 | 4228888 | WD repeat-containing protein 61 |
| Exon + UTR | 1,743 | 1,318 | 9 | 100.0 | 50 |
| Glyma.17G090500 | 17 | 7052506 | 7053858 | Glyma17g09810 | 17 | 7317271 | 7318756 | Achaete-scute transcription factor-related |
| Exon + UTR | 1,594 | 1,166 | 9 | 100.0 | |
| Glyma.19G108100 | 19 | 36030632 | 36032867 | Glyma19g28390 | 19 | 35849981 | 35852216 | FLOWERING LOCUS T |
| Exon + UTR | 952 | 463 | 5 | 100.0 | 15, 37 |
| Glyma.19G108200 | 19 | 36049111 | 36051851 | Glyma19g28400 | 19 | 35868460 | 35871203 | FLOWERING LOCUS T |
| Exon + UTR | 1,113 | 560 | 6 | 90.0 | 15, 37 |
| Glyma.19G194300 | 19 | 45183357 | 45185175 | Glyma19g37890 | 19 | 44979743 | 44981657 | TERMINAL FLOWER 1 |
| Exon + UTR | 1,411 | 1,078 | 8 | 100.0 | 37, 53 |
| Glyma.19G224200 | 19 | 47633059 | 47641958 | Glyma19g41210 | 19 | 47511095 | 47520052 | Phytochrome A |
| Exon + UTR | 5,235 | 4,400 | 27 | 98.6 | 3 |
| Glyma.19G260400 | 19 | 50364718 | 50369677 | Glyma19g44970 | 19 | 50244046 | 50249070 | Pseudo-response regulator 5 |
| Gene | 5,673 | 4,941 | 31 | 98.3 | |
| Glyma.20G090000 | 20 | 33236018 | 33241692 | Glyma20g22160 | 20 | 32087412 | 32093306 | Phytochrome A |
| Exon + UTR | 4,871 | 4,076 | 25 | 99.2 | 3, 14 |
| Glyma.U034500 | Scaffold_32 | 197150 | 220019 | Glyma11g15580 | 11 | 11232271 | 11255186 | Two-component response regulator-like |
| Exon + UTR | 5,352 | 3,833 | 30 | 98.1 | |
Gene description from Phytozome 12.
Gene name refers to Kong et al., Wu et al., Fan et al., and Cao et al.
Total size of amplified region by designed amplicon primers.
Target size (bp) of amplicon based on soybean genome version 1.1 (Gmax189).
Total number of designed amplicons on target gene.
Percentage of target region covered by amplicon.
Glyma.04G143300 and Glyma.U034500on Gmax275 genome version were different chromosome positions on Gmax189.
The gene annotation manually curated.
‘Achaete-scute transcription factor related’ gene was included as control for variant detection.
Figure 1Evaluation of quality of DNA and AmpliSeq library prepared from the DNA using the Agilent 2200 TapeStation system. The AmpliSeq libraries were evaluated using the D1000 screen tape. (A) Quality of the DNA derived from the leaf and seeds. W and E indicate Williams 82 and Enrei, respectively. The numerical assessment of DNA quality ranged from 1 to 10 based on the DNA integrity number (DIN). A high DIN indicates highly intact DNA, whereas a low DIN indicates degraded DNA. (B) Distribution of amplicons in the AmpliSeq library shown as a gel image. (C) Electropherogram of the same AmpliSeq library as shown in (B). Lower (25 bp) and upper (1,500 bp) peak are the standard markers. The middle peak indicates the library.
Figure 2Comparison of sequence performance metrics classified by the plant materials used for DNA extraction. (A) The box plot of average read length, on-target rate, and uniformity. The on-target rate is on-target percent of the aligned reads. Uniformity is the percent of bases in all the amplicon-targeted regions covered by at least 0.2× the mean base read depth. (B) Average normalized reads (APKM) per sample across 382 amplicons generated from the 192 mini-core collection. The X and Y axes indicate 382 amplicons sorted in their read coverage and APKM shown as the mean on a log scale, respectively.
Figure 3Distribution of mapped sequence reads of two different alleles in the E3 gene region. (A) Top: Read coverage of Williams 82 with the default parameters of TMAP. Bottom: Read coverage of PGC010 with the default and optimized parameters of TMAP. The E3 gene of PGC010 has a large deletion in the fourth exon. (B) APKM of amplicons from variant (e3T) and wild type (WT) on the fourth exon of E3. Average APKM of four amplicons (left most) and APKM of each amplicon—(a) AMPL1040313, (b) AMPL1040314, (c) AMPL1037722, and (d) AMPL1036854.
Classification of detected variants of 28 flowering time-related genes by snpEff analysis
| Glyma ID | Description | Gene symbol | Number of variants in total | Number of novel variants | Number of known variants | Number of missense variants | Number of ‘high’ effect variants to gene function |
|---|---|---|---|---|---|---|---|
| Glyma.02G069500 | FLOWERING LOCUS T |
| 10 | 2 | 8 | 0 | 0 |
| Glyma.03G194700 | TERMINAL FLOWER 1 |
| 2 | 0 | 2 | 0 | 0 |
| Glyma.03G227300 | Phytochrome A |
| 70 | 5 | 65 | 35 | 5 |
| Glyma.04G143300 | B3 domain-containing protein |
| 3 | 1 | 2 | 0 | 0 |
| Glyma.04G156400 | B3 domain-containing protein |
| 4 | 0 | 4 | 0 | 0 |
| Glyma.06G207800 | B3 domain-containing protein |
| 5 | 2 | 3 | 3 | 2 |
| Glyma.08G363100 | FLOWERING LOCUS T |
| 9 | 2 | 7 | 0 | 0 |
| Glyma.08G363200 | FLOWERING LOCUS T |
| 6 | 1 | 5 | 0 | 0 |
| Glyma.09G035500 | Phytochrome B |
| 10 | 6 | 4 | 2 | 0 |
| Glyma.09G143500 | BROTHER OF FT AND TFL 1 |
| 1 | 1 | 0 | 0 | 0 |
| Glyma.10G141400 | Phytochrome A |
| 20 | 8 | 12 | 3 | 1 |
| Glyma.10G221500 | GIGANTEA |
| 2 | 1 | 1 | 1 | 1 |
| Glyma.12G073900 | Two-component response regulator-like |
| 18 | 3 | 15 | 5 | 1 |
| Glyma.15G140000 | Phytochrome B |
| 27 | 9 | 18 | 7 | 2 |
| Glyma.16G044100 | FLOWERING LOCUS T |
| 14 | 5 | 9 | 0 | 0 |
| Glyma.16G044200 | FLOWERING LOCUS T |
| 8 | 2 | 6 | 1 | 0 |
| Glyma.16G150700 | FLOWERING LOCUS T |
| 22 | 13 | 9 | 2 | 1 |
| Glyma.16G151000 | FLOWERING LOCUS T |
| 9 | 3 | 6 | 1 | 0 |
| Glyma.16G196300 | BROTHER OF FT AND TFL 1 |
| 20 | 5 | 15 | 1 | 0 |
| Glyma.16G200700 | MADS box protein |
| 45 | 15 | 30 | 7 | 0 |
| Glyma.17G052100 | WD repeat-containing protein 61 |
| 15 | 1 | 14 | 0 | 1 |
| Glyma.17G090500 | Achaete-scute transcription factor-related |
| 17 | 10 | 7 | 0 | 6 |
| Glyma.19G108100 | FLOWERING LOCUS T |
| 0 | 0 | 0 | 0 | 0 |
| Glyma.19G108200 | FLOWERING LOCUS T |
| 11 | 3 | 8 | 1 | 0 |
| Glyma.19G194300 | TERMINAL FLOWER 1 |
| 14 | 5 | 9 | 5 | 0 |
| Glyma.19G224200 | Phytochrome A |
| 16 | 8 | 8 | 5 | 3 |
| Glyma.19G260400 | Pseudo-response regulator 5 |
| 42 | 11 | 31 | 8 | 0 |
| Glyma.20G090000 | Phytochrome A |
| 16 | 2 | 14 | 1 | 0 |
| Glyma.U034500 | Two-component response regulator-like |
| 26 | 26 | 0 | 5 | 7 |
High-impact variant to gene function includes criteria of stop lost, stop gained, and frameshift.
DNA polymorphisms in nine genes significantly associated with flowering time variation in the mini-core collection as revealed by the single linear regression analysis
| Glyma ID | Gene description | Gene symbol | Chromosome | Position (bp) | Reference sequence | Alternative sequence | SNP name |
| Functional effect | Amino acid change | Functional class | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2011 | 2012 | 2013 | |||||||||||
| Glyma.06G207800 | B3 domain- containing protein |
| Chr06 | 20207322 | C | G | rs123612969 ( | 0.0226 | 0.0153 | 0.0333 | Missense_variant | p.Thr75Arg/c.224C>G | MISSENSE |
| Glyma.10G221500 | GIGANTEA |
| Chr10 | 45310798 | A | T | rs124971350 ( | 2E-16 | 2E-16 | 2E-16 | Stop-gained | p.Lys527*/c.1582A>T | HIGH |
|
| |||||||||||||
| Glyma.12G073900 | Two-component response regulator-like |
| Chr12 | 5508242 | T | G | rs125308101 | 0.00258 | 0.0082 | 0.0019 | Upstream_gene_ variant | ||
|
| Chr12 | 5508672 | CT | C | rs745192414 | 0.00155 | 0.00261 | 0.00503 | Intron_variant | c.-120 + 219delT | |||
|
| Chr12 | 5508702 | T | C | rs743387935 | 0.0211 | 0.0228 | 0.0109 | Intron_variant | c.-119-198T>C | |||
|
| Chr12 | 5509310 | G | A | rs388619068 | 0.0146 | 0.0119 | 0.00907 | Missense_variant | p.Asp98Asn/c.292G>A | MISSENSE | ||
|
| Chr12 | 5509317 | C | T | rs125308103 | 0.00107 | 0.00148 | 0.00319 | Missense_variant | p.Ser100Leu/c.299C>T | MISSENSE | ||
|
| Chr12 | 5519728 | A | C | rs125308115 | 0.00229 | 0.00749 | 0.00169 | Missense_variant | p.Lys378Gln/c.1132A>C | MISSENSE | ||
|
| Chr12 | 5520578 | A | G | rs389122657 | 0.014 | 0.0112 | 0.00865 | Synonymous_ variant | p.Ala504Ala/c.1512A>G | SILENT | ||
|
| Chr12 | 5520945 | T | C | rs125308117 | 8.96E-10 | 1.76E-08 | 7.74E-11 | Stop_lost | p.Ter627Glnext*?/c.1879T>C | MISSENSE | ||
| HIGH |
| Chr12 | 5521029 | G | T | rs743179814 | 0.0244 | 0.0203 | 0.0089 | 3_Prime_UTR_ variant | c.*82G>T | ||
| Glyma.15G140000 | Phytochrome B |
| Chr15 | 11436193 | G | A | rs388435741 | 5.17E-06 | 7.93E-06 | 8.47E-06 | 3_Prime_UTR_ variant | c.*218C>T | |
|
| Chr15 | 11441207 | C | T | rs126279495 | 5.17E-06 | 7.93E-06 | 8.47E-06 | Missense_variant | p.Val394Ile/c.1180G>A | MISSENSE | ||
|
| Chr15 | 11442400 | A | C | rs126279502 | 0.0753 | 0.0769 | 0.0431 | 5_Prime_UTR_ variant | c.-14T>G | |||
| Glyma.16G150700 | FLOWERING LOCUS T |
| Chr16 | 31110004 | TATAAGAAAGC | T | rs392064733_1 | 0.0397 | 0.117 | 0.0266 | 5_Prime_UTR_ variant | c.-50_-59delATAAGAAAGC | |
|
| Chr16 | 31110004 | TATAAGAAAGCA | TA | rs392064733_2 | 0.0397 | 0.117 | 0.0266 | 5_Prime_UTR_ variant | c.-49_-58delTAAGAAAGCA | |||
|
| Chr16 | 31110991 | A | AATAT | rs864598505_1 | 6.24E-05 | 0.000274 | 0.000111 | Intron_variant | c.202-56_202-55insATAT | |||
|
| Chr16 | 31111033 | T | A | rs126829817 | 6.24E-05 | 0.000274 | 0.000111 | Intron_variant | c.202-15T>A | |||
|
| Chr16 | 31111042 | T | C | rs126829818 | 0.0697 | 0.21 | 0.0419 | Splice_region_ variant | c.202T>C | |||
|
| Chr16 | 31111349 | A | G | rs126829819 | 6.24E-05 | 0.000274 | 0.000111 | Intron_variant | c.304 + 72A>G | |||
|
| Chr16 | 31114633 | G | A | Chr16_31114633 | 5.46E-06 | 4.14E-06 | 3.05E-07 | Missense_ variant | p.Gly169Asp/c.506G>A | MISSENSE | ||
|
| Chr16 | 31114658 | AGA | AA | Chr16_31114658_2 | 0.0765 | 0.0538 | 0.043 | 3_Prime_UTR_ variant | c.*1delG | |||
|
| Chr16 | 31114930 | G | T | rs126829846 | 6.24E-05 | 0.000274 | 0.000111 | 3_Prime_UTR_ variant | c.*272G>T | |||
| Glyma.16G200700 | MADS box protein |
| Chr16 | 36179909 | T | G | rs126888526 | 1.13E-07 | 3.18E-06 | 3.35E-07 | 3_Prime_UTR_ variant | c.*218A>C | |
|
| Chr16 | 36180002 | G | A | rs126888527 | 1.09E-07 | 1.63E-06 | 1.98E-07 | 3_Prime_UTR_ variant | c.*125C>T | |||
|
| Chr16 | 36180087 | T | TGA | Chr16_36180087 | 8.8E-08 | 2.16E-06 | 2.07E-07 | 3_Prime_UTR_ variant | c.*39_*40insTC | |||
|
| Chr16 | 36180122 | A | C | rs126888528 | 4.19E-07 | 9.01E-06 | 1.03E-06 | 3_Prime_UTR_ variant | c.*5T>G | |||
|
| Chr16 | 36180183 | T | A | rs389022394 | 4.96E-07 | 1.03E-05 | 1.32E-06 | Missense_variant | p.Thr219Ser/c.655A>T | MISSENSE | ||
|
| Chr16 | 36182854 | T | TA | Chr16_36182854 | 6.15E-06 | 4.11E-05 | 3.79E-06 | Intron_variant | c.511-49_511-50insT | |||
|
| Chr16 | 36182857 | T | A | rs126888574 | 8.41E-07 | 7.3E-06 | 5.2E-07 | Intron_variant | c.511-52A>T | |||
|
| Chr16 | 36183423 | T | G | rs126888580_2 | 6.9E-06 | 5.27E-06 | 2.61E-06 | Intron_variant | c.427-19A>C | |||
|
| Chr16 | 36183426 | G | C | rs126888581_1 | 0.00286 | 0.00159 | 0.0018 | Intron_variant | c.427-22C>G | |||
|
| Chr16 | 36183435 | G | C | rs126888582 | 6.8E-06 | 5.07E-06 | 2.66E-06 | Intron_variant | c.427-31C>G | |||
|
| Chr16 | 36183450 | G | A | rs126888583 | 0.0104 | 0.00721 | 0.00804 | Intron_variant | c.427-46C>T | |||
|
| Chr16 | 36183510 | C | T | rs126888584 | 0.000219 | 0.00275 | 0.000461 | Intron_variant | c.427-106G>A | |||
|
| Chr16 | 36183518 | A | G | rs126888585 | 3.22E-10 | 9.24E-09 | 3.46E-10 | Intron_variant | c.427-114T>C | |||
|
| Chr16 | 36183541 | C | A | rs126888586 | 2.98E-10 | 8.83E-09 | 3.27E-10 | Intron_variant | c.427-137G>T | |||
|
| Chr16 | 36183556 | T | C | Chr16_36183556 | 0.0507 | 0.0363 | 0.048 | Intron_variant | c.427-152A>G | |||
|
| Chr16 | 36183568 | T | A | rs126888588 | 0.00193 | 0.0172 | 0.00328 | Intron_variant | c.427-164A>T | |||
|
| Chr16 | 36183573 | T | A | rs126888589 | 0.00193 | 0.0172 | 0.00328 | Intron_variant | c.427-169A>T | |||
|
| Chr16 | 36184165 | T | C | rs126888605 | 4.71E-05 | 3.54E-05 | 0.000019 | Intron_variant | c.327-41A>G | |||
|
| Chr16 | 36184729 | T | C | rs126888609 | 2.18E-09 | 2.34E-08 | 6.84E-10 | Missense_variant | p.Thr79Ala/c.235A>G | MISSENSE | ||
|
| Chr16 | 36184733 | C | T | rs126888610 | 2.18E-09 | 2.34E-08 | 6.84E-10 | Synonymous_ variant | p.Ser77Ser/c.231G>A | SILENT | ||
|
| Chr16 | 36184819 | A | T | rs126888611 | 0.0341 | 0.0817 | 0.0244 | Intron_variant | c.183-38T>A | |||
|
| Chr16 | 36187211 | A | C | rs744233319 | 0.000287 | 0.000744 | 0.00118 | Synonymous_ variant | p.Ser36Ser/c.108T>G | SILENT | ||
|
| Chr16 | 36187552 | T | G | rs126888636 | 0.0383 | 0.0315 | 0.0302 | Upstream_gene_ variant | ||||
|
| Chr16 | 36187600 | A | G | rs126888637 | 0.0383 | 0.0315 | 0.0302 | Upstream_gene_ variant | ||||
| Glyma.17G052100 | WD repeat-containing protein 61 |
| Chr17 | 3955280 | A | AT | rs126942305_1 | 0.00214 | 0.00191 | 0.00197 | Downstream_gene_ variant | ||
|
| Chr17 | 3955411 | A | G | rs126942313 | 2.56E-10 | 3.12E-10 | 1.07E-10 | Downstream_gene_ variant | ||||
|
| Chr17 | 3955475 | TC | AA | rs126942314_1 | 0.00359 | 0.00645 | 0.00279 | Downstream_gene_ variant | ||||
|
| Chr17 | 3955476 | C | A | rs126942315 | 0.00359 | 0.00645 | 0.00279 | Downstream_gene_ variant | ||||
|
| Chr17 | 3955546 | T | A | rs126942316 | 0.00374 | 0.00157 | 0.00264 | 3_Prime_UTR_ variant | c.*125A>T | |||
|
| Chr17 | 3955554 | T | C | rs126942317 | 0.031 | 0.0299 | 0.0358 | 3_Prime_UTR_ variant | c.*117A>G | |||
|
| Chr17 | 3955716 | A | G | rs388258139 | 0.00418 | 0.00761 | 0.00338 | Synonymous_ variant | p.Ala307Ala/c.921T>C | SILENT | ||
|
| Chr17 | 3955763 | C | CG | Chr17_3955763 | 0.00444 | 0.00194 | 0.00318 | Frameshift_ variant | p.Val291_Ala292fs/c.873_874insC | HIGH | ||
|
| Chr17 | 3955764 | A | G | rs126942318 | 0.00374 | 0.00157 | 0.00264 | Synonymous_ variant | p.Val291Val/c.873T>C | SILENT | ||
|
| Chr17 | 3955884 | G | C | rs126942319 | 0.00374 | 0.00157 | 0.00264 | Synonymous_ variant | p.Val251Val/c.753C>G | SILENT | ||
|
| Chr17 | 3956142 | T | C | rs126942321 | 6.36E-08 | 3.72E-08 | 1.25E-08 | Synonymous_ variant | p.Ala165Ala/c.495A>G | SILENT | ||
|
| Chr17 | 3956163 | T | C | rs126942322 | 0.00644 | 0.00295 | 0.00466 | Synonymous_ variant | p.Lys158Lys/c.474A>G | SILENT | ||
|
| Chr17 | 3958319 | C | G | rs126942340 | 9.03E-10 | 4.86E-10 | 1.98E-10 | Synonymous_ variant | p.Ser16Ser/c.48G>C | SILENT | ||
|
| Chr17 | 3958374 | T | G | rs126942341 | 2.32E-10 | 9.96E-11 | 5.12E-11 | 5_Prime_UTR_ variant | c.-8A>C | |||
| Glyma.19G194300 | TERMINAL FLOWER 1 |
| Chr19 | 45183701 | T | A | rs127928573 ( | 0.00122 | 0.00587 | 0.000401 | Missense_variant | p.Arg166Trp/c.496A>T | MISSENSE |
|
| Chr19 | 45183808 | C | T | rs745009806 | 0.0319 | 0.0184 | 0.0153 | Missense_variant | p.Arg130Lys/c.389G>A | MISSENSE | ||
|
| Chr19 | 45183859 | G | A | rs127928574 | 4.29E-11 | 3.71E-11 | 6.02E-10 | Missense_variant | p.Pro113Leu/c.338C>T | MISSENSE | ||
|
| Chr19 | 45184581 | G | T | Chr19_45184581 | 0.014 | 0.153 | 0.0428 | Splice_region_ variant | c.202C>A | |||
|
| Chr19 | 45185131 | C | T | Chr19_45185131 | 0.00805 | 0.00351 | 0.00764 | 5_Prime_UTR_ variant | c.-142G>A | |||
| Glyma.19G224200 | Phytochrome A |
| Chr19 | 47633086 | T | TA | Chr19_47633086 | 0.000763 | 0.000353 | 0.00085 | 5_Prime_UTR_ variant | c.-693_-694insA | |
|
| Chr19 | 47634596 | A | G | rs127944661 | 0.0052 | 0.00108 | 0.00429 | Synonymous_ variant | p.Ser43Ser/c.129A>G | SILENT | ||
|
| Chr19 | 47635025 | C | T | rs388644281 | 3.18E-05 | 3.43E-05 | 3.69E-05 | Synonymous_ variant | p.Ile186Ile/c.558C>T | SILENT | ||
|
| Chr19 | 47635737 | C | A | rs389001110 | 0.0534 | 0.052 | 0.048 | Missense_variant | p.Leu424Ile/c.1270C>A | MISSENSE | ||
|
| Chr19 | 47636564 | G | A | rs127944664 | 0.0098 | 0.00232 | 0.00888 | Intron_variant | c.2074 + 23G>A | |||
|
| Chr19 | 47636607 | G | T | rs127944665 | 0.0127 | 0.0031 | 0.0113 | Intron_variant | c.2074 + 66G>T | |||
|
| Chr19 | 47637258 | A | G | rs393405985_1 | 0.044 | 0.0447 | 0.0427 | Missense_variant | p.Thr832Ala/c.2494A>G | MISSENSE | ||
|
| Chr19 | 47638302 | G | A | Chr19_47638302 ( | 0.0433 | 0.0306 | 0.0485 | Missense_variant | p.Gly1050Arg/c.3148G>A | MISSENSE | ||
|
| Chr19 | 47638344 | A | T | rs389636522 | 0.00508 | 0.00113 | 0.00433 | Splice_region_ variant | c.3183A>T | |||
|
| Chr19 | 47641562 | 15 kb deletion |
| 1.9E-13 | 1.61E-13 | 5.15E-14 | Loss of exon 4 | HIGH | ||||
| Glyma.U034500 | Two-component response regulator-like |
| Scaffold_32 | 197169 | C | T | Scaffold_32_197169 | 0.000318 | 0.000771 | 0.000612 | 3_Prime_UTR_ variant | c.*958G>A | |
|
| Scaffold_32 | 197421 | T | C | Scaffold_32_197421 | 0.000034 | 0.00007 | 8.54E-05 | 3_Prime_UTR_ variant | c.*706A>G | |||
|
| Scaffold_32 | 197459 | A | T | Scaffold_32_197459 | 1.23E-09 | 1.94E-08 | 2.71E-09 | 3_Prime_UTR_ variant | c.*668T>A | |||
|
| Scaffold_32 | 198053 | T | A | Scaffold_32_198053 | 2.16E-05 | 4.99E-05 | 0.000048 | Intron_variant | c.*133 + 29A>T | |||
|
| Scaffold_32 | 198773 | A | AT | Scaffold_32_198773 | 1.85E-06 | 3.75E-06 | 2.85E-06 | Frameshift_variant | p.Met737_Ala738fs/c.2209_2210insA | HIGH | ||
|
| Scaffold_32 | 199551 | A | C | Scaffold_32_199551 | 4.88E-09 | 6.54E-08 | 1.23E-08 | Synonymous_ variant | p.Ala529Ala/c.1587T>G | SILENT | ||
|
| Scaffold_32 | 199605 | T | G | Scaffold_32_199605 | 2.16E-05 | 4.99E-05 | 0.000048 | Intron_variant | c.1575-42A>C | |||
|
| Scaffold_32 | 199656 | TA | T | Scaffold_32_199656 | 0.0096 | 0.0104 | 0.011 | Intron_variant | c.1574 + 2delT | |||
|
| Scaffold_32 | 199722 | C | G | Scaffold_32_199722 | 0.000318 | 0.000771 | 0.000612 | Missense_variant | p.Gly504Ala/c.1511G>C | MISSENSE | ||
|
| Scaffold_32 | 202754 | G | A | Scaffold_32_202754 | 0.0142 | 0.01 | 0.0105 | Stop_gained | p.Arg308*/c.922C>T | |||
| HIGH |
| Scaffold_32 | 206717 | A | G | Scaffold_32_206717 | 1.23E-09 | 1.94E-08 | 2.71E-09 | Intron_variant | c.792 + 27T>C | ||
|
| Scaffold_32 | 216494 | C | T | Scaffold_32_216494 | 4.88E-09 | 6.54E-08 | 1.23E-08 | Synonymous_ variant | p.Val168Val/c.504G>A | SILENT | ||
|
| Scaffold_32 | 218380 | A | T | Scaffold_32_218380 | 3.29E-09 | 4.29E-08 | 8.4E-09 | Synonymous_ variant | p.Pro73Pro/c.219T>A | SILENT | ||
Gene names refer to those in Kong et al., Fan et al., and Cao et al.
Figure 4Detected variants in E1, E3, E4, FT5a, and FT2a from 192 mini-core collection. The grey and white boxes indicate UTRs and exons. The solid lines indicate 5′-uptream and intron regions. The black and grey circles indicate loss-of-function (described as ‘HIGH’ impact on the gene function in Supplementary Table S8) and missense variants. Braces indicate known large InDels. These InDels can be detected by read depth.
DNA polymorphisms responsible for flowering time variation in the mini-core collection as revealed by multiple regression analysis
| Data set | SNP No. | SNP name | Physical position | Glyma ID | Description | Alleles | Effect | MAF | Parameters estimated by linear regression | Contribution rate (%) | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chromosome | bp | β |
|
| |||||||||
| 2011 | |||||||||||||
| E2 | rs124971350 ( | Chr10 | 45310798 | Glyma.10G221500 | GIGANTEA (E2) | A/T | Stop_gained | 0.42 | −5.21 | 1.51 | 2.5E-03** | 21 | |
| SNP379 |
| Chr19 | 47641562 | Glyma.19G224200 | Phytochrome A (E3) | Large deletion | Loss of exon 4 | 0.17 | −9.88 | 3.05 | 3.9E-03** | 19 | |
| SNP156 | rs125308117 | Chr12 | 5520945 | Glyma.12G073900 | Two-component response regulator-like | T/C | Stop_lost | 0.31 | 11.24 | 3.53 | 4.5E-03** | 12 | |
| 2012 | |||||||||||||
| E2 | rs124971350 ( | Chr10 | 45310798 | Glyma.10G221500 | GIGANTEA (E2) | A/T | Stop_gained | 0.42 | −5.44 | 1.47 | 1.3E-03** | 22 | |
| SNP379 |
| Chr19 | 47641562 | Glyma.19G224200 | Phytochrome A (E3) | Large deletion | Loss of exon 4 | 0.17 | −8.50 | 2.95 | 9.0E-03** | 19 | |
| SNP156 | rs125308117 | Chr12 | 5520945 | Glyma.12G073900 | Two-component response regulator-like | T/C | Stop_lost | 0.31 | 9.65 | 3.42 | 1.0E-02* | 10 | |
| 2013 | |||||||||||||
| E2 | rs124971350 ( | Chr10 | 45310798 | Glyma.10G221500 | GIGANTEA (E2) | A/T | Stop_gained | 0.42 | −5.36 | 1.66 | 4.0E-03** | 20 | |
| SNP379 |
| Chr19 | 47641562 | Glyma.19G224200 | Phytochrome A (E3) | Large deletion | Loss of exon 4 | 0.17 | −12.18 | 3.34 | 1.5E-03** | 13 | |
| SNP156 | rs125308117 | Chr12 | 5520945 | Glyma.12G073900 | Two-component response regulator-like | T/C | Stop_lost | 0.31 | 12.69 | 3.86 | 3.5E-03** | 19 | |
| SNP349 | rs127928573 | Chr19 | 45183701 | Glyma.19G194300 | TERMINAL FLOWER 1 | T/A | Missense_variant | 0.09 | 11.23 | 5.11 | 3.9E-02* | 0.5 | |
MAF: minor allele frequency; s.e.: standard error.
Physical position on Gmax275.
Gene description was obtained from Phytozome 12.
Effect to gene function annotated by snpEff. Effect of AMPL1040314 was defined by manually.
Standardized regression coefficients.
Adjusted P-value was obtained from multivariate models days to flowering and genotype as covariates. Signification codes: ‘**’ 0.01 ‘*’ 0.05.
Large deletion on E3 estimated by coverage of four amplicon on 4th exon.