| Literature DB >> 34978122 |
Christina E Fliege1, Russell A Ward1,2, Pamela Vogel3,4, Hanh Nguyen5, Truyen Quach5, Ming Guo3, João Paulo Gomes Viana1, Lucas Borges Dos Santos1, James E Specht3, Tom E Clemente3, Matthew E Hudson1, Brian W Diers1.
Abstract
Entities:
Keywords: Glycine max (L.) Merr.; QTL; fine mapping; gene cloning; seed protein; soybean
Mesh:
Year: 2022 PMID: 34978122 PMCID: PMC9303569 DOI: 10.1111/tpj.15658
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 7.091
List of 19 soybean chromosome 20 markers (Bolon et al., 2010), ordered by their base pair (bp) positions (Williams 82 assemblies – see www.soybase.org), that were used to characterize 13 populations of BC5F7 plants and their descendent BC5F7:8 lines in the first round of fine mapping (Figure S1). Each of the 13 BC5F7 populations was developed from a separate BC5F6 plant and the genotype of each of these parent plants for the chromosome 20 markers are show below. If the BC5F6 plant thepopulation was developed from was homozygous for the donor parent (G. soja) high protein allele, genotypic code B was used, A was used when the plant was homozygous for the recurrent parent (G. max) allele, and H when the plant was heterozygous. Probabilities are given for whether the marker segregation in each population was significantly associated with protein based on field tests and an arrow is pointed in the direction, relative to crossovers, to denote where the QTL was located. The region where these tests show the QTL is located is shaded in grey
| BC5F7 Population No. | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Marker name |
GMax1.01 bp position |
GMax2.0 bp position | 2 | 3 | 10 | 14 | 22 | 4 | 6 | 13 | 17 | 20 | 11 | 15 | 19 |
| Satt239 | 24,129,682 | 25,275,083 | H | H | H | H | H | A | A | B | A | A | B | B | A |
| ↓ | |||||||||||||||
| ssrpqtl_4 | 24,812,334 | 25,971,714 | H | H | H | H | H | A | A | B | A | A | B | B | H |
| ssrpqtl_8 | 25,751,901 | 26,920,157 | H | H | H | H | H | A | A | B | A | A | B | B | H |
| ssrpqtl_11 | 26,270,814 | 27,439,056 | H | H | H | H | H | A | A | B | A | A | B | B | H |
| ssrpqtl_13 | 26,444,803 | 27,606,228 | H | H | H | H | H | A | A | B | A | A | B | B | H |
| ↓ | |||||||||||||||
| ssrpqtl_14 | 26,538,403 | 27,699,841 | H | H | H | H | H | A | A | B | A | A | B | H | H |
| ssrpqtl_15 | 26,542,454 | 27,703,952 | H | H | H | H | H | A | A | B | A | A | B | H | H |
| ↓ | |||||||||||||||
| ssrpqtl_16 | 26,609,299 | 27,770,740 | H | H | H | H | H | A | A | B | A | A | H | H | H |
| ssrpqtl_17 | 26,649,308 | 27,810,743 | H | H | H | H | H | A | A | B | B | A | H | H | H |
| ↓ | ↓ | ↓ | ↓ | ↓ | ↓ | ↓ | ↓ | ↓ | ↓ | ||||||
| ssrpqtl_18 | 26,958,336 | 28,124,804 | B | A | B | A | B | H | H | H | H | H | H | H | H |
| ssrpqtl_25 | 30,489,918 | 31,627,304 | B | A | B | A | B | H | H | H | H | H | H | H | H |
| ssrpqtl_29 | 31,787,239 | 32,934,791 | B | A | B | A | B | H | H | H | H | H | H | H | H |
| ssrpqtl_32 | 31,992,972 | 33,141,346 | B | A | B | A | B | H | H | H | H | H | H | H | H |
| ssrpqtl_33 | 32,022,042 | 33,170,565 | B | A | B | A | B | H | H | H | H | H | H | H | H |
|
|
|
|
|
|
| ||||||||||
| ssrpqtl_34 | 32,178,223 | 33,326,612 | A | A | A | A | B | A | A | A | A | H | A | A | H |
| ssrpqtl_35 | 32,216,450 | 33,359,151 | A | A | A | A | B | A | A | A | A | H | A | A | H |
| ssrpqtl_36 | 32,384,780 | 33,526,075 | A | A | A | A | B | A | A | A | A | H | A | A | H |
| ssrpqtl_37 | 32,717,564 | 33,858,592 | A | A | A | A | B | A | A | A | A | H | A | A | H |
|
|
| ||||||||||||||
| ssrpqtl_38 | 32,910,185 | 34,049,358 | A | A | A | A | A | A | A | A | A | A | A | A | A |
| 2008 Prob > F | NS | NS | NS | NS | NS |
|
|
|
|
|
|
|
| ||
| 2009 Prob > F | NS | NS | NS | NS | NS |
|
| ‐ | ‐ |
| ‐ | ‐ | ‐ | ||
Significance of the marker association test in each population based on field testing of each population of BC5F7 plants in the field 2008 and each population of BC5F7:8 lines in the field in 2009. NS not significant,
Significant at 0.05 probability,
Significant at 0.01 probability, and dash (–) not tested.
List of 17 soybean chromosome 20 markers, ordered by their bp positions (Williams 82 assemblies – see www.soybase.org), that were used to characterize 11 populations of BC5F9 plants grown in 2011 and their descendent BC5F9:10 and BC5F9:11 lines grown in 2012 and 2013, respectively, in the second round of fine mapping. Each of the 11 BC5F9 populations was developed from a separate BC5F8 plant and the genotype of each of these parent plants for the chromosome 20 markers are show below. If the BC5F9 plant the population was developed from was homozygous for the donor parent (G. soja) high protein allele, genotypic code B was used, A was used when the plant was homozygous for the recurrent parent (G. max) allele, and H when the plant was heterozygous. Probabilities are given for whether the marker segregation in each population was significantly associated with protein based on field tests and an arrow is pointed in the direction, relative to crossovers, that the QTL is located. The region where these tests show the QTL is located is shaded in grey. The markers are from Bolon et al. (2010) and Song et al. (2010) and for brevity, the prefix BARCSOYSSR was dropped from the names of markers from Song et al.
| BC5F9 Population No. | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Marker name |
Gmax1.01 bp Position |
Gmax2.0 bp Position | 3 | 5 | 7 | 9 | 10 | 15 | 26 | 27 | 28 | 31 | 34 |
| ssrpqtl_17 | 26,649,365 | Not avail. | H | H | H | H | H | H | A | A | B | A | A |
| ↓ | |||||||||||||
| 20_0599 | 26,829,294 | 27,991,409 | H | H | H | H | H | H | A | A | B | A | H |
| 20_0616 | 27,811,875 | 28,974,676 | H | H | H | H | H | H | A | A | B | A | H |
| ↓ | |||||||||||||
| 20_0617 | 27,877,620 | 29,040,539 | H | H | H | H | H | H | A | A | B | H | H |
| 20_0636 | 28,972,334 | 30,134,877 | H | H | H | H | H | H | A | A | B | H | H |
| ↓ | |||||||||||||
| 20_0647 | 29,643,301 | 30,793,572 | A | H | H | H | H | H | A | A | B | H | H |
| 20_0650 | 29,758,405 | 30,909,346 | A | H | H | H | H | H | A | A | B | H | H |
| ↓ | |||||||||||||
| 20_0655 | 30,052,089 | 31,198,164 | A | B | H | H | H | H | A | A | B | H | H |
| ↓ | |||||||||||||
| 20_0657 | 30,187,698 | 31,333,773 | A | B | A | H | H | H | A | A | B | H | H |
| 20_0667 | 30,489,863 | 31,627,414 | A | B | A | H | H | H | A | A | B | H | H |
| ↓ | |||||||||||||
| 20_0668 | 30,517,621 | 31,655,159 | A | B | A | H | H | H | A | A | H | H | H |
| 20_0670 | 30,606,609 | 31,744,150 | A | B | A | H | H | H | A | A | H | H | H |
| ↓ |
| ||||||||||||
| 20_0674 | 30,684,404 | 31,821,947 | A | B | A | B | H | H | A | H | H | H | H |
|
|
| ||||||||||||
| 20_0678 | 30,754,018 | 31,891,560 | A | B | A | B | A | H | H | H | H | H | H |
| 20_0715 | 32,030,122 | 33,178,717 | A | B | A | B | A | H | H | H | H | H | H |
|
| |||||||||||||
| 20_0718 | 32,178,222 | 33,326,648 | A | B | A | B | A | A | H | H | H | H | H |
| ssrpqtl_34 | 32,178,303 | Not avail. | A | B | A | B | A | A | H | H | H | H | H |
| 2011 Prob > F | NS | NS | NS | ‐ |
|
| ‐ | ‐ |
|
|
| ||
| 2012 Prob > F | NS | NS |
| NS | NS |
| ‐ | ‐ | |||||
| 2013 Prob > F |
| NS | ‐ | ‐ | ‐ | NS |
| ‐ | ‐ | ||||
Significance of the marker association test based on field testing of each population of BC5F9 plants in the field 2011 and each population of BC5F9:10 and BC5F9:11 lines in the field in 2012 and 2013, respectively. NS denotes not significant,
Significant at 0.05 probability,
Significant at 0.01 probability, and dash (–) not tested.
Figure 1Molecular characterization of the genetic interval of the cqSeed protein‐003 interval.
(a) Diagram showing the position of assembled scaffolds within the interval, and annotated genes. Region to the right of the interval contains a repetitive sequence that does not code for protein.
(b) Expanded view of Glyma.20G085100, showing the respective 321 bp insertion in Williams 82 compared with the PI 468916 sequence.
(c) Diagram showing the primer combination used to develop a codominant marker for this polymorphism.
(d) Agarose gel showing genotyping of part of the population of soybean accessions in Table 3 using the marker.
List of soybean germplasm accessions that have been reported to be homozygous for either the high (H) or low (L) protein allele for a QTL in the chromosome 20 cqSeed protein‐003 QTL region. This characterization was based on the use of each accession as a parent in a biparental population in a QTL mapping study (for details, see the cited report in the Source column). A polymerase chain reaction‐based marker diagnostic assay was used to characterize each listed accession as to whether it possessed the PI 468916 Glycine soja (+) allele or the Williams 82 Glycine max (−) allele at each the two Glyma. 20 genes that were inferred in the present study to be potential candidates for the QTL. Haplotypes were determined using the marker from Glyma.20G085100 and the two closest markers flanking each side of the gene. The haplotype designations are defined in the Experimental procedures section
| Accession name | Accession number | Source | Allele at cqSeed protein‐003 |
|
| Haplotype |
|---|---|---|---|---|---|---|
| − | PI 468916 | Diers et al. ( | H | + | + | 5 |
| − | PI 326582A | Chaky et al. ( | H | + | + | 6 |
| Kosodiguri Extra Early | FC 30687 | P5 | H | + | + | 5 |
| Akazu | PI 91725–4 | P34 | H | + | + | 5 |
| N‐34 | PI 153293 | P6 | H | + | + | 5 |
| V‐4 | PI 153296 | P1 | H | + | + | 5 |
| V‐6 | PI 153297 | P14 | H | + | − | 5 |
| V‐14 | PI 153301 | P12 | H | + | − | 5 |
| V‐16 | PI 153302 | P9 | H | + | + | 5 |
| No. 51 | PI 154196 | P23 | H | + | + | 5 |
| − | PI 159764 | P10 | H | + | + | 5 |
| No. 58 | PI 181571 | P20 | H | + | + | 2 |
| Bitterhof | PI 189880 | P13 | L | − | + | 1 |
| Geant Vert | PI 189963 | P2 | H | + | − | 5 |
| Kariho‐takiya | PI 243532 | P36 | H | + | + | 2 |
| No. 17 | PI 253666A | P40 | H | + | + | 2 |
| Wasedaizu No. 1 | PI 261469 | P19 | H | + | − | 2 |
| − | PI 340011 | P35 | H | + | − | 2 |
| Oshimashirome | PI 360843 | P39 | L | − | − | 1 |
| Ronset 4 | PI 372423 | P4 | H | + | − | 5 |
| KAERI‐GNT 310–1 | PI 398516 | P33 | L | − | − | 1 |
| KAS 330–9‐1 | PI 398704 | P44 | H | + | − | 2 |
| − | PI 398881 | Diers et al. (in prep) | L | − | − | 1 |
| KLS 630–1 | PI 398970 | P45 | L | − | − | 2 |
| Huaj an si er dian | PI 404188A | Diers et al. (in prep) | L | − | − | 1 |
| KAS 330–9‐2 | PI 407773B | P47 | H | + | + | 2 |
| ORD 8113 | PI 407788A | P41 | H | + | − | 2 |
| − | PI 407823 | P46 | L | − | − | 1 |
| KAERI 511–11 | PI 407877B | P43 | (L) | (+) | − | 2 |
| KAS 640–7 | PI 408138 C | P37 | H | + | − | |
| Saikai 1 | PI 423942 | P28 | H | + | − | 2 |
| Saikai 18 | PI 423948A | P29 | H | + | − | 2 |
| Saikai 20 | PI 423949 | P25 | H | + | − | 2 |
| Shirome | PI 423954 | P22 | (L) | (+) | − | 2 |
| Shirome | PI 424148 | P21 | (L) | (+) | − | 2 |
| KAS 239–4 | PI 424286 | P42 | L | − | − | 2 |
| Backchung No. 42 | PI 427136 | Diers et al. (in prep) | L | − | − | 1 |
| Choseng No. 1 | PI 427138 | P18 | H | + | + | 2 |
| Seuhae No. 20 | PI 427141 | P26 | H | + | + | 2 |
| DV‐147 | PI 437088A | P24 | H | + | − | 2 |
| VIR 249 | PI 437112A | P30 | L | − | − | 1 |
| VNIISC‐4 | PI 437169B | Diers et al. (in prep) | L | − | − | 1 |
| Sjuj‐dja‐pyn‐da‐do | PI 437716A | P27 | H | + | − | 2 |
| Ronest 4 | PI 438415 | P11 | H | + | + | 5 |
| Szu yueh pa | PI 445845 | P32 | H | + | − | 2 |
| KAS 578–1 | PI 458256 | P48 | L | − | + | 1 |
| NS‐20 | PI 518751 | Diers et al. (in prep) | L | − | − | 1 |
| Provar | PI 548608 | P31 | L | − | − | 1 |
| Fen dou 14 | PI 561370 | Diers et al. (in prep) | L | − | − | 1 |
| Williams 82 | PI 518671 | Kim et al. ( | L | − | − | 1 |
| A81‐355012 | A81‐355012 | Diers et al. ( | L | − | − | |
| Ina | PI 606749 | Unpublished | L | − | − | |
| Danbaekkong | PI 619083 | Warrington et al. ( | H | + | Unknown |
Study in which a significant QTL was mapped in the cqSeed protein‐003 interval and the genotype was a parent. Diers et al. (in prep) refers to unpublished results in a SoyNAM population (Diers et al. 2018), Chaky refers to Chaky et al. (2003), Warrington refers to Warrington et al. (2015), Kim refers to Kim et al. (2016), and P followed by a number refers to Phansak et al. (2016) with the number corresponding to the population number in that study.
Figure 2Population biology of the insertion/deletion (indel) polymorphism at Glyma.20G085100.
(a) Linkage disequilibrium (LD) around the Glyma.20G085100 locus. Left panel: using a population of diverse accessions (Tables 3 and S4) the R 2 LD values around the locus were calculated using data from the SoySNP50k array (Song et al. 2013); note there is a region of LD surrounding the Glyma.20G085100 gene indicated by the arrow (left panel). Right panel: indel genotype determined with a polymerase chain reaction‐based marker detecting the insertional polymorphism in the CCT‐domain gene (CCT marker) is also added. It is clear the indel locus is not in strong LD with surrounding markers. (b) Principal components analysis of the accessions studied here for protein content. First and third principal components were calculated from whole‐genome SoySNP50k array marker information for each accession and plotted, and the points representing accessions colored by the genotype at the four single nucleotide polymorphisms (SNPs) in (a) in LD with the Glyma.20G085100 locus plus the Glyma.20G085100 indel marker. Accessions with the PI 468916 sequence across the locus are denoted in red and Williams 82 sequence in blue. Lines were also detected that were homozygous or heterozygous for the PI 468916 version of the indel in Glyma.20G085100, but carried the flanking haplotype of markers identical to Williams 82, denoted in light blue or green. Broad distribution of green points indicates likely reversion by transposon excision.
Figure 3Sequence of the insertion/deletion polymorphism.
The 321‐bp indel polymorphism in the Glyma.20g085100 gene responsible for the cqSeed protein‐003 quantitative trait loci.
(a) A 321‐bp sequence with transposon similarity is present in the Williams 82 genome and not the genome of the high‐protein accession PI 468916.
(b) Predicted impact of the indel shown in A on the intron–exon structure of the Glyma.20g085100 transcript. Internal coding exons are shown as gray blocks (Exon 1 is identical and not shown), terminal exon as a blue block, and polyadenylation signal as a green diamond. Positions are in base pairs from the transcription start site, as in (A). The orange box shows the region of the insertion.
(c) Multiple alignment of related CCT domain proteins. Published protein sequence of Glyma.20g085100 is shown (†) along with our annotation of the sequence from PI 468916 (‡) and a re‐annotated version of the Williams 82 sequence using the same methods as used for the PI 468916 sequence (§) aligned to related proteins in GenBank. Note that the sequence of PI 468916 is closely conserved with related proteins.
. Associations between RNAi hairpin transgene and seed protein percentage on a dry weight basis in g kg−1 for three populations of greenhouse grown T2 plants
| Transgene present | Transgene not present | ||||
|---|---|---|---|---|---|
| Population |
| Mean |
| Mean | Pr > F |
| 1157‐1‐T1‐3 | 18 | 417 | 5 | 399 | 0.05 |
| 1146‐5‐T1‐4 | 25 | 444 | 5 | 415 | 0.04 |
| 1146‐5‐T1‐5 | 20 | 441 | 5 | 409 | 0.02 |
Number of plants tested that did or did not have the transgene present.
Significance level of the test to detect differences between the present and not present groups.