| Literature DB >> 28698580 |
Dan Zhang1, Haiyan Lü2, Shanshan Chu2, Huairen Zhang3, Hengyou Zhang4, Yuming Yang5, Hongyan Li2, Deyue Yu6.
Abstract
Water-soluble protein content (WSPC) is a critical factor in both soybean protein quality and functionality. However, the underlying genetic determinants are unclear. Here, we used 219 soybean accessions and 152 recombinant inbred lines genotyped with high-density markers and phenotyped in multi-environments to dissect the genetic architectures of WSPC and protein content (PC) using single- and multi-locus genome-wide association studies. In the result, a total of 32 significant loci, including 10 novel loci, significantly associated with WSPC and PC across multi-environments were identified, which were subsequently validated by linkage mapping. Among these loci, only four exhibited pleiotropic effects for PC and WSPC, explaining the low correlation coefficient between the two traits. The largest-effect WSPC-specific loci, GqWSPC8, was stably identified across all six environments and tagged to a linkage disequilibrium block comprising two promising candidate genes AAP8 and 2 S albumin, which might contribute to the high level of WSPC in some soybean varieties. In addition, two genes, Glyma.13G123500 and Glyma.13G194400 with relatively high expression levels at seed development stage compared with other tissues were regarded as promising candidates associated with the PC and WSPC, respectively. Our results provide new insights into the genetic basis of WSPC affecting soybean protein quality and yield.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28698580 PMCID: PMC5506034 DOI: 10.1038/s41598-017-04685-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phenotypic analysis of protein content (PC) and water-soluble protein content (WSPC) in the 219 soybean accessions. The histograms on the diagonal show the phenotypic distribution of each trait across six environments. The values above the diagonal are pairwise correlation coefficients between traits, and the plots below the diagonal are scatter plots of compared traits. PCE1-E6, denote the protein content in six different environments; WSPCE1-E6, denote the water-soluble protein content in the corresponding environments.
Loci significantly associated with soybean protein content (PC) and water soluble protein content (WSPC) and the candidate genes.
| QTL | Chr | Rep. SNPb | Pos. (bp)c | No. sig.d |
| Traits-Environmentse | Related QTLf | Candidate genesg | Annotationsh |
|---|---|---|---|---|---|---|---|---|---|
|
| 3 | AX-93995056 | 37,773,722 | 18 | 0.229 | E1, E2, E3, E4, E5, BLUP | Seed protein 21-9, Seed Leu 1-7 |
| Glycinin A2B1a precursor |
|
| 4 | AX-94274580 | 34,743,951 | 7 | 0.292 | E1, E2, E3, E4, E5, E6, BLUP | / |
| Family not named |
|
| 4 | AX-93920058 | 46,200,673 | 4 | 0.227 | E3, E4, E5, E6 | Seed protein 3-3, Seed protein 4-1, |
| Leucine-rich repeat protein kinasefamily protein |
|
| 5 | AX-94012201 | 5,637,601 | 4 | 0.174 | E1, E2, E3, BLUP | / |
| Nitrate transporter |
|
| 6 | AX-93731783 | 26,098,086 | 3 | 0.269 | E4, E5, E6 | / |
| Phospholipid acyltransferase |
|
| 9 | AX-93764734 | 11,138,977 | 6 | 0.258 | E2, E3, E4, E5, E6, BLUP | Seed acidic fraction 1-2, Seed Thr 1-4 |
| inositol transporter |
|
| 10 | AX-93933852 | 40,504,375 | 33 | 0.254 | E1, E2, E3, E4, E5, E6 BLUP | Seed protein 27-5 |
| RmlC-like cupins superfamily protein |
|
| 11 | AX-94089116 | 24,782,059 | 6 | 0.258 | E3, E4, E5, E6, BLUP | Seed protein 25-1, 25-2 |
| Family not named |
|
| 11 | AX-93795201 | 37,461,551 | 12 | 0.259 | E1, E3, E4, E5, E6, BLUP | / |
| Transmembrane amino acid transporter |
|
| 12 | AX-93804391 | 34,037,114 | 5 | 0.260 | E1, E2, E3, E5, BLUP | Seed protein 5-2, 33-1, 21-10 |
| Serine carboxypeptidase S28 family protein |
|
| 13 | AX-94109235 | 23,091,289 | 3 | 0.220 | E1, E3, E4 | Seed protein 3-7 |
| Glycinin A3B4 subunits |
|
| 14 | AX-93822697 | 7,715,347 | 3 | 0.032 | E1, E2, E6 | Seed Protein 1-2 |
| Asparaginase |
|
| 15 | AX-93649790 | 7,681,400 | 7 | 0.292 | E1, E3, E4, E5, E6, BLUP | Seed protein 5-1, 3-6, 4-6 |
| Beta-amylase |
|
| 15 | AX-93837099 | 11,235,816 | 6 | 0.226 | E1, E3, E4, E5, E6, BLUP | / |
| Rab5-interacting family protein |
|
| 15 | AX-94136114 | 16,417,543 | 6 | 0.258 | E1, E3, E4, E5, E6, BLUP | Seed Lys 1-2, Seed Tyr 1-3 |
| Zinc finger WD40 repeat protein |
|
| 19 | AX-94196006 | 47,615,818 | 10 | 0.267 | E1, E3, E4, E5, E6, BLUP | Seed protein 2-2,16-2 |
| 7S globulin precursor |
|
| 1 | AX-93961814 | 9,078,593 | 3 | 0.095 | E2, E4 | Seed protein 3-5 |
| Family not named |
|
| 7 | AX-94283226 | 3,484,506 | 4 | 0.049 | E1, E2, E3, BLUP | Seed protein 36-33 |
| RNA polymerase III subunit RPC6 |
|
| 8 | AX-94048210 | 8,643, 359 | 469 | 0.193 | E1, E2, E3, E4, E5, E6, BLUP | Seed protein 26-1, 30-4 |
| Seed storage 2S albumin protein |
|
| Amino acid permease | ||||||||
|
| 10 | AX-93933852 | 40,504,375 | 33 | 0.130 | E1, E2, E4, E5, E6, BLUP | Seed protein 27-5 |
| RmlC-like cupins superfamily protein |
|
| 11 | AX-93795201 | 37,461,551 | 19 | 0.089 | E1, E2, E3, E4, E5, E6 | / |
| Transmembrane amino acid transporter |
|
| 12 | AX-94268787 | 2,583,421 | 4 | 0.038 | E1, E2, E3, BLUP | / |
| Auxin responsive protein |
|
| 12 | AX-93805526 | 36,952,663 | 5 | 0.123 | E1, E2, BLUP | Seed protein 5-2, 33-1, 21-10 |
| Concanavalin A-like/Legume lectin domain |
|
| 13 | AX-94112235 | 30,988,071 | 19 | 0.110 | E1, E2, E3, E4, E5, E6, BLUP | Seed protein21-6, 33-2 |
| Albumin 1 gene |
|
| 15 | AX-93649393 | 48,823,741 | 3 | 0.058 | E1, E2, E3 | / |
| Lob domain-containing protein 4 |
|
| 16 | AX-94150758 | 28,496,401 | 6 | 0.030 | E2, E3, E4, BLUP | / |
| Proline dipeptidase |
|
| 17 | AX-94156641 | 7,962,112 | 10 | 0.123 | E1, E2, E3, BLUP | Seed Leu 1-3 |
| Aldehyde dehydrogenase |
|
| 18 | AX-93870261 | 6,745,767 | 6 | 0.080 | E1, E2, E3, E4, E5, E6, BLUP | Seed protein 26-8, 26-14 |
| Amino acid permease |
|
| 18 | AX-94170844 | 17,076,547 | 4 | 0.083 | E2, E4, E5 | / |
| GDSL-like Lipase |
|
| 18 | AX-94182257 | 56,190,147 | 5 | 0.011 | E1, E3, BLUP | Seed protein 30-10 |
| Amino acid transporter 6 |
|
| 19 | AX-94196006 | 47,615,818 | 12 | 0.079 | E1, E2, E5, BLUP | Seed protein 2-2,16-2 |
| 7S globulin precursor |
|
| 20 | AX-94198333 | 4,049,700 | 5 | 0.022 | E2, E3, E4 | Seed protein 11-1 |
| DNA replication licensing factor |
aChromosome; bthe representative SNP with the minimum P value; crepresentative SNP position on soybean genome assembly Glycine max Wm82.a1.v1.1 (www.phytozome.net); dthe number of significant association signals detected in the region; ethe significant signals were associated with the traits across different environments; fPreviously reported protein-related QTL in SoyBase (http://www.soybase.org/); g,hGenes ID and annotated in Glycine max Wm82.a2.v1 (www.phytozome.net), and NCBI RefSeq gene models in SoyBase (www.soybase.org) were used as the source of candidate genes. The QTNs with bold type were identified simultaneously by single- and multi-locus GWAS methods, and the underlined QTNs were detected only by multi-locus GWAS methods.
Figure 2Genetic architecture of soybean protein content (PC) and water-soluble protein content (WSPC). (a) and (b) Manhattan plot for the BLUP of soybean PC and WSPC across six environments by genome-wide association mapping. Red horizontal lines depict the Bonferroni-adjusted significance threshold (P < 4.95 × 10−6). The x axis shows the 20 soybean chromosomes, and the y axis shows the significance expressed as −log10 P value. (c) Associations between 25 loci aligned on the upper boundary and 14 phenotype values (contain two traits across six environments and their BLUP) aligned on the lower boundary. Positions of loci correspond to the above panel. Deep red, red, pink, and gray lines represent significant associations between SNPs and phenotype value with threshold levels of P < 1.0 × 10−11, P < 1.0 × 10−9, P < 1.0 × 10−7, P < 1.0 × 10−5, respectively.
Figure 3Associations, genomic locations and the pattern of pairwise LD of SNPs associated with water-soluble protein content (WSPC) on chromosome 8. (a) A 2.5-Mb region of the major-effect quantitative trait loci (GqWSPC8) harboring the peak SNP, AX-94048210 on chromosome 8. The most significantly associated SNP is shown with a big blue dot. Red horizontal lines depict the Bonferroni-adjusted significance threshold (P < 4.95 × 10−6). The x axis shows the genomic position, and the y axis shows the significance expressed as −log10 P value. (b) Soybean genome region around the SNP marker, AX-94048210 on chromosome 8, whose position is indicated by a vertical gray dashed line (0.25-Mb) on the top panel. (c) The extent of linkage disequilibrium (LD) in the regions based on pairwise r values. The r values are indicated using the color intensity index. Heatmap showing LD between each pair of markers that passed the Bonferonni threshold in GWAS.
Figure 4Epistatic interaction between AX-93822697_T_A and AX-93952504 _G_T associated with PC, and candidate gene for each SNP locus. (a) Box plot of PC based on different genotypes in soybean accessions. (b) Phenotypic differences between genotype combinations of the two SNP. (c) and (d) Candidate genes for AX-93822697_T_A and AX-93952504 _G_T loci, respectively. The proposed causal genes are indicated in red. The bottom panel depicts the extent of linkage disequilibrium in the regions based on pairwise r values. The r values are indicated using the color intensity index shown.
Nine QTLs associated with protein content (PC) and water-soluble protein content (WSPC) across four environments in RIL population.
| Traits | Namea | Chr.b | Marker intervalc | Positiond | LODe | R2(%)f | Addg | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 2012 | 2013 | 2014 | 2015 | BLUP | |||||||
| PC |
| 3 | M950668-M977935 | 38993729–39764723 | 5.54 | 3.82 | 5.8 | 2.6 | 7.28 | 11.67 | −0.46 |
|
| 5 | M1912544-M1913462 | 5517334–5608744 | ns | 3.47 | 2.7 | ns | 3.32 | 6.87 | 0.36 | |
|
| 9 | M1614684-M1595567 | 45450837–45451129 | 2.75 | ns | 2.60 | ns | 2.63 | 7.24 | 0.36 | |
|
| 10 | M764990-M697081 | 37059790–37175436 | ns | 3.09 | ns | 5.23 | 4.11 | 9.1 | 0.61 | |
|
| 13 | M1793867-M1714063 | 34799842–35437945 | ns | 3.45 | ns | 2.76 | ns | 6.34 | 0.36 | |
|
| 11 | M812042-M837694 | 7962054–8577956 | 2.76 | 2.81 | 3.16 | ns | 3.99 | 6.74 | −0.36 | |
|
| 11 | M804544-M861597 | 7301888–7428451 | 6.27 | 3.70 | 6.45 | 4.04 | 6.95 | 17.47 | −0.55 | |
|
| 19 | M1050238-M1066605 | 47835501–49307483 | 3.3 | ns | 3.35 | 2.83 | 2.86 | 6.58 | −0.35 | |
| WSPC |
| 1 | M430744-M433677 | 294462–294750 | ns | 3.93 | ns | ns | 4.15 | 9.53 | 0.72 |
|
| 3 | M878118-M941301 | 1452355–1990434 | ns | 3.27 | ns | 2.5 | ns | 6.53 | 0.7 | |
|
| 8 | M2696788-M2699628 | 8709744–9076486 | 3 | 4.28 | 7.96 | 8.65 | 9.04 | 17.93 | 1.04 | |
|
| 10 | M812042-M837694 | 38151603–38151909 | ns | ns | 2.62 | 2.98 | 2.87 | 6.94 | 0.48 | |
aThe name of the QTL is defined by the abbreviation of traits and the chromosome number. bChromosome; cconfidence interval of QTL; dThe interval of physical distance in soybean genome; ethe logarithm of odds score; fthe mean phenotypic variance explained by related QTL; gthe mean additive effect of QTLs. The QTLs in bold were detected simultaneously by ICIM and GCIM, and the underlined QTLs were detected by GCIM only.
Figure 5QTLs for soybean protein content (PC) and water-soluble protein content (WSPC) on soybean chromosomes by linkage mapping in RIL population. The lines link denotes epistatic associations between QTL and QTL. Blue line denotes two QTLs in different chromosomes, while red line denotes two QTLs in the same chromosome. The outside/inside wheat-colored circle indicates the LOD/PVE value curve for investigated traits across environments. The outermost circle indicates the 20 soybean chromosomes, QTLs for PC/WSPC, the position and linked markers of these QTLs on the chromosomes.
Figure 6The genetic overlap between protein content (PC) and water-soluble protein content (WSPC) in the GWAS population. (a) Quantitative trait locus (QTL) categories and their number. (b) Associations between genotypes of SNP AX-93995056 and PC. Box plot of PC in 14 A-type and 195 G-type soybean accessions. The vertical axis indicates the PC. The PC of A-type accessions was significantly higher than that of G-type accessions (t test, P = 3.17 × 10−8). (c) Associations between genotypes of SNP AX-94048210 and WSPC. Box plot of PC in 50 A-type and 169 G-type soybean accessions. The vertical axis indicates the WSPC. The WSPC of G-type accessions was significantly higher than that of A-type accessions (t test, P = 1.01 × 10−29).