| Literature DB >> 30519252 |
Kaixin Zhang1,2, Shulin Liu3, Wenbin Li1,2, Shiping Liu1,2, Xiyu Li1,2, Yanlong Fang1,2, Jun Zhang4, Yue Wang1,2, Shichao Xu1,2, Jianan Zhang1,2, Jie Song1,2, Zhongying Qi1,2, Xiaocui Tian1,2, Zhixi Tian3, Wen-Xia Li1,2, Hailong Ning1,2.
Abstract
Protein content (PC), an important trait in soybean (Glycine max) breeding, is controlled by multiple genes with relatively small effects. To identify the quantitative trait nucleotides (QTNs) controlling PC, we conducted a multi-locus genome-wide association study (GWAS) for PC in 144 four-way recombinant inbred lines (FW-RILs). All the FW-RILs were phenotyped for PC in 20 environments, including four locations over 4 years with different experimental treatments. Meanwhile, all the FW-RILs were genotyped using SoySNP660k BeadChip, producing genotype data for 109,676 non-redundant single-nucleotide polymorphisms. A total of 129 significant QTNs were identified by five multi-locus GWAS methods. Based on the 22 common QTNs detected by multiple GWAS methods or in multiple environments, pathway analysis identified 8 potential candidate genes that are likely to be involved in protein synthesis and metabolism in soybean seeds. Using superior allele information for 22 common QTNs in 22 elite and 7 inferior lines, we found higher superior allele percentages in the elite lines and lower percentages in the inferior lines. These findings will contribute to the discovery of the polygenic networks controlling PC in soybean, increase our understanding of the genetic foundation and regulation of PC, and be useful for molecular breeding of high-protein soybean varieties.Entities:
Keywords: QTNs; four-way recombinant inbred lines; multi-locus GWAS; protein content; soybean
Year: 2018 PMID: 30519252 PMCID: PMC6258895 DOI: 10.3389/fpls.2018.01690
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
FIGURE 1Frequency distribution of seed protein content under 20 environments.
Joint ANOVA of PC of parent lines in multiple environments.
| Source | DF | SS | MS | Pr > F | |
|---|---|---|---|---|---|
| Replication | 2 | 5.55 | 2.77 | 6.59 | 0.0018 |
| Environment | 19 | 155.86 | 8.20 | 19.49 | <0.0001 |
| Genotype | 3 | 271.22 | 90.41 | 214.80 | <0.0001 |
| Genotype ∗ Environment | 57 | 50.02 | 0.877 | 2.08 | 0.0002 |
| Error | 158 | 66.50 | 0.42 |
Joint ANOVA of PC of FW-RILs in multiple environments and heritability.
| Source | DF | SS | MS | Pr > F | Variance component | |
|---|---|---|---|---|---|---|
| Replication | 2 | 1,304.62 | 652.31 | 11,639.60 | <0.0001 | 0.25 |
| Environment | 19 | 1,893.98 | 99.681 | 1,778.72 | <0.0001 | 0.23 |
| Genotype | 143 | 4,565.61 | 31.93 | 569.70 | <0.0001 | 0.50 |
| Genotype ∗ Environment | 2,436 | 12,408.82 | 5.09 | 90.89 | <0.0001 | 1.68 |
| Error | 5,196 | 291.19 | 0.06 | 0.06 | ||
| 0.85 |
FIGURE 2Population structure based on 5375 SNPs distributed across 20 chromosomes. (A) Plot of ΔK calculated for K = 1–10. (B) Population structure (K = 2); the areas of the two colors (green and red) illustrate the proportion of each subgroup.
FIGURE 3(A) The total numbers of significant QTNs detected in 20 environments across 5 methods. (B) The total numbers of significant QTNs detected using each of 5 multi-locus GWAS methods in 20 environments.
Stable expressed QTNs identified in multiple environments and by multiple methods.
| Methoda | Env | Marker | Chr | Marker position | QTN effect | LOD score | |
|---|---|---|---|---|---|---|---|
| 1,1 | E14,E20, | AX-157298785 | 18 | 6,620,851 | −0.39, −0.40, | 4.21, 4.02, | 10.78, 11.02, |
| 2,3 | E14,E14, | −0.29, −0.67, | 4.32, 4.43, | 5.86, 7.78, | |||
| 5,5 | E14,E16 | −0.34, −0.39 | 4.56, 5.33 | 8.40, 10.48 |
Common QTNs for seed protein content in soybean across different multi-methods.
| Methoda | Marker | Chr | Position (bp) | QTN effect | LOD score | |
|---|---|---|---|---|---|---|
| 2,3 | AX-157088197 | 1 | 2,142,538 | 0.24,0.51 | 3.06,3.15 | 5.41,5.50 |
| 2,5 | AX-157514742 | 1 | 5,244,469 | 0.41,0.44 | 4.76,4.76 | 8.58,9.99 |
| 2,4 | AX-157393800 | 1 | 36,630,129 | 0.53,0.61 | 4.17,5.49 | 6.39,8.57 |
| 2,3,4 | AX-157197609 | 2 | 19,981,350 | −0.34,−0.65,−0.38 | 3.38,3.36,3.81 | 8.06,6.14,9.58 |
| 3,5 | ||||||
| 2,4 | AX-157487767 | 3 | 28,963,194 | 0.45,0.47 | 4.72,3.90 | 4.74,5.30 |
| 2,3,4,5 | ||||||
| 1,2,4,5 | ||||||
| 1,2,4,5 | ||||||
| 1,4 | AX-157083233 | 6 | 49,396,770 | −0.78,−0.51 | 3.20,3.65 | 19.21,6.67 |
| 1,3,4,5 | ||||||
| 1,2,5 | ||||||
| 3,5 | ||||||
| 4,5 | ||||||
| 3,5 | ||||||
| 4,5 | ||||||
| 2,4,5 | ||||||
| 1,4 | AX-157512649 | 16 | 24,057,874 | 0.31,0.24 | 3.16,3.44 | 7.00,4.09 |
| 1,5 | ||||||
| 1,3,5 | AX-157333937 | 18 | 2,064,407 | 0.56,0.96,0.48 | 3.56,4.33,4.23 | 11.90,8.54,8.58 |
| 1,2,4 | AX-157443296 | 18 | 6,597,875 | −0.49,−0.36,−0.38 | 3.83,4.26,5.12 | 15.96,8.71,9.88 |
| 1,2,3,5 | AX-157298785 | 18 | 6,620,851 | −0.39(−0.40),−0.29, −0.67,−0.34 (−0.39) | 4.21 (4.02),4.32, 4.43,4.56,(5.33) | 10.78(11.02),5.86, 7.78,8.40 (10.48) |
FIGURE 4(A) The number of common QTNs detected by different methods and (B) different combinations of methods. Method numbers correspond to (1) mrMLM, (2) FASTmrMLM, (3) FASTmrEMMA, (4) pLARmEB, and (5) ISIS EM-BLASSO.
Phenotypic averages of seed protein content and proportion of superior alleles in 29 lines across 22 common QTNs.
| Line | PC (%) | PSA (%) | Line | PC (%) | PSA (%) |
|---|---|---|---|---|---|
| HN54 | 44.21 | 82 | HN67 | 44.19 | 55 |
| HN37 | 43.26 | 77 | HN98 | 43.01 | 55 |
| HN46 | 44.01 | 77 | HN41 | 43.10 | 50 |
| HN47 | 43.27 | 77 | HN48 | 43.55 | 50 |
| HN40 | 43.40 | 73 | HN58 | 43.13 | 50 |
| HN45 | 43.54 | 73 | HN93 | 43.03 | 45 |
| HN103 | 44.20 | 68 | HN20 | 43.34 | 36 |
| HN24 | 43.07 | 64 | |||
| HN74 | 43.42 | 64 | |||
| HN118 | 43.11 | 64 | |||
| HN142 | 43.14 | 64 | |||
| HN60 | 43.06 | 59 | |||
| HN69 | 43.43 | 59 | |||
| HN91 | 43.20 | 59 | |||
| HN35 | 43.43 | 55 |
FIGURE 5Distribution of superior allele percentages and the PC in the 29 high- and low-PC lines.
Superior alleles and their proportions of 22 common QTNs in 22 elite and 7 inferior lines.
| QTN | Superior allele | PSA (%)a | PSA (%)b | QTN | Superior allele | PSA (%)a | PSA (%)b |
|---|---|---|---|---|---|---|---|
| AX-157506141 | CC | 95 | 71 | AX-157074676 | GG | 64 | 71 |
| AX-157514742 | CC | 82 | 29 | AX-157489326 | TT | 59 | 57 |
| AX-157168337 | TT | 77 | 57 | AX-157594705 | GG | 59 | 29 |
| AX-157393800 | AA | 77 | 43 | AX-157487767 | AA | 50 | 57 |
| AX-157462104 | GG | 77 | 43 | AX-157570733 | TT | 50 | 29 |
| AX-157298785 | GG | 73 | 43 | AX-157512649 | GG | 45 | 71 |
| AX-157397239 | AA | 73 | 29 | AX-157197609 | AA | 45 | 14 |
| AX-157443296 | CC | 68 | 57 | AX-157069070 | TT | 41 | 57 |
| AX-157088197 | GG | 68 | 43 | AX-157333937 | CC | 41 | 14 |
| AX-157566978 | AA | 68 | 43 | AX-157083233 | TT | 36 | 29 |
| AX-157217990 | TT | 68 | 14 | AX-157357710 | AA | 36 | 0 |
FIGURE 6Heat map of the superior allele distribution for the 22 common QTNs in the 29 high- and low-PC lines. Blue and white colors represent superior and inferior alleles, respectively.
FIGURE 7Information on pathways and orthologous protein families of 17 genes. (A) shows the information on pathway. (B) shows the information on orthologous protein families.
Details of 17 genes annotated in the KEGG database.
| QTN name | Gene namea | Chromosome | Position | KO number | Annotation |
|---|---|---|---|---|---|
| AX-157088197 | Glyma.01G020900 | chr01 | 2104937..2109784 | K00083 | E1.1.1.195; cinnamyl-alcohol dehydrogenase [EC:1.1.1.195] |
| AX-157514742 | Glyma.01G046000 | chr01 | 5317206..5321170 | K21734 | SLD; sphingolipid 8-(E/Z)-desaturase [EC:1.14.19.29] |
| AX-157487767 | |||||
| AX-157397239 | Glyma.04G206300 | chr04 | 47880082..47890343 | K21594 | GUF1; translation factor GUF1, mitochondrial [EC:3.6.5.-] |
| AX-157570733 | Glyma.10G207100 | chr10 | 43860713..43863694 | K01373 | CTSF; cathepsin F [EC:3.4.22.41] |
| AX-157570733 | |||||
| AX-157566978 | Glyma.12G017200 | chr12 | 1212436..1217081 | K17790 | TIM22; mitochondrial import inner membrane translocase subunit TIM22 |
| AX-157566978 | Glyma.12G018000 | chr12 | 1252709..1256330 | K04683 | TFDP1; transcription factor Dp-1 |
| AX-157566978 | Glyma.12G018800 | chr12 | 1321415..1326705 | K1542 | PPP4R2; serine/threonine-protein phosphatase 4 regulatory subunit 2 |
| AX-157566978 | |||||
| AX-157357710 | |||||
| AX-157357710 | Glyma.12G113400 | chr12 | 11135703..11140450 | K19355 | MAN; mannan endo-1,4-beta-mannosidase [EC:3.2.1.78] |
| AX-157217990 | |||||
| AX-157333937 | Glyma.18G027100 | chr18 | 2033839..2038697 | K05857 | PLCD; phosphatidylinositol phospholipase C, delta [EC:3.1.4.11] |
| AX-157333937 | |||||
| AX-157443296 | |||||
| AX-157443296 |
Details of significant SNPs detected by MLM with screening critical P < 0.0001.
| Environment | Chr | Physical position (bp) | Physical distance (kb) | No. of QTNs (MLM)a | No. of QTNs (multi-locis GWAS)b | |
|---|---|---|---|---|---|---|
| E6 | 1 | 5,703,406 | 2.82E-05 | 1 | 1 | |
| E19 | 8 | 43,712,243 | 1.17E-04 | 1 | 1 | |
| E4 | 13 | 1,147,825 | 9.80E-05 | 1 | 0 | |
| E12 | 14 | 15,013,495 | 1.35E-04 | 1 | 0 | |
| E12 | 16 | 5,609,879 | 1.12E-04 | 1 | 0 | |
| E19 | 16 | 28,008,354 | 1.07E-04 | 1 | 2 | |
| E6 | 18 | 5,396,919 | 3.54E-05 | 1 | 0 | |
| E16 | ||||||
| Total |