| Literature DB >> 32354037 |
Janice Kofsky1, Hengyou Zhang1, Bao-Hua Song1.
Abstract
A worldwide food shortage has been projected as a result of the current increase in global population and climate change. In order to provide sufficient food to feed more people, we must develop crops that can produce higher yields. Plant early vigor traits, early growth rate (EGR), early plant height (EPH), inter-node length, and node count are important traits that are related to crop yield. Glycine soja, the wild counterpart to cultivated soybean, Glycine max, harbors much higher genetic diversity and can grow in diverse environments. It can also cross easily with cultivated soybean. Thus, it holds a great potential in developing soybean cultivars with beneficial agronomic traits. In this study, we used 225 wild soybean accessions originally from diverse environments across its geographic distribution in East Asia. We quantified the natural variation of several early vigor traits, investigated the relationships among them, and dissected the genetic basis of these traits by applying a Genome-Wide Association Study (GWAS) with genome-wide single nucleotide polymorphism (SNP) data. Our results showed positive correlation between all early vigor traits studied. A total of 12 SNPs significantly associated with EPH were identified with 4 shared with EGR. We also identified two candidate genes, Glyma.07G055800.1 and Glyma.07G055900.1, playing important roles in influencing trait variation in both EGR and EPH in G. soja.Entities:
Keywords: GWAS; Glycine soja; early growth; early vigor; genome-wide association study; wild soybean
Mesh:
Substances:
Year: 2020 PMID: 32354037 PMCID: PMC7247153 DOI: 10.3390/ijms21093105
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Phenotypic variation and correlations of studied early vigor traits in wild soybean: Mean, standard deviation (StDev), and coefficient of variance (CV). The units of early plant height (EPH) and inter-node length are mm. Positive significant correlation between traits: early growth rate (EGR), EPH, node count, and inter-node length by Spearman’s ρ correlation coefficient.
| Trait | Phenotypic Variation | Positive Significant Correlation between Traits | ||||
|---|---|---|---|---|---|---|
| Mean | StDev | CV | EPH | Node Count | Inter-Node Length | |
| EGR | 16.02 | 8.34 | 52.05 | 0.9861 | 0.7023 | 0.8674 |
| EPH | 244.68 | 127.2 | 51.99 | - | 0.6911 | 0.8838 |
| Node Count | 4.34 | 0.93 | 21.33 | - | - | 0.3449 |
| Inter-node Length | 54.55 | 24.97 | 45.76 | - | - | - |
Figure 1Results of principle component analysis (PCA) and genome-wide association analysis (GWAS). Manhattan plots illustrate the GWAS results of EGR (A) and EPH (B). X-axis represents single nucleotide polymorphism (SNP) positions across the entire genome by chromosome and the y-axis is the negative logarithm p-value: -log10 (p) of each SNP. Significant SNPs with chromosome-wide false discovery rate (FDR) adjustment are highlighted in green. The corresponding QQ plots for EGR and EPH are shown in (D) and (E), respectively. For Q-Q plots, x-axis represents expected −log10 (p) and y-axis is observed −log10 (p) of each SNPs. (C) Plot of PCA result with all 225 ecotypes. EGR: early growth rate; EPH: early plant height.
Significant SNP markers associated with early growth rate (EGR) and early plant height (EPH): The q-value given is the chromosome-wide FDR-adjusted p-value. Significance in CB (Chromosome-wide Bonferroni threshold), GB (Genome-wide Bonferroni threshold), and GFDR (Genome-wide FDR adjustment) with p < 0.05 are indicated with (*). The position is physical position on the chromosome (Chr.) in base pair.
| SNP | Chr | Position |
| Location | Associated Gene | Associated QTL | CB | GB | GFDR | |||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| ss715598271 | 7 | 4924020 | 0.1085 | Intron |
| 9.14E-07 | 0.001431 | * | * | * | |
| ss715614175 | 13 | 19487316 | 0.071 | Intergenic |
| Plant Height 26-11 [ | 6.50E-05 | 0.042055 | - | - | - | |
| ss715615103 | 13 | 31173270 | 0.0911 | Intergenic |
| 6.12E-06 | 0.006046 | * | - | - | ||
| ss715616082 | 13 | 39280839 | 0.0944 | 5UTR |
| 6.23E-06 | 0.006046 | * | - | - | ||
|
| ss715579500 | 1 | 45269059 | 0.0792 | Intergenic |
| 2.26E-05 | 0.028024 | * | - | - | |
| ss715598269 | 7 | 4915929 | 0.0682 | Intron |
| Plant Height 19-5 [ | 1.04E-04 | 0.027144 | - | - | - | |
| ss715598270 | 7 | 4918294 | 0.0834 | 3UTR |
| Plant Height 19-5 [ | 1.64E-05 | 0.010022 | * | - | - | |
| ss715598271 | 7 | 4924020 | 0.1242 | Intron |
| Plant Height 19-5 [ | 1.62E-07 | 0.000254 | * | * | * | |
| ss715598272 | 7 | 4928272 | 0.0817 | Intron |
| Plant Height 19-5 [ | 1.92E-05 | 0.010022 | * | - | - | |
| ss715598304 | 7 | 5214440 | 0.0816 | Intergenic |
| Plant Height 19-5 [ | 4.11E-05 | 0.016091 | - | - | - | |
| ss715598895 | 7 | 8788505 | 0.0668 | Intron |
| 1.04E-04 | 0.027144 | - | - | - | ||
| ss715598145 | 7 | 42926704 | 0.0628 | CDS |
| 1.86E-04 | 0.041611 | - | - | - | ||
| ss715614175 | 13 | 19487316 | 0.09 | Intergenic |
| Plant Height 26-11 [ | 7.24E-06 | 0.007026 | * | - | - | |
| ss715615103 | 13 | 31173270 | 0.0893 | Intergenic |
| 7.18E-06 | 0.007026 | * | - | - | ||
| ss715616082 | 13 | 39280839 | 0.0874 | 5UTR |
| 1.22E-05 | 0.007893 | * | - | - | ||
| ss715620138 | 14 | 9595999 | 0.0873 | Intergenic | - | 8.84E-06 | 0.013826 | * | - | - | ||
Figure 2Pairwise linkage disequilibrium (LD) between SNPs in the local genes of interest: Glyma.07G055800.1, Glyma.07G055900.1, and the known QTL, plant height 19-5 [43], in relation to SNPs ss715598269, ss715598270, ss715598272 (significantly associated with EPH), and ss715598271 (significantly associated with both EPH and EGR).
Figure 3Geographic distribution of known locations of 225 G. soja accession: Each point marks a geographic location where the samples were originally collected.