| Literature DB >> 32397526 |
Cesar Augusto Medina1, Charles Hawkins1,2, Xiang-Ping Liu1,3, Michael Peel4, Long-Xi Yu1.
Abstract
Soil salinity is a growing problem in world production agriculture. Continued improvement in cropEntities:
Keywords: GBS; abiotic stress; allele dosage; association mapping; genomic selection; polyploid
Mesh:
Substances:
Year: 2020 PMID: 32397526 PMCID: PMC7247575 DOI: 10.3390/ijms21093361
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
A summary of single nucleotide polymorphism (SNP) markers developed by genotype-by-sequencing (GBS) and their categories of gene annotations based on the Medicago truncatula reference genome (Mt.v5.0).
| SNPs | Count | |
|---|---|---|
| Coding | Synonymous variant | 2843 |
| Missense variant | 2014 | |
| Stop lost | 3 | |
| Stop gained | 22 | |
| Start lost | 0 | |
| Splice donor variant | 2 | |
| Splice acceptor variant | 4 | |
| Exonic splice region variant | 7 | |
| Splice region variant | 83 | |
| 5 prime UTR variant | 82 | |
| 3 prime UTR variant | 174 | |
| Non-coding | Upstream transcript variant | 61 |
| Downstream transcript variant | 32 | |
| Intron variant | 956 | |
| Intergenic variant | 579 | |
Figure 1Single nucleotide polymorphism variants (SNVs) identified in alfalfa (Medicago sativa) populations developed in Logan, Utah (A) Histogram of filtered variants called by Next Generation Sequencing Experience Platform (NGSEP) showing distribution by minor allele frequency and classified by function after annotation. (B) Distribution of GBS SNP markers across eight Medicago truncatula chromosomes using 1 Mb window. The colored lines represent the marker density as showing on the right color legends.
Figure 2Frequency of allele dosage in autotetraploid alfalfa (Medicago sativa) for 6862 high-quality biallelic SNVs obtained from NGSEP pipeline in the Logan dataset. A represents dosage of the major allele and B is for the minor allele dosage.
Figure 3Manhattan plots showing marker–trait association for vigor (V) in alfalfa populations at Othello Washington (WA) and Castle Dale Utah (UT). (A) Markers identified by general model in the UT dataset. (B) Markers identified by diplo-general model in the UT dataset. (C) Markers identified by general model in the WA dataset. (D) Markers identified by diplo-general model in the WA dataset. The threshold of 0.05 was used for significant markers according to the Bonferroni method.
SNP marker, trait, model, chromosome, position, allele, , locus tag, and putative gene function associated with alfalfa (Medicago sativa) yield (Y) under salt stress in Othello, WA, and vigor (V) in Othello, Washington (V_WA), and Castle Dale, Utah (V_UT) fields.
| M. | Trait | Model | Chr. | Position | SNP |
| Locus tag | Annotation |
|---|---|---|---|---|---|---|---|---|
| 283 | V_WA | 1 | 1 | 19123928 | A/G | 5.34 | MtrunA17_Chr1g0170381 | Hypothetical protein |
| 860 | V_UT | 1, 2 | 1 | 50528093 | C/T | 5.59, 6.08 | MtrunA17_Chr1g0205221 | Putative folate-biopterin transporter, major facilitator superfamily domain-containing protein |
| 861 | V_UT | 1, 2 | 1 | 50528125 | C/T | 5.7, 6.08 | ||
| 1561 | V_UT | 1 | 2 | 35034036 | A/G | 6.08 | MtrunA17_Chr2g0312131 | Hypothetical protein |
| 1644 | Y_Jun_19, Y_Jul_19 | 2, 3, 5 | 2 | 38865320 | A/G | 5.19, 6.02 | MtrunA17_Chr2g0316741 | Hypothetical protein |
| 1744 | V_WA | 2 | 2 | 44365722 | A/G | 5.54 | MtrunA17_Chr2g0324021 | Putative oxidoreductase |
| 1992 | V_WA | 2 | 3 | 2641319 | C/G | 5.55 | MtrunA17_Chr3R0014140 | RLX_singleton_family134 PWWP domain |
| 1993 | V_WA | 1, 2 | 3 | 2641320 | C/T | 5.28, 5.53 | ||
| 2033 | Y_All_18 | 4 | 3 | 5484686 | C/G | 4.95 | MtrunA17_Chr3g0083861 | Putative Serpin family protein |
| 2195 | Y_Jul_18 | 2 | 3 | 17906891 | C/T | 6.2 | MtrunA17_Chr3g0094791 | Putative tetratricopeptide-like helical domain, DYW domain-containing protein |
| 2711 | V_WA | 2 | 3 | 49957218 | A/T | 5.65 | NA | NA |
| 2712 | V_WA | 2 | 3 | 49957253 | C/T | 5.55 | ||
| 3515 | V_UT | 1 | 4 | 44369334 | C/T | 5.46 | MtrunA17_Chr4g0048811 | Putative aminoacyltransferase, E1 ubiquitin-activating enzyme |
| 3708 | Y_Sep_19 | 4 | 4 | 54035230 | A/G | 5.04 | MtrunA17_Chr4g0062111 | Putative protein CHAPERONE-LIKE PROTEIN OF POR1 |
| 4154 | V_WA | 2 | 5 | 12453276 | A/G | 5.55 | MtrunA17_Chr5g0410771 | Putative HSP20-like chaperone, P-loop containing nucleoside triphosphate hydrolase |
| 4155 | V_WA | 2 | 5 | 12453319 | G/T | 5.55 | ||
| 4156 | V_WA | 2 | 5 | 12453328 | C/G | 5.54 | ||
| 4463 | V_WA | 2 | 5 | 35355162 | G/T | 5.91 | MtrunA17_Chr5g0435221 | Putative 23S rRNA (adenine(2503)-C(2))-methyltransferase |
| 4633 | V_UT | 1, 2 | 5 | 41782228 | A/T | 5.53, 6.4 | MtrunA17_Chr5g0444321 | Putative leucine-rich repeat domain, L domain-containing protein |
| 4775 | Y_All_18, Y_Aug_18, Y_Sep_18 | 1 | 6 | 1909362 | C/T | 6.74, 5.7, 5.61 | MtrunA17_Chr6g0451341 | Putative transcription regulator IWS1 family |
| 4868 | V_WA | 1 | 6 | 7243498 | A/G | 5.48 | MtrunA17_Chr6g0457561 | Hypothetical protein |
| 5146 | V_WA | 2 | 6 | 35426314 | C/G | 5.86 | MtrunA17_Chr6R0226110 | Putative potassium channel, voltage-dependent, ERG |
| 5241 | V_WA | 1 | 6 | 40502777 | A/G | 5.34 | MtrunA17_Chr6g0486011 | Putative zinc finger, RanBP2-type |
| 5558 | V_UT | 1 | 7 | 26012100 | C/T | 5.45 | MtrunA17_Chr7g0235641 | Putative RIN4, pathogenic type III effector avirulence factor Avr cleavage |
| 5834 | V_WA | 2 | 7 | 43123906 | A/G | 5.71 | NA | NA |
| 5858 | V_WA | 1 | 7 | 44707092 | C/T | 5.6 | MtrunA17_Chr7g0259771 | Putative small GTPase superfamily, EF-hand domain pair |
| 6478 | Y_Jun_19 | 2 | 8 | 32682521 | A/T | 5.18 | MtrunA17_Chr8g0369441 | Putative brevis radix (BRX) domain, transcription factor BREVIS RADIX domain-containing protein |
M. = Marker consecutive. Chr. = chromosome; Y = BLUEs values for yield in the indicated harvest; HS = health score of plants under salt stress. Models: 1 = general, 2 = diplo-general, 3 = diplo-additive, 4 = 2-dominant-reference, 5 = 1-dominant-reference. Locus tag annotation based on [13]. Orange colored cells indicate the same marker in different traits. Grey colored cells indicate several markers associated to same loci.
Figure 4Manhattan plots showing marker–trait associations for yield datasets in alfalfa (Medicago sativa) at Othello, Washington over two years. (A) Markers identified by general model in All 2018. (B) Markers identified by 2-dominant reference model in All 2018. (C) Markers identified by diplo-general model in July 2018 dataset. (D) Markers identified by general model in August 2018. (E) Markers identified by general model in September 2018. (F) Markers identified by diplo-general model in June 2019. (G) Markers identified by diplo-additive model in June 2019. (H) Markers identified by 1-dominant reference model in June 2019. (I) Markers identified by diplo-general model in July 2019. (J) Markers identified by diplo-additive model in July 2019. (K) Markers identified by 1-dominant reference model in July 2019. (L) Markers identified by 2-dominant reference model in September 2019 dataset. Markers threshold was set using Bonferroni > 0.05.
Figure 5Linkage disequilibrium (LD) among markers associated for yield and vigor under salt stress. Haploview v4.2 [14] and pairwise LD values () were used for 27 SNPs associated with yield and vigor under salt stress (green color) and their surrounding SNPs in 10 kb (black color). Bright red coloring indicates ; blue coloring indicates ; white coloring indicates ; shades of pink/red coloring indicates .
Genomic selection (GS) metrics for alfalfa (Medicago sativa) plant vigor under salt stress at Castle Dale, Utah (HS_UT), and Othello, Washington (HS_WA). Eight GS models were tested using 10-fold cross-validation and the metrics of accuracies as Pearson’s correlation values (Pearson) and root mean squared error (RMSE) are shown by model.
| Dataset | Metric | rrBLUP | BayesA | BayesB | BayesC | BL | BRR | RF | SVM |
|---|---|---|---|---|---|---|---|---|---|
| V_UT | Pearson | 0.267 | 0.274 | 0.250 | 0.275 | 0.272 | 0.245 | 0.244 | 0.287 |
| RMSE | 0.894 | 0.885 | 0.896 | 0.890 | 0.887 | 0.894 | 0.890 | 0.880 | |
| V_WA | Pearson | 0.336 | 0.336 | 0.327 | 0.342 | 0.329 | 0.343 | 0.324 | 0.361 |
| RMSE | 0.696 | 0.693 | 0.698 | 0.692 | 0.696 | 0.696 | 0.708 | 0.691 |
Notes: BL, Bayesian LASSO; BRR, Bayesian ridge regression; RF, random forest; SVM, support vector machine.
Description of best linear unbiased estimates (BLUEs) yield values and genomic selection (GS) results for alfalfa (Medicago sativa) grown under salt stress. Broad sense heritability (), residual SD (Res_SD), , and coefficient of variation (Coef_Var) of phenotypic data were calculated using the package Mr.Bean [15] with genotype as random effect. Eight GS models were tested using 10-fold cross-validation and the metrics of accuracies as Pearson’s correlation values (Pearson) and root mean squared error (RMSE) are shown by model.
| Dataset |
| Res_SD |
| Coef_Var | Metric | rrBLUP | BayesA | BayesB | BayesC | BL | BRR | RF | SVM |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Jul_18 | 0.47 | 0.55 | 0.479 | 0.23 | Pearson | 0.305 | 0.305 | 0.303 | 0.307 | 0.303 | 0.299 | 0.343 | 0.324 |
| RMSE | 0.509 | 0.506 | 0.51 | 0.508 | 0.508 | 0.509 | 0.508 | 0.503 | |||||
| Aug_18 | 0.51 | 0.46 | 0.51 | 0.25 | Pearson | 0.27 | 0.259 | 0.275 | 0.272 | 0.253 | 0.265 | 0.268 | 0.24 |
| RMSE | 0.409 | 0.411 | 0.407 | 0.408 | 0.408 | 0.408 | 0.414 | 0.414 | |||||
| Sep_18 | 0.69 | 0.24 | 0.629 | 0.38 | Pearson | 0.444 | 0.445 | 0.448 | 0.447 | 0.454 | 0.45 | 0.464 | 0.509 |
| RMSE | 0.255 | 0.254 | 0.254 | 0.255 | 0.254 | 0.254 | 0.256 | 0.244 | |||||
| All_18 | 0.8 | 0.38 | 0.717 | 0.3 | Pearson | 0.234 | 0.216 | 0.227 | 0.226 | 0.209 | 0.236 | 0.302 | 0.268 |
| RMSE | 0.377 | 0.38 | 0.376 | 0.379 | 0.375 | 0.377 | 0.37 | 0.371 | |||||
| May_19 | 0.43 | 0.55 | 0.506 | 0.28 | Pearson | 0.116 | 0.108 | 0.107 | 0.121 | 0.119 | 0.115 | 0.182 | 0.113 |
| RMSE | 0.551 | 0.558 | 0.556 | 0.552 | 0.552 | 0.553 | 0.541 | 0.548 | |||||
| Jun_19 | 0.33 | 0.5 | 0.502 | 0.28 | Pearson | 0.173 | 0.147 | 0.155 | 0.146 | 0.184 | 0.154 | 0.219 | 0.201 |
| RMSE | 0.477 | 0.481 | 0.478 | 0.478 | 0.474 | 0.478 | 0.467 | 0.469 | |||||
| Jul_19 | 0.43 | 0.49 | 0.555 | 0.33 | Pearson | 0.258 | 0.242 | 0.238 | 0.266 | 0.231 | 0.235 | 0.287 | 0.281 |
| RMSE | 0.51 | 0.513 | 0.509 | 0.507 | 0.51 | 0.51 | 0.514 | 0.51 | |||||
| Sep_19 | 0.54 | 0.29 | 0.553 | 0.39 | Pearson | 0.249 | 0.231 | 0.257 | 0.24 | 0.247 | 0.236 | 0.276 | 0.301 |
| RMSE | 0.31 | 0.312 | 0.309 | 0.311 | 0.309 | 0.31 | 0.312 | 0.308 | |||||
| All_19 | 0.83 | 0.37 | 0.716 | 0.45 | Pearson | 0.072 | 0.065 | 0.083 | 0.064 | 0.06 | 0.083 | 0.137 | 0.138 |
| RMSE | 0.464 | 0.467 | 0.466 | 0.466 | 0.463 | 0.462 | 0.456 | 0.455 |
Notes: BL, Bayesian LASSO; BRR, Bayesian ridge regression; RF, random forest; SVM, support vector machine.
Comparison of genomic selection (GS) models in phenotypic data collected for alfalfa (Medicago sativa) yield under salt stress. Random forest (RF) and support vector machine (SVM) models were trained by 10-fold cross validation (RF_10CV or SVM_10%). Pearson’s correlation (Pearson) and root mean squared error (RMSE) values were calculated.
| Harvest | Metric | RF_10CV | RF_10% | SVM_10CV | SVM_10% |
|---|---|---|---|---|---|
| July_2018 | Pearson | 0.343 | 0.728 | 0.324 | 0.793 |
| RMSE | 0.508 | 0.389 | 0.503 | 0.353 | |
| August_2018 | Pearson | 0.268 | 0.225 | 0.240 | 0.279 |
| RMSE | 0.414 | 0.468 | 0.414 | 0.459 | |
| September_2018 | Pearson | 0.464 | 0.771 | 0.509 | 0.729 |
| RMSE | 0.256 | 0.222 | 0.244 | 0.205 | |
| All_2018 | Pearson | 0.302 | 0.259 | 0.268 | -0.073 |
| RMSE | 0.370 | 0.399 | 0.371 | 0.657 | |
| May_2019 | Pearson | 0.182 | 0.135 | 0.113 | 0.282 |
| RMSE | 0.541 | 0.511 | 0.548 | 0.491 | |
| June_2019 | Pearson | 0.219 | 0.226 | 0.201 | 0.353 |
| RMSE | 0.467 | 0.479 | 0.469 | 0.464 | |
| July_2019 | Pearson | 0.287 | 0.365 | 0.281 | 0.479 |
| RMSE | 0.514 | 0.471 | 0.510 | 0.450 | |
| September_2019 | Pearson | 0.276 | 0.410 | 0.301 | 0.627 |
| RMSE | 0.312 | 0.302 | 0.308 | 0.275 | |
| All_2019 | Pearson | 0.137 | 0.275 | 0.138 | 0.229 |
| RMSE | 0.456 | 0.469 | 0.455 | 0.472 |