Literature DB >> 33930039

Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits.

Mohsen Yoosefzadeh-Najafabadi1, Dan Tulpan2, Milad Eskandari1.   

Abstract

Improving genetic yield potential in major food grade crops such as soybean (Glycine max L.) is the most sustainable way to address the growing global food demand and its security concerns. Yield is a complex trait and reliant on various related variables called yield components. In this study, the five most important yield component traits in soybean were measured using a panel of 250 genotypes grown in four environments. These traits were the number of nodes per plant (NP), number of non-reproductive nodes per plant (NRNP), number of reproductive nodes per plant (RNP), number of pods per plant (PP), and the ratio of number of pods to number of nodes per plant (P/N). These data were used for predicting the total soybean seed yield using the Multilayer Perceptron (MLP), Radial Basis Function (RBF), and Random Forest (RF), machine learning (ML) algorithms, individually and collectively through an ensemble method based on bagging strategy (E-B). The RBF algorithm with highest Coefficient of Determination (R2) value of 0.81 and the lowest Mean Absolute Errors (MAE) and Root Mean Square Error (RMSE) values of 148.61 kg.ha-1, and 185.31 kg.ha-1, respectively, was the most accurate algorithm and, therefore, selected as the metaClassifier for the E-B algorithm. Using the E-B algorithm, we were able to increase the prediction accuracy by improving the values of R2, MAE, and RMSE by 0.1, 0.24 kg.ha-1, and 0.96 kg.ha-1, respectively. Furthermore, for the first time in this study, we allied the E-B with the genetic algorithm (GA) to model the optimum values of yield components in an ideotype genotype in which the yield is maximized. The results revealed a better understanding of the relationships between soybean yield and its components, which can be used for selecting parental lines and designing promising crosses for developing cultivars with improved genetic yield potential.

Entities:  

Year:  2021        PMID: 33930039     DOI: 10.1371/journal.pone.0250665

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  5 in total

Review 1.  Genome-Wide Association Study Statistical Models: A Review.

Authors:  Mohsen Yoosefzadeh-Najafabadi; Milad Eskandari; François Belzile; Davoud Torkamaneh
Journal:  Methods Mol Biol       Date:  2022

2.  Machine-Learning-Based Genome-Wide Association Studies for Uncovering QTL Underlying Soybean Yield and Its Components.

Authors:  Mohsen Yoosefzadeh-Najafabadi; Milad Eskandari; Sepideh Torabi; Davoud Torkamaneh; Dan Tulpan; Istvan Rajcan
Journal:  Int J Mol Sci       Date:  2022-05-16       Impact factor: 6.208

3.  YOLO-VOLO-LS: A Novel Method for Variety Identification of Early Lettuce Seedlings.

Authors:  Pan Zhang; Daoliang Li
Journal:  Front Plant Sci       Date:  2022-02-24       Impact factor: 5.753

4.  Application of artificial neural networks and genetic algorithm to predict and optimize greenhouse banana fruit yield through nitrogen, potassium and magnesium.

Authors:  Mahmoud Reza Ramezanpour; Mostafa Farajpour
Journal:  PLoS One       Date:  2022-02-14       Impact factor: 3.240

5.  Artificial neural network-based model to predict the effect of γ-aminobutyric acid on salinity and drought responsive morphological traits in pomegranate.

Authors:  Saeedeh Zarbakhsh; Ali Reza Shahsavar
Journal:  Sci Rep       Date:  2022-10-05       Impact factor: 4.996

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.