| Literature DB >> 30952873 |
Devin A Gredell1, Amelia R Schroeder2, Keith E Belk1, Corey D Broeckling3, Adam L Heuberger4, Soo-Young Kim5, D Andy King6, Steven D Shackelford6, Julia L Sharp5, Tommy L Wheeler6, Dale R Woerner1, Jessica E Prenni7.
Abstract
Ambient mass spectrometry is an analytical approach that enables ionization of molecules under open-air conditions with no sample preparation and very fast sampling times. Rapid evaporative ionization mass spectrometry (REIMS) is a relatively new type of ambient mass spectrometry that has demonstrated applications in both human health and food science. Here, we present an evaluation of REIMS as a tool to generate molecular scale information as an objective measure for the assessment of beef quality attributes. Eight different machine learning algorithms were compared to generate predictive models using REIMS data to classify beef quality attributes based on the United States Department of Agriculture (USDA) quality grade, production background, breed type and muscle tenderness. The results revealed that the optimal machine learning algorithm, as assessed by predictive accuracy, was different depending on the classification problem, suggesting that a "one size fits all" approach to developing predictive models from REIMS data is not appropriate. The highest performing models for each classification achieved prediction accuracies between 81.5-99%, indicating the potential of the approach to complement current methods for classifying quality attributes in beef.Entities:
Mesh:
Year: 2019 PMID: 30952873 PMCID: PMC6450883 DOI: 10.1038/s41598-019-40927-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Histogram of slice shear force values and tenderness classifications as tender (<20.0 kg) and tough (≥20.0 kg).
Summary of classification groupings and number of observations used for each of the four model sets.
| Classifications (# of observations) | Model Sets | |||
|---|---|---|---|---|
| Main | Specialized | Breed | Tenderness | |
| Dark Cutter (41) | × | × | ||
| Top Choice/Prime* (81) | × | |||
| Low Choice/Select* (84) | × | |||
| Wagyu (42) | × | × | ||
| Grass fed (42) | × | × | ||
| Tender (215) | × | |||
| Tough (74) | × | |||
| Angus (159) | × | |||
| Not Angus (46) | × | |||
R packages and functions used for the development of final predictive models.
| Machine Learning Algorithm | R Function | Package |
|---|---|---|
| LDA | lda() | MASS |
| LogitBoost | LogitBoost() |
|
| PDA | fda () | mda |
| SVM | svm() |
|
| XGBoost | xgboost() |
|
| PLSDA | plsDA() | DiscriMiner |
Number (percent) of predictors for each dimension reduction technique for each model set.
| Model Set | Original | PCA | FS | PCA-FS |
|---|---|---|---|---|
| Main | 1,700 (100%) | 289 (17%) | 229 (13.47%) | 24 (1.41%) |
| Specialized | 1,700 (100%) | 124 (7.29%) | 60 (3.53%) | 8 (0.47%) |
| Breed | 1,700 (100%) | 203 (11.94%) | 602 (35.41%) | 38 (2.24%) |
| Tenderness | 1,700 (100%) | 289 (17%) | 67 (3.94%) | 16 (0.94%) |
Figure 2Visualization of the PLSDA model for each of the model sets. Plots represent the first two PLS components of the PCA-FS reduced data.
Figure 3Prediction accuracies (based on 10-fold cross validation) for the top performing machine learning algorithm and data reduction approach combinations for each model set.
Summary of final prediction accuracies based on 100 fold cross validation for the top machine learning algorithm and data reduction approach combination for each model set after parameter tuning.
| Model Set | Dimension Reduction Approach | Number of predictors | Machine Learning Algorithm | Final Accuracy Rate |
|---|---|---|---|---|
| Main | PCA-FS | 24 PCs | LDA |
|
| Specialized | FS | 60 mass-bins | SVM - Linear |
|
| Breed | PCA-FS | 38 PCs | SVM - Radial |
|
| Tenderness | FS | 67 mass-bins | XGBoost |
|