| Literature DB >> 33158206 |
Didem Peren Aykas1,2, Christopher Ball3, Amanda Sia1, Kuanrong Zhu1, Mei-Ling Shotts1, Anna Schmenk1, Luis Rodriguez-Saona1.
Abstract
This study evaluates a novel handheld sensor technology coupled with pattern recognition to provide real-time screening of several soybean traits for breeders and farmers, namely protein and fat quality. We developed predictive regression models that can quantify soybean quality traits based on near-infrared (NIR) spectra acquired by a handheld instrument. This system has been utilized to measure crude protein, essential amino acids (lysine, threonine, methionine, tryptophan, and cysteine) composition, total fat, the profile of major fatty acids, and moisture content in soybeans (n = 107), and soy products including soy isolates, soy concentrates, and soy supplement drink powders (n = 15). Reference quantification of crude protein content used the Dumas combustion method (AOAC 992.23), and individual amino acids were determined using traditional protein hydrolysis (AOAC 982.30). Fat and moisture content were determined by Soxhlet (AOAC 945.16) and Karl Fischer methods, respectively, and fatty acid composition via gas chromatography-fatty acid methyl esterification. Predictive models were built and validated using ground soybean and soy products. Robust partial least square regression (PLSR) models predicted all measured quality parameters with high integrity of fit (RPre ≥ 0.92), low root mean square error of prediction (0.02-3.07%), and high predictive performance (RPD range 2.4-8.8, RER range 7.5-29.2). Our study demonstrated that a handheld NIR sensor can supplant expensive laboratory testing that can take weeks to produce results and provide soybean breeders and growers with a rapid, accurate, and non-destructive tool that can be used in the field for real-time analysis of soybeans to facilitate faster decision-making.Entities:
Keywords: SIMCA; essential amino acids; fat content; major fatty acids; near-infrared spectroscopy; partial least square regression; protein content; soybean
Mesh:
Substances:
Year: 2020 PMID: 33158206 PMCID: PMC7662469 DOI: 10.3390/s20216283
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Photograph of the handheld NIR prototype sensor used for soybean samples spectra collection (a) and an inner view of the spectrometer housing (b).
Figure A1Noise level information by the signal ratio and taking the standard deviation of a random sample’s spectral replications and their corresponding spectra.
Figure A2The reproducibility of the spectral measurements for a random sample that is tested with 3-h intervals. Spectra were collected at given times; A: 11 p.m. B: 2 p.m. C: 5 p.m.
Reference analysis results for essential amino acid (lysine, threonine, methionine, tryptophan, and cysteine), total protein, major fatty acid (palmitic, stearic, oleic, linoleic, and linolenic), fat, and moisture content in soybeans and soy products.
| Parameter (%) * | Minimum | Maximum | Mean | STDEV ** | CV% *** | |
|---|---|---|---|---|---|---|
| Threonine | Soybean | 1.34 | 1.56 | 1.45 | 0.06 | 4.17 |
| Soy Products | 1.75 | 3.20 | 2.64 | 0.54 | 20.46 | |
| Cysteine | Soybean | 0.45 | 0.60 | 0.55 | 0.04 | 7.20 |
| Soy Products | 0.63 | 1.06 | 0.92 | 0.14 | 15.52 | |
| Methionine | Soybean | 0.47 | 0.64 | 0.51 | 0.04 | 5.02 |
| Soy Products | 0.63 | 1.14 | 0.95 | 0.20 | 22.39 | |
| Lysine | Soybean | 2.34 | 2.57 | 2.43 | 0.07 | 2.88 |
| Soy Products | 2.91 | 5.54 | 4.50 | 1.08 | 24.37 | |
| Tryptophan | Soybean | 0.35 | 0.54 | 0.44 | 0.04 | 10.04 |
| Soy Products | 0.60 | 1.32 | 0.99 | 0.23 | 23.31 | |
| Total Protein | Soybean | 32.48 | 37.4 | 34.12 | 0.89 | 1.81 |
| Soy Products | 42.96 | 81.91 | 67.39 | 14.46 | 21.45 | |
| Palmitic Acid | Soybean | 6.22 | 13.4 | 9.19 | 2.57 | 22.93 |
| Stearic Acid | 3.53 | 5.21 | 4.40 | 0.54 | 11.79 | |
| Oleic Acid | 17.60 | 84.00 | 52.83 | 28.05 | 42.04 | |
| Linoleic Acid | 4.10 | 57.40 | 27.29 | 22.88 | 91.23 | |
| Linolenic Acid | 1.88 | 8.19 | 4.53 | 2.43 | 30.38 | |
| Fat | 16.07 | 16.97 | 16.35 | 0.18 | 1.11 | |
| Moisture | 5.30 | 5.68 | 5.49 | 0.09 | 1.59 |
* Individual amino acids (threonine, cysteine, methionine, lysine, and tryptophan), total protein, fat, and moisture are given in % wet basis; fatty acids (palmitic, stearic, oleic, linoleic, and linolenic) are given in % peak area from the GC analysis. ** Standard deviation. *** Coefficient of variation.
Total protein, fat content, and fatty acid composition comparison of high-oleic and conventional soybean samples.
| Parameter (%) * | High-Oleic | Conventional | |
|---|---|---|---|
| Total Protein | 34.17 ± 0.61 | 33.66 ± 0.60 | 0.000 |
| Fat | 16.42 ± 0.19 | 16.27 ± 0.10 | 0.000 |
| Palmitic Acid | 7.00 ± 0.52 | 11.95 ± 0.69 | 0.000 |
| Stearic Acid | 3.87 ± 0.33 | 4.91 ± 0.23 | 0.000 |
| Oleic Acid | 79.25 ± 2.00 | 23.35 ± 3.65 | 0.000 |
| Linoleic Acid | 5.99 ± 1.32 | 51.61 ± 3.13 | 0.000 |
| Linolenic Acid | 2.21 ± 0.32 | 7.09 ± 0.53 | 0.000 |
* Total protein and fat contents are given in % wet basis; fatty acids (palmitic, stearic, oleic, linoleic, and linolenic) are given in % peak area from the GC analysis. ** indicates the significant difference between high-oleic and conventional soybeans (p < 0.05).
Figure 2Raw NIR spectra of ground soybeans (high-oleic and conventional varieties), soy isolate, and soy concentrate.
Figure 3Soft independent modeling of class analogy (SIMCA) 3D projection plot for high-oleic and conventional soybean varieties (a) SIMCA discriminating plot based on the NIR spectra of high-oleic and conventional soybean samples using the handheld NIR sensor, showing bands and regions responsible for class separation (b).
Statistical performance of the prediction models developed using a handheld NIR sensor for estimating various constituents of soy samples.
| Parameter (%) * | Calibration Model | External Validation Model | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Range | N a | Factor | RMSECV b | Rcv c | Range | n d | RMSEP e | RPre f | RPD g | RER h | |
| Threonine | 1.34–3.20 | 26 | 6 | 0.05 | 1.00 | 1.44–3.19 | 6 | 0.08 | 1.00 | 8.7 | 22.3 |
| Cysteine | 0.45–1.06 | 26 | 4 | 0.03 | 0.99 | 0.54–1.02 | 6 | 0.02 | 0.99 | 8.8 | 19.9 |
| Methionine | 0.47–1.14 | 26 | 4 | 0.04 | 0.99 | 0.47–1.13 | 6 | 0.07 | 0.97 | 3.8 | 9.7 |
| Lysine | 2.34–5.54 | 26 | 5 | 0.17 | 0.99 | 2.38–5.48 | 6 | 0.15 | 1.00 | 8.6 | 21.2 |
| Tryptophan | 0.35–1.32 | 26 | 4 | 0.04 | 0.99 | 0.42–1.26 | 6 | 0.04 | 0.99 | 8.2 | 21.2 |
| Total Protein | 32.48–81.91 | 73 | 4 | 1.51 | 0.99 | 33.28–81.15 | 18 | 1.64 | 0.99 | 8.3 | 29.2 |
| Palmitic Acid | 6.50–13.00 | 77 | 5 | 0.49 | 0.97 | 6.40–12.50 | 19 | 0.40 | 0.98 | 4.8 | 15.1 |
| Stearic Acid | 3.43–5.36 | 70 | 6 | 0.21 | 0.91 | 3.44–5.15 | 17 | 0.21 | 0.93 | 2.4 | 8.3 |
| Oleic Acid | 17.60–84.00 | 76 | 5 | 3.04 | 0.99 | 17.20–79.90 | 19 | 3.07 | 0.99 | 8.1 | 20.5 |
| Linoleic Acid | 4.10–54.60 | 77 | 5 | 2.48 | 0.99 | 4.90–57.40 | 19 | 2.71 | 0.99 | 7.2 | 19.4 |
| Linolenic Acid | 1.90–8.50 | 76 | 5 | 0.55 | 0.94 | 3.50–7.80 | 19 | 0.56 | 0.95 | 2.8 | 7.6 |
| Fat | 16.07–16.97 | 46 | 6 | 0.05 | 0.95 | 16.07–16.84 | 12 | 0.07 | 0.96 | 2.6 | 13.3 |
| Moisture | 5.32–5.68 | 45 | 6 | 0.04 | 0.91 | 5.30–5.58 | 11 | 0.04 | 0.92 | 2.4 | 7.5 |
a Number of samples used in calibration models. b Root mean square error of cross-validation. c Correlation coefficient of cross-validation. d Number of samples used in external validation models. e Root mean square error of prediction. f Correlation coefficient of prediction for external validation. g Residual predictive deviation. h Range error ratio. * Individual amino acids (threonine, cysteine, methionine, lysine, tryptophan), total protein, fat, and moisture are given in % wet basis; fatty acids (palmitic, stearic, oleic, linoleic, linolenic) are given in % peak area from the GC analysis. RMSECV and RMSEP are in units of the predicted parameters.
Figure 4Partial least squares regression (PLSR) calibration and external validation plots for total protein (a), lysine (b), fat (c), and oleic acid (d) contents in soybean samples using the handheld NIR sensor. Grey diamonds represent samples in the calibration set; black diamonds represent samples in external validation set.
Figure A3Partial least squares regression (PLSR) calibration and external validation plots for threonine (a), cysteine (b), methionine (c), tryptophan (d), palmitic acid (e), stearic acid (f), linoleic acid (g), linolenic acid (h), and moisture (i) contents in soybean samples using the handheld NIR sensor. Grey diamonds represent samples in calibration set; black diamonds represent samples in external validation set.