Erick K Towett1, Lee B Drake2, Gifty E Acquah3, Stephan M Haefele3, Steve P McGrath3, Keith D Shepherd1. 1. World Agroforestry (ICRAF), Nairobi, Kenya. 2. Department of Anthropology, University of New Mexico, Albuquerque, NM, United States of America. 3. Department of Sustainable Agricultural Sciences, Rothamsted Research, Harpenden, United Kingdom.
Abstract
Portable X-ray fluorescence (pXRF) and Diffuse Reflectance Fourier Transformed Mid-Infrared (DRIFT-MIR) spectroscopy are rapid and cost-effective analytical tools for material characterization. Here, we provide an assessment of these methods for the analysis of total Carbon, Nitrogen and total elemental composition of multiple elements in organic amendments. We developed machine learning methods to rapidly quantify the concentrations of macro- and micronutrient elements present in the samples and propose a novel system for the quality assessment of organic amendments. Two types of machine learning methods, forest regression and extreme gradient boosting, were used with data from both pXRF and DRIFT-MIR spectroscopy. Cross-validation trials were run to evaluate generalizability of models produced on each instrument. Both methods demonstrated similar broad capabilities in estimating nutrients using machine learning, with pXRF being suitable for nutrients and contaminants. The results make portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments.
Portable X-ray fluorescence (pXRF) and Diffuse Reflectance Fourier Transformed Mid-Infrared (DRIFT-MIR) spectroscopy are rapid and cost-effective analytical tools for material characterization. Here, we provide an assessment of these methods for the analysis of totalCarbon, Nitrogen and total elementalcomposition of multiple elements in organic amendments. We developed machine learning methods to rapidly quantify the concentrations of macro- and micronutrient elements present in the samples and propose a novel system for the quality assessment of organic amendments. Two types of machine learning methods, forest regression and extreme gradient boosting, were used with data from both pXRF and DRIFT-MIR spectroscopy. Cross-validation trials were run to evaluate generalizability of models produced on each instrument. Both methods demonstrated similar broad capabilities in estimating nutrients using machine learning, with pXRF being suitable for nutrients and contaminants. The results make portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments.
Small-scale farmers produce 80% of the food supply in developing countries, and investments to improve their productivity are urgently needed [1]. Achieving higher rates of productivity will need to rely on improved technologies such as high-yielding crop varieties, better and more inorganic fertilizer, and more efficient use of available resources, for example manures and other organic fertilizers. Effective quality assurance mechanisms can help address three challenges facing scaling-up efforts in supply chains for foods in developing countries of Sub-Saharan Africa (SSA): sourcing, market size and consumer trust [2]. Clark and Hobbs [2] performed supply chain analysis to evaluate how stakeholder actions and relationships influence the dynamics of complementary food markets in SSA and argued that effective signalling of credence attributes via credible quality assurance can contribute to the sustainability of localcomplementary food supply chains and once established, may contribute to the long-term affordability, accessibility and availability of these foods in SSA. Establishing regional and/or country-level quality assurance mechanisms for agro-inputs (e.g. organic fertilizers) quality requires the coordination of stakeholder actions to address food insecurity. This is particularly important for subsistence farmers, for whom organic farming is often the only available option for at least part of their farm. Inorganic fertilizers are expensive for poor farmers [3], and their use is often restricted by cash flow problems. When small farmers do use inorganic fertilizer, it often is applied in pockets or provided to specific plants [4], or to cash crops such as cotton [5]. Hence, organic amendments (OA) such as cattle manure and mixed farmyard manure are necessary to replenish nutrients in the majority of smallholder plots. Consequently, organic amendments have been found to be essentialcomponent of strategies for integrated soil fertility management to maintain soil nutrients in both mixed crop and livestock agriculture, which are common across smallholder farmers [6-9].Organic amendments comprise a variety of plant-derived materials that range from dried plant materials, to animal manures and litters, and agricultural by-products and sewage sludges. The nutrient content of such organic amendments varies greatly among source materials but is usually substantially lower in organic fertilizers as compared with chemicalfertilizers. However, organic amendments contain macro- and micro-nutrients, and may provide added value compared with standard mineralfertilisers [10]. Organic amendments also improve soil structure, increase water holding capacity and promote biological activity [11], but what is less clear from published evidence is the relationship between an action to improve soil structure (for example addition of OA to soil) and the magnitude of change in the associated benefit (for example increase in soil carbon). But despite these potential benefits, OA may be unbalanced in terms of relative availability of nutrients [10].To achieve an efficient combination of organic and inorganic fertilizers as well as the adjustment to crops and soils, knowledge of the composition of the OA is essential. Traditional laboratory methods for analyzing major and trace elements in OA use a combination of homogenization, drying and preparation of individual samples followed by acid digestion and determination of elements by atomic absorption spectrophotometry (AAS), inductively coupled plasma optical emission spectrometry or mass spectroscopy (ICP-OES or ICP-MS). Totalcarbon and nitrogen need a separate analysis, today usually done by routine dynamic combustion methods on an elemental analyzer. While these methods are well established, the associated sample preparation procedures are time-consuming and expensive. As such, they are not scalable solutions to assist subsistence farmers in developing countries. New cheap and fast analytical methods are therefore needed to enable smart nutrient resource management.The commercial availability of portable Energy Dispersive X-ray Fluorescence (EDXRF) has enabled wider use of non-destructive total element analysis in multiple material types [12]. The range of elements which can be determined is limited by the sensitivity of the detector and its energy range; for portable EDXRF the range is from sodium (atomic number 11) to uranium (atomic number 92). The data obtained are counts of photons emitted at different energy levels and need to be calibrated to provide quantitative values [13]. This is often done using fundamental physical parameters, which employ any number of equations to estimate elementalconcentrations in analytes [14]. This approach functions well for metals [15] but runs into difficulty when applied to oxides, where different oxides and carbonates can frustrate quantification [16]. However, when appropriately calibrated, portable XRF (pXRF) systems give results at the μg/g level [17]. In some cases, portable equipment is comparable to laboratory systems [18] when calibrated empirically. The quality standard for laboratory and portable equipment is based on both validity and reliability (Fig 1) [19].
Fig 1
Core concepts in portable instrumentations, adopting the framework by Hughes (1998) [18].
Testing XRF to quantify a wide range of elementalcontents in a range of organic fertilizers and their effect on crop produce could accelerate the assessment of the suitability of new fertilizer products and a variety of OA. Recently, Sapkota et al. [20] indicated that elementalconcentration can accurately be measured in dried and moist manure samples using pXRF. Likewise, Roa-Espinosa et al. [21] found that XRF spectrometry can be used as a rapid and precise method for quantitative elemental analysis of macro- and micronutrients in dairy manure. Thomas et al. [22] also demonstrated that pXRF is a reliable, cost-effective tool for screening potential organic fertilizers and their effect on grains and crop residues.Diffuse Reflectance Spectroscopy (DRS) is another method emerging as a rapid and cost-effective alternative to routine laboratory analysis for multiple sample matrices such as soils, plants and manures based on the interaction of electromagnetic energy with matter. It is already well established that mid-infrared spectroscopy (range: 4000–400 cm-1) (MIR) is useful for estimating a number of chemical and physical soil properties (such as pH, cation exchange capacity (CEC), carbonatecontent, organic carboncontent, soil texture, mineralcomposition, organic matter and watercontent (hydration, hygroscopic and free pore water, etc.) from a single soil spectrum with minimal sample preparation [23]. Examples of applications of infrared spectroscopy in agricultural inputs (manures/compost/bio-wastes/litter/organic resource quality) include the analyses of the following properties: Moisture, pH, totalN, NH4-N, total dissolved N, suspended N, soluble reactive, K, Ca, Mg, total dissolved P, suspended C, P, Ca, Na, and Mg; various salts and metals; lignin, total soluble polyphenols, decomposition rate, in vitro dry matter digestibility, C and N mineralisation potential, compost maturity [23]. Spectral absorbances from the MIR range can be calibrated using regression models to predict multiple constituents in seconds in a wide range of materials and the accuracy of these regression models relies heavily on the calibration dataset used. Although MIR spectroscopy has so far had limited use in developing countries, it has potential to make a huge contribution in helping these countries accelerate agricultural development while safeguarding their environment in the drive towards achieving the sustainable development goals [23].A major challenge for operationalising MIR and pXRF is how to provide robust calibrations that hold up over a wide range of materials and across instruments. Solutions are needed to lay the foundations for a comprehensive nutrient measurement system that would require minimum sample preparation and meets the needs of validity and replicability [19]. This paper aimed to test machine learning methods for calibrating pXRF and MIR, both independently and combined, for analysis of macronutrient content and potentialcontaminants of a wide range of organic amendments. Specific sub-objective here were to examine the inter-instrument variability of six pXRF instruments and conduct an assessment of Diffuse Reflectance Fourier Transformed Mid-Infrared (DRIFT-MIR) spectroscopy for OA analysis by utilizing the MIR spectra of OA samples. Targeted characteristics were total C and N, as well as ash and other macro and micronutrients, in parallel with the pXRF analysis of the same parameters.
Materials and methods
Ninety-eight organic amendment samples were obtained from a wide range of sources in Western Kenya (64) and the United Kingdom (34). Samples from Kenya included manure from cattle, goats, poultry and pigs whereas samples from the UK included cattle manure, mixed farmyard manure, sewage sludge and sewage sludge compost. Conventional analysis for major- and micronutrients, as well as totalcarbon and ash content were conducted at Rothamsted Research (UK). Total C and totalN were determined using a modified Dumas method on a Leco TruMac combustion analyser. Major and trace elements were determined using ICP-OES or ICP-MS (inductively coupled plasma optical emission spectrometry/mass spectrometry) analysis after nitric/perchloric acid (85/15 v/v) or Aqua Regia (HCl:HNO3 4:1) digestion of test samples in open tubes at up to 175 oC [24, 25]. The ash content was determined using the loss-on-ignition method. Test samples that had been oven dried (105 oC for at least 5 hours) were ashed in a furnace at 550 oC for two hours and weighed to calculate the relative ash content (%).Six Bruker Tracer 5i XRF instruments (900F4352, 900F4473, 900F4504, 900F4166, 900F5118, 900F5163) with Rhodium tubes were used to collect data. The units had resolutions (full width height maximum, or FWHM) of 135 eV at the Manganese K-alpha line. Samples were analyzed with a voltage of 10 kV and a current of 70 μA for 90 seconds with no filter for the elemental range of Na, Mg, P, S, K, and Ca. In addition, the samples were analyzed with 35 kV and a current of 35 μA for 90 seconds with a filter (Cu 75 μm: Ti 25 μm: Al 200 μm) for the elements Mn, and Fe. Scans were collected from air-dried and milled (to pass a 75- microns mesh sieve) samples presented as loose powder in XRF cups lined with Prolene film.The MIR spectra of the OA were also acquired on air-dried and ground (to pass a 75- microns mesh sieve) samples using a Bruker Alpha KBr DRIFT-MIR spectrometer. On each sample holder, samples were loaded in a single replication. During scanning at each spot/sample, 32 co-added scans were collected at a resolution of 4 cm-1. Spectra were truncated to 4000–600 cm1 and regions showing atmospheric CO2features (2379.8–2350.8 cm-1) were removed. Raw MIR spectra (Fig 2A) were transformed using the Savitzky-Golay (SG) transformation [26] with a window size of 21 data points and a polynomial order 3 [27] and followed by the first derivative transformation (Fig 2B) before developing calibration models. All MIR spectra derivations and calculations were performed with R statistical language and open source software [28]. PrincipalComponent Analysis (PCA) was performed on the pre-processed MIR spectra of all the OA samples.
Fig 2
Illustrations of the diffuse reflectance mid infrared (DRIFT-MIR) spectra from the organic amendment (OA) samples.
(A) raw spectra, (B) pre-processed spectra and (C) Principal Component Analysis (PCA) scores plot (PC1 vs. PC2) for mid infrared (MIR) spectra from the OA samples (n = 98) based on sample types. The PCA was performed on the pre-processed MIR spectra.
Illustrations of the diffuse reflectance mid infrared (DRIFT-MIR) spectra from the organic amendment (OA) samples.
(A) raw spectra, (B) pre-processed spectra and (C) PrincipalComponent Analysis (PCA) scores plot (PC1 vs. PC2) for mid infrared (MIR) spectra from the OA samples (n = 98) based on sample types. The PCA was performed on the pre-processed MIR spectra.Machine learning has seen rapid advancement in the past decade owing to their implementation in open source languages such as R and python, as well as increasingly affordable high-powered processors on computers. Forest regressions are one of the simplest implementations of these techniques in which variables are randomly selected and iterated over a set number of sampling events and a set number of trees with a set number of iterations. From these, a weighted final model is produced which accounts for variable importance in a way which is generalizable (e.g. low risk of overfitting). A more advanced technique, extreme gradient boosting (XGBoost), was also used [29-31].Typically, variables are defined via general parameters (e.g. a range of energies in a spectrum) which correspond to known responses, such as the Kα1 or Lα1 emission line for an element in pXRF. In lieu of pre-defining pXRF lines for each element in a traditional manner [32], the entire spectrum was used for machine-learning models. While it is standard practice to use the full DRIFT-MIR spectra in calibrations and variables are also defined via general parameters (e.g. a range of wavenumbers in a spectrum) which correspond to known responses, such as a functional group, it is not common for pXRF spectra. This removes the need to pre-define variables in pXRF analysis (e.g. Ca Kα1) and instead uses the whole spectrum without human input to evaluate models. This approach also allows an independent variable, such as ash content, to be quantified if suitable data is provided for training. Further, it allows a visualization of the spectrum in terms of its predictive properties for a given independent variable. Using the whole spectrum enables full automation of the calibration process for pXRF.Forest models for pXRF and DRIFT-MIR spectra were run using 1500 trees with 200 resampling events using k-fold cross validation over 25 iterations based on the R package randomForest (4.6–14). The best forest models were selected using root mean square error (RMSE) in the caret package (6.0–84) using the R language (3.6.0). XGBoost models differ from forest in that trees can be weighted differently, have fixed depths, and different resembling. For example, trees can be built from randomly selected columns (Energies for pXRF spectra) and rows (standards). XGBoost (0.82.1) models were run using 400 rounds with a variable tree depth ranging from 5 to 25. pXRF energy channels (columns) were randomly selected with a range of 40–60%, with samples (rows) randomly selected with the same frequency. Learning rates (eta) were constrained to values between 0.1 and 0.3, with gamma regularization ranging from 0 to 0.1. The minimalchild weight (controls the model complexity) was limited to 1. Unique combinations of these variables were run over 32 iterations with k-fold using caret (6.0–82) and the best model was selected using root mean square error (RMSE). Calibrations were created using CloudCal (v3) [33].The models were evaluated using randomized cross-validation and a hold-out validation consisting of 67/33% split between calibration and validation data. This high number of standards randomly withheld from training (33%) was used to test the generalizability of models and ensure that machine learning wasn’t simply memorizing data sets. Both the R2 value and validation slope of the regression line between observed and predicted values for all cross-validation trials were used to evaluate the best model because (a) both metrics should approach 1 as models increase in accuracy and (b) while R2 provides information regarding model precision, the validation slope gives the clearest assessment of model accuracy; a validation slope of 1 would indicate a 1:1 ratio between predicted and known values.
Results
The wet chemistry analysis data of the nutrient values varied greatly among OA and showed the considerable diversity in the samples (Table 1). This was also confirmed by the PCA scores plot (PC1 vs. PC2) for DRIFT-MIR spectra from the OA samples (Fig 2C). The difference in the OA types is clearly shown; cattle manure seemed to group with goat and poultry manure as well as compost whereas pig manure grouped together with sewage sludge. Also, there was more variance in the cattle manure group than in the sewage sludge group. The means and ranges to in elementalcontents and major nutrients varied within the selected OA samples (Table 1). Nutrient concentrations ranged widely, for example 0.02–5.41% for N and 0.06–3.40% for P. Ash content as an indicator of the inorganic component ranged from 11.6–94.9%. The wide range of characteristics in the samples was also confirmed by the PCA scores plot (PC1 vs. PC2) for MIR spectra (Fig 2C).
Table 1
Descriptive statistics for total carbon (C), total N, ash content, major nutrients and potential contaminants of all organic amendment (OA) samples used for the calibration and validation of pXRF and DRIFT-MIR methods.
Percentile
Property
Units
Mean
Std dev.
Min
Max
2.5th
25th
50th
75th
97.5th
Ash
%
63.8
21.5
11.6
94.9
16.0
52.8
70.1
81.5
88.8
Total C
%
19.4
11.8
1.23
44.7
5.30
9.95
15.8
27.5
43.5
Total N
%
1.65
1.32
0.02
5.41
0.40
0.75
1.11
2.29
5.27
P
%
0.64
0.85
0.06
3.40
0.08
0.13
0.19
0.99
3.00
K
%
0.82
0.77
0.07
4.06
0.10
0.35
0.60
1.01
2.87
Ca
%
1.99
2.73
0.25
21.14
0.37
0.64
0.86
2.87
8.50
Al
%
1.95
1.36
0.01
8.97
0.02
0.87
1.84
2.71
4.28
Mg
%
0.39
0.34
0.13
2.79
0.14
0.24
0.31
0.43
1.29
Na
%
0.14
0.73
0.00
7.21
0.01
0.01
0.02
0.06
0.44
S
%
0.33
0.38
0.03
1.74
0.04
0.08
0.14
0.57
1.31
Mn
%
0.05
0.05
0.01
0.45
0.02
0.04
0.05
0.06
0.09
Fe
%
2.38
2.78
0.05
24.96
0.09
1.18
1.79
2.83
6.64
Zn
mg kg-1
571
1151
41
6553
43
63
92
432
3828
Cu
mg kg-1
278
762
10
5080
11
17
23
143
1413
Ni
mg kg-1
70
142
3.0
1015
3.4
21
29
46
474
Cd
mg kg-1
8.2
25
0.1
137
0.2
0.9
1.5
2.2
108
Pb
mg kg-1
129
310
0.2
1398
0.5
5.3
8.3
35
1263
To investigate the effect of the inter-instrumental variability on the pXRF regression models and on the prediction accuracy, calibration curves were generated for each instrument and different chemical properties. Photon data from each instrument varied, with instrument 900F4166 producing lower counts and instrument 900F4473 producing the highest (Fig 3A). For example, instrument 900F4473 had >2,500 counts per second for phosphorus K-alpha and a concentration of about 3.2% P, while instruments 900F4166 and 900F5188 had the same 2,500 counts for a much higher P concentration of >6.5%. But these differences between instruments can be corrected with instrument specific calibrations as shown in Fig 3B, where the predictions for phosphorusconcentration from the different instruments overlay each other and are all close to the values measured by ICP-OES.
Fig 3
Inter-instrumental variability of pXRF instruments.
Fig 3A shows the relation between phosphorus-specific counts per second for six different instruments. XRF4166 line is hidden behind the blue XRF 5118 line in the left figure. Fig 3B shows the estimated phosphorus content of the samples after instrument specific calibration. The dotted line on the right plot indicates the expected 1:1 ratio for XRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration.
Inter-instrumental variability of pXRF instruments.
Fig 3A shows the relation between phosphorus-specific counts per second for six different instruments. XRF4166 line is hidden behind the blue XRF 5118 line in the left figure. Fig 3B shows the estimated phosphoruscontent of the samples after instrument specific calibration. The dotted line on the right plot indicates the expected 1:1 ratio for XRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration.Next, spectroscopy (PXRF and DRIFT-MIR) in conjunction with both chemometric techniques was tested to predict various chemical properties of OA. The predictions of the calibration models compared to the measured values are shown in the Figs 4 and 5 (examples are ash and Ncontent, respectively). For both examples, the figures show generally good predictions with little error (R2 values above 0.98) and a close fit of actual and predicted concentrations (the regression line for all data points is close to the 1:1 line). For the calibrations, 67% of all OA samples were used (randomly selected). A summary of the regression coefficients for all calibrations conducted for both methods (pXRF and DRIFT-MIR), both models (Forest or XGBoost) and all OA characteristics determined is provided in Table 2. It shows that very good regressions were achieved for most characteristics, but they also indicate differences in the predictive value between methods and models used. Across both methods and models, good to very good regressions were achieved for ash, total C, totalN, P, K, Ca, S and Fe (R2 > 0.9). Less good, but still acceptable regressions were found for the elements Mg, Na and Mn (0.7 > R2 > 0.9). Against conventional knowledge, XRF performed well to predict total C, totalN and ash in OA, which cannot be predicted with XRF based on known element specific Kα1 or Lα1 emission lines. Differences in predictive power of the calibration functions between both methods (pXRF and DRIFT-MIR) and for the characteristics tested were small in most cases. Only in the case of Na at higher concentrations did both methods underestimate the actualconcentration (S2 Fig in S1 File).
Fig 4
Comparison of actual and predicted ash content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4473) and DRIFT-MIR.
The dotted line indicates the expected 1:1 ratio for estimates and known values. The bottom graphs show the respective response variables in the spectra.
Fig 5
Comparison of actual and predicted total nitrogen (N) content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4437) and DRIFT-MIR.
The dotted line indicates the expected 1:1 ratio for estimates and known values. The bottom graphs show the respective response variables in the spectra.
Table 2
Correlation (R2) values for pXRF and MIR calibrations for total C, total N ash content, and major nutrients.
pXRF Forest R2
pXRF XGBoost R2
MIR Forest R2
MIR XGBoost R2
Ash (%)
0.94
0.94
0.93
0.94
Total C (%)
0.97
0.92
0.95
0.95
Total N (%)
0.92
0.93
0.92
0.94
P (%)
0.94
0.89
0.92
0.87
K (%)
0.90
0.93
0.84
0.72
Ca (%)
0.98
0.95
0.83
0.81
Mg (%)
0.77
0.71
0.73
0.66
Na (%)
0.91
0.87
0.86
0.81
S (%)
0.98
0.91
0.93
0.83
Mn (%)
0.87
0.73
0.63
NA
Fe (%)
0.95
0.94
0.67
0.77
Zn (ppm)
0.95
0.94
0.00
0
Cu (ppm)
0.92
0.90
0.00
0
Ni (ppm)
0.92
0.97
0.00
0
Cd (ppm)
0.99
0.97
0.24
0
Pb (ppm)
0.95
0.98
0.00
0
For the calibrations, 67% of all organic amendment (OA) samples and Forest or XGBoost models were used.
Comparison of actual and predicted ash content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4473) and DRIFT-MIR.
The dotted line indicates the expected 1:1 ratio for estimates and known vn class="Chemical">alues. The bottom graphs show the respective response variables in the spectra.
Comparison of actual and predicted total nitrogen (N) content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4437) and DRIFT-MIR.
The dotted line indicates the expected 1:1 ratio for estimates and known vn class="Chemical">alues. The bottom graphs show the respective response variables in the spectra.
For the calibrations, 67% of all organic amendment (OA) samples and Forest or XGBoost models were used.Next, we used the calibration functions established with two thirds of the total sample number and tested their predictive power with the remaining one third of the samples (validation). Validation results are shown in Fig 6 for the six different pXRF instruments and four selected characteristics (ash, total C, totalN, P). The regression lines indicate relatively good predictions for these “unknown samples” with a decreasing precision in the order of P > C> N > ash. In addition, the regression line for the pXRF predictions were in all four cases close to the expected 1:1 ratio between observed and predicted values. The same validation was conducted for DRIFT-MIR (Fig 7) and precision of predictions decreased in the order of N < ash < C alues was close to the expected 1:1 ratio.
Fig 6
Randomized cross-validation of 6 pXRF instruments for ash, total carbon, total nitrogen and total phosphorus.
For each characteristic, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for pXRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration.
Fig 7
Randomized cross-validation of DRIFT-MIR predictions for ash, total carbon, total nitrogen and total phosphorus.
For each element, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for DRIFT-MIR estimates and known values. The shaded areas around all lines are 95% confidence predictions based on the calibration.
Randomized cross-validation of 6 pXRF instruments for ash, total carbon, total nitrogen and total phosphorus.
For each characteristic, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for pXRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration.
Randomized cross-validation of DRIFT-MIR predictions for ash, total carbon, total nitrogen and total phosphorus.
For each element, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for DRIFT-MIR estimates and known values. The shaded areas around all lines are 95% confidence predictions based on the calibration.A summary of all regression coefficients and slopes of the regression line between observed and predicted values for all cross-validation trials with pXRF and MIR and all sample characteristics is shown in Table 3. The results indicate, that the pXRF method allows generally better predictions than the DRIFT-MIR method for most OA characteristics, the exceptions are ash and carbon. Which model (Forest or XGBoost) gives the best result, based on a mixed indicator of regression coefficient and slope of the regression line between observed and predicted values, varies between the OA properties.
Table 3
Correlation coefficients (R2) and the slope of the regression line between observed and predicted values for all cross-validation trials with pXRF and MIR and all sample characteristics measured in the hold-out validation.
pXRF Forest R2
pXRF Forest Slope
pXRF XGBR2
pXRF XGB Slope
MIR Forest R2
MIR Forest Slope
MIR XGB R2
MIR XGB Slope
Best Method
Best Model
Ash (%)
0.83
1.01
0.83
1.01
0.89
1.10
0.86
1.03
MIR
XGBoost
Total C (%)
0.88
1.03
0.87
1.01
0.90
1.11
0.89
1.01
MIR
XGBoost
Total N (%)
0.86
1.00
0.86
1.01
0.87
1.15
0.83
1.02
pXRF
Forest
P (%)
0.66
0.96
0.64
1.06
0.77
1.18
0.60
0.89
pXRF
Forest
K (%)
0.78
0.83
0.79
0.79
0.47
1.06
0.30
0.66
pXRF
Forest
Ca (%)
0.66
0.98
0.63
0.81
0.69
1.52
0.46
0.68
pXRF
Forest
Mg (%)
0.42
0.78
0.43
0.76
0.27
1.24
0.38
0.52
pXRF
XGBoost
Na (%)
0.65
0.97
0.63
0.89
0.65
1.18
0.43
0.62
pXRF
Forest
S (%)
0.70
1.16
0.68
0.99
0.88
1.11
0.73
0.90
pXRF
XGBoost
Mn (%)
0.25
0.77
0.25
0.71
0.08
0.77
NA
NA
pXRF
Forest
Fe (%)
0.89
1.05
0.88
1.05
0.28
1.07
0.19
0.68
pXRF
Forest
Zn (ppm)
0.45
0.83
0.43
0.76
NA
NA
NA
NA
pXRF
Forest
Cu (ppm)
0.83
1.12
0.62
0.81
NA
NA
NA
NA
pXRF
Forest
Ni (ppm)
0.61
1.02
0.59
0.90
NA
NA
NA
NA
pXRF
Forest
Cd (ppm)
NA
NA
NA
NA
NA
NA
NA
NA
NA
NA
Pb (ppm)
0.84
1.01
0.81
0.99
NA
NA
NA
NA
pXRF
Forest
The cross-validation trials used 33% of all organic amendment (OA) samples and Forest or XGBoost models.
The cross-validation trials used 33% of all organic amendment (OA) samples and Forest or XGBoost models.
Discussion
The wide range of materials and element values for calibration and validation was helpful to i) test the potential of the methods and ii) provide robust calibrations. The principalcomponent analysis (Fig 2C) indicated a structural difference between solid OA (manure from cattle, goat, poultry but also compost) and more liquid OA like pig manure and sewage sludge. A wider variety in components of solid OA might be responsible for a larger variance in the cattle manure group than in the sewage sludge group. We also found that spectra of OA (S1 Fig in S1 File) has some resemblance to soil spectra because there is often some soil mixed in with the manure and some soil features were therefore evident (e.g., O-H stretching in clays at 3694 cm1). This similarity occurs even though most soil organic matter derives from the decomposition of plant material added to the soil whereas most organic matter in manure comes from partially digested plant material eaten by the animals. This finding was also in agreement with that of a previous study [34] that found that the quality of most of the manure resources derived from cattle, sheep, goat, chicken in selected household across four district in a semi-arid environment of the North West Province in South Africa had relatively high soil content (mean 22.7%).Using this diverse set of samples, we explored the possibility of a comprehensive analysis of OA, employing machine learning methods to evaluate the MIR and pXRF spectra, namely random forest regressions and extreme gradient boosting. The emergence of these MIR and pXRF systems presents new opportunities for rapid, low-cost analysis of OA samples, both as lab system and portable systems. We hypothesized that pXRF instruments could provide OA data of sufficient accuracy and would reduce the overall time and budget compared with the use of conventional techniques. However, their sensitivity and accuracy are dependent on the instruments’ settings, make and model [35]. We found that the photon data from the various Tracer 5i pXRF instruments varied considerably, with 900F4166 producing much lower counts than the 900F4473 instrument (Fig 3A). This variation results from small changes in the anode thickness of the X-ray tube and imperfections in tube-sample-detector geometry. Therefore, each individual pXRF instrument needs to be calibrated separately.Our assessment of pXRF and DRIFT-MIR spectroscopy for the analysis of total C and totalN as well as total elementalcomposition of multiple elements in OA samples confirmed the potential of these tools. Using forest regression and extreme gradient boosting machine learning models, excellent calibration functions could be established (Table 2; Figs 4 and 5) to rapidly quantify the concentrations of macro- and micronutrient elements present in the OA samples. Both MIR and pXRF had generally good agreement in calibrations for light elements (carbon, nitrogen) and holistic measures of OA quality (ash content), whereas pXRF tended to perform better for most heavier elements, though performance was nearly equal for P and S (Table 2). Forest regressions provided comparable results for MIR and XRF for Mg (Table 3), while Mn had weaker cross-validation results. The general pattern observed for MIR was that it generally performed well on elements that were not transition metals; with the exception being Fe, likely due to its higher abundance.These results are novel for two reasons. First, XRF is unable to measure elements such as carbon or nitrogen directly and yet this study obtained excellent results (R2 > 0.86) with XRF for both elements in both the cross-validation and hold-out sample sets (Tables 2 and 3). MIR is typically unable to measure elements with an atomic number > 11, such as P and K very well and yet we obtained reasonable results (R2 > 0.72) for P in both the calibrations and validation sample sets and acceptable results (R2 > 0.30) for K (Tables 2 and 3). We hypothesized that, because the matrix of the OA tested here is a mixture of organic components and silicates such as clays (S1 Fig in S1 File), there is a necessary inverse correlation between ash and carboncontent. For pXRF analysis, there is also a change in density, as ash may have three times the density of organic material. Scattering of X-rays in this material will lead to different count rates reaching the detector from non-diagnostic portions of the unfiltered spectrum (7–10 keV) (Fig 5). This, coupled with the K-alpha line for silicon, allows for the strong predictive power for ash and Ncontent because of density differences between them. As such, the success in identifying elements such as N (Figs 5 and 6) with portable XRF is understandable in this specific context, but likely not generalizable to plants and soils (the latter can also contain carbonates). However, the principle of narrowly defining a sample matrix to use machine learning techniques will likely produce future advances in the calibration of both x-ray and infrared data, as demonstrated here for OA.Randomized cross validation trials (using 33% of all organic amendment samples and Forest or XGBoost models) confirmed the good predictive value of the XRF and MIR calibrations for most elements/characteristics in the hold-out validation set (Table 3; Figs 6 and 7). However, the cross validations also show that calibration model performance should be evaluated critically. For example, MIR calibration with XGBoost for Ca provides an R2 of 0.81 (Table 2), while the average of 100 cross validation trials of the hold-out samples provides an R2 of only 0.46 (Table 3). This suggests one of two possibilities: either XGBoosting is resulting in an overfit of the data, or there is a critical threshold of the number of standards needed to provide an estimate of Ca that is met by the full dataset (n = 98) but not the partial (n = 65). Either way, the results show that randomized cross validation and hold-out validations are essential to interpret model accuracy and reliability. Our results indicate that MIR + machine learning is not yet a proven method to infer Mg and Mn concentrations in OA, contradicting the study of López-Núñez et al. [36] who claimed good calibrations for pXRF and a wide range of nutrients and contaminants in very similar OA (but the study did not include validations). MIR did not produce useable models for trace elements (Ni, Cu, Zn) or contaminants (Cd, Pb), while both machine learning model types produced useable models from XRF data; this is an unsurprising conclusion as these elements all have fluorescence peaks identifiable to low levels with XRF.Based on theoreticalconsiderations, MIR should be best for light elements like total C and totalN while XRF should be best for evaluating elements like S and P. However, our results indicate that MIR came close to the performance of XRF on elements like P and S, while XRF came close to MIR in estimating total C and N (Table 3, Figs 6 and 7). Both devices had difficulty measuring Mg but surprisingly did better for P (Table 3). Elements which tend to correlate with clay content Na and K are elements which tend to correlate with the ash content, so the higher performance for these elements may be related to the success in identifying ash content. With few exceptions, MIR and XRF were relatively interchangeable techniques for estimating all properties analyzed in this wide range of OA.
Conclusions
We conclude that combining MIR and XRF spectral methods with machine learning techniques enables rapid, portable, and nondestructive measurement of a full suite of nutrients in OA on both devices independently. The approach is also scalable, as the calibration process for XRF can be at least partially automated provided each new instrument is calibrated against common standards. These results are significant in that:XRF is capable of estimating properties like carbon and nitrogencontent of OA;MIR is capable of estimating P and S in OA as well;Portable non-destructive spectrometry paired with machine learning can provide a comprehensive nutrient profile with minimal sample preparation outside a traditional laboratory environment;XRF allows contaminants to be detected, e.g. the presence of trace amounts of potentially toxic metals like e.g. Zn, Cu and Ni–there were good calibration/validations in our study for these elements.If there is one key scope of this work, it is that smallholders need good returns on their investments, especially OA, mostly in terms of increased yields to achieve greater profit and/or food security. Therefore, portable MIR and XRF spectrometers in conjunction with machine learning are adequate solutions to support nutrient management with minimalcost for analysis per sample. For spectrometry at large, machine learning techniques can extract more actionable information than has been previously recognized. To our knowledge, this is the first study to compare the performance of pXRF and FTIR as rapid analytical methods for the determination and monitoring of major and trace nutrient elements in OA. Both methods performed well for most parameters analyzed but pXRF did slightly better for heavier elements whereas FTIR was superior for light elements and the ash content. The combination with machine learning helps to reduce uncertainties in assessing OA quality and, hence, enables better decision-making especially for comprehensive nutrient management for all types of farms. It also allows to identify and avoid contaminated OA fertilizers, thereby protecting soils and the environment from pollution. Future work will now evaluate the use of XRF for testing of conventional mineralfertilizers.(DOCX)Click here for additionn class="Chemical">al data file.
8 Sep 2020PONE-D-20-15361Comprehensive Nutrient Analysis in Agricultural Organic Amendments Through Non-Destructive Assays Using Machine LearningPLOS ONEDear Dr. Erick K Towett,Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.Please submit your revised manuscript by September 21. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.Please include the following iten class="Gene">ms when submitting your revised manusn class="Chemical">cript:
A rebuttn class="Chemical">al letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manusn class="Chemical">cript'.
If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocolsWe look forward to receiving your revised manusn class="Chemical">cript.
Kind regards,Bn class="Chemical">alasubramani Ravindran, Ph.D
Academic EditorPLOS ONEJournn class="Chemical">al Requirements:
When submitting your revision, we need you to address these additionn class="Chemical">al requirements.
1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found athttps://journn class="Chemical">als.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and
https://journn class="Chemical">als.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf
2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.3. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.Reviewers' n class="Chemical">comments:
Reviewer's Responses to Questionsn class="Chemical">Comments to the Author
1. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********2. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: NoReviewer #2: YesReviewer #3: Yes**********3. Have the authors made n class="Chemical">all data underlying the findings in their manusn class="Chemical">cript fully available?
The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********4. Is the manusn class="Chemical">cript presented in an intelligible fashion and written in standard English?
PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********5. Review n class="Chemical">Comments to the Author
Please use the space provided to explain your answers to the questions above. You may also include additionalcomments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: THE STUDY VALIDATE THE APPLICATION OF XPS FOR THE REAL-TIME SOIL NUTRIENT ANALYSIS.IT IS WELL ACn class="Chemical">COMPLISHED WITH n class="Chemical">NECESSARY METHODS.
the n class="Chemical">conclusion is supported with the necessary results.
authors approach to use mechine learning for the data ann class="Chemical">alysis is to be appreciated.
Reviewer #2: Genern class="Chemical">al n class="Chemical">commends
The author did good piece of work on machine learning helps to find percentage of carbon and nitrogen presence in the soil. XRF is capable of estimating properties like carbon and nitrogen. the work on artificial neural networks was well explained.The author addresses following question1. Inconclusion part authors discussing reason it may be avoid and rewrite.2. From your study, How MIR + machine learning useful to derive finding of n class="Chemical">Mg and Mnn class="Chemical">concentrations?
3. Mentation accuracy difference (Percentage) Tracer 5i pXRF 900F4166 and 900F4473 instrument.4. Line number 211, calibrations, 67% was chosen randomly what could be the reason for choosing 67%. If more 80 % and above does the values change?Reviewer #3: n class="Chemical">Comments to the Authors
The authors have tried to find alternative sources for Comprehensive Nutrient Analysis in Agricultural Organic Amendments through Non-Destructive Assays Using Machine Learning. The work done by the authors is commendable and applaudable. It is considered commendable since this study provides an alternative solution for portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments.AbstractAvoid abbreviations in the abstractIntroductionThe authors have arranged the literatures and context relating the significance of choosing the objective and scope of the study. The following comments in this section areLine 26 - Small-scale and family farmers – Define.Line 27 – Authors are to include some literature related to food supply in developing n class="Chemical">countries. Readers will be interested to understand about what is food supply in developing n class="Chemical">countries through some literatures in introduction section.
Line 109 – The authors are recommended to add certain background studies related to macro and micronutrients. Further, additional details for C and N, as well as ash in this study are also encouraged. This information will act as the state of the art for the readers and also compare with the current research findings of the authors.Materin class="Chemical">als and Methods
The authors have presented the methodology in a standard and technicn class="Chemical">al aspects. However there are certain facts that are to be improved and included to inn class="Chemical">crease the reader’s interest.
Line 176 – Though a detailed explanation on experimental model has been provided. Authors are encouraged to provide real time pictures of the experimental study carried out to increase the reader’s interest (If available). In addition, an image of the field used in this study can be included in the manuscript.ResultsThis section has been explained in appropriate manner.Disn class="Chemical">cussionThis section has been discussed in detail by the authors and the literatures stated in this section have beencovered well related to the objective and scope of this study.n class="Chemical">ConclusionInclude solid findings with quantifiable results. Add the scope and future directions in brief.Genern class="Chemical">al n class="Chemical">comments to authors
I encourage and recommend the authors to also provide site images, to increase the curiosity in readers. I would like to recommend minor revision of this study and accept this manuscript in its present form.**********6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). n class="Gene">If published, this will include your full peer review and any attached files.
n class="Gene">If you choose “no”, your identity will remain anonymous but your review may still be made public.
Do you want your identity to be public for this peer review? For information about this choice, including n class="Chemical">consent withdrawn class="Chemical">al, please see our Privacy Policy.
Reviewer #1: NoReviewer #2: NoReviewer #3: No[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.21 Sep 2020Dear Bn class="Chemical">alasubramani Ravindran,
Academic EditorPLOS ONEDear Academic Editor,Subject: Manusn class="Chemical">cript Revision SubmissionMs. Ref. No.: PONE-D-20-15361Title: Comprehensive Nutrient Analysis in Agricultural Organic Amendments Through Non-Destructive Assays Using Machine LearningJournn class="Chemical">al: PLOS On class="Chemical">NE
We hereby submit a revised version of our manuscript titled: " Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learning" by Towett K. Erick, Drake B. Lee, Acquah E. Gifty, Haefele M. Stephan McGrath P. Steve, and Shepherd D. Keith, to be considered for publication in PLOS ONE.Thank you very much for the opportunity to revise our manuscript to address the reviewers’ comments. We have carefully undertaken the revision of the manuscript in light of the peer reviewers 2 and 3 comments geared towards making the MS more informative. WE have also addressed the editors comments. Copies of the manuscript, Tables and Figures with tracks the changes are submitted separately as requested. The responses to the specific comments by the reviewers are given in a separate submitted file named “response to reviewers”.The responses to the specn class="Gene">ific n class="Chemical">comments by the reviewers are given as follows.
Response to Reviewers' n class="Chemical">Comments
Reviewer's Responses to Questionsn class="Chemical">Comments to the Author
1. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes2. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: NoReviewer #2: YesReviewer #3: Yes3. Have the authors made n class="Chemical">all data underlying the findings in their manusn class="Chemical">cript fully available?
The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes4. Is the manusn class="Chemical">cript presented in an intelligible fashion and written in standard English?
PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes5. Review n class="Chemical">Comments to the Author
Please use the space provided to explain your answers to the questions above. You may also include additionalcomments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1:THE STUDY VALIDATE THE APPLICATION OF XPS FOR THE REAL-TIME SOIL NUTRIENT ANALYSIS.IT IS WELL ACn class="Chemical">COMPLISHED WITH n class="Chemical">NECESSARY METHODS.
the n class="Chemical">conclusion is supported with the necessary results.
authors approach to use mechine learning for the data ann class="Chemical">alysis is to be appreciated.
Response to Comment: We very much appreciate the reviewer’s recommendation and constructive suggestions on how this MS may be revised to be more informative.Reviewer #2: Genern class="Chemical">al n class="Chemical">commends
The author did good piece of work on machine learning helps to find percentage of carbon and nitrogen presence in the soil. XRF is capable of estimating properties like carbon and nitrogen. the work on artificial neural networks was well explained.The author addresses following question1. Inconclusion part authors discussing reason it may be avoid and rewrite.Response to comment no. 1: We agree that the conclusions of the manuscript need not to have reasons and we have reworded it from the initial wording “These results are significant for the following reasons:” and we have now rewritten it to read “ These results are significant in that:…..”.2. From your study, How MIR + machine learning useful to derive finding of n class="Chemical">Mg and Mnn class="Chemical">concentrations?
Response to comment no. 2: When we revised the manuscript, we considered the concern raised and have edited the discussion section to include a statement that “Forest regressions provided comparable results for MIR and XRF for Mg (Table 3), while Mn had weaker cross-validation results. That said, the general pattern observed for MIR was that it generally performed well on elements that were not transition metals; with the exception being Fe, likely due to its higher abundance” in lines 342-345.3. Mentation accuracy difference (Percentage) Tracer 5i pXRF 900F4166 and 900F4473 instrument.Response to comment no. 3. The difference between 900F4166 and 900F4473 isn’t accuracy, but rather photon flux - the XRF tube in 900F4473 generated more x-rays per unit time for the energy range of light elements (1 - 4 keV); this could be due to any number of manufacturing differences, such as the thickness of the Rh anode in the tube, the thickness of the Be window used on either the tube or the detector, or small differences in geometry that improve fluorescence in a given energy range. It is precisely these small manufacturing differences that produce the need for individual instrument calibrations. This is described at the end of the second paragraph on page 9, lines 327 to 333.4. Line number 211, calibrations, 67% was chosen randomly what could be the reason for choosing 67%. If more 80 % and above does the values change?Response to comment no. 4: Random sampling is the simplest way of selecting samples as it creates a subset that follows the statistical distribution of the original dataset. The actual value used for cross-validation is ultimately about tradeoffs - what is the minimal number of samples needed to train a model to generalize to a larger data set? A 20% validation split (e.g. 80% of samples used for training) would produce a higher R2 and slope closer to 1 for both the training and test data sets. The more restrictive 33% cross-validation sample (e.g. 67% used for training data) was used given the significance of the claims - accurately quantifying N or C using XRF (and P or S using MIR) is a notable expansion of each instrument’s capability. The cross-validation split was used to stress-test these claims to demonstrate the generalizability of the models, rather than get the highest possible R2 or slope closest to 1. It would be interesting to rerun these data at a 20% split - but it would take at least two weeks and thus extended beyond the review time (as well as be a costly allocation of computer resources). We have added text to clarify the rational behind this particular cross-validation split choice in lines 231-234.Randomized cross-validation in general is an unbiased method, it is also efficient as more samples are required to achieve the representativeness of the data. This method is commonly used as it is easy to carry out, and unbiased. Shepherd & Walsh (2002) and McCarty et al. (2002) are some exemplar studies where this sampling approach has been utilized in spectral modelling studies where each dataset is first randomly split into calibration and validation set (75% and 25% respectively). In general, although an increase in calibration set size could increase the performance of the model (at thus increasing the calibration set to 80% would increase the accuracy of the validation data), it has been found that in many datasets, calibration sample size >75% does not provide much improvement to model prediction and also means that only 25% of the samples need to be fully analysed to provide a good calibration set. Following this approach, for our study we chose randomly 67% of the samples for validation and 33% for calibration as the dataset was split based on the unique reference properties and based on sample types (Table 1 & Figure 2).Shepherd KD, Walsh MG. 2002. Development of reflectance spectral libraries for characterization of soil properties. Soil Science Society of America Journal 66:988_998. DOI 10.2136/sssaj2002.9880.McCarty GW, Reeves JB, Reeves VB, Follett RF, Kimble JM. 2002. Mid-infrared and near-infrared diffuse reflectance spectroscopy for soil carbon measurement. Soil Science Society of America Journal 66:640_646 DOI 10.2136/sssaj2002.6400.Reviewer #3: n class="Chemical">Comments to the Authors
1). The authors have tried to find alternative sources for Comprehensive Nutrient Analysis in Agricultural Organic Amendments through Non-Destructive Assays Using Machine Learning. The work done by the authors is commendable and applaudable. It is considered commendable since this study provides an alternative solution for portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments.1). AbstractAvoid abbreviations in the abstractResponse to comment no. 1: Many thanks for this comment. The abbreviations have been corrected for the elements Carbon and Nitrogen.2). IntroductionThe authors have arranged the literatures and context relating the significance of choosing the objective and scope of the study. The following comments in this section areLine 26 - Small-scale and family farmers – Define.Response to comment no. 2: Thanks for the comments. This reference to the family farms has been deleted. Although it can be argued that limited areas under smallholder farming systems are closer to the homestead, family farms are those small plots nearing the farmhouse where most recycling of organic waste occurs and a diverse range of crops is produced.3). Line 27 – Authors are to include some literature related to food supply in developing n class="Chemical">countries. Readers will be interested to understand about what is food supply in developing n class="Chemical">countries through some literatures in introduction section.
Response to comment no. 3: Thank you for this comment. This would be too broad for the present scope of the manuscript and out of context with the study. However, a literature on the above mentioned food supply dynamics in Sub-Saharan Africa countries has been included in lines 32-43 with particular emphasis on the need for establishing effective quality assurance mechanisms.4). Line 109 – The authors are recommended to add certain background studies related to macro and micronutrients. Further, additional details for C and N, as well as ash in this study are also encouraged. This information will act as the state of the art for the readers and also compare with the current research findings of the authors.Response to comment no. 4: The reviewer refers here to the last line of objectives, and it would be rather unconventional to add references here. However, the requested descriptions and references are available in the text line 99 to 104 and 108 to 130.5). Materin class="Chemical">als and Methods
The authors have presented the methodology in a standard and technicn class="Chemical">al aspects. However there are certain facts that are to be improved and included to inn class="Chemical">crease the reader’s interest.
Response to comment no. 5: This is a very generalcomment which is difficult to address. We do believe that we introduced the method from a technical point of view but also in relation to its possible uses, so we thought this should create interest for the paper.6). Line 176 – Though a detailed explanation on experimental model has been provided. Authors are encouraged to provide real time pictures of the experimental study carried out to increase the reader’s interest (If available). In addition, an image of the field used in this study can be included in the manuscript.Response to comment no. 6: We could include images of the equipment in action, taken in the laboratory or to simulate field as we did not take measurements in the field for this study. We would ask the editor to decide if such images are something the journal would like to include - we have included S1 – S3 Photos after the S1 and S2 Figures in the supporting information for consideration.7). ResultsThis section has been explained in appropriate manner.Response to n class="Chemical">comment no. 7: We very much appreciate the reviewer n class="Chemical">comment.
8). Disn class="Chemical">cussionThis section has been discussed in detail by the authors and the literatures stated in this section have beencovered well related to the objective and scope of this study.Response to n class="Chemical">comment no. 8: Thanks for this n class="Chemical">comment.
9). n class="Chemical">ConclusionInclude solid findings with quantifiable results. Add the scope and future directions in brief.Response to comment no. 9: We also very much appreciate the reviewer’s constructive suggestions and recommendations to add the scope and future directions to the conclusion. This has been done at the end of the conclusion, lines 412-414 and 429-431.10). Genern class="Chemical">al n class="Chemical">comments to authors
I encourage and recommend the authors to also provide site images, to increase the curiosity in readers. I would like to recommend minor revision of this study and accept this manuscript in its present form.Response to comment no. 10: We also very much appreciate the reviewer’s constructive suggestions and recommendations on including images (S1 – S3 Photos) in the supporting information to manuscript to be more informative and we have undertaken to include the same as supplementary to the MS and let the editor decide on whether these can be included, as outlined in the response to reviewer question 6 above.Submitted filename: Response to Reviewers.docxClick here for additionn class="Chemical">al data file.
10 Nov 2020Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learningPONE-D-20-15361R1Dear Dr. Erick K Towett ,We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formn class="Chemical">al acceptance letter and your manusn class="Chemical">cript will be scheduled for publication.
An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.Kind regards,Bn class="Chemical">alasubramani Ravindran, Ph.D
Academic EditorPLOS ONEReviewers' n class="Chemical">comments:
Reviewer's Responses to Questionsn class="Chemical">Comments to the Author
1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.Reviewer #1: n class="Chemical">All n class="Chemical">comments have been addressed
Reviewer #2: n class="Chemical">All n class="Chemical">comments have been addressed
Reviewer #3: n class="Chemical">All n class="Chemical">comments have been addressed
**********2. Is the manuscript technically sound, and do the data support the conclusions?The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.Reviewer #1: YesReviewer #2: PartlyReviewer #3: Yes**********3. Has the statistical analysis been performed appropriately and rigorously?Reviewer #1: (n class="Chemical">No Response)
Reviewer #2: YesReviewer #3: Yes**********4. Have the authors made n class="Chemical">all data underlying the findings in their manusn class="Chemical">cript fully available?
The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********5. Is the manusn class="Chemical">cript presented in an intelligible fashion and written in standard English?
PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.Reviewer #1: YesReviewer #2: YesReviewer #3: Yes**********6. Review n class="Chemical">Comments to the Author
Please use the space provided to explain your answers to the questions above. You may also include additionalcomments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)Reviewer #1: (n class="Chemical">No Response)
Reviewer #2: The Authors answers n class="Chemical">all the question hence we can proceed for publication.
Reviewer #3: (n class="Chemical">No Response)
**********7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). n class="Gene">If published, this will include your full peer review and any attached files.
n class="Gene">If you choose “no”, your identity will remain anonymous but your review may still be made public.
Do you want your identity to be public for this peer review? For information about this choice, including n class="Chemical">consent withdrawn class="Chemical">al, please see our Privacy Policy.
Reviewer #1: NoReviewer #2: Yes: Dr G. RAMn class="Chemical">ALIn class="Chemical">NGAM
Reviewer #3: No1 Dec 2020PONE-D-20-15361R1Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learningDear Dr. Towett:I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.n class="Gene">If we can help with anything else, please email us at plosone@plos.org.
Thank you for submitting your work to PLOS On class="Chemical">NE and supporting open access.
Kind regards,PLOS On class="Chemical">NE Editorin class="Chemical">al Office Staff
on behn class="Chemical">alf of
Dr. Bn class="Chemical">alasubramani RavindranAcademic EditorPLOS ONE
Authors: Rafael López-Núñez; Fátima Ajmal-Poley; José A González-Pérez; Miguel Angel Bello-López; Pilar Burgos-Doménech Journal: Int J Environ Res Public Health Date: 2019-11-06 Impact factor: 3.390
Authors: Gifty E Acquah; Javier Hernandez-Allica; Cathy L Thomas; Sarah J Dunham; Erick K Towett; Lee B Drake; Keith D Shepherd; Steve P McGrath; Stephan M Haefele Journal: PLoS One Date: 2022-01-11 Impact factor: 3.240