Literature DB >> 33301449

Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learning.

Erick K Towett¹, Lee B Drake², Gifty E Acquah³, Stephan M Haefele³, Steve P McGrath³, Keith D Shepherd¹.

Abstract

Portable X-ray fluorescence (pXRF) and Diffuse Reflectance Fourier Transformed Mid-Infrared (DRIFT-MIR) spectroscopy are rapid and cost-effective analytical tools for material characterization. Here, we provide an assessment of these methods for the analysis of total Carbon, Nitrogen and total elemental composition of multiple elements in organic amendments. We developed machine learning methods to rapidly quantify the concentrations of macro- and micronutrient elements present in the samples and propose a novel system for the quality assessment of organic amendments. Two types of machine learning methods, forest regression and extreme gradient boosting, were used with data from both pXRF and DRIFT-MIR spectroscopy. Cross-validation trials were run to evaluate generalizability of models produced on each instrument. Both methods demonstrated similar broad capabilities in estimating nutrients using machine learning, with pXRF being suitable for nutrients and contaminants. The results make portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments.

Entities: CellLine Chemical Disease Gene Species

Mesh：

Substances：
Fertilizers
Soil

Year: 2020 PMID： 33301449 PMCID： PMC7728284 DOI： 10.1371/journal.pone.0242821

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Small-scn>an class="Chemical">ale farmers produce 80% of the food supply in developing countries, and investments to improve their productivity are urgently needed [1]. Achieving higher rates of productivity will need to rely on improved technologies such as high-yielding crop varieties, better and more inorganic fertilizer, and more efficient use of available resources, for example manures and other organic fertilizers. Effective quality assurance mechanisms can help address three challenges facing scaling-up efforts in supply chains for foods in developing countries of Sub-Saharan Africa (SSA): sourcing, market size and consumer trust [2]. Clark and Hobbs [2] performed supply chain analysis to evaluate how stakeholder actions and relationships influence the dynamics of complementary food markets in SSA and argued that effective signalling of credence attributes via credible quality assurance can contribute to the sustainability of local complementary food supply chains and once established, may contribute to the long-term affordability, accessibility and availability of these foods in SSA. Establishing regional and/or country-level quality assurance mechanisms for agro-inputs (e.g. organic fertilizers) quality requires the coordination of stakeholder actions to address food insecurity. This is particularly important for subsistence farmers, for whom organic farming is often the only available option for at least part of their farm. Inorganic fertilizers are expensive for poor farmers [3], and their use is often restricted by cash flow problems. When small farmers do use inorganic fertilizer, it often is applied in pockets or provided to specific plants [4], or to cash crops such as cotton [5]. Hence, organic amendments (OA) such as cattle manure and mixed farmyard manure are necessary to replenish nutrients in the majority of smallholder plots. Consequently, organic amendments have been found to be essential component of strategies for integrated soil fertility management to maintain soil nutrients in both mixed crop and livestock agriculture, which are common across smallholder farmers [6-9]. Organic amendments compn>rise a variety of plant-derived materin>an class="Chemical">als that range from dried plant materials, to animal manures and litters, and agricultural by-products and sewage sludges. The nutrient content of such organic amendments varies greatly among source materials but is usually substantially lower in organic fertilizers as compared with chemical fertilizers. However, organic amendments contain macro- and micro-nutrients, and may provide added value compared with standard mineral fertilisers [10]. Organic amendments also improve soil structure, increase water holding capacity and promote biological activity [11], but what is less clear from published evidence is the relationship between an action to improve soil structure (for example addition of OA to soil) and the magnitude of change in the associated benefit (for example increase in soil carbon). But despite these potential benefits, OA may be unbalanced in terms of relative availability of nutrients [10]. To achieve an efficient combination of organic and inorganic n>an class="Chemical">fertilizers as well as the adjustment to crops and soils, knowledge of the composition of the OA is essential. Traditional laboratory methods for analyzing major and trace elements in OA use a combination of homogenization, drying and preparation of individual samples followed by acid digestion and determination of elements by atomic absorption spectrophotometry (AAS), inductively coupled plasma optical emission spectrometry or mass spectroscopy (ICP-OES or ICP-MS). Total carbon and nitrogen need a separate analysis, today usually done by routine dynamic combustion methods on an elemental analyzer. While these methods are well established, the associated sample preparation procedures are time-consuming and expensive. As such, they are not scalable solutions to assist subsistence farmers in developing countries. New cheap and fast analytical methods are therefore needed to enable smart nutrient resource management. The commercin>an class="Chemical">al availability of portable Energy Dispersive X-ray Fluorescence (EDXRF) has enabled wider use of non-destructive total element analysis in multiple material types [12]. The range of elements which can be determined is limited by the sensitivity of the detector and its energy range; for portable EDXRF the range is from sodium (atomic number 11) to uranium (atomic number 92). The data obtained are counts of photons emitted at different energy levels and need to be calibrated to provide quantitative values [13]. This is often done using fundamental physical parameters, which employ any number of equations to estimate elemental concentrations in analytes [14]. This approach functions well for metals [15] but runs into difficulty when applied to oxides, where different oxides and carbonates can frustrate quantification [16]. However, when appropriately calibrated, portable XRF (pXRF) systems give results at the μg/g level [17]. In some cases, portable equipment is comparable to laboratory systems [18] when calibrated empirically. The quality standard for laboratory and portable equipment is based on both validity and reliability (Fig 1) [19].

Fig 1

Core concepts in portable instrumentations, adopting the framework by Hughes (1998) [18].

Testing XRF to quantify a wide range of elementn>an class="Chemical">al contents in a range of organic fertilizers and their effect on crop produce could accelerate the assessment of the suitability of new fertilizer products and a variety of OA. Recently, Sapkota et al. [20] indicated that elemental concentration can accurately be measured in dried and moist manure samples using pXRF. Likewise, Roa-Espinosa et al. [21] found that XRF spectrometry can be used as a rapid and precise method for quantitative elemental analysis of macro- and micronutrients in dairy manure. Thomas et al. [22] also demonstrated that pXRF is a reliable, cost-effective tool for screening potential organic fertilizers and their effect on grains and crop residues. Diffuse Reflectance Spn>ectrosn>an class="Chemical">copy (DRS) is another method emerging as a rapid and cost-effective alternative to routine laboratory analysis for multiple sample matrices such as soils, plants and manures based on the interaction of electromagnetic energy with matter. It is already well established that mid-infrared spectroscopy (range: 4000–400 cm-1) (MIR) is useful for estimating a number of chemical and physical soil properties (such as pH, cation exchange capacity (CEC), carbonate content, organic carbon content, soil texture, mineral composition, organic matter and water content (hydration, hygroscopic and free pore water, etc.) from a single soil spectrum with minimal sample preparation [23]. Examples of applications of infrared spectroscopy in agricultural inputs (manures/compost/bio-wastes/litter/organic resource quality) include the analyses of the following properties: Moisture, pH, total N, NH4-N, total dissolved N, suspended N, soluble reactive, K, Ca, Mg, total dissolved P, suspended C, P, Ca, Na, and Mg; various salts and metals; lignin, total soluble polyphenols, decomposition rate, in vitro dry matter digestibility, C and N mineralisation potential, compost maturity [23]. Spectral absorbances from the MIR range can be calibrated using regression models to predict multiple constituents in seconds in a wide range of materials and the accuracy of these regression models relies heavily on the calibration dataset used. Although MIR spectroscopy has so far had limited use in developing countries, it has potential to make a huge contribution in helping these countries accelerate agricultural development while safeguarding their environment in the drive towards achieving the sustainable development goals [23]. A major challenge for opn>erationn>an class="Chemical">alising MIR and pXRF is how to provide robust calibrations that hold up over a wide range of materials and across instruments. Solutions are needed to lay the foundations for a comprehensive nutrient measurement system that would require minimum sample preparation and meets the needs of validity and replicability [19]. This paper aimed to test machine learning methods for calibrating pXRF and MIR, both independently and combined, for analysis of macronutrient content and potential contaminants of a wide range of organic amendments. Specific sub-objective here were to examine the inter-instrument variability of six pXRF instruments and conduct an assessment of Diffuse Reflectance Fourier Transformed Mid-Infrared (DRIFT-MIR) spectroscopy for OA analysis by utilizing the MIR spectra of OA samples. Targeted characteristics were total C and N, as well as ash and other macro and micronutrients, in parallel with the pXRF analysis of the same parameters.

Materials and methods

Ninety-eight organic amendment samples were obtained from a wide range of sources in Western Kenya (64) and the United Kingdom (34). Samples from Kenya included manure from cattle, n>an class="Species">goats, poultry and pigs whereas samples from the UK included cattle manure, mixed farmyard manure, sewage sludge and sewage sludge compost. Conventional analysis for major- and micronutrients, as well as total carbon and ash content were conducted at Rothamsted Research (UK). Total C and total N were determined using a modified Dumas method on a Leco TruMac combustion analyser. Major and trace elements were determined using ICP-OES or ICP-MS (inductively coupled plasma optical emission spectrometry/mass spectrometry) analysis after nitric/perchloric acid (85/15 v/v) or Aqua Regia (HCl:HNO3 4:1) digestion of test samples in open tubes at up to 175 oC [24, 25]. The ash content was determined using the loss-on-ignition method. Test samples that had been oven dried (105 oC for at least 5 hours) were ashed in a furnace at 550 oC for two hours and weighed to calculate the relative ash content (%). Six Bruker Tracer 5i XRF instruments (900F4352, 900F4473, 900F4504, 900F4166, 900F5118, 900F5163) with Rhodium tubes were used to collect data. The units had resolutions (full width height maximum, or FWHM) of 135 eV at the n>an class="Chemical">Manganese K-alpha line. Samples were analyzed with a voltage of 10 kV and a current of 70 μA for 90 seconds with no filter for the elemental range of Na, Mg, P, S, K, and Ca. In addition, the samples were analyzed with 35 kV and a current of 35 μA for 90 seconds with a filter (Cu 75 μm: Ti 25 μm: Al 200 μm) for the elements Mn, and Fe. Scans were collected from air-dried and milled (to pass a 75- microns mesh sieve) samples presented as loose powder in XRF cups lined with Prolene film. The MIR spectra of the OA were also acquired on air-dried and ground (to pass a 75- min>an class="Chemical">crons mesh sieve) samples using a Bruker Alpha KBr DRIFT-MIR spectrometer. On each sample holder, samples were loaded in a single replication. During scanning at each spot/sample, 32 co-added scans were collected at a resolution of 4 cm-1. Spectra were truncated to 4000–600 cm1 and regions showing atmospheric CO2 features (2379.8–2350.8 cm-1) were removed. Raw MIR spectra (Fig 2A) were transformed using the Savitzky-Golay (SG) transformation [26] with a window size of 21 data points and a polynomial order 3 [27] and followed by the first derivative transformation (Fig 2B) before developing calibration models. All MIR spectra derivations and calculations were performed with R statistical language and open source software [28]. Principal Component Analysis (PCA) was performed on the pre-processed MIR spectra of all the OA samples.

Fig 2

Illustrations of the diffuse reflectance mid infrared (DRIFT-MIR) spectra from the organic amendment (OA) samples.

(A) raw spectra, (B) pre-processed spectra and (C) Principal Component Analysis (PCA) scores plot (PC1 vs. PC2) for mid infrared (MIR) spectra from the OA samples (n = 98) based on sample types. The PCA was performed on the pre-processed MIR spectra.

Illustrations of the diffuse reflectance mid infrared (DRIFT-MIR) spectra from the organic amendment (OA) samples.

(A) raw spectra, (B) pre-processed spectra and (C) Principn>an class="Chemical">al Component Analysis (PCA) scores plot (PC1 vs. PC2) for mid infrared (MIR) spectra from the OA samples (n = 98) based on sample types. The PCA was performed on the pre-processed MIR spectra. Machine learning has seen rapid advancement in the past decade owing to their implementation in open source languages such as R and python, as well as increasingly affordable high-powered processors on computers. Forest regressions are one of the simplest implementations of these techniques in which variables are randomly selected and iterated over a set number of sampling events and a set number of trees with a set number of iterations. From these, a weighted final model is produced which accounts for variable importance in a way which is generalizable (e.g. low risk of overfitting). A more advanced technique, extreme gradient boosting (XGBoost), was also used [29-31]. Typically, variables are defined via genern>an class="Chemical">al parameters (e.g. a range of energies in a spectrum) which correspond to known responses, such as the Kα1 or Lα1 emission line for an element in pXRF. In lieu of pre-defining pXRF lines for each element in a traditional manner [32], the entire spectrum was used for machine-learning models. While it is standard practice to use the full DRIFT-MIR spectra in calibrations and variables are also defined via general parameters (e.g. a range of wavenumbers in a spectrum) which correspond to known responses, such as a functional group, it is not common for pXRF spectra. This removes the need to pre-define variables in pXRF analysis (e.g. Ca Kα1) and instead uses the whole spectrum without human input to evaluate models. This approach also allows an independent variable, such as ash content, to be quantified if suitable data is provided for training. Further, it allows a visualization of the spectrum in terms of its predictive properties for a given independent variable. Using the whole spectrum enables full automation of the calibration process for pXRF. Forest models for pXRF and DRIFT-MIR spn>ectra were run using 1500 trees with 200 resampn>ling events using k-fold n>an class="Chemical">cross validation over 25 iterations based on the R package randomForest (4.6–14). The best forest models were selected using root mean square error (RMSE) in the caret package (6.0–84) using the R language (3.6.0). XGBoost models differ from forest in that trees can be weighted differently, have fixed depths, and different resembling. For example, trees can be built from randomly selected columns (Energies for pXRF spectra) and rows (standards). XGBoost (0.82.1) models were run using 400 rounds with a variable tree depth ranging from 5 to 25. pXRF energy channels (columns) were randomly selected with a range of 40–60%, with samples (rows) randomly selected with the same frequency. Learning rates (eta) were constrained to values between 0.1 and 0.3, with gamma regularization ranging from 0 to 0.1. The minimal child weight (controls the model complexity) was limited to 1. Unique combinations of these variables were run over 32 iterations with k-fold using caret (6.0–82) and the best model was selected using root mean square error (RMSE). Calibrations were created using CloudCal (v3) [33]. The models were evaluated using randomized n>an class="Chemical">cross-validation and a hold-out validation consisting of 67/33% split between calibration and validation data. This high number of standards randomly withheld from training (33%) was used to test the generalizability of models and ensure that machine learning wasn’t simply memorizing data sets. Both the R2 value and validation slope of the regression line between observed and predicted values for all cross-validation trials were used to evaluate the best model because (a) both metrics should approach 1 as models increase in accuracy and (b) while R2 provides information regarding model precision, the validation slope gives the clearest assessment of model accuracy; a validation slope of 1 would indicate a 1:1 ratio between predicted and known values.

Results

The wet chemistry analysis data of the nutrient vn>an class="Chemical">alues varied greatly among OA and showed the considerable diversity in the samples (Table 1). This was also confirmed by the PCA scores plot (PC1 vs. PC2) for DRIFT-MIR spectra from the OA samples (Fig 2C). The difference in the OA types is clearly shown; cattle manure seemed to group with goat and poultry manure as well as compost whereas pig manure grouped together with sewage sludge. Also, there was more variance in the cattle manure group than in the sewage sludge group. The means and ranges to in elemental contents and major nutrients varied within the selected OA samples (Table 1). Nutrient concentrations ranged widely, for example 0.02–5.41% for N and 0.06–3.40% for P. Ash content as an indicator of the inorganic component ranged from 11.6–94.9%. The wide range of characteristics in the samples was also confirmed by the PCA scores plot (PC1 vs. PC2) for MIR spectra (Fig 2C).

Table 1

Descriptive statistics for total carbon (C), total N, ash content, major nutrients and potential contaminants of all organic amendment (OA) samples used for the calibration and validation of pXRF and DRIFT-MIR methods.

						Percentile
Property	Units	Mean	Std dev.	Min	Max	2.5^th	25^th	50^th	75^th	97.5^th
Ash	%	63.8	21.5	11.6	94.9	16.0	52.8	70.1	81.5	88.8
Total C	%	19.4	11.8	1.23	44.7	5.30	9.95	15.8	27.5	43.5
Total N	%	1.65	1.32	0.02	5.41	0.40	0.75	1.11	2.29	5.27
P	%	0.64	0.85	0.06	3.40	0.08	0.13	0.19	0.99	3.00
K	%	0.82	0.77	0.07	4.06	0.10	0.35	0.60	1.01	2.87
Ca	%	1.99	2.73	0.25	21.14	0.37	0.64	0.86	2.87	8.50
Al	%	1.95	1.36	0.01	8.97	0.02	0.87	1.84	2.71	4.28
Mg	%	0.39	0.34	0.13	2.79	0.14	0.24	0.31	0.43	1.29
Na	%	0.14	0.73	0.00	7.21	0.01	0.01	0.02	0.06	0.44
S	%	0.33	0.38	0.03	1.74	0.04	0.08	0.14	0.57	1.31
Mn	%	0.05	0.05	0.01	0.45	0.02	0.04	0.05	0.06	0.09
Fe	%	2.38	2.78	0.05	24.96	0.09	1.18	1.79	2.83	6.64
Zn	mg kg^-1	571	1151	41	6553	43	63	92	432	3828
Cu	mg kg^-1	278	762	10	5080	11	17	23	143	1413
Ni	mg kg^-1	70	142	3.0	1015	3.4	21	29	46	474
Cd	mg kg^-1	8.2	25	0.1	137	0.2	0.9	1.5	2.2	108
Pb	mg kg^-1	129	310	0.2	1398	0.5	5.3	8.3	35	1263

To investigate the effect of the inter-instrumentn>an class="Chemical">al variability on the pXRF regression models and on the prediction accuracy, calibration curves were generated for each instrument and different chemical properties. Photon data from each instrument varied, with instrument 900F4166 producing lower counts and instrument 900F4473 producing the highest (Fig 3A). For example, instrument 900F4473 had >2,500 counts per second for phosphorus K-alpha and a concentration of about 3.2% P, while instruments 900F4166 and 900F5188 had the same 2,500 counts for a much higher P concentration of >6.5%. But these differences between instruments can be corrected with instrument specific calibrations as shown in Fig 3B, where the predictions for phosphorus concentration from the different instruments overlay each other and are all close to the values measured by ICP-OES.

Fig 3

Inter-instrumental variability of pXRF instruments.

Fig 3A shows the relation between phosphorus-specific counts per second for six different instruments. XRF4166 line is hidden behind the blue XRF 5118 line in the left figure. Fig 3B shows the estimated phosphorus content of the samples after instrument specific calibration. The dotted line on the right plot indicates the expected 1:1 ratio for XRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration.

Inter-instrumental variability of pXRF instruments.

Fig 3A shows the relation between phosphorus-spn>ecn>an class="Gene">ific counts per second for six different instruments. XRF4166 line is hidden behind the blue XRF 5118 line in the left figure. Fig 3B shows the estimated phosphorus content of the samples after instrument specific calibration. The dotted line on the right plot indicates the expected 1:1 ratio for XRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration. Next, spectroscopy (n>an class="Chemical">PXRF and DRIFT-MIR) in conjunction with both chemometric techniques was tested to predict various chemical properties of OA. The predictions of the calibration models compared to the measured values are shown in the Figs 4 and 5 (examples are ash and N content, respectively). For both examples, the figures show generally good predictions with little error (R2 values above 0.98) and a close fit of actual and predicted concentrations (the regression line for all data points is close to the 1:1 line). For the calibrations, 67% of all OA samples were used (randomly selected). A summary of the regression coefficients for all calibrations conducted for both methods (pXRF and DRIFT-MIR), both models (Forest or XGBoost) and all OA characteristics determined is provided in Table 2. It shows that very good regressions were achieved for most characteristics, but they also indicate differences in the predictive value between methods and models used. Across both methods and models, good to very good regressions were achieved for ash, total C, total N, P, K, Ca, S and Fe (R2 > 0.9). Less good, but still acceptable regressions were found for the elements Mg, Na and Mn (0.7 > R2 > 0.9). Against conventional knowledge, XRF performed well to predict total C, total N and ash in OA, which cannot be predicted with XRF based on known element specific Kα1 or Lα1 emission lines. Differences in predictive power of the calibration functions between both methods (pXRF and DRIFT-MIR) and for the characteristics tested were small in most cases. Only in the case of Na at higher concentrations did both methods underestimate the actual concentration (S2 Fig in S1 File).

Fig 4

Comparison of actual and predicted ash content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4473) and DRIFT-MIR.

The dotted line indicates the expected 1:1 ratio for estimates and known values. The bottom graphs show the respective response variables in the spectra.

Fig 5

Comparison of actual and predicted total nitrogen (N) content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4437) and DRIFT-MIR.

The dotted line indicates the expected 1:1 ratio for estimates and known values. The bottom graphs show the respective response variables in the spectra.

Table 2

Correlation (R2) values for pXRF and MIR calibrations for total C, total N ash content, and major nutrients.

	pXRF Forest R²	pXRF XGBoost R²	MIR Forest R²	MIR XGBoost R²
Ash (%)	0.94	0.94	0.93	0.94
Total C (%)	0.97	0.92	0.95	0.95
Total N (%)	0.92	0.93	0.92	0.94
P (%)	0.94	0.89	0.92	0.87
K (%)	0.90	0.93	0.84	0.72
Ca (%)	0.98	0.95	0.83	0.81
Mg (%)	0.77	0.71	0.73	0.66
Na (%)	0.91	0.87	0.86	0.81
S (%)	0.98	0.91	0.93	0.83
Mn (%)	0.87	0.73	0.63	NA
Fe (%)	0.95	0.94	0.67	0.77
Zn (ppm)	0.95	0.94	0.00	0
Cu (ppm)	0.92	0.90	0.00	0
Ni (ppm)	0.92	0.97	0.00	0
Cd (ppm)	0.99	0.97	0.24	0
Pb (ppm)	0.95	0.98	0.00	0

For the calibrations, 67% of all organic amendment (OA) samples and Forest or XGBoost models were used.

Comparison of actual and predicted ash content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4473) and DRIFT-MIR.

The dotted line indicates the expected 1:1 ratio for estimates and known vpan class="Chemical">alues. The bottom graphs show the respective response variables in the spectra.

Comparison of actual and predicted total nitrogen (N) content of organic amendment (OA) samples based on whole-spectrum forest regressions for pXRF (instrument 900F4437) and DRIFT-MIR.

The dotted line indicates the expected 1:1 ratio for estimates and known vpan class="Chemical">alues. The bottom graphs show the respective response variables in the spectra. For the cpan class="Chemical">alibrations, 67% of pan class="Chemical">all organic amendment (OA) samples and Forest or XGBoost models were used. Next, we used the calibration functions established with two thirds of the totn>an class="Chemical">al sample number and tested their predictive power with the remaining one third of the samples (validation). Validation results are shown in Fig 6 for the six different pXRF instruments and four selected characteristics (ash, total C, total N, P). The regression lines indicate relatively good predictions for these “unknown samples” with a decreasing precision in the order of P > C> N > ash. In addition, the regression line for the pXRF predictions were in all four cases close to the expected 1:1 ratio between observed and predicted values. The same validation was conducted for DRIFT-MIR (Fig 7) and precision of predictions decreased in the order of N < ash < C <P. Again, the regression line for the predicted versus observed values was close to the expected 1:1 ratio.

Fig 6

Randomized cross-validation of 6 pXRF instruments for ash, total carbon, total nitrogen and total phosphorus.

For each characteristic, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for pXRF estimates and known values and the shaded areas around all lines are 95% confidence predictions based on the calibration.

Fig 7

Randomized cross-validation of DRIFT-MIR predictions for ash, total carbon, total nitrogen and total phosphorus.

For each element, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for DRIFT-MIR estimates and known values. The shaded areas around all lines are 95% confidence predictions based on the calibration.

Randomized cross-validation of 6 pXRF instruments for ash, total carbon, total nitrogen and total phosphorus.

For each characteristic, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for pXRF estimates and known values and the shaded areas around n>an class="Chemical">all lines are 95% confidence predictions based on the calibration.

Randomized cross-validation of DRIFT-MIR predictions for ash, total carbon, total nitrogen and total phosphorus.

For each element, 33% of standards were withheld and treated as unknowns. The dotted line indicates the expected 1:1 ratio for DRIFT-MIR estimates and known vn>an class="Chemical">alues. The shaded areas around all lines are 95% confidence predictions based on the calibration. A summary of all regression n>an class="Chemical">coefficients and slopes of the regression line between observed and predicted values for all cross-validation trials with pXRF and MIR and all sample characteristics is shown in Table 3. The results indicate, that the pXRF method allows generally better predictions than the DRIFT-MIR method for most OA characteristics, the exceptions are ash and carbon. Which model (Forest or XGBoost) gives the best result, based on a mixed indicator of regression coefficient and slope of the regression line between observed and predicted values, varies between the OA properties.

Table 3

Correlation coefficients (R2) and the slope of the regression line between observed and predicted values for all cross-validation trials with pXRF and MIR and all sample characteristics measured in the hold-out validation.

	pXRF Forest R²	pXRF Forest Slope	pXRF XGBR²	pXRF XGB Slope	MIR Forest R²	MIR Forest Slope	MIR XGB R²	MIR XGB Slope	Best Method	Best Model
Ash (%)	0.83	1.01	0.83	1.01	0.89	1.10	0.86	1.03	MIR	XGBoost
Total C (%)	0.88	1.03	0.87	1.01	0.90	1.11	0.89	1.01	MIR	XGBoost
Total N (%)	0.86	1.00	0.86	1.01	0.87	1.15	0.83	1.02	pXRF	Forest
P (%)	0.66	0.96	0.64	1.06	0.77	1.18	0.60	0.89	pXRF	Forest
K (%)	0.78	0.83	0.79	0.79	0.47	1.06	0.30	0.66	pXRF	Forest
Ca (%)	0.66	0.98	0.63	0.81	0.69	1.52	0.46	0.68	pXRF	Forest
Mg (%)	0.42	0.78	0.43	0.76	0.27	1.24	0.38	0.52	pXRF	XGBoost
Na (%)	0.65	0.97	0.63	0.89	0.65	1.18	0.43	0.62	pXRF	Forest
S (%)	0.70	1.16	0.68	0.99	0.88	1.11	0.73	0.90	pXRF	XGBoost
Mn (%)	0.25	0.77	0.25	0.71	0.08	0.77	NA	NA	pXRF	Forest
Fe (%)	0.89	1.05	0.88	1.05	0.28	1.07	0.19	0.68	pXRF	Forest
Zn (ppm)	0.45	0.83	0.43	0.76	NA	NA	NA	NA	pXRF	Forest
Cu (ppm)	0.83	1.12	0.62	0.81	NA	NA	NA	NA	pXRF	Forest
Ni (ppm)	0.61	1.02	0.59	0.90	NA	NA	NA	NA	pXRF	Forest
Cd (ppm)	NA	NA	NA	NA	NA	NA	NA	NA	NA	NA
Pb (ppm)	0.84	1.01	0.81	0.99	NA	NA	NA	NA	pXRF	Forest

The cross-validation trials used 33% of all organic amendment (OA) samples and Forest or XGBoost models.

Discussion

The wide range of materials and element vn>an class="Chemical">alues for calibration and validation was helpful to i) test the potential of the methods and ii) provide robust calibrations. The principal component analysis (Fig 2C) indicated a structural difference between solid OA (manure from cattle, goat, poultry but also compost) and more liquid OA like pig manure and sewage sludge. A wider variety in components of solid OA might be responsible for a larger variance in the cattle manure group than in the sewage sludge group. We also found that spectra of OA (S1 Fig in S1 File) has some resemblance to soil spectra because there is often some soil mixed in with the manure and some soil features were therefore evident (e.g., O-H stretching in clays at 3694 cm1). This similarity occurs even though most soil organic matter derives from the decomposition of plant material added to the soil whereas most organic matter in manure comes from partially digested plant material eaten by the animals. This finding was also in agreement with that of a previous study [34] that found that the quality of most of the manure resources derived from cattle, sheep, goat, chicken in selected household across four district in a semi-arid environment of the North West Province in South Africa had relatively high soil content (mean 22.7%). Using this diverse set of samples, we explored the possibility of a compn>rehensive ann>an class="Chemical">alysis of OA, employing machine learning methods to evaluate the MIR and pXRF spectra, namely random forest regressions and extreme gradient boosting. The emergence of these MIR and pXRF systems presents new opportunities for rapid, low-cost analysis of OA samples, both as lab system and portable systems. We hypothesized that pXRF instruments could provide OA data of sufficient accuracy and would reduce the overall time and budget compared with the use of conventional techniques. However, their sensitivity and accuracy are dependent on the instruments’ settings, make and model [35]. We found that the photon data from the various Tracer 5i pXRF instruments varied considerably, with 900F4166 producing much lower counts than the 900F4473 instrument (Fig 3A). This variation results from small changes in the anode thickness of the X-ray tube and imperfections in tube-sample-detector geometry. Therefore, each individual pXRF instrument needs to be calibrated separately. Our assessment of pXRF and DRIFT-MIR spn>ectrosn>an class="Chemical">copy for the analysis of total C and total N as well as total elemental composition of multiple elements in OA samples confirmed the potential of these tools. Using forest regression and extreme gradient boosting machine learning models, excellent calibration functions could be established (Table 2; Figs 4 and 5) to rapidly quantify the concentrations of macro- and micronutrient elements present in the OA samples. Both MIR and pXRF had generally good agreement in calibrations for light elements (carbon, nitrogen) and holistic measures of OA quality (ash content), whereas pXRF tended to perform better for most heavier elements, though performance was nearly equal for P and S (Table 2). Forest regressions provided comparable results for MIR and XRF for Mg (Table 3), while Mn had weaker cross-validation results. The general pattern observed for MIR was that it generally performed well on elements that were not transition metals; with the exception being Fe, likely due to its higher abundance. These results are novel for two reasons. First, XRF is unable to measure elements such as carbon or n>an class="Chemical">nitrogen directly and yet this study obtained excellent results (R2 > 0.86) with XRF for both elements in both the cross-validation and hold-out sample sets (Tables 2 and 3). MIR is typically unable to measure elements with an atomic number > 11, such as P and K very well and yet we obtained reasonable results (R2 > 0.72) for P in both the calibrations and validation sample sets and acceptable results (R2 > 0.30) for K (Tables 2 and 3). We hypothesized that, because the matrix of the OA tested here is a mixture of organic components and silicates such as clays (S1 Fig in S1 File), there is a necessary inverse correlation between ash and carbon content. For pXRF analysis, there is also a change in density, as ash may have three times the density of organic material. Scattering of X-rays in this material will lead to different count rates reaching the detector from non-diagnostic portions of the unfiltered spectrum (7–10 keV) (Fig 5). This, coupled with the K-alpha line for silicon, allows for the strong predictive power for ash and N content because of density differences between them. As such, the success in identifying elements such as N (Figs 5 and 6) with portable XRF is understandable in this specific context, but likely not generalizable to plants and soils (the latter can also contain carbonates). However, the principle of narrowly defining a sample matrix to use machine learning techniques will likely produce future advances in the calibration of both x-ray and infrared data, as demonstrated here for OA. Randomized cross vn>an class="Chemical">alidation trials (using 33% of all organic amendment samples and Forest or XGBoost models) confirmed the good predictive value of the XRF and MIR calibrations for most elements/characteristics in the hold-out validation set (Table 3; Figs 6 and 7). However, the cross validations also show that calibration model performance should be evaluated critically. For example, MIR calibration with XGBoost for Ca provides an R2 of 0.81 (Table 2), while the average of 100 cross validation trials of the hold-out samples provides an R2 of only 0.46 (Table 3). This suggests one of two possibilities: either XGBoosting is resulting in an overfit of the data, or there is a critical threshold of the number of standards needed to provide an estimate of Ca that is met by the full dataset (n = 98) but not the partial (n = 65). Either way, the results show that randomized cross validation and hold-out validations are essential to interpret model accuracy and reliability. Our results indicate that MIR + machine learning is not yet a proven method to infer Mg and Mn concentrations in OA, contradicting the study of López-Núñez et al. [36] who claimed good calibrations for pXRF and a wide range of nutrients and contaminants in very similar OA (but the study did not include validations). MIR did not produce useable models for trace elements (Ni, Cu, Zn) or contaminants (Cd, Pb), while both machine learning model types produced useable models from XRF data; this is an unsurprising conclusion as these elements all have fluorescence peaks identifiable to low levels with XRF. Based on theoretical n>an class="Chemical">considerations, MIR should be best for light elements like total C and total N while XRF should be best for evaluating elements like S and P. However, our results indicate that MIR came close to the performance of XRF on elements like P and S, while XRF came close to MIR in estimating total C and N (Table 3, Figs 6 and 7). Both devices had difficulty measuring Mg but surprisingly did better for P (Table 3). Elements which tend to correlate with clay content Na and K are elements which tend to correlate with the ash content, so the higher performance for these elements may be related to the success in identifying ash content. With few exceptions, MIR and XRF were relatively interchangeable techniques for estimating all properties analyzed in this wide range of OA.

Conclusions

We conclude that combining MIR and XRF spectral methods with machine learning techniques enables rapid, portable, and nondestructive measurement of a full suite of nutrients in OA on both devices independently. The approach is also scalable, as the calibration process for XRF can be at least partially automated provided each new instrument is calibrated against common standards. These results are significant in that: XRF is capable of estimating properties like carbon and nitrogen content of OA; MIR is capable of estimating pan class="Chemical">P and S in OA as well; Portable non-destructive spectrometry paired with machine learning can provide a comprehensive nutrient profile with minimal sample preparation outside a traditional laboratory environment; XRF allows contaminants to be detected, e.g. the presence of trace amounts of potentially toxic metals like e.g. Zn, Cu and Ni–there were good calibration/validations in our study for these elements. If there is one key sn>an class="Chemical">cope of this work, it is that smallholders need good returns on their investments, especially OA, mostly in terms of increased yields to achieve greater profit and/or food security. Therefore, portable MIR and XRF spectrometers in conjunction with machine learning are adequate solutions to support nutrient management with minimal cost for analysis per sample. For spectrometry at large, machine learning techniques can extract more actionable information than has been previously recognized. To our knowledge, this is the first study to compare the performance of pXRF and FTIR as rapid analytical methods for the determination and monitoring of major and trace nutrient elements in OA. Both methods performed well for most parameters analyzed but pXRF did slightly better for heavier elements whereas FTIR was superior for light elements and the ash content. The combination with machine learning helps to reduce uncertainties in assessing OA quality and, hence, enables better decision-making especially for comprehensive nutrient management for all types of farms. It also allows to identify and avoid contaminated OA fertilizers, thereby protecting soils and the environment from pollution. Future work will now evaluate the use of XRF for testing of conventional mineral fertilizers. (DOCX) Click here for additionpan class="Chemical">al data file. 8 Sep 2020 PONE-D-20-15361 Comprehensive Nutrient Analysis in Agricultural Organic Amendments Through Non-Destructive Assays Using Machine Learning PLOS ONE Dear Dr. Erick K Towett, Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process. Please submit your revised manuscript by September 21. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file. Please include the following items when submitting your revised manuscript: A rebuttpan class="Chemical">al letter that responds to each point raised by the academic editor and reviewer(s). You should upn>load this letter as a separate file labeled 'Response to Reviewers'. A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'. An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuspan class="Chemical">cript'. If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter. If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols We look forward to receiving your revised manuspan class="Chemical">cript. Kind regards, Bpan class="Chemical">alasubramani Ravindran, pan class="Chemical">Ph.D Academic Editor PLOS ONE Journpan class="Chemical">al Requirements: When submitting your revision, we need you to address these additionpan class="Chemical">al requirements. 1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journpan class="Chemical">als.plos.org/plosone/s/file?id=wjVg/pan class="Chemical">PLOSOne_formatting_sample_main_body.pdf and https://journpan class="Chemical">als.plos.org/plosone/s/file?id=ba62/pan class="Chemical">PLOSOne_formatting_sample_title_authors_affiliations.pdf 2. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscripn>t be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. n>an class="Gene">If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide. 3. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data. Reviewers' pan class="Chemical">comments: Reviewer's Responses to Questions pan class="Chemical">Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 2. Has the statisticpan class="Chemical">al anpan class="Chemical">alysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes Reviewer #3: Yes ********** 3. Have the authors made pan class="Chemical">all data underlying the findings in their manuspan class="Chemical">cript fully available? The PLOS Data policy requires authors to make n>an class="Chemical">all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 4. Is the manuspan class="Chemical">cript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Review pan class="Chemical">Comments to the Author Please use the space provided to expn>lain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: THE STUDY VALIDATE THE APPLICATION OF XPS FOR THE REAL-TIME SOIL NUTRIENT ANALYSIS. IT IS WELL ACpan class="Chemical">COMpan class="Chemical">PLISHED WITH NECESSARY METHODS. the pan class="Chemical">conclusion is supn>ported with the necessary results. authors approach to use mechine learning for the data anpan class="Chemical">alysis is to be appreciated. Reviewer #2: Generpan class="Chemical">al pan class="Chemical">commends The author did good piece of work on machine learning helps to find percentage of carbon and n>an class="Chemical">nitrogen presence in the soil. XRF is capable of estimating properties like carbon and nitrogen. the work on artificial neural networks was well explained. The author addresses following question 1. In pan class="Chemical">conclusion part authors dispan class="Chemical">cussing reason it may be avoid and rewrite. 2. From your study, How MIR + machine learning useful to derive finding of pan class="Chemical">Mg and Mn pan class="Chemical">concentrations? 3. Mentation accuracy difference (Percentage) Tracer 5i pXRF 900F4166 and 900F4473 instrument. 4. Line number 211, calibrations, 67% was chosen randomly what could be the reason for choosing 67%. If more 80 % and above does the values change? Reviewer #3: pan class="Chemical">Comments to the Authors The authors have tried to find alternative sources for n>an class="Chemical">Comprehensive Nutrient Analysis in Agricultural Organic Amendments through Non-Destructive Assays Using Machine Learning. The work done by the authors is commendable and applaudable. It is considered commendable since this study provides an alternative solution for portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments. Abstract Avoid abbreviations in the abstract Introduction The authors have arranged the literatures and context relating the significance of choosing the objective and scope of the study. The following comments in this section are Line 26 - Smpan class="Chemical">all-scpan class="Chemical">ale and family farmers – Define. Line 27 – Authors are to include some literature related to food supply in developing pan class="Chemical">countries. Readers will be interested to understand about what is food supn>ply in developing pan class="Chemical">countries through some literatures in introduction section. Line 109 – The authors are recommended to add certain background studies related to macro and micronutrients. Further, additional details for C and N, as well as ash in this study are also encouraged. This information will act as the state of the art for the readers and also compare with the current research findings of the authors. Materipan class="Chemical">als and Methods The authors have presented the methodology in a standard and technicpan class="Chemical">al aspects. However there are certain facts that are to be improved and included to inpan class="Chemical">crease the reader’s interest. Line 176 – Though a detailed explanation on experimental model has been provided. Authors are encouraged to provide real time pictures of the experimental study carried out to increase the reader’s interest (If available). In addition, an image of the field used in this study can be included in the manuscript. Results This section has been explained in appropriate manner. Dispan class="Chemical">cussion This section has been discussed in detail by the authors and the literatures stated in this section have been covered well related to the objective and scope of this study. pan class="Chemical">Conclusion Include solid findings with quantpan class="Gene">ifiable results. Add the span class="Chemical">cope and future directions in brief. Generpan class="Chemical">al pan class="Chemical">comments to authors I encourage and recommend the authors to also provide site images, to increase the curiosity in readers. I would like to recommend minor revision of this study and accept this manuscript in its present form. ********** 6. pan class="Chemical">PLOS authors have the option to publish the peer review history of their article (what does this mean?). pan class="Gene">If published, this will include your full peer review and any attached files. pan class="Gene">If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: No Reviewer #3: No [NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.] While revising your submission, please upload your figure files to the Preflight Ann>an class="Chemical">alysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step. 21 Sep 2020 Dear Bpan class="Chemical">alasubramani Ravindran, Academic Editor PLOS ONE Dear Academic Editor, Subject: Manuspan class="Chemical">cript Revision Submission pan class="Gene">Ms. Ref. No.: pan class="Chemical">PONE-D-20-15361 Title: Comprehensive Nutrient Analysis in Agricultural Organic Amendments Through Non-Destructive Assays Using Machine Learning Journpan class="Chemical">al: pan class="Chemical">PLOS ONE We hereby submit a revised version of our manuscript titled: " n>an class="Chemical">Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learning" by Towett K. Erick, Drake B. Lee, Acquah E. Gifty, Haefele M. Stephan McGrath P. Steve, and Shepherd D. Keith, to be considered for publication in PLOS ONE. Thank you very much for the opportunity to revise our manuscript to address the reviewers’ comments. We have carefully undertaken the revision of the manuscript in light of the peer reviewers 2 and 3 comments geared towards making the MS more informative. WE have also addressed the editors comments. Copies of the manuscript, Tables and Figures with tracks the changes are submitted separately as requested. The responses to the specific comments by the reviewers are given in a separate submitted file named “response to reviewers”. The responses to the specpan class="Gene">ific pan class="Chemical">comments by the reviewers are given as follows. Response to Reviewers' pan class="Chemical">Comments Reviewer's Responses to Questions pan class="Chemical">Comments to the Author 1. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes 2. Has the statisticpan class="Chemical">al anpan class="Chemical">alysis been performed appropriately and rigorously? Reviewer #1: No Reviewer #2: Yes Reviewer #3: Yes 3. Have the authors made pan class="Chemical">all data underlying the findings in their manuspan class="Chemical">cript fully available? The PLOS Data policy requires authors to make n>an class="Chemical">all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes 4. Is the manuspan class="Chemical">cript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes 5. Review pan class="Chemical">Comments to the Author Please use the space provided to expn>lain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: THE STUDY VALIDATE THE APPLICATION OF XPS FOR THE REAL-TIME SOIL NUTRIENT ANALYSIS. IT IS WELL ACpan class="Chemical">COMpan class="Chemical">PLISHED WITH NECESSARY METHODS. the pan class="Chemical">conclusion is supn>ported with the necessary results. authors approach to use mechine learning for the data anpan class="Chemical">alysis is to be appreciated. Response to Comment: We very much appreciate the reviewer’s recommendation and constructive suggestions on how this MS may be revised to be more informative. Reviewer #2: Generpan class="Chemical">al pan class="Chemical">commends The author did good piece of work on machine learning helps to find percentage of carbon and n>an class="Chemical">nitrogen presence in the soil. XRF is capable of estimating properties like carbon and nitrogen. the work on artificial neural networks was well explained. The author addresses following question 1. In pan class="Chemical">conclusion part authors dispan class="Chemical">cussing reason it may be avoid and rewrite. Response to comment no. 1: We agree that the conclusions of the manuscript need not to have reasons and we have reworded it from the initial wording “These results are significant for the following reasons:” and we have now rewritten it to read “ These results are significant in that:…..”. 2. From your study, How MIR + machine learning useful to derive finding of pan class="Chemical">Mg and Mn pan class="Chemical">concentrations? Response to comment no. 2: When we revised the manuscript, we considered the concern raised and have edited the discussion section to include a statement that “Forest regressions provided comparable results for MIR and XRF for Mg (Table 3), while Mn had weaker cross-validation results. That said, the general pattern observed for MIR was that it generally performed well on elements that were not transition metals; with the exception being Fe, likely due to its higher abundance” in lines 342-345. 3. Mentation accuracy difference (Percentage) Tracer 5i pXRF 900F4166 and 900F4473 instrument. Response to comment no. 3. The dn>an class="Gene">ifference between 900F4166 and 900F4473 isn’t accuracy, but rather photon flux - the XRF tube in 900F4473 generated more x-rays per unit time for the energy range of light elements (1 - 4 keV); this could be due to any number of manufacturing differences, such as the thickness of the Rh anode in the tube, the thickness of the Be window used on either the tube or the detector, or small differences in geometry that improve fluorescence in a given energy range. It is precisely these small manufacturing differences that produce the need for individual instrument calibrations. This is described at the end of the second paragraph on page 9, lines 327 to 333. 4. Line number 211, calibrations, 67% was chosen randomly what could be the reason for choosing 67%. If more 80 % and above does the values change? Response to comment no. 4: Random sampn>ling is the simpn>lest way of selecting sampn>les as it creates a subset that follows the statistical distribution of the original dataset. The actual value used for cross-validation is ultimately about tradeoffs - what is the minimal number of samples needed to train a model to generalize to a larger data set? A 20% validation split (e.g. 80% of samples used for training) would produce a higher R2 and slope closer to 1 for both the training and test data sets. The more restrictive 33% cross-validation sample (e.g. 67% used for training data) was used given the significance of the claims - accurately quantifying N or C using XRF (and P or S using MIR) is a notable expansion of each instrument’s capability. The cross-validation split was used to stress-test these claims to demonstrate the generalizability of the models, rather than get the highest possible R2 or slope closest to 1. It would be interesting to rerun these data at a 20% split - but it would take at least two weeks and thus extended beyond the review time (as well as be a costly allocation of computer resources). We have added text to clarify the rational behind this particular cross-validation split choice in lines 231-234. Randomized cross-vn>an class="Chemical">alidation in general is an unbiased method, it is also efficient as more samples are required to achieve the representativeness of the data. This method is commonly used as it is easy to carry out, and unbiased. Shepherd & Walsh (2002) and McCarty et al. (2002) are some exemplar studies where this sampling approach has been utilized in spectral modelling studies where each dataset is first randomly split into calibration and validation set (75% and 25% respectively). In general, although an increase in calibration set size could increase the performance of the model (at thus increasing the calibration set to 80% would increase the accuracy of the validation data), it has been found that in many datasets, calibration sample size >75% does not provide much improvement to model prediction and also means that only 25% of the samples need to be fully analysed to provide a good calibration set. Following this approach, for our study we chose randomly 67% of the samples for validation and 33% for calibration as the dataset was split based on the unique reference properties and based on sample types (Table 1 & Figure 2). Shepherd KD, Walsh MG. 2002. Development of reflectance spectral libraries for characterization of soil properties. Soil Science Society of America Journal 66:988_998. DOI 10.2136/sssaj2002.9880. McCarty GW, Reeves JB, Reeves VB, Follett RF, Kimble JM. 2002. Mid-infrared and near-infrared diffuse reflectance spectroscopy for soil carbon measurement. Soil Science Society of America Journal 66:640_646 DOI 10.2136/sssaj2002.6400. Reviewer #3: pan class="Chemical">Comments to the Authors 1). The authors have tried to find alternative sources for n>an class="Chemical">Comprehensive Nutrient Analysis in Agricultural Organic Amendments through Non-Destructive Assays Using Machine Learning. The work done by the authors is commendable and applaudable. It is considered commendable since this study provides an alternative solution for portable spectrometry in combination with machine learning a scalable solution to provide comprehensive nutrient analysis for organic amendments. 1). Abstract Avoid abbreviations in the abstract Response to comment no. 1: Many thanks for this comment. The abbreviations have been corrected for the elements Carbon and Nitrogen. 2). Introduction The authors have arranged the literatures and context relating the significance of choosing the objective and scope of the study. The following comments in this section are Line 26 - Smpan class="Chemical">all-scpan class="Chemical">ale and family farmers – Define. Response to comment no. 2: Thanks for the comments. This reference to the family farms has been deleted. Although it can be argued that limited areas under smallholder farming systems are closer to the homestead, family farms are those small plots nearing the farmhouse where most recycling of organic waste occurs and a diverse range of crops is produced. 3). Line 27 – Authors are to include some literature related to food supply in developing pan class="Chemical">countries. Readers will be interested to understand about what is food supn>ply in developing pan class="Chemical">countries through some literatures in introduction section. Response to comment no. 3: Thank you for this comment. This would be too broad for the present scope of the manuscript and out of context with the study. However, a literature on the above mentioned food supply dynamics in Sub-Saharan Africa countries has been included in lines 32-43 with particular emphasis on the need for establishing effective quality assurance mechanisms. 4). Line 109 – The authors are recommended to add certain background studies related to macro and micronutrients. Further, additional details for C and N, as well as ash in this study are also encouraged. This information will act as the state of the art for the readers and also compare with the current research findings of the authors. Response to comment no. 4: The reviewer refers here to the last line of objectives, and it would be rather unconventional to add references here. However, the requested descriptions and references are available in the text line 99 to 104 and 108 to 130. 5). Materipan class="Chemical">als and Methods The authors have presented the methodology in a standard and technicpan class="Chemical">al aspects. However there are certain facts that are to be improved and included to inpan class="Chemical">crease the reader’s interest. Response to comment no. 5: This is a very general comment which is difficult to address. We do believe that we introduced the method from a technical point of view but also in relation to its possible uses, so we thought this should create interest for the paper. 6). Line 176 – Though a detailed explanation on experimental model has been provided. Authors are encouraged to provide real time pictures of the experimental study carried out to increase the reader’s interest (If available). In addition, an image of the field used in this study can be included in the manuscript. Response to comment no. 6: We could include images of the equipment in action, taken in the laboratory or to simulate field as we did not take measurements in the field for this study. We would ask the editor to decide if such images are something the journal would like to include - we have included S1 – S3 Photos after the S1 and S2 Figures in the supporting information for consideration. 7). Results This section has been explained in appropriate manner. Response to pan class="Chemical">comment no. 7: We very much appreciate the reviewer pan class="Chemical">comment. 8). Dispan class="Chemical">cussion This section has been discussed in detail by the authors and the literatures stated in this section have been covered well related to the objective and scope of this study. Response to pan class="Chemical">comment no. 8: Thanks for this pan class="Chemical">comment. 9). pan class="Chemical">Conclusion Include solid findings with quantpan class="Gene">ifiable results. Add the span class="Chemical">cope and future directions in brief. Response to comment no. 9: We also very much appreciate the reviewer’s constructive suggestions and recommendations to add the scope and future directions to the conclusion. This has been done at the end of the conclusion, lines 412-414 and 429-431. 10). Generpan class="Chemical">al pan class="Chemical">comments to authors I encourage and recommend the authors to also provide site images, to increase the curiosity in readers. I would like to recommend minor revision of this study and accept this manuscript in its present form. Response to comment no. 10: We n>an class="Chemical">also very much appreciate the reviewer’s constructive suggestions and recommendations on including images (S1 – S3 Photos) in the supporting information to manuscript to be more informative and we have undertaken to include the same as supplementary to the MS and let the editor decide on whether these can be included, as outlined in the response to reviewer question 6 above. Submitted filename: Response to Reviewers.docx Click here for additionpan class="Chemical">al data file. 10 Nov 2020 Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learning PONE-D-20-15361R1 Dear Dr. Erick K Towett , We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements. Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formpan class="Chemical">al acceptance letter and your manuspan class="Chemical">cript will be scheduled for publication. An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorin>an class="Chemical">al Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org. If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org. Kind regards, Bpan class="Chemical">alasubramani Ravindran, pan class="Chemical">Ph.D Academic Editor PLOS ONE Reviewers' pan class="Chemical">comments: Reviewer's Responses to Questions pan class="Chemical">Comments to the Author 1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation. Reviewer #1: pan class="Chemical">All pan class="Chemical">comments have been addressed Reviewer #2: pan class="Chemical">All pan class="Chemical">comments have been addressed Reviewer #3: pan class="Chemical">All pan class="Chemical">comments have been addressed ********** 2. Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented. Reviewer #1: Yes Reviewer #2: Partly Reviewer #3: Yes ********** 3. Has the statisticpan class="Chemical">al anpan class="Chemical">alysis been performed appropriately and rigorously? Reviewer #1: (No Response) Reviewer #2: Yes Reviewer #3: Yes ********** 4. Have the authors made pan class="Chemical">all data underlying the findings in their manuspan class="Chemical">cript fully available? The PLOS Data policy requires authors to make n>an class="Chemical">all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 5. Is the manuspan class="Chemical">cript presented in an intelligible fashion and written in standard English? PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here. Reviewer #1: Yes Reviewer #2: Yes Reviewer #3: Yes ********** 6. Review pan class="Chemical">Comments to the Author Please use the space provided to expn>lain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters) Reviewer #1: (No Response) Reviewer #2: The Authors answers pan class="Chemical">all the question hence we can proceed for publication. Reviewer #3: (No Response) ********** 7. pan class="Chemical">PLOS authors have the option to publish the peer review history of their article (what does this mean?). pan class="Gene">If published, this will include your full peer review and any attached files. pan class="Gene">If you choose “no”, your identity will remain anonymous but your review may still be made public. Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy. Reviewer #1: No Reviewer #2: Yes: Dr G. RAMpan class="Chemical">ALINGAM Reviewer #3: No 1 Dec 2020 PONE-D-20-15361R1 Comprehensive nutrient analysis in agricultural organic amendments through non-destructive assays using machine learning Dear Dr. Towett: I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department. If your institution or institutions have a press office, please let them know about your upn>coming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org. pan class="Gene">If we can help with anything else, please email us at plosone@plos.org. Thank you for submitting your work to pan class="Chemical">PLOS ONE and supn>porting open access. Kind regards, pan class="Chemical">PLOS ONE Editoripan class="Chemical">al Office Staff on behpan class="Chemical">alf of Dr. Bpan class="Chemical">alasubramani Ravindran Academic Editor PLOS ONE

3 in total

1. Effect of different organic fertilizers application on growth and environmental risk of nitrate under a vegetable field.

Authors: Shuyan Li; Jijin Li; Bangxi Zhang; Danyang Li; Guoxue Li; Yangyang Li
Journal: Sci Rep Date: 2017-12-05 Impact factor: 4.379

2. Quick Analysis of Organic Amendments via Portable X-ray Fluorescence Spectrometry.

Authors: Rafael López-Núñez; Fátima Ajmal-Poley; José A González-Pérez; Miguel Angel Bello-López; Pilar Burgos-Doménech
Journal: Int J Environ Res Public Health Date: 2019-11-06 Impact factor: 3.390

3 in total

2 in total

1. Rapid identification of wood species using XRF and neural network machine learning.

Authors: Aaron N Shugar; B Lee Drake; Greg Kelley
Journal: Sci Rep Date: 2021-09-02 Impact factor: 4.996

2. Portable X-ray fluorescence (pXRF) calibration for analysis of nutrient concentrations and trace element contaminants in fertilisers.

Authors: Gifty E Acquah; Javier Hernandez-Allica; Cathy L Thomas; Sarah J Dunham; Erick K Towett; Lee B Drake; Keith D Shepherd; Steve P McGrath; Stephan M Haefele
Journal: PLoS One Date: 2022-01-11 Impact factor: 3.240

2 in total