| Literature DB >> 32322619 |
Agus Arip Munawar1,2, Yuswar Yunus1,2, Purwana Satriyo1,2.
Abstract
Presented paper describes spectroscopic dataset and calibration models database of near infrared spectroscopy (NIRS) used to predict agricultural soil fertility properties. Near infrared spectra data in form of absorbance spectrum were acquired in wavelength range from 1000 to 2500 nm for a total of 40 bulk soil samples amounted of 10 g per each bulk. Soil fertility properties, presented as soil nitrogen (N), phosphorus (P). potassium (K), soil pH, magnesium (Mg) and calcium (Ca), were measured by means of wet chemical analysis. Calibration models, used to predict those soil fertility parameters were developed using two different regression algorithms namely principal component regression (PCR) and partial least square regression (PLSR) respectively. Prediction performance can be evaluated and justified by looking their statistical indicators: correlation of determination (R2), correlation coefficient (r), root mean square error (RMSE) and residual predictive deviation (RPD). Spectra data can also be corrected in order to improve and enhance prediction performance. Obtained NIRS dataset and models database can be used as a rapid and simultaneous method to determine agricultural soil fertility properties.Entities:
Keywords: Calibration model; Datasets; NIRS; Prediction; Soil
Year: 2020 PMID: 32322619 PMCID: PMC7163313 DOI: 10.1016/j.dib.2020.105469
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1near infrared absorbance spectra after first derivative (D1) at different soil depth.
Fig. 2Prediction performance for soil N and P prediction using two different calibration approach: PCR and PLSR.
Fig. 3Prediction performance for soil K and pH prediction using two different calibration approach: PCR and PLSR.
Fig. 4Prediction performance for soil Mg and Ca prediction using two different calibration approach: PCR and PLSR.
Prediction performance for all calibration models database to determine soil fertility properties.
| Soil properties | Calibration models | Statistical Indicators | |||
|---|---|---|---|---|---|
| R2 | r | RMSEC | RPD | ||
| N | PCR | 0.85 | 0.92 | 0.07 | 2.00 |
| PLSR | 0.87 | 0.93 | 0.04 | 3.50 | |
| P | PCR | 0.93 | 0.96 | 2.97 | 3.86 |
| PLSR | 0.99 | 0.99 | 2.12 | 5.41 | |
| K | PCR | 0.88 | 0.94 | 0.25 | 2.04 |
| PLSR | 0.90 | 0.95 | 0.19 | 2.68 | |
| pH | PCR | 0.92 | 0.96 | 0.83 | 2.66 |
| PLSR | 0.93 | 0.96 | 0.77 | 2.87 | |
| Mg | PCR | 0.91 | 0.95 | 2.84 | 1.67 |
| PLSR | 0.94 | 0.97 | 2.14 | 2.22 | |
| Ca | PCR | 0.90 | 0.95 | 3.66 | 1.75 |
| PLSR | 0.93 | 0.96 | 3.14 | 2.04 | |
Ca: soil calcium, K: potassium, Mg: magnesium, N: nitrogen, P: phosphorus, pH: soil pH, PCR: principal component regression, PLSR: partial least square regression, r: correlation coefficient, R2: coefficient of determination, RMSEC: root mean square error in calibration, RPD: residual predictive deviation index.
Descriptive statistics data of actual measured soil fertility properties.
| Descriptive statistics | N | P | K | pH | Mg | Ca |
|---|---|---|---|---|---|---|
| Mean | 0.15 | 14.49 | 0.88 | 5.78 | 6.43 | 7.45 |
| Max | 0.52 | 40.92 | 2.58 | 11.21 | 18.07 | 20.18 |
| Min | 0.02 | 1.68 | 0.26 | 2.57 | 0.31 | 0.39 |
| Range | 0.50 | 39.24 | 2.32 | 8.64 | 17.76 | 19.79 |
| Std. Deviation | 0.14 | 11.46 | 0.51 | 2.21 | 4.75 | 6.41 |
| Variance | 0.02 | 131.26 | 0.26 | 4.89 | 22.60 | 41.08 |
| RMS | 0.21 | 18.38 | 1.02 | 6.18 | 7.96 | 9.78 |
| Skewness | 1.25 | 0.97 | 1.44 | 0.73 | 0.43 | 0.68 |
| Kurtosis | 0.47 | -0.16 | 2.14 | -0.12 | -0.52 | -0.95 |
| Median | 0.09 | 9.77 | 0.69 | 5.31 | 6.54 | 5.55 |
| Q1 | 0.04 | 5.86 | 0.55 | 4.23 | 1.48 | 2.00 |
| Q3 | 0.24 | 21.92 | 1.16 | 6.75 | 9.28 | 12.68 |
Ca: calcium, K: potassium, Mg: magnesium, N: nitrogen, P: phosphorus, Q1: first quartile, Q3: third quartile.
Fig. 5Spectra data for soil samples projected onto PCA and Hotelling T2 ellipse for outlier detection.
Fig. 6Prediction performance of calibration models to predict soil fertility properties.
| Subject | Agricultural and Biological Sciences |
| Specific subject area | Spectroscopy, non-invasive and rapid method for soil fertility properties determination |
| Type of data | Table |
| How data were acquired | Spectral datasets of soil samples were acquired using a benchtop Fourier transform near infrared (NIR) spectroscopy ( |
| Data format | Raw |
| Parameters for data collection | Spectra datasets of soil samples were used to predict fertility parameters in form of soil nitrogen content (N), phosphorus (P), potassium (K), soil pH (pH), magnesium (Mg) and calcium (Ca). Soil samples were collected per 5 cm at top soil from 0 to 20 cm depth. |
| Description of data collection | Calibration models database were established and obtained by regressing near infrared spectroscopic data as independent variables (X) and actual soil fertility properties: N, P, K, pH, Mg and Ca as dependent variables (Y). calibration models were carried out by means of two different regression approaches namely principal component regression (PCR) and partial least square regression (PLSR). During calibration modelling, n-fold cross validation was applied to generate more robust and accurate prediction results. Cross validation can also be performed using leverage validation or test matrix methods. Moreover, generated NIR model database can be evaluated and tested using unknown independent samples datasets. |
| Data source location | Near infrared spectra dataset of soil samples and their fertility properties were collected at the Department of Agricultural Engineering and Department of Soil Science, |
| Data accessibility | Dataset are available on this article and can be found in Mendeley repository data: |