Literature DB >> 34095376

On-site substrate characterization in the anaerobic digestion context: A dataset of near infrared spectra acquired with four different optical systems on freeze-dried and ground organic waste.

Margaud Pérémé1,2,3, Alexandre Mallet1,4,5,3, Lorraine Awhangbo1,3, Cyrille Charnier5, Jean-Michel Roger4,3, Jean-Philippe Steyer1, Éric Latrille1,3, Ryad Bendoula4.   

Abstract

The near infrared spectra of thirty-three freeze-dried and ground organic waste samples of various biochemical composition were collected on four different optical systems, including a laboratory spectrometer, a transportable spectrometer with two measurement configurations (an immersed probe, and a polarized light system) and a micro-spectrometer. The provided data contains one file per spectroscopic system including the reflectance or absorbance spectra with the corresponding sample name and wavelengths. A reference data file containing carbohydrates, lipid and nitrogen content, biochemical methane potential (BMP) and chemical oxygen demand (COD) for each sample is also provided. This data enables the comparison of the optical systems for predictive model calibration based for example on Partial Least Squares Regression (PLS-R) [1], but could be used more broadly to test new chemometrics methods. For example, the data could be used to evaluate different transfer functions between spectroscopic systems [2]. This dataset enabled the research work reported by Mallet et al. 2021 [3].
© 2021 The Author(s). Published by Elsevier Inc.

Entities:  

Keywords:  Chemometrics; Compact spectrometers; Instrument standardization; Near infrared spectroscopy; Organic waste

Year:  2021        PMID: 34095376      PMCID: PMC8166774          DOI: 10.1016/j.dib.2021.107126

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table NIRFlex N-500 FT-NIR (BUCHI, France) Immersed probe consisting of two fibers (one for illumination, the other for signal collection) plugged to a Rocket FTNIR-L1–025–2TE (Arcoptics, Switzerland) Polarized light system (PoLiS) plugged to a Rocket FTNIR-L1–025–2TE (Arcoptics, Switzerland) Neospectra-micro + Raspberry-Pi (Si-Ware, Egypt) Value of the Data Near infrared spectroscopy provides a fast and non-destructive methodology to characterize solid organic waste substrates involved in bioprocesses such as anaerobic digestion or composting. This unique dataset allows to compare the analytical performance of different compact spectroscopic systems for organic waste characterization including a handheld micro spectrometer, and two potential at-line systems with an immersed probe or polarized light system [4]. These spectroscopic systems can be compared to a laboratory spectrometer considered as the standard instrument of reference. The dataset could be used as well for instrument standardization [2] to test different strategies for building transfer models between instruments such as piecewise direct standardization (PDS) [5], or transfer by orthogonal projection (TOP) [6]. The instrument standardization from benchtop spectrometers to portable spectrometers is of high interest for researchers wishing to use calibration models on the field [7]. Researchers in chemometrics or bioprocesses can benefit from this data, as it allows to build predictive models on organic waste. These models could find their application in anaerobic digestion monitoring.

Data Description

Data provided in this article consists of near infrared spectra of 33 organic waste samples of various biochemical composition, acquired on four different optical systems: an FT-NIR laboratory spectrometer (NIRFlex N-500, Buchi), a FT-NIR compact spectrometer (Rocket, Arcoptics) with two configurations (a contact immersed probe and a polarized light system PoLiS), and an FT-NIR micro spectrometer (Neospectra-micro, Si-Ware) based on MEMS FT-NIR technology. Each sample was scanned in triplicates on each optical system. Depending on the system, absorbance or reflectance spectra were obtained (Fig. 1). Each collected signals type was put into a separate file: the signals from the laboratory spectrometer (“lab_spectrometer_spectra_absorbance_nm.tab”), the signals from the compact spectrometer with the immersed probe (“immersed_probe_spectra_absorbance_nm.tab”), the three signals from the compact spectrometer with the PoLiS system (“Polis_Rbs_spectra_reflectance_nm.tab”, “Polis_Rss_spectra_reflectance_nm.tab”, “Polis_Rms_spectra_reflectance_nm.tab”), and the signals from the micro-spectrometer (“microspectrometer_spectra_absorbance_nm.tab”). In each of these files, the first column corresponds to the name of the substrate, and the first row header corresponds to the wavelengths in nm.
Fig. 1

Raw spectra obtained on the 33 organic waste samples with A) the laboratory spectrometer (absorbance, in nm, referred as Alab), B) the compact spectrometer with an immersed probe (absorbance, in nm, referred as Aip), C) the micro spectrometer (absorbance, in nm, referred as Aμ), and the spectra obtained using the compact spectrometer plugged to the polarized light system PoLiS with D) the single scattered signals (reflectance, in nm, referred as Rss), E) the multi-scattered signal (reflectance, in nm, referred as Rms) and F) the total backscattered signal (reflectance, in nm, referred as Rbs).

Raw spectra obtained on the 33 organic waste samples with A) the laboratory spectrometer (absorbance, in nm, referred as Alab), B) the compact spectrometer with an immersed probe (absorbance, in nm, referred as Aip), C) the micro spectrometer (absorbance, in nm, referred as Aμ), and the spectra obtained using the compact spectrometer plugged to the polarized light system PoLiS with D) the single scattered signals (reflectance, in nm, referred as Rss), E) the multi-scattered signal (reflectance, in nm, referred as Rms) and F) the total backscattered signal (reflectance, in nm, referred as Rbs). In addition, a reference data file containing chemical information about each sample (carbohydrate, lipid and Nitrogen content, biochemical methane potential and chemical oxygen demand) is provided, as shown in Fig. 2. The reference data is consolidated into one file called “reference_data.tab”, with the first column that corresponds to substrate names, and the first row header that corresponds to the reference variable name (including its unit).
Fig. 2

Histograms of reference values obtained on the 33 organic waste samples using NIRS prediction model. For each subplot, both the mean value (μ) and standard deviation value (σ) are provided; and the dashed line represents the mean.

Histograms of reference values obtained on the 33 organic waste samples using NIRS prediction model. For each subplot, both the mean value (μ) and standard deviation value (σ) are provided; and the dashed line represents the mean. Finally, a python script is provided to show how a partial least squares regression (PLS-R) can be applied on this data. The script runs a train/test random split, followed by a k-fold cross-validation on train dataset to find optimal latent variable number of the PLS model and finally plots predictions and observed values on test set to evaluate obtained model performance. The python script name is “data_usage_example.py”, and a documentation on how to use this script can be found in the text file “README.md”.

Experimental Design, Materials and Methods

Samples and reference data

The 33 freeze-dried and ground samples were gathered from different anaerobic digestion plant in France (Fig. 3). The substrates cover a wide range of waste types including animal manure, animal waste, crop residues, food waste, and wastewater treatment plant waste (Table 1). Reference chemical data were obtained using a characterization method based on NIRS as described in Charnier et al. [8]. The standard errors of prediction evaluated on an independent test set were 53 mgO2.gTS−1 for carbohydrates content, 3.2 × 10−2 g.gTS−1 for lipid content, 8.6 × 10−3 g.gTS−1 for nitrogen content and 83 mgO2.gTS−1 for chemical oxygen demand.
Fig. 3

Map (focused on France) showing the positions of the anaerobic digestion plants from where the substrates were collected.

Table 1

Substrate characteristics (name, waste type and origin).

Substrate nameWaste typeOrigin
goat_manureAnimal manurePlant 7
chicken_manureAnimal manurePlant 7
cow_manureAnimal manurePlant 10
pig_slurryAnimal manurePlant 8
horse_manureAnimal manurePlant 5
dairy_sludgeAnimal wastePlant 1
duck_slurryAnimal wastePlant 1
charcuterie_grease_tankAnimal wastePlant 1
slaughterhouse_sludgeAnimal wastePlant 1
gelatinous_waterAnimal wastePlant 1
corn_wasteCrop residuesPlant 5
grape_marcCrop residuesPlant 5
wheat_derivativeCrop residuesPlant 8
corn_derivativeCrop residuesPlant 8
sunflower_derivativeCrop residuesPlant 8
clover_silageCrop residuesPlant 9
strawCrop residuesPlant 9
overpressed_beet_pulpCrop residuesPlant 9
eggFood wastePlant 6
food_industrie_wasteFood wastePlant 11
ready_meal_greaseFood wastePlant 1
biscuit_doughFood wastePlant 1
ready_meal_wasteFood wastePlant 1
lactoserumFood wastePlant 5
vegetablesFood wastePlant 4
chocolateFood wastePlant 4
vegetables_hydrolysisFood wastePlant 3
lemon_pulpFood wastePlant 1
water_treatment_plant_grease_tankWastewater treatmentPlant 1
sewage_sludgeWastewater treatmentPlant 2
paper_mill_wasteWastewater treatmentPlant 9
water_treatment_plant_flotting_sludgeWastewater treatmentPlant 3
Map (focused on France) showing the positions of the anaerobic digestion plants from where the substrates were collected. Substrate characteristics (name, waste type and origin).

Near infrared spectra acquisition

Laboratory spectrometer

For the laboratory spectrometer reference, freeze-dried and ground samples were scanned in reflectance over 12,500 - 4000 cm−1 (1000–2500 nm), with a resolution of 8 cm−1, using a BUCHI NIR-Flex N-500 solids spectrophotometer fitted with a vial accessory (Buchi, Flawil, Switzerland). Each sample was measured three times and shaken between each replicate. Furthermore, each measurement consisted of an average of 96 scans performed during vial rotation to obtain a representative measurement. White reference background signal was collected on a Spectralon® (99% reflectance). Measurements were obtained and absorbance was calculated.

Compact spectrometer with immersed probe

Both the immersed probe (Section 2.2.2) and PoLiS (Section 2.2.3) measurements were acquired using a compact Arcoptix FT-NIR Rocket spectrometer and an Ocean Optics’ HL-2000 halogen lamp as light source. Samples were scanned in reflectance mode over 3800 to 11,000 cm−1 (900–2500 nm) with a resolution of 4 cm−1. Each sample was measured on three different spots, and each measurement is an average of ten scans. The dark current (signal recovered without light) In(λ) is recorded at the beginning of the measurement session and automatically subtracted from the measured intensity. A white reference (SRS99, Spectralon®) was used as a reference to standardize spectra from non-uniformities of all components of the instrumentation (light source, fibers, spectrometer) every hour. The reflected light intensity was recorded and the absorbance signal was computed:

Compact spectrometer with polis system

For the PoLiS measurements, the incident light cone was s-polarized using a wire-grid polarizer (Thorlabs WP25L-UB). The reflected light was split into an s-polarized image and a p-polarized image with a calcite Wollaston polarizer (Thorlabs WP10P) providing respectively parallel III(λ) and perpendicular I⊥(λ) light intensities. The system resulted in three different signals: the single scattering reflectance , the multiple scattering reflectance , and the total backscattering reflectance :While the immersed probe is a contact probe (sample is placed directly in contact with probe), the PoLiS system is a distant measurement system placed at 5 cm from the sample.

Micro spectrometer

Finally, the samples were analyzed using a micro spectrometer Neospectra-micro (Si-Ware, Egypt) mounted on a Raspberry Pi single-board computer. The instrument was controlled using a connection to a PC and the SpectraMOST software was used. Samples were scanned in reflectance mode over 3921 cm−1 to 7407 cm−1 (1350–2550 nm) and a resolution of 66 cm−1. Each sample was measured on three different spots, and each measurement was obtained with a scan time of 28 s. A white reference (SRS99, Spectralon®) signal was collected before each measurement. For each sample, the intensity signal was collected and the absorbance signal was computed:

Ethics Statement

Authors declare that here is no ethical issues regarding this dataset.

CRediT Author Statement

Margaud Pérémé: Data curation, Writing – original draft, Methodology, Investigation; Alexandre Mallet: Conceptualization, Methodology, Software, Resource, Writing – review & editing, Supervision; Lorraine Awhangbo: Conceptualization, Methodology, Supervision; Cyrille Charnier: Writing – review & editing, Supervision; Jean-michel Roger: Writing – review & editing, Supervision; Jean-philippe Steyer: Writing – review & editing, Supervision; Éric Latrille: Conceptualization, Methodology, Resource, Data curation, Writing – review & editing, Supervision; Ryad Bendoula: Conceptualization, Methodology, Resource, Writing – review & editing, Supervision.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships which have or could be perceived to have influenced the work reported in this article.
SubjectVIS-NIR Spectroscopy
Specific subject areaOptical system comparison, organic waste characterization
Type of dataTableFigureScript
How data were acquiredData was acquired on the following FT-NIR spectrometers:

NIRFlex N-500 FT-NIR (BUCHI, France)

Immersed probe consisting of two fibers (one for illumination, the other for signal collection) plugged to a Rocket FTNIR-L1–025–2TE (Arcoptics, Switzerland)

Polarized light system (PoLiS) plugged to a Rocket FTNIR-L1–025–2TE (Arcoptics, Switzerland)

Neospectra-micro + Raspberry-Pi (Si-Ware, Egypt)

Data formatRawAnalyzedPresented as .tab files (table) and .py files (script)
Parameters for data collection33 solid organic waste substrates of different biochemical composition were analyzed on four different optical systems: a benchtop laboratory spectrometer, a compact spectrometer with two measurement configurations (contact immersed probe and polarized light system), and a micro spectrometer.
Description of data collection33 solid organic waste samples were freeze-dried and ground, and scanned on four different optical systems.Depending on the optical system, absorbance or reflectance values are provided. For each sample, triplicate (x3) spectra were acquired to enhance spectral representativeness.In parallel, a characterization method based on NIRS allowed to obtained reference values for each of these samples: carbohydrates content, lipids content, nitrogen content, chemical oxygen demand (COD), and biochemical methane potential (BMP)
Data source locationInstitution: LBE, INRAECity/Town/Region: NarbonneCountry: France
Data accessibilityRepository name: Data Inrae (Dataverse)Dataset name: On-site substrate characterization in the anaerobic digestion context: a dataset of near infrared spectra acquired with four different optical systems on freeze-dried and ground organic wastesData identification number: 10.15454/SQQTUUDirect URL to data: https://doi.org/10.15454/SQQTUU
Related research articleA. Mallet, M. Pérémé, L. Awhangbo, C. Charnier, J.-M. Roger, J.-P. Steyer, É. Latrille, R. Bendoula, Fast at-line characterization of solid organic waste: Comparing analytical performance of different compact near infrared spectroscopic systems with different measurement configurations, Waste Manag. 126 (2021) 664–673. https://doi.org/10.1016/j.wasman.2021.03.045.
  4 in total

1.  Fast at-line characterization of solid organic waste: Comparing analytical performance of different compact near infrared spectroscopic systems with different measurement configurations.

Authors:  Alexandre Mallet; Margaud Pérémé; Lorraine Awhangbo; Cyrille Charnier; Jean-Michel Roger; Jean-Philippe Steyer; Éric Latrille; Ryad Bendoula
Journal:  Waste Manag       Date:  2021-04-16       Impact factor: 7.145

2.  Fast characterization of solid organic waste content with near infrared spectroscopy in anaerobic digestion.

Authors:  Cyrille Charnier; Eric Latrille; Julie Jimenez; Margaux Lemoine; Jean-Claude Boulet; Jérémie Miroux; Jean-Philippe Steyer
Journal:  Waste Manag       Date:  2016-11-02       Impact factor: 7.145

3.  Transfer of a near infrared spectroscopy laboratory application to an online process analyser for in situ monitoring of anaerobic digestion.

Authors:  L Christian Krapf; Dieter Nast; Andreas Gronauer; Urs Schmidhalter; Hauke Heuwinkel
Journal:  Bioresour Technol       Date:  2012-11-15       Impact factor: 9.642

4.  A Review of Calibration Transfer Practices and Instrument Differences in Spectroscopy.

Authors:  Jerome J Workman
Journal:  Appl Spectrosc       Date:  2017-10-25       Impact factor: 2.388

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.