| Literature DB >> 35408697 |
Hafeez Ur Rehman1, Valeria Tafintseva1, Boris Zimmermann1, Johanne Heitmann Solheim1, Vesa Virtanen2, Rubina Shaikh3,4, Ervin Nippolainen3, Isaac Afara3, Simo Saarakkala2, Lassi Rieppo2, Patrick Krebs5, Polina Fomina5, Boris Mizaikoff5, Achim Kohler1.
Abstract
Preclassification of raw infrared spectra has often been neglected in scientific literature. Separating spectra of low spectral quality, due to low signal-to-noise ratio, presence of artifacts, and low analyte presence, is crucial for accurate model development. Furthermore, it is very important for sparse data, where it becomes challenging to visually inspect spectra of different natures. Hence, a preclassification approach to separate infrared spectra for sparse data is needed. In this study, we propose a preclassification approach based on Multiplicative Signal Correction (MSC). The MSC approach was applied on human and the bovine knee cartilage broadband Fourier Transform Infrared (FTIR) spectra and on a sparse data subset comprising of only seven wavelengths. The goal of the preclassification was to separate spectra with analyte-rich signals (i.e., cartilage) from spectra with analyte-poor (and high-matrix) signals (i.e., water). The human datasets 1 and 2 contained 814 and 815 spectra, while the bovine dataset contained 396 spectra. A pure water spectrum was used as a reference spectrum in the MSC approach. A threshold for the root mean square error (RMSE) was used to separate cartilage from water spectra for broadband and the sparse spectral data. Additionally, standard noise-to-ratio and principle component analysis were applied on broadband spectra. The fully automated MSC preclassification approach, using water as reference spectrum, performed as well as the manual visual inspection. Moreover, it enabled not only separation of cartilage from water spectra in broadband spectral datasets, but also in sparse datasets where manual visual inspection cannot be applied.Entities:
Keywords: OPUS; PCA; quality spectra; quantum cascade lasers; sparse spectra; spectral preclassification; water spectrum
Mesh:
Substances:
Year: 2022 PMID: 35408697 PMCID: PMC9000438 DOI: 10.3390/molecules27072298
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Examples of water and cartilage spectra. (a) Cartilage spectrum from dataset 1 (human), (b) Pure water spectrum.
Figure 2Raw spectra of Human dataset in range from 800 to 1900 cm. Highlighted spectra in red color represent water/analyte-poor spectra, whereas the remaining spectra are analyte-rich (cartilage) spectra.
Figure 3Results of preclassification using dataset 1 broadband spectra. (a,d,g) Water/low absorbance (analyte-poor) spectra and (c,f,i) cartilage/analyte-rich spectra identified by the MSC approach with water spectrum as reference. (b,e,h) PCA scores obtained by different thresholds, 0.13 for (b), 0.144 for (c), and 0.152 for (h). c and w in PCA scores correspond to cartilage and water spectrum.
Figure 4Visual presentation of broad band spectrum after using signal-to-noise ratio (a) Water/Analyte-poor Spectra (b) Cartilage/Analyte-rich Spectra.
Figure 5Visual presentation of Human spectral data (set 2) 7 WNs after using MSC preclassification approach, (a) raw spectra, (b) water/analyte-poor spectra, and (c) cartilage/analyte-rich spectra.