| Literature DB >> 34089321 |
Nosa Agbonkonkon1, Greg Wojciechowski1, Derek A Abbott1, Sara P Gaucher1, Daniel R Yim1, Andrew W Thompson1, Michael D Leavell1.
Abstract
Recent innovations in synthetic biology, fermentation, and process development have decreased time to market by reducing strain construction cycle time and effort. Faster analytical methods are required to keep pace with these innovations, but current methods of measuring fermentation titers often involve manual intervention and are slow, time-consuming, and difficult to scale. Spectroscopic methods like near-infrared (NIR) spectroscopy address this shortcoming; however, NIR methods require calibration model development that is often costly and time-consuming. Here, we introduce two approaches that speed up calibration model development. First, generalized calibration modeling (GCM) or sibling modeling, which reduces calibration modeling time and cost by up to 50% by reducing the number of samples required. Instead of constructing analyte-specific models, GCM combines a reduced number of spectra from several individual analytes to produce a large pool of spectra for a generalized model predicting all analyte levels. Second, randomized multicomponent multivariate modeling (RMMM) reduces modeling time by mixing multiple analytes into one sample matrix and then taking the spectral measurements. Afterward, individual calibration methods are developed for the various components in the mixture. Time saved from the use of RMMM is proportional to the number of components or analytes in the mixture. When combined, the two methods effectively reduce the associated cost and time for calibration model development by a factor of 10.Entities:
Keywords: Fermentation; Generalized calibration model (GCM); NIR spectroscopy; RMMM; Sibling model
Mesh:
Year: 2021 PMID: 34089321 PMCID: PMC9113423 DOI: 10.1093/jimb/kuab033
Source DB: PubMed Journal: J Ind Microbiol Biotechnol ISSN: 1367-5435 Impact factor: 4.258
Fig. 1Schematic of current wet chemistry method showing its long turnaround time as well as traditional individualized NIR model method. GCM and RMMM time savings are also depicted.
Fig. 2Calibration models for individual organic acids. This individualized model method is the traditional mode for NIR model development.
Calibration Model Parameters and Quality Metrics for the Individual Organic Acid Models and the Generalized Model
| Name | Rank |
| RMSECV (M) | Spectral range (cm–1) | Math pretreatment | RPD |
|---|---|---|---|---|---|---|
| Glutaric acid | 6 | 0.9984 | 0.0045 | 6348–5315 | SNV | 24.7 |
| Tartaric acid | 8 | 0.9984 | 0.0035 | 7505–6796, 4428–4242 | First derivative + SNV | 25.4 |
| Succinic acid | 5 | 0.9954 | 0.0072 | 8458–7498, 6101–5446 | SNV | 15 |
| Adipic acid | 5 | 0.9676 | 0.017 | 9403–7498 | First derivative | 5.59 |
| Generalized model | 11 | 0.9915 | 0.0103 | 9403–5446 | First derivative | 10.8 |
Fig. 3The organic acid GCM combining spectra from samples containing the individual organic acids is shown in (a). The organic acid GCM with glutaric, tartaric, and adipic acids is shown in (b). Model (b) was used to predict concentrations of succinic acid, which was not included in its calibration set. The regression plot (c) shows good correlation between measured and predicted succinic acid concentrations.
Fig. 4Individual α-bisabolol and farnesene models are shown in (a) and (b). The terpene GCM from the combination of spectra taken from samples containing either α-bisabolol and farnesene (c) shows a model that is as good as the two individual models. The calibration range of farnesene was higher than that of α-bisabolol, and combining both analytes into the terpene GCM allowed extension of the α-bisabolol range. (d) Compares α-bisabolol concentrations in a set of samples generated at pilot scale predicted by the terpene GCM generated from lab-scale samples.
Calibration Model Parameters and Quality Metrics of the Individual α-Bisabolol and Farnesene Models and the Terpene GCM
| Name | Rank |
| RMSECV (g/kg) | Spectral range (cm–1) | Math pretreatment | RPD |
|---|---|---|---|---|---|---|
| α-Bisabolol | 9 | 0.9929 | 2.240 | 8454–7498, 6102–5446 | First derivative + SNV | 11.8 |
| Farnesene | 7 | 0.9990 | 1.310 | 7506–6094 | SNV | 31.3 |
| Terpenes | 15 | 0.9958 | 2.090 | 7506–5446 | SNV | 15.4 |
Summary Results of the NIR Calibration Models for Molecules in Groups A and B, Showing the Metrics Used to Determine Model Quality. Different Spectral Regions, Math Pretreatment, and Ranks Were Used
| Group | Name | Rank |
| RMSECV (g/kg) | Spectral range (cm–1) | Math pretreatment | RPD |
|---|---|---|---|---|---|---|---|
| A | Cadaverine | 9 | 0.9767 | 0.84 | 9403–6094, 4605–4420 | First derivative + SNV | 6.6 |
| 2-Phenylethanol | 10 | 0.9610 | 1.41 | 9403–7336, 6310–5785 | First derivative | 5.1 | |
| α-Bisabolol | 5 | 0.9902 | 3.57 | 6012–5447 | First derivative + SNV | 10.3 | |
| B | Squalene | 6 | 0.9735 | 1.16 | 6102–5770 | Second derivative | 6.2 |
| Oleic acid | 7 | 0.9753 | 1.28 | 10,391–9588, 8794–7992, 6395–5592 | MSC | 6.4 | |
| 4 | 0.9951 | 0.534 | 6102–5770 | First derivative + MSC | 14.2 |
Fig. 5Calibration models of the molecules in groups A and B generated using RMMM.