| Literature DB >> 32392841 |
Pradeep Kumar Anand1, Dong Ryeol Shin2, Mudasar Latif Memon3.
Abstract
In this paper, we present an architecture of a personalized glucose monitoring system (PGMS). PGMS consists of both invasive and non-invasive sensors on a single device. Initially, blood glucose is measured invasively and non-invasively, to train the machine learning models. Then, paired data and corresponding errors are divided scientifically into six different clusters based on blood glucose ranges as per the patient's diabetic conditions. Each cluster is trained to build the unique error prediction model using an adaptive boosting (AdaBoost) algorithm. Later, these error prediction models undergo personalized calibration based on the patient's characteristics. Once, the errors in predicted non-invasive values are within the acceptable error range, the device gets personalized for a patient to measure the blood glucose non-invasively. We verify PGMS on two different datasets. Performance analysis shows that the mean absolute relative difference (MARD) is reduced exceptionally to 7.3% and 7.1% for predicted values as compared to 25.4% and 18.4% for measured non-invasive glucose values. The Clarke error grid analysis (CEGA) plot for non-invasive predicted values shows 97% data in Zone A and 3% data in Zone B for dataset 1. Moreover, for dataset 2 results echoed with 98% and 2% in Zones A and B, respectively.Entities:
Keywords: adaptive boosting; clustering; diabetic care; error prediction model; machine learning; non-invasive blood glucose monitoring; personalized calibration
Year: 2020 PMID: 32392841 PMCID: PMC7278000 DOI: 10.3390/diagnostics10050285
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Figure 1Clarke error grid analysis (CEGA) plot with different zones.
Regulations for accurate blood glucose measurement.
| Regulation | Glucose | Acceptable | Device | MARD |
|---|---|---|---|---|
| US Food and Drug Administration (FDA) [ | Entire range | −15%~15% | > 95% | < 5% |
| −20%~20% | > 99% | < 1% | ||
| European Medicines [ | ≥100 mg/dL | −15%~15% | > 95% | < 5% |
| <100 mg/dL | −15~15 mg/dL | > 95% | < 5% |
FDA approved invasive blood glucose monitors with accuracies.
| Product Name | Manufacturer | Accuracy |
|---|---|---|
| Contour Next [ | Bayer | 100% |
| Accu-Check Aviva Plus [ | Roche | 99% |
| Walmart ReliOn Confirm [ | Arkray | 97% |
| CVS Advanced [ | AgaMatrix | 96% |
| FreeStyle Lite [ | Abbott Diabetes Care | 96% |
Comparison of non-invasive sensor technologies with their accuracies.
| Sensing Technology | MARD | CEGA Plot (%) | ||
|---|---|---|---|---|
| Zone A | Zone A and B | Zone C, D, and E | ||
| Infrared spectroscopy [ | - | 84.0 | - | - |
| Impedance spectroscopy [ | - | 56.0 | 93.0 | 7.0 |
| Diffuse reflectance [ | - | 87.5 | 95.8 | 4.2 |
| Raman spectroscopy [ | - | 86.7 | - | - |
| Optical coherence tomography [ | 11.5 | 83.0 | 99.0 | 1.0 |
| Photo-acoustic spectroscopy [ | 11.8 | 82.7 | 100.0 | 0.0 |
| Multi-sensor1 [ | 8.3 | 90.0 | 100.0 | 0.0 |
| Multi-sensor2 [ | 8.8 | 92.7 | 100.0 | 0.0 |
| Multi sensor3 [ | 22.4 | 60.0 | 96.0 | 4.0 |
Multi-sensor1 consists of near-infrared and impedance spectroscopy. Multi-sensor2 consists of near-infrared and photo-acoustic spectroscopy. Multi sensor3 consists of thermal, electromagnetic, ultrasonic.
Non-invasive and minimally-invasive blood glucose monitors with their accuracies.
| Product Name | Manufacturer | Sensing Technology | Accuracy |
|---|---|---|---|
| GlucoTrack [ | Integrity Application | Multi-technology | MARD: 23.4% |
| Combo Glucometer | CNOGA Medical | Near-Infrared | MARD: 17.1% |
| SugerBEAT [ | Nemaura Medical | Reverse | MARD: 13.8% |
| Symphony [ | Echo Therapeutics | Sonophoresis | MARD: 12.3% |
| Wizmi [ | Wear2b Ltd. | NIR spectroscopy | MARD: 7.2% |
| Eversense [ | Senseonics | Fluorescence | MARD: 14.8% |
Figure 2Personalized glucose monitoring system (PGMS) structural diagram.
Figure 3Software flowchart of the PGMS.
Baseline dataset 1.
| Parameters | Unit | GlucoTrack [ | Dataset 1 |
|---|---|---|---|
| Invasive Range | mg/dl | 65~492 | 65~492 |
| Non-invasive range | mg/dl | 80~352 | 80~352 |
| Number of paired data | - | 1772 | 918 |
| MARD | % | 23.4 | 23.9 |
| Minimum Error | % | −221 | −221 |
| Maximum Error | % | 61 | 61 |
Baseline dataset 2.
| Parameters | Unit | CoG [ | Dataset 2 |
|---|---|---|---|
| Invasive Range | mg/dl | 37~458 | 37~458 |
| Non-invasive range | mg/dl | 40~428 | 40~428 |
| Number of paired data | - | 730 | 470 |
| Minimum Error | % | −131 | −131 |
| Maximum Error | % | 65 | 65 |
| MARD | % | 17.1 | 17.4 |
Error and MARD reduction by different initial approaches.
| Parameter | Unit | AdaBoost | AdaBoost + K-Means Clustering | ||
|---|---|---|---|---|---|
| Initial Error | Final Error | Initial Error | Final Error | ||
| Minimum Error | % | −60.1 | −60.7 | −61 | −57 |
| Maximum Error | % | 149.3 | 143.5 | 149 | 139 |
| MARD | % | 27.4 | 26.3 | 26.8 | 25.1 |
Clusters formed based on domain-knowledge.
| Blood Glucose Range (mg/dL) | Cluster Name |
|---|---|
| 50–80 | Hypoglycemia |
| 81–115 | No diabetic |
| 116–150 | Pre-diabetic |
| 151–180 | Diabetic |
| 181–250 | Highly diabetic |
| > 250 | Critically diabetic |
Optimized hyperparameters values.
| Hyperparameters | Ranges | For Dataset 1 | For Dataset 2 |
|---|---|---|---|
| Regressor type | Decision Tree | Decision Tree | Decision Tree |
| Depth | 1~100 | 10 | 20 |
| Estimators | 10~500 | 200 | 150 |
| Learning rate | 0.0001~1 | 0.7 | 0.008 |
| Loss function | Linear, Square | Exponential | Linear |
| Random state | 1~5 | 3 | 1 |
Results of the PGMS applied to dataset 1.
| Cluster | Paired Data | Parameter | Unit | Initial Error | Final Error |
|---|---|---|---|---|---|
| <80 | 4 | Minimum | % | −152 | −23 |
| Maximum | % | −92 | 2 | ||
| MARD | % | 124.6 | 12.6 | ||
| 81–115 | 32 | Minimum | % | −147 | −30 |
| Maximum | % | 20 | 19 | ||
| MARD | % | 48.3 | 7.9 | ||
| 116–150 | 51 | Minimum | % | −94 | −16 |
| Maximum | % | 21 | 13 | ||
| MARD | % | 31.2 | 6.0 | ||
| 151–180 | 42 | Minimum | % | −81.0 | −18.3 |
| Maximum | % | 34.8 | 7.1 | ||
| MARD | % | 21.8 | 5.1 | ||
| 181–250 | 93 | Minimum | % | −64.9 | −21.5 |
| Maximum | % | 52.5 | 16.3 | ||
| MARD | % | 16.6 | 7.5 | ||
| >250 | 54 | Minimum | % | −12 | −32 |
| Maximum | % | 43 | 22 | ||
| MARD | % | 17.2 | 9.0 | ||
| Total | 276 | Minimum * | % | −152 | −32 |
| Maximum † | % | 53 | 22 | ||
| Overall MARD ‡ | % | 25.4 | 7.3 |
* Minimum is lowest out of 6 clusters. † Maximum is the highest out of 6 clusters. ‡ Overall MARD is the weighted MARD of 6 clusters.
Figure 4(a) Non-invasive measured (in green color) vs. non-invasive predicted (in red color) with respect to reference values (in blue color). (b) Percentage error non-invasive measured (in green color) vs. non-invasive predicted (in red color) for dataset 1.
Figure 5Clarke error grid analysis (CEGA) plot for the dataset 1. (a) For non-invasive measured values, (b) for non-invasive predicted values.
CEGA plot summary for the dataset 1.
| Zones | Non-Invasive Measured Values | Non-Invasive Predicted Values | ||
|---|---|---|---|---|
| Number | % | Number | % | |
| Zone A | 149 | 54 | 267 | 97 |
| Zone B | 115 | 42 | 9 | 3 |
| Zone C | 9 | 3 | 0 | 0 |
| Zone D | 3 | 1 | 0 | 0 |
| Zone E | 0 | 0 | 0 | 0 |
Results of the PGMS applied to the dataset 2.
| Cluster | Paired Data | Parameter | Unit | Initial Error | Final Error |
|---|---|---|---|---|---|
| <80 | 12 | Minimum | % | −43.3 | −20.6 |
| Maximum | % | 27.4 | 33.8 | ||
| MARD | % | 11.3 | 10.9 | ||
| 81–115 | 26 | Minimum | % | −130.9 | −49.8 |
| Maximum | % | 16.7 | 16.6 | ||
| MARD | % | 31.7 | 9.8 | ||
| 116–150 | 39 | Minimum | % | −64.0 | −15.4 |
| Maximum | % | 65.1 | 26.0 | ||
| MARD | % | 17.0 | 6.5 | ||
| 151–180 | 30 | Minimum | % | −56.1 | −14.2 |
| Maximum | % | 40.1 | 8.8 | ||
| MARD | % | 15.6 | 4.5 | ||
| 181–250 | 27 | Minimum | % | −50.5 | −19.6 |
| Maximum | % | 38.6 | 17.1 | ||
| MARD | % | 16.5 | 7.3 | ||
| >250 | 9 | Minimum | % | −10.4 | −17.0 |
| Maximum | % | 33.7 | 7.5 | ||
| MARD | % | 11.1 | 5.6 | ||
| Total | 143 | Minimum * | % | −131 | −50 |
| Maximum † | % | 65 | 34 | ||
| Overall MARD ‡ | % | 18.4 | 7.1 |
* Minimum is the lowest out of 6 clusters. † Maximum is the highest out of 6 clusters. ‡ Overall MARD is the weighted MARD of 6 clusters.
Figure 6(a) Non-invasive measured (in green color) vs. non-invasive predicted (in red color) with respect to reference values (in blue color). (b) Percentage error non-invasive measured (in green color) vs. non-invasive predicted (in red color) for dataset 2.
Figure 7Clarke error grid analysis (CEGA) plot for the dataset 2. (a) For non-invasive measured values, (b) for non-invasive predicted values.
CEGA plot summary for dataset 2.
| Zones | Non-Invasive Measured Values | Non-Invasive Predicted Values | ||
|---|---|---|---|---|
| Number | % | Number | % | |
| Zone A | 99 | 69 | 140 | 98 |
| Zone B | 41 | 29 | 3 | 2 |
| Zone C | 2 | 1 | 0 | 0 |
| Zone D | 1 | 1 | 0 | 0 |
| Zone E | 0 | 0 | 0 | 0 |
Performance comparison of PGMS with other non-invasive or minimal-invasive monitors.
| Non-Invasive Measurement System | MARD | CEGA Plot | ||
|---|---|---|---|---|
| Zone A | Zone B | |||
| PGMS | AdaBoost | 26.3% | - | - |
| K-Means Clustering + AdaBoost | 25.1% | - | - | |
| Domain-knowledge Clustering + AdaBoost | 7.1% | 98% | 2% | |
| GlucoTrack | 23.4% | 57% | 39% | |
| CoG | 17.1% | 86.2% | 12.6% | |
| SugarBEAT | 13.8% | - | - | |
| Symphony | 12.3% | 81.7% | 18.3% | |
| Wizmi | 7.2% | 93% | 7% | |
| Eversense | 14.8% | - | - | |