| Literature DB >> 29764796 |
Stephane Aris-Brosou1, James Kim2, Li Li2, Hui Liu2.
Abstract
BACKGROUND: Vendors in the health care industry produce diagnostic systems that, through a secured connection, allow them to monitor performance almost in real time. However, challenges exist in analyzing and interpreting large volumes of noisy quality control (QC) data. As a result, some QC shifts may not be detected early enough by the vendor, but lead a customer to complain.Entities:
Keywords: CART; QC chemistry results; adaptive boosting; complaint data; post market surveillance
Year: 2018 PMID: 29764796 PMCID: PMC5974458 DOI: 10.2196/medinform.9960
Source DB: PubMed Journal: JMIR Med Inform
List of the fields logged by e-Connectivity (that includes quality control, QC data) and by the customer complaint system. Corresponding abbreviations are shown.
| Data and Abbreviation | Short description | |
| Assay | Abbreviation of assay name (recoded here) | |
| J Number | Unique identifier assigned to each analyzer placed | |
| F Concentration | Concentration of solute (assay); QCa result | |
| Units | Unit of measured concentration (mmol/L) | |
| F Concentration (SI) | Concentration of solute (assay); QC result | |
| Units SI | Unit of measured concentration (SI) | |
| Reagent Lot Number | Reagent lot number | |
| S Gen | Manufacturing generation number | |
| S Lot | Manufacturing lot number | |
| ERF Lot | Electrolyte reference fluid lot | |
| IWF Lot | Immuno-wash fluid lot number | |
| Control Lot Number | Performance verifier lot number | |
| Cal Curve ID | Calibration curve ID | |
| Result ID | Unique identifier (encrypted) of QC result | |
| Sample Name | Unique identifier (encrypted) of sample name | |
| Time Metering | Time stamp of concentration log through e-Connectivity | |
| Total Dilution | Dilution factor | |
| Operator Dilution | Operator requested dilution | |
| Body Fluid | Fluid type (serum, plasma, or urine) | |
| Create Audit Date | Time stamp of when complaint was placed | |
| Call Subject | Same as assay in e-Connectivity | |
| Call Area | Classification of concern or problem of the product or the analyzer-generated condition | |
| Resolution | Term describing how the complaint was resolved | |
| Complaint Number | Unique identifier of each complaint | |
| Customer Number | Unique identifier of each customer | |
| J Number | Analyzer serial number | |
| Lot number | Reagent lot number | |
| Region | Geographic region where complaint was placed | |
| Call Status | Current call status of complaint (closed or open) | |
| Problem description | Free-text field describing the complaint | |
aQC: quality control.
Figure 1Feature definitions based on a typical sample logged in e-Connectivity. Assay concentrations (here for assay A) are plotted as a function of time. Horizontal blue lines show the modes of the density of sample means (our estimated verifiers). Vertical gray lines show timing of maintenance activities (change of calibration curves, etc). The orange vertical line shows when the customer placed a call—for “accuracy high” (ACCH; indicates the measured concentration is suspected of being higher than the actual value) in this example. The concentration reading just before this call (“#1”) and 10 e-Connectivity logs before it (“#10”) are indicated in red. Our machine-learning (ML) algorithms (in red) aim at learning the signatures (in purple) of call areas (orange) from a training set, to be able to identify those call areas, before a customer complains.
List of the features used in the predictive modeling. Note that a “cutoff” represents the time when a customer calls in the case of “positive samples” (when there is an actual complaint), or the time drawn at random in the case of “negative samples” (see Methods).
| Feature name | Definition |
| MostRecentConcentration | Assay concentration reading just before cutoff |
| TwoMostRecentConcentrationMean | Mean concentration for the two readings before cutoff |
| FiveMostRecentConcentrationMean | Mean concentration for the five readings before cutoff |
| TenMostRecentConcentrationMean | Mean concentration for the ten readings before cutoff |
| TwoMostRecentConcentrationSD | SD of concentration for the two readings before cutoff |
| FiveMostRecentConcentrationSD | SD of concentration for the five readings before cutoff |
| TenMostRecentConcentrationSD | SD of concentration for the ten readings before cutoff |
| NbPriorSGenChange | Number of S Gen changes before cutoff (since start of QC sample) |
| NbPriorSLotChange | Number of S Lot changes before cutoff |
| NbPriorERFLotChange | Number of ERF Lot changes before cutoff |
| NbPriorIWFLotChange | Number of IWF Lot changes before cutoff |
| NbPriorContLotNumChange | Number of Control Lot Number changes before cutoff |
| NbPriorCalCurveChange | Number of Calibration Curve changes before cutoff |
| TimeSinceLastSGenChange | Time elapsed since last S Gen change before cutoff |
| TimeSinceLastSLotChange | Time elapsed since last S Lot change before cutoff |
| TimeSinceLastERFLotChange | Time elapsed since last ERF Lot change before cutoff |
| TimeSinceLastIWFLotChange | Time elapsed since last IWF Lot change before cutoff |
| TimeSinceLastContLotNumChange | Time elapsed since last Control Lot Number change before cutoff |
| TimeSinceLastCalCurveChange | Time elapsed since last Calibration Curve change before cutoff |
| TimeToComplain | Time elapsed since last e-Connectivity log before cutoff |
Figure 2Empirical cumulative distribution function (ECDF) of customer complaints. The ECDF was plotted for the five assays considered. The horizontal gray bars represent the first, second, and third quartiles. Each assay is color-coded as shown (inset).
Figure 3Distribution of prediction error rates for the binned quality check (QC)–only data. Error rates are shown as derived from the cross-validation analyses, where the data were split 2500 times (see Methods). Results are shown for both classifiers, Classification and Regression Trees (CART; broken lines) and adaptive boosting (solid lines), over the five assays considered for the 90-day data with all features (a) or with TimeToComplain removed (b) and likewise for the 45-day data with (c) or not (d) all features. Each assay is color-coded as shown.
Figure 4Feature importance under adaptive boosting for the binned quality control (QC)–only data. Importance of the features are shown as radar charts, over the five assays considered. Each assay is color-coded as shown. Top panels are for the whole 90-day datasets, whereas the bottom panels are for the 45-day datasets. Left panels include all feature; right panels exclude TimeToComplain from the models.