| Literature DB >> 31218780 |
Brian Hazlehurst1, Carla A Green1, Nancy A Perrin1, John Brandes1, David S Carrell2, Andrew Baer3, Angela DeVeaugh-Geiss4, Paul M Coplan4,5.
Abstract
PURPOSE: To enhance automated methods for accurately identifying opioid-related overdoses and classifying types of overdose using electronic health record (EHR) databases.Entities:
Keywords: electronic health records; methods; natural language processing; opioid overdose; pharmacoepidemiology
Mesh:
Substances:
Year: 2019 PMID: 31218780 PMCID: PMC6772185 DOI: 10.1002/pds.4810
Source DB: PubMed Journal: Pharmacoepidemiol Drug Saf ISSN: 1053-8569 Impact factor: 2.890
Sampling description for the three datasets
| Sampling Goal | Events Identified by: | KPNW | KPW | |||
|---|---|---|---|---|---|---|
| Base Population | Development Dataset | Validation Dataset | Base Population | Portability Dataset | ||
| Suspected overdose cases | Opioid overdose codes | 2271 | 483 | 848 | 750 | 188 |
| Adverse effects codes | 254 | 78 | 102 | 0 | 0 | |
| At risk for overdose | Pain, mental health, substance abuse codes | 87 550 | 222 with ≥30 d supply of ER/LA opioids | 373 with ≥30 d supply of ER/LA opioids | 33 438 | 247 |
| 223 with ≤30 d supply of ER/LA opioids | 372 with ≤30 d supply of ER/LA opioids | 0 | ||||
| Total | 1006 | 1695 | 34 188 | 435 | ||
Abbreviations: KPNW, Kaiser Permanente Northwest; KPW, Kaiser Permanente Washington.
Figure 1The overdose rule tree. A “rule tree” specifies a complex set of constraints that must be matched in the encounter data (text data within a single note, in this case) to generate a positive classification using the MediClass system. A rule tree is rooted by a single rule. A rule tree can be used to define a class alone or in combination with other rule trees.Each node in the tree is either a single rule (marked with a version number and shown in bold font) or it is one or more unified medical language system (UMLS) concepts (shown as plain font labels with no version numbers).Terms (not shown here) are child nodes of concepts, which help define how a concept is matched, using linguistic manipulations, against a sequence of tokens found in the text data. Terms are provided by the UMLS, as well as by custom additions found through trial and error in the development process, and constitute a lexicon of clinical expressions grouped by the concepts that they represent. Also not shown are proximity and ordering constraints, which govern relationships between concepts that are grouped by a rule. For example, a proximity constraint enforces a maximum allowable distance between any tokens (linguistic primitives of the text note) that participate in the identification of concepts within a rule.Every rule has one or more child nodes—each child node is connected to its parent by either the Boolean AND relation (shown with a solid line) or the Boolean OR relation (shown with a dashed line). For the parent node to match the data, all of the AND children and at least one of the OR children must match the data.Rules can take the following modifier: ! = Boolean NOT (ie, the reported match status of the rule is inverted from what is determined by normal match criteria for the rule). Concepts can take the following modifiers, which define constraints on how terms are matched in the text data: [−] = “negated form only” (ie, only terms of the concept asserted as negative in the text will create a match) [+] = “positive form only” (ie, only terms of the concept not asserted as negative in the text will create a match)
Classifications and their alignment with gold standard chart review
| NLP Only Classification | Chart Review Gold Standard Comparator | |
|---|---|---|
| Event type, irrespective of substance involved | Intentional overdose | Intentional overdose = clearly or possible |
| Unintentional overdose (excludes intentional overdose) | Unintentional overdose | |
| Overdose of any type (combines intentional and unintentional overdose) | Unintentional overdose or intentional overdose = clearly or possible | |
| Adverse drug reaction—ADR (excludes any overdose) | Adverse drug reaction | |
| Substance involved in overdose or ADR |
Heroin | Heroin involved = yes or possible |
|
Opioid only (excludes heroin) | A single opioid event (excludes heroin) | |
|
Polysubstance including opioid (excludes heroin) | A polydrug, opioid event (excludes heroin) | |
| Any opioid (excludes heroin, includes polysubstance) | A single or polydrug opioid event (excludes heroin) | |
| Substance abuse involved in opioid‐related overdose |
Prescription medication abuse (whether prescribed or not) |
Opioid or non‐opioid prescription med abuse = yes, |
|
Substance abuse (including alcohol abuse or presence) |
Alcohol present = yes | |
|
Illicit drug abuse |
Abuse of non‐prescribed substances = yes | |
|
Any substance abuse |
Opioid or non‐opioid prescription med or non‐prescribed substance abuse or alcohol present = yes | |
| Patient error in opioid‐related overdose or ADR |
Patient error (excludes all abuse as defined above) |
Opioid or non‐opioid medication‐ taking error = yes |
Abbreviation: NLP, natural language processing.
Evaluation of NLP application using development dataset
| Development Dataset (KPNW) N = 1006 | ||||||
|---|---|---|---|---|---|---|
| NLP‐only Classification | n | Performance Compared with Gold Standard Chart Review n | ||||
| Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | NPV (95% CI) | |||
| Event type, irrespective of substance involved | Intentional overdose | 74 | .81 (.70‐.89) | .98 (.96‐.99) | .83 (.72‐.91) | .97 (.96‐.99) |
| Unintentional overdose | 158 | .71 (.63‐.78) | .94 (.91‐.96) | .79 (.71‐.85) | .91 (.87‐.93) | |
| Overdose of any type | 232 | .80 (.74‐.85) | .93 (.90‐.95) | .87 (.81‐.91) | .89 (.85‐.92) | |
| Adverse drug reaction—ADR | 79 | .24 (.15‐.35) | .93 (.90‐.95) | .32 (.21‐.45) | .89 (.87‐.92) | |
| Substance involved in overdose or ADR | Heroin | 19 | .84 (.60‐.96) | 1.0 (.99‐1.0) | .80 (.56‐.93) | 1.0 (.99‐1.0) |
| Opioid only (excludes heroin) | 98 | .37 (.27‐.47) | .94 (.92‐.95) | .39 (.29‐.50) | .93 (.91‐.95) | |
| Polysubstance including opioid (excludes heroin) | 175 | .57 (.49‐.64) | .96 (.94‐.97) | .75 (.67‐.82) | .91 (.89‐.93) | |
| Any opioid (excludes heroin, includes polysubstance) | 273 | .72 (.66‐.77) | .96 (.94‐.97) | .88 (.83‐.92) | .90 (.87‐.92) | |
| Substance abuse involved in opioid‐related overdose | Prescription med abuse (whether prescribed or not) | 56 | .38 (.25‐.51) | .96 (.94‐.98) | .50 (.34‐.66) | .94 (.92‐.96) |
| Substance abuse (including alcohol abuse or presence) | 45 | .62 (.47‐.76) | .97 (.95‐.98) | .58 (.43‐.72) | .97 (.95‐.98) | |
| Illicit drug abuse | 84 | .46 (.36‐.58) | .97 (.97‐.98) | .68 (.55‐.80) | .92 (.90‐.94) | |
| Any substance abuse | 129 | .67 (.59‐.75) | .96 (.94‐.97) | .81 (.72‐.87) | .92 (.89‐.94) | |
| Patient error in opioid‐related overdose or ADR | Patient error | 21 | .33 (.15‐.57) | .98 (.96‐.99) | .33 (.15‐.57) | .98 (.96‐.99) |
Abbreviations: KPNW, Kaiser Permanente Northwest; NLP, natural language processing; NPV, negative predictive value; PPV, positive predictive value.
n is the count as determined by gold standard chart review, includes only events for which EHR source data were available.
Evaluation of NLP application using validation dataset
| Validation dataset (KPNW), n = 1696 | ||||||
|---|---|---|---|---|---|---|
| NLP‐only Classification | n | Performance Compared with Gold Standard Chart Review, n | ||||
| Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | NPV (95% CI) | |||
| Event type, irrespective of substance involved | Intentional overdose | 122 | .74 (.65‐.81) | .97 (.95‐.98) | .82 (.73‐.88) |
|
| Unintentional overdose | 212 | .66 (.59‐.72) |
| .73 (.66‐.79) |
| |
| Overdose of any type | 334 | .78 (.73‐.82) | .89 (.85‐.92) | .86 (.82‐.90) |
| |
| Adverse drug reaction—ADR | 93 | .31 (.22‐.42) | .94 (.92‐.96) | .44 (.32‐.57) | .90 (.87‐.92) | |
| Substance involved in overdose or ADR | Heroin | 54 | .70 (.56‐.82) | .99 (.98‐1.0) | .86 (.72‐.92) |
|
| Opioid only (excludes heroin) | 166 | .27 (.21‐.35) | .88 (.85‐.90) | .40 (.31‐.50) |
| |
| Polysubstance including opioid (excludes heroin) | 199 | .62 (.55‐.69) |
|
| .85 (.82‐.88) | |
| Any opioid (excludes heroin, includes polysubstance) | 365 | .75 (.70‐.79) |
| .86 (.81‐.89) | .76 (.72‐.80) | |
| Substance abuse involved in opioid‐related overdose | Prescription med abuse (whether prescribed or not) | 87 | .37 (.27‐.48) |
| .43 (.31‐.55) | .91 (.88‐.93) |
| Substance abuse (including alcohol abuse or presence) | 42 | .67 (.50‐.80) |
|
| .98 (.96‐.98) | |
| Illicit drug abuse | 107 | .49 (.39‐.58) |
| .54 (.44‐.64) | .91 (.88‐.93) | |
| Any substance abuse | 187 | .72 (.65‐.78) |
| .71 (.64‐.77) | .90 (.87‐.92) | |
| Patient error in opioid‐related overdose or ADR | Patient error | 33 | .30 (.16‐.49) | .97 (.96‐.98) | .37 (.20‐.58) | .97 (.95‐.98) |
Note. Bold values indicate significantly different measures (P < .05, chi‐square test) from results using development dataset (see Table 3).
Abbreviations: KPNW, Kaiser Permanente Northwest; NLP, natural language processing; NPV, negative predictive value; PPV, positive predictive value.
n is the count as determined by gold standard chart review, includes only events for which EHR source data were available.
Evaluation of NLP application using portability dataset
| Portability dataset (KPW), n = 435 | ||||||
|---|---|---|---|---|---|---|
| NLP‐only Classification | Performance Compared with Gold Standard Chart Review n | |||||
| n | Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | NPV (95% CI) | ||
| Event type, irrespective of substance involved | Intentional overdose | 31 | .65 (.45‐.80) | .99 (.96‐1.0) | .83 (.62‐.95) | .96 (.93‐.98) |
| Unintentional overdose | 43 |
| .96 (.92‐.98) | .68 (.49‐.82) | .93 (.89‐.95) | |
| Overdose of any type | 74 |
| .97 (.93‐.98) | .86 (.74‐.93) | .90 (.86‐.94) | |
| Adverse drug reaction—ADR | 5 |
|
|
|
| |
| Substance involved in overdose or ADR | Heroin | 5 |
| .99 (.97‐1.0) |
| .99 (.96‐1.0) |
| Opioid only (excludes heroin) | 26 | .38 (.21‐.59) |
| .37 (.20‐.58) |
| |
| Polysubstance including opioid (excludes heroin) | 42 | .48 (.32‐.63) |
| .71 (.51‐.86) |
| |
| Any opioid (excludes heroin, includes polysubstance) | 68 | .69 (.57‐.79) |
| .85 (.88‐.95) |
| |
| Substance abuse involved in opioid‐related overdose | Prescription med abuse (whether prescribed or not) | 18 | .28 (.11‐.54) | .98 (.95‐.99) | .45 (.18‐.75) | .96 (.92‐.98) |
| Substance abuse (including alcohol abuse or presence) | 18 | .44 (.22‐.69) | .98 (.95‐.99) | .53 (.27‐.78) | .97 (.94‐.98) | |
| Illicit drug abuse | 25 | .28 (.13‐.50) | .99 (.97‐1.0) | .70 (.35‐.92) | .94 (.90‐.96) | |
| Any substance abuse | 44 | .52 (.37‐.67) | .98 (.95‐.99) | .82 (.62‐.93) | .92 (.88‐.95) | |
| Patient error in opioid‐related overdose or ADR | Patient error | 5 | 0.0 (0.0‐.54) | .99 (.97‐1.0) | 0.0 (0.0‐.80) | .98 (.96‐.99) |
Note. Bold values indicate measures that are significantly different (P < .05, chi‐square test) from results using the development dataset (see Table 3).
Abbreviations: KPNW, Kaiser Permanente Northwest; NLP, natural language processing; NPV, negative predictive value; PPV, positive predictive value.
n is the count as determined by gold standard chart review, includes only events for which EHR source data were available.