Literature DB >> 31020755

Identifying and classifying opioid-related overdoses: A validation study.

Carla A Green¹, Nancy A Perrin^1,2, Brian Hazlehurst¹, Shannon L Janoff¹, Angela DeVeaugh-Geiss³, David S Carrell⁴, Carlos G Grijalva⁵, Caihua Liang^6,7, Cheryl L Enger^6,7, Paul M Coplan^8,9.

Abstract

PURPOSE: The study aims to develop and validate algorithms to identify and classify opioid overdoses using claims and other coded data, and clinical text extracted from electronic health records using natural language processing (NLP).
METHODS: Primary data were derived from Kaiser Permanente Northwest (2008-2014), an integrated health care system (~n > 475 000 unique individuals per year). Data included International Classification of Diseases, Ninth Revision (ICD-9) codes for nonfatal diagnoses, International Classification of Diseases, Tenth Revision (ICD-10) codes for fatal events, clinical notes, and prescription medication records. We assessed sensitivity, specificity, positive predictive value, and negative predictive value for algorithms relative to medical chart review and conducted assessments of algorithm portability in Kaiser Permanente Washington, Tennessee State Medicaid, and Optum.
RESULTS: Code-based algorithm performance was excellent for opioid-related overdoses (sensitivity = 97.2%, specificity = 84.6%) and classification of heroin-involved overdoses (sensitivity = 91.8%, specificity = 99.0%). Performance was acceptable for code-based suicide/suicide attempt classifications (sensitivity = 70.7%, specificity = 90.5%); sensitivity improved with NLP (sensitivity = 78.7%, specificity = 91.0%). Performance was acceptable for the code-based substance abuse-involved classification (sensitivity = 75.3%, specificity = 79.5%); sensitivity improved with the NLP-enhanced algorithm (sensitivity = 80.5%, specificity = 76.3%). The opioid-related overdose algorithm performed well across portability assessment sites, with sensitivity greater than 96% and specificity greater than 84%. Cross-site sensitivity for heroin-involved overdose was greater than 87%, specificity greater than or equal to 99%.
CONCLUSIONS: Code-based algorithms developed to detect opioid-related overdoses and classify them according to heroin involvement perform well. Algorithms for classifying suicides/attempts and abuse-related opioid overdoses perform adequately for use for research, particularly given the complexity of classifying such overdoses. The NLP-enhanced algorithms for suicides/suicide attempts and abuse-related overdoses perform significantly better than code-based algorithms and are appropriate for use in settings that have data and capacity to use NLP.

Entities: Chemical Disease Gene Species

Keywords: abuse; algorithms; heroin; methods; opioid overdose; pharmacoepidemiology; suicide

Mesh：

Substances：
Analgesics, Opioid
Heroin

Year: 2019 PMID： 31020755 PMCID： PMC6767606 DOI： 10.1002/pds.4772

Source DB: PubMed Journal: Pharmacoepidemiol Drug Saf ISSN： 1053-8569 Impact factor: 2.890

OODs can be identified using coded insurance claims data or electronic health records. Heroin‐involved OODs can be accurately identified. OODs that are suicides/suicide attempts can be identified with adequate accuracy using coded data. OODs involving substance abuse can be identified with adequate accuracy using coded data. Algorithms for classifying suicides/suicide attempts and substance abuse‐involved overdoses can be significantly improved using data derived from natural language processing of clinical text in electronic health records.

INTRODUCTION

Opioid use disorders and fatal and nonfatal opioid‐related overdoses (OODs) are significant public health problems.1, 2, 3, 4, 5, 6, 7, 8 Initiatives to reduce prescription opioid‐related risks include clinical guidelines,9 restricted access to extended‐release/long‐acting (ER/LA) opioids,10, 11, 12 added abuse‐deterrent properties,13, 14, 15 FDA's Risk Evaluation and Mitigation Strategies (REMS),16, 17, 18, 19 opioid management plans,10, 20, 21 and state prescription drug monitoring programs.22, 23, 24 Accurate identification of OODs is essential to quantify the burden of the problem, evaluate risk‐reduction strategies, monitor population‐level outcomes, and improve prevention and quality of care. Furthermore, differentiation of type, such as suicides or heroin‐related overdoses, is needed to target and evaluate specific interventions. To date, few studies have validated methods used to identify OODs.25, 26, 27, 28 The purpose of this study was to improve upon and conduct a full validation of a previously developed algorithm to identify overdose events26 and to develop and validate algorithms that classify types of overdoses (eg, heroin‐related and suicides). The original study was limited in scope, excluding events that occurred within 3 days of surgery, and only assessed positive predictive value (PPV). The present study attempted to improve algorithm performance using (a) additional coded data, including a sample of individuals without identified OODs; (b) data extracted with natural language processing (NLP) of EHR clinical notes; (c) tests in three additional health care systems; and (d) additional performance measures.

METHODS

Study populations, data sources, and sampling procedures

Kaiser Permanente Northwest (KPNW) was the primary site for algorithm development and validation; additional sites allowed performance assessment in other data systems. KPNW is an integrated health plan providing comprehensive inpatient and outpatient medical care, including addiction and mental health treatment to members in Oregon and southwest Washington State. KPNW served about 500 000 members at study end and is demographically representative of its service area. The study population included all members with any eligibility from January 1, 2008 to December 31, 2014. Data sources included administrative, clinical, inpatient and outpatient records, claims, and clinical care information received from non‐KPNW settings.

Samples

Mutually exclusive samples were constructed for development and validation. Suspected OOD cases composed approximately 56% of each sample, defined by a patient and a point in time (based on ICD9‐CM/ICD10‐CM codes), using the previously developed OOD algorithm (the “OOD algorithm”; Table 1) and opioid‐related adverse effects codes. The remainder of each sample (approximately 44%) consisted of “at‐risk” individuals likely to have an OOD but for whom no suspected OODs were identified. Individuals at risk were identified using diagnoses commonly comorbid with opioid/other substance use disorders. Diagnoses were chosen on the basis of prior research and existing literature and included the following: (a) substance abuse diagnoses, (b) mental health diagnoses, and (c) diagnoses associated with substance abuse (see Data S1). Individuals with at least two diagnoses from two categories, without suspected OODs, were considered at risk. We drew stratified random samples of at‐risk individuals, half with 0–29 days' supply of ER/LA opioids in the prior year, and half with greater than or equal to 30 days' supply. At‐risk periods were converted to events/nonevents for analyses.

Table 1

ICD‐9 and ICD‐10 codes used in the initial algorithm to detect opioid overdosesa

	ICD‐9 Code	ICD‐10 Code
Poisoning by opium (alkaloids) unspecified	965.00
Poisoning by heroin	965.01
Poisoning by methadone	965.02
Poisoning by other opiates and related narcotics	965.09
Accidental poisoning by heroin	E850.0
Accidental poisoning by methadone	E850.1
Accidental poisoning by other opiates and related narcotics	E850.2
COD: poisoning by opiates and related narcoticsa	9650
COD: poisoning by opium		T40.0
COD: poisoning by heroin		T40.1
COD: poisoning by other opioids		T40.2
COD: poisoning by methadone		T40.3
COD: poisoning by other synthetic narcotic		T40.4
COD: accidental poisoning by and exposure to narcotics and psychodysleptics, not elsewhere classified		X42
COD: intentional self‐poisoning by and exposure to narcotics and psychodysleptics, not elsewhere classified		X62
COD: undetermined poisoning by and exposure to narcotics and psychodysleptics, not elsewhere classified		Y12

Note. COD: cause of death; ICD‐9/10: International Classification of Diseases, Ninth/Tenth Revision.

Presence of any of these codes was considered indicative of the presence of an OOD.

Though the study period was prior to the nationwide switch to ICD‐10 diagnostic codes, ICD‐10 cause‐of‐death codes were in use nationwide throughout the study period.

ICD‐9 and ICD‐10 codes used in the initial algorithm to detect opioid overdosesa Note. COD: cause of death; ICD‐9/10: International Classification of Diseases, Ninth/Tenth Revision. Presence of any of these codes was considered indicative of the presence of an OOD. Though the study period was prior to the nationwide switch to ICD‐10 diagnostic codes, ICD‐10 cause‐of‐death codes were in use nationwide throughout the study period.

Development sample

We created the development sample first, using OOD cases previously identified, to maximize information available for development. Cases included events identified using opioid‐related International Classification of Diseases, Ninth/Tenth Revisions (ICD‐9/10) poisoning codes and opioid‐related adverse effects codes. We included the latter to provide information that would help differentiate OODs from adverse effects. Together, these codes together formed the “suspected overdose” stratum in the sample.

Validation sample

We randomly selected algorithm‐identified OOD events and at‐risk individuals not used in development, proportionate to the number of suspected overdoses (approximately 56%) and at‐risk cases (approximately 44%) in that sample. This “balancing” allowed performance comparisons between development and validation samples. Once development was complete, performance was assessed in the validation sample.

Datasets

The development sample included 1006 events (suspected OODs and at‐risk individuals) from 977 unique people, of which 872 (845 unique people) had EHR information available for chart audit. These 872 events comprised the development dataset. The validation sample included 1696 suspected OOD events and 1136 of which had EHR data available for audit (1100 unique people). These 1136 events formed the validation dataset. Table 2 shows counts of events by type in each sample, the base populations, and sample sizes.

Table 2

Development and validation sample descriptions

Sampling Goal	Events Identified by:	Base Population	Development Sample	Validation Sample
Suspected overdose cases	Opioid poisoning diagnostic codes	2,271	483	848
Suspected overdose cases	Opioid adverse effects diagnostic codes	254	78	103
At risk for overdose	Pain, mental health, and substance abuse diagnostic codes	87,550	222 with ≥30 days supply of ER/LA opioids	373 with ≥30 days supply of ER/LA opioids
At risk for overdose	Pain, mental health, and substance abuse diagnostic codes	87,550	223 with ≤30 days supply of ER/LA opioids	372 with ≤30 days supply of ER/LA opioids
Total			1006	1696

Note. ER/LA: extended‐release/long‐acting.

Development and validation sample descriptions Note. ER/LA: extended‐release/long‐acting.

Portability sites

Kaiser Permanente Washington (KPWA) provides coverage and care to approximately 760 000 individuals in Puget Sound and Spokane. KPWA's researchers have access to EHR and claims data from integrated health plan members. Limiting factors for some analyses were lack of chart auditor access to chemical dependency records and hospital data that were restricted to insurance claims. Optum. Data were derived from Optum's Integrated Database, which included 12.3 million individuals and links the following data: ambulatory, inpatient, medical claims, prescription, and practice management. Chronologic profiles were created from claims data, EHR structured data, and EHR notes extracted using NLP. Because profile data had limitations, we relaxed criteria for ambiguous cases, allowing “possible” responses for outcomes that were coded yes/no at other sites. Tennessee Medicaid (TennCare) is a managed Medicaid program, covering about 1.2 million Medicaid‐eligible, state residents. TennCare maintains an enrollee registry, records of patient‐provider encounters, pharmacy benefits and usage. Chart audits were restricted to inpatient hospital encounter records to facilitate record access; lack of access to outpatient records, particularly mental health follow‐up after suicides/suicide attempts and outpatient chemical dependency treatment, made these classifications difficult for auditors.

Chart audit process

Chart audit data were considered the “gold standard” against which algorithm results were compared. Index dates were dates of suspected OODs or, for those at risk, dates of the second qualifying diagnosis. We examined clinical records of at‐risk patients for overdoses for a 2‐year period centered upon the index date (greater than one OOD allowed per person). If no overdoses were found, event dates for at‐risk individuals became dates of qualifying diagnoses; individuals were coded as having no overdose on that date. At KPNW, professional chart auditors determined whether suspected OODs were actual OODs, and classified confirmed OODs according to whether or not they involved: (a) substance abuse; (b) misuse of prescribed medications (ie, therapeutic use not consistent with directions); (c) heroin; (d) inpatient pain management or anesthesia; (e) patient errors; (f) clinician prescribing errors; (g) alcohol; (h) other substances; and (i) suicide/attempted suicide. We completed 100% duplicate review and resolved discrepancies with review by clinicians and team discussion. Portability sites completed audits using similar procedures, though some adaptations were needed to account for site and data differences. Detailed chart audit procedures, including definitions, are available in Data S2.

Algorithm performance standards

Algorithm performance was evaluated using sensitivity, specificity, PPV, NPV, and F‐scores. No universal standards exist for sensitivity or specificity because acceptability depends on context and use. We expected OODs and heroin involvement to be readily identifiable, thus set sensitivity and specificity of 85% as acceptable and 90% as excellent for these algorithms. We expected other classifications (abuse, suicides/attempts, misuse, polysubstance involvement, and medication error) to be more difficult, thus set 75% as acceptable for sensitivity/specificity. F‐scores measure the accuracy of the algorithm and vary from 0 to 1, with 1 representing perfect fit. We considered F‐score values of 0.90 or greater to indicate excellent performance. Chi‐square tests assessed differences between performance in development and validation datasets.

Code‐based algorithm development procedures

We used the least absolute shrinkage and selection operator (LASSO) regression29, 30 to select variables based on predicted probabilities of case status, and classification and regression trees (CART) with random forest (500 trees)31 to evaluate cutoff values for continuous variables. Variables selected using LASSO were entered into logistic regression analyses to estimate final parameters. Predicted probabilities from logistic regression analyses were used to classify events. Parameters were then applied to the validation dataset and compared with chart audit findings.

OOD algorithm

We began by forcing the ICD‐9/10 codes from our prior algorithm26 into the model (Table 1), then tested the following additional variables: prescribed opioids in the 9 months prior to index date (immediate release [IR], ER/LA, or both); substance abuse diagnoses; opioid withdrawal diagnoses; weight change; and hospitalizations. None improved performance. The best fitting model used a binary variable coded “1” if any ICD‐9/10 codes listed in Table 1 were found.

OODs with heroin involvement

Among chart‐audit‐confirmed OODs in the development sample (n = 423), codes shown in Table 3 were tested to predict heroin involvement. The best fitting model performed well, and used a binary variable coded “1” if E965.01, E850.0, or T40.1 were present.

Table 3

ICD‐9 and ICD‐10 codes tested in the model to predict if an opioid overdose was related to heroin

E950.0	Suicide and self‐inflicted poisoning by solid or liquid substances
E965.01	Poisoning by heroin
E850.0	Accidental poisoning by heroin
T40.1	COD: poisoning by heroin

Note. COD: cause of death; ICD‐9/10: International Classification of Diseases, Ninth/Tenth Revision.

ICD‐9 and ICD‐10 codes tested in the model to predict if an opioid overdose was related to heroin Note. COD: cause of death; ICD‐9/10: International Classification of Diseases, Ninth/Tenth Revision.

Suicides/suicide attempt‐related OODs

We modeled overdoses in the development sample (n = 423) according to whether or not they were suicides/attempted suicides (see Tables 4 and 5 for codes evaluated). The initial model assessed predictors identified in the Mental Health Research Network's work predicting suicides/suicide attempts.32 We created a binary variable if any suicide‐related codes in Table 4 were present, binary variables for single or multiple episode depression diagnoses, and interaction terms for suicide and recurrent depression. Next, we tested diagnoses for alcohol use disorders, anxiety disorders, bipolar disorders, tobacco and drug use disorders, other psychoses, and schizophrenia spectrum disorders (Table 5). The best fitting model included main effects for suicide codes, single episode depression, recurrent depression, alcohol use disorder (AUD), nonalcohol substance use disorder (SUD), and three interaction terms for suicide with multiple episode depression, AUD and SUD.

Table 4

ICD‐9 diagnostic codes evaluated to predict whether or not an opioid overdose was intentional

Suicide/Suicide Attempt
E950.0	Suicide and self‐inflicted poisoning by solid or liquid substances
E950.1	Barbiturates
E950.2	Sedatives and hypnotics
E950.3	Tranquilizers and other psychotropic agents
E950.4	Other specified drugs and medicinal substances
E950.5	Unspecified drug or medicinal substance
E950.9	Other and unspecified solid and liquid substances
E956	Suicide and self‐inflicted injury by cutting and piercing instrument
E958.8	Suicide and self‐inflected injury by other specified means
E958.9	Suicide and self‐inflicted injury by unspecified means
V62.84	Suicidal ideation

Note. ICD‐9: International Classification of Diseases, Ninth Revision.

Table 5

Diagnoses tested for predicting suicides/suicide attempts

Diagnoses	ICD‐9 Codes
Alcohol use disorders	291.x, 303.x, 305.0
Anxiety disorders	300.0, 300.2, 300.3, 309.20, 309.21, 309.24, 309.81
Bipolar disorders	296.0, 296.1, 296.4–296.7, 296.80, 296.81, 296.89
Drug use disorders	292.x, 304.x, 305.2–305.9
Other psychoses	297.1, 297.3, 298.8, 298.9, 301.22
Schizophrenia spectrum disorders	295.x
Tobacco use disorder	305.1, 649.0, 989.84, V15.82

Note. ICD‐9: International Classification of Diseases, Ninth Revision.

ICD‐9 diagnostic codes evaluated to predict whether or not an opioid overdose was intentional Note. ICD‐9: International Classification of Diseases, Ninth Revision. Diagnoses tested for predicting suicides/suicide attempts Note. ICD‐9: International Classification of Diseases, Ninth Revision.

Substance use involved unintentional OODs

For unintentional overdoses (n = 268), we developed an algorithm to detect substance abuse involvement, using logistic regression to evaluate diagnostic codes listed in Table 6. The final model included two terms: (a) a binary indicator of substance abuse coded “1” if any ICD‐9 code related to heroin was present, or if there were no dispenses for an ER/LA or IR opioid in the year prior to the overdose (including the event date), and (b) indication of opioid abuse from ICD‐9 codes in the 2 years prior to the event.

Table 6

Diagnoses and other variables tested for identifying substance abuse involvement in opioid‐related overdoses

Diagnoses and Other Variables	ICD‐9 and ICD‐10 Codes
Poisoning by heroin	965.01
Accidental poisoning by heroin	E850.0
Poisoning by heroin	T40.1(COD)
Poisoning by opium (alkaloids), unspecified	965.00
Poisoning by methadone	965.02
Poisoning by other opiates/narcotics	965.09
Opioid dependence	304.00–304.03, 304.70–304.72
Opioid abuse	305.50–305.53
Drug use disorders	292.x, 304.x, 305.2–305.9
Alcohol use disorders	291.x, 303.x, 305.0
Tobacco use disorders	305.1, 649.0, 989.84, V15.82
Count of unique opioid prescribers in the 2 years prior to event	n/a
Count of early ER/LA opioid dispenses in the 2 years prior to event (early refill defined as two consecutive fills of an ER/LA where the number of days between prescriptions was less than or equal to 85% of the days' supply in the first prescription)	n/a
Count of early IR opioid dispenses in 2 years prior to the event	n/a
Medicaid/Medicare dual eligibility	n/a

Note. COD: cause of death; ER/LA: extended‐release/long‐acting; ICD‐9/10: International Classification of Diseases, Ninth/Tenth Revision.

Diagnoses and other variables tested for identifying substance abuse involvement in opioid‐related overdoses Note. COD: cause of death; ER/LA: extended‐release/long‐acting; ICD‐9/10: International Classification of Diseases, Ninth/Tenth Revision.

NLP‐enhanced algorithm development

We attempted to enhance code‐based algorithm performance with indicators extracted using NLP of EHR clinical notes. The companion paper (Hazlehurst et al33) provides details about development and validation of NLP‐derived variables. Each NLP‐derived variable was tested using logistic regression analyses to determine whether or not its addition improved performance beyond that of each respective code‐based algorithm. We used DeLong's test for two correlated receiver operator curves to compare areas under the curve for code‐based and NLP‐enhanced models. Table 7 shows definitions of NLP‐derived binary variables to code‐based algorithms for testing. The NLP classification “polysubstance including opioid” was broken into six binary subcomponents: named opioids; general “narcotics”; named opioid‐interacting medications; named recreational/illicit drugs; alcohol; named over‐the‐counter medications. The NLP classification “substance abuse” was broken into two subcomponents: alcohol or substance abuse noted; alcohol presence mentioned.

Table 7

Natural language processing (NLP) classifications and their alignment with gold standard chart review

	NLP‐only Classification	Chart Review Gold Standard Comparator
Event type, irrespective of substance involved	Intentional overdose (suicide/suicide attempt)	Intentional overdose = clearly or possible
	Unintentional overdose (excludes intentional overdose)	Unintentional overdose
	Overdose of any type (combines intentional and unintentional overdose)	Unintentional overdose or intentional overdose = clearly or possible
	Adverse drug event (excludes any overdose)	Adverse drug reaction
Substance involved in overdose or adverse drug event	Heroin	Heroin involved = yes or possible
	NLP identifies an overdose or adverse drug event in combination with heroin, regardless of other opioid or nonopioid prescription or over‐the‐counter medication	Heroin involved = yes or possible
	Opioid only (excludes heroin)	A single opioid event (excludes heroin)
	NLP identifies an overdose or adverse drug event in combination with a named opioid (or generic “narcotic”) in the absence of heroin and additional nonopioid prescription or over‐the‐counter medications.	A single opioid event (excludes heroin)
	Polysubstance including opioid (excludes heroin)	A polydrug, opioid event (excludes heroin)
	NLP identifies an overdose or adverse drug event in combination with a named opioid (or generic “narcotic”) AND additional nonopioid prescription or over‐the‐counter medications, but in the absence of heroin.	A polydrug, opioid event (excludes heroin)
	Any opioid (excludes heroin, include polysubstance)	A single or polydrug opioid event (excludes heroin)
Substance abuse involved in opioid‐related overdose	Prescription medication abuse (whether prescribed or not)	Opioid or nonopioid prescription med abuse = yes, AND NOT heroin = yes or possible in an opioid overdose event
	NLP identifies an opioid overdose in combination with abuse of medications (ie, nontherapeutic goals/actions noted about prescription medications), or documented conclusion of abuse by clinician.
	Substance abuse (including alcohol abuse or presence)	Alcohol present = yes in an opioid overdose event
	NLP identifies an opioid overdose in combination with notations of alcohol abuse or just alcohol present	Alcohol present = yes in an opioid overdose event
	Illicit drug abuse NLP identifies an opioid overdose in combination with heroin or other named recreational drugs (marijuana, cocaine, and methamphetamine).	Abuse of nonprescribed substances = yes in an opioid overdose event
	Any substance abuse	Opioid or nonopioid prescription med or nonprescribed substance abuse or alcohol present = yes in an opioid overdose event
	Any of the above types of abuse.
Patient error in opioid‐related overdose or adverse drug event	Patient error (EXCLUDES ALL ABUSE above)	Opioid or nonopioid medication‐taking error = yes AND NOT ABUSE AS DEFINED ABOVE
	NLP identifies an opioid overdose or adverse drug event in combination with mention of mistake/accident in taking medications

Natural language processing (NLP) classifications and their alignment with gold standard chart review

Portability assessment

Portability sites contributed data from commercial insurance, Medicaid and integrated health care settings. The OOD algorithm was applied to each site's population from 2008 to 2014 (2008‐2013 for TennCare). Classification algorithms were tested at sites with adequate data. At each site, a random sample of approximately 250 suspected OODs was selected for inclusion. The remaining approximately 250 individuals per sample were selected using the same criteria used at KPNW to identify at‐risk individuals. At KPWA, 435/500 cases had EHR information available (159 confirmed OODs); 500 cases were audited at Optum (258 confirmed OODs), and 516 at TennCare (240 OODs). Classification algorithms demonstrating acceptable performance at KPNW were implemented at portability sites, comparing results with chart audit. Lack of adequate numbers of cases prevented testing some classification algorithms: heroin involvement at TennCare; substance abuse involvement at KPWA (code‐based, NLP‐enhanced). The NLP‐enhanced algorithm for suicides/suicide attempts was only tested at KPWA as other sites did not have NLP capability. Mean age and sex in development and validation datasets were similar (48.8 and 46.9 years; 39.5% and 39.6% male, respectively). Confirmed OOD prevalence was 48.5% in the development dataset (n = 423) and 53.3% (n = 605) in the validation dataset. The validation dataset had a slightly higher prevalence of heroin‐involved OODs than the development dataset (16.2% vs. 11.8%) and similar prevalence of intentional overdoses (36.6% development and 34.2% validation), and substance use involved unintentional overdoses (41.4% and 43.7%, respectively). The study protocol was reviewed and approved by all sites' institutional review boards.

RESULTS

Table 8 summarizes algorithm performance in development and validation datasets for algorithms reaching acceptable performance during development. The OOD algorithm performed well: sensitivity (97.2%), specificity (84.6%), PPV (87.4%), and NPV (96.5%). There were no differences between development and validation datasets on sensitivity, PPV, or NPV, though specificity declined. The heroin‐involved classification also performed well in validation: sensitivity was 91.8%, specificity 99.0%, PPV 94.7%, and NPV 98.4%. Some measures of performance (specificity and PPV) were significantly (P ≤ 0.018) better in the validation compared with the development dataset (Table 8).

Table 8

Cross validation of models with 95% confidence intervals and significance tests comparing the performance in the two samples

	Development Dataset	Validation Dataset	P Value for the Difference
Opioid‐related overdose
Sensitivity (95% CI)	97.9 (96.0‐99.0)	97.2 (95.5‐98.4)	0.493
Specificity (95% CI)	88.9 (85.6‐91.6)	84.6 (81.3‐87.5)	0.028
PPV (95% CI)	89.2 (86.4‐91.5)	87.4 (85.0‐89.3)	0.3421
NPV (95% CI)	97.8 (95.9‐98.8)	96.5 (94.5‐97.8)	0.205
F‐score	0.93	0.92
OODa classified as heroin involved
Sensitivity (95% CI)	94.0 (83.5‐98.8)	91.8 (84.6‐96.4)	0.635
Specificity (95% CI)	96.8 (94.5‐98.3)	99.0 (97.7‐99.7)	0.018
PPV (95% CI)	79.7 (69.1‐87.3)	94.7 (88.3‐97.7)	0.004
NPV (95% CI)	99.2 (97.6‐99.7)	98.4 (97.0‐99.2)	0.330
F‐score	0.86	0.93
OOD classified as suicide/suicide attempt
Sensitivity (95% CI)	77.4 (70.0‐83.7)	70.5 (63.8‐76.7)	0.142
Specificity (95% CI)	88.1 (83.6‐91.7)	90.2 (86.9‐92.9)	0.380
PPV (95% CI)	79.0 (82.8‐84.0)	78.9 (73.3‐83.6)	0.995
NPV (95% CI)	87.1 (83.4‐90.1)	85.5 (82.6‐87.9)	0.551
F‐score	0.78	0.74
OOD classified as abuse involved
Sensitivity (95% CI)	82.0 (73.6‐88.6)	75.3 (68.2‐81.5)	0.184
Specificity (95% CI)	83.4 (76.7‐88.9)	79.5 (73.6‐84.6)	0.329
PPV (95% CI)	77.8 (70.9‐83.4)	74.0 (68.5‐78.9)	0.462
NPV (95% CI)	86.8 (81.4‐90.7)	80.5 (76.0‐84.4)	0.117
F‐score	0.80	0.75
Overdose classified as suicide/suicide attempt using NLP‐enhanced algorithm
Sensitivity (95% CI)	88.4 (82.3‐93.0)	78.7 (72.5‐84.1)	0.016
Specificity (95% CI)	91.8 (87.8‐94.8)	91.0 (87.7‐93.6)	0.707
PPV (95% CI)	86.2 (80.6‐90.3)	81.9 (76.7‐86.2)	0.278
NPV (95% CI)	93.2 (89.8‐95.5)	89.2 (86.3‐91.5)	0.079
F‐score	0.87	0.80
Overdose classified as involving abuse using NLP‐enhanced algorithm
Sensitivity (95% CI)	90.1 (83.0‐95.0)	80.5 (73.8‐86.1)	0.030
Specificity (95% CI)	79.6 (72.5‐85.6)	76.3 (70.2‐81.8)	0.449
PPV (95% CI)	75.8 (69.5‐81.1)	72.5 (67.4‐77.2)	0.517
NPV (95% CI)	91.9 (86.6‐95.2)	83.4 (78.7‐87.3)	0.023
F‐score	0.82	0.76

Note. NLP: natural language processing; OOD: opioid‐related overdose.

Opioid‐related overdose.

Cross validation of models with 95% confidence intervals and significance tests comparing the performance in the two samples Note. NLP: natural language processing; OOD: opioid‐related overdose. Opioid‐related overdose. For the classification identifying suicides/suicide attempts, performance was acceptable for the code‐based model in both datasets and did not differ significantly (P ≥ 0.142) between the two. Sensitivity was 70.5%, specificity 90.2%, PPV 78.9%, and NPV 85.5% for the coded‐based algorithm. Performance of the code‐based classification for substance abuse‐involved OODs was also acceptable: Sensitivity was 75.3%, specificity 79.5%, PPV 74.0%, and NPV 80.5%. Performance did not differ between development and validation datasets (P ≥ 0.117). Models classifying opioid misuse, patient medication errors, and polysubstance involvement did not reach acceptable (greater than or equal to 0.75%) levels of sensitivity and specificity so were not validated. We identified few events involving clinician error and few inpatient events making modeling unfeasible for these classifications. For inpatient overdose/oversedation, we adopted a different identification strategy (see Green et al, companion paper34).

NLP‐enhanced models

The addition of NLP variables did not improve performance of the code‐based OOD algorithm or the heroin algorithm, though NLP‐enhanced models outperformed code‐based models for classifying suicides/suicide attempts (sensitivity = 78.7%, specificity = 91.0%, PPV = 81.9%, and NPV = 89.2%) and those involving substance abuse (sensitivity = 80.5%, specificity = 76.3%, PPV = 72.5%, and NPV = 83.4%). Performance declined for the NLP‐enhanced models in the validation dataset compared with the development dataset (Table 8), but validation results remained above acceptable limits for suicides/suicide attempts and substance abuse‐involved overdoses for sensitivity, specificity, and NPV.

Algorithm portability performance

Table 9 presents performance across portability sites. The OOD algorithm performed well, with sensitivity greater than 96%, specificity greater than 84%, and F‐scores greater than 0.92. Cross‐site sensitivity for sites with adequate data to test the heroin‐involved classification was greater than 87%, specificity greater than 99%, and F‐scores greater than or equal to 0.90. The code‐based algorithm for suicides/suicide attempts performed equally at KPNW and KPW (F‐scores of 0.74), though there was a decline in sensitivity at Optum and TennCare, likely a result of data limitations. The NLP‐enhanced algorithm for suicides/attempts performed better than the code‐based algorithm and also performed better at KPWA than at KPNW (F‐scores of 0.85 and 0.80, respectively). The substance abuse‐involved algorithm performed poorly at Optum and TennCare. Upon further investigation, it appeared that chart audit determinations for substance abuse‐related OODs at these two sites were made difficult by data limitations.

Table 9

Final results of portability assessment of code‐based and NLP‐enhanced algorithms

	KPNW	KPW	Optum	TennCare
OOD
Sensitivity	97.2	100.0	96.9	99.2
Specificity	84.6	89.2	100.0	92.4
PPV	87.4	84.1	100.0	91.9
NPV	96.5	100.0	96.9	99.2
F‐score	0.92	0.92	0.98	0.95
Heroin
Sensitivity	91.8	87.5	98.5	N/A
Specificity	99.0	99.3	100.0	N/A
PPV	94.7	93.3	100.0	N/A
NPV	98.4	98.5	99.5	N/A
F‐score	0.93	0.90	0.99	N/A
Suicide/suicide attempt
Sensitivity	70.5	74.1	63.2	44.9
Specificity	90.2	86.7	91.0	87.1
PPV	78.9	74.1	81.1	64.5
NPV	85.5	86.7	80.1	75.1
F‐score	0.74	0.74	0.67	0.53
Suicide/suicide attempt—NLP enhanced
Sensitivity	78.7	81.5	N/A	N/A
Specificity	91.0	95.2	N/A	N/A
PPV	81.9	89.8	N/A	N/A
NPV	89.2	90.9	N/A	N/A
F‐score	0.80	0.85	N/A	N/A
Abuse involved
Sensitivity	75.3	N/A	67.1	41.5
Specificity	79.5	N/A	31.9	62.5
PPV	74.0	N/A	46.2	29.0
NPV	80.5	N/A	52.6	74.4
F‐score	0.76	N/A	0.55	0.34

Final results of portability assessment of code‐based and NLP‐enhanced algorithms

DISCUSSION

The code‐based OOD algorithm, using ICD‐9 diagnostic codes and death data using ICD‐10 codes, has excellent performance across health systems, whether applied to EHR‐based databases or commercial or Medicaid claims databases. Few inpatient events were identified using the algorithm, suggesting that it does not confound inpatient overdoses that are typically medically related with accidental or intentional overdoses occurring elsewhere. Given the excellent performance of the code‐based OOD algorithm, the NLP‐enhanced algorithm was unable to improve performance. In short, our results show that a simple code‐based algorithm can be used to accurately identify overdoses in widely differing settings. Similarly, the code‐based algorithm classifying overdoses as heroin‐involved showed excellent performance across settings when adequate numbers of heroin‐related events were available for testing. As with the OOD algorithm, however, there was little room from improved performance in an NLP‐enhanced model. Identifying suicides/attempted suicides presented a greater challenge, though the code‐based algorithm performed adequately. Performance was not as good at Optum or TennCare, but PPV and NPV were acceptable in all but TennCare, where reviews were focused on inpatient health care encounters rather than both inpatient and outpatient data. Other sites had access to OOD‐related follow‐up visits that often provided information necessary to determine when an OOD was a suicide/suicide attempt. The NLP‐enhanced algorithm significantly improved performance for detecting suicides/suicide attempts suggesting that including NLP‐derived data from clinical notes is beneficial when available. Performance of the NLP‐enhanced model was good in both sites with the necessary data. The code‐based algorithm for substance abuse involvement showed reasonable, moderate performance in KPNW, particularly given the complexity of identifying substance abuse with coded data alone. With room for improvement and the likelihood that clinicians document suspicions about substance abuse in clinical notes rather than using diagnostic codes, we expected and found that the NLP‐enhanced algorithm performance was better. Unfortunately, chart auditors did not have access to chemical dependency records at KPWA, preventing detection of adequate numbers of abuse‐related overdoses to test the model there. Nevertheless, results in KPNW suggest that using NLP to identify substance abuse‐involved overdoses will be more fruitful than code‐based algorithms alone. As a result of limited cases for some classifications, we were unsuccessful with code‐based or NLP‐enhanced models classifying prescription medication misuse or patient errors. Other limitations included that development was based primarily on data extracted from a single integrated health care system. Portability assessments designed to overcome this limitation provide important confirmation for some outcomes but limitations in sample sizes and data sources at portability sites made some comparisons unfeasible. Also, NLP‐enhanced algorithms rely on a specific NLP system (MediClass) for extracting information from clinical notes. Although the “knowledge” used by MediClass is easily extracted for use by other NLP systems, results may not be identical.

CONCLUSIONS

To our knowledge, this work is the most comprehensive validation of algorithms developed to identify and classify OODs. The code‐based OOD algorithm shows excellent performance across different health care systems using ICD‐9 encounter codes. The same holds true for classifying overdoses involving heroin. Algorithms for identifying opioid‐related suicides/suicide attempts and substance abuse‐involved overdoses perform adequately, particularly given the complexity of identifying these types of OODs. The NLP‐enhanced algorithms for suicides/suicide attempt‐related overdoses and abuse‐related overdoses substantially enhance classification, which should improve ascertainment in settings with NLP capacity. Finally, we used a conservative strategy for evaluating algorithm performance by using an at‐risk sample of noncases. A random sample of the population would likely have resulted in no identified opioid overdoses and in better algorithm performance but would not have allowed us to learn from missed cases as part of algorithm development. Additional research is now needed to translate and assess the algorithms for use with ICD‐10 encounter data.

ETHICS STATEMENT

The study was reviewed and approved by the Kaiser Permanente Northwest Institutional Review Board for the Protection of Human Subjects. Participating sites reviewed the study and ceded oversight to the KPNW IRB.

CONFLICT OF INTEREST

This project was conducted as part of a Food and Drug Administration (FDA)‐required postmarketing study of extended‐release and long‐acting opioid analgesics (https://www.fda.gov/downloads/Drugs/DrugSafety/InformationbyDrugClass/UCM484415.pdf) funded by the Opioid Postmarketing Consortium (OPC). The OPC is composed of companies that hold NDAs of extended‐release and long‐acting opioid analgesics and, at the time of publication, they included the following companies: Allergan; Assertio Therapeutics, Inc; BioDelivery Sciences, Inc; Collegium Pharmaceutical, Inc; Daiichi Sankyo, Inc; Egalet Corporation; Endo Pharmaceuticals, Inc; Hikma Pharmaceuticals USA Inc; Janssen Pharmaceuticals, Inc; SpecGX, LLC; Pernix Therapeutics Holdings, Inc; Pfizer, Inc; and Purdue Pharma, LP. The study was designed in collaboration between OPC members and independent investigators with input from FDA. Investigators maintained intellectual freedom in terms of publishing final results. This study was registered with ClinicalTrials.gov as study NCT02667197 on January 28, 2016. All authors received research funding from the OPC. Drs Green and Perrin and Ms Janoff received prior funding from Purdue Pharma, LP to carry out related research. Dr Green provided research consulting to the OPC. Kaiser Permanente Center for Health Research (KPCHR) staff had primary responsibility for study design, though OPC members provided comments on the protocol. The protocol and statistical analysis plan were reviewed by FDA, revised following review, and then approved. All algorithm development and validation analyses were conducted by KPCHR; analyses of algorithm portability were completed by each participating site. KPCHR staff made all final decisions regarding publication and content, though OPC members reviewed and provided comments on the manuscript. Drs DeVeaugh‐Geiss and Coplan were employees of Purdue Pharma, LP at the time of the study. Dr Carrell has received funding from Pfizer Inc and Purdue Pharma, LP to carry out related research. Dr Grijalva has served as a consultant for Pfizer, and Merck for unrelated work; he has also received funding from NIH, AHRQ, CDC, FDA, and Sanofi‐Pasteur. Drs Liang and Enger are employees of Optum. Data S1 Supplement A: ICD‐9 codes used to identify the sample at‐risk for opioid‐related overdoses. Click here for additional data file. Data S2 Supplement B. Chart Audit Procedures Figure 1. Overview of general chart audit procedures. Table 1. Categories used by chart auditors to classify types of opioid‐related overdoses. Table 2. Results of chart audits in the KPNW development sample. Includes both suspected overdoses and review of at‐risk individuals. Click here for additional data file.

28 in total

1. Predicting Suicide Attempts and Suicide Deaths Following Outpatient Visits Using Electronic Health Records.

Authors: Gregory E Simon; Eric Johnson; Jean M Lawrence; Rebecca C Rossom; Brian Ahmedani; Frances L Lynch; Arne Beck; Beth Waitzfelder; Rebecca Ziebell; Robert B Penfold; Susan M Shortreed
Journal: Am J Psychiatry Date: 2018-05-24 Impact factor: 18.112

2. Opioid risk management: understanding FDA mandated risk evaluation and mitigation strategies (REMS).

Authors: Paul Alexander Sloan
Journal: J Opioid Manag Date: 2009 May-Jun

3. Vital signs: overdoses of prescription opioid pain relievers---United States, 1999--2008.

Authors:
Journal: MMWR Morb Mortal Wkly Rep Date: 2011-11-04 Impact factor: 17.586

4. Increases in Drug and Opioid-Involved Overdose Deaths - United States, 2010-2015.

Authors: Rose A Rudd; Puja Seth; Felicita David; Lawrence Scholl
Journal: MMWR Morb Mortal Wkly Rep Date: 2016-12-30 Impact factor: 17.586

5. Opioid prescriptions for chronic pain and overdose: a cohort study.

Authors: Kate M Dunn; Kathleen W Saunders; Carolyn M Rutter; Caleb J Banta-Green; Joseph O Merrill; Mark D Sullivan; Constance M Weisner; Michael J Silverberg; Cynthia I Campbell; Bruce M Psaty; Michael Von Korff
Journal: Ann Intern Med Date: 2010-01-19 Impact factor: 25.391

6. Clinical guidelines for the use of chronic opioid therapy in chronic noncancer pain.

Authors: Roger Chou; Gilbert J Fanciullo; Perry G Fine; Jeremy A Adler; Jane C Ballantyne; Pamela Davies; Marilee I Donovan; David A Fishbain; Kathy M Foley; Jeffrey Fudin; Aaron M Gilson; Alexander Kelter; Alexander Mauskop; Patrick G O'Connor; Steven D Passik; Gavril W Pasternak; Russell K Portenoy; Ben A Rich; Richard G Roberts; Knox H Todd; Christine Miaskowski
Journal: J Pain Date: 2009-02 Impact factor: 5.820

7. Abuse rates and routes of administration of reformulated extended-release oxycodone: initial findings from a sentinel surveillance sample of individuals assessed for substance abuse treatment.

Authors: Stephen F Butler; Theresa A Cassidy; Howard Chilcoat; Ryan A Black; Craig Landau; Simon H Budman; Paul M Coplan
Journal: J Pain Date: 2012-11-03 Impact factor: 5.820

8. Development of an algorithm to identify serious opioid toxicity in children.

Authors: Cecilia P Chung; S Todd Callahan; William O Cooper; Katherine T Murray; Kathi Hall; Judith A Dudley; C Michael Stein; Wayne A Ray
Journal: BMC Res Notes Date: 2015-07-04

9. Using natural language processing of clinical text to enhance identification of opioid-related overdoses in electronic health records data.

Authors: Brian Hazlehurst; Carla A Green; Nancy A Perrin; John Brandes; David S Carrell; Andrew Baer; Angela DeVeaugh-Geiss; Paul M Coplan
Journal: Pharmacoepidemiol Drug Saf Date: 2019-06-19 Impact factor: 2.890

10. Prescription drug monitoring programs and other interventions to combat prescription opioid abuse.

Authors: Bharath Chakravarthy; Shyam Shah; Shahram Lotfipour
Journal: West J Emerg Med Date: 2012-11

24 in total

1. Opioid prescribing history prior to heroin overdose among commercially insured adults.

Authors: Pooja Lagisetty; Kun Zhang; Rebecca L Haffajee; Lewei Allison Lin; Jason Goldstick; Rebecca Brownlee; Amy Bohnert; Marc R Larochelle
Journal: Drug Alcohol Depend Date: 2020-05-12 Impact factor: 4.492

2. Risk and protective factors for repeated overdose after opioid overdose survival.

Authors: Brian Suffoletto; Amy Zeigler
Journal: Drug Alcohol Depend Date: 2020-02-05 Impact factor: 4.492

3. Opioid overdose survivors: Medications for opioid use disorder and risk of repeat overdose in Medicaid patients.

Authors: Stephen Crystal; Molly Nowels; Hillary Samples; Mark Olfson; Arthur Robin Williams; Peter Treitler
Journal: Drug Alcohol Depend Date: 2022-01-10 Impact factor: 4.492

4. Comparative Analysis of Patient Distress in Opioid Treatment Programs using Natural Language Processing.

Authors: Fatemeh Shah-Mohammadi; Wanting Cui; Keren Bachi; Yasmin Hurd; Joseph Finkelstein
Journal: Biomed Eng Syst Technol Int Jt Conf BIOSTEC Revis Sel Pap Date: 2022-02

5. Stepping up to the Canadian opioid crisis: a longitudinal analysis of the correlation between socioeconomic status and population rates of opioid-related mortality, hospitalization and emergency department visits (2000-2017).

Authors: Mhd Wasem Alsabbagh; Martin Cooke; Susan J Elliott; Feng Chang; Noor-Ul-Huda Shah; Marco Ghobrial
Journal: Health Promot Chronic Dis Prev Can Date: 2022-06 Impact factor: 2.725

6. Prescription Opioid Dispensing Patterns Prior to Heroin Overdose in a State Medicaid Program: a Case-Control Study.

Authors: Daniel M Hartung; Kirbee A Johnston; Sara Hallvik; Gillian Leichtling; Jonah Geddes; Christi Hildebran; Shellie Keast; Brian Chan; P Todd Korthuis
Journal: J Gen Intern Med Date: 2020-09-15 Impact factor: 5.128

7. Identifying individuals with opioid use disorder: Validity of International Classification of Diseases diagnostic codes for opioid use, dependence and abuse.

Authors: Pooja Lagisetty; Claire Garpestad; Angela Larkin; Colin Macleod; Derek Antoku; Stephanie Slat; Jennifer Thomas; Victoria Powell; Amy S B Bohnert; Lewei A Lin
Journal: Drug Alcohol Depend Date: 2021-02-13 Impact factor: 4.492

Review 8. Can antiepileptic efficacy and epilepsy variables be studied from electronic health records? A review of current approaches.

Authors: Barbara M Decker; Chloé E Hill; Steven N Baldassano; Pouya Khankhanian
Journal: Seizure Date: 2021-01-13 Impact factor: 3.184

Review 9. Characterizing trends in methamphetamine-related health care use when there is no ICD code for "methamphetamine use disorder".

Authors: Riley D Shearer; Nathan D Shippee; Tyler N A Winkelman
Journal: J Subst Abuse Treat Date: 2021-03-19

10. Associations of opioid prescription dose and discontinuation with risk of substance-related morbidity in long-term opioid therapy.

Authors: Patrick D Quinn; Zheng Chang; Matthew J Bair; Martin E Rickert; Robert D Gibbons; Kurt Kroenke; Brian M D'Onofrio
Journal: Pain Date: 2022-04-01 Impact factor: 7.926