| Literature DB >> 35647478 |
Ang Li1, Wilson L da Costa2, Danielle Guffey3, Emily M Milner4, Anthony K Allam4, Karen M Kurian4, Francisco J Novoa4, Marguerite D Poche4, Raka Bandyo5,6, Carolina Granada1, Courtney D Wallace6, Neil A Zakai7, Christopher I Amos2,3.
Abstract
Background: Research on venous thromboembolism (VTE) that relies only on the International Classification of Diseases (ICD) can misclassify outcomes. Our study aims to discover and validate an improved VTE computable phenotype for people with cancer.Entities:
Keywords: administrative claims; health care; natural language processing; neoplasms; venous thromboembolism; venous thrombosis
Year: 2022 PMID: 35647478 PMCID: PMC9130880 DOI: 10.1002/rth2.12733
Source DB: PubMed Journal: Res Pract Thromb Haemost ISSN: 2475-0379
FIGURE 1Cohort construction and data sources. CPT, Current Procedural Terminology; HCPCS, Healthcare Common Procedure Coding System; ICD, International Classification of Diseases
FIGURE 2Study design for computable phenotype derivation. ICD, International Classification of Diseases; NLP, natural language processing; POS, part‐of‐speech; VTE, venous thromboembolism
FIGURE 3Study design for computable phenotype validation. ICD, International Classification of Diseases; NLP, natural language processing; VTE, venous thromboembolism
Performance of the ICD‐based algorithms after each exclusion filter in derivation data set
| ICD Category | No exclusion | Exclusion 1 | Exclusion 2 | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Total | PPV VTE, n (%) | PPV ±90d, n (%) | Total | PPV VTE, n (%) | PPV ±90d, n (%) | Total | PPV VTE | PPV ±90d | |
| Acute PE | 324 | 270 (83) | 252 (78) | 374 | 307 (82) | 279 (75) | 289 | 258 (89) | 241 (83) |
| Acute LE‐DVT | 228 | 146 (64) | 139 (61) | 284 | 180 (63) | 167 (59) | 202 | 142 (70) | 134 (66) |
| Acute UE‐DVT | 98 | 81 (83) | 78 (80) | 119 | 96 (81) | 88 (74) | 100 | 85 (85) | 78 (78) |
| Non‐specific VTE | 134 | 59 (44) | 47 (35) | 78 | 53 (68) | 46 (59) | 62 | 42 (68) | 36 (58) |
| Historic VTE | 173 | 75 (43) | 53 (31) | ||||||
| Chronic VTE | 43 | 31 (72) | 26 (60) | ||||||
| Total | 1000 | 662 (66) | 595 (60) | 855 | 636 (74) | 580 (68) | 653 | 527 (81) | 489 (75) |
Abbreviations: ICD, International Classification of Diseases; IVC, inferior vena cava; LE‐DVT, lower‐extremity deep vein thrombosis; PE, pulmonary embolism; PPV, positive predictive value; UE‐DVT, upper‐extremity deep vein thrombosis; VTE, venous thromboembolism.
See Table S1 for detailed list of code included.
No exclusion: use first ICD code after date of cancer diagnosis. Column “PPV VTE” indicates how many patients had acute VTE after cancer diagnosis regardless of timing; column “PPE ±90d” indicates how many patients had acute VTE within 90 days of first given ICD code.
Exclusion 1 (wrong ICD code): use first ICD code after excluding chronic VTE codes, history VTE codes, and a subset of nonspecific VTE codes from consideration of “acute VTE” (see Table S4).
Exclusion 2 (carryover effect): use first ICD code after excluding patients with known ICD codes for VTE (any time) or received therapeutic anticoagulation (up to 90 d) before date of cancer diagnosis + exclusion 1.
Exclusion 3 (rule out effect): use first ICD code after excluding patients with outpatient encounter UNLESS having 2+ codes >30 d and <180 d apart or receiving therapeutic anticoagulation at time of encounter (−7 d to +30 d) + exclusion 1 + exclusion 2.
Exclusion 4 (anticoagulation, IVC filter and death): use first ICD code that had anticoagulation, IVC filter placement, or death within 30 d regardless of inpatient or outpatient encounter +exclusion 1 + exclusion 2.
Performance of ICD and NLP‐based algorithms in validation data set
| Unweighted VTE (n = 800) | Weighted VTE (n = 8507) | Weighted Sensitivity, % | Weighted Specificity, % | Weighted PPV, % | Weighted NPV, % | Weighted c statistic | |||
|---|---|---|---|---|---|---|---|---|---|
| No | Yes | No | Yes | ||||||
| Previous ICD algorithm | |||||||||
| Predicted no (0) | 512 | 51 | 7604 | 253 | 71.7 (68.9‐74.6) | 99.7 (99.6‐99.8) | 96.7 (95.2‐97.7) | 96.8 (96.4‐97.2) | 0.86 (0.84‐0.87) |
| Predicted yes (1) | 8 | 229 | 22 | 641 | |||||
| Current ICD algorithm | |||||||||
| Predicted no (0) | 505 | 22 | 7584 | 172 | 80.8 (78.1‐83.1) | 99.4 (99.3‐99.6) | 94.5 (92.9‐96.2) | 97.8 (97.4‐98.1) | 0.90 (0.89‐0.91) |
| Predicted yes (1) | 15 | 258 | 42 | 722 | |||||
| Current NLP algorithm | |||||||||
| Predicted no (0) | 497 | 28 | 7420 | 94 | 89.5 (87.1‐91.2) | 97.3 (96.9‐97.6) | 79.5 (77.1‐81.9) | 98.7 (98.5‐99.0) | 0.93 (0.92‐0.94) |
| Predicted yes (1) | 23 | 252 | 206 | 800 | |||||
| ICD + NLP algorithm | |||||||||
| Predicted no (0) | 488 | 7 | 7356 | 35 | 96.1 (94.7‐97.2) | 97.0 (96.6‐97.3) | 78.8 (76.2‐81.2) | 99.5 (99.3‐99.7) | 0.98 (0.97‐0.98) |
| Predicted yes (1) | 26 | 36 | 218 | 203 | |||||
| 74.3 (71.5‐77.1) | 99.8 (99.6‐99.9) | 97.5 (96.0‐98.4) | 97.1 (96.7‐97.5) | ||||||
| Predicted yes (2) | 6 | 237 | 16 | 640 | |||||
Acute VTE ICD‐9‐CM and ICD‐10‐CM codes are listed in Table S1 (after excluding certain nonspecific, historic, and chronic codes). Therapeutic anticoagulation is defined as the presence of an administered (inpatient/infusion center) or prescribed (outpatient) direct oral anticoagulant (DOAC) (any dose), oral warfarin (any dose), subcutaneous enoxaparin (>1.3 mg/kg if daily or >0.8 mg/kg for twice‐daily frequency), subcutaneous fondaparinux (5‐10 mg), intravenous (IV) heparin (continuous drip only excluding flushes or pushes), or subcutaneous heparin (>7500 mg) within 7 days before and 30 days after suspected VTE diagnosis. IVC filter is defined by CPT codes 37191, 37620, 36005, and 36010. Relevant radiology reports are defined as any contrast scan or Doppler ultrasound with CPT codes listed in Table S2.
Abbreviations: ICD‐9‐CM, International Classification of Diseases, Ninth Revision, Clinical Modification; ICD‐10‐CM, International Classification of Diseases, Tenth Revision, Clinical Modification; NLP, natural language processing; NPV, negative predictive value; PPV, positive predictive value; VTE, venous thromboembolism.
Previous ICD algorithm (Sanfilippo et al,4 ICD‐9‐CM converted to ICD‐10‐CM): first of (any inpatient or outpatient acute VTE code) with (therapeutic anticoagulation or IVC filter or death −7 d to +30 d) within 365 d.
Current ICD algorithm: first of (any inpatient acute VTE code) or (2+ outpatient acute VTE code >30 d and <180 d) or (any outpatient acute VTE code with therapeutic anticoagulation −7 d to +30 d) within 365 d.
Current NLP algorithm: first relevant radiology impression predicted to be positive for VTE based on rule‐based NLP prediction.
ICD +NLP algorithm: 1st of either ICD or NLP algorithm within 365d. Predicted no (0) indicates both algorithms did not identify VTE; predicted yes (1) indicates one of the two algorithms identified VTE; predicted yes (2) indicates both algorithms identified VTE.
Weighted events are slightly different due to decimal rounding.
Number ranges in paratheses refer to 95% Confidence Intervals (CI) for the weighted estimates.