| Literature DB >> 34313748 |
Guoqian Jiang1, Sanket S Dhruva2, Jiajing Chen3, Wade L Schulz4,5, Amit A Doshi6, Peter A Noseworthy7, Shumin Zhang8, Yue Yu9, H Patrick Young10, Eric Brandt3, Keondae R Ervin11, Nilay D Shah12, Joseph S Ross5,10, Paul Coplan13,14, Joseph P Drozda3.
Abstract
OBJECTIVE: The study sought to conduct an informatics analysis on the National Evaluation System for Health Technology Coordinating Center test case of cardiac ablation catheters and to demonstrate the role of informatics approaches in the feasibility assessment of capturing real-world data using unique device identifiers (UDIs) that are fit for purpose for label extensions for 2 cardiac ablation catheters from the electronic health records and other health information technology systems in a multicenter evaluation.Entities:
Keywords: RWE; UDI; cardiac ablation catheters; informatics analysis; medical device evaluation; real-world evidence; unique device identifier
Mesh:
Year: 2021 PMID: 34313748 PMCID: PMC8449615 DOI: 10.1093/jamia/ocab117
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.A data flow diagram illustrating a typical process for the use of unique device identifiers (UDIs) for collecting device exposure data and clinical data. EHR: electronic health record; GUDID: Global Unique Device Identification Database; IT: information technology; J&J: Johnson & Johnson; PCORnet: Patient-Centered Research Network; YNHH: Yale New Haven Hospital.
AF and VT patient counts by disease subtype (Note that 1 patient may have more than 1 diagnosis)
| AF | Paroxysmal AF | Persistent AF | Permanent AF | Unspecified and other AF | VT | Ischemic VT | Nonischemic VT | |
|---|---|---|---|---|---|---|---|---|
| Mercy (01/01/2014-02/20/2020) | 169 062 | 88 387 | 11 898 | 31 753 | 145 903 | 24 401 | 16 379 | 8022 |
| Mayo Clinic (01/01/2014-12/31/2019) | 133 298 | 60 999 | 12 372 | 21 800 | 98 839 | 20 920 | 13 114 | 7806 |
| YNHH (02/01/2013-08/13/2019) | 54 821 | 15 007 | 3594 | 14 961 | 21 259 | 14 104 | 9599 | 4505 |
| Total |
|
|
|
|
|
|
|
|
AF: atrial fibrillation; VT: ventricular tachycardia; YNHH: Yale New Haven Hospital.
Device counts for AF patients by brand-specific subtypes of interest
| Paroxysmal AF | Persistent AF | |||
|---|---|---|---|---|
| ThermoCool ST | ThermoCool STSF | ThermoCool ST (treatment catheter) | ThermoCool STSF (control catheter) | |
| Mercy (01/01/2014-02/20/2020) | 377 | 408 | 251 | 492 |
| Mayo Clinic (01/01/2014-12/31/2019) | 625 | 248 | 233 | 100 |
| YNHH (02/01/2013-08/13/2019) | 96 | 135 | 65 | 115 |
| Total | 1098 | 791 | 549 | 707 |
AF: atrial fibrillation; ST: Smarttouch; STSF: Smarttouch Surround Flow; YNHH: Yale New Haven Hospital.
Validation of the AF subtype cases identified using ICD codes against the prospective nurse-abstracted registry data at Mayo Clinic
| Code | Vocabulary | Term | Paroxysmal AF in registry | Persistent AF in registry | Total |
|---|---|---|---|---|---|
| 427.31 | ICD-9 | AF | 304 (41.6) | 427 (58.4) | 731 (100) |
| I48.0 | ICD-10 | Paroxysmal AF | 260 (52.4) | 236 (47.6) | 496 (100) |
| I48.1 | ICD-10 | Persistent AF | 52 (29.5) | 124 (70.5) | 176 (100) |
| I48.2 | ICD-10 | Chronic AF | 4 (19.0) | 17 (81.0) | 21 (100) |
| I48.91 | ICD-10 | Unspecified AF | 251 (41.8) | 349 (58.2) | 600 (100) |
Values are n (%). ICD-9 codes were used prior to October 2015 and ICD-10 codes thereafter.
AF: atrial fibrillation; ICD: International Classification of Diseases; ICD-9: International Classification of Diseases–Ninth Revision; ICD-10: International Classification of Diseases–Tenth Revision;
Summary of predictive values for ICD codes by AF type at Mercy as compared with an natural language processing tool
| Paroxysmal AF (%) | Persistent AF (%) | Chronic AF (%) | |
|---|---|---|---|
| Sensitivity (recall) | 82.80 | 62.70 | 74.80 |
| Specificity | 86.50 | 95.90 | 90.80 |
| Positive predictive value (precision) | 94.20 | 80.40 | 87.60 |
| Negative predictive value | 65.70 | 90.60 | 80.70 |
AF: atrial fibrillation; ICD: International Classification of Diseases.
The CDM implementation status of 3 health systems: Mayo Clinic, Mercy, and YNNH
| CDM Implementation Status | Mayo Clinic | Mercy | YNHH |
|---|---|---|---|
| i2b2 Star Schema | X | X | |
| PCORnet CDM | X | X | |
| OMOP CDM | X (in progress) | X | |
| Sentinel CDM | X | ||
| FHIR | X | X | X |
CDM: common data model; CPT: Current Procedural Terminology; FHIR: Fast Healthcare Interoperability Resources; i2b2: Informatics for Integrating Biology and the Bedside; OMOP: Observational Medical Outcomes Partnership; PCORnet: Patient-Centered Research Network; YNHH: Yale New Haven Hospital.
Figure 2.The maturity level analysis results by 3 sites for the key technologies used in the data capture and transformation. Maturity level consists of 5 levels (ie, 1 = conceptual, 2 = reactive, 3 =structured, 4 = complete, and 5 = advanced). CDM: common data model; NLP: natural language processing; UDI: unique device identifier; YNHH: Yale New Haven Hospital.
Successes and challenges from informatics analysis
| Successes vs Challenges | |
|---|---|
| Use of UDIs
Data capture Data transformation Maturity | Successes:
The use of UDIs had been planned in the proposal stage, which had been envisioned as a key method to identify device exposure data. The use of UDIs is particularly effective in identifying brand-specific devices and relevant device exposure data as targeted in the NESTcc test case (see details in text). |
| Challenges:
The source of UDI information varies by healthcare system, requiring tailored approaches to extracting it and linking it to EHRs. UDIs are documented in different health IT systems and efforts are needed to identify and link them with clinical data in EHR systems. UDI implementation in health IT systems is uneven across sites. For example, data on medical devices used in YNHH prior to October 2017 are currently not readily available and were not routinely captured within the EHR. | |
| Use of Standardized Codes
Data capture Data transformation Maturity | Successes:
The algorithms for identifying conditions and outcome endpoints mainly rely on ICD codes. The advantage of the approach is that data can be readily collected across NCs. Data quality validation using registry data and chart review as a “gold standard” is an important component in the study design. |
| Challenges:
Coming to an agreement on standard computable covariate and outcome definitions took more time than we foresaw and requires domain-specific (cardiac electrophysiologist) clinical expertise. Data validation is a complex task for which we had not planned. We were able only to assess positive predictive values for study outcomes in small samples due to time and funding constraints. The accuracy of the algorithms using only ICD-10 codes is not optimal for some outcomes (largely owing to carry over of past diagnoses into subsequent healthcare visits) and more complex algorithms will need to be explored in the future study, eg, only using primary diagnosis vs also include secondary diagnosis codes, applying the requirement of no reported diagnosis prior to the index procedure, only including inpatient events for some outcomes such as stroke, adding additional data types (eg, procedures, medications) and using unstructured clinical notes searched by NLP. Refinement of the algorithms to define rehospitalization and reason for rehospitalization with consensus across NCs is required for future work. One key issue is that important diagnoses (eg, arrhythmias, stroke) are carried forward for a significant period of time (eg once a patient is diagnosed with AF, they may continue to carry this diagnosis into the future even though the arrhythmia may not necessarily have recurred, especially in ambulatory care visits); this makes ascertaining arrhythmia recurrence using diagnosis codes a challenge as an effectiveness study outcome and will require algorithm development (eg, restricting stroke events to inpatient diagnoses), refinement, and validation for use in a regulatory grade study. Simply examining all diagnoses from ICD-10 codes during follow-up will lead to misclassification and thereby low positive predictive value. Some of the “gold standard” measures used in the validation of AF diagnoses had not been validated themselves so their diagnostic accuracy is unknown, ie, the ablation registry at Mayo Clinic and the NLP probe at Mercy YNHH uses an internal coding for procedures, which are not all mapped to the standard CPT codes, and often less specific, multiple procedure records can exist for the same procedure with some lag in entry time. Some of these records can persist even when the procedure did not take place, and in some instances, more than 1 ablation procedure may have taken place. These issues may require manual chart review to resolve, which can be time-consuming. | |
| Use of NLP technology
Data capture Data transformation Maturity | Successes:
We have successfully leveraged NLP to identify covariates like left ventricular ejection fraction from echocardiogram reports, and to validate atrial fibrillation patient phenotypes (see details in text). The value of the NLP technology in adding additional data points for improving accuracy of phenotyping algorithms has been realized (see details in text). |
| Challenges:
Requiring advanced expertise in using existing NLP tools or developing fit-for-purpose NLP algorithms. Lacking NLP solutions that are portable across sites. | |
| Use of CDMs
Data capture Data transformation Maturity | Advantages:
The OMOP CDM has specified a device exposure table, with a field to capture UDI information. i2b2 star schema is a generic model that can handle device data by leveraging device vocabularies in its ontology cell. PCORnet CDM is working on expanding the model to capture UDI and device exposure data. Sentinel CDM is designed primarily for insurance claims data and contains no device data. CDMs can be used for standardizing data collection and analysis process across sites, facilitating meaningful collaborations. |
| Challenges:
The implementation of CDMs often requires significant time and effort to extract and convert data from clinical data information systems, such as EHRs and laboratory information systems, to the format required to load into each CDM. Multiple CDMs may be difficult to maintain for each health system and the health systems may implement different CDMs, thus decreasing the value of use of CDMs. The CDMs lack definitive rules for storing the UDI, and therefore, more generic identifiers such as a device identifier without a product identifier may be present in these fields. |
CDM: common data model; CPT: Current Procedural Terminology; EHR: electronic health record; i2b2: Informatics for Integrating Biology and the Bedside; ICD-10: International Classification of Diseases–Tenth Revision; IT: information technology; NC: network collaborator; NESTcc: National Evaluation System for Health Technology Coordinating Center; NLP: natural language processing; OMOP: Observational Medical Outcomes Partnership; PCORnet: Patient-Centered Research Network; UDI: unique device identifier; YNHH: Yale New Haven Hospital.