| Literature DB >> 31660955 |
Marcel von Lucadou1, Thomas Ganslandt2, Hans-Ulrich Prokosch3, Dennis Toddenroth3.
Abstract
BACKGROUND: The secondary use of electronic health records (EHRs) promises to facilitate medical research. We reviewed general data requirements in observational studies and analyzed the feasibility of conducting observational studies with structured EHR data, in particular diagnosis and procedure codes.Entities:
Keywords: Availability; Completeness; Correctness; Currency; Data quality; Electronic health record; Granularity; Observational study; Retrospective study
Mesh:
Year: 2019 PMID: 31660955 PMCID: PMC6819452 DOI: 10.1186/s12911-019-0939-0
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Overview of available eligibility criteria and other patient characteristics. A patient characteristic was considered available if it had been documented at least once. A detailed overview of exposure (= E) and outcome (= O) variables can be overviewed in detail in Table 1
Study descriptions and detailed overview of investigated outcome and exposure variables from Fig. 1
| 1. DPS study | Exposure | A | C | Outcome | A | C | ||
Assessing the relationship between typical risk factors and the severity of Dupuytren’s contracture | ||||||||
| Comorbidities | Epilepsy Diabetes mellitus | X | – | Classifications | Iselin | – | X | |
| Lifestyle | Alcohol Smoking | X | – | |||||
| Demography | Sex | X | – | Classifications | Iselin at initial or recurrent diagnosis of Dupuytren’s contracture | – | X | |
| Demography | Mean age at initial or recurrent diagnosis of Dupuytren’s contracture | X | – | |||||
| Classifications | Iselin (stage of Dupuytren’s contracture) | – | X | Complications | Postoperative Intraoperative | X | X | |
| 2. DG study | Exposure | A | C | Outcome | A | C | ||
Perforation numbers after endoscopic esophageal balloon dilatation and bougination N = 423 | Perforation | Perforation No Perforation | X | – | Length of hospital stay | X | – | |
| 3. DRO study | Exposure | A | C/R | Outcome | A | C/R | ||
Comparison of two radiochemotherapies protocols in locally advanced pancreatic cancer N = 13 | Radiotherapy Chemotherapy | Radiotherapy + Gemcitabine and cisplatin | X | X | Classifications | Common toxicity criteria | – | – |
Radiotherapy + 5-Fluororuacil and mitomycin C | – | – | Death | X | – | |||
| Additive chemotherapy | X | X | Death | X | – | |||
“X” = available, “-” = unavailable, A: Administrative data (structured data), C: Clinical data (structured and unstructured data), R: Registry data (structured data)
Overview of the computed patient characteristic distributions from our EHR structured data in comparison to the original study
| DPS study | |||||
| Patient characteristics | Publication (P) | EHR (E) | |||
DP: 1956–2006 (50 years) | DP: Nov 1999 – Sep 2016 (17 years) | ||||
| Demography | Sex | Male, n (%) | 2579 (88.4) | 327 (79.6) | |
| Female, n (%) | 340 (11.6) | 84 (20.4) | |||
| Mean age | Male, years | 57.62 | 61.39 | ||
| Female, years | 62.62 | 64.57 | |||
| Complications | Nerve injury, n (%) | 108 (3.7) | 3 (0.73) | ||
| Tendon injury, n (%) | 5 (0.2) | 2 (0.49) | |||
| Skin necrosis, n (%) | 76 (2.6) | 0 (0) | |||
| Infection, n (%) | 94 (3.2) | 1 (0.24) | |||
| Bleeding, n (%) | 35 (1.2) | 7 (1.7) | |||
| Comorbidities | Diabetes mellitus, n (%) | 306 (10.5) | 25 (6.1) | ||
| Epilepsy, n (%) | 39 (1.3) | 5 (1.2) | |||
DP: 1988–2006 (18 years) | DP: Nov 1999 – Sep 2016 (17 years) n = 411 | ||||
| Demography | Sex | Male, n (%) | 977 (87.3) | 327 (79.6) | |
| Female, n (%) | 142 (12.7) | 84 (20.4) | |||
| Affected digit/s | One digit, n (%) | 505 (45.1) | 100 (24.3) | ||
| More than one digit, n (%) | 614 (54.9) | 263 (64) | |||
| Operation technique | Limited fasciectomy, n (%) | 1061 (94.8) | 5 (1.2) | ||
| Total fasciectomy, n (%) | 58 (5.2) | 0 (0) | |||
| Additional amputation, n (%) | 13 (1.2) | 2 (0.5) | |||
| Recurrence, n (%) | 145 (13) | 67 (16.3) | |||
| Lifestyle | Smoking, n (%) | 185 (16.5) | 11 (2.7) | ||
| Alcohol, n (%) | 236 (21.1) | 4 (1) | |||
| Both, n (%) | 76 (6.8) | 0 (0) | |||
| Simultaneous operations | Carpal tunnel, n (%) | 14 (1.3) | 33 (8.0) | ||
| Trigger finger, n (%) | 26 (2.3) | 44 (10.7) | |||
| DG study | |||||
| Patient characteristics | Publication (P) | EHR (E) | |||
DP: Jan 2002 – Dec 2011 (9 years) | DP: Dec 2000 – Sep 2016 (15 years) n = 423 | ||||
| Demography | Sex | Male, n (%) | 269 (73.1) | 297 (70.2) | |
| Female, n (%) | 99 (26.9) | 126 (29.8) | |||
| Mean Age | Total, years (range) | 61.3 (21–94) | 62.5 (19–96) | ||
| Balloon Dilatation, years | 61.8 | 57.7 | |||
| Bougination, years | 63.4 | 63.5 | |||
Balloon dilatation, Bougination | Patient with balloon dilatation, n (%) | 68 (18.5) | 69 (16.3) | ||
| Patient with bougination, n (%) | 300 (81.5) | 354 (83.7) | |||
| Sessions of balloon dilatation, n | 211 | 136 | |||
| Sessions of bougination, n | 1286 | 1281 | |||
| Total sessions, n | 1497 (4.07 per patient) | 1417 (3.35 per patient) | |||
| Perforation | Balloon dilatations, n | 0 | 0 | ||
| Bougination, n | 8 | 5 | |||
| Occurence, n | During first session: 4 During second session: 4 | During first session: 3 During second session: 1 During later sessions: 1 | |||
| Detection, n | During procedure: 4 After procedure: 4 | No specification: 5 | |||
| Rate per procedure, % | 0.53 | 0.35 | |||
| Rate per patient, % | 2.17 | 1.18 | |||
| Death, n | 1 | 0 | |||
| Length of hospital stay | With perforation, days (range) | 21.3 (9–41) | 19.2 (4–36) | ||
| Without perforation, days (range) | 6.8 (2–13) | 9.1 (1–80) | |||
| DRO Study | |||||
| Patient characteristics | Publication (P) | EHR (E) | |||
DP: unknown Gemcitabine and cisplatin | DP: Jan 2008 – May 2015 Gemcitabine and cisplatin n = 13 | ||||
| Demography | Sex | Male, n (%) | 36 (62) | 10 (76.9) | |
| Female, n (%) | 22 (38) | 3 (23.1) | |||
| Mean Age, years | 63 | 60.5 | |||
| Tumor | Location | Head, n (%) | 40 (69) | 6 (46.2) | 3 (23.1) (no specification) |
| Body, n (%) | 9 (16) | 1 (7.7) | |||
| Tail, n (%) | 1 (2) | 0 (0) | |||
| Head body, n (%) | 5(9) | 3 (23.1) | |||
| Body tail, n (%) | 3 (5) | ||||
| Diagnosed by | Biopsy, n (%) | 41 (71) | 1 (7.7) | ||
| Operation, n (%) | 15 (29) | 0 (0) | |||
| Median Duration | Radiotherapy, days | 42 | 42 | ||
| Survival | Alive at analysis, n | 2 | 10 | ||
Note that the DPS study included two observational periods. DP = data period
Critical appraisal of the included study populations and discussion of contextual data quality issues in patient characteristics (see orange boxes, Fig. 1)
| DPS study | |||
| Issues in the study population | |||
| To ensure the condition had been treated during the considered encounter, we only included subjects with a primary diagnosis of Dupuytren disease. The sensitivity of this approach may be limited insofar as some patients with another primary diagnosis also might have been eligible. | |||
| Patient characteristics | DQ violation | Issue | |
| Recurrence | Granularity | A label for a recurrence of Dupuytren’s contracture was not available. We therefore implemented a computed phenotype by considering the diagnosis of Dupuytren as a recurrence when documented during a subsequent encounter. This phenotype may be biased in certain circumstances, as neither the diagnosis nor the procedure codes specified the affected hand. Accordingly, we could not differentiate between a recurrence and a novel affection of the other hand. Further, the patient could have already undergone surgery in a different hospital, so that his seeming first encounter may in fact represent a recurrence. | |
| Operation | Technique | Completeness Granularity | The actually corresponding procedure codes for the described operation techniques in the original study were not frequently used in our EHR, which instead employed different procedure codes; this suggests that documentation habits may have affected frequency estimates. We were unable to clearly ascertain which procedure codes represented treatment of conditions that had been documented via simultaneous diagnostic codes. |
| Simultaneous | Granularity | We computed the distribution of simultaneous surgeries by counting the secondary diagnoses of carpal tunnel and trigger finger. The corresponding procedure codes described surgical techniques that could be applied to treat several different hand conditions, so that we could not determine whether these diagnoses had been treated during the encounter. | |
| Complications | Currency Granularity | Counting complications would require interpretations of plausible temporal and causal relationships, which we were not always able to infer from observable codes. When a subject had received more than one intervention during an encounter, for example, it was difficult to determine which of the corresponding clinical events happened first and caused each other. | |
| Lifestyle | Completeness Granularity | Information on lifestyle features such as alcohol and smoking was not frequently coded in our EHR. The diagnostic codes did not clarify whether, for example, a patient had started or stopped smoking. We noticed an inconsistent use of these codes throughout patient encounters, which can be interpreted as a changing diagnostic status or incompleteness (see Additional file | |
| Affected digits | Completeness Granularity | There was no diagnostic code available that specified the number of affected digits. To bypass this problem, we computed a phenotype from procedure codes that had specified the number of digits operated on. The phenotype provided incomplete numbers, which may point to low utilization of the included procedure codes in our EHR. | |
| DG study | |||
| Issues in the study population | |||
| – | |||
| Eligibility criteria | DQ violation | Issue | |
| Esophageal stenosis | Completeness | We noticed an inconsistent coding of esophagus stenosis. Adding the diagnostic code for esophagus stenosis to the eligibility criteria as described in the publication would decrease the study population size. | |
| Patient characteristics | DQ violation | Issue | |
| Indication | Granularity Correctness | In some instances, we were unable to determine and count the indications for bougination or dilatation, as the etiology of the treated esophagus stenosis formation was occasionally not well reflected in the codes. Some patients had multiple diagnoses that could be considered as a potential indication. According to free-text notes, some subjects developed a postsurgical or postradiation esophagus stenosis after tumor treatment, but had only a tumor diagnosis coded. | |
| Perforation: Occurrence, clinic, management | Currency Granularity | The computed number of intervention-associated perforations might be subject to bias, insofar as we could not retrace whether the endoscopic procedure carried out on the esophagus preceded the complication, or whether the coded perforation was a remnant of earlier medical history. Further, we could not determine the subsequent management of perforations and the associated clinical picture, as the diagnostic codes and their time stamps did not permit us to extrapolate causal chains and temporal sequences. | |
| Death | Granularity | We had data on inpatient death dates. Information on the cause of death was not available. | |
| DRO study | |||
| Issues in the study population | |||
| We filtered for patients who received the exact same chemotherapy protocol, varying by a maximum of +/− 2 days, which excludes subjects with a delayed administration or subjects with additional chemotherapy after the treatment. Further, since the disease stage and tumor histology type was not reflected in the codes, we could not tell if the radiochemotherapy in our recruited study population had been administrated for the primary tumor, distant metastasis, or for a different tumor. | |||
| Eligibility criteria | DQ violation | Issue | |
| Locally advanced pancreatic cancer | Granularity | No specific diagnostic code was available for this tumor entity. | |
| Radiotherapy | Granularity | No specific procedure code was available for the radiotherapy protocol. | |
| Chemotherapy | Granularity | No specific procedure code was available for the chemotherapy protocol. | |
| Patient characteristics | DQ violation | Issue | |
| Tumor | Location | Correctness Granularity | We computed the distribution of tumor location distribution from coded tumor location of the first encounter. For some subjects, the tumor location remained unspecified at this point. Further, we noticed a changing tumor location in subsequent encounters (see Additional file |
| Diagnosed by operation/biopsy | Completeness | Only patient data from the university hospital in Erlangen were available for our analysis. Consequently, operations or biopsies performed at other hospitals could not be included in our distribution computation. | |
| Death | Completeness Granularity | We only had data on death dates of patients who had died in the hospital. Information on the cause of death was not available. | |
| Radiochemotherapy: Cause of adjustment | Granularity Currency | The cause of radiochemotherapy adjustment required interpreting temporal and causal relationships which could not be inferred from the codes. | |
We evaluated “correctness” if a corresponding gold standard data source was available, and “completeness” if the corresponding patient characteristic distribution had been calculated. DQ= Data quality