| Literature DB >> 29602429 |
Daniel Leightley1, Zoe Chui2, Margaret Jones3, Sabine Landau4, Paul McCrone5, Richard D Hayes6, Simon Wessely7, Nicola T Fear8, Laura Goodwin9.
Abstract
BACKGROUND: Electronic Healthcare Records (EHRs) are created to capture summaries of care and contact made to healthcare services. EHRs offer a means to analyse admissions to hospitals for epidemiological research. In the United Kingdom (UK), England, Scotland and Wales maintain separate data stores, which are administered and managed exclusively by devolved Government. This independence results in harmonisation challenges, not least lack of uniformity, making it difficult to evaluate care, diagnoses and treatment across the UK. To overcome this lack of uniformity, it is important to develop methods to integrate EHRs to provide a multi-nation dataset of health.Entities:
Keywords: Big data; Data linkage; Electronic health records; Hospital admission; Hospital episode statistics; Secondary care
Mesh:
Year: 2018 PMID: 29602429 PMCID: PMC5887874 DOI: 10.1016/j.ijmedinf.2018.02.012
Source DB: PubMed Journal: Int J Med Inform ISSN: 1386-5056 Impact factor: 4.046
Fig. 1Overview of the data linkage process and formation of patient-level dataset.
Defines the terminology used by data providers, data periods of interest and number of variables requested.
| Terminology | NHS Digital | Secure Anonymised Information Linkage (SAIL) | Information Services Division (ISD) | |
|---|---|---|---|---|
| Accident and Emergency (A&E) | Year range | 2007/08–2013/14 | 2009/10–2013/14 | 2003/04–2013/14 |
| Dataset | Accident and Emergency | Emergency Department Data Set | Accident and Emergency | |
| Variable Count | 142 | 69 | 42 | |
| Admitted Patient Care (APC) | Year range | 2003/04–2013/14 | 2003/04–2013/14 | 2003/04–2013/14 |
| Dataset | Admitted Patient Care | Patient Episode Database for Wales | Scottish Morbidity Records 01 | |
| Variable Count | 265 | 115 | 36 | |
| Outpatient | Year Range | 2003/04–2013/14 | 2004/05–2013/14 | 2003/04–2013/14 |
| Dataset | Outpatient | Outpatient | Outpatient | |
| Variable Count | 96 | 46 | 32 | |
Stipulates the definition used for associating variables to a commonality category.
| Category | Criteria |
|---|---|
| Admittance/Discharge | Variables that provide information on the admission and discharge of a patient. This includes admission and discharge date, episode information, time of admission, source of admission and destination of patient upon discharge. |
| Diagnosis/Classification | Variables that provide information on the diagnosis of the patient, including ICD-10 coding and date of diagnosis, A&E coding or local diagnoses coding system. |
| Treatment/Procedure/Investigation | Variables that provide information on the treatment/procedures undertaken, including OPCS Classification of Interventions and Procedures version 4 coding, or local diagnoses coding system. |
| Care Provider | Variables that provide information on the provider of care, including geographical location and provider type. |
| Care Speciality | Variables that provide information on the speciality of care, including consultant association, department of care and clinical staff role. |
| Costing/Resources | Variables that provide information on the cost of care, including cost of treatment, staffing costs and direct costs incurred by the care provider. |
Completeness of KCMHR demographics (n = 8,602). 1NHS/CHI number is complete if length is greater than 8. CHI number serves the same purpose as NHS number for Scotland. 2Valid if length 1 or greater. 3Valid if date of birth is present e.g. DD/MM/YY or DD/MM/YYYY.% represents row percentages.
| NHS Number/CHI Number1 | Initial | Forename2 | Surname2 | Gender | Date of Birth3 | |
|---|---|---|---|---|---|---|
| Variable Completeness | 6877 (79.95%) | 8179 (95.08%) | 8413 (97.8%) | 8602 (100%) | 8602 (100%) | 8597 (99.94%) |
| Missing Values | 1725 (20.05%) | 423 (4.92%) | 189 (2.2%) | 0 | 0 | 5 (0.06%) |
Fig. 2Data extraction, number of participants matched and the total number of EHR for each data source divided by those with and without an NHS/CHI number. Note: A participant can appear in multiple nation data providers. Percentage figures are cascading, where the percentage is out of the preceding value.
Represents the number of episodes and participant numbers for each department and nation. Percent values represent the percentage of the matched sample (n = 6336).
| Department | NHS Digital | Secure Anonymised Information Linkage | Information Services Division | |
|---|---|---|---|---|
| Accident and Emergency | Episodes | 6775 | 392 | 343 |
| Participant | 2873 (45.44%) | 163 (2.77%) | 206 (3.25%) | |
| Admitted Patient Care | Episodes | 7516 | 444 | 577 |
| Participant | 2970 (46.77%) | 176 (19.3%) | 251 (3.96%) | |
| Outpatient | Episodes | 41,026 | 1703 | 2276 |
| Participant | 4300 (67.87%) | 240 (3.79%) | 435 (6.87%) | |
Variable completeness for a sample of common variables formed in England, Scotland and Wales.
| Data Source | Common Variable (national assigned variable name) | NHS Digital | Information Services Division | Secure Anonymised Information Linkage |
|---|---|---|---|---|
| Accident & Emergency | Reason for Admission | 42.44% | 67.93% | 58.93% |
| Attendance Category (attendance_cat) | 100% | 100% | 100% | |
| Admission Time (time_arrival) | 87.13% | 93.76% | 90.51% | |
| Arrival Mode (arrival_mode) | 100% | 97.96% | 100% | |
| Provider Code (provider_code) | 100% | 100% | 100% | |
| Admitted Patient Care | Primary (1st) Diagnosis (diag_01) | 100% | 100% | 100% |
| Admission Source (admin_source) | 98.59% | 100% | 100% | |
| Main Speciality (main_speciality) | 100% | 97.13% | 98.08 | |
| Operation (oper_01) | 100% | 72.62% | 68.52% | |
| Discharge Method (dist_meth) | 100% | 77.34% | 79.09% | |
| Outpatient | Attend (attended) | 100% | 77.07% | 80.28% |
| Attended Type (attend_type) | 100% | 100% | 100% | |
| Main Speciality (main_ speciality) | 100% | 100% | 100% | |
| Referral Source (ref_source) | 100% | 78.60% | 100% | |
| Diagnosis (diag_01) | 97.98% | 100% | 100% |
Represents a group of variables which describe the anatomical area, side body and presenting diagnosis.
“Unknown and unspecified causes of morbidity” assigned to 87.57% of all appointments.
Most common ICD-10 codes assigned during Admitted Patient Care visit with gender comparison.
| ICD-10 Code | Description | Occurrence (n = participants) | Male (n = 874) | Female (n = 109) | |
|---|---|---|---|---|---|
| 1 | Z51 | Other medical care | 490 (n = 116) | 99 (85.34%) | 17 (14.66%) |
| 2 | Z86 | Personal history of certain other diseases | 336 (n = 190) | 159 (83.68%) | 31 (16.32%) |
| 3 | I10 | Essential (primary) hypertension | 295 (n = 151) | 140 (92.72%) | 11 (7.28%) |
| 4 | Z37 | Outcome of delivery | 281 (n = 198) | 0 | 198 (100%) |
| 5 | Z30 | Contraceptive management | 272 (n = 262) | 247 (94.27%) | 15 (5.73%) |
| 6 | R10 | Abdominal and pelvic pain | 266 (n = 221) | 155 (70.14%) | 66 (29.86%) |
| 7 | M23 | Internal derangement of knee | 265 (n = 217) | 208 (95.85%) | 9 (4.15%) |
| 8 | F17 | Mental and behavioural disorders due to use of tobacco | 259 (n = 190) | 168 (88.42%) | 22 (11.58%) |
| 9 | Z72 | Problems related to lifestyle | 253 (n = 197) | 183 (92.89%) | 14 (7.11%) |
| 10 | M54 | Dorsalgia | 222 (n = 123) | 102 (82.93%) | 21 (17.07%) |