| Literature DB >> 33215077 |
Hugo De Oliveira1,2, Martin Prodel1, Ludovic Lamarsalle1, Matt Inada-Kim3,4,5, Kenny Ajayi6, Julia Wilkins6, Sara Sekelj6, Sue Beecroft7, Sally Snow6, Ruth Slater6, Andi Orlowski6,8.
Abstract
OBJECTIVE: The "Bow-tie" optimal pathway discovery analysis uses large clinical event datasets to map clinical pathways and to visualize risks (improvement opportunities) before, and outcomes after, a specific clinical event. This proof-of-concept study assesses the use of NHS Hospital Episode Statistics (HES) in England as a potential clinical event dataset for this pathway discovery analysis approach.Entities:
Keywords: HES database; data mining; hospitals/statistics and numerical data; process mining; sepsis
Year: 2020 PMID: 33215077 PMCID: PMC7660952 DOI: 10.1093/jamiaopen/ooaa039
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Data structuration protocol
| Requirement | Structuration procedure |
|---|---|
| Handling missing values in the unprocessed HES dataset |
Hospital stays with no discharge date and no length of stay (LOS) were labeled as day care (LOS < 1 day). 34 episodes not related to any hospital stay were removed from the dataset. |
| Removing redundant variables | Non-informative administrative or accounting variables captured in the unprocessed HES dataset were removed, as was demographic information. |
| Standardizing data input fields | Missing fields |
| Timestamp conversion | Timestamps were formatted into yyyy-mm-dd. The smallest unit considered was days. |
| Labeling and grouping diagnostic codes | Unprocessed medical data (HES episodes) were labeled using ICD-10 codes. Published methods were used to combine ICD-10 codes into one of 220 coded event classes. |
| Creating a time-ordered event sequence for each HES ID |
Episodes relating to a single stay were grouped into one coded event record (from the 220 ICD-10 coded events). A set of coded event records relating to each HES ID was time-ordered in a dataset. |
For example, LOS, discharge date and arrival method are included within HES for inpatient stays, but are not relevant for outpatient episodes.
Figure 1.Constructive, iterative, optimal pathway discovery algorithm. The score for each model describes the representativeness of the model. A “new” current solution is only adopted if it scores more highly than the current model. All Tabu neighbors (those that have recently been visited) are censored.
Hospital episode data included in the “bow-tie” analysis
| Patients | Hospital episodes | Hospital spells | Coded events |
|---|---|---|---|
| 76 523 | 4 509 000 | 964 000 | 580 000 |
Episodes, hospital spells, and coded events are the total combined number of events from the analyzed period: 2 years prior and 1 year after the index hospitalization for sepsis.
Figure 2.Bow-tie graph of the coded events in the 2 years before and 1 year after the index hospitalization for sepsis. The “bow-tie” graph is read from left to right, with circles representing event nodes of the process model (ie, coded events). The links (or edges) from each circle represent the time-ordered sequence of one coded event node following another. The sizes of nodes and links are proportional to the number of patients following this pathway. Note: The coded event “septicemia” contains a number of additional sepsis-related codes in addition to A40 or A41 (and their derivatives). See Supplementary Materials for full details of the HES ICD-10 codes included in this coded event.
Figure 3.(A) Coded events in the 2 years before the index hospitalization for sepsis (with patient numbers). (B) Coded events in the 1 year after the index hospitalization for sepsis (with patient numbers).
Allocation of the top 10 SOS codes to the coded event categories
| Coded event class categories | Suspicion of sepsis codes |
|---|---|
| Pneumonia |
J18.1 Lobar pneumonia, unspecified J18.9 Pneumonia, unspecified J18.0 Bronchopneumonia, unspecified |
| Septicemia | A41.9 Sepsis, unspecified |
| Urinary tract infections (UTI) | N39.0 Urinary tract infection, site not specified |
| Aspiration pneumonitis | J69.0 Pneumonitis due to food and vomit |
| Chronic obstructive pulmonary disease and bronchiectasis (“chronic pulmonary disease”) | J44.0 Chronic obstructive pulmonary disease with acute lower respiratory infection |
| Other lower respiratory disease | J22.X Unspecified acute lower respiratory infection |
| Skin and subcutaneous tissue infections (“skin infection”) | L03.1 Cellulitis of other parts of limb |
| Other gastrointestinal disorders (“gastrointestinal disorders”) | K63.1 Perforation of intestine (non-traumatic) |
Coded events used in the present study have been mapped to the top 10 SOS codes. These SOS codes provide an indication of patients at high risk of sepsis who should undergo proactive screening and are a key target for improving the detection and treatment of sepsis. Shaded boxes indicate the coded event classes represented in the final sepsis discovery model.