Literature DB >> 30642263

Developing outcome, process and balancing measures for an emergency department longitudinal patient monitoring system using a modified Delphi.

Marie E Ward1, Abel Wakai2, Ronald McDowell3, Fiona Boland4, Eoin Coughlan5, Moayed Hamza1, John Browne5, Ronan O'Sullivan6, Una Geary7, Fiona McDaid8, Éidín Ní Shé1, Frances J Drummond9, Conor Deasy10, Eilish McAuliffe11.   

Abstract

BACKGROUND: Early warning score systems have been widely recommended for use to detect clinical deterioration in patients. The Irish National Emergency Medicine Programme has developed and piloted an emergency department specific early warning score system. The objective of this study was to develop a consensus among frontline healthcare staff, quality and safety staff and health systems researchers regarding evaluation measures for an early warning score system in the Emergency Department.
METHODS: Participatory action research including a modified Delphi consensus building technique with frontline hospital staff, quality and safety staff, health systems researchers, local and national emergency medicine stakeholders was the method employed in this study. In Stage One, a workshop was held with the participatory action research team including frontline hospital staff, quality and safety staff and health systems researchers to gather suggestions regarding the evaluation measures. In Stage Two, an electronic modified-Delphi study was undertaken with a panel consisting of the workshop participants, key local and national emergency medicine stakeholders. Descriptive statistics were used to summarise the characteristics of the panellists who completed the questionnaires in each round. The mean Likert rating, standard deviation and 95% bias-corrected bootstrapped confidence interval for each variable was calculated. Bonferroni corrections were applied to take account of multiple testing. Data were analysed using Stata 14.0 SE.
RESULTS: Using the Institute for Healthcare Improvement framework, 12 process, outcome and balancing metrics for measuring the effectiveness of an ED-specific early warning score system were developed.
CONCLUSION: There are currently no published measures for evaluating the effectiveness of an ED early warning score system. It was possible in this study to develop a suite of evaluation measures using a modified Delphi consensus approach. Using the collective expertise of frontline hospital staff, quality and safety staff and health systems researchers to develop and categorise the initial set of potential measures was an innovative and unique element of this study.

Entities:  

Keywords:  Early warning score system; Emergency department; Evaluation measures; Longitudinal patient monitoring

Mesh:

Year:  2019        PMID: 30642263      PMCID: PMC6332627          DOI: 10.1186/s12873-018-0220-3

Source DB:  PubMed          Journal:  BMC Emerg Med        ISSN: 1471-227X


Background

Longitudinal patient monitoring systems (Early Warning Score (EWS) or Track and Trigger system (TTS)) have been widely recommended for use to detect clinical deterioration in patients [1]. The emergency department (ED) is a unique healthcare environment. ED patients are likely to be unknown to ED clinical staff and present with undifferentiated symptoms. They usually have to be managed with limited clinical information, through small windows of time and focus. Existing longitudinal patient monitoring systems developed for hospital inpatients may not be suitable for the ED [2, 3]. The early recognition of patient deterioration is also a key patient safety strategy for ED patients. It enables timely clinical intervention and transfer to a higher level of care in order to prevent adverse patient outcomes [4, 5]. A recent systematic review found that early warning systems ‘seem to predict adverse outcomes in adult patients of varying acuity presenting to the ED but there is a lack of high quality comparative studies to examine the effect of using early warning systems on patient outcomes’ [6]. There is also a lack of published measures for evaluating the effectiveness of longitudinal patient monitoring systems in the ED setting and the challenges of developing such measures have been outlined [7]. The Irish National Emergency Medicine Programme (EMP), aimed at improving the safety and quality of ED patient care, developed and piloted an ED-specific longitudinal patient monitoring system known as ED-ACE where ACE is an acronym for Adult Clinical Escalation. This study is part of a larger research programme, which saw the first full-scale iterative implementation of ED-ACE in a large urban acute hospital [7]. The main objective of this study was to develop a consensus among key stakeholders in the hospital frontline healthcare staff, quality and safety (Q&S) experts and health system researchers for outcome, process and balancing measures to measure the effectiveness of ED-ACE.

Methods

Methodological approach

The methodology to develop the evaluation measures was a two-stage process. Stage One consisted of a workshop with the research programme’s Participatory Action Research Group (PAR) group. All 13 members were invited to attend. Ten members attended (Table 1). The purpose of the workshop was to build on the collective professional experience of the members to gather suggestions regarding the evaluation measures to be used for the implementation of ED-ACE. Participants were informed about and asked to consider the Institute for Healthcare Improvement (IHI) framework for developing outcome, process and balancing measures [8]. The ‘stickies’ method was used to allow each individual to generate suggested outcome, process and balancing measures [9].
Table 1

Study Participants

PAR Workshop ParticipantsDelphi Participants
The 10 workshop participants included the following:Director Centre for Nurse Education;Assistant Director of Nursing;Consultant in Intensive Care;ED Business Manager; ED Clinical Nurse Manager; ED Clinical Nurse Facilitator;EM Consultant and Clinical Lead for the project; Professor of Health Systems and joint PI on project; Senior Research Fellow in Human Factors; Postdoctoral ResearcherRound 1:22 nursing staff, 19 medical staff, 5 academics and researchers, 2 managers and 1 Health and Social Care Professional.Round 2:12 nursing staff, 19 medical staff, 3 academics and researchers, 4 managers and 1 Health and Social Care Professional.Characteristics of Delphi panellists are outlined in Table 2.
Study Participants In Stage Two, an electronic modified-Delphi study was undertaken to develop a consensus on a suite of measures to be used for the evaluation of ED-ACE [10]. The evaluation workshop was the modification on the Delphi and the rest of the process proceeded as a standard Delphi. A multidisciplinary Delphi panel was created to include the research team (including Health Systems, Epidemiology and Public Health, Patient Q&S, and Human Factors researchers); members of the project’s Scientific Advisory Group; Consultants, Registrars, Advance Nurse Practitioners (ANPs) from the ED at the planned implementation site who had not been involved in the research; all members of the EMP and the EMP’s Emergency Medicine Nursing Interest Group (ENIG); the Lead EM Consultant and EM nursing leads in all similar-sized EDs in Ireland. Fifty-eight professionals in total were invited to participate, 49 of which participated in Round 1 and 39 in Round 2 (Tables 1 and 2).
Table 2

Characteristics of Delphi panellists

Round OneRound Two
Number invited to participate5858
Number of respondents49 (84.5%)39 (67.2%)
Professional backgroundNursing22 (44.9%)12 (30.8%)
Medical19 (38.8%)19 (48.7%)
Academic /Faculty3 (6.1%)2 (5.1%)
Managers2 (4.1%)4 (10.3%)
Researchers2 (4.1%)1 (2.6%)
Health and Social Care Professionals1 (2.0%)1 (2.6%)
Currently working in an EDYes38 (77.5%)31 (79.5%)
Based at hospital where ED-ACE implementation study being conductedYes17 (34.7%)16 (41.0%)
Characteristics of Delphi panellists The Delphi approach was used in this study to reach consensus because other commonly used consensus group methods (for example, focus groups) were not cost-effective and feasible due to the fact that the panel for this study represented diverse geographical locations, making it impractical and costly to meet in person [11]. Other reasons for using the Delphi consensus approach for this study include participant anonymity (to each other, though not to the study’s lead researcher) and the avoidance of groupthink or domination that might arise in a face-to-face discussion [12]. Using the multi-disciplinary expertise of the PAR group to develop and categorise the initial set of potential measures was an innovative and unique element of this study. The evaluation measures for the Delphi study were entered into SurveyMonkey software (https://www.surveymonkey.com/) to create an online/web-based electronic questionnaire that was used for both rounds of the Delphi study. A pilot study was then carried out based on three members of the research team assessing the content and flow of the draft questionnaire as well as ensuring the clarity of the measures and their categorisation. The feedback comments from the pilot study were used to create the final study questionnaire and an email containing the web link to the questionnaire was sent to all the Delphi panellists. This email also included a cover letter outlining the overall study objectives, how the initial list of measures was developed and an explanation of the Delphi process. Participation was on a voluntary basis and in keeping with the Delphi process participants were assured that their responses would be anonymous. Completion of the questionnaire was also taken as consent to participate. As the completion of the study questionnaire was anonymous, a background section called ‘Source of Expertise’ (Part A) was included in the questionnaire to capture background information (Please see Additional file 1: Appendix A for the Delphi R1 questionnaire).

Data and statistical analysis

Descriptive statistics were used to summarise the characteristics of the panellists who completed the questionnaires in each round. The mean Likert rating, standard deviation and 95% bias-corrected bootstrapped confidence interval (CI) for each variable was calculated. Greatorex and Dexter [13] concluded that, although statistics such as the mean and standard deviation assume an interval scale, the mean can be understood to represent group opinion and the standard deviation the amount of disagreement within the panel. Missing data was due to dropout with some panellists failing to progress through consecutive sections of the questionnaire (see below) although where panellists ranked measures within a section they ranked all measures. All responses and rankings were included in the analyses. Ensuring participant anonymity meant it was not possible to analyse at the individual (panellist) level change in relation to how measures, which were included in both rounds of the Delphi process, were ranked. However, where a variable reached the ‘high agreement’ threshold in R2 but not in R1, a one-sided test was used to determine whether the proportion of panellists who rated the variable as ‘Important’ or ‘Very Important’ in R2 was significantly higher than in R1. Bonferroni corrections were applied to take account of multiple testing. Data were analysed using Stata 14.0 SE.

Results

Stage one (workshop)

An initial list of 90 potential measures was developed at the workshop. After duplicates were removed there were 73 potential measures remaining (Fig. 1). A scoping literature review was conducted to explore evaluation measures used in other studies of longitudinal patient monitoring systems being implemented in the ED setting. Two additional measures were added following the literature review.
Fig. 1

Flowchart of development of measures

Flowchart of development of measures Following the workshop, the list of proposed measures was categorised into outcome, process and balancing measures and more specifically into the following categories: Treatment process measures examining the treatment process of patients in the ED and how that might be affected by ED-ACE. Implementation process measures exploring the implementation, receipt and setting of implementing ED-ACE and help in the interpretation of the outcome results. Outcome Measures to determine if ED-ACE results in an improvement in patient outcomes. Balancing measures exploring the use of ED-ACE from different dimensions and the possible impact of its use on other areas of the ED and the wider hospital system.

Stage two (Delphi consensus process)

The first round (R1) Delphi questionnaire contained 75 potential measures which participants were asked to rate on a 5-point Likert scale as very unimportant, unimportant, neither unimportant or important, important, very important for the evaluation of ED-ACE. This phase of the research was about determining the perceived importance of each measure by the key stakeholders. Assessing measures for feasibility and determining minimum data sets for measures would come later. At least 70% of Delphi panellists had to rate a potential measure in the ‘high agreement range’ (‘important’ and ‘very important’), for the measure to be selected for inclusion in the final suite of measures [11]. Participants were also invited to add any additional measures they felt should to be included in the Delphi round two (R2) questionnaire. The R1 questionnaire was emailed to 58 participants in August 2015. Personalised emails were sent to all the participants external to the project to improve the response rate. Two separate email reminders were sent after 3 weeks and then after one further week. The survey closed on 15th September 2015. Responses to R1 were analysed and proposed additional measures were collated (Fig. 1 provides a flowchart of the entire process). For R2 all 58 R1 participants were again invited to participate. This time they were asked to rate 27 potential measures. As in R1, personal email invitations to participate and reminders to the Delphi panellists were sent in R2.

Characteristics of the panel

Table 2 details the characteristics of the Delphi panellists. The same group of 58 stakeholders were invited to participate in both rounds of the Delphi study. The response rates were 84.5% for R1 and 67.2% for R2. In R1 thirty-nine panellists rated the treatment process measures, thirty-five the implementation process measures, thirty-four the outcome measures and thirty-one the balancing measures. In R2 thirty-five panellists rated the treatment process measures, thirty-two the implementation process measures, and thirty-one both the outcome measures and the balancing measures.

Stability of items ranked in both rounds

Twenty two variables, which did not meet the threshold for inclusion following R1, were revised if considered necessary for clarification, and panellists were asked to rate these again in R2 (in addition to other variables suggested by the Panellists in R1). 13 of these 22 variables were rated as ‘Important’ or ‘Very Important’ following R2 and these are listed in Table 3. However the proportion of panellists who ranked the variables as ‘Important’ or ‘Very Important’ in R2 was only significantly higher than in R1 for five of the variables. One of the 22 variables, “Number of patients who are in ED waiting for in-patient beds”, had the highest mean Likert score of all balancing measure variables across both rounds (4.33, 95%CI (3.98,4.63)).
Table 3

Variables from R1 attaining ‘high agreement’ in R2

Variable domainRound OneRound Twop-value
No of respondentsVariableNo (%) panellists ranking variable Important/ Very importantNo of respondentsVariableNo (%) panellists ranking variable Important/ Very important
Treatment39Time of completion of admitting/consulting team assessment to time of ED departure26 (66.7%)35Time of completion of admitting/consulting team assessment in the ED to time of ED departure25 (71.4%)0.66
Treatment39ICU admission rate22 (56.4%)35ICU admission rate30 (85.7%)0.01
Implementation35ICU referral rate23 (65.7%)32ICU referral rate25 (78.1%)0.78
Implementation35Resuscitation room activity level22 (62.9%)32Resuscitation room activity level30 (93.8%)0.01
Implementation35Number of times ISBAR communication tool was used to communicate the need for escalation22 (62.9%)32Number of times ISBAR communication tool was used to communicate the need for escalation25 (78.1%)0.52
Implementation35Health Professionals Work Index (HPWI) survey to measure autonomy and control over practice; work place relationships; managerial support and availability of resources20 (57.1%)32 Staff perception of availability of support and resources 27 (84.4%)0.04
Implementation35Number of people being triaged18 (51.4%)32Number of patients triaged27 (84.4%)0.01
Implementation35Minnesota Job Satisfaction Questionnaire to measure job satisfaction16 (45.7%)32 Job satisfaction for ED staff 30 (93.8%)< 0.001
Treatment (Round 1) /utcome (Round 2)39Admission to Intensive Care Unit (ICU) within 2 days of having been assessed and treated and deemed appropriate for admission to a hospital ward from ED25 (64.1%)31Admission to Intensive Care Unit (ICU) within 2 h of having been assessed and treated and deemed appropriate for admission to the ICU from the ED22 (71.0%)0.55
Outcome34In-hospital mortality rate23 (67.6%)31In-hospital mortality rate24 (77.4%)0.38
Balancing32Service delivery measured through e.g. resources (beds, equipment etc.) against recommended requirements22 (68.8%)30Service delivery measured through e.g. available resources (beds, equipment etc.) against recommended requirements23 (76.7%)0.72
Balancing32Average length of stay (AVLOS) in hospital for patients who come through ED22 (68.8%)30Average length of stay (AVLOS) in hospital for patients who come through ED23 (76.7%)0.72
Balancing32Number of patients waiting for in-patient beds21 (65.6%)30Number of patients who are in the ED waiting for in-patient beds24 (80.0%)0.30
Variables from R1 attaining ‘high agreement’ in R2

ED-ACE evaluation measures

A suite of 69 process (treatment and implementation process), outcome and balancing measures for evaluating the effectiveness of ED-ACE were developed (Fig. 1). Table 4 details the three measures in Delphi R1 and R2 with the highest mean Likert ratings per category thus leaving a set of the 12 highest-ranked measures. The purpose of using the Delphi technique was to reach consensus on which process, outcome and balancing measures would be the most important to evaluate ED-ACE. Table 4 therefore includes both those measures that reached consensus in R1, and therefore excluded from R2, as well as those reaching consensus in R2 across the four categories of process (treatment and implementation process), outcome and balancing measures [8]. Thus, while some clinical treatment variables may have rated higher than the top three balancing variables it was considered important to represent the top three variables across the four categories.Additional file 2 Appendix 2 provides a list of the mean Likert ratings for all of the proposed measures, the 69 that reached consensus as the most important and the remaining 34 that did not reach consensus.
Table 4

The proposed 12 measures with the highest mean ratings per IHI framework category

IHI CategoryHighest Mean Rated VariableMean Rating(95% CI)
(A) Treatment Process MeasuresEarly detection and treatment of patients at risk of sepsis4.95 [4.82, 4.97]
Early identification and treatment of life-threatening complications4.90 [4.82, 5.00]
Early detection and treatment of patients with chest pain at risk of myocardial infarction4.85 [4.74, 4.95]
(B) Implementation Process MeasuresNumber of patients who deteriorated as identified by ED-ACE4.63 [4.43, 4.77]
Number of patients whose care was escalated as a result of using ED-ACE4.54 [4.20, 4.74]
Number of re-triages that took place as a result of using D-ACE4.31 [4.08, 4.49]
(C) Outcome MeasuresReduction in the number of serious incidents in the ED4.59 [4.33, 4.77]
Reduction in the number of unexpected deaths in the ED4.41 [4.24. 4.62]
Prevalence of deterioration in ED patients4.38 [4.06, 4.68]
(D) Balancing MeasuresNumber of patients who are in the ED waiting for in-patient beds4.33 [3.98, 4.63]
Service delivery measured through actual staffing levels against recommended staffing levels4.25 [3.97, 4.53]
Staff adherence to treatment guidelines (e.g. treatment guidelines for acute stroke and acute myocardial infarction)4.19 [3.88, 4.53]
The proposed 12 measures with the highest mean ratings per IHI framework category

Discussion

The objective of this study was to reach a consensus regarding evaluation measures to measure the effectiveness of an ED-specific longitudinal patient monitoring system. While 69 measures reached consensus we would like to focus on the top- three ranked measures across the four IHI recommended categories for evaluating the implementation of a new Q&S initiative. The highest-ranked treatment measures in this study relate to life-threatening clinical conditions. This has face validity as the essence of the specialty of emergency medicine is to manage acute and urgent illness and injury. Early detection and treatment of patients at risk of sepsis is the highest-ranked measure in this study; this is not surprising given 60% of hospital deaths in Ireland have an infection or sepsis diagnosis and there has been considerable work done nationally and internationally on improving detection and treatment of sepsis [14]. Considering approximately 1 in 25 patients attending an Irish ED may have sepsis [15] and 1 in 100 patients attending an Irish ED may have severe sepsis or septic shock [16], this measure has high clinical relevance. Chest pain accounts for 5–20% of all ED admissions [17]. Causes of chest pain range from the benign (e.g., musculoskeletal chest pain) to potentially life-threatening conditions (e.g., acute coronary syndrome). The early detection and treatment of patients with chest pain at risk of myocardial infarction was rated highly in this study. The highest-ranked implementation process measures in this study were the number of patients who deteriorated, whose care was escalated, and who were re-triaged as a result of using ED-ACE. The highest-ranked outcome measures related to the reduction in the number of serious incidents and unexpected deaths in the ED and prevalence of deterioration in ED patients. These outcome measures are consistent with the underlying principle of longitudinal patient monitoring systems, to detect and prevent patient deterioration [1]. Balancing measures are meant to detect any unintended consequences of implementing a new intervention [18]. The highest-ranked balancing measures related to the number of patients who are in the ED waiting for in-patient beds. This may reflect concerns about the possibility of worsening chronic crowding in most EDs if use of ED-ACE resulted in more patients being transferred from the ED waiting room to the already crowded clinical care areas within the ED. The second highest ranked balancing measure was service delivery as measured by actual staffing levels against recommended staffing levels. Considering ED staffing is the single most important factor in providing a high quality, timely and clinically effective service to patients [19], this may reflect concerns about the ED staffing resource needed to carry out regular monitoring of ED patients. The third balancing measure chosen, staff adherence to treatment guidelines (e.g., for acute stroke), may also reflect concerns about the adequacy of ED staffing resource to implement ED-ACE while concomitantly delivering patient care consistent with existing clinical guideline recommendations. Given resource limitations and ED crowding it is possible that shortcuts may be taken in implementing all components of ED-ACE. This is something that would need to be monitored during implementation.

Limitations

Firstly, valid and reliable measures depend on the availability of high quality data [20] and while this study is the first step in developing consensus on a suite of measures, a separate study is required to determine how feasible and cost-effective it will be to collect the minimum data set required for implementing the measures [21]. Secondly, while we considered the views of frontline staff and researchers, there was no patient representation in the consensus development process. Thirdly, while the response rate to R1 was high (84.5%) this did drop to 67.2% in R2. In order to maintain the rigour of this technique, a response rate of 70% is suggested, but as Hasson et al. [22] note to achieve this, the researcher must know the identity of respondents, and non-respondents must be pursued individually. Given that our study was anonymous and we could not pursue participants individually a response rate of 67.2% can be considered acceptable. Also given that our study was anonymous we were unable to examine changes within individual respondents between rounds. Finally, the study investigators assumed that all the Delphi panellists had relevant knowledge regarding the evidence base for all the measures that reached consensus. However, it is conceivable that not all the panellists were aware of the evidence supporting all the measures that reached agreement.

Conclusion

The Delphi technique has been used previously to develop quality of clinical care indicators for EDs [20]. In this study it has proved effective in contributing to the development of a suite of 12 treatment and implementation process, outcome and balancing measures for measuring the effectiveness of an ED-specific longitudinal patient monitoring system. In our knowledge, this is the first study to develop a suite of measures to evaluate the effectiveness of an ED-specific longitudinal patient monitoring system. While acknowledging the limitations outlined above, we consider this study a necessary starting point for the development of valid and reliable measures to evaluate the effectiveness of ED-ACE. Appendix 1: Delphi Instrument Round 1 This contains the full delphi questionnaire for Round 1 of the study (PDF 240 kb) Appendix 2: Delphi Measures Mean Likert Scores This provides a list of the mean Likert ratings for all of the proposed measures, the 69 that reached consensus as the most important and the remaining 34 that did not reach consensus. (DOCX 39 kb)
  17 in total

Review 1.  Research guidelines for the Delphi survey technique.

Authors:  F Hasson; S Keeney; H McKenna
Journal:  J Adv Nurs       Date:  2000-10       Impact factor: 3.187

Review 2.  An accessible analytical approach for investigating what happens between the rounds of a Delphi study.

Authors:  J Greatorex; T Dexter
Journal:  J Adv Nurs       Date:  2000-10       Impact factor: 3.187

3.  Current use of early warning scores in UK emergency departments.

Authors:  James R Griffiths; Elizabeth M Kidney
Journal:  Emerg Med J       Date:  2011-10-01       Impact factor: 2.740

4.  The Prevalence of Severe Sepsis or Septic Shock in an Irish Emergency Department.

Authors:  C McNevin; R McDowell; F Fitzpatrick; R O'Sullivan; A Wakai
Journal:  Ir Med J       Date:  2018-02-09

Review 5.  The Delphi technique: a worthwhile research approach for nursing?

Authors:  H P McKenna
Journal:  J Adv Nurs       Date:  1994-06       Impact factor: 3.187

6.  Consensus methods: characteristics and guidelines for use.

Authors:  A Fink; J Kosecoff; M Chassin; R H Brook
Journal:  Am J Public Health       Date:  1984-09       Impact factor: 9.308

7.  Prevention of deterioration in acutely ill patients in hospital.

Authors:  Colin Steen
Journal:  Nurs Stand       Date:  2010 Aug 11-17

Review 8.  [Chest pain units. Organization and protocol for the diagnosis of acute coronary syndromes].

Authors:  Julián Bayón Fernández; Eduardo Alegría Ezquerra; Xavier Bosch Genover; Adolfo Cabadés O'Callaghan; Ignacio Iglesias Gárriz; José Julio Jiménez Nácher; Félix Malpartida De Torres; Ginés Sanz Romero
Journal:  Rev Esp Cardiol       Date:  2002-02       Impact factor: 4.753

9.  Design and Implementation of a Trauma Care Bundle at a Community Hospital.

Authors:  Ryan Andres; Elan Hahn; Steffen de Kok; Rafi Setrak; Jeffrey Doyle; Allison Brown
Journal:  BMJ Qual Improv Rep       Date:  2017-05-17

Review 10.  The effectiveness of physiologically based early warning or track and trigger systems after triage in adult patients presenting to emergency departments: a systematic review.

Authors:  Francesca Wuytack; Pauline Meskell; Aislinn Conway; Fiona McDaid; Nancy Santesso; Fergal G Hickey; Paddy Gillespie; Adam J N Raymakers; Valerie Smith; Declan Devane
Journal:  BMC Emerg Med       Date:  2017-12-06
View more
  1 in total

1.  Wavelet Transform Artificial Intelligence Algorithm-Based Data Mining Technology for Norovirus Monitoring and Early Warning.

Authors:  Xucheng Fan; Na Xue; Zhiguo Han; Chao Wang; Heer Ma; Yaoqin Lu
Journal:  J Healthc Eng       Date:  2021-09-17       Impact factor: 2.682

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.