| Literature DB >> 28624991 |
Nicolas S Marjanovic1,2, Agathe De Simone3, Guillaume Jegou3, Erwan L'Her4,5.
Abstract
BACKGROUND: This study aimed to provide a new global and comprehensive evaluation of recent ICU ventilators taking into account both technical performances and ergonomics.Entities:
Year: 2017 PMID: 28624991 PMCID: PMC5474229 DOI: 10.1186/s13613-017-0285-2
Source DB: PubMed Journal: Ann Intensive Care ISSN: 2110-5820 Impact factor: 6.925
Fig. 1The concept of devices’ global evaluation. In order to assess ‘efficacy’ of a device, we considered that a global ergonomics evaluation required evaluating concomitantly ‘efficiency’, ‘engagement’, ‘ease of use’ and ‘tolerance to error’, as these four dimensions may be considered as interdependent. Tolerance to error was evaluated through the objective tasks completion scenarios. While considering that an easy-to-use device should be easily managed by a skilled physician, but not familiar to that specific device, we took particular attention to included naive subjects in the evaluation. Bench testings do explore the most important technical determinants of the efficiency of a device (tidal volume accuracy, triggering, etc.). Efficiency assessment might also include interfaces’ performance evaluation. For such sake, we used pupillary diameter variation which can be considered as a determinant of stress. Other eye-tracking tools such as blinking measurements or heat mapping may also have been used. Engagement during use of the device was evaluated through the use of psycho-cognitive scales, combined with physiological parameters measurements. Heart rate variability (HRV) may also have been measured for such sake
Fig. 2Box plot of tidal volume (VT) (a), positive end-expiratory pressure (PEEP) (b) and pressure support (PS) (c). Dotted lines represent the 10% error range. Black line represents exact VT value delivery. Values are provided as median and interquartile. A p value equal or below 0.05 was considered significant. *p < 0.05; § p < 0.05 as compared to S1, PB980 and Servo-U. † p < 0.005 as compared to R860, PB980 and Servo-U. There was a significant difference in terms of VT delivery precision between devices (a; p = 0.0498). All devices except S1 depicted a median VT value within the 10% error range. PB980 had the lowest error in terms of VT delivery, but Servo-U had the highest precision in response to respiratory mechanics modifications. V500 and V680 had relatively low error, but low precision in response to respiratory mechanics modifications. While median PEEP delivery was within the reliability range for all devices (b), V500 was significantly different to the other devices in terms of precision. S1 had the lowest error, but with rather low precision. Servo-U, PB980 and R860 had low error and high precision. When considering mean values, three devices delivered PS values higher than the reliability range (Servo-U: 13 ± 6%; PB980: 11 ± 12%; S1: 17 ± 14.4%; p < 0.001). Pressure support delivery was higher than the 10% error range for two devices (Servo-U and S1). Precision in response to respiratory mechanics modifications was low for PB980, S1 and R860
Fig. 5Task Load Index and System Usability Scale scores. Dotted line represents the mean value across all scores (Avea excluded). SUS consists of a ten-item questionnaire and assesses usability from different aspects: effectiveness (ability of users to complete tasks); efficiency (level of resource used in performing tasks); and satisfaction (subjective reactions to using the system). SUS score has a range of 0–100, the highest score being the best value (‘simple to use’). NASA-TLX is a multidimensional tool developed for mental workload evaluation. It explores three dimensions dependent on user perception of the task (mental workload, temporal workload and physical workload) and three dimensions dependent on the interaction between the subject and the task itself, which may be mostly related to the interface (effort, performance and frustration). An individual weighting of these dimensions by letting the subjects compare them pairwise enables a decrease in the inter-/intra-individual variability of the overall score. The higher the TLX, the lower the ergonomics. Our reference device (Avea) had the best TLX and SUS scores, and V680 the worst (p = 0.049). For usability (SUS), a difference between our reference device (Avea) and S1 was also observed. *p value <0.05
Objective tasks completion rate
| Device | Power on (%) | Start ventilation (%) | Inspiratory flow setting (%) | Ventilatory mode modification (%) | Cycling setting (%) | NIV mode activation (%) |
|---|---|---|---|---|---|---|
| Carefusion Avea | 92 | 100 | 100 | 100 | 75 | 75 |
| Dräger V500 | 100 | 92 | 100 | 100 | 92 | 50 |
| Covidien PB980 | 100 | 100 | 100 | 91 | 100 | 100 |
| Philips V680 | 100 | 100 | 50§ | 91 | 91 | 40† |
| Hamilton S1 | 91 | 91 | 18§ | 82 | 91 | 91 |
| GE R860 | 100 | 100 | 100 | 91 | 100 | 91 |
| Maquet Servo-U | 36* | 100 | 0¤ | 100 | 45‡ | 27† |
Results are presented as the different objective tasks success rate, expressed as the percentage of successful attempts. ICU’s physicians with mechanical ventilation’s knowledge had to complete 11 specific tasks of variable clinical importance for each ventilator, 4 mainly dedicated to monitoring and 7 to setting. Overall provides results of the entire bunch of test (first value), or after powering on/switching off tasks exclusion (second value)
Among all ventilators, our reference device the Avea had the better success rate. In between our comparison of the six ventilators, Covidien PB980 had the better results and the Servo-U the worst. A minority of users could power on Servo-U and always over the predefined 120 s time range. Settings reading represented the more difficult task, whatever the device, while their terminology was highly different from one device to the other
* p < 0.01 as compared to others. § p < 0.001 as compared to Avea, V500 and PB980. ¤ p < 0.01 as compared to V680. ‡ p < 0.05 as compared to others except Avea. † p < 0.05 as compared to PB980 S1 and R860. ‖ p < 0.001 as compared to Avea and PB980. # p < 0.01 as compared to others, except S1 and V680