| Literature DB >> 36040787 |
Cabella Lowe1, Mitchell Browne1, William Marsh2, Dylan Morrissey1,3.
Abstract
BACKGROUND: Musculoskeletal disorders negatively affect millions of patients worldwide, placing significant demand on health care systems. Digital technologies that improve clinical outcomes and efficiency across the care pathway are development priorities. We developed the musculoskeletal Digital Assessment Routing Tool (DART) to enable self-assessment and immediate direction to the right care.Entities:
Keywords: acceptability; digital health; digital technology; eHealth; mHealth; mobile health; mobile phone; musculoskeletal; physiotherapy triage; triage; usability
Mesh:
Year: 2022 PMID: 36040787 PMCID: PMC9472040 DOI: 10.2196/38352
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 7.076
Figure 1The Digital Assessment Routing Tool mobile health system.
Figure 2Integration of the DART mobile health system within an existing musculoskeletal disorder pathway. DART: Digital Assessment Routing Tool.
Figure 3DART usability study iterative, convergent mixed methods design. New participants were recruited for each testing round. Participants raising specific issues in previous rounds were invited individually to review and provide feedback on changes. DART: Digital Assessment Routing Tool; eHEALS: eHealth Literacy Scale; ISO: International Organization for Standardization.
Figure 4Data collection methods used to assess DART performance against the International Organization for Standardization 9241-210-2019 and International Organization for Standardization 30071-1-2019 standards constructs of effectiveness, efficiency, and satisfaction [30] and accessibility [31]. DART: Digital Assessment Routing Tool.
Figure 5The convergent mixed methods design, where both data types are collected simultaneously to allow the analysis and grading of usability problems, thus informing the next system iteration.
Usability problem grading criteria, adapted from guidance issued by The Food and Drug Administration [49].
| Grade | Impact | Frequency | Implications | Action |
| 1 | High | High, moderate, or low | Prevents effective use of the system | Address in next study iteration |
| 2 | Moderate or low | High or moderate | Affects the quality of system delivery | Address in next study iteration |
| 3 | Moderate or low | Low or moderate | Minor issues for several users or a small number of users highlighting concerns important to them | Document and address in later development |
| 4 | Low | Low | Small issues that, if resolved, could improve user satisfaction | Document and address in later development |
Participant characteristics (N=22).
| Characteristic | Daily internet users | Infrequent internet users | ESOLa,b | All groups | |
| Total sample, n (%) | 19 (86) | 3 (14) | 6 (27) | 22 (100) | |
|
| |||||
|
| Values, mean (SD) | 47.6 (15.7) | 55 (11.4) | 41 (8.5) | 48.6 (15.2) |
|
| Values, range | 20-77 | 47-68 | 31-55 | 20-77 |
| Sex (male), n (%) | 9 (41) | 1 (5) | 3 (14) | 10 (46) | |
|
| |||||
|
| Values, mean (SD) | 29 (8) | 25 (4) | 26 (12.3) | 28.8 (7.8) |
|
| Values, range | 8-38 | 21-29 | 8-37 | (8-38) |
aESOL: English for speakers of other languages.
bAll ESOL participants were also daily internet users.
ceHEALS: eHealth Literacy Scale.
Recruitment matrix showing minimum quotas and number of participants recruited by characteristics of interest (N=22)a.
| Characteristic | Daily internet user (n=19) | Infrequent internet user (n=3) | |||
|
| Quota | Enrolled, n (%) | Quota | Enrolled, n (%) | |
|
| |||||
|
| 18-54 | 2-4 | 7 (37) | 1-3 | 2 (67) |
|
| 55-74 | 2-4 | 10 (53) | 1-3 | 1 (33) |
|
| ≥75 | 1-3 | 1 (5) | 2-4 | 0 (0) |
|
| |||||
|
| Male | Minimum 6 | 7 (37) | Minimum 4 | 1 (33) |
|
| Female | Minimum 6 | 10 (53) | Minimum 4 | 2 (67) |
|
| |||||
|
| Non-ESOL | Minimum 6 | 15 (79) | Minimum 6 | 3 (100) |
|
| ESOL | Minimum 2 | 6 (32) | Minimum 2 | 0 (0) |
aTotal study participants quota was 20.
bESOL: English for speakers of other languages.
Figure 6Relationship between age and eHealth literacy scores across all participants. (eHEALS scores range from 0 to 40 scale, with higher scores being better). eHEALS: eHealth Literacy Scale.
Figure 7Body sites selected by participants. The number of selections represents the total of the front, back, and either side of a given body site. The Digital Assessment Routing Tool algorithms are designed to assess for musculoskeletal disorder conditions that occur or refer to pain in the selected body site.
Figure 8Number of usability problems across testing rounds by grade. The incidence and problem grading changed over the 5 rounds of testing, with grade 1 and 2 problems being negated or reduced to a lower grade. All grade 3 and 4 issues were documented, reviewed, and prioritized for future Digital Assessment Routing Tool development.
Figure 9Grade 1 and 2 usability problem themes and underlying subthemes. MSD: musculoskeletal disorder.
Grade 1 and 2 usability problem themes, subthemes, and participant quotes over 5 rounds of testinga.
| Underlying theme and subthemes | Usability problem grade of subtheme (testing round) | Participant quotes | |||
|
| |||||
|
| Incorrect body site selected |
1 (1) 2 (2) 2 (3) 4 (4) 4 (5) |
“If people are like me, they don't read things properly, especially at the beginning. Was there an option to start again, because people might mess up?” [DART005] “The only time that I felt slightly lacking in confidence was on the body site. That was the only time I wasn’t sure the system would grab the information I clicked.” [DART002] | ||
|
| Question inappropriately triggering recommendation—systemic inflammatory disease and central nervous system condition |
1 (1) 1 (2) 0 (3) 0 (4) 0 (5) |
“I think that came because I said that I'm stiff in the morning, that eight minutes or whatever.” [DART006] “So I read weakness, not severe weakness. So, I can stand on it and support my weight, but it just hurts like hell, rather than not being able to support myself.” [DART015] | ||
|
| |||||
|
| Unable to select secondary body site |
0 (1) 2 (2) 2 (3) 0 (4) 0 (5) |
“I think if I could have put more evidence in, then I’d be more likely to follow the recommendation at the end because I think it was relevant to me.” [DART006] “I suppose you could differentiate slightly more between the source point of the pain and the consequences for your other limbs like, you know, I knew very well that it was bad at the back that was causing my inability to walk. So maybe distinction between primary pain and a secondary or referred pain might be useful.” [DART010] | ||
|
| Impact of existing and previous MSDc |
2 (1) 0 (2) 0 (3) 0 (4) 4 (5) |
“I don’t want to waste the GP’s time or my time waiting for an appointment to be told what I already know. So, in my two cases, it wasn’t so much about diagnosis is more of an okay, this has returned. We know the course of action.” [DART019] “I wasn't sure whether sometimes we’re talking about what it’s like when it’s really bad, or what it’s like in general.” [DART005] | ||
|
| Impact of mental health status |
0 (1) 2 (2) 3 (3) 3 (4) 3 (5) |
“I think one way of making it better is also seeing how it affects someone psychologically as well. I think that this is something which can sometimes be overlooked, but I think it's important to see how it is impacting on someone's emotional wellbeing?” [DART001] “When you seek sort of medical advice, or you have a condition that gives you worry and anxiety, probably you expect a little bit more than just sort of self-treatment.” [DART016] | ||
|
| Recommendations not sufficiently personalized |
4 (1) 2 (2) 0 (3) 0 (4) 0 (5) |
“I guess, it might need to be a little bit more personalized recommendations depending on what people choose.” [DART001] “I suppose the only thing that might dissuade people, would be that if they were users of it, and it came up with the same sort of end page every time.” [DART002] | ||
|
| |||||
|
| Specific questions (work status, previous treatment, and comorbidities) |
2 (1) 3 (2) 4 (3) 0 (4) 4 (5) |
“You were distinguishing between people who were employed, and people who are not employed. It just seemed to me as though there was quite a large category of people lumped together in that one box and maybe it would be better to differentiate them a bit more, so that they did actually tick retired or they ticked student” [DART010] “Where you were asked whether you'd had surgery or physio, it just was rather a broad question. I thought maybe it should have been a tick box for that to show which one you’d had.” [DART004] “You’re a little bit unsure about whether it’s really registered to your osteoporosis.” [DART010] | ||
aUsability problems were clustered into subthemes based on specific areas of DART functionality. Problem grades were reduced in severity over testing rounds as problems were negated or reduced during DART iterations (grade 1 is the most severe, and grade 4 is the least severe).
bDART: Digital Assessment Routing Tool.
cMSD: musculoskeletal disorder.
Display of quantitative data by International Organization for Standardization 9241-210-2019 standard constructs (effectiveness, efficiency, and satisfaction) over 5 testing roundsa.
| Construct, goal, and testing round | Result | ||||
|
| |||||
|
|
| ||||
|
|
| Round 1 | 13 (100) | ||
|
|
| Round 2 | 11 (100) | ||
|
|
| Round 3 | 11 (100) | ||
|
|
| Round 4 | 10 (100) | ||
|
|
| Round 5 | 5 (100) | ||
|
|
| ||||
|
|
| Round 1 | 11 (85) | ||
|
|
| Round 2 | 5 (45) | ||
|
|
| Round 3 | 11 (100) | ||
|
|
| Round 4 | 10 (100) | ||
|
|
| Round 5 | 10 (100) | ||
|
|
| ||||
|
|
| Round 1 | 13 (100) | ||
|
|
| Round 2 | 9 (82) | ||
|
|
| Round 3 | 11 (100) | ||
|
|
| Round 4 | 8 (80) | ||
|
|
| Round 5 | 5 (100) | ||
|
|
| ||||
|
|
| Round 1 | 12 (92) | ||
|
|
| Round 2 | 8 (73) | ||
|
|
| Round 3 | 11 (100) | ||
|
|
| Round 4 | 8 (80) | ||
|
|
| Round 5 | 4 (80) | ||
|
| |||||
|
|
| ||||
|
|
|
| |||
|
|
|
| Values, mean (SD) | Not recorded | |
|
|
|
| Values, range | Not recorded | |
|
|
|
| |||
|
|
|
| Values, mean (SD) | 5.7 (5.35) | |
|
|
|
| Values, range | 1-18 | |
|
|
|
| |||
|
|
|
| Values, mean (SD) | 5.4 (4.54) | |
|
|
|
| Values, range | 1-15 | |
|
|
|
| |||
|
|
|
| Values, mean (SD) | 3.5 (1.5) | |
|
|
|
| Values, range | 1-5 | |
|
|
|
| |||
|
|
|
| Values, mean (SD) | 7.4 (2.13) | |
|
|
|
| Values, range | 3-17 | |
|
|
|
| |||
|
|
|
| Values, mean (SD) | 5.2 (4.44) | |
|
|
|
| Values, range | 1-18 | |
|
|
| ||||
|
|
| Round 1 | 0 | ||
|
|
| Round 2 | 0 | ||
|
|
| Round 3 | 0 | ||
|
|
| Round 4 | 0 | ||
|
|
| Round 5 | 0 | ||
|
|
| ||||
|
|
| Round 1 | 1 | ||
|
|
| Round 2 | 2 | ||
|
|
| Round 3 | 2 | ||
|
|
| Round 4 | 2 | ||
|
|
| Round 5 | 6 | ||
|
| |||||
|
|
| ||||
|
|
|
| |||
|
|
|
| Values, n (%) | 5 (23) | |
|
|
|
| Values, mean (SD) | 91.6 (4.23) | |
|
|
|
| Margin of error | 4.46 or –4.46 | |
|
|
|
| |||
|
|
|
| Values, n (%) | 6 (27) | |
|
|
|
| Values, mean (SD) | 87 (10.23) | |
|
|
|
| Margin of error | 12.72 or –12.72 | |
|
|
|
| |||
|
|
|
| Values, n (%) | 5 (23) | |
|
|
|
| Values, mean (SD) | 79.5 (16.91) | |
|
|
|
| Margin of error | 21.02 or –21.02 | |
|
|
|
| |||
|
|
|
| Values, n (%) | 2 (9) | |
|
|
|
| Values, mean (SD) | 78.8 (18.75) | |
|
|
|
| Margin of error | N/Af | |
|
|
|
| |||
|
|
|
| Values, n (%) | 4 (18) | |
|
|
|
| Values, mean (SD) | 78.8 (5.73) | |
|
|
|
| Margin of error | 9.11 or –9.11 | |
aQuantitative data show the number of participants in each round and the percentage that achieved the construct theme.
bTime taken to complete an assessment (time taken to reach a disposition was not measured during round 1, as the “think-aloud” method of data capture was prioritized at this stage); number of system errors where the participant was unable to navigate to the end of the assessment because of a system technical error; backsteps where the participant moved back to the previous question.
cDART: Digital Assessment Routing Tool.
dSystem Usability Scale scores by round, group of interest, and across all groups.
eResponses were scored on a 5-point Likert scale (1=strongly disagree and 5=strongly agree) and converted to a score of between 0 and 4, with 4 being the most positive usability rating. Converted scores for all participants are multiplied by 2.5 to give a range of possible total values from 0 to 100. We used 90% CI to allow the benchmarking of the overall DART System Usability Scale score with other studies using this value [46].
fN/A: not applicable.
System Usability Scale score per group for construct 3 (satisfaction) of the International Organization for Standardization 9241-210-2019 standard.
| System Usability Scalea score per group | Daily internet users (n=19) | Infrequent internet users (n=3) | ESOLb internet users (n=6) | All participants (n=22) |
| Values, mean (SD) | 86.5 (4.48) | 70.8 (5.44) | 78.1 (4.60) | 84.3 (12.73) |
| Margin of error | 1.78 or –1.78 | 9.17 or –9.17 | 3.79 or –3.79 | 4.67 or –3.79 |
aResponses were scored on a 5-point Likert scale (1=strongly disagree and 5=strongly agree) and converted to a score of between 0 and 4, with 4 being the most positive usability rating. Converted scores for all participants are multiplied by 2.5 to give a range of possible total values from 0 to 100. We used 90% CI to allow the benchmarking of the overall Digital Assessment Routing Tool System Usability Scale score with other studies using this value [46].
bESOL: English for speakers of other languages.
Figure 10Combined measures of effectiveness by testing round. Results are displayed as the percentage of total assessments to allow comparison, as there were different numbers of participants and assessments in each round. The percentage of assessments in each round resulted in a positive response to the following queries: (1) whether a disposition was achieved, (2) whether it was a clinically correct disposition, (3) whether the participant would trust the disposition, and (4) whether the participant would act on the disposition.
Figure 11Efficiency (time to complete an assessment). Time taken for participants to complete assessments. A total of 16 participants completed 34 assessments in total across rounds 2 to 5. Time was not recorded in round 1, as participants were encouraged to use the “think-aloud” technique.
Figure 12Measures of efficiency (backsteps) by testing round. Number of times a participant moved back a step in the question set to review their previous question and response, with backsteps shown as a percentage of the total number of questions in the assessment.
Figure 13Measure of satisfaction by testing round. SUS: System Usability Scale.
Display of qualitative data by International Organization for Standardization 9241-210-2019 standard constructs (effectiveness, efficiency, and satisfaction)a.
| Construct and goal | Participant quotes | ||
|
| |||
|
| Assessment results for a recommendation being given |
“I found it really user friendly and I found I could read the questions quite quickly and just give an answer and move on.” [DART018] | |
|
| Assessment results for a correct clinical recommendation; |
“I expected the area [selected body part] that I chose to change color, I would do it a different color, red or something like that.” [DART005] | |
|
| Assessment of whether the participant would trust |
“It might be easier if you just say have a secondary field to sort of like give your secondary issues as well. You know, sometimes it's just not, it's like the neck runs into the arm and lower parts, but it can be different things as well.” [DART014] “It might make people feel a bit more confident that they've done it right.” [DART015] | |
|
| Assessment of whether the participant would act upon |
“I think if I could have put more evidence in, then I’d be more likely to follow the recommendation at the end, because I think it was relevant to me.” [DART006] | |
|
| |||
|
| Time taken to reach recommendation (minutes) |
“It was very quick. And I quite like that it has one thing for one page, which is a very short question, it gives you a few options, and then you answer so you don't have to go through long text questions, one after the other. So, it just takes you very quickly step by step. And it's quite, I don’t know, for me, it was super easy and clear to answer questions.” [DART017] | |
|
| DARTb system errors |
“I found it really simple system to use very, very easy and had no problems at all.” [DART010B] | |
|
| DART system backsteps |
“That was a question about whether I’d been off work for a long time and if I’m employed or self-employed, something that I didn’t find quite straightforward.” [DART020] | |
|
| |||
|
| System Usability Scale score per round |
“If I had this actual system, I would have saved £150 in cash and probably three months of pain had I been able to access it when I had my problems with my back.” [DART013] “It’s done me a favor actually, because I was in two minds whether to try and get a private injection, whether to go to an osteopath or physio. I think it might save me money in the long run.” [DART014] | |
aParticipant quotes provide a deeper understanding of system performance and usability problems.
bDART: Digital Assessment Routing Tool.