| Literature DB >> 33787894 |
Martin Hensher1,2, Paul Cooper1,2, Sithara Wanni Arachchige Dona1,2, Mary Rose Angeles1,2, Dieu Nguyen1,2, Natalie Heynsbergh1,3, Mary Lou Chatterton1,2, Anna Peeters1.
Abstract
OBJECTIVE: The study sought to review the different assessment items that have been used within existing health app evaluation frameworks aimed at individual, clinician, or organizational users, and to analyze the scoring and evaluation methods used in these frameworks.Entities:
Keywords: assessment criteria; evaluation framework; health apps; scoring and scaling
Year: 2021 PMID: 33787894 PMCID: PMC8263081 DOI: 10.1093/jamia/ocab041
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.PRISMA (Preferred Reporting Items for Systematic Reviews and Meta-Analyses) diagram.
The distribution of frameworks across studies
| Framework | Name of the framework |
|---|---|
| 1. Studies that used a single existing framework for app evaluation (n = 10) |
MARS (n = 1) APA (n = 2) CRAAP (n = 1) ORCHA-24 (n = 1) SUMI (n = 1) SUS (n = 1) Psychological Component Checklist (n = 1) A synoptic framework (n = 1) The APPLICATION scoring system (n = 1) |
| 2. Studies that self-developed a framework for evaluation (n = 63) |
MARS (n = 1) uMARS (n = 1) The Health IT Usability, Evaluation Model (Health-ITUEM) (n = 1) Expert-Based Utility Evaluation (n = 1) The APPLICATION scoring system (n = 1) App Chronic Disease Checklist (n = 1) Nutrition App Quality Evaluation (AQEL) (n = 1) Enlight (tool for mobile and Web-based eHealth interventions) (n = 1) mHealth Emergency Strategy Index (n = 1) MedAd-AppQ Medication Adherence App Quality assessment tool (n = 1) Digital Health Scorecard (n = 1) Design and Evaluation of Digital Health Intervention Frameworks (n = 1) The mobile Health App Trustworthiness (mHAT) checklist) (n = 1) Ranked health (n = 1) PsyberGuide (n = 1) No particular name (n = 48) |
| 2.1 Frameworks that influenced to develop new framework |
MARS (n = 2) Persuasive system design principles (n = 1) Nielsen Usability Model (n = 1) Technology Acceptance Model (n = 1) |
| 2.2 guidelines that used to develop new framework |
U.S. Public Health Services Clinical Practice Guidelines (n = 1) UK BTS/SIGN, U.S. EPR-3, and international GINA guidelines (n = 1) |
| 3. Studies that used a combination of self-developed and existing frameworks for evaluation (n = 6) |
Brief DISCERN Instrument (n = 1) Silber scale (n = 2) Health-ITUES (n = 2) Tool used by Cruz—tool for measuring the compliance with Android and iOS guidelines (n = 1) Tool for measuring the User QoE by Martines-Perez 2013 (n = 1) Abbott Scale for Interactivity (n = 1) The Health On the Net Code Criteria (n = 1) The Technology Acceptance Model (n = 1) Usability framework of TURF (n = 1) Chinese Guideline for the Management of Hypertension (n = 1) The Anxiety and Depression Association of America (n = 1) PsyberGuide (n = 1) |
| 4. Use of survey tools | Use of an existing or self- developed surveys (n = 10) |
| 5. Not relevant | Review or opinion papers (n = 8) |
APA: American Psychiatric Association; BTS: British Thoracic Society; CRAAP: Currency, Relevance, Authority, Accuracy, and Purpose; EPR-3: Expert Panel Report 3; GINA: Global Initiative for Asthma; Health-ITUES: Health Information Technology Usability Evaluation Scale; MARS: Mobile App Rating Scale; ORCHA-24: Organisation for the Review of Care and Health Applications–24-Question Assessment; SIGN: Scottish Intercollegiate Guidelines Network; SUMI: Standardized Software Usability Measurement Inventory; SUS: System Usability Scale; TURF: Task, User, Representation and Function;
Commonly identified domains from health app evaluation frameworks
|
|
|
|
|---|---|---|
| 01 | Clarity of purpose of the app | A clear statement of the intended purpose of the app as well as the specificity of the users or the disease. |
| 02 | Developer credibility | Transparency of the app development and testing process, and accountability and credibility of the app developer, funders, affiliations, and sponsors. |
| 03 | Content/information validity | Readability, credibility, characteristics, quality, and accuracy of the information in the health app. The ability to tailor the app content per user preference and using simple language. |
| 04 | User experience | The overall experience of using an app in terms of its user friendliness, design features, functionalities, and ability to consider user preference through personalization function. |
| 05 | User engagement/adherence and social support | The extent of how apps maintain user retention using functionalities such as gamification, forums, and the use of behavior techniques as well as the extent of social support. |
| 06 | Interoperability | Data sharing and data transfer capabilities of the health apps. |
| 07 | Value | Perceived benefits and advantages associated with the use of health app. |
| 08 | Technical features and support | Health apps that are free from defects, errors, bugs, and quantity and timely updates. Technical support and service quality provided within the app. |
| 09 | Privacy/security/ethical/legal | Privacy and security domains pertain to data protection, cybersecurity, and encryption mechanisms for the storage and data transmission. Legalities of the health app that look at whether the health apps adhere to guidelines and have disclaimers concerning on clinical accountability. |
| 10 | Accessibility | This pertains to the ability of health apps to capture a wider audience and bridge the gap in access to health apps and healthcare services for vulnerable populations/people with disabilities. |
Distribution of domains discussed across studies by year
| Domain | Number of studies reported on each domain by year | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 2011 | 2012 | 2013 | 2014 | 2015 | 2016 | 2017 | 2018 | 2019 | 2020 | |
| Clarity of purpose of the app | 0 | 1 | 0 | 2 | 1 | 2 | 2 | 5 | 4 | 2 |
| Developer credibility | 0 | 1 | 0 | 3 | 1 | 6 | 1 | 3 | 6 | 3 |
| Content/information validity | 1 | 2 | 2 | 5 | 8 | 10 | 4 | 13 | 9 | 4 |
| User experience | 1 | 2 | 4 | 8 | 10 | 13 | 9 | 16 | 12 | 5 |
| User engagement/adherence and social support | 1 | 0 | 1 | 1 | 3 | 8 | 3 | 6 | 8 | 1 |
| Interoperability | 0 | 2 | 1 | 0 | 1 | 1 | 1 | 6 | 3 | 2 |
| Value | 0 | 1 | 1 | 3 | 3 | 5 | 5 | 6 | 5 | 2 |
| Technical features and support | 0 | 1 | 1 | 1 | 1 | 2 | 1 | 6 | 7 | 2 |
| Privacy/security/ethical/legal | 0 | 1 | 2 | 1 | 2 | 3 | 2 | 11 | 7 | 3 |
| Total identified studies | 1 | 3 | 4 | 10 | 11 | 16 | 10 | 20 | 16 | 6 |
Figure 2.Categories of app assessment criteria respective to identified domains. AE: adverse events; HC: health care.
The number of unique assessment criteria per domain
| Domain | Number of unique questions per domain | Number of objective questions | Number of subjective questions |
|---|---|---|---|
| Clarity of purpose of the app | 13 | 10 | 3 |
| Developer credibility | 24 | 23 | 1 |
| Content/information validity | 77 | 52 | 25 |
| User experience | 137 | 75 | 62 |
| User engagement/adherence and social support | 51 | 24 | 27 |
| Interoperability | 4 | 3 | 1 |
| Value | 48 | 15 | 33 |
| Technical features and support | 14 | 13 | 1 |
| Privacy/security/ethical/legal | 51 | 43 | 8 |
| Accessibility | 11 | 11 | 0 |
| Total | 430 | 269 | 161 |
Figure 3.Frequency distribution of evaluative scaling methods (N = 97).
Figure 4.Frequency distribution of scoring mechanisms (N = 97).