| Literature DB >> 34855618 |
Antonio Muro-Culebras1,2, Adrian Escriche-Escuder1,2, Jaime Martin-Martin1,2, Cristina Roldán-Jiménez1,2, Irene De-Torres3, Maria Ruiz-Muñoz1,2, Manuel Gonzalez-Sanchez1,2, Fermin Mayoral-Cleries4, Attila Biró5, Wen Tang6, Borjanka Nikolova7, Alfredo Salvatore8, Antonio Ignacio Cuesta-Vargas1,2,9.
Abstract
BACKGROUND: There are several mobile health (mHealth) apps in mobile app stores. These apps enter the business-to-customer market with limited controls. Both, apps that users use autonomously and those designed to be recommended by practitioners require an end-user validation to minimize the risk of using apps that are ineffective or harmful. Prior studies have reviewed the most relevant aspects in a tool designed for assessing mHealth app quality, and different options have been developed for this purpose. However, the psychometric properties of the mHealth quality measurement tools, that is, the validity and reliability of the tools for their purpose, also need to be studied. The Consensus-based Standards for the Selection of Health Measurement Instruments (COSMIN) initiative has developed tools for selecting the most suitable measurement instrument for health outcomes, and one of the main fields of study was their psychometric properties.Entities:
Keywords: assessment; eHealth; mHealth; mobile apps; mobile health; mobile phone; questionnaire design; rating; smartphone
Mesh:
Year: 2021 PMID: 34855618 PMCID: PMC8686474 DOI: 10.2196/15433
Source DB: PubMed Journal: JMIR Mhealth Uhealth ISSN: 2291-5222 Impact factor: 4.773
Consensus-based Standards for the Selection of Health Measurement Instruments definitions of domains, measurement properties, and aspects of measurement properties [18].
| Domain | Measurement property | Aspect of a measurement property | Definition | |
| Reliability | N/Aa | N/A | The degree to which the measurement is free from measurement error | |
|
| The extent to which scores for patients who have not changed are the same for repeated measurement under several conditions: for example, using different sets of items from the same HR-PROsb (internal consistency) over time (test-retest), by different persons on the same occasion (interrater), or by the same persons (ie, raters or responders) on different occasions (intrarater) | |||
|
| Internal consistency | N/A | The degree of the interrelatedness among the items | |
|
| Reliability | N/A | The proportion of the total variance in the measurements which is due to | |
|
| Measurement error | N/A | The systematic and random error of a patient’s score that is not attributed to true changes in the construct to be measured | |
|
| The degree to which an HR-PRO instrument measures the construct it purports to measure | |||
|
|
| The degree to which the content of an HR-PRO instrument is an adequate reflection of the construct to be measured | ||
|
|
| Face validity | The degree to which (the items of) an HR-PRO instrument indeed looks as though they are an adequate reflection of the construct to be measured | |
|
|
| The degree to which the scores of an HR-PRO instrument are consistent with the hypotheses (for instance, with regard to internal relationships, relationships to the scores of other instruments, or differences between relevant groups) based on the assumption that the HR-PRO instrument validly measures the construct to be measured | ||
|
|
| Structural validity | The degree to which the scores of an HR-PRO instrument are an adequate reflection of the dimensionality of the construct to be measured | |
|
|
| Hypotheses testing | Idem construct validity | |
|
|
| Cross-cultural validity | The degree to which the performance of the items on a translated or culturally adapted HR-PRO instrument are an adequate reflection of the performance of the items of the original version of the HR-PRO instrument | |
|
| Criterion validity | N/A | The degree to which the scores of an HR-PRO instrument are an adequate reflection of a | |
|
| The ability of an HR-PRO instrument to detect change over time in the construct to be measured | |||
|
| Responsiveness | N/A | Idem responsiveness | |
| Interpretabilityd | N/A | N/A | Interpretability is the degree to which one can assign qualitative meaning, that is, clinical or commonly understood connotations, to an instrument’s quantitative scores or change in scores | |
aN/A: not applicable.
bHR-PRO: health-related patient-reported outcome.
cThe word true must be seen in the context of the classical test theory, which states that any observation is composed of 2 components: a true score and an error associated with the observation. True is the average score that would be obtained if the scale were given an infinite number of times. It refers only to the consistency of the score and not to its accuracy.
dInterpretability is not considered a measurement property but is an important characteristic of a measurement instrument.
Figure 1Flowchart of the selection process. COSMIN: Consensus-based Standards for the Selection of Health Measurement Instruments.
Main characteristics of the mobile health apps quality measurement tools included in this review.
| Measurement tool | Validation | Year | Language | Cross-cultural adaptation available | Dimensions of the measurement tool | Number of items |
| Mobile App Rating Scale | 60 mental health apps | 2015 | English | Italian and Spanish | 5: engagement; functionality; esthetics; information quality; and subjective app quality | 23 |
| iSYScore index | 257 health apps | 2016 | Spanish | —a | 3: popularity and interest; trust and quality; and usefulness | 14 |
| User version of the Mobile App Rating Scale | 2 health apps | 2016 | English | — | 5: engagement; functionality; esthetics; information; and subjective app quality | 20 |
| Health information technology usability evaluation scale | 100 adults who tested the use of a mobile health app | 2015 | English | — | 4: quality of work life; perceived usefulness; perceived ease of use; and user control | 20 |
| Measurement scales for perceived usefulness and perceived ease of use | 2 studies: 112 users and 2 systems and 40 users and 2 systems | 1998 | English | — | 2: perceived usefulness and perceived ease of use | 12 |
| The mHealth app usability questionnaire for interactive mHealth apps (patient version) | 2 health apps | 2019 | English | — | 3: Ease of use and satisfaction; System information arrangement; Usefulness | 21 |
| The mHealth app usability questionnaire for stand-alone mHealth apps (patient version) | 2 health apps | 2019 | English | — | 3: ease of use; interface and satisfaction; and usefulness | 18 |
| System Usability Scale | 3 studies: 20 people, 206 studies using System Usability Scale, and 9000 System Usability Scale questionnaires | 1986 | English | Portuguese, Spanish, French, German, Persian, and Malay | 1: usability | 10 |
aNot analyzed.