Literature DB >> 34322643

Translation and validation of the Italian version of the user version of the Mobile Application Rating Scale (uMARS).

Simone Morselli1,2, Arcangelo Sebastianelli1,2, Alexander Domnich3, Chiara Bucchi1, Pietro Spatafora1,2, Andrea Liaci1,2, Luca Gemma1,2, Stavros Gravas4, Donatella Panatto5, Stoyan Stoyanov6, Sergio Serni1,2, Mauro Gacci1,2.   

Abstract

BACKGROUND: Health sciences are steadily developing apps to help people to adopt correct lifestyles and to help physicians to monitor patients with chronic diseases. However, a properly validated tool that can evaluate patients' perception of apps is still lacking in many languages. In English, a validated questionnaire, called User Version of the Mobile Application Rating Scale (uMARS), is currently available. We translated the uMARS into Italian and validated our version.
METHODS: The uMARS questionnaire was translated from English to Italian by an official translator, and then administered to 100 smartphone users in order to evaluate the same app at times 1 and 2 (after 2 weeks). Paired t-test, Pearson Correlation Coefficient, Intraclass Correlation Coefficient (ICCs) and Cronbach's Alpha were used to evaluate the reliability and validity of the Italian uMARS.
RESULTS: We recruited 100 subjects, 52 males (52%) and 48 females (48%), with a mean age of 22.8 (SD: 3.4). All subjects answered all questions both at time 1 and at time 2. Paired t-test showed no statistically significant difference in each answer or group of answers between times 1 and 2 (P > 0.05). Cronbach's alpha was 0.945, as all subjects answered all questions. Each question was further assessed through the Pearson correlation coefficient, which demonstrated high reliability, with significant P (< 0.05) and Pearson Coefficients higher than 0.7. Similarly, ICCs were always higher than 0.750.
CONCLUSIONS: Our results validated the Italian version of uMARS, which may become a reliable and useful tool for evaluating health apps. ©2021 Pacini Editore SRL, Pisa, Italy.

Entities:  

Keywords:  Italian; Translation; User version mobile application rating scale; Validation; uMARS

Mesh:

Year:  2021        PMID: 34322643      PMCID: PMC8283658          DOI: 10.15167/2421-4248/jpmh2021.62.1.1894

Source DB:  PubMed          Journal:  J Prev Med Hyg        ISSN: 1121-2233


Introduction

Mobile health (mHealth) is a continuously evolving field of healthcare, owing to its particularly attractive features, such as ubiquity and portability [1]. Indeed, the number of mobile subscriptions in Europe is over 1,300 per 1,000 inhabitants, and 76% of Europeans access the Internet daily [2]. Healthcare professionals (HCPs), patients and the general public are increasingly using mHealth for a vast range of purposes, including communication and consultation with health services, HCPs and patients, the acquisition of health information and monitoring [1, 3]. Today, health-related mobile applications (apps) constitute a cornerstone of mHealth, and their overall market in 2019 was estimated as > 350,000 [4]. Indeed, it is believed that more than 500 million people worldwide have downloaded at least one mHealth app to their mobile phone [5]. Despite this promising outlook, mHealth apps are often of suboptimal quality and,worryingly, have poor or no evidence base [6]. This fact may compromise users’ choices and habits, thereby affecting health-related outcomes [7]. The easiest way to assess the quality of an app is to check its “star rating”, which can immediately be done in app stores. However, this quality appraisal system is highly subjective and may be significantly skewed by the phenomenon of “information asymmetry” between specialists and lay users [8, 9]. For instance, a review of urology apps available in the common app stores demonstrated that, while some apps were of moderate-high quality, the average app-store rating was 1 star only [10]. In order to address the above-described quality issues in a more objective way, the Mobile Application Rating Scale (MARS) was developed [8]. Given the frequency of its citation in published papers), MARS is probably the most popular scale worldwide. The original English language MARS scale has already been successfully validated in several languages, including Italian, Spanish, German, Dutch and Arabic [9, 11-14]. Briefly, the original MARS instrument is composed of 23 Likert scale-based items that cover the objective quality dimensions of engagement, functionality, esthetics and information, and one subjective dimension [8]. However, the original MARS scale was developed for professional use by researchers, clinicians and other specialists, and training is required before it can be used [8]. Given that most app users have no specialist knowledge, a training-free user version of MARS (uMARS) was subsequently validated and is publicly available [15]. An Italian version of the “expert” MARS instrument was successfully validated by Domnich et al. (2016). The same research group then tried to adapt the Italian version of MARS to the user version, with regard to the assessment of the quality of an app concerning invasive pneumococcal disease; however, no formal validation was performed [16]. To address this unmet need, the present study aimed to validate the Italian version of uMARS.

Methods

THE ENGLISH VERSION OF UMARS

Like the expert version, the uMARS covers four objective dimensions (engagement, functionality, esthetics and information) and one subjective dimension. Unlike the expert version, however, it is training-free, omits three items on the information subscale and has better readability properties. Briefly, the scale consists of 20 anchored 5-point Likert-type items that are distributed as follows: engagement (N = 5), functionality (N = 4), esthetics (N = 3) and information (N = 4) and 4 items belonging to the subjective quality domain. There is also a section on perceived impact (6 items), which assesses the potential impact of a given app on users’ knowledge, behaviors, intentions, help-seeking, etc.; this is a domain that evaluates users’ perception of the usefulness of the app, thereby enabling its impact to be rated. The scoring procedure is based on the mean: (i) subscale-specific scores are obtained by averaging individual item scores on that particular subscale; (ii) an overall uMARS score is an average of the four objective dimensions [15]. During the validation process, the English version of uMARS displayed both excellent internal consistency (Cronbach’s α of 0.90) and good test-retest reliability in a period of up to three months [15].

ADAPTATION AND TRANSLATION

To meet our objective, we generally adopted the process of adaptation/translation of the Italian version of MARS [9]. Following consultations with the authors of the expert MARS version, it was deemed that the two Italian versions were fully interchangeable from the points of view of conceptual, item, semantic, operational, measurement and functional equivalence. Indeed, the uMARS has simpler wording, but at the same time shares the same topics with the expert version. The formal adaptation procedure was therefore judged redundant. The English version of uMARS was then translated into Italian by a professional bilingual translator. The resulting output was then compared with the validated expert scale and discussed by the research group, which led to only minor changes being made. A back-translation was then performed in order to verify the compatibility and accuracy of meaning between the source and target languages. The authors of the original Italian MARS scale approved the final Italian uMARS questionnaire [9]. The final version of the questionnaire is available in Appendix A.

VALIDATION PROCEDURE

To validate the final Italian uMARS scale, we roughly followed the original methodology, as described by Stoyanov et al. (2016). We aimed to test the internal consistency and test-retest reliability of the scale. To validate the questionnaire, we set a target of 100 individuals, a sample size in line with the original uMARS validation study [15]. Potential participants were selected from the University of Florence, Faculty of Medicine. Subjects who, on the day of enrollment, were able to navigate in the study target app from their devices were potentially eligible. Participants had to be sufficiently fluent in Italian. The app chosen for uMARS validation was Facebook. There were several strong arguments for this choice: this app is free, popular and does not require participants to spend the time they would normally need to familiarize themselves with a new app, and Facebook contains all of the components covered by the four uMARS domains; therefore, all items could be confidently validated. The choice of Facebook was also related to the cost of downloading some health apps and the unavailability of some health apps on various mobile operating systems. Moreover, the use of certain health apps would have excluded some subjects from participating in the study. Eligible subjects were instructed to navigate all the app functions for at least 10 min and then to rate the app by using the uMARS. In order to ascertain intra-rater reproducibility, this procedure was carried out twice: on enrollment (t1) and approximately two weeks later (t2); this time-lag was deemed an appropriate waiting period by the research team. The study was conducted in accordance with the Declaration of Helsinki. Participation in this research was voluntary and all relevant Italian and international biomedical and privacy-related guidelines were followed. As the nature of the study was neither interventional nor biomedical, participants were not exposed to any risks. Therefore, formal ethical approval for this study was deemed unnecessary. Furthermore, all participants were enrolled on a voluntary basis, and our research group guaranteed their privacy.

STATISTICAL ANALYSIS

Internal consistency was measured by means of Cronbach’s α using the following “rule of thumb” categories: excellent (≥ 0.90), good (0.80-0.89), acceptable (0.70-0.79), questionable (0.60-0.69), poor (0.50-0.59) and unacceptable (< 0.50) [17]. The reproducibility of the Italian uMARS questionnaire was evaluated through the second application of the questionnaire at t2, two weeks after t1. First, a paired t-test was calculated in order to observe variations between the two measurements, for each sub-score and total score. Moreover, for every variable, test-retest reliability was evaluated through a bivariate Pearson correlation coefficient. A Pearson correlation coefficient above 0.7 and a significance level below 0.05 were considered sufficient to assess test-retest reliability. Finally, intraclass correlation coefficients (ICCs) were computed between t1 and t2, in order to provide weighted values of rater agreement and assess proximity rather than equality of ratings. The model chosen to calculate ICCs was that of random-effects average measures with absolute agreement [18]. All statistical analyses were conducted by means of SPSS version 20.0 (SPSS Inc, Chicago, IL, USA). A P-value < 0.05 was conventionally deemed statistically significant.

Results

During the study, the target population of 100 was reached in approximately three months. The mean age of participants was 22.8 years (SD = 3.4) and the male-to-female ratio was close to 1 (48 female vs 52 male). All participants were native Italian speakers, mostly university students in their 2nd, 3rd or 4th year; 22 participants (22%) were students’ relatives: 9 (9%) with a degree, 10 (10%) with a high-school diploma and 3 (3%) without a high-school diploma. All participants filled in the uMARS questionnaire both at t1 and at t2 (100% compliance). Their principal characteristics are reported in Table I.
Tab. I.

Characteristics of participants in the validation of the Italian version of uMARS.

Sex, n (%)Female48 (48.0%)
Male52 (52.0%)
Native Italian speaker, n (%)100 (100%)
Education, n (%)High school88 (88.0%)
University9 (9.0%)
Middle school3 (3.0%)
Age, mean (SD)22.8 (3.4)
The Italian uMARS displayed high internal consistency (Cronbach’s α = 0.95), and excellent reliability (Cronbach’s α based on standardized items = 0.98). The paired t-tests demonstrated that all 20 items and all the overall (sub)scale scores were similar between t1 and t2 (P > 0.12) (Table II). In detail, the highest variability was observed for question E20, regarding the subjective evaluation of the app: “What is your overall (star) rating of the app?” t1 mean 3.7 (SD: 0.8) vs t2 mean 3.6 (SD: 1.0) p = 0.089. The lowest variability was observed in the Information Section concerning question D16 “Credibility of source: does the information within the app seem to come from a credible source?”, t1 mean 3.6 (SD: 1.1) t2 mean 3.6 (SD: 1.1) p = 1.000. Moreover, test-retest reliability showed high consistency in each sub-score and total score, with a Pearson Correlation coefficient above 0.7 and significance below 0.05 (Tab. II). Analogously, the ICCs observed were constantly high (Tab. III), confirming excellent test-retest reliability.
Tab. II.

Test-retest reliability between time 1 and time 2 in the validation of the Italian version of uMARS.

Paired t-testReliability
Time 1Time 2PPearson correlationP
A - engagementA1 - entertainment3.8 (1.0)3.7 (1.1)0.2350.8260.001
A2 - interest3.9 (1.1)3.8 (1.1)0.3520.8100.001
A3 - customization4.0 (0.9)3.8 (0.9)0.3090.8100.001
A4 - interactivity3.9 (0.9)4.0 (0.9)0.4980.7970.001
A5 - target group3.9 (1.0)3.8 (1.1)0.8220.7980.001
A - engagement total19.4 (3.6)19.1 (3.8)0.2830.8540.001
B - functionalityB6 - performance4.0 (0.9)4.0 (0.8)0.7500.8280.001
B7 - ease of use4.0 (0.9)3.9 (0.9)0.6230.7420.001
B8 - navigation4.1 (0.7)3.9 (0.8)0.1600.8030.001
B9 - gestural design4.2 (0.8)4.2 (0.6)0.6600.8430.001
B - functionality total16.3 (2.1)16.0 (2.0)0.2140.7900.001
C - estheticsC10 - layout3.8 (0.8)3.9 (0.8)0.7430.7480.001
C11 - graphics3.9 (0.8)3.8 (0.9)0.8220.7710.001
C12 - visual appeal3.8 (0.9)3.8 (1.0)0.3230.8020.001
C - esthetics total11.5 (2.0)11.4 (2.1)0.7270.7940.001
D - informationD13 - quality of information3.7 (1.0)3.7 (0.8)0.6430.7530.001
D14 - quantity of information3.9 (1.1)3.9 (1.0)0.5990.8500.001
D15 - visual information4.0 (0.9)3.9 (1.0)0.3230.7720.001
D16 - credibility of source3.6 (1.1)3.6 (1.1)1.0000.8440.001
D - information total15.1 (3.4)15.1 (3.1)1.0000.8740.001
Quality(A+b+c+d/4)15.6 (2.3)15.4 (2.3)0.3390.8810.001
E - subjective qualityE17 - recommendation to others3.7 (1.0)3.7 (1.0)0.7000.8680.001
E18 - use and relevance4.1 (1.1)3.9 (1.1)0.1670.7920.001
E19 - payment2.5 (1.6)2.5 (1.5)0.6860.8030.001
E20 - overall rating3.7 (0.8)3.5 (1.0)0.1270.7920.001
E - subjective quality total14.0 (3.2)13.6 (3.2)0.0800.8800.001
F - perceived impactF1 - awareness3.3 (1.2)3.2 (1.1)0.2940.8100.001
F2 - knowledge3.0 (1.2)3.1 (1.3)0.2300.8680.001
F3 - attitudes3.0 (1.3)3.0 (1.2)0.8220.8490.001
F4 - intention to change3.3 (1.1)3.1 (1.2)0.1280.8130.001
F5 - help seeking3.2 (1.2)3.2 (1.2)0.5380.8000.001
F6 - behavior change3.1 (1.3)3.2 (1.2)0.8000.8860.001
F - perceived impact total19.0 (6.4)18.8 (6.3)0.4880.9500.001
Total score83.8 (15.4)82.5 (14.8)0.1970.9170.001

icc: intraclass correlation coefficient; for single measures α = cronbach’s alpha. All values are scores and are reported as mean (standard deviation). Paired t-test was done as statistical analysis between time 1 and time 2.

Tab. III.

Intraclass Correlation Coefficients between Time 1 and Time 2 in the validation of the Italian version of uMARS.

Iccα
A- engagementA1 - entertainment0.850.97
A2 - interest0.830.90
A3 - customization0.810.90
A4 - interactivity0.800.97
A5 - target group0.780.91
A - engagement total0.800.89
B - functionalityB6 - performance0.820.97
B7 - ease of use0.760.87
B8 - navigation0.800.87
B9 - gestural design0.870.94
B - functionality total0.790.88
C - estheticsC10 - layout0.750.87
C11 - graphics0.770.90
C12 - visual appeal0.800.90
C - esthetics total0.790.88
D - informationD13 - quality of information0.760.85
D14 - quantity of information0.850.92
D15 - visual information0.770.87
D16 - credibility of source0.840.95
D - information total0.870.93
Quality(A+b+c+d/4)0.880.94
E -subjective qualityE17 - recommendation to others0.870.92
E18 - use and relevance0.790.86
E19 - payment0.800.85
E20 - overall rating0.810.87
E - subjective quality total0.880.94
F – perceived impactF1 - awareness0.810.87
F2 - knowledge0.860.93
F3 - attitudes0.850.92
F4 - intention to change0.810.90
F5 - help seeking0.800.89
F6 - behavior change0.880.94
F- perceived impact total0.950.97
Total score0.920.96

ICC: Intraclass Correlation Coefficient; for single measures α = Cronbach’s Alpha. All values are scores and are reported as mean (standard deviation). Paired t-test was done as statistical analysis between time 1 and time 2.

Discussion

The uMARS scale is designed to enable the quality of health-related apps to be evaluated, both within institutional bodies (e.g. researchers, scientific societies, regulatory agencies, industry) and among end-users [15]. We translated this scale into Italian and validated our version. Together with the Italian “expert” MARS questionnaire, our newly validated Italian uMARS scale completes the set of instruments for the quality evaluation of Italian health apps. Although Italy ranks relatively highly in terms of mobile subscriptions and Internet access, the overall English language proficiency of Italians is among the lowest in Europe [9, 19-21]. Therefore, original questionnaires should be translated into Italian and validated, in order to reach the majority of the population (this is also a matter of equality of access to a new technology). Moreover, the Italian version of uMARS can now also be used for wide population-based research. The validity and reliability of the Italian uMARS version proved similar to those of the original uMARS [15]. Specifically, we obtained a similarly high level of internal consistency (α ≥ 0.90), in line with the Italian “expert” version of MARS [10]. The test-retest reliability and ICCs were also high, being similar to those of the original scale created by Stoyanov et al. (2016). In our opinion, these optimal properties were obtained because: (i) we adopted the methodology of the original MARS and consulted the original uMARS and the Italian MARS scales, and (ii) subsequent approval of the text of the scale was obtained from the authors of the previously published “expert” scale [8, 9, 15]. Considering that the present study was based on a body of previous research, we were able to identify some limitations of our study. Indeed, the total sample size of 100 participants was not powered a priori [8, 9, 15]. However, in a recently published review by Bujang et al., this sample size was judged sufficient [22]. Moreover, our sample was mostly composed of young university students, who are usually very familiar with modern technology and use apps more often than older people do. Consequently, the sample might be not fully representative of the general Italian population. Finally, although the app chosen for validation had the advantage of being widely used, it was not an mHealth app; this was a potential limitation, as it might have influenced respondents’ perception of the impact of the questionnaire on health.) However, the sub- and total scores in other domains can be assumed to be reliable and correctly assessed by the questionnaire.

Conclusions

The Italian version of uMARS displayed good reliability and validity. When accompanied by the “expert” MARS version, it may be used in multipurpose research/public health projects and by developers working in the sphere of digital health intervention in Italy. Furthermore, the Italian version of uMARS can provide more equal access to the evaluation of mHealth technologies from the point of view of different stakeholders (i.e. for-profit vs non-profit). Characteristics of participants in the validation of the Italian version of uMARS. Test-retest reliability between time 1 and time 2 in the validation of the Italian version of uMARS. icc: intraclass correlation coefficient; for single measures α = cronbach’s alpha. All values are scores and are reported as mean (standard deviation). Paired t-test was done as statistical analysis between time 1 and time 2. Intraclass Correlation Coefficients between Time 1 and Time 2 in the validation of the Italian version of uMARS. ICC: Intraclass Correlation Coefficient; for single measures α = Cronbach’s Alpha. All values are scores and are reported as mean (standard deviation). Paired t-test was done as statistical analysis between time 1 and time 2.
  14 in total

Review 1.  Use of Mobile Health (mHealth) Technologies and Interventions Among Community Health Workers Globally: A Scoping Review.

Authors:  Jody Early; Carmen Gonzalez; Vanessa Gordon-Dseagu; Laura Robles-Calderon
Journal:  Health Promot Pract       Date:  2019-06-10

2.  Spanish adaptation and validation of the Mobile Application Rating Scale questionnaire.

Authors:  R Martin Payo; M M Fernandez Álvarez; M Blanco Díaz; M Cuesta Izquierdo; S R Stoyanov; E Llaneza Suárez
Journal:  Int J Med Inform       Date:  2019-06-05       Impact factor: 4.046

3.  Computing Inter-Rater Reliability for Observational Data: An Overview and Tutorial.

Authors:  Kevin A Hallgren
Journal:  Tutor Quant Methods Psychol       Date:  2012

Review 4.  Low Quality of Free Coaching Apps With Respect to the American College of Sports Medicine Guidelines: A Review of Current Mobile Apps.

Authors:  François Modave; Jiang Bian; Trevor Leavitt; Jennifer Bromwell; Charles Harris Iii; Heather Vincent
Journal:  JMIR Mhealth Uhealth       Date:  2015-07-24       Impact factor: 4.773

5.  Development and validation of the Italian version of the Mobile Application Rating Scale and its generalisability to apps targeting primary prevention.

Authors:  Alexander Domnich; Lucia Arata; Daniela Amicizia; Alessio Signori; Bernard Patrick; Stoyan Stoyanov; Leanne Hides; Roberto Gasparini; Donatella Panatto
Journal:  BMC Med Inform Decis Mak       Date:  2016-07-07       Impact factor: 2.796

6.  An eHealth Project on Invasive Pneumococcal Disease: Comprehensive Evaluation of a Promotional Campaign.

Authors:  Donatella Panatto; Alexander Domnich; Roberto Gasparini; Paolo Bonanni; Giancarlo Icardi; Daniela Amicizia; Lucia Arata; Stefano Carozzo; Alessio Signori; Angela Bechini; Sara Boccalini
Journal:  J Med Internet Res       Date:  2016-12-02       Impact factor: 5.428

7.  Current Knowledge and Adoption of Mobile Health Apps Among Australian General Practitioners: Survey Study.

Authors:  Oyungerel Byambasuren; Elaine Beller; Paul Glasziou
Journal:  JMIR Mhealth Uhealth       Date:  2019-06-03       Impact factor: 4.773

8.  A Review on Sample Size Determination for Cronbach's Alpha Test: A Simple Guide for Researchers.

Authors:  Mohamad Adam Bujang; Evi Diana Omar; Nur Akmal Baharum
Journal:  Malays J Med Sci       Date:  2018-12-28

9.  The Arabic Version of the Mobile App Rating Scale: Development and Validation Study.

Authors:  Marco Bardus; Nathalie Awada; Lilian A Ghandour; Elie-Jacques Fares; Tarek Gherbal; Tasnim Al-Zanati; Stoyan R Stoyanov
Journal:  JMIR Mhealth Uhealth       Date:  2020-03-03       Impact factor: 4.773

10.  Urology apps: a review of all apps available for urologists.

Authors:  Jonathan K Makanjuola; Amrith R Rao; Jemma Hale; Matthew Bultitude; Ben Challacombe; Prokar Dasgupta
Journal:  BJU Int       Date:  2012-05-11       Impact factor: 5.588

View more
  1 in total

1.  Designing and Implementing the MySusCof App-A Mobile App to Support Food Waste Reduction.

Authors:  Rainer Haas; Hakan Aşan; Onur Doğan; Claus Rainer Michalek; Özlem Karaca Akkan; Zeki Atıl Bulut
Journal:  Foods       Date:  2022-07-26
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.