Literature DB >> 33204203

Application of Rasch Analysis for Development and Psychometric Properties of Adolescents' Quality of Life Instruments: A Systematic Review.

Sahar Dabaghi1, Fatemeh Esmaielzadeh1, Camelia Rohani2,3.   

Abstract

BACKGROUND: Due to the importance of assessing quality of life (QoL) in healthy and ill adolescents, the evaluation of psychometric properties of these questionnaires is important.
OBJECTIVE: To investigate the application of Rasch analysis in psychometric assessment studies on adolescents' QoL instruments, and to evaluate the quality of reporting Rasch parameters in these studies.
METHODS: This systematic review was conducted by searching for papers in electronic databases PubMed, Web of Science, EMBASE, Cochrane Library and Scopus until December 2018.
RESULTS: After screening 122 papers, 31 remained in the study. Around 68% of the studies used the Rasch analysis for instrument testing and 32% for the development of new instruments. In 77.4% of studies, both classical and Rasch methods were used parallel to data analysis. In 32.2% of studies, healthy adolescents were the main target group. The most commonly used instrument in Rasch studies was, KIDSCREEN, administered in different countries. Six Rasch parameters were reported with a higher percentage in the studies. Major reported parameters of Rasch analysis were application of the software program (96.7%), test of item fit to the Rasch model (93.5%), unidimensionality (80.6%), type of the identified mathematical Rasch model (74.1%), threshold (58%) and differential item functioning (54.8%). Based on the psychometric evaluation of the QoL instruments, 71% of studies showed acceptable results.
CONCLUSION: The application of the Rasch model for psychometric assessment of adolescents' QoL questionnaires has increased in recent decades. But, there is still no strong and commonly used critical appraisal tool or guideline for the evaluation of these papers.
© 2020 Dabaghi et al.

Entities:  

Keywords:  Rasch analysis; adolescence; instrument; psychometric; quality of life; systematic review

Year:  2020        PMID: 33204203      PMCID: PMC7666979          DOI: 10.2147/AHMT.S265413

Source DB:  PubMed          Journal:  Adolesc Health Med Ther        ISSN: 1179-318X


Introduction

One of the concerns of healthcare providers is adolescents and children’s quality of life (QoL).1 Adolescence, is a transitional phase between the childhood and adulthood period. The World Health Organization (WHO) defines it as a life period between ages 10 to 19.2 This range is overlapping with the definition of the WHO for young people between 10 to 24.2 Substance abuse, injuries, obesity, chronic diseases, mental health problems, sexual and reproductive health disorders are important issues during the adolescence period.3,4 To assess physical, social and mental health as well as evaluating population-based intervention programs, outcome variables like QoL and health-related quality of life (HRQoL) are measured among adolescents and children.5 Thus, the assessment of QoL among adolescents and children is important to identify individuals with poor QoL condition and to plan intervention strategies.6 One important part of an individual’s general well-being is defined as QoL.7 As described by the WHO, QoL is an individuals’ understanding of their situations in life within the context of the culture and value systems in which they live, in relation to their goals, expectations, standards, and attitudes.8 However, HRQoL contains at least physical, psychological and social functioning aspects in the context of a disease.9 Despite various definitions for QoL and HRQoL, the researchers agree on three important features, consisting of multidimensionality, subjectivity, and dynamicity of these concepts.10 Since, the 1970s, after efforts to improve QoL, the number of instruments has increased exponentially, some of which are made for adolescents.11 In clinical and non-clinical settings, the use of QoL measurement tools can provide valuable information for the best possible individual care and evaluation of interventions.12 Considering the importance of measuring the QoL of adolescents, it is valuable to use valid and reliable questionnaires. The necessity and importance of providing sufficient information about validity and reliability of the questionnaires have been mentioned in previous studies.13,14 Thus, evaluation of the psychometric properties of these questionnaires is essential.15 To this end, various tools have been designed and are available as generic and specific-disease tools to measure QoL.16 Although, evaluation of psychometric properties of instruments is mostly done by Classical Test Theory (CTT) methods, researchers’ attention has increased towards application of more complicated models. Also, numerous researchers have recommended, that new theories should replace the classical theories in various disciplines. The Rasch model is one of the most practical and common approaches used for this purpose.17 The Rasch analysis, based on the Item Response Theory (IRT), is used for evaluation of the psychometric properties of measurement tools, and is more complex than the classical methods. Relevant findings show that this complexity is superior to the classical methods because more accurate results are obtained. There are some problems regarding the use of classical methods in which raw scores, linear combinations of these scores and responses that have ordinal scale, essentially are considered as interval-scale data. Modern analysis offers solutions to these problems. Rasch analysis is a statistical technique that can be used to convert ordinal data into interval data in questions with two or more options to answer.18 QoL Questionnaires with several subscales can be evaluated by Rasch models.19 In these models, it is assumed that, all the questions and materials of a test measuring a construct constitute a sort of ordered relationship. The second assumption in these models is that a correct answer to one question does not influence the other questions.17 Evidence shows that the number of QoL instruments is increasing, some of them specific for adolescents. Due to the importance of assessing QoL in healthy and ill adolescents, the evaluation of psychometric properties of these instruments is important.17 Thus, a systematic review was conducted on studies that investigated psychometric properties of adolescents’ QoL Instruments by Rasch analysis. The following research questions were also put forth in this review: How is the application of Rasch analysis in psychometric assessment studies on adolescents’ QoL Instruments? How is the quality of reporting Rasch parameters in selected studies?

Methods

The present study was carried out based on the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) 2009. PRISMA Statement is a critical appraisal tool and helps authors improve the reporting of systematic review and meta-analysis studies’ results.20 Selected studies included all papers reporting psychometric properties of QoL instruments in adolescents by the Rasch method. A systematic search was done by two authors independently using five electronic databases; PubMed, Web of Science, EMBASE, Cochrane Library and Scopus, with no time limit until December 2018 (Table 1). A number of key terms with the assistance of Boolean expressions (AND, OR, NOT) were assessed in the title and abstract of the identified papers. Key terms included “Quality of Life” OR “Life Quality” OR “Health-Related” OR “Health-Related Quality Of Life” OR “HRQOL” AND “Rasch analysis” OR “Rasch analyses“ OR “Rasch measurement” OR “Rasch measurement model” OR “Rasch model” OR “Rasch modeling” OR “Rasch rating scale analysis” OR “Rasch rating scale model” AND “Adolescent” OR “Teenager” OR “Teen” OR “Stripling”. The inclusion criteria for study selection were all papers published in a scientific journal in the English language, they used Rasch analysis to evaluate psychometric properties of adolescents̕ QoL instruments, and there was access to the full text of the paper. Exclusion criteria included non-English language papers, adult target population, and review papers. After removing duplicates in the first screening phase, two authors independently reviewed titles and abstracts of the studies. Papers containing the full text, were included in the review and were screened separately by two authors. In cases of disagreement between the two authors, the papers were reviewed by the third author. Two tables were designed in the research team to collect the data from different papers. The following data were extracted from all studies: author’s name, year of publication, country, aim of the study, study type, measurement instrument, item numbers, rating scales, measurement construct, sample and age, data collection procedure, study results, type of software, type of Rasch parameters, validity (Classical and Rasch methods), reliability (Classical and Rasch methods), and mixed results of both Rasch and Classical methods. Three authors individually evaluated the Rasch parameters using a Quality Identifiers (QI) checklist. All processes of studies’ selection and data extraction were discussed within the research team. The QI checklist which was introduced by Tennant and Conaghan with 10 items, was used to evaluate quality of the reporting of Rasch parameters.21,22 The criteria for this 10-item checklist are: (1) Stating the name of the software for Rasch analysis, (2) Reporting mathematical derivation of the Rasch model, (3) Evaluation of the threshold ordering, (4) Test of item fit to the Rasch model, (5) Test of person fit, (6) Testing for differential item functioning (DIF), (7) Reliability, (8) Response dependency, (9) Unidimensionality, and (10) Transformation table.21 Here there is a short explanation for each item.
Table 1

Search Syntax in Different Databases

DatabaseSearch SyntaxNumber of Outputs
PubMed(“Quality of life”[tiab] OR “Life Quality”[tiab] OR “Health-Related Quality Of Life”[tiab] OR “Health Related Quality Of Life”[tiab] OR HRQOL[tiab]) AND (“Rasch analysis”[tiab] OR “Rasch analyses”[tiab] OR “Rasch measurement”[tiab] OR “Rasch measurement model”[tiab] OR “Rasch model”[tiab] OR “Rasch modeling”[tiab] OR “Rasch modelling”[tiab] OR “Rasch rating scale analysis”[tiab] OR “Rasch rating scale model”[tiab] AND Adolescent[tiab] OR Teenager[tiab] OR Teen[tiab] OR Stripling[tiab])33
Embase(“Quality of life”:ab,ti OR “Life Quality”:ab,ti OR “Health-Related Quality Of Life”:ab,ti OR “Health Related Quality Of Life”:ab,ti OR HRQOL:ab,ti) AND (“Rasch analysis”:ab,ti OR “Rasch analyses”:ab,ti OR “Rasch measurement”:ab,ti OR “Rasch measurement model”:ab,ti OR “Rasch model”:ab,ti OR “Rasch modeling”:ab,ti OR “Rasch modelling”:ab,ti OR “Rasch rating scale analysis”:ab,ti OR “Rasch rating scale model”:ab,ti) AND (Adolescent:ab,ti OR Teenager:ab,ti OR Teen:ab,ti OR Stripling:ab,ti)14
Web of Science(TS=(Quality of life) OR TS=(Life Quality) OR TS=(Health-Related Quality Of Life) OR TS=(Health Related Quality Of Life) OR TS=(HRQOL)) AND (TS=(Rasch analysis) OR TS=(Rasch analyses) OR TS=(Rasch measurement) OR TS=(Rasch measurement model) OR TS=(Rasch model) OR TS=(Rasch modeling) OR TS=(Rasch modelling) OR TS=(Rasch rating scale analysis) OR TS=(Rasch rating scale model)) AND (TS= (Adolescent) OR TS=(Teenager) OR TS=(Teen) OR TS=(Stripling))50
Cochrane Library((“Quality of life”):ti,ab,kw OR (“Life Quality”):ti,ab,kw OR (“Health-Related Quality Of Life”):ti,ab,kw OR (“Health Related Quality Of Life”):ti,ab,kw OR (“HRQOL”):ti,ab,kw) AND ((“Rasch analysis”):ti,ab,kw OR (“Rasch analyses”):ti,ab,kw OR (“Rasch measurement”):ti,ab,kw OR (“Rasch measurement model”):ti,ab,kw OR (“Rasch model”):ti,ab,kw OR (“Rasch modeling”):ti,ab,kw OR (“Rasch modelling”):ti,ab,kw OR (“Rasch rating scale analysis”):ti,ab,kw OR (“Rasch rating scale model”):ti,ab,kw) AND ((“Adolescent”):ti,ab,kw OR (“Teenager”):ti,ab,kw OR (“Teen”):ti,ab,kw OR (“Stripling”):ti,ab,kw)3
SCOPUSTITLE-ABS(“Quality of life”) OR TITLE-ABS(“Life Quality”) OR TITLE-ABS(“Health-Related Quality Of Life”) OR TITLE-ABS(“Health Related Quality Of Life”) OR TITLE-ABS (HRQOL) AND TITLE-ABS (“Rasch analysis”) OR TITLE-ABS (“Rasch analyses”) OR TITLE-ABS (“Rasch measurement”) OR TITLE-ABS(“Rasch measurement model”) OR TITLE-ABS(“Rasch model”) OR TITLE-ABS(“Rasch modeling”) OR TITLE-ABS(“Rasch modelling”) OR TITLE-ABS(“Rasch rating scale analysis”) OR TITLE-ABS(“Rasch rating scale model”) AND TITLE-ABS(Adolescent) OR TITLE-ABS(Teenager) OR TITLE-ABS(Teen) OR TITLE-ABS(Stripling)22
Search Syntax in Different Databases

Stating the Name of the Software for Rasch Analysis

There are several software programs to analyze the data using the Rasch method, but the main software programs which are used in the studies are WINSTEPS, RUMM, ConQuest and WINMIRA. Each of these software programs report the findings in a slightly different process. But, the main hypothesis in all of them is testing and matching observed response pattern in the data with theoretical pattern expected by the model.21–24

Reporting Mathematical Derivation of the Rasch Model

The first step in Rasch analysis is determining and selecting the mathematical derivation of the Rasch model. If the Likert scale items have two response options the chosen model is a dichotomous model. If the items have three or more levels of response options (Polytomous model), the Rasch model will be slightly different and is commonly referred to as the Andrich Rating Scale Model (RSM) or Masters Partial Credit Model (PCM). The two models differs in the way of behaving with thresholds (probabilistic midpoint between two adjacent categories). In the RSM, any category on the response scale is modeled to be “equal” throughout all items. In other words, the gap between thresholds is considered equal across items. In fact, the threshold is a metric distance between items and should be the same across the items of a scale. Studies should explain the type of the model which was used and the reason for this selection.21–24

Evaluation of the Threshold Ordering

This evaluation applies to polytomous items using the PCM and the RSM models. It is important to identify the classification structure of response levels to the items and to ensure that answers to the items are consistent with the metric estimate of the underlying construct. This means that the transition from one response level to the next should increase the hidden property of the measure. When this does not happen, disordered thresholds have occurred. Disorder at threshold means collapse of response options (response categories) over one another, followed by disorder at response levels due to relevance, redundancy, or misallocation of the items in the instrument. In the “Rasch-Andrich thresholds” or “step calibrations”, disordering appears when some categories do not match to intervals on the latent variable. Andrich disputes that there is a requirement at the level of the item for the categories to be ordered. The latent response processes must therefore be dependent so that an outcome of being successful on the second latent process and failing the first latent process cannot occur. Therefore, a threshold cannot be passed unless all prior thresholds have been passed. Andrich imposes this order requirement through what he calls a Guttman structure.25,26 Studies should identify threshold as well as measures to eliminate the disorder.21–24

Test of Item Fit to the Rasch Model

There may be a difference between observed and expected responses in each study, so model testing can give the researcher more information. There are different fit statistics in different software programs. Although they cannot be compared to each other, there are standardized values for all individuals and they can subsequently be comparable. For example, WINSTEPS software outputs include infit, outfit, and ZSTD (standardized fit statistics). In the RUMM 2020 software, outputs are Chi-square statistic and residual statistic (standardized sum of all differences between observed and expected values aggregated across all persons) which are very similar to outfit and ZSTD in WINSTEPS software. Infit is the difference between observed and expected responses for those items with difficulty levels close to the individual’s ability. The outfit deals with the differences between all items (regardless of difficulty of the item and based on one’s ability). Thus, infit gives more weight to the responses for items close to one’s ability. Chi-square test shows the difference between expected and observed values of measured properties between groups, indicating different levels of ability. If its value is less than 0.05, then it is considered as a misfit. Another output of the RUMM 2020 is item-trait interaction Chi-square, representing the variance of measured property. If Chi-square becomes significant, hierarchical ordering of response levels varies across attributes.21–24

Test of Person Fit

Answers with unusual patterns can influence fit item levels. These incorrect responses can be triggered by unregistered diseases such as cognitive deficits in the respondent. Thus, deleting such responders can lead to improved internal construct validity of the tool. Such response patterns must be identified by high positive residuals. So, person fit is important and should be reported.21–24

Testing for DIF (Differential Item Functioning)

DIF or item bias can influence the fit model. This happens when different groups with different demographic characteristics in a sample (eg, youth, the elderly, women, and men) respond differently to the same item while equal levels of response are measured for a latent variable. The goal is not to disregard age group differences, but to have access to the same levels of a trait regardless of the age variable. There may be a difference in different demographic settings, but an important point is that, DIF should be evaluated at least in different age and sex groups in each study.21–24

Reliability

There are different outputs for reliability in different software programs. For example, the output of the WINSTEPS software is reported as “person reliability” and “person separation”. “Person reliability” indicates to how well the instrument constantly reproduces a participant’s score. A “person reliability” coefficient of 0.70 shows an acceptable level, 0.80 a good level, and 0.90 an excellent level. While a “person separation” indicates how well the instrument differentiates persons into separate ability groups. It means “reproducibility of relative measure location, therefore, the greater the person separation statistic, the more distinct strata are disclosed (not less than 2). In RUMM 2030, this is provided by the “Person Separation Index” (PSI). The PSI is equivalent to Cronbach’s alpha, though it uses person estimates in logit instead of raw scores. Similar to Cronbach’s alpha, a value close to 1 indicates high internal consistency and a value less than 0.70 indicates model misfit.21–24

Response Dependency

Response dependency arises when the responses to some items are correlated, that is, answering one item determines the answer to another. Dependent items can be determined by calculating a residual correlation matrix, which should not actually show a significant correlation.21–24

Unidimensionality

According to the Rasch model, it is assumed that, all items collectively represent a unidimensional tool and measure one trait. There are several methods used for the measurement of unidimensionality that vary depending on the software which is used in data analysis, but Rasch Principal Component Analysis (PCA) for residuals or PCAR is the common indicator for the analysis of residuals. In WINSTEPS software, the first remaining contrast is an important indicator and usually should not be higher than two (eigenvalue of <2). However, if the first contrast has an eigenvalue value >2, it indicates that another dimension may be exist. It specifies that the dimension has at least three items.27 Another way to measure it using RUMM software is reporting a series of independent t-tests guided by pairs of person measures fitted from two subsets of positive and negative loading items on the first component of the residuals. Person estimates derived from the positive set of items are compared against those derived from the negative set. A series of unique t-tests are performed to compare the probabilities for each individual. Percentage of these tests outside the range of −1.96 to +1.96 should not exceed 5%. The differences in estimates derived from the two sets of the items are normally distributed.21–24

Transformation Table

When the Rasch model is used for psychometric study of an instrument, the options of instrument items can be converted to an interval scale. In this case, a transformation table for this change must be reported in the study. Equal interval scaling is important in clinical trials for accurately reporting the change in scores and reporting responsiveness, such as minimal important differences and effect size.21–24

Results

From a total of 122 papers which were found until December 2018, 31 papers were finally included in this study. The process of the study selection is shown in Figure 1 (the PRISMA 2009 diagram). The general characteristics of the selected studies are shown in Table 2.
Figure 1

PRISMA diagram for the selection of studies.

Notes: Adapted from Moher D, Liberati A, Tetzlaff J, Altman DG. The PRISMA GROUP. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med. 2009; 6 (7): 1000097. doi: 10. 1371/journal. pmed 1000097.20

Table 2

General Characteristics of the Selected Studies with Application of Rasch Analysis in Adolescents’ QoL Instruments (n = 31)

No.Author (Year); CountryAim of the StudyStudy TypeInstrument; Item NumbersRating ScalesMeasurement AreaSample; AgeData CollectionResults
1Caronni et al(2017); Italy45Develop a new questionnaire for measuring HRQoL in young people with a spine deformityCross-sectional study; Rasch analysis was run on version 1 and 2 of the ISYQOLISYQOL; 205-pointSpine deformityAdolescents with idiopathic scoliosis or Scheuermann juvenile kyphosis (n=402); under 18 in two phases: ISYQOL-1I; 14.6±1.99; and with ISYQOL-II; (15±2.1)Self-report questionnaireTwo valid and reliable versions of the questionnaires are provided for patients with and without brace.
2Caronni et al (2014); Italy51Present Rasch analysis of the SRS-22 questionnaire and to develop a short form and Rasch approved of the SRS-7Psychometric studySRS-22; 22SRS-7; 7SRS-7: four items= 4-point;one item=3-point; one item=5-pointIdiopathic scoliosisAIS patients who never were treated before; (n=300); 10–16 yearsSelf-report questionnaireThe SRS-22 showed a poor clinimetric properties; the SRS-7 showed better targeting of the participants’ population.
3Oluboyede et al (2017); UK54Development and refinement of the WAItE, a new obesity-specific QoL measure for adolescentsTwo phases: qualitative phase (one-to-one and focus group interviews); quantitative phase (Psychometric assessments and Rasch analysis)WAItE; 75-pointOverweight and obesitytreatment-seeking above normal weight status and non-treatment-seeking (school sample- adolescents) (n=315); 11–18 yearsSelf-report questionnaireThe WAItE focuses on aspects of life affected by weight that are important to adolescents. It showed the potential for adding key information to the assessment of weight management interventions aimed in adolescents.
4Smidt et al (2010); US55Development and validation a QoL index for adolescents with skin diseasesProspective, longitudinal cohort study. Two phases: qualitative phase (expert opinions, and review of the literature); quantitative phase (psychometric assessments and Rasch analysis)Skindex-Teen; 215-pointSkin diseases (Acne, Atopic dermatitis, Nevi or halo nevi, Psoriasis, Verrucae, Alopecia or alopecia areata, Morphea or localized scleroderma, Vitiligo)Patients with a skin condition (n=205); 12–17 yearsSelf-report questionnaireValid and reliable questionnaire is provided for patients with skin diseases.
5Sapin et al (2005); France28Evaluation of psychometric properties of the VSP-A questionnaire on two populations, ill and healthyPsychometric studyVSP-A; 375-pointGlobal HRQoL in ill medical, surgical or psychiatric conditions and healthy adolescentsTwo sub-samples: the first with adolescents attending school, and the second with inpatient youths with various conditions (n= 1938); 10–17 yearsSelf-report questionnaireResults support the reliability and validity of the VSP-A as a generic report of HRQoL for adolescents.
6Basra et al (2018); UK56Development and validation of the T-QoL instrument for adolescents with skin diseasesTwo phases: qualitative phase (semi-structured interviews with adolescents); quantitative phase (psychometric assessments and Rasch analysis)T-QoL; 185-pointSkin diseases (Acne, Eczema, Psoriasis, Moles, Nonspecific dermatitis, Warts)Adolescents with skin diseases (n= 426); 12–19 yearsSelf-report questionnaireValid and reliable questionnaire is provided for patients with skin diseases.
7Ravens-Sieberer et al (2008); 13 European countries29Reliability and validity of the European KIDSCREEN-52 generic HRQoL questionnaireCross-sectional studyKIDSCREEN; 525-pointHRQoL in healthy adolescentsChildren and adolescents (n= 22,827); 8–11 yearsSelf-report questionnaireThe KIDSCREEN −52 showed a good clinimetric properties.
8Gothwal et al (2018); India40Psychometric properties of KIDSCREEN-27 and child-parent agreement regarding child’s HRQoL in children operated for CGProspective cohort with a psychometric evaluationKIDSCREEN; 27 (child and adolescent version, and parent version)5-pointCongenital GlaucomaChildren operated for CG (n= 121); 8–18 yearsSelf-report questionnaireItems of the KIDSCREEN −27 were fitted with the Rasch model, but a lack of unidimensionality was observed. There was a discordance between CG child’s self-report of HRQoL and parent’s report.
9Nielsen et al (2017); Denmark57Translate the DCGM-37 and DSM-10 questionnaires into Danish and use the Rasch model to determine their internal validity and reliability in children and adolescents with T1DTwo phases: translated into the Danish language and psychometric assessments and Rasch analysisDCGM; 37 and DSM; 105-pointT1DChildren and adolescents with T1D (n=413); 8–18 yearsSelf-report questionnaireDanish translations of the DISABKIDSDCGM- 37 demonstrated good validity and adequate reliability.
10Ng et al (2015); China30Reliability and validity of the KIDSCREEN-52 and KIDSCREEN-27Two phases: translated into the Chinese language and psychometric assessments and Rasch analysisKIDSCREEN; 27, 525-pointHRQoL of StudentsChildren and adolescents (n= 1379 + 555); 8–14 yearsSelf-report questionnaireChinese version of the KIDSCREEN demonstrated good validity and reliability.
11Jervaeus et al (2013); Sweden41Present Rasch analysis results on the KIDCSREEN-27Cohort studyKIDCSREEN; 275-pointChildhood cancerParticipants diagnosed with cancer (n= 63); 12–22 yearsTelephone- interviewThe KIDSCREEN-27, showed a good psychometric properties, with the exception of Autonomy and Parent Relations, due to non-satisfactoryunidimensionality, for use among adolescents and young adults who have survived childhood cancer.
12Jafari et al (2012); Iran31Present Rasch analysis results of the Persian version of the KIDSCREEN-27Psychometric studyKIDSCREEN; 275-pointHRQoL of StudentsChildren and adolescents (n= 1083); 8–18 yearsSelf-report questionnairePersian version of the KIDSCREEN-27 showed good validity and reliability.
13Aires et al (2011); Brazil32Psychometric properties of the Brazilian-Portuguese version of the VSP-ATwo phases: translated into the Brazilian language and psychometric assessments and Rasch analysisVSP-A; 365-pointHRQoL in high school studentsAdolescents (n= 446); 14–18 yearsSelf-report questionnaireBrazilian version of the VSP-A demonstrated good validity and reliability.
14Kook & Varni (2008); Korea24Psychometric properties of the Korean version of the PedsQL4.0Two phases: translated into the Korean language and psychometric assessments and Rasch analysisPedsQL; 235-pointQoL of studentsSchool children and adolescents (n= 1425); 8–18 yearsSelf-report questionnaireKorean version of the PedsQL showed moderate validity and reliability.
15Simeoni et al (2007); 7 European countries46Present psychometric properties of the HRQoL DISABKIDSChronic Generic Measure and to develop a short form by Rasch analysisCross-sectional studyDCGM; 375-pointHRQoL of children and adolescentsStudents with chronic health conditions (asthma, arthritis, epilepsy, cerebral palsy, diabetes, atopic dermatitis, cystic fibrosis (n= 1153); 8–16 yearsSelf-report questionnaireThe DCGM-37 showed a good clinimetric properties.
16Robitail et al (2006); 7 European countries42Psychometric properties of the European proxy KIDSCREEN-52Cross-sectional studyKIDSCREEN; 525-pointHRQoL of children and adolescents with Special Health Care NeedsChildren and adolescents with special health care needs and their proxies (parents) (n= 2505); 8–18 yearsSelf-report questionnaireThe KIDSCREEN-52 showed a good psychometric properties.
17Henry et al (2003); France52Development of the CFQ for assessing QoL in pediatric and adult patientsTwo phases: qualitative phase (expert panel and review of the literatures and interview with CF teenagers); quantitative phase (psychometric assessments and Rasch analysis)CFQ; 254-pointCystic FibrosisPatients (101 teenagers and 151 adults) and 141 parents of children with CF; 8–13 yearsSelf-report questionnaireBoth the CFQ 14 + and the CFQ child parent-proxy French questionnaires showed a good psychometric properties.
18Chae et al (2018); Korea37Psychometric properties of the Korean version of the CHAQ by applying the Rasch modelPsychometric studyCHAQ; 304-pointCPChildren with CP (n= 65); 6–15 yearsProxy-report Questionnaire;The CHAQ-30, adapted to the Korean population in assessing HRQL in children with CP showed a poor psychometric properties; item development and modification were determined to be necessary.
19Landfeldt et al (2018); UK & US47Psychometric properties of the English (UK and US) version of the PedsQL NMMPsychometric studyPedsQL NMM; 255-pointDuchenne muscular dystrophyPatients with Duchenne muscular dystrophy (n= 278); 9–23 yearsSelf-report questionnaireThe PedsQL NMM-25 showed a poor clinimetric properties.
20Bray et al (2017); Australia48Two parts: Part one-determine whether a single score as a valid reflection of the HRQoL could be derived from ESM data.Part two- external validity by comparing the measure score derived from the ESM with that of the retrospective measurePsychometric studyESM a diary method; 19 PedsQL Generic Core scales; 232-, 3-, or 4-pointDuchenne muscular dystrophyBoys with Duchenne muscular dystrophy (n= 31); 9–18 yearsSelf-report questionnaireThe ESM −19 as a reported day-to-day experience of QoL showed a good psychometric properties in children.
21Strong et al (2017); Australia49Psychometric properties of the Taiwan version of the Sizing Me Up for adolescents with normal weight and underweight children in East Asia using Rasch analysis and CTT methodsTwo phases: translated into the Taiwanese language and psychometric assessments and Rasch analysisSizing Me Up; 224-pointObese, overweight, normal weight, and underweight childrenStudents in third to sixth grades (n= 497); under 11 yearsSelf-report questionnaireThe Sizing Me Up showed a good psychometric properties for underweight and normal weight children.
22Vélez et al (2016); Columbia33Psychometric properties of the Colombian version of the KIDSCREEN-27 children and parent-proxy versions through Rasch analysisPsychometric studyKIDSCREEN; 275-pointHRQoL of ill and healthy childrenChildren and adolescents: n=321; and parents-proxies: n=1150; 8–18 yearsSelf-report questionnaire and interview for children; self-report parent-proxy questionnaireThe KIDSCREEN −27 showed a good psychometric properties.
23Ravens-Sieberer et al (2014); 13 European countries34Development and validation of a new KIDS-CAT based on KIDSCREEN experiencesPsychometric studyKIDS-CAT; KIDSCREEN 52, 27.105-pointQoL and well-being in ill and healthy studentsChildren and adolescents (n= 22,827); 8–18 yearsSelf-report and proxy versions of the questionnairesThe KIDS-CAT showed a good psychometric properties.
24Huang et al (2012); US44Develop a HRQoL tool for YASCC based on three legacy instruments (QOL-CS, SF-36, and QLACS)Psychometric studyHRQoL tool for YASCC;1237-pointChildhood cancerYoung adult survivors of childhood cancer (N=151); 23–29 yearsInterviewThe HRQOL tool for YASCC showed a poor psychometric properties.
25Park et al (2012); Korea38Evaluation of the PODCI in patients with cerebral palsy using Rasch analysisCohort studyPODCI; 39Not reportedCerebral PalsyAdolescents with CP, (n= 720); 8–14 yearsSelf-report and proxy-report versions of the questionnairesThe PODCI showed a poor psychometric properties.
26Huang et al (2011); US43Psychometric properties of a generic HRQoL instrument, the PedsQL 4.0, for children with LTCPsychometric studyPedsQL; 215-pointLife threatening conditionsParents of children with LTC (n= 257); 2–18 yearsParent proxy-report by telephone interviewThe PedsQL showed a poor psychometric properties.
27Erhart et al (2010); 7 European countries35Compare item reduction analysis based on the CTT with analysis based on the Rasch Partial Credit Model item-fit for children and adolescents’ HRQoL itemsCross-sectional studyA dimension from a preliminary KIDSCREEN; 195-pointPhysical well-being dimensionChildren and adolescents; (n= 3019); 8–18 yearsSelf-report questionnaireBoth types of item reduction analysis should be accompanied by additional analyses. However, the results support the usability of the Rasch method for developing new HRQoL measures for children and adolescents.
28Khadka, et al (2010); UK53Development and validation of the CVAQC for adolescents with a visual impairmentTwo phases: qualitative phase (focus groups and interview); quantitative phase (psychometric assessments and Rasch analysis)CVAQC; 254-pointWith and without a visual impairmentChildren and young people (N=109); 5–18 yearsSelf-report questionnaireThe CVAQC showed a good psychometric properties.
29Erhart, et al (2009); 7 European counties39Rasch Measurement Properties of the KIDSCREEN-52 in Children with CP and DIF between Children with and without CPPsychometric studyKIDSCREEN; 525-pointCerebral palsyChildren (n=3219) and parents (n=2126) in the general population (KIDSCREEN project); and children with CP (n=501) and their parents (n=823) (SPARCLE project); 8–12 yearsSelf-report and parent version of the questionnaireThe KIDSCREEN-52 items were understood in the same way by children with and without CP.All items of the tool were fitted with the Rasch model.Some small deviation from unidimensionality was observed in two dimensions of parent-reported version, but not influenced on the measurement.
30Robitail et al (2007); 13 European countries36Present Rasch analysis of the KIDSCREEN and to develop a short form and Rasch approved of the KIDSCREEN-52Cross-sectional studyKIDSCREEN; 275-pointHRQoL of young peopleChildren and adolescent (n= 22,827); 8–18 yearsSelf-report questionnaireThe KIDSCREEN-27 showed a good psychometric properties.
31Bower et al (2006); China50Development of a validated cross-cultural QoL instrument, specific for children with bladder dysfunction; PinQMethodological study in three phases for instrument developmentPinQ; 212, 4-pointBladder DysfunctionChildren with Bladder Dysfunction from 10 countries (n= 156); 6–17 yearsSelf-report questionnaireThe PinQ with two factors showed a good psychometric properties.

Abbreviations: HRQoL, health-related quality of life; ISYQOL, Italian Spine Youth Quality of Life; SRS, Scoliosis Research Society; AIS, Adolescent Idiopathic Scoliosis; WAItE, Weight-Specific Adolescent Instrument for Economic-Evaluation; QoL, quality of life; VSP-A, Vécu et Santé Perçue de l’Adolescent; T-QoL, Teenagers’ Quality of Life; CG, congenital glaucoma; DCGM, DISABKIDS Chronic-Generic Module, it is a joint Chronic-Generic Module from the European project of DISABKIDS for developing HRQoL instruments in children and adolescents with chronic conditions; DSM, Diabetes-Specific Module; T1D, type 1 diabetes; PedsQL, Pediatric Quality of Life Inventory; CFQ, Cystic Fibrosis Questionnaire; CF, cystic fibrosis; CHAQ, Childhood Health Assessment Questionnaire; CP, cerebral palsy; PedsQL NMM, Pediatric Quality of Life Inventory 3.0 Neuromuscular Module; ESM, experience sampling method; CAT, computerized adaptive test; YASCC, young adult survivors of childhood cancer; PODCI, Pediatric Outcomes Data Collection Instrument; LTC, life-threatening conditions; CVAQC, Cardiff Visual Ability Questionnaire for Children; PinQ, pediatric QoL tool; CTT, classical test theory; DIF, differential item functioning.

General Characteristics of the Selected Studies with Application of Rasch Analysis in Adolescents’ QoL Instruments (n = 31) Abbreviations: HRQoL, health-related quality of life; ISYQOL, Italian Spine Youth Quality of Life; SRS, Scoliosis Research Society; AIS, Adolescent Idiopathic Scoliosis; WAItE, Weight-Specific Adolescent Instrument for Economic-Evaluation; QoL, quality of life; VSP-A, Vécu et Santé Perçue de l’Adolescent; T-QoL, Teenagers’ Quality of Life; CG, congenital glaucoma; DCGM, DISABKIDS Chronic-Generic Module, it is a joint Chronic-Generic Module from the European project of DISABKIDS for developing HRQoL instruments in children and adolescents with chronic conditions; DSM, Diabetes-Specific Module; T1D, type 1 diabetes; PedsQL, Pediatric Quality of Life Inventory; CFQ, Cystic Fibrosis Questionnaire; CF, cystic fibrosis; CHAQ, Childhood Health Assessment Questionnaire; CP, cerebral palsy; PedsQL NMM, Pediatric Quality of Life Inventory 3.0 Neuromuscular Module; ESM, experience sampling method; CAT, computerized adaptive test; YASCC, young adult survivors of childhood cancer; PODCI, Pediatric Outcomes Data Collection Instrument; LTC, life-threatening conditions; CVAQC, Cardiff Visual Ability Questionnaire for Children; PinQ, pediatric QoL tool; CTT, classical test theory; DIF, differential item functioning. PRISMA diagram for the selection of studies. Notes: Adapted from Moher D, Liberati A, Tetzlaff J, Altman DG. The PRISMA GROUP. Preferred Reporting Items for Systematic Reviews and Meta-Analyses: The PRISMA Statement. PLoS Med. 2009; 6 (7): 1000097. doi: 10. 1371/journal. pmed 1000097.20 In response to the first research question “How is the application of Rasch analysis in psychometric assessment studies on adolescents’ QoL Instruments?” the following results were obtained.

Characteristics of Studies

67.7% of the studies (n = 21) used the Rasch analysis for instrument testing and 32.3% (n = 10) for the development of new instruments. In 77.4% of studies (n = 24), both Classical and Rasch methods were used parallel to data analysis. Mean age of adolescents in selected studies was equal to 12 years old (range: 6–22 years). The most commonly used instrument in Rasch studies was KIDSCREEN, administered in different countries. An important point is that, the majority of QoL of instruments, such as KIDSCREEN (KIDSCREEN Health-Related Quality of Life Questionnaire), DCGM-37 (DISABKIDS Chronic Generic Module), PedsQL (Pediatric Quality of Life Inventory), CFQ (Cystic Fibrosis Questionnaire) cover a wide range of ages, including childhood and adolescence, making it impossible to separate study results on adolescents from children. Various tools have been used, and the main target group for evaluating validity and reliability of the QoL tools was healthy adolescents (n = 78,149) in 32.2% of studies.24,28–36 Regarding target groups of patients, 9.4% of studies were related to adolescents with Cerebral Palsy (CP), accounting for the highest number of patients.37–39 Other disorders studied, including kyphosis, skin diseases, diabetes, cancer, glaucoma, bladder dysfunction, visual impairment, Life-Threatening Conditions (LTC), Duchenne muscular dystrophy, cystic fibrosis, mental health, idiopathic scoliosis, spine deformity, mental health, underweight, obesity and overweight have also been the focus of interest. Also, 22.6% of the studies (n = 7) were cross-cultural surveys, with the highest share among European countries (Table 2).

Adolescents’ QoL Instruments

Twenty types of QoL instruments were evaluated by the Rasch method. Among various QoL instruments, there were 18 specific tools for adolescents and children; ISYQOL (Italian Spine Youth Quality of Life), WAItE (Weight-specific Adolescent Instrument for Economic evaluation), Skindex-Teen (skin disease scale), VSP-A (Vécu et Santé Percue de l’Adolescent), KIDSCREEN, T-QoL (Teenagers’ Quality of Life), DISABKIDS (Disability Kids), PedsQL (Pediatric Quality of Life Inventory), DCGM-37 DISABKIDS (Chronic Generic Module), CHAQ (The Childhood Health Assessment Questionnaire), Sizing Me Up (a self-reported weight-related QoL instrument), KIDS-CAT (computerized adaptive test), HRQOL tool for YASCC (Young Adult Survivors of Childhood Cancer), PODCI (Pediatric Outcomes Data Collection Instrument), CVAQC (Cardiff Visual Ability Questionnaire for Children), PinQ (Pediatric QoL), ESM (Experience Sampling) a diary method, SRS-7 (Scoliosis Research Society), and CFQ (Cystic Fibrosis Questionnaire). The most commonly used tool was KIDSCREEN, administered in 11 studies (35.5%) in different countries, examining its psychometric properties.29–31,33,36,39–41 This questionnaire was designed in the KIDSCREEN project in 10 dimensions with 52 questions and is suitable for measuring QoL of children and adolescents at 8–18 years of age. It has 10 dimensions, including physical well-being, psychological well-being, emotions and moods, social support and peer, parent’s relationships, self-perception, autonomy, school environment, social acceptance, and financial resources, measuring QoL across a 5-point Likert scale. The highest score for each item is 4 and the lowest score is 0.29,30,41 In the present review, studies used the short (10, 19, 27 items) and long versions (52 items) of the questionnaire.29,31,33–36,39–42 It was used as a self-report questionnaire in most studies, while telephone-interview,41,43 proxy interview,33,34,37–39,43 and face-to face interview versions33,44 have also been reported. Several types of Likert-scales were used in various studies, but 5-point Likert-scales were mostly used. Based on the psychometric assessment of the instruments, 9 studies (29%) reported unsatisfactory psychometrics results. The rest of studies reported satisfactory psychometrics results (n = 22, 71%) (Table 2). Furthermore, some short forms of the instruments, such as SRS-7, KIDSCREEN-23, and CHAQ-28 showed good psychometric results.

Rasch Parameters

In response to the second research question “How is the quality of reporting Rasch parameters in selected studies?” the following results were obtained: Tables 2 and 3 represent the detailed results for psychometric assessment of the studies. Rasch parameters using the QI checklist are shown in Table 3. The results show that most studies (n = 30, 96.7%) reported the name of the software program for Rasch analysis; WINSTEPS (n = 23, 74.2%), RUMM (n = 3, 9.7%), and WINMIRA (n = 3, 9.7%), respectively. One study (3.2%) used two software programs; WINMIRA and MULTIRA.29 Two studies (6.4%) used other software programs; DIGRAM and FACETS.48,57 Also, 15 studies (48.4%) applied PCM29,31,33–36,38,39,41,45–50 and 6 studies (19.4%) applied RSM as a mathematical Rasch model.24,28,40,51–53 One study (3.2%) used both models.42 In 8 studies (25.8%), no mathematical Rasch model was mentioned.30,32,37,43,44,54–57 As threshold is important for instruments with polytomous item responses, it was observed in 58.1% of the studies (n = 18). It was not mentioned in several studies (n=13, 41.9%).28,30,31,38,42–44,52,55 In studies with disordered threshold, poor items were treated by item deletion45,46,54 or category rescoring.33,40,41,47,51,53,56
Table 3

Psychometric Properties of the Selected QoL Studies in Adolescents (n = 31)

No.Author (Year)Rasch SoftwareMathematical Derivation of Rasch ModelRasch Parameters (Type)Validity (Rasch and Classical Methods)*Reliability (Rasch and Classical Methods)*Mixed Results (Rasch and Classical Methods)
1Caronni et al (2017)45WINSTEPSPCMInfit MnSq, Outfit MnSq, ZSTD, DIF, Unidimensionality, Threshold, person reliability, PSIPCAR,Person reliability, PSI
2Caronni et al (2014)51WINSTEPSRSMInfit, Outfit, DIF, Unidimensionality, Threshold, Person reliability, PSRPCAR, Floor-ceiling effectPerson reliability, PSR-
3Oluboyede et al (2017)54RUMMNot reportedFit residuals (item fit residuals, person fit residuals), Chi-square (difference between observed and expected responses), DIF, Item-threshold probability curve, Unidimensionality, Local dependency, PSIEFA, CFA, PAF, Floor effectPSI, Internal Consistency (Cronbach’s α), corrected item-total correlation+
4Smidt et al (2010)55WINSTEPSNot reportedFit statistics (MnSq),Separation index, Item scale reliabilityEFA, Construct, Content, Face validitySeparation index (in scale level), Internal consistency (Cronbach’s alpha), ICC, Item-scale reliability, Responsiveness+
5Sapin et al (2005)28BIGSTEPS/WINSTEPSRSMUnidimensionalityContent validity, Construct validity (inter-item correlations, item- dimension correlations, PCA), Item discriminant validity, Item-convergent validity, External validity, Clinical validity (Known-group), Floor-ceiling effectInternal consistency (Cronbach’s alpha), Sensitivity+
6Basra et al (2018)56RUMMNot reportedDIF, Item responses, Individual item fit, Unidimensionality, Threshold, PSIConvergent validity, Construct validity, Face and Content validity, EFA, CFACronbach’s alpha, ICC, PSI, Internal consistency, Sensitivity (SRM, ES)+
7Ravens-Sieberer et al (2008)29WINMIRA,MULTIRAPCMDIF, Infit, Item threshold parameter, Person threshold parameterConvergent and Discriminant validity, Construct validity (between groups by ES), CFAInternal consistency (Cronbach’s alpha), ICC+
8Gothwal et al (2018)40WINSTEPSRSMInfit, DIF, Unidimensionality, Threshold, Mean item and person locationPCAReliability (not reported the type of reliability)+
9Nielsen et al (2017)57DIGRAMNot reportedDIF, Unidimensionality, Threshold, local dependenceConstruct validity, GLLRM, overall test-of-fit (using CLR tests)Cronbach’s α, Monte Carlo method+
10Ng et al (2015)30WINSTEPSNot reportedInfit, Dimensional mapCFA, Convergent and Divergent validity, Floor-ceiling effectInternal consistency (Cronbach’s alpha), ICC+
11Jervaeus et al (2013)41WINSTEPSPCMItem and person goodness of fit statistics (MnSq residuals and standardized z-values), DIF, Unidimensionality,ThresholdPCAR, Internal scale validity, Person response validityNot reported separately-
12Jafari et al (2012)31WINSTEPSPCMInfit, Outfit, Item difficulty, CPCConvergent validity, Discriminant validityInternal consistency (Cronbach's alpha)+
13Aires et al (2011)32WINSTEPSNot reportedDIF, Infit, Unidimensionality,CFA (Internal construct validity), External construct validity (by comparison between groups), IIC, IDV, Floor-ceiling effectCronbach’s alpha, ICC+
14Kook & Varni (2008)24WINSTEPSRSMInfit, Outfit, Item hierarchy, Threshold, Person reliability, Item reliability, PSI, Item reliability indexConstruct validity (Known-groups), Structural validity (CFA), Floor-ceiling effectPerson reliability, Item reliability, PSI+
15Simeoni et al (2007)46WINSTEPSPCMInfit, UnidimensionalityCFA (Construct validity), Convergent and Discriminant validity, IIC, IDVInternal consistency (Cronbach’s alpha), ICC+
16Robitail et al (2006)42WINSTEPSPCM, RSMIinfit, UnidimensionalityCFA, Construct validity, External validity, Convergent and Divergent validity, Floor-ceiling effectCronbach’s alpha, ICC+
17Henry et al (2003)52BIGSTEPSRSMInfit, Outfit, Unidimensionality, Item calibrationConvergent and Discriminant validity, External construct validity, Clinical validityInternal consistency (Cronbach’s alpha), ICC, Responsiveness+
18Chae et al (2018)37WINSTEPSNot reportedInfit, Outfit, Item difficulty, Item hierarchy, Threshold, Separation reliability, Separation indexConstruct validity, Concurrent validitySeparation reliability (for subject, item), separation index (for subject, item)-
19Landfeldt et al (2018)47RUMMPCMPerson fit (mean fit residual), item fit (mean fit residual), Local item dependency, DIF, Unidimensionality, Item hierarchy, Threshold, PSI, Cronbach’s alphaPCARPSI, Cronbach’s alpha-
20Bray et al (2017)48FACETSPCMInfit, Outfit, Unidimensionality, Item hierarchy, Threshold, Person reliability indexConstruct validity, correlation with with the summary score from the PedsQ Generic Core scales to examine whether daily experience was representative of HRQLPerson reliability Index (internal consistency)+
21Strong et al (2017)49WINSTEPSPCMInfit, Outfit, DIF, Unidimensionality, Item hierarchy, ThresholdCFA, Concurrent validity, Known-groups validityInternal consistency (Cronbach’s alpha), ICC+
22Vélez et al (2016)33WINSTEPSPCMInfit, Outfit, DIF, Unidimensionality,Threshold, Person and Item separation,Internal scale validityInternal consistency (Person and Item separation)-
23Ravens-Sieberer et al (2014)34Reported elsewherePCMDIF, Infit, UnidimensionalityCFA, Convergent validity, Known-groups validity, Criterion validityCronbach’s alpha, ICC+
24Huang et al (2012)44WINSTEPSNot reportedInfit, Outfit, Unidimensionality, Item difficulty and threshold (item-person map), Item and Person reliability, Item and Person separationCFA, known-groups validity, Floor- ceiling effectsItem separation, PSI, Item reliability, Person reliability, Precision, Inter-item correlation+
25Park et al (2012)38WINSTEPSPCMInfit (MnSq), Unidimensionality, Average item calibrationFloor-ceiling effect, Item separation (by individual inter-item difference or average item calibration)No parameter was reported as reliability-
26Huang et al (2011)43WINSTEPSNot reportedItem hierarchy of difficulty, Item and Person Separation IndexItem-domain convergent and discriminant validity, known-groups validity, construct validityInternal consistency (Cronbach’s alpha), Item Separation Index, Person Separation Index+
27Erhart et al. (2010)35WINMIRAPCMItem fit (two methods: Infit and Q-Index as a conditional item-fit), Unidimensionality, Cross-cultural DIF, Threshold, Local independenceStructural validity (CFA), known-group validity, Cross-cultural validity, Relative validity for socio-demographic and health-related factors (ES)Corrected item-total correlation and Cronbach’s alpha (maximizing)+
28Khadka et al. (2010)53WINSTEPSRSMInfit, Outfit, ZSTD, Unidimensionality, Item hierarchy, DIF, ThresholdPerson and item estimates, Item reduction and calibration, Content validity, Ceiling effectPSI, Item Separation Reliability, ICC+
29Erhart et al. (2009)39WINMIRAPCMItem fit (two methods: Infit and Q-index), DIF, Threshold, Unidimensionality, Local dependenceCFANo parameter was reported as reliability+
30Robitail et al (2007)36WINSTEPSPCMInfit, DIF, UnidimensionalityEFA, CFA, IIC, IDV, Floor-ceiling effectCronbach’s alpha+
31Bower et al (2006)50WINSTEPSPCMInfit, UnidimentionalityEFA, Internal validity (INFIT), Floor-ceiling effectCronbach’s alpha+

Abbreviations: PCM, Masters Partial Credit Model; RSM, Andrich Rating Scale Model; DIF, differential item functioning; PSI, Person Separation Index; PSR, person separation ratio; ZSTD, standardized fit statistics; PCA, principal component analysis; PCAR, principal component analysis of residuals; SEM, structural equation modeling; CFA, confirmatory factor analysis; EFA, explanatory factor analysis; ICC, intra-class correlation coefficient; IIC, item-internal consistency; IDV, item discriminant validity; GLLRM, graphical log linear Rasch model; CLR, conditional likelihood ratio test, SRM, standardised response mean; ES, effect size, MnSq, mean square; CPC, category probability curve.

Psychometric Properties of the Selected QoL Studies in Adolescents (n = 31) Abbreviations: PCM, Masters Partial Credit Model; RSM, Andrich Rating Scale Model; DIF, differential item functioning; PSI, Person Separation Index; PSR, person separation ratio; ZSTD, standardized fit statistics; PCA, principal component analysis; PCAR, principal component analysis of residuals; SEM, structural equation modeling; CFA, confirmatory factor analysis; EFA, explanatory factor analysis; ICC, intra-class correlation coefficient; IIC, item-internal consistency; IDV, item discriminant validity; GLLRM, graphical log linear Rasch model; CLR, conditional likelihood ratio test, SRM, standardised response mean; ES, effect size, MnSq, mean square; CPC, category probability curve. Among current studies, most of them (n = 29, 93.5%) reported infit and or outfit mean square for item fit, and a few reported another statistic test (Q-index) (n = 2, 6.4%). Out of these, 2 studies (6.4%) did not address item fit. Only 8 studies (25.8%) reported person fit. 17 studies (54.8%) reported DIF, but 14 studies (45.1%) did not mention it. Based on the reliability results, PSI (n = 10, 32.2%), item separation ratio (n = 6, 19.3%), and Cronbach’s alpha by RUMM (n = 1, 3.2%) were reported in studies. In 3 studies (9.6%), indicators of reliability were not reported. Except for 6 studies (19.3%), the rest of the studies reported unidimensionality (n = 25, 80.6%). In none of the studies was a transformation table drawn (Table 3). According to the QI checklist, around 71% of the studies (n = 22) reported between 4 to 7 Rasch parameters (Table 4). Rasch parameters, including application of the software program (n = 30, 96.7%), test of item fit to the Rasch model (n = 29, 93.5%), unidimensionality (n = 25, 80.6%), type of the identified mathematical Rasch model (n = 23, 74.1%) and threshold (n = 18, 58%) were reported with a higher percentage, respectively. Reliability was reported in most studies using two different approaches; Rasch (n = 12, 38.7%) and classical methods (n = 16, 51.6%). Reliability was not reported in only three studies (9.6%).
Table 4

Evaluation of the Quality of Reporting Rasch Parameters by the QI Checklist in Studies (n = 31)

Rasch ParametersNumber of Studies (n)Percentage (%)
1Software Programa3096.7
WINSTEPS2374.2
WINMIRA39.7
RUMM 203039.7
Others26.4
Not reportedb13.2
2Mathematical derivation of Rasch model2270.1
Partial Credit Model (PCM)1548.4
Rating Scale Model (RSM)619.4
Both Methods13.2
Not reported929.0
3Threshold (for Polytomous Item Response)1858.1
4Test of Item Fit2993.5
5Test of Person Fit825.8
6DIFc1754.8
7Reliability1238.7
PSId1032.2
Item Separation Ratio619.3
Two methods (PSI, Item Separation Ratio)412.9
Cronbach’s alphae13.2
Not reported39.6
8Response Dependency516.1
9Unidimensionality2580.6
10Transformation Table00.0

Notes: aBIGSTEPS was classified in the category of the WINSTEPS, so it is a free DOS precursor to WINSTEPS; One study applied two software programs, WINMIRA and MULTIRA, this study was classified by the name of the first software; bOne study reported the name of the software elsewhere, therefore it was classified in the category of “not reported”; cDIF, Differential Item Functioning; dPSI, Person Separation Index; eBefore RUMM 2030, reliability was reported by Cronbach alpha coefficient.

Evaluation of the Quality of Reporting Rasch Parameters by the QI Checklist in Studies (n = 31) Notes: aBIGSTEPS was classified in the category of the WINSTEPS, so it is a free DOS precursor to WINSTEPS; One study applied two software programs, WINMIRA and MULTIRA, this study was classified by the name of the first software; bOne study reported the name of the software elsewhere, therefore it was classified in the category of “not reported”; cDIF, Differential Item Functioning; dPSI, Person Separation Index; eBefore RUMM 2030, reliability was reported by Cronbach alpha coefficient.

Discussion

The present study was conducted aimed at providing a systematic review of the application of Rasch analysis in psychometric assessment studies on adolescents’ QoL instruments. Overall results showed that, most of the selected studies used the Rasch method for instrument testing and fewer for instrument development. Results of this study showed that in the last decade, the number of these studies has increased. The diversity of QoL instruments using Rasch analysis is high, and KIDSCREEN has been mostly used in those studies, as it is applicable to a wide range of healthy and ill adolescents and children (10–18 years of age). This suggests that there is a place for further development and exploration of other QoL instruments in the future. Furthermore, most QoL instruments involve a wide range of ages, so they cover two stages of life, childhood and adolescence, such as KIDSCREEN, PedsQL, CFQ. In some of these instruments, it is not possible to distinguish adolescence from the childhood period, which may reduce their efficiency. In the present study, 12 types of instruments involved a wide range of ages, including childhood and adolescence (ISYQOL, KIDSCREEN, DCGM, PedsQL, CFQ, CHAQ, Sizing Me Up, KIDCAT, HRQOL, PODCI, CVAQC, and PinQ). Evidence shows that there are a limited number of critical appraisal tools and available guidelines for writing papers in the field of Rasch21,22 and this can probably have an influence on the reporting of Rasch studies. The results of our study indicate that authors have reported a various number of Rasch parameters in their studies. Some parameters, such as application of the software program and test of item fit to the Rasch model were reported in more than 93% of studies. The type of identified mathematical Rasch model was described in more than 74% of studies, and unidimensionality, threshold and DIF were reported in around 80%, 58% and 54% of studies, respectively. But, parameters, such as response dependency, were reported less (around 16%), and the transformation table had not been reported in the selected studies. This is not a surprising result, so the results of an earlier systematic review in the field of rheumatology showed that the application of the transformation table was limited. From 1991 to 2012 in two periods, it was reported as static with a proportion of 11.4% and 12.2%, respectively.21 As the transformation table is a report about converting ordinal scales to the interval for the estimation of responsiveness and clinical changes in outcome variables, this limit can influence the application of Rasch transformed interval measurement in clinical practice. But, according to the study design and type of the QoL instrument, there is a possibility that it has not been in the objective of those studies. However, this can be an area for further development and evaluation of outcome measures in clinical practice in future Rasch QoL studies. Furthermore, it should be noted that, the QI checklist was used in the present study as a critical appraisal tool to evaluate the quality of reporting of Rasch parameters in those studies,21 but the number of critical appraisal tools for Rasch papers is scarce. Following an extensive search by authors, only the QI checklist and the Rasch Box with 4 items at three levels in the COSMIN (Consensus‐based Standards for selection of health status Measurement Instruments) were found.59 Although, there is a guide with five headings, 22 items and a number of useful references for writing Rasch papers in the Journal of Applied Measurement,58 it has been presented as a general guide and not in the format of a critical appraisal tool. In this study, the QI checklist was applied for assessment of the Rasch parameters in the studies, because of the variety of items and ease of use. QI checklist seems to be a good tool for criticizing Rasch analysis studies, but it is not widely used, so its validity and reliability has not been entirely confirmed. One of the criticisms of this 10-item checklist is an unclear response level for items, and another one is about the method of scoring. However, it seems that there is a requirement for more rigorous critical appraisal tools and guidelines to evaluate the quality of Rasch papers and Rasch parameters. Evidence shows that, few studies have used the QI checklist, and it is recommended to conduct further research on this tool and to identify the strengths and weaknesses of it. Therefore, application of the Rasch model in adolescents’ QoL studies needs a criterion for using necessary parameters, as available evidence indicates that, some papers have only reported several parameters in adolescents’ QoL instruments.

Limitations

There were some limitations in this study. The current systematic review was limited to English language papers, thus information in other languages was not reported in this study. Furthermore, Gray literature, consisting of dissertations and congress abstracts, was not included in the study. Also, we assessed only the presence of Rasch parameters according to the QI checklist without evaluating the quality of parameters’ analysis.

Conclusions

Application of the Rasch model for psychometric assessment of adolescents’ QoL instruments has increased in recent decades. There were 20 types of adolescents’ QoL instruments which were evaluated by the Rasch models. The most commonly used instrument was, KIDSCREEN, administered in different countries. Six Rasch parameters were reported with a higher percentage in the studies, including application of the software program (96.7%), test of item fit to the Rasch model (93.5%), unidimensionality (80.6%), type of the identified mathematical Rasch model (74.1%), threshold (58%) and DIF (54.8%). Our results indicate that although Rasch analysis is a robust method in adolescents’ QoL studies due to documented instruments’ measurement functioning, no access to the strong critical appraisal tools may influence the high power application of it.
  46 in total

Review 1.  The Rasch measurement model in rheumatology: what is it and why use it? When should it be applied, and what should one look for in a Rasch paper?

Authors:  Alan Tennant; Philip G Conaghan
Journal:  Arthritis Rheum       Date:  2007-12-15

Review 2.  Quality of life in older ages.

Authors:  Gopalakrishnan Netuveli; David Blane
Journal:  Br Med Bull       Date:  2008-02-15       Impact factor: 4.291

3.  Development of the Cystic Fibrosis Questionnaire (CFQ) for assessing quality of life in pediatric and adult patients.

Authors:  Bernadette Henry; Pierre Aussage; Cécile Grosskopf; Jean-Marie Goehrs
Journal:  Qual Life Res       Date:  2003-02       Impact factor: 4.147

4.  Can in-the-moment diary methods measure health-related quality of life in Duchenne muscular dystrophy?

Authors:  Paula Bray; Anita C Bundy; Monique M Ryan; Kathryn N North
Journal:  Qual Life Res       Date:  2016-11-03       Impact factor: 4.147

5.  Development and validation of Skindex-Teen, a quality-of-life instrument for adolescents with skin disease.

Authors:  Aimee C Smidt; Jin-Shei Lai; David Cella; Sapna Patel; Anthony J Mancini; Sarah L Chamlin
Journal:  Arch Dermatol       Date:  2010-08

6.  Cross-cultural adaptation and psychometric properties of the Brazilian-Portuguese version of the VSP-A (Vécu et Santé Perçue de l'Adolescent), a health-related quality of life (HRQoL) instrument for adolescents, in a healthy Brazilian population.

Authors:  Mariana T Aires; Pascal Auquier; Stephane Robitail; Guilherme L Werneck; Marie-Claude Simeoni
Journal:  BMC Pediatr       Date:  2011-01-27       Impact factor: 2.125

7.  Field testing of a European quality of life instrument for children and adolescents with chronic conditions: the 37-item DISABKIDS Chronic Generic Module.

Authors:  Marie-Claude Simeoni; Silke Schmidt; Holger Muehlan; David Debensason; Monika Bullinger
Journal:  Qual Life Res       Date:  2007-04-03       Impact factor: 3.440

8.  Recent advances in analysis of differential item functioning in health research using the Rasch model.

Authors:  Curt Hagquist; David Andrich
Journal:  Health Qual Life Outcomes       Date:  2017-09-19       Impact factor: 3.186

9.  Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement.

Authors:  David Moher; Alessandro Liberati; Jennifer Tetzlaff; Douglas G Altman
Journal:  PLoS Med       Date:  2009-07-21       Impact factor: 11.069

Review 10.  Systematic review of health-related quality of life models.

Authors:  Tamilyn Bakas; Susan M McLennon; Janet S Carpenter; Janice M Buelow; Julie L Otte; Kathleen M Hanna; Marsha L Ellett; Kimberly A Hadler; Janet L Welch
Journal:  Health Qual Life Outcomes       Date:  2012-11-16       Impact factor: 3.186

View more
  2 in total

1.  Evaluation of two weight stigma scales in Malaysian university students: weight self-stigma questionnaire and perceived weight stigma scale.

Authors:  Wan Ying Gan; Serene En Hui Tung; Ruckwongpatr Kamolthip; Simin Ghavifekr; Paratthakonkun Chirawat; Ira Nurmala; Yen-Ling Chang; Janet D Latner; Ru-Yi Huang; Chung-Ying Lin
Journal:  Eat Weight Disord       Date:  2022-04-26       Impact factor: 3.008

2.  Cross-cultural adaptation and validation of the Italian Spine Youth Quality of Life (ISYQOL) questionnaire's Arabic version.

Authors:  Salah M Fallatah; Shaker Emam; Ghamid Al-Ghamdi; Faisal Almatrafi
Journal:  Medicine (Baltimore)       Date:  2021-12-10       Impact factor: 1.817

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.