| Literature DB >> 25492701 |
Emmanuelle Anthoine1,2,3, Leïla Moret4,5, Antoine Regnault6,7, Véronique Sébille8,9, Jean-Benoit Hardouin10,11.
Abstract
PURPOSE: New patient reported outcome (PRO) measures are regularly developed to assess various aspects of the patients' perspective on their disease and treatment. For these instruments to be useful in clinical research, they must undergo a proper psychometric validation, including demonstration of cross-sectional and longitudinal measurement properties. This quantitative evaluation requires a study to be conducted on an appropriate sample size. The aim of this research was to list and describe practices in PRO and proxy PRO primary psychometric validation studies, focusing primarily on the practices used to determine sample size.Entities:
Mesh:
Year: 2014 PMID: 25492701 PMCID: PMC4275948 DOI: 10.1186/s12955-014-0176-2
Source DB: PubMed Journal: Health Qual Life Outcomes ISSN: 1477-7525 Impact factor: 3.186
Figure 1Flow chart of selection process.
Figure 2Repartition of the articles according to thresholds recommended in the literature. a: According to thresholds of subject to item ratio. b: According to thresholds of sample size.
Psychometric properties definitions in the field of health-related assessment
|
|
|
|---|---|
| Content validity | The ability of an instrument to reflect the domain of interest and the conceptual definition of a construct. In order to claim content validity, there is no formal statistical testing, but item generation process should include a review of published data and literature, interviews from targeted patients and an expert panel to approach item relevance [ |
| Face validity | The ability of an instrument to be understandable and relevant for the targeted population. It concerns the critical review of an instrument after it has been constructed and generally includes a pilot testing [ |
| Construct validity | The ability of an instrument to measure the construct that it was designed to measure. A hypothetical model has to be formed, the constructs to be assessed have to be described and their relationships have to be postulated. If the results confirm prior expectations about the constructs, the instrument may be valid [ |
| Convergent validity | Involves that items of a subscale correlate higher than a threshold with each other, or with the total sum-score of their own subscale [ |
| Divergent validity | Involves that items within any one subscale should not correlate too highly with external items or with the total sum-score of another subscale [ |
| Known group validity | The ability of an instrument to be sensitive to differences between groups of patients that may be anticipated to score differently in the predicted direction [ |
| Criterion validity | The assessment of an instrument against the true value, or a standard accepted as the true value. It can be divided into concurrent validity and predictive validity [ |
| Concurrent validity | The association of an instrument with accepted standards [ |
| Predictive validity | The ability of an instrument to predict future health status or test results. Future health status is considered as a better indicator than the true value or a standard [ |
| Reliability | Determining that a measurement yields reproducible and consistent results [ |
| Internal consistency | The ability of an instrument to have interrelated items [ |
| Repeatability | (Test-retest reliability) The ability of the scores of an instrument to be reproducible if it is used on the same patient while the patient’s condition has not changed (measurements repeated over time) [ |
| Responsiveness | The ability of an instrument to detect change when a patient’s health status improves or deteriorates [ |
General description of journals and scales
|
| ||
|---|---|---|
|
| ||
| Main topic of journal | Clinical | 66.7% (76) |
| Methodological | 8.8% (10) | |
| Psychological | 17.5% (20) | |
| Other | 7.0% (8) | |
| Impact factor | Mean (SD); median; range | 2.53 (1.55); 2.25; [0.38; 11.01] |
|
| ||
| Concept of interest | Quality of life | 29.8% (34) |
| Behavior/Attitude | 21.1% (24) | |
| Social psychological functioning | 15.8% (18) | |
| Satisfaction | 13.1% (15) | |
| Symptom severity | 10.6% (12) | |
| Knowledge/Literacy | 6.1% (7) | |
| Physical functioning | 3.5% (4) | |
| Number of dimensions | Mean (SD); median; range | 3.7 (2.7); 3; [1; 13] |
| Number of items per dimension | Mean (SD); median; range | 11.8 (32); 7; [1; 340] |
| Type of measurement scale | Dichotomous | 2.6% (3) |
| Likert ordinal | 84.2% (96) | |
| Nominal | 3.5% (4) | |
| Numeric rating scale | 6.2% (7) | |
| Several kinds | 0.9% (1) | |
| Not mentioned | 2.6% (3) |
Data are percentages (n) and otherwise indicated.
Standard deviation (SD).
Sample size determination
|
| ||
|---|---|---|
| Sample size included | Mean (SD); Median; range | 509 (1094); 207; [24; 7906] |
| Subject to item ratio | Mean (SD); Median; range | 28 (67); 10; [1; 527] |
|
| Yes | 9.6% (11) |
| Justification | 4.4% (5) | |
| No | 86.0% (98) |
Standard deviation (SD).