| Literature DB >> 31801463 |
C M W Gaasterland1, M C Jansen van der Weide2, K C B Roes3, J H van der Lee2.
Abstract
BACKGROUND: Goal Attainment Scaling (GAS) is an instrument that is intended to evaluate the effect of an intervention by assessing change in daily life activities on an individual basis. However, GAS has not been validated adequately in an RCT setting. In this paper we propose a conceptual validation plan of GAS in the setting of rare disease drug trials, and describe a hypothetical trial where GAS could be validated.Entities:
Keywords: Drug trials; Goal attainment scaling; Rare diseases; Validation
Year: 2019 PMID: 31801463 PMCID: PMC6894223 DOI: 10.1186/s12874-019-0866-x
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Example of GAS goal topics and scores in (fictitious) patients with mitochondrial disease
| Goals | -2 score | 0 score (expected outcome level) | + 2 score |
|---|---|---|---|
| Being able to cook for me and my partner | The patient cooks once a week or less | The patient cooks three times a week | The patient cooks five times a week or more |
| Being able to watch a film at night without falling asleep | The patient can watch a series episode of 25 min without falling asleep | The patient can watch a series episode of 50 min without falling asleep | The patient can watch a film of 2 h without falling asleep |
| Being able to dress myself independently | The patient needs help with putting on a shirt and pants | Patient can put on a shirt, but needs help with pants | The patient can dress him/herself without any help |
Fig. 1Visualization of the differences between GAS and classical measurement instruments
The differences in properties of between GAS compared to other and classical measurement instruments
| Classical measurement instruments | Goal Attainment Scaling |
|---|---|
| Items and scoring options are fixed | Items (goals) and scoring options (goal attainment levels) are different for each patient |
| Instrument can be used for many patients in many settings | Patients make their own personalized instrument for one occasion together with their clinician |
| Repeated measurements are possible, for example before and after an intervention, that can be compared | Only one measurement is possible, after an intervention or after a defined period of time |
| The score is ‘anchored’, e.g. to a general population reference range, or to a cut-off score for a diagnosis, and is interpretable on individual level | The score is not ‘anchored’ and can only be interpreted as a comparison of group means in a randomized setting |
| One or more unidimensional scales quantifying underlying constructs | Quantifies change due to an intervention (or over time) that is relevant to patients |
Proposal for ways to assess aspects of measurement properties of GAS in a blinded placebo-controlled RCT
| COSMIN Domain | Quality aspect | COSMIN definition | Definition of quality aspect as applicable to GAS | Measurable on GAS step | How to assess on individual level*** | How to assess on trial level |
|---|---|---|---|---|---|---|
| Validity | Content validity | The degree to which the content of an HR-PRO instrument is an adequate reflection of the construct to be measured | The degree to which goals capture patient outcomes that can be expected to be influenced by treatment. | Selecting goals [ | Evaluation of the goals by independent clinical experts**** | |
| Content validity | The degree to which the content of an HR-PRO instrument is an adequate reflection of the construct to be measured | Levels are determined consistently, ensuring that they are centered around an appropriate value of zero and proportionally ordered. | Defining the levels [ | Evaluation of the goal achievement levels by independent clinical experts**** | ||
| Construct validity | The degree to which the scores of an HR-PRO instrument are consistent with hypotheses (for instance with regard to internal relationships, relationships to scores of other instruments, or differences between relevant groups) based on the assumption that the HRPRO instrument validly measures the construct to be measured | The underlying construct (one or more than one for every patient) is the attainment of goals that are chosen to reflect the effect that is expected from an intervention on a functional level. Construct validity is the degree to which the scores of GAS are consistent with hypotheses. | Assessing goal attainment [ | Hypothesis testing: Comparison of scores on individual goals with change scores on other measurement instruments that measure a construct that is similar to one or more of the chosen goals, such as a function scale or a balance test* | Hypothesis testing: Comparison between two groups. It is expected that the mean GAS scores of two randomized groups receiving effective or non-effective interventions will differ in favor of the group receiving the effective intervention. | |
| Reliability | Intra rater reliability | The extent to which scores for patients who have not changed are the same for repeated measurement by the same persons on different occasions | Goal attainment is assessed consistently when performed by the same rater for the same patient in the same condition repeatedly (possibly hypothetically). | Assessing goal attainment [ | Repeated assessment of video recording of the assessment of the goal attainment level ** by the same rater | |
| Inter rater reliability | The extent to which scores for patients who have not changed are the same for repeated measurement by different persons on the same occasions | Goal attainment is assessed consistently when performed by different raters for the same patient in the same condition. | Assessing goal attainment [ | Assessment of the video recording of the assessment of the goal attainment level** by one or more independent raters | ||
| Inter trial reliability | Goal attainment scaling leads to consistent results when implemented in repeated implementations of the same trial. | Assessing goal attainment [ | Replication between trials or within one trial (with a split-half design); comparison of mean differences between intervention groups |
It is not feasible to compare all chosen goals with other measurement instruments, as goals may vary widely in a heterogeneous population
** In the video recording the goal score should not be made explicit
*** The score of GAS is only a change score, and cannot be interpreted causally at the individual level. However, content validity of GAS can only be assessed on an individual level, since every patient individually chooses his or her own goals. The same goes for reliability and aspects of our proposed construct validity: since all goals differ on an individual level, this has to be assessed at the individual level, in order to ensure the validity and interpretability of GAS on group level
**** According to the original COSMIN methodology, this should be done very extensively with qualitative interviews and/or focus groups. However, in the setting of rare diseases, this is not feasible
GAS might be validated by using it in a double-blind randomized clinical trial with some design additions for validation, for example in a placebo-controlled trial investigating a potential new drug for mitochondrial disease. We will describe a hypothetical trial, that serves as an example to further clarify our validation proposal. In this hypothetical trial protocol, 80 patients are randomly allocated to an experimental or a placebo arm. At baseline the goals are selected and the levels of attainment are defined by each patient together with their treating physician. The expected effect of the intervention and the relevance of the goals for the patient are the main leading factors for this selection. Additional baseline measurements contain the Mitochondrial Disease Adult Scale (NMDAS), a semi-quantitative clinical rating scale that monitors the extensive clinical spectrum of mitochondrial disease [ For the evaluation of content validity, an external expert on mitochondrial diseases with access to the clinical files of the patients evaluates the content validity of the selected goals and of the attainment levels set at baseline. For the selected goals it is important that they align with the severity of the patient’s disease and impairments. Also, there has to be a relation between the chosen goals and the expected effect of the intervention. The validity of the selected goals and the attainment levels each can be scored on a 5-point scale, 1 indicating low validity and 5 indicating high validity. A mean score of for example 3.5 or more can be considered to denote adequate content validity. Reliability is addressed as follows: The videotaped assessment of the goal attainment is evaluated by two independent assessors to measure the inter-rater reliability [ To assess construct validity, correlation coefficients are calculated between the GAS scores and the change scores from baseline to follow-up of the NMDAS and the BBS. We expect that there will be a correlation between the goal attainment scores of the patients, and the NMDAS and BBS, since these two validated measurement instruments are often included in studies on mitochondrial disease. The constructs that they measure will probably also be reflected by at least some of the goals chosen by patients with mitochondrial disease. We also expect that GAS may measure some other aspects of the disease that are not captured within these two instruments. Therefore, a moderate correlation is expected (rho between 0.3 and 0.5) between the GAS scores and the change scores of the related other measurement instruments. This will show that GAS measures aspects that are related to the disease, but also something else that is not yet captured by standard measurement instruments. When there is a statistically significant difference in GAS scores between the placebo group and the experimental group, this is considered as an indication of the construct validity of GAS on trial level, and of the efficacy of the new drug. |