| Literature DB >> 17942384 |
Cornelia M Ruland1, Suzanne Bakken, Jo Røislien.
Abstract
Recently there has been a proliferation of interactive tailored patient assessment (ITPA) tools. However, evidence of the reliability and validity of these instruments is often missing, which makes their value in research studies questionable. Because several of the common methods to evaluate instrument reliability and validity are not applicable to interactive tailored patient assessments, informatics researchers may benefit from some guidance on which methods of reliability and validity assessment they can appropriately use. This paper describes the main differences between interactive tailored patient assessments and assessment instruments based on psychometric, or classical test, theory; it summarizes the measurement techniques normally used to ascertain the validity and reliability of assessment instruments based on psychometric theory; it discusses which methods are appropriate for interactive tailored patient assessments and which are not; and finally, it illustrates the application of some of the feasible techniques with a case study that describes how the reliability and validity of the tailored symptom assessment instrument called Choice were evaluated.Entities:
Mesh:
Year: 2007 PMID: 17942384 PMCID: PMC2047281 DOI: 10.2196/jmir.9.3.e22
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Differences Between Traditional Measurement Instruments and Interactive Tailored Patient Assessments
| Understanding characteristics of populations; generalizability | Understanding characteristics of individuals | |
| Research | Clinical practice; to tailor patient care / advice to each individual | |
| 1. Each subscale measures one latent concept at a time. Different concepts are contained in internally consistent subscales. | May capture patients’ symptom and problem experiences on different dimensions | |
| Every respondent completes more or less the same set of questions. | Every respondent completes a different set of questions, based on initial item selection. | |
| Parsimony: to explain the greatest amount of variance in the concept measured with the fewest numbers of items. | Comprehensiveness: to help patients find a close match between the item description and their actual experience. |
Psychometric concepts, definitions, and methods
| Internal consistency | Average intercorrelation among items | Cronbach alpha, split-half | Inappropriate due to highly variable number of assessment items among respondents |
| Test-retest | Association between measurements on the same respondents at multiple points in time using the same version of the measurement instrument; coefficient of stability | Correlation between two measurements | Inappropriate if concept being measured changes over time; otherwise appropriate. Even small changes over time might fundamentally change the patient’s response to the interactive tailored patient assessment. |
| Alternate forms | Association between measurements on the same respondents at multiple points in time using two forms of the “same” measurement instrument; coefficient of equivalence | Correlation between two measurements | Inappropriate if concept being measured changes over time; otherwise appropriate. Due to the nature of the interactive tailored patient assessment, with possibly detailed items, coming up with an alternate form might be difficult. |
| Content | Extent to which a specific measure depicts a domain of content | Literature review, expert review | Appropriate |
| Criterion-related | Extent of correlation between the test and the criterion | Concurrent validity (test and criterion at same point in time); predictive (test and criterion at a future point in time) | Appropriate. Be aware that it might be difficult to find a sensible criterion when many issues are addressed simultaneously, as often is the case. |
| Construct | Extent to which a particular measure performs in accordance with theoretically derived hypotheses concerning the concepts (or constructs) being measured | Factor analysis, convergent validation, discriminant validation, known group differences, multitrait-multimethod matrix | Factor analysis is often inappropriate due to variable number of assessment items among respondents, or the large sample size that otherwise would be required. Other methods are usually appropriate. |