Literature DB >> 17484039

Evaluating measurement equivalence using the item response theory log-likelihood ratio (IRTLR) method to assess differential item functioning (DIF): applications (with illustrations) to measures of physical functioning ability and general distress.

Jeanne A Teresi1, Katja Ocepek-Welikson, Marjorie Kleinman, Karon F Cook, Paul K Crane, Laura E Gibbons, Leo S Morales, Maria Orlando-Edelen, David Cella.   

Abstract

BACKGROUND: Methods based on item response theory (IRT) that can be used to examine differential item functioning (DIF) are illustrated. An IRT-based approach to the detection of DIF was applied to physical function and general distress item sets. DIF was examined with respect to gender, age and race. The method used for DIF detection was the item response theory log-likelihood ratio (IRTLR) approach. DIF magnitude was measured using the differences in the expected item scores, expressed as the unsigned probability differences, and calculated using the non-compensatory DIF index (NCDIF). Finally, impact was assessed using expected scale scores, expressed as group differences in the total test (measure) response functions.
METHODS: The example for the illustration of the methods came from a study of 1,714 patients with cancer or HIV/AIDS. The measure contained 23 items measuring physical functioning ability and 15 items addressing general distress, scored in the positive direction.
RESULTS: The substantive findings were of relatively small magnitude DIF. In total, six items showed relatively larger magnitude (expected item score differences greater than the cutoff) of DIF with respect to physical function across the three comparisons: "trouble with a long walk" (race), "vigorous activities" (race, age), "bending, kneeling stooping" (age), "lifting or carrying groceries" (race), "limited in hobbies, leisure" (age), "lack of energy" (race). None of the general distress items evidenced high magnitude DIF; although "worrying about dying" showed some DIF with respect to both age and race, after adjustment.
CONCLUSIONS: The fact that many physical function items showed DIF with respect to age, even after adjustment for multiple comparisons, indicates that the instrument may be performing differently for these groups. While the magnitude and impact of DIF at the item and scale level was minimal, caution should be exercised in the use of subsets of these items, as might occur with selection for clinical decisions or computerized adaptive testing. The issues of selection of anchor items, and of criteria for DIF detection, including the integration of significance and magnitude measures remain as issues requiring investigation. Further research is needed regarding the criteria and guidelines appropriate for DIF detection in the context of health-related items.

Entities:  

Mesh:

Year:  2007        PMID: 17484039     DOI: 10.1007/s11136-007-9186-4

Source DB:  PubMed          Journal:  Qual Life Res        ISSN: 0962-9343            Impact factor:   4.147


  15 in total

1.  Differential item functioning in a Spanish translation of the PTSD checklist: detection and evaluation of impact.

Authors:  Maria Orlando; Grant N Marshall
Journal:  Psychol Assess       Date:  2002-03

2.  Test bias in a cognitive test: differential item functioning in the CASI.

Authors:  Paul K Crane; Gerald van Belle; Eric B Larson
Journal:  Stat Med       Date:  2004-01-30       Impact factor: 2.373

3.  Demographic variation in SF-12 scores: true differences or differential item functioning?

Authors:  John A Fleishman; William F Lawrence
Journal:  Med Care       Date:  2003-07       Impact factor: 2.983

Review 4.  Different approaches to differential item functioning in health applications. Advantages, disadvantages and some neglected topics.

Authors:  Jeanne A Teresi
Journal:  Med Care       Date:  2006-11       Impact factor: 2.983

Review 5.  Differential item functioning on the Mini-Mental State Examination. An application of the Mantel-Haenszel and standardization procedures.

Authors:  Neil J Dorans; Edward Kulick
Journal:  Med Care       Date:  2006-11       Impact factor: 2.983

6.  Measurement in a multi-ethnic society. Overview to the special issue.

Authors:  Jeanne A Teresi; Anita L Stewart; Leo S Morales; Sidney M Stahl
Journal:  Med Care       Date:  2006-11       Impact factor: 2.983

Review 7.  Item and scale differential functioning of the Mini-Mental State Exam assessed using the Differential Item and Test Functioning (DFIT) Framework.

Authors:  Leo S Morales; Claudia Flowers; Peter Gutierrez; Marjorie Kleinman; Jeanne A Teresi
Journal:  Med Care       Date:  2006-11       Impact factor: 2.983

8.  Modern psychometric methods for detection of differential item functioning: application to cognitive assessment measures.

Authors:  J A Teresi; M Kleinman; K Ocepek-Welikson
Journal:  Stat Med       Date:  2000 Jun 15-30       Impact factor: 2.373

9.  Test of item-response bias in the CES-D scale. experience from the New Haven EPESE study.

Authors:  S R Cole; I Kawachi; S J Maller; L F Berkman
Journal:  J Clin Epidemiol       Date:  2000-03-01       Impact factor: 6.437

10.  Depressive symptoms of whites and African Americans aged 60 years and older.

Authors:  J J Gallo; L Cooper-Patrick; S Lesikar
Journal:  J Gerontol B Psychol Sci Soc Sci       Date:  1998-09       Impact factor: 4.077

View more
  30 in total

1.  Occurrences and sources of Differential Item Functioning (DIF) in patient-reported outcome measures: Description of DIF methods, and review of measures of depression, quality of life and general health.

Authors:  Jeanne A Teresi; Mildred Ramirez; Jin-Shei Lai; Stephanie Silver
Journal:  Psychol Sci Q       Date:  2008

2.  Analysis of differential item functioning in the depression item bank from the Patient Reported Outcome Measurement Information System (PROMIS): An item response theory approach.

Authors:  Jeanne A Teresi; Katja Ocepek-Welikson; Marjorie Kleinman; Joseph P Eimicke; Paul K Crane; Richard N Jones; Jin-Shei Lai; Seung W Choi; Ron D Hays; Bryce B Reeve; Steven P Reise; Paul A Pilkonis; David Cella
Journal:  Psychol Sci Q       Date:  2009

3.  Differential item functioning and health assessment.

Authors:  Jeanne A Teresi; John A Fleishman
Journal:  Qual Life Res       Date:  2007-04-19       Impact factor: 4.147

4.  Are comparisons the answer to understanding behavioral aspects of aging in racial and ethnic groups?

Authors:  Keith E Whitfield; Jason C Allaire; Rhonda Belue; Christopher L Edwards
Journal:  J Gerontol B Psychol Sci Soc Sci       Date:  2008-09       Impact factor: 4.077

Review 5.  Modifying measures based on differential item functioning (DIF) impact analyses.

Authors:  Jeanne A Teresi; Mildred Ramirez; Richard N Jones; Seung Choi; Paul K Crane
Journal:  J Aging Health       Date:  2012-03-15

6.  Measurement invariance of the SF-12 across European-American, Latina, and African-American postpartum women.

Authors:  Tamer F Desouky; Pablo A Mora; Elizabeth A Howell
Journal:  Qual Life Res       Date:  2012-07-25       Impact factor: 4.147

7.  Do measures of depressive symptoms function differently in people with spinal cord injury versus primary care patients: the CES-D, PHQ-9, and PROMIS®-D.

Authors:  Karon F Cook; Michael A Kallen; Charles Bombardier; Alyssa M Bamer; Seung W Choi; Jiseon Kim; Rana Salem; Dagmar Amtmann
Journal:  Qual Life Res       Date:  2016-07-14       Impact factor: 4.147

8.  Language-related differential item functioning between English and German PROMIS Depression items is negligible.

Authors:  H Felix Fischer; Inka Wahl; Sandra Nolte; Gregor Liegl; Elmar Brähler; Bernd Löwe; Matthias Rose
Journal:  Int J Methods Psychiatr Res       Date:  2016-10-16       Impact factor: 4.035

9.  Statistical approaches to harmonize data on cognitive measures in systematic reviews are rarely reported.

Authors:  Lauren E Griffith; Edwin van den Heuvel; Isabel Fortier; Nazmul Sohel; Scott M Hofer; Hélène Payette; Christina Wolfson; Sylvie Belleville; Meghan Kenny; Dany Doiron; Parminder Raina
Journal:  J Clin Epidemiol       Date:  2014-12-08       Impact factor: 6.437

10.  Differential item functioning of the Boston Naming Test in cognitively normal African American and Caucasian older adults.

Authors:  Otto Pedraza; Neill R Graff-Radford; Glenn E Smith; Robert J Ivnik; Floyd B Willis; Ronald C Petersen; John A Lucas
Journal:  J Int Neuropsychol Soc       Date:  2009-07-02       Impact factor: 2.892

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.