| Literature DB >> 28122727 |
Jürgen Barth1,2, Wout E L de Boer1, Jason W Busse3,4,5, Jan L Hoving6,7, Sarah Kedzia1, Rachel Couban4, Katrin Fischer8, David Y von Allmen1, Jerry Spanjer9,10, Regina Kunz11.
Abstract
OBJECTIVES: To explore agreement among healthcare professionals assessing eligibility for work disability benefits.Entities:
Mesh:
Year: 2017 PMID: 28122727 PMCID: PMC5283380 DOI: 10.1136/bmj.j14
Source DB: PubMed Journal: BMJ ISSN: 0959-8138

Fig 1 Identification of studies assessing inter-rater agreement of evaluation of disability
Methodological quality of included studies
| Raters blinded to findings of others | Risk for order effect (sequence of examination) | Appropriate statistical measure of agreement | |||
|---|---|---|---|---|---|
| de Kort, 1992 | Yes | No risk | Yes | ||
| Dickmann, 2007 | Yes | No risk | NA | ||
| Elder, 1994 | Unclear | No risk | Yes | ||
| Ikezawa, 2010 | Yes | No risk | Yes | ||
| Ingravallo, 2008 | Yes | No risk | Yes | ||
| Lederer, 1998 | Yes | No risk | NA | ||
| Rudbeck, 2011 | Yes | No risk | Yes | ||
| Schellart, 2013 | Unclear | Unclear | Yes | ||
| Schreuder, 2012 | Unclear | No risk | Unclear | ||
| Slebus, 2010 | Yes | No risk | Yes | ||
| Spanjer, 2008 | Yes | No risk | Yes | ||
| Spanjer, 2009 | Unclear | No risk | Yes | ||
| Spanjer, 2010 | Yes | No risk | Yes | ||
| Dell-Kuster, 2014 | NA | NA | Yes | ||
| Lax, 2004 | NA | NA | NA | ||
| Okpaku, 1994 | NA | NA | NA | ||
| Berns, 2007 | Unclear | Yes | Yes | ||
| Chopra, 2002 | Probably yes | No | Yes | ||
| Daradkeh, 1994 | Unclear | Unclear | Yes | ||
| Hannula, 2006 | No | Yes | Yes | ||
| Hill, 1989 | Yes | Unclear | Yes | ||
| Mundo, 2010 | Yes | Unclear | Yes | ||
| Nozu, 1995 | Unclear | Unclear | Yes | ||
Generalisability of study findings to real world of insurance medicine*
| Recruitment strategy (for claimants) | Recruitment success | Verisimilitude | Range of experience in raters | Specific training for work capacity assessment | Training for study purposes | No of cases | No of raters | Generalisability | |
|---|---|---|---|---|---|---|---|---|---|
| de Kort, 1992 | Random sample | NA | Records of real patients | Narrow | Yes | No | 180 | 5 | Yes |
| Dell-Kuster, 2014 | Consecutive sample | >80% | Real patients | Wide | No | No | 3562 | Unclear | Yes |
| Dickmann, 2007 | NA | NA | Video case scenario | Narrow | Yes | No | 1 | 22 | Probably no |
| Elder, 1994 | NA | NA | Written case scenarios | Unclear | Yes | No | 10 | 35 | Probably no |
| Ikezawa, 2010 | NA | NA | Written case scenarios | Wide | Yes | No | 3 | 36 | Probably yes |
| Ingravallo, 2008 | Random sample | >80% | Real patients | Narrow | Yes | Yes | 15 | 16 | Yes |
| Lax, 2004 | Random sample | >80% | Records of real patient | Narrow | Yes | No | 23 | 2 | Yes |
| Lederer, 1998 | Any other recruitment | NA | Records of real patients | Wide | Yes | No | 1 | 103 | Probably yes |
| Okpaku, 1994 | Unclear, not reported | NA | Records of real patients | Narrow | Yes | Yes | 144 | 6 | Probably yes |
| Rudbeck, 2011 | NA | NA | Written case scenarios | Unclear | Yes | No | 8 | 11 | Probably no |
| Schellart, 2013 | NA | NA | Video case with actor | Wide | Yes | Yes | 4 | 40 | Probably no |
| Schreuder, 2012 | NA | NA | Written case scenarios | Unclear | Yes | No | 132 | 5 | Probably no |
| Slebus, 2010 | NA | NA | Written case scenarios | Narrow | Yes | Mixed, unclear | 5 | 51 | Probably no |
| Spanjer, 2008 | Random sample | NA | Records of real patient | Wide | Yes | No | 12 | 12 | Yes |
| Spanjer, 2009 | Random sample | NA | Records of real patients | Narrow | Yes | No | 30 | 27 | Yes |
| Spanjer, 2010 | Any other recruitment | >50-80% | Real patients | Narrow | Yes | No | 62 | 16 | Yes |
NA=not applicable.
*44% of studies rated as generalisable, 19% as probably generalisable, and 37% as probably not generalisable (see appendix 2 for details).
Included studies on evaluation of disability from insurance setting*
| Context of study | Disease/course | Retrospective time frame/prospective time frame for judgment | |
|---|---|---|---|
| de Kort, 1992, Netherlands | Random selection of 180 applicants (from national database of 101 754 individuals) for one of three jobs in public domain (administration, prison security, cleaning, catering) who underwent pre-employment assessment. 90 applicants (30 from each job category) had been judged (temporarily) unfit and 90 cases were used as reference cases with similar diagnosis. Study explores a) agreement among panel physicians and b) agreement between panel and decision of government occupational health and safety service | Mixed (individuals judged fit and unfit for job)/chronic | Unclear, not reported/unclear not reported |
| Dell-Kuster, 2014, Switzerland | Single centre study on 3562 real life disability assessments about reliability of medical experts and family physicians of claimants judging work ability of claimants | Mixed conditions/chronic disease | Long term history/long term ≥6 months |
| Dickmann, 2007, Germany | Fictional medical assessment of female administrative person with depression was videotaped and circulated to 22 psychiatric experts. Experts rated work ability in last job and in any suitable alternative job in context of claim for social security benefits | Mental disorder (depression)/chronic disease | Long term history/long term ≥6 months |
| Elder, 1994, UK | 35 medical experts attending scientific conference for medical experts rated 10 short case histories of real patients who applied for early retirement. Experts decided on acceptance or rejection of early retirement request | Mixed: psychiatric illness; somatic illness; somatic illness with psychiatric comorbities/mixed | Long term history/long term ≥ 6 months |
| Ikezawa, 2010, Canada | In survey with 3 case vignettes from real cases, 36 clinicians made return to work recommendations. Clinicians worked in major rehabilitation facility operated by Alberta Workers Compensation Board | Fracture, dislocation and low back pain (musculoskeletal)/injury or work accident | Long term history/short term <6 months |
| Ingravallo, 2008, Italy | 4 medical commissions with 4 experts each reviewed 16 patients from database of 150 patients with different degrees of narcolepsy | Narcolepsy/chronic disease | Long term history/long term ≥6 months |
| Lax, 2004, US | Single centre study in publicly funded occupational health centre where medical staff regularly evaluate patients for occupational illnesses compared 23 patients with mixed medical diagnoses who had recently undergone IME for evaluation of workers’ compensation claim with findings of health centre staff | Somatic disease, almost all occupational illnesses, including musculoskeletal disorders/injury or work incident | Long term history/long term ≥6 months |
| Lederer, 1998, Germany | Quality assurance study in public administration in Germany, with public health physicians who regularly perform medical expertises on civil servants. These 103 physicians evaluated single claim file and reported their judgment on permanent work disability | Neurological disease (stroke with right sighted hemiplegia/chronic disease | Long term history/long term ≥6 months |
| Okpaku, 1994, US | Independent rehabilitation team who was familiar in social security administration (SSA) assessments re-evaluated 158 adults with mental health problems, by imitating SSA approach. Cases all claimants or recipients of social security benefits. Judgments of rehabilitation team compared with those of SSA team | Mental disorder/chronic disease | Long term history/unclear, not reported, mixed |
| Rudbeck, 2011, Denmark | 11 medical specialists in social medicine rated work ability of eight 8 case histories of real patients | Mental or musculoskeletal; patients with mental problems can also report on back pain/chronic disease | Long term history/mix of short term and long term |
| Schellart, 2013, Netherlands | 43 Dutch insurance physicians applied insurance medicine guidelines for depression to four videos of actor claimants with depression. Insurance physicians used list of functional abilities before and after training to apply guidelines. Subanalysis of randomised controlled trial on training | Mental disorder (depression)/chronic disease | Long term history/long term ≥6 months |
| Schreuder, 2012, Netherlands | 5 occupational physicians assessed readiness to work of employees (n=132) who had been sicklisted for three weeks because of mental or musculoskeletal disorders | Mental and musculoskeletal disorders; other disorders/unclear or no information | Short term/short term ≥6 months |
| Slebus, 2010, Netherlands | 25 insurance physicians compared assessment of work ability in 5 patients with major depressive disorders with and without specific “checklist for work ability.” Subanalysis of randomised controlled trial on training | Mental disorders/chronic disease | Not reported/long term ≥6 months |
| Spanjer, 2008, Netherlands | 12 insurance physicians used disability assessment structured interview reports of 12 claimants with mental or physical disorder. Reports assessed on functional information system and mental ability list | Mental and somatic disorders/chronic disease | Long term history/long term ≥6 months |
| Spanjer, 2009, Netherlands | Insurance physicians (n=27) assessed 30 claimants for disability benefits with musculoskeletal disorders. Different information about claimants was provided: exclusively medical; exclusively functional; mixed | Musculoskeletal disorders/chronic disease | Unclear, not reported, mixed/long term ≥6 months |
| Spanjer, 2010, Netherlands | 16 insurance physicians from Dutch social insurance office trained partly in disability assessment structured interview assessed 62 real cases. Subanalysis of randomised controlled trial on training | Musculoskeletal disorders/chronic disease | Not reported/long term ≥6 months |
*In insurance setting, health professionals make judgments on work disability based on functional limitations, which include normative judgments, often from societal perspective.
Characteristics of studies investigating eligibility for work disability benefits
| Insurance setting (n=16) | Research setting (n=7) | |
|---|---|---|
| Health conditions: | ||
| Mental disorders | 38% | 100% |
| Musculoskeletal disorders | 25% | — |
| Mixed (somatic and mental disorders) | 38% | — |
| Course of disease or injury: | ||
| Acute diseases | — | 14% |
| Acute and chronic diseases | 6% | 28% |
| Chronic diseases | 75% | 43% |
| Chronic injuries | 13% | — |
| No information/unclear | 6% | 14% |
| Composition of patient population: | ||
| Single disorders (such as narcolepsy, stroke, depression, low back pain, psychosis, depression, anxiety, schizophrenia) | 31% | 71% |
| Mixed disorders | 69% | 29% |
| Reference time frame before evaluation for judgments on health condition: | ||
| Short term period | 6% | 71% |
| Long term period | 69% | — |
| Not reported | 25% | 29% |
| Prognostic time frame: | ||
| Short term (<6 months) | 13% | 29% |
| Long term (≥6 months) | 69% | — |
| Mixed | 6% | — |
| Not reported | 13% | 71% |
| Use of tools to facilitate rating of work disability: | ||
| Professional expertise only | 63% | — |
| ≥1 rating or reporting instruments | 37% | 100% |
Outcomes used in insurance setting to assess global rating of disability for work
| Outcome measure | Quantification |
|---|---|
| Global rating of work ability | Scale from 100-0% |
| Work ability | 3 categories: |
| Health related work ability | 4 categories: intact or slightly/much/extremely reduced |
| List of functional abilities | Sum score; range not reported |
| Global rating of work ability | Scale from 100% (status as before depressive disorder) to 0% (“inability to work”) |
| Global rating of fit for work | 3 categories: fit/doubt fit for work/unfit for work |
| Recommend return to work | 3 categories: return to previous work/return to modified work/no return to work |
| Recommend fit for work | 2 categories: yes |
| Readiness and ability of employee to return to work | 2 categories: high |
| Approval or decline of application for early retirement because of ill health | 4 categories: accept/reject/other action/no response |
| Decision on disability benefit | Scale on % disability from 100-0% |
| Approval for social security benefit | Social security administration—2 categories: yes |
| Severity of handicap | 3 categories: no handicap/handicap/severe handicap |
| Agreement among occupational health professional and medical expert on 4 disability items | 3 categories: full/partial/disagreement |
| Reduction in working hours | Hours/day |
Included studies on evaluation of disability in research setting*
| Context of study | Disease/course | Retrospective time frame/prospective time frame for judgment | |
|---|---|---|---|
| Berns, 2007, US | 2 practitioners rated 29 of larger sample with bipolar disorders with newly developed multidimensional scale of independent functioning (MSIF). Study conducted in single centre | Mental (bipolar disorders)/chronic disease | Short term/not reported |
| Chopra, 2002, Australia | 2 clinicians assessed feasibility and reliability of international classification of impairments, disability, and handicap (ICIDH-II) in 20 patients with psychotic disorders. Multicentre study | Mental (psychosis)/acute disease | Short term/short term <6 months |
| Daradkeh, 1994, United Arab Emirates | 2 psychiatrists with experience in rating disability assessment schedule (DAS, based on axis V of ICD-10, with one dimension of work) reviewed 42 psychiatric patients with different informants (such as family). Single centre study | Mixed mental inpatient and outpatients/mixed | Short term/not reported |
| Hannula, 2006, Finland | Group of researchers developed rating scale based on social adjustment scale with focus on social and occupational functioning (SOFAS). Four clinically trained professionals administered rating scale to 39 videotaped interviews of consecutive patients from Helsinki Psychotherapy Study | Mental: anxiety and mood disorders/mixed | Short term/short term <6 months |
| Hill, 1989, UK | Authors developed adult personality functioning assessment (APFA) with work as one subdomain. 3 raters used APFA for assessment 21 audiotaped standardised interviews with client | Mental/chronic disease | Unclear, not reported, mixed/unclear, not reported, mixed |
| Mundo, 2010, Italy | 18 raters assessed 180 inpatients with Kennedy Axis V (K Axis), which is equivalent to global assessment of functioning (GAF). One subscale covers occupational skills | Mixed mental disorders/unclear / no information | Short term/unclear, not reported, mixed |
| Nozu, 1995, Japan | 3 experts assessed schizophrenic outpatients who started occupational therapy at Tokyo Metropolitan Chubu Comprehensive Mental Health Centre with newly developed Work-Personality Insufficiency Rating Scale | Patients with schizophrenia/chronic disease | Unclear/unclear, not reported, mixed |
*In research setting, researchers who develop instruments tend to standardise their research environment when judging occupational functioning. Normative considerations or societal perspective are not part of their judgments.
Outcomes used in research setting to assess global rating of disability for work
| Outcome measure | Quantification |
|---|---|
| Global rating about functioning within work environment | 7 item Likert scale, 1 (normal functioning)-7 (total disability) |
| Adult personality functioning assessment | 6 point scale, 0-5, higher values indicate worse functioning |
| Occupational functioning | 6 point scale, “no dysfunction” to “maximum dysfunction” |
| Occupational functioning | Scale 100-0, higher values indicate better functioning |
| Occupational skills | Scale 100-0, higher values indicate better occupational skills |
| Global rating for remunerative employment | 5 item scale: no to complete or extreme problem |
| Employability | No information |
Reproducibility among experts stratified by level of inter-rater reliability
| Study | Use of rating or reporting instrument | Outcome | Outcome measure and IRR findings | Generalisability to real world disability evaluation |
|---|---|---|---|---|
| Excellent to good | ||||
| Schellart | Yes | Functional work ability | ICC 0.65 | Probably no |
| Slebus | Yes | Functional work ability | ICC 0.64 | Probably no |
| Ikezawa | Yes | Recommend return to work | % agreement 82.4% | Probably yes |
| Spanjer 2008 | Yes | Reduction in working hours | κ 0.8 | Yes |
| Spanjer 2009 | Yes | Reduction in working hours | κ 0.86 | Yes |
| Lederer | NR | Remaining work ability; limitations in work performance (single case) | Frequency of agreement: 91%; 86% | Probably yes |
| Fair to poor | ||||
| De Kort | NR | Fit for work | κ 0.38 | Yes |
| Dickmann | NR | Work ability in last job (single case): <3 hours; 3-6 hours; | Frequency of agreement: 27%; 36%; 37% | Probably no |
| Elder | NR | Early retirement | κ 0.24 | Probably no |
| Ingravallo | NR | Disability benefit | κ −0.10-0.35 | Yes |
| Rudbeck | NR | Health related work ability | κ 0.33 | Probably no |
| Schreuder | NR | Readiness and ability to return to work | κ 0.14 | Probably no |
| Spanjer 2010 | Yes | Reduction in working hours | ICC 0.53 | Yes |
| Dell-Kuster | NR | Work ability: last job; alternative job | Agreement: 51%; 20% | Yes |
| Lax | NR | Agreement on 4 disability items: full; partial; no agreement | Frequency of agreement*: 4%; 34%; 78% | Yes |
| Okpaku | NR | Approval for social security benefits | Frequency of agreement: yes/no decisions 40% | Probably yes |
| Excellent to good | ||||
| Berns | Yes | Functioning in work environment | ICC 0.86 | NA |
| Chopra | Yes | Remunerative employment | κ 0.62 | NA |
| Hannula | Yes | Occupational functioning | ICC 0.91 | NA |
| Hill | Yes | Dysfunctioning in work as social role | ICC 0.76 | NA |
| Mundo | Yes | Occupational skills | ICC 0.75 | NA |
| Nozu | Yes | Employability | ICC 0.88 | NA |
| Fair to poor | ||||
| Daradkeh | Yes | Occupational functioning | κ 0.53 | NA |
NR=not reported; ICC=intraclass correlation; IRR=inter-rater reliability; NA=not applicable.
*Total >100%.