| Literature DB >> 30037859 |
Iosief Abraha1,2, Alessandro Montedori1, Diego Serraino3, Massimiliano Orso1,2, Gianni Giovannini1, Valeria Scotti4, Annalisa Granata5, Francesco Cozzolino1, Mario Fusco5, Ettore Bidoli3.
Abstract
OBJECTIVE: To define the accuracy of administrative datasets to identify primary diagnoses of breast cancer based on the International Classification of Diseases (ICD) 9th or 10th revision codes.Entities:
Keywords: accuracy; administrative database; breast cancer; sensitivity and specificity; systematic review; validity
Mesh:
Year: 2018 PMID: 30037859 PMCID: PMC6059263 DOI: 10.1136/bmjopen-2017-019264
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Study screening process.
Characteristics of included studies
| First author, year of publication | Period of data collection | Country | Records evaluated (N) | Source population | Type of administrative data | Diagnostic codes | Algorithm | Reference standard |
| Fisher | 1984–1985 | USA | 33 cases (any position); 24 cases (first position) from each 239 hospitals | All National Medical beneficiaries, | Medicare claim database: inpatient hospital discharge. | 174–174.9 | (a) Diagnosis in any position. | Medical records review |
| McBean | 1986–1987 | USA | 5744 cases | Persons 65 years of age and older living in the five states participating in the SEER Program. | Medicare claim database: inpatient hospital data. | 174–174.9 | New cases of breast cancer with diagnostic codes in any position. | Cancer Registry (SEER) |
| Solin | 1988–1989 | USA | 469 cases | Women aged ≥21 years enrolled in the USA. Healthcare in the southeastern Pennsylvania region. | Claims database included inpatient hospital stays, short procedure unit stays and professional services. Each claim included the USA (comprised ICD-9 code and CPT-4). | 174–174.9; 233.0 | (A) Initial algorithm based on new case of breast carcinoma+one or more of the following: (1) mastectomy, (2) partial mastectomy with lymphadenectomy, (3) excision, breast biopsy or partial mastectomy+lymphadenectomy, (4) excision, breast biopsy or partial mastectomy+the diagnosis of carcinoma of the breast, (5) excision, breast biopsy or partial mastectomy followed by radiation therapy treatment or (6) excision, breast biopsy or partial mastectomy followed by chemotherapy treatment. | Medical records review |
| Warren | 1989 | USA | 3454 cases | All women aged 65+ years with one or more hospitalisations with a diagnosis of breast cancer in Medicare. | Medicare Hospital Inpatient (ICD-9-CM). | 174–174.9; 233.0 | (i) Any hospitalisation breast cancer (ICD-9-CM 174–174.9 and 233.0) as principal diagnosis. | Cancer Registry (SEER) |
| Solin | 1993–1994 | USA | 177 cases | All women aged ≥65 years enrolled in the US Healthcare (Pennsylvania and New Jersey). | Claims database included hospital inpatient, short procedure unit stays and professional services (ICD-9 diagnosis code, and the CPT-4 procedure code). | 174–174.9; 233.0 | This study was performed to evaluate prospectively a previously published algorithm | Medical records review |
| McClish | 1986–1989 | USA | 3690 cases | All residents aged 65+ years diagnosed with breast cancer (Virginia). | MEDPAR (Medicare) inpatient hospital claim database (ICD- 9 CM). | 174–174.9; 233.0 | Incident cases of breast cancer ICD-9-CM 174; V174.9; 233 and 233.0. | Cancer Registry |
| Cooper | 1984–1993 | USA | 71 862 cases | All women aged 64+ years with breast cancer (Atlanta, Detroit, Seattle-Puget Sound, San Francisco Oakland, Connecticut, Hawaii, Iowa, New Mexico and Utah). | (1) MEDPAR (Medicare: inpatient hospital claim database (ICD-9 CM and specific procedural codes ICD-9-CM and HCPCS/CPT-4. | 174–174.9 | Cancer Registry (SEER) | |
| Warren | 1992 | USA | Women residing in the SEER states n=6 59 260; cases=6784. | All Medicare eligible women residing in one of five SEER states who were age 65 years and older as of 1 January 1992. | Medicare inpatient and physician claim database (ICD-9). | 174–174.9; 233.0 | Cancer Registry (SEER) | |
| Leung | 1994–1996 | USA | 1033 cases | All women aged 21 years or older who were enrolled in Health Net (California). | Claims database includes claims received for inpatient hospital stays, short procedure unit stays and professional services (code ICD-9-CM). | 174–174.9; 233.0 | Basic breast cancer diagnosis and one of the following: (1) mastectomy; (2) partial mastectomy with lymphadenectomy; (3) excision, breast biopsy or partial mastectomy+lymphadenectomy; (4) excision, breast biopsy or partial mastectomy+diagnosis of carcinoma; (5) excision, breast biopsy or partial mastectomy followed by radiation therapy or (6) excision, breast biopsy or partial mastectomy followed by chemotherapy. | Medical chart review |
| Freeman | 1990–1992 | USA | 7464 cases; 1415 controls: | Females aged 65–74 years (in 1992) diagnosed with breast cancer (San Francisco/Oakland, Detroit, Atlanta and Seattle and the states of Connecticut, Iowa, New Mexico, Utah and Hawaii). | ICD-9 inpatient record; outpatient record; physician claim (Medicare). | 174–174.9; 233.0; V103 | Model 1 | Cancer Registry (SEER) |
| Wang | 1989–1991 | USA | 8872 cases | All women aged 20 years and older who were enrolled in either Medicaid or Medicare and PAAD (New Jersey State). | Medicaid in patient files. | ICD-9 code: not reported | New cases of breast cancer: | Cancer registry |
| Koroukian | 1997–1998 | USA | 2635 incident cases | Women aged 40 years or older (Ohio). | Medicaid claims and enrolment files. ICD-9-CM. | 174–174.9; 233.0 | Incident of breast cancer (ICD-9 174.0–174.9 233.0) and combinations of diagnosis and procedures codes (chemotherapy or radiation therapy, mastectomy, lumpectomy). | Cancer Registry (OCISS) |
| Ganry | 1998 | France | 198 incident cases | All women aged 15 years or older who were diagnosed or treated (in the Amiens University Hospital and five general hospitals) of the Somme area. | French hospital database adapted from the Diagnosis Related Group (DRG). | ICD-9 code: not reported | New case of breast cancer—at least one of the following criteria: (a) breast cancer as primary diagnosis, alone or with (i) mastectomy; (ii) partial mastectomy with lymphadenectomy or (iii) excision, breast biopsy or partial mastectomy for procedures; (b) breast cancer as secondary diagnosis, with (i) chemotherapy as principal diagnosis or (ii) without specific procedures (excluding prevalent cases: women with history of breast cancer between 1991 and 1997). | Cancer registry (French Somme Area) |
| Nattinger | Validation set: 1994; training set: 1995 | USA | 7607 cases and 120 317 controls | Training set: claims from 7700 SEER-Medicare breast cancer subjects (age 65+ years) diagnosed in 1995, and 124 884 controls. Validation set: claims from 7607 | Random sample; ICD-9 inpatient record; outpatient record; physician claim (Medicare). | 174–174.9; 233.0 | Four-step algorithm: | Cancer Registry (SEER) |
| Penberthy | 1995 | USA | 249 cases | Women aged 65+ years with breast cancer diagnosis. | (a) lnpatient Medicare; (b) inpatient or Part B claims. | ICD-9 174 | Six case definitions: A1) diagnosis first position; A2) inpatient diagnosis in any position; A3) inpatient diagnosis in any position+inpatient surgical procedure; B1) inpatient diagnosis in any position+inpatient surgical procedure; B2) inpatient diagnosis in any position+inpatient surgical procedure OR a diagnostic procedure+diagnosis+a surgery or chemotherapy or radiation therapy procedure in an outpatient or physician office record within 4 months of a diagnostic procedure; B3) inpatient diagnosis in any position OR a diagnosis+a surgery or chemotherapy or radiation therapy procedure in an outpatient or physician office record. | Cancer Registry (Virginia State); medical chart review |
| Setoguchi | 1997–2000 | USA | 2004 cases | Subjects aged 65+ years Medicare recipients in Pennsylvania. | Medicare inpatient hospital claim and drug benefit programme data. | Unclear | Four algorithms based on the combination of the following: (a) ICD-9 diagnosis codes ( | Cancer Registry (Pennsylvania |
| Baldi | 2000 training set, 2001 validation set | Italy | 925 cases | All residents in Piedmont region. | Regional inpatient administrative database. | 174–174.9; 233.0 | Algorithms based on combination between (i) ICD-9-CM diagnosis breast cancer (invasive 174.0–174.9, in situ 233.0); and ICD-9-CM procedures code incisional breast biopsy: 85.12 excision or destruction of breast tissue: 85.20–85.25 subcutaneous mastectomy: 85.33–85.36 mastectomy: 85.41–85.48 | Cancer Registry (Piedmont Region) |
| Couris | 2002 | France | 995 cases | Women aged 20 years or older living in one of the nine French districts covered by a cancer registry in 2002. | Inpatient hospital administrative data (French National Institute of | C50.0 to C50.9 | (a) Principal diagnosis for invasive breast cancer—ICD-10 codes C50.0 to C50.9; (b) principal diagnosis+specific surgery procedures. | French cancer registries |
| Yuen | 2002–2005 | Italy | 11 615 cases | Women aged 20 years older with incident breast cancer (Emilia-Romagna region). | Regional administrative database (Hospital discharge files). | 174–174.9; 233.0 | Women having a diagnosis code for cancer as well as a principal or secondary surgical code for lumpectomy or mastectomy: principal or secondary procedure indicating excision or destruction of breast tissue (ICD-9-CM code 85.20–85.25) or mastectomy (ICD-9-CM code 85.41–85.48); and principal or secondary diagnosis of carcinoma in situ of the breast (ICD-9-CM code 233.0) or malignant neoplasm of the breast (ICD-9-CM code 174.0–174.9). | Cancer registry (AIRTUM) |
| Kemp | 2004–2008 | Australia | 2039 women with invasive breast tumour | Women aged 45+ years who had completed breast cancer-related items in the baseline survey of the 45 and up study (New South Wales). | i) Administrative hospital separations records (ICD-10-AM); ii) outpatient medical service claims; iii) prescription medicines claims and iv) the 45 and up study baseline survey. | C50.0 to C50.9 | Principal inpatient diagnosis of invasive breast cancer using ICD-10- AM codes C50.0-C50.9. | Cancer Registry |
| Sato | 2011 | Japan | 50 056 women included in the study cohort (633 with breast cancer) | Women with no prior cancer-related history, from the claims data at a single institution between 1 January and 31 December 2011. | ICD for oncology, third edition (ICD-O-3): topography code of breast cancer (C500 to C506, C508, C509). | C50.0 to C50.9 | 14 definitions starting from (1) breast cancer alone and subsequent addition of (2) diagnosis code related to breast cancer (3) diagnostic imaging code (4) biopsy code (5) marker test code (6) surgery code (7) chemotherapy code (8) medication code (9) radiation procedure code (10) the other code related to breast cancer (11) diagnosis code related to breast cancer or marker test code (12) surgery, chemotherapy, medication or radiation procedure code (13) diagnosis code related to the breast cancer, marker test code, surgery, chemotherapy, medication or radiation procedure code (14) ≥3 diagnoses of breast cancer. | Cancer Registry |
AIRTUM, Associazione Italiana dei Registri Tumori; CPT-4: current procedural terminology-4; HCPCS, Healthcare Common Procedure Coding System; ICD, International Classification of Diseases; MEDPAR, Medicare Annual Demographic Files, the Medicare Provider Analysis and Review; OCISS, Ohio Cancer Incidence Surveillance System; SEER, Surveillance, Epidemiology, and End Results Program.
Accuracy results by initial algorithm in the 21 included studies
| Study ID | Initial algorithm | Sensitivity (95% CI) | Specificity (95% CI) | PPV (95% CI) | NPV (95% CI) |
| ICD-9 | |||||
| Fisher | BCD in primary position | 96 (79 to 100) | 88 (70 to 98) | ||
| McBean | BCD in any position | 97 | 96 | ||
| Solin | BCD in primary position | 88 (85 to 91) | |||
| Warren | BCD in primary position | 94 (93 to 95) | 97 | 83 (78 to 89) | |
| McClish | BCD in primary position | 83 (82 to 84) | 91 (90 to 93) | ||
| Solin | BCD (unclear position) | 83 (77 to 87) | |||
| Leung | BCD in primary position | 84 (82 to 87) | |||
| Warren | BCD in primary position | 57 (55 to 59) | 99 (99 to 99) | 91 (90 to 93) | 99 (98 to 100) |
| Cooper | BCD in primary position | 68 (68 to 69) | |||
| Freeman | BCD in primary position | 68 (66 to 70) | 74 (72 to 76) | ||
| Wang | BCD (unclear position) | 89 (88 to 90) | |||
| Koroukian | BCD in primary position | 69 (66 to 71) | 15 (13 to 17) | ||
| Ganry | BCD in primary position | 85 (80 to 90) | 100 (100 to 100) | 98 (94 to 99) | 100 (100 to 100) |
| Nattinger | BCD in primary position | 80 (79 to 81) | 100 (100 to 100) | 89 (87 to 92) | |
| Penberthy | BCD in primary position | 53 | 96 | ||
| Setoguchi | ≥1 BCD in primary position | 87 (86 to 89) | 100 (100 to 100) | 50 (49 to 52) | 100 (100 to 100) |
| Baldi | BCD in primary position+surgical procedures | 74 (71 to 77) | 90 (87 to 92) | ||
| Yuen | BCD in primary position+surgical procedures | 85 (84 to 86) | 99 (99 to 99) | 91 (90 to 91) | |
| ICD-10 | |||||
| Couris | BCD in primary position | 69 (66 to 72) | 99 (99 to 99) | 57 (54 to 60) | 100 (100 to 100) |
| Kemp | BCD in primary position | 86 (85 to 88) | 99 (99 to 99) | 86 (84 to 87) | 100 (100 to 100) |
| Sato | BCD in primary position | 99 (98 to 100) | 99 (93 to 99) | 66 (63 to 69) | |
BCD, breast cancer diagnosis; ICD, International Classification of Disease; NPV, negative predictive value; PPV, positive predictive value.
Results of studies validating diagnoses of breast cancer (first row) and surgical procedures (subsequent rows)
| # | Author/year | Algorithms invasive breast cancer | Sensitivity | Specificity | PPV | NPV |
| 1 | Solin | Diagnosis incident cases | – | – | 88 (85 to 91) | – |
| 2 | Solin 1994 | Mastectomy | – | – | 95 (92 to 98) | – |
| 3 | Solin 1994 | Partial mastectomy with lymphadenectomy | – | – | 96 (91 to 100) | – |
| 4 | Solin 1994 | Excision and lymphadenectomy | – | – | 100 (100 to 100) | – |
| 1 | Solin | Initial algorithm: diagnosis incident cases | – | – | 83 (78 to 89) | – |
| 2 | Solin 1997 | Initial algorithm: mastectomy | – | – | 95 (90 to 100) | – |
| 3 | Solin 1997 | Initial algorithm: partial mastectomy with lymphadenectomy | – | – | 95 (88 to 100) | – |
| 4 | Solin 1997 | Initial algorithm: excision and lymphadenectomy | – | – | 92 (83 to 100) | – |
| 5 | Solin 1997 | Best algorithm: diagnosis incident cases | – | – | 84 (79 to 90) | – |
| 6 | Solin 1997 | Best algorithm: mastectomy | – | – | 82 (76 to 88) | – |
| 7 | Solin 1997 | Best algorithm: partial mastectomy with lymphadenectomy | – | – | 84 (78 to 89) | – |
| 8 | Solin 1997 | Best algorithm: excision and lymphadenectomy | – | – | 84 (78 to 89) | – |
| 1 | McClish | incident cases identified in MEDPAR | 83 (82 to 84) | – | – | – |
| 2 | McClish 1997 | incident cases identified in VCR | 82 (81 to 83) | – | – | – |
| 3 | McClish 1997 | aggregated (VCR+MEDPAR) | 97 (96 to 97) | – | – | – |
| 4 | McClish 1997 | MEDPAR definitive surgical therapy | 80 (79 to 81) | – | – | – |
| 5 | McClish 1997 | VCR definitive surgical therapy | 87 (86 to 88) | – | – | – |
| 1 | Leung | Initial algorithm: diagnosis | – | – | 84 (82 to 87) | – |
| 2 | Leung 1999 | Mastectomy | – | – | 92 (90 to 95) | – |
| 3 | Leung 1999 | Partial mastectomy with lymphadenectomy | – | – | 98 (96 to 100) | – |
| 4 | Leung 1999 | Excision, breast biopsy or partial mastectomy plus lymphadenectomy | – | – | 92 (87 to 98) | – |
| 1 | Cooper | First set of analysis (increase in SE including in order inpatient other diagnosis, surgical, part B, etc): inpatient, first position diagnostic codes | 68 (68 to 69) | – | – | – |
| 2 | Cooper 1999 | First set of analysis: inpatient, surgical | 79 (79 to 79) | – | – | – |
| 3 | Cooper 1999 | First set of analysis: part B, first position | 91 (91 to 91) | – | – | – |
| 4 | Cooper 1999 | First set of analysis: part B, surgical | 94 (93 to 94) | – | – | – |
| 5 | Cooper 1999 | Second set of analysis (increase in SE including in order part B other diagnosis, surgical, inpatient, etc): part B, first position | 66 (66 to 66) | – | – | – |
| 6 | Cooper 1999 | Second set of analysis: part B, other diagnostic codes | 77 (77 to 77) | – | – | – |
| 7 | Cooper 1999 | Second set of analysis: part B, surgical | 81 (81 to 81) | – | – | – |
| 8 | Cooper 1999 | Second set of analysis: inpatient, first position | 91 (91 to 91) | – | – | – |
| 9 | Cooper 1999 | Second set of analysis: inpatient, surgical | 94 (93 to 94) | – | – | – |
| 1 | Freeman | Primary diagnosis: hospital inpatient in Medicare Provider Analysis (MEDPAR) | 68 (66 to 70) | – | 74 (72 to 76) | – |
| 2 | Freeman 2000 | Mastectomy hospital inpatient | 53 (51 to 56) | – | 73 | – |
| 3 | Freeman 2000 | Partial mastectomy hospital inpatient | 7 (4 to 11) | – | 64 | – |
| 4 | Freeman 2000 | Excisional biopsy hospital inpatient | 8 (5 to 12) | – | 56 | – |
| 5 | Incisional biopsy hospital inpatient | 8 (5 to 11) | – | 73 | – | |
| 1 | Ganry | Hospitalisation with breast cancer as primary diagnosis: mastectomy | – | – | 100 (100 to 100) | – |
| 2 | Ganry 2003 | Hospitalisation with breast cancer as primary diagnosis: partial mastectomy with lymphadenectomy | – | – | 100 (100 to 100) | – |
| 3 | Ganry 2003 | Hospitalisation with breast cancer as primary diagnosis: biopsy/excision plus the diagnosis of carcinoma | – | – | 100 (100 to 100) | – |
| 1 | Kemp | Diagnosis of invasive breast cancer | 86 (85 to 88) | 100 (100 to 100) | 86 (84 to 87) | 100 (100 to 100) |
| 2 | Kemp 2013 | Lumpectomy | 61 (59 to 63) | 99 (99 to 99) | 52 (50 to 54) | 99 (99 to 99) |
| 3 | Kemp 2013 | Mastectomy | 33 (31 to 35) | 100 (100 to 100) | 71 (68 to 74) | 99 (99 to 99) |
| 4 | Kemp 2013 | Lumpectomy OR mastectomy | 84 (83 to 86) | 99 (99 to 99) | 56 (55 to 58) | 100 (100 to 100) |
MEDPAR, Medicare Annual Demographic Files, the Medicare Provider Analysis and Review; NPV, negative predictive value; PPV, positive predictive value; VCR, Virginia Cancer Registry.
Results of studies that combined surgical procedures followed by chemoradiation or radiation therapy with diagnosis of breast cancer
| N | Author/year | Algorithms invasive breast cancer | Sensitivity | Specificity | PPV |
| 1 | Solin | Diagnosis incident cases | – | – | 88 (85 to 91) |
| 2 | Solin 1994 | Excision followed by radiation therapy | – | – | 94 (88 to 99) |
| 3 | Solin 1994 | Excision followed by chemotherapy | – | – | 94 (88 to 100) |
| 1 | Solin | Initial algorithm: diagnosis incident cases | – | – | 83 (78 to 89) |
| 2 | Solin 1997 | Initial algorithm: excision followed by radiation therapy treatment | 97 (91 to 100) | ||
| 3 | Solin 1997 | Initial algorithm: excision followed by chemotherapy | – | – | 90 (77 to 100) |
| 4 | Solin 1997 | Best algorithm: diagnosis incident cases | – | – | 84 (79 to 90) |
| 5 | Solin 1997 | Best algorithm: excision followed by radiation therapy treatment | – | 84 (78 to 89) | |
| 6 | Solin 1997 | Best algorithm: excision followed by chemotherapy | – | 84 (78 to 89) | |
| 1 | Leung | Initial algorithm: diagnosis |
| – | 84 (82 to 87) |
| 2 | Leung 1999 | Excision, breast biopsy or partial mastectomy followed by radiation therapy | 96 (94 to 98) | ||
| 3 | Leung 1999 | Excision, breast biopsy or partial mastectomy followed by chemotherapy | 93 (90 to 97) | ||
| 1 | Koroukian | Incident breast cancer | – | – | 15 (13 to 17) |
| 2 | Koroukian 2003 | Breast cancer diagnosis, chemotherapy or radiation therapy | – | 34 (29 to 39) | |
| 3 | Koroukian 2003 | Breast cancer diagnosis, lumpectomy, chemotherapy or radiation therapy | 85 (78 to 92) | ||
| 1 | Ganry | Hospitalisation with breast cancer (primary diagnosis): without any procedure | – | – | 91 (81 to 100) |
| 2 | Ganry 2003 | Hospitalisation with breast cancer (secondary diagnosis): chemotherapy as primary diagnosis | – | – | 98 (94 to 100) |
| 1 | Sato | Diagnosis of breast cancer | 99 (99 to 100) | 99 (93 to 100) | 66 (63 to 69) |
| 2 | Sato 2015 | Diagnosis of breast cancer+diagnosis code related to the breast cancer, marker test code, surgery, chemotherapy, medication or radiation procedure code | 97 (96 to 100) | 100 (100 to 100) | 83 (80 to 85) |
PPV, positive predictive value.
Range of sensitivities and PPVs stratified by administrative data source, type of ICD code and country of origin
| Range of sensitivities | Range of PPVs | |
| Administrative data source | ||
| Inpatient (primary position only) | 53%–99% (18 studies) | 15%–98% (19 studies) |
| Outpatient (outpatient diagnosis only) | 9% (1 study) | 19% (1 study) |
| Type of ICD | ||
| ICD-9 (initial algorithm) | 53%–97% (15 studies) | 15%–98% (16 studies) |
| ICD-10 (initial algorithm) | 69%–99% (3 studies) | 57%–86% (3 studies) |
| Country of origin | ||
| USA (initial algorithm) | 53%–97% (12 studies) | 15%–96% (13 studies) |
| Italy (initial algorithm) | 74%–85% (2 studies) | 90%–91% (2 studies) |
| France (initial algorithm) | 69%–85% (2 studies) | 57%–98% (2 studies) |
| Japan (initial algorithm) | 99% (1 study) | 66% (1 study) |
| Australia (initial algorithm) | 86% (1 study) | 86% (1 study) |
| Accuracy over time | ||
| Before 2001 (initial algorithm) | 57%–97% (7 studies) | 74%–96% (9 studies) |
| After 2000 (initial algorithm) | 53%–99% (11 studies) | 15%–98% (10 studies) |
ICD, International Classification of Disease; PPV, positive predictive value.