| Literature DB >> 33377908 |
Megan Hoopes1, Robert Voss1, Heather Angier2, Miguel Marino2, Teresa Schmidt1, Jennifer E DeVoe2, Jeffrey Soule3, Nathalie Huguet2.
Abstract
BACKGROUND: Many cancer survivors receive primary care in community health centers (CHCs). Cancer history is an important factor to consider in the provision of primary care, yet little is known about the completeness or accuracy of cancer history data contained in CHC electronic health records (EHRs).Entities:
Mesh:
Year: 2021 PMID: 33377908 PMCID: PMC8246795 DOI: 10.1093/jnci/djaa210
Source DB: PubMed Journal: J Natl Cancer Inst ISSN: 0027-8874 Impact factor: 13.506
Figure 1.Diagram of electronic health record (EHR) linkage to California, Oregon, and Washington state cancer registries and resulting subgroups for validation analyses.
Cross-tabulation of patient-level cancer ascertainment by EHR and cancer registries, overall and by state
| Cancer history in EHR data | Cancer in registry | Total | |
|---|---|---|---|
| Yes | No | ||
| 3 states combined, No. (%) | |||
| Yes | 28 361 (1.9) | 22 730 (1.5) | 51 091 (3.4) |
| No | 23 616 (1.6) | 1 437 605 (95.1) | 1 461 221 (96.6) |
| Total | 51 977 (3.4) | 1 460 335 (96.6) | 1 512 312 (100) |
| California, No. (%) | |||
| Yes | 11 012 (1.4) | 9972 (1.3) | 20 984 (2.7) |
| No | 11 749 (1.5) | 737 154 (95.7) | 748 903 (97.3) |
| Total | 22 761 (3.0) | 747 126 (97.0) | 769 887 (100) |
| Oregon, No. (%) | |||
| Yes | 14 499 (2.6) | 11 107 (2.0) | 25 606 (4.6) |
| No | 8894 (1.6) | 520 162 (93.8) | 529 056 (95.4) |
| Total | 23 393 (4.2) | 531 269 (95.8) | 554 662 (100) |
| Washington, No. (%) | |||
| Yes | 2850 (1.4) | 2114 (1.1) | 4964 (2.5) |
| No | 2973 (1.5) | 190 993 (96.0) | 193 966 (97.5) |
| Total | 5823 (2.9) | 193 107 (97.1) | 198 930 (100) |
Counts are distinct patients with any cancer. Some cancers were identified by multiple registries, so the single-state EHR cancer and overall patient count values do not sum to the EHR total. EHR = electronic health record.
Agreement of EHR and registry cancer ascertainment: full study, by state, EHR source, and CHC use
| Sample | Kappa | Strength of agreement | Sensitivity | Specificity | Positive predictive value |
|---|---|---|---|---|---|
| Three states combined | 0.535 | Moderate | 0.546 | 0.984 | 0.555 |
| California | 0.489 | Moderate | 0.484 | 0.987 | 0.525 |
| Oregon | 0.573 | Moderate | 0.620 | 0.979 | 0.566 |
| Washington | 0.515 | Moderate | 0.489 | 0.989 | 0.574 |
| EHR | 0.524 | Moderate | 0.508 | 0.987 | 0.576 |
| EHR | 0.063 | Slight | 0.038 | 0.998 | 0.376 |
| PCP assigned | 0.5643 | Moderate | 0.611 | 0.981 | 0.554 |
| PCP not assigned | 0.3735 | Fair | 0.292 | 0.994 | 0.566 |
| Years established | 0.6077 | Substantial | 0.666 | 0.977 | 0.594 |
| Years established | 0.3513 | Fair | 0.343 | 0.991 | 0.381 |
| Overall visit rate | 0.5668 | Moderate | 0.738 | 0.972 | 0.484 |
| Overall visit rate | 0.5181 | Moderate | 0.449 | 0.990 | 0.660 |
Cancer registry is gold standard. CHC = community health center; EHR = electronic health record; PCP = primary care provider.
Refers to section(s) of EHR in which patient’s cancer history was documented.
Time between patient’s earliest and most recent encounter.
Based on ambulatory visits in 2016-2018.
Figure 2.Percentage of leading cancers identified by source of ascertainment. Total number of cases ascertained from either source presented in parentheses. Width of bars is proportional to this combined case count. EHR = electronic health record.
Site-specific agreement of electronic health record and registry cancer cases
| Cancer site | Cases coascertained, No. | Kappa | Strength of agreement | Sensitivity | Specificity | Positive predictive value |
|---|---|---|---|---|---|---|
| Prostate (male) | 4890 | 0.658 | Substantial | 0.681 | 0.9958 | 0.644 |
| Bladder | 1538 | 0.651 | Substantial | 0.646 | 0.9995 | 0.658 |
| Breast (female) | 6139 | 0.606 | Substantial | 0.586 | 0.9959 | 0.637 |
| Kidney and renal pelvis | 1125 | 0.598 | Moderate | 0.605 | 0.9995 | 0.592 |
| Testis (male) | 451 | 0.560 | Moderate | 0.626 | 0.9993 | 0.508 |
| Lung and bronchus | 1635 | 0.538 | Moderate | 0.517 | 0.9992 | 0.563 |
| Larynx | 163 | 0.521 | Moderate | 0.463 | 0.9999 | 0.595 |
| Esophagus | 185 | 0.518 | Moderate | 0.573 | 0.9999 | 0.473 |
| Colon and rectum | 2135 | 0.512 | Moderate | 0.511 | 0.9987 | 0.517 |
| Thyroid | 998 | 0.503 | Moderate | 0.494 | 0.9994 | 0.514 |
| Pancreas | 273 | 0.460 | Moderate | 0.437 | 0.9998 | 0.487 |
| Endometrium (female) | 668 | 0.437 | Moderate | 0.340 | 0.9995 | 0.615 |
| Hodgkin lymphoma | 310 | 0.432 | Moderate | 0.585 | 0.9996 | 0.343 |
| Ovary (female) | 450 | 0.422 | Moderate | 0.510 | 0.9991 | 0.361 |
| Liver and intrahepatic bile duct | 524 | 0.412 | Moderate | 0.537 | 0.9993 | 0.335 |
| Myeloma | 151 | 0.396 | Fair | 0.549 | 0.9998 | 0.310 |
| Non-Hodgkin lymphoma | 763 | 0.393 | Fair | 0.503 | 0.9989 | 0.323 |
| Melanoma of the skin | 1443 | 0.381 | Fair | 0.339 | 0.9988 | 0.439 |
| Stomach | 157 | 0.375 | Fair | 0.318 | 0.9999 | 0.458 |
| Leukemia | 381 | 0.371 | Fair | 0.496 | 0.9994 | 0.297 |
| Oral cavity and pharynx | 457 | 0.353 | Fair | 0.331 | 0.9995 | 0.380 |
| Soft tissue including heart | 145 | 0.353 | Fair | 0.395 | 0.9998 | 0.320 |
| Vulva (female) | 125 | 0.316 | Fair | 0.213 | 0.9999 | 0.610 |
| Anus, anal canal, and anorectum | 184 | 0.304 | Fair | 0.202 | 0.9999 | 0.617 |
| Cervix uteri (female) | 523 | 0.237 | Fair | 0.339 | 0.9973 | 0.184 |
| Brain and other nervous system | 328 | 0.218 | Fair | 0.159 | 0.9996 | 0.350 |
| Other | 746 | 0.107 | Slight | 0.162 | 0.9946 | 0.084 |
Cancer registry is gold standard.
Figure 3.Distribution of year of diagnosis and age at diagnosis among matched cases, electronic health record (EHR) vs cancer registry. Comparisons include matched patient*sites (same cancer site and patient in both EHR and registry data), n = 27 026. We imputed mid-year for dates where only the year was known and mid-month if day was unknown.
Prevalence and characteristics of patients with cancer history according to EHR and cancer registry, stratified by data source
| Patient characteristics | Patients with a history of cancer | ASMD | |
|---|---|---|---|
| Documented in EHR, % (n = 51 091) | Documented in cancer registry, % (n = 51 977) | ||
| Age on January 1, 2019, y | 0.074 | ||
| 18-29 | 1.8 | 1.8 | |
| 30-39 | 4.9 | 4.3 | |
| 40-49 | 8.6 | 7.2 | |
| 50-59 | 18.2 | 17.3 | |
| 60-69 | 29.7 | 29.3 | |
| 70-79 | 21.5 | 22.6 | |
| ≥80 | 15.4 | 17.4 | |
| Sex | 0.020 | ||
| Female | 56.0 | 55.5 | |
| Male | 44.0 | 44.5 | |
| Race or ethnicity | 0.119 | ||
| Hispanic | 13.4 | 12.2 | |
| Non-Hispanic White | 72.9 | 71.2 | |
| Non-Hispanic Black | 3.4 | 4.6 | |
| Non-Hispanic Asian | 2.8 | 4.2 | |
| Other and unknown | 7.6 | 7.9 | |
| Preferred language | 0.131 | ||
| English | 84.7 | 84.2 | |
| Spanish | 9.1 | 7.3 | |
| Other | 6.2 | 8.6 | |
| Charlson Comorbidity Index score (excluding cancer component) | 0.229 | ||
| 0 | 33.7 | 44.6 | |
| 1 | 18.0 | 15.3 | |
| 2-3 | 22.0 | 18.6 | |
| 4-6 | 17.8 | 14.6 | |
| ≥7 | 8.5 | 6.9 | |
| Federal poverty level | 0.085 | ||
| ≤138% | 41.5 | 39.3 | |
| ≥139% | 16.1 | 14.0 | |
| Missing or unknown | 42.5 | 46.8 | |
| Primary payer type | 0.100 | ||
| Medicaid | 25.1 | 23.5 | |
| Medicare | 39.5 | 37.2 | |
| Private | 14.8 | 14.8 | |
| Uninsured | 19.4 | 23.8 | |
| Other or missing | 1.3 | 0.7 | |
| Primary care provider assigned | 0.271 | ||
| Yes | 89.2 | 79.4 | |
| No | 10.8 | 20.6 | |
| Average annual encounter rate, 2016-2018 | 0.352 | ||
| No visits in 2016-2018 | 38.9 | 56.1 | |
| <1/y | 12.3 | 11.9 | |
| 1-2/y | 15.9 | 11.5 | |
| 2-5/y | 18.8 | 12.0 | |
| 5-10/y | 10.0 | 5.9 | |
| >10/y | 4.1 | 2.6 | |
| 3-Year encounter rate, 2016-2018, mean (SD) | 3.9 (7.2) | 3.5 (6.8) | 0.065 |
All characteristics obtained from EHR data. Time-varying characteristics assigned as of latest encounter date, unless otherwise specified. ASMD = absolute standardized mean difference; EHR = electronic health record.
ASMD greater than 0.1 indicates meaningful distributional difference.