| Literature DB >> 20145734 |
Grace L Smith1, Ya-Chen T Shih, Sharon H Giordano, Benjamin D Smith, Thomas A Buchholz.
Abstract
BACKGROUND: In epidemiologic studies, cancer stage is an important predictor of outcomes. However, cancer stage is typically unavailable in medical insurance claims datasets, thus limiting the usefulness of such data for epidemiologic studies. Therefore, we sought to develop an algorithm to predict cancer stage based on covariates available from claims-based data.Entities:
Year: 2010 PMID: 20145734 PMCID: PMC2818641 DOI: 10.1186/1742-5573-7-1
Source DB: PubMed Journal: Epidemiol Perspect Innov ISSN: 1742-5573
Study Sample Patient Characteristics, N = 77,306
| Predictor Variable | % of All Patients |
|---|---|
| Age, mean (SD) | 76 (7) |
| White race | 94 |
| Stage I | 51 |
| Stage II | 26 |
| Stage III | 10 |
| Stage IV | 4 |
| Stages I-III but T or N unknown | 9 |
| Axillary LN involvement | 19 |
| Metastatic disease | 14 |
| No. visits to surgeon, mean (SD) | 4 (3) |
| No. visits to medical oncologist, mean (SD) | 4 (9) |
| No. visits to radiation oncologist, mean (SD) | 3 (5) |
| Imaging (CT, MRI, PET, or bone scan) | 25 |
| Radiation therapy | 44 |
| Breast conserving surgery | 45 |
| Mastectomy | 49 |
| Axillary LN dissection | 72 |
| Chemotherapy (any agent) | 18 |
| Doxorubicin | 7 |
| Paclitaxel | 3 |
| No. physician visits, mean (SD) | 14 (12) |
| Screening mammography | 78 |
| Influenza vaccine | 34 |
| No. hospital admission for any cause, mean (SD) | 1 (1) |
| Charlson comorbidity score | |
| 0 comorbid conditions | 69 |
| 1 comorbid condition | 18 |
| 2 or more comorbid conditions | 8 |
| Unknown | 5 |
Abbreviations: CT computed tomography; LN lymph nodes; MRI magnetic resonance imaging PET positron emission tomography; SD standard deviation
a As indicated by Medicare claims codes; thus percentage of patients with code for metastatic disease not equal to patients with Stage IV disease.
Figure 1Distribution of calculated probabilities (%) for patients with predicted stages I-III and stage IV disease (a). Distribution of calculated probabilities for patients with predicted stages I-II and stage III disease (b). A calculated probability of 5% corresponds to a cutpoint of 0.05. The histograms are normalized to 100%.
Figure 2Receiver Operating Curve (ROC) for equation to predict stage IV disease.
Figure 3Receiver Operating Curve (ROC) for equation to predict stage I-III disease.