| Literature DB >> 22822041 |
Justin A Strauss1, Chun R Chao, Marilyn L Kwan, Syed A Ahmed, Joanne E Schottinger, Virginia P Quinn.
Abstract
OBJECTIVE: Significant limitations exist in the timely and complete identification of primary and recurrent cancers for clinical and epidemiologic research. A SAS-based coding, extraction, and nomenclature tool (SCENT) was developed to address this problem.Entities:
Mesh:
Year: 2012 PMID: 22822041 PMCID: PMC3638182 DOI: 10.1136/amiajnl-2012-000928
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1Process diagram for a SAS-based coding, extraction, and nomenclature tool (SCENT).
Figure 2Sample pathology report text following preprocessing by a SAS-based coding, extraction, and nomenclature tool (SCENT).
Figure 3Sample pathology report text following preprocessing and code assignment by a SAS-based coding, extraction, and nomenclature tool (SCENT).
Figure 4Sample chart review form used by abstractors to classify the pathology reports of breast and prostate cancer patients.
Pathology report classifications of a SAS-based coding, extraction, and nomenclature tool (SCENT) and Cerner's CoPathPlus coding, as compared with abstractor review
| Benign | Cancer recurrence | Other primary cancer | Suspicious | κ Value | |||||
| % | N | % | N | % | N | % | N | ||
| Breast cancer (abstractor) | (436) | (32) | ( | ( | |||||
| SCENT | |||||||||
| Benign | 99.8 | 435 | – | – | – | – | 25.0 | 1 | 0.96 |
| Cancer recurrence | – | – | 100.0 | 32 | – | – | – | – | |
| Other primary cancer | 0.2 | 1 | – | – | 100.0 | 18 | 50.0 | 2 | |
| Suspicious | – | – | – | – | – | – | 25.0 | 1 | |
| CoPathPlus | |||||||||
| Benign | 97.2 | 424 | 12.5 | 4 | 22.2 | 4 | 50.0 | 2 | 0.72 |
| Cancer recurrence | 0.7 | 3 | 84.4 | 27 | – | – | – | – | |
| Other primary cancer | – | – | – | – | 55.6 | 10 | 25.0 | 1 | |
| Suspicious | 2.1 | 9 | 3.1 | 1 | 22.2 | 4 | 25.0 | 1 | |
| Prostate cancer (abstractor) | (356) | (29) | (36) | ( | |||||
| SCENT | |||||||||
| Benign | 99.4 | 354 | – | – | 5.6 | 2 | – | – | 0.95 |
| Cancer recurrence | – | – | 96.6 | 28 | 2.8 | 1 | – | – | |
| Other primary cancer | 0.6 | 2 | 3.4 | 1 | 91.7 | 33 | – | – | |
| Suspicious | – | – | – | – | – | – | 100.0 | 4 | |
| CoPathPlus | |||||||||
| Benign | 96.1 | 342 | 10.3 | 3 | 33.3 | 12 | 50.0 | 2 | 0.65 |
| Cancer recurrence | 0.3 | 1 | 72.4 | 21 | 5.6 | 2 | 25.0 | 1 | |
| Other primary cancer | 1.1 | 4 | 13.8 | 4 | 52.8 | 19 | – | – | |
| Suspicious | 2.5 | 9 | 3.4 | 1 | 8.3 | 3 | 25.0 | 1 | |
Includes the pathology reports of randomly sampled breast and prostate cancer patients from 6 months after diagnosis and subsequent to primary treatment. Contralateral breast cancers were considered to be recurrences.
Validation metrics for a SAS-based coding, extraction, and nomenclature tool (SCENT) and Cerner's CoPathPlus coding, as compared with abstractor review of pathology reports
| Sensitivity | Specificity | PPV | NPV | |
| Breast cancer | ||||
| SCENT | 1.00 (0.93 to 1.00) | 0.99 (0.98 to 1.00) | 0.94 (0.85 to 0.98) | 1.00 (0.99 to 1.00) |
| CoPathPlus | 0.74 (0.60 to 0.84) | 0.99 (0.98 to 1.00) | 0.90 (0.77 to 0.96) | 0.97 (0.95 to 0.98) |
| Prostate cancer | ||||
| SCENT | 0.97 (0.89 to 0.99) | 0.99 (0.98 to 1.00) | 0.97 (0.89 to 0.99) | 0.99 (0.98 to 1.00) |
| CoPathPlus | 0.71 (0.59 to 0.80) | 0.98 (0.96 to 0.99) | 0.88 (0.77 to 0.95) | 0.95 (0.92 to 0.97) |
Includes the pathology reports of randomly sampled breast and prostate cancer patients from 6 months after diagnosis and subsequent to primary treatment.
Shown with Wilson's 95% CI.
NPV, negative predictive value; PPV, positive predictive value.