| Literature DB >> 33381748 |
Anobel Y Odisho1, Briton Park2, Nicholas Altieri2, John DeNero3, Matthew R Cooperberg1,4, Peter R Carroll1, Bin Yu2,3,5.
Abstract
OBJECTIVE: Cancer is a leading cause of death, but much of the diagnostic information is stored as unstructured data in pathology reports. We aim to improve uncertainty estimates of machine learning-based pathology parsers and evaluate performance in low data settings.Entities:
Keywords: cancer; information extraction; machine learning; natural language processing; pathology; prostate cancer
Year: 2020 PMID: 33381748 PMCID: PMC7751177 DOI: 10.1093/jamiaopen/ooaa029
Source DB: PubMed Journal: JAMIA Open ISSN: 2574-2531
Data elements extracted from pathology reports
| Data elements | Description |
|---|---|
| Document classifier algorithm fields | |
| Gleason GradePrimary, secondary, tertiary | Histologic grading of tumor aggressiveness based on the Gleason grading system. Each specimen is assigned a primary, secondary, and occasionally a tertiary score, each of which are whole numbers from 1 to 5 |
| Tumor histologic type | Primary histologic type, such as acinar adenocarcinoma, ductal adenocarcinoma, and small cell neuro-endocrine carcinoma |
| Cribriform pattern | Whether the cells exhibit a cribriform growth pattern (Gleason 4 only) |
| Treatment effect | Indicator whether there is evidence of a prior treatment, such as hormone treatment or radiation therapy |
| Margin status for tumor | To evaluate surgical margins, the entire prostate surface is inked after removal. The surgical margins are designated as “negative” if the tumor is not present at the inked margin and “positive” if tumor is present at the inked margin |
| Margin status for benign glands | The benign margins are designated as “positive” if there are benign prostate glands present at the inked margin and “negative” otherwise |
| Perineural invasion | Whether cancer cells were seen surrounding or tracking along a nerve fiber within the prostate |
| Seminal vesicle invasion | Invasion of tumor into the seminal vesicle |
| Extraprostatic extension | Presence of tumor beyond the prostatic capsule |
| Lymph node status | Whether tumor was identified in lymph nodes |
| Token extractor algorithm fields | |
| Pathologic stage classification T (primary tumor) N (regional lymph nodes) M (distant metastasis) | Based on American Joint Committee on Cancer TNM staging system for prostate cancer. Based on the edition used in each report (5th–8th edition) |
| Tumor volume | Amount of tumor identified in prostate specimen (cubic centimeters) |
| Prostate weight | Overall weight of the prostate (g) |
Weighted F1 scores for classification fields and mean accuracy for token extractor fields on full training data sample (n = 2066)
| Data elements | Logistic regression | AdaBoost classifier | Random forest | SVM | CNN | LSTM | Majority class accuracy |
|---|---|---|---|---|---|---|---|
| Gleason grade—primary | 0.978 | 0.971 | 0.941 | 0.932 | 0.981 | 0.628 | 0.709 |
| Gleason grade—secondary | 0.958 | 0.943 | 0.913 | 0.912 | 0.968 | 0.576 | 0.467 |
| Gleason grade—tertiary | 0.923 | 0.930 | 0.844 | 0.886 | 0.930 | 0.741 | 0.901 |
| Tumor histology | 0.989 | 0.995 | 0.995 | 0.993 | 0.995 | 0.994 | 0.991 |
| Cribriform pattern | 0.963 | 0.981 | 0.963 | 0.968 | 0.987 | 0.966 | 0.997 |
| Treatment effect | 0.981 | 0.979 | 0.981 | 0.981 | 0.981 | 0.973 | 0.985 |
| Tumor margin status | 0.941 | 0.953 | 0.888 | 0.918 | 0.950 | 0.630 | 0.799 |
| Benign margin status | 0.977 | 0.975 | 0.972 | 0.981 | 0.978 | 0.967 | 0.997 |
| Perineural invasion | 0.944 | 0.978 | 0.938 | 0.929 | 0.972 | 0.613 | 0.771 |
| Seminal vesicle invasion | 0.943 | 0.974 | 0.940 | 0.965 | 0.976 | 0.784 | 0.904 |
| Extraprostatic extension | 0.954 | 0.953 | 0.882 | 0.939 | 0.961 | 0.778 | 0.712 |
| Lymph node status | 0.983 | 0.952 | 0.983 | 0.973 | 0.986 | 0.824 | 0.570 |
| Mean weighted F1 across classification models | 0.961 | 0.965 | 0.937 | 0.948 | 0.972 | 0.790 | 0.817 |
| T stage | 0.951 | 0.954 | 0.948 | – | – | – | – |
| N stage | 0.954 | 0.954 | 0.948 | – | – | – | – |
| M stage | 0.972 | 0.969 | 0.969 | – | – | – | – |
| Estimate tumor volume | 0.605 | 0.765 | 0.873 | – | – | – | – |
| Prostate weight | 0.846 | 0.855 | 0.914 | – | – | – | – |
| Mean accuracy for token extractor models | 0.866 | 0.899 | 0.930 | – | – | – | – |
CNN, convolutional neural network; LSTM, long short-term memory neural network; SVM, support vector machine.
Mean weighted F1 score ± standard deviation for classification models for classification models and mean accuracy ± standard deviation for token extractor models on increasing numbers of reports (n) after 5 trials
| Model |
|
|
|
|
|
|---|---|---|---|---|---|
| Classification models (mean weighted F1 score across all classification fields ± SD) | |||||
| Logistic | 0.781 ± 0.175 | 0.846 ± 0.117 | 0.875 ± 0.090 | 0.911 ± 0.059 | 0.934 ± 0.041 |
| AdaBoost | 0.829 ± 0.140 | 0.878 ± 0.100 | 0.907 ± 0.066 | 0.928 ± 0.049 | 0.945 ± 0.034 |
| Random forest | 0.795 ± 0.169 | 0.835 ± 0.128 | 0.867 ± 0.101 | 0.882 ± 0.088 | 0.901 ± 0.070 |
| SVM | 0.738 ± 0.214 | 0.763 ± 0.209 | 0.786 ± 0.194 | 0.842 ± 0.112 | 0.860 ± 0.140 |
| CNN | 0.720 ± 0.225 | 0.790 ± 0.163 | 0.851 ± 0.122 | 0.893 ± 0.086 | 0.935 ± 0.055 |
| LSTM | 0.688 ± 0.205 | 0.729 ± 0.187 | 0.743 ± 0.203 | 0.739 ± 0.214 | 0.739 ± 0.212 |
| Token extractor models (mean accuracy across all token extractor fields ± SD) | |||||
| Logistic | 0.844 ± 0.085 | 0.897 ± 0.079 | 0.892 ± 0.096 | 0.902 ± 0.087 | 0.896 ± 0.092 |
| Adaptive boost | 0.877 ± 0.097 | 0.892 ± 0.080 | 0.890 ± 0.084 | 0.896 ± 0.082 | 0.890 ± 0.092 |
| Random forest | 0.897 ± 0.180 | 0.898 ± 0.064 | 0.915 ± 0.054 | 0.920 ± 0.041 | 0.924 ± 0.038 |
CNN, convolutional neural network; LSTM, long short-term memory neural network; SVM, support vector machine.
Upper: classifier accuracy and expected calibration error for boosting before and after isotonic calibration and Lower: expected calibration error for random forest model before and after isotonic calibration
| Data elements | Weighted-F1 | ECE | Isotonic weighted-F1 | Isotonic ECE |
|---|---|---|---|---|
| Classification calibration | ||||
| Gleason grade—primary | 0.95 | 0.03 | 0.93 | 0.03 |
| Gleason grade—secondary | 0.94 | 0.08 | 0.92 | 0.14 |
| Gleason grade—tertiary | 0.91 | 0.05 | 0.91 | 0.03 |
| Tumor histology | 0.99 | 0.009 | 0.99 | 0.007 |
| Cribriform pattern | 0.995 | 0.007 | 0.995 | 0.017 |
| Treatment effect | 0.99 | 0.007 | 0.99 | 0.003 |
| Tumor margin status | 0.96 | 0.15 | 0.94 | 0.013 |
| Benign margin status | 0.994 | 0.007 | 0.995 | 0.019 |
| Perineural invasion | 0.95 | 0.26 | 0.96 | 0.02 |
| Seminal vesicle invasion | 0.987 | 0.16 | 0.97 | 0.02 |
| Extraprostatic extension | 0.96 | 0.12 | 0.96 | 0.01 |
| Lymph node status | 0.96 | 0.04 | 0.98 | 0.01 |
|
| ||||
| Data elements | ECE | Isotonic ECE | ||
|
| ||||
| Extractor calibration | ||||
| T stage | 0.155 | 0.016 | ||
| N stage | 0.144 | 0.013 | ||
| M stage | 0.007 | 0.005 | ||
| Estimated volume of tumor | 0.221 | 0.021 | ||
| Prostate weight | 0.278 | 0.033 | ||