| Literature DB >> 28852119 |
Evani Radiya-Dixit1, David Zhu2, Andrew H Beck3.
Abstract
Misclassification of breast lesions can result in either cancer progression or unnecessary chemotherapy. Automated classification tools are seen as promising second opinion providers in reducing such errors. We have developed predictive algorithms that automate the categorization of breast lesions as either benign usual ductal hyperplasia (UDH) or malignant ductal carcinoma in situ (DCIS). From diagnosed breast biopsy images from two hospitals, we obtained 392 biomarkers using Dong et al.'s (2014) computational tools for nuclei identification and feature extraction. We implemented six machine learning models and enhanced them by reducing prediction variance, extracting active features, and combining multiple algorithms. We used the area under the curve (AUC) of the receiver operating characteristic (ROC) curve for performance evaluation. Our top-performing model, a Combined model with Active Feature Extraction (CAFE) consisting of two logistic regression algorithms, obtained an AUC of 0.918 when trained on data from one hospital and tested on samples of the other, a statistically significant improvement over Dong et al.'s AUC of 0.858. Pathologists can substantially improve their diagnoses by using it as an unbiased validator. In the future, our work can also serve as a valuable methodology for differentiating between low-grade and high-grade DCIS.Entities:
Mesh:
Year: 2017 PMID: 28852119 PMCID: PMC5575012 DOI: 10.1038/s41598-017-10324-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1(a) An underfitting model. (b) An ideal model that identifies the underlying relationship of the data. (c) An overfitting model.
The performances of the six machine learning models with all features (column 2) and with the active features (column 3) in terms of the AUC.
| Algorithm | C-score for classification with all features | C-score for classification with active features | V-score for classification with active features |
|---|---|---|---|
| L1-regularized LR | 0.931 | 0.921 | 0.897 |
| LR w/early stopping | 0.904 | 0.923 | 0.884 |
| Random forest | 0.854 | 0.878 | 0.666 |
| Convolutional neural network | 0.779 | 0.850 | 0.650 |
| Conditional inference forest | 0.801 | 0.822 | Did not run |
| Multi-layer perceptron | 0.695 | 0.489 | Did not run |
The V-scores for classification with the active features (column 4) indicate each model’s generalizability. We used 1000 seeds to account for the random number variance.
The performance of Dong et al.’s model, our two LR algorithms with active features, and our CAFE model.
| Algorithm | C-score | V-score |
|---|---|---|
| L1-regularized LR from Dong | 0.931 | 0.858 |
| L1-regularized LR with active features | 0.921 (SD of 0.0064) | 0.897 |
| LR with early stopping and active features | 0.923 (SD of 0.0020) | 0.884 |
| CAFE model | 0.921 | 0.918 |
Figure 2The receiver operating characteristic (ROC) curve of our CAFE model of the combined and optimized L1-regularized and early stopping logistic regression algorithms is graphed in blue. This model achieved a V-score AUC of 0.918. The ROC curve of Dong et al.’s model of the L1-regularized logistic regression algorithm is in red. Their model achieved a V-score AUC of 0.858.
The performance of Dong et al.’s model as well as the L1-regularized LR, random forest, and convolutional neural network models with active features after switching the datasets for training and testing.
| Algorithm | V-score for Switched Datasets |
|---|---|
| L1-regularized LR from Dong | 0.757 |
| L1-regularized LR with active features | 0.827 |
| Random forest with active features | 0.662 |
| Convolutional neural network with active features | 0.631 |
A summary of the various machine learning models that were applied to automate the lesion classification.
| Model | Type | Implementation | Layers |
|---|---|---|---|
| L1-regularized LR | Regression | R (glmnet) | 2 (input, output) |
| LR with early stopping | Regression | Python (Theano) | 2 (input, output) |
| MLP | Deep learning | R (neuralnet) | 3 (input, hidden layer, output) |
| CNN | Deep learning | Python (Theano) | 5 (filtering, pooling, MLP) |
| Random forests | Tree-based learning | R (randomForest) | Not applicable |
| Conditional inference forests | Tree-based learning | R (party) | Not applicable |
Figure 3An example decision tree with two features.