| Literature DB >> 32166073 |
Hanie Azary1, Monireh Abdoos2.
Abstract
BACKGROUND: Breast cancer is one of the most common cancers in women. Mammogram images have an important role in the treatment of various states of this cancer. In recent years, machine learning methods have been widely used for tumor segmentation in mammogram images. Pixel-based segmentation methods have been presented using both supervised and unsupervised learning approaches. Supervised learning methods are usually fast and accurate, but they usually use a large number of labeled data. Besides, providing these samples is very hard and usually expensive. Unsupervised learning methods do not require the labels of the training data for decision making and they completely ignore the prior knowledge that may lead to a low performance. Semi-supervised learning methods which use a small number of labeled data solve the problem of providing the high number of samples in supervised methods, while they usually result in a higher accuracy in comparison to the unsupervised methods.Entities:
Keywords: Bayes classifier; co-training algorithm; mammogram images; support vector machine classifier; tumor segmentation
Year: 2020 PMID: 32166073 PMCID: PMC7038743 DOI: 10.4103/jmss.JMSS_62_18
Source DB: PubMed Journal: J Med Signals Sens ISSN: 2228-7477
Figure 1Direction of run-length matrix
The list of features used for each pixel
| Statistic features |
| 1. Mean |
| 2. Variance |
| 3. Absolute deviation |
| 4. Standard deviation |
| Run-length matrix |
| 1. SRE |
| 2. LRE |
| 3. GLN |
| 4. RLN |
| 5. RP |
| 6. LGRE |
| 7. HGRE |
| 8. SRLGE |
| 9. SRHGE |
| 10. LRLGE |
| 11. LRHGE |
SRE – Short-run emphasis; LRE – Long-run emphasis; GLN – Gray-level nonuniformity; RLN – Run-length nonuniformity; RP – Run percentage; LGRE – Low gray-level-run emphasis; HGRE – High gray-level-run emphasis; SRLGE – Short-run low gray-level emphasis; SRHGE – Short-run high gray-level emphasis; LRLGE – Long-run low gray-level emphasis; LRHGE – Long-run high gray-level emphasis
Figure 2An example for Fisher discriminant analysis: (a) The data before transformation and (b) the same data after transformation
Figure 3The co-training algorithm
Figure 4Tumor segmentation procedure, according to the co-training algorithm
Figure 5(a) An example of region of interest for a test image, (b) the output of Bayes method, (c) the output of support vector machine method, and (d) the output of co-training algorithm
Figure 6Receiver operating characteristic curve of compare performance supervised learning and semi-supervised learning method
The comparison between the proposed co-training algorithm and watershed and also region growing segmentation test images
| Accuracy (%) | Co-training method | Watershed segmentations | Region growing |
|---|---|---|---|
| Test image 1 | 91.68 | 90.05 | 89.80 |
| Test image 2 | 92.67 | 91.35 | 91.78 |
| Test image 3 | 89.52 | 82.12 | 79.22 |
| Test image 4 | 91.00 | 92.28 | 90.58 |
| Test image 5 | 76.19 | 79.01 | 79.81 |
Comparison of performance of the supervised learning and the semi-supervised learning methods
| Learning approaches | Supervised learning method | Semi-supervised learning method (co-training algorithm) | ||||
|---|---|---|---|---|---|---|
| Test evaluation using different training samples | ||||||
| 200 | 0 | 27,054 | 200 | 450 | 27,054 | |
| Learning algorithm | SVM | Bayes | Co-training | |||
| Accuracy (%) | 43.17 | 87.52 | 94.04 | |||
SVM – Support vector machine
The performance of supervised and semi-supervised methods according to mean and standard deviation
| SVM | Bayes | Co-training | ||||
|---|---|---|---|---|---|---|
| Mean (%) | SD | Mean (%) | SD | Mean (%) | SD | |
| Accuracy | 79.56 | 7.75 | 78.67 | 7.80 | 80.54 | 7.77 |
| PPV | 83.73 | 20.53 | 89.11 | 16.69 | 87.19 | 17.67 |
| NPV | 65.34 | 29.67 | 55.89 | 30.47 | 63.37 | 29.71 |
| Sensitivity | 83.00 | 9.87 | 78.85 | 10.05 | 82.14 | 9.98 |
| Specificity | 80.10 | 11.99 | 84.35 | 9.80 | 83.67 | 9.89 |
| A-z | 0.76 | 0.07 | 0.77 | 0.06 | 0.78 | 0.07 |
SVM – Support vector machine; SD – Standard deviation; PPV – Positive predictive value; NPV – Negative predictive value