| Literature DB >> 32462017 |
Said Boumaraf1, Xiabi Liu1, Chokri Ferkous2, Xiaohong Ma3.
Abstract
Mammography remains the most prevalent imaging tool for early breast cancer screening. The language used to describe abnormalities in mammographic reports is based on the Breast Imaging Reporting and Data System (BI-RADS). Assigning a correct BI-RADS category to each examined mammogram is a strenuous and challenging task for even experts. This paper proposes a new and effective computer-aided diagnosis (CAD) system to classify mammographic masses into four assessment categories in BI-RADS. The mass regions are first enhanced by means of histogram equalization and then semiautomatically segmented based on the region growing technique. A total of 130 handcrafted BI-RADS features are then extracted from the shape, margin, and density of each mass, together with the mass size and the patient's age, as mentioned in BI-RADS mammography. Then, a modified feature selection method based on the genetic algorithm (GA) is proposed to select the most clinically significant BI-RADS features. Finally, a back-propagation neural network (BPN) is employed for classification, and its accuracy is used as the fitness in GA. A set of 500 mammogram images from the digital database for screening mammography (DDSM) is used for evaluation. Our system achieves classification accuracy, positive predictive value, negative predictive value, and Matthews correlation coefficient of 84.5%, 84.4%, 94.8%, and 79.3%, respectively. To our best knowledge, this is the best current result for BI-RADS classification of breast masses in mammography, which makes the proposed system promising to support radiologists for deciding proper patient management based on the automatically assigned BI-RADS categories.Entities:
Mesh:
Year: 2020 PMID: 32462017 PMCID: PMC7238352 DOI: 10.1155/2020/7695207
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Mammogram samples of the four BI-RADS categories taken from the DDSM database: (a) B-2 (A_2001_1.RIGHT_MLO), (b) B-3 (B_3099_1.LEFT_CC), (c) B-4 (B_3390_1.LEFT_CC), and (d) B-5 (C_0176_1.LEFT_MLO). The extracted regions of interest (ROIs) are shown in the upper middle of each image.
Figure 2Block diagram of the proposed CAD system.
Figure 3Preprocessing stage: (a) full mammogram image, (b) ROI obtained by cropping (a), (c) final obtained ROI after applying HE on (b), (d) original histogram, and (e) enhanced histogram.
Figure 4Final segmented ROIs after applying HE and the proposed semiautomatic segmentation method: (a) B-2 sample, (b) B-3 sample, (c) B-4 sample, and (d) B-5 sample.
Figure 5Mass lesion descriptors according to BI-RADS mammography [38].
Summary of all quantified BI-RADS features used in this study.
| BI-RADS category | Descriptors | Feature number | Feature name |
|---|---|---|---|
| Shape | Round | 1-9 (9 features) | Continuity |
|
| |||
| Margin∗ | Circumscribed | 12-32 (21 features) | Kurtosis |
|
| |||
| Density∗ | High | 33-130 (98 features) | Angular second moment |
|
| |||
| Additional features | — | 10-11 (2 features) | Mass size |
Figure 6General architecture of the modified GA-based feature selection method.
Figure 7Example of the performed crossover operator.
Confusion matrix of the best feature subset for classification.
| BI-RADS categories | Predicted | |||
|---|---|---|---|---|
| Actual | B-2 | B-3 | B-4 | B-5 |
| B-2 |
| 2 | 1 | 0 |
| B-3 | 3 |
| 4 | 2 |
| B-4 | 2 | 5 |
| 7 |
| B-5 | 0 | 3 | 2 |
|
Figure 8Evaluation metrics computed from the best classification result.
Comparison with existing state-of-the art counterparts.
| Method | Dataset | A_Sn (%) | A_Sp (%) | PPV (%) | NPV (%) | MCC (%) | Accuracy (%) | Year |
|---|---|---|---|---|---|---|---|---|
| [ | 46 DDSM | 76 | 84 | — | — | — | 76.67 | 2015 |
| [ | 480 DDSM | 83 | 94 | 83.9 | 94.6 | 78.4 | 83.85 | 2016 |
| [ | 410 INbreast | — | — | — | — | — | 83.4 | 2018 |
| Ours | 500 DDSM | 84.5 | 94.25 | 84.4 | 94.8 | 79.3 | 84.5 | 2020 |
A_Sn: average sensitivity; A_Sp: average specificity.
Figure 9Training and testing accuracies versus the best feature subset.
Summary of the 46 selected features as the best feature subset using our modified genetic feature selection method.
| BI-RADS category | Feature name | Feature significance |
|---|---|---|
| Shape | Irregularity | (i) Irregularity: this feature is extracted based on active contour method (aka snakes) [ |
|
| ||
| Margin | Kurtosis “variance” kurtosis “Kurtosis” | (i) These features are quantified from a set of waveforms by wavelet analysis used in [ |
|
| ||
| Density | Angular second moment “skewness” | Is a measure of the asymmetry of the “angular second moment feature” distribution about its mean. |
|
| ||
| Additional features | Mass size | (i) The mass size is represented by the zone included inside the contour and is computed in mm2. |