| Literature DB >> 24330643 |
Edward J Kendall1, Michael G Barnett, Krista Chytyk-Praznik.
Abstract
BACKGROUND: Diagnostic performance in breast screening programs may be influenced by the prior probability of disease. Since breast cancer incidence is roughly half a percent in the general population there is a large probability that the screening exam will be normal. That factor may contribute to false negatives. Screening programs typically exhibit about 83% sensitivity and 91% specificity. This investigation was undertaken to determine if a system could be developed to pre-sort screening-images into normal and suspicious bins based on their likelihood to contain disease. Wavelets were investigated as a method to parse the image data, potentially removing confounding information. The development of a classification system based on features extracted from wavelet transformed mammograms is reported.Entities:
Mesh:
Year: 2013 PMID: 24330643 PMCID: PMC4029799 DOI: 10.1186/1471-2342-13-43
Source DB: PubMed Journal: BMC Med Imaging ISSN: 1471-2342 Impact factor: 1.930
Figure 1Panel A. Digitized film image masked to tissue. Panels B - E are Db2 wavelet coefficient maps at 2nd level of decomposition. B. horizontal detail, C. vertical detail, D. diagonal detail, E. approximation. Features were calculated from the tissue area.
Mean performance of statistical features across all 11 wavelet bases tested
| | ||||||
|---|---|---|---|---|---|---|
| M | 89.2 | 26.6 | 50.3 | 86.8 | 24.1 | 44.4 |
| σ | 94 | 27.6 | 52.8 | 87 | 27.7 | 46.9 |
| S | 90.8 | 29.4 | 52.7 | 91.9 | 20.6 | 43.6 |
| K | 92.8 | 23.7 | 49.8 | 93.5 | 16.8 | 41.6 |
| M + σ | 97.4 | 33.9 | 57.9 | 89.7 | 23.7 | 45 |
| M + S | 97.2 | 38.1 | 60.5 | 91.9 | 25.1 | 46.8 |
| M + K | 96.1 | 35.6 | 58.5 | 94 | 19.6 | 43.6 |
| σ + S | 95.6 | 29.1 | 54.3 | 93.2 | 23.1 | 45.8 |
| σ + K | 96.3 | 28.5 | 54.1 | 94.3 | 18 | 42.6 |
| S + K | 94.2 | 32.2 | 55.7 | 93.9 | 19.5 | 43.6 |
§M - mean, σ- standard deviation, S - skewness, K - kurtosis.
*Sensitivity is defined as TP/(TP + FN).
†Specificity is defined as TN/(TN + FP).
‡Classification rate is defined as (TP + TN)/(TP + TN + FP + FN).
Comparison of the performance of wavelet bases on the DDSM dataset
| Haar | M-h1 | M-d1 | S-h3 | 99.2 | 36.6 | 60.3 |
| Db 2 | M-h3 | M-d8 | S-h5 | 97.4 | 42.7 | 63.4 |
| Db 4 | M-h8 | M-d1 | S-h5 | 95.2 | 20.8 | 49 |
| Db 8 | M-h6 | S-v8 | S-d3 | 97.5 | 40.4 | 62 |
| Bior 1.5 | M-d4 | S-h6 | --- | 96.9 | 38.8 | 60.8 |
| Bior 2.2 | M-h5 | M-v2 | S-d2 | 98.8 | 44.8 | 65.2 |
| Bior 2.8 | M-d4 | S-d2 | S-a5 | 92.9 | 46.9 | 64.4 |
| Bior 3.7 | M-d4 | S-h4 | S-d4 | 98.9 | 28.1 | 54.9 |
| Bior 4.4 | M-h1 | M-d4 | S-d2 | 96.1 | 43 | 63.1 |
| Bior 5.5 | M-h6 | M-d5 | S-d2 | 98.5 | 38.1 | 61 |
| Bior 6.8 | M-v3 | M-d4 | S-d2 | 98 | 39 | 61.3 |
§Wavelet basis notation: Dbn where n describes the number of coefficients used in the wavelet. Db2 encodes polynomials with two coefficients, i.e. constant and linear components. Biorm.n where n describes the order for decomposition and m is the order used for reconstruction.
*Sensitivity is defined as TP/(TP + FN).
†Specificity is defined as TN/(TN + FP).
‡Classification rate is defined as (TP + TN)/(TP + TN + FP + FN).
The best triplet feature was selected for each wavelet.
Comparison of the performance of wavelet bases on the MIAS dataset
| Haar | S-h1 | K-a2 | K-a8 | 90.8 | 32.7 | 51.5 |
| Db2 | K-a3 | | | 93.9 | 14.1 | 39.9 |
| Db4 | K-h5 | | | 94.9 | 9.3 | 37.0 |
| Db8 | S-h3 | S-a4 | K-d4 | 91.8 | 27.3 | 48.2 |
| Bior 1.5 | K-h3 | K-a1 | K-a8 | 94.9 | 13.7 | 39.9 |
| Bior 2.2 | K-a2 | | | 94.9 | 14.1 | 40.3 |
| Bior 2.8 | S-d5 | K-a4 | | 94.9 | 27.3 | 49.2 |
| Bior 3.7 | S-d6 | K-d8 | K-a7 | 93.9 | 23.9 | 46.5 |
| Bior 4.4 | K-h6 | K-a2 | K-a5 | 93.9 | 16.1 | 41.3 |
| Bior 5.5 | K-a1 | | | 93.9 | 14.1 | 39.9 |
| Bior 6.8 | S-h3 | K-h7 | K-a3 | 94.9 | 22.0 | 45.5 |
§Wavelet basis notation: Dbn where n describes the number of coefficients used in the wavelet. Db2 encodes polynomials with two coefficients, i.e. constant and linear components. Biorm.n where n describes the order for decompositiona and m is the order used for reconstruction.
*Sensitivity is defined as TP/(TP + FN).
†Specificity is defined as TN/(TN + FP).
‡Classification rate is defined as (TP + TN)/(TP + TN + FP + FN).
The best triplet feature was selected for each wavelet.
Figure 2A sequential classification network. Images deemed suspicious were passed to a subsequent classifier for re-analysis. As images moved along the chain, the confidence that they were truly suspicious grew.
Figure 3A four-tap branched network. Classifiers were tuned to preferentially detect calcifications or masses. Tuning refers to selecting the feature set to optimize sensitivity for the anomaly.
Figure 4A six-tap branched network. Classifiers were tuned to preferentially detect calcifications or masses. Tuning refers to selecting the feature set to optimize sensitivity for the anomaly.
Performance of sequential classifiers using the DDSM database
| Haar | 390 | 5 | 675 | 644 | 97.9 | 61 |
| Bior 3.7 | 268 | 2 | 407 | 642 | 98.8 | 72.1 |
| Bior 2.2 | 103 | 4 | 304 | 638 | 94 | 77.5 |
| Bior 6.8 | 131 | 7 | 173 | 631 | 91.9 | 85.7 |
| Db 2 | 87 | 10 | 86 | 621 | 84.1 | 92.2 |
| Bior 5.5 | 20 | 6 | 67 | 615 | 67 | 93.8 |
| Bior 1.5 | 21 | 8 | 45 | 607 | 61.5 | 95.7 |
| Db 8 | 15 | 11 | 30 | 596 | 45.5 | 97 |
§Wavelet basis notation: Dbn where n describes the number of coefficients used in the wavelet. Db2 encodes polynomials with two coefficients, i.e. constant and linear components. Biorm.n where n describes the order for decompositiona and m is the order used for reconstruction.
Suspicious images were passed to the next classifier in the chain.
Performance of sequential classifiers on the MIAS database
| Bior 2.8 | 56 | 5 | 149 | 93 | 84.3 | 56.6 |
| Bior 6.8 | 23 | 0 | 126 | 93 | 100.0 | 60.7 |
| Bior 3.7 | 9 | 1 | 117 | 92 | 81.1 | 62.2 |
| Haar | 50 | 5 | 67 | 87 | 82.7 | 73.1 |
| Db 8 | 5 | 3 | 62 | 84 | 44.3 | 73.9 |
§Wavelet basis notation: Dbn where n describes the number of coefficients used in the wavelet. Db2 encodes polynomials with two coefficients, i.e. constant and linear components. Biorm.n where n describes the order for decompositiona and m is the order used for reconstruction.
Suspicious images were passed to the next classifier in the chain.
Performance of branched network classification
| DDSM | | | | | | | | | |
| four-tap | 748 | 814 | 251 | 1 | 1.0 | 0.764 | 0.861 | 0.975 | 1 |
| six-tap | 648 | 840 | 225 | 1 | 1.0 | 0.789 | 0.868 | 0.979 | 1 |
| MIAS | | | | | | | | | |
| 4 tap | 98 | 95 | 110 | 0 | 1.0 | 0.463 | 0.637 | 0.975 | 1 |
| 6 tap | 98 | 134 | 71 | 0 | 1.0 | 0.654 | 0.766 | 0.979 | 1 |
Segmentation of mammograms containing masses from those containing calcifications
| DDSM | | | | | | | | | |
| four- tap | 814 | 1 | 407 | 108 | 239 | 145 | 0.764 | 0.993 | 1.000 |
| six- tap | 840 | 1 | 407 | 1078 | 239 | 119 | 0.770 | 0.993 | 1.000 |
| | | | | | | | | | |
| MIAS | | | | | | | | | |
| 4 tap | 95 | 0 | 71 | 86 | 23 | 28 | 0.463 | 0.947 | 1.000 |
| 6 tap | 134 | 0 | 75 | 66 | 23 | 5 | 0.654 | 1.000 | 1.000 |