| Literature DB >> 30555254 |
Farahnaz Sadoughi1, Zahra Kazemy1, Farahnaz Hamedan1, Leila Owji1, Meysam Rahmanikatigari2, Tahere Talebi Azadboni1.
Abstract
Breast cancer is the most common cancer among women around the world. Despite enormous medical progress, breast cancer has still remained the second leading cause of death worldwide; thus, its early diagnosis has a significant impact on reducing mortality. However, it is often difficult to diagnose breast abnormalities. Different tools such as mammography, ultrasound, and thermography have been developed to screen breast cancer. In this way, the computer helps radiologists identify chest abnormalities more efficiently using image processing and artificial intelligence (AI) tools. This article examined various methods of AI using image processing to diagnose breast cancer. It was a review study through library and Internet searches. By searching the databases such as Medical Literature Analysis and Retrieval System Online (MEDLINE) via PubMed, Springer, IEEE, ScienceDirect, and Gray Literature (including Google Scholar, articles published in conferences, government technical reports, and other materials not controlled by scientific publishers) and searching for breast cancer keywords, AI and medical image processing techniques were extracted. The results were provided in tables to demonstrate different techniques and their results over recent years. In this study, 18,651 articles were extracted from 2007 to 2017. Among them, those that used similar techniques and reported similar results were excluded and 40 articles were finally examined. Since each of the articles used image processing, a list of features related to the image used in each article was also provided. The results showed that support vector machines had the highest accuracy percentage for different types of images (ultrasound =95.85%, mammography =93.069%, thermography =100%). Computerized diagnosis of breast cancer has greatly contributed to the development of medicine, is constantly being used by radiologists, and is clear in ethical and medical fields with regard to its effects. Computer-assisted methods increase diagnosis accuracy by reducing false positives.Entities:
Keywords: artificial intelligence techniques; breast cancer; breast cancer screening techniques; medical image processing
Year: 2018 PMID: 30555254 PMCID: PMC6278839 DOI: 10.2147/BCTT.S175311
Source DB: PubMed Journal: Breast Cancer (Dove Med Press) ISSN: 1179-1314
Figure 1Stages of systematic review.
Advantages and disadvantages of various imaging techniques in breast cancer
| Imaging method | Application | Advantage | Disadvantage |
|---|---|---|---|
| Mammography | Golden standard imaging and diagnosis of breast cancer early stages | • It uses low levels of X-rays for imaging | • Radiation risk and other risks |
| Ultrasound | Suitable for dense and soft tissues | • Widely available and accessible | • Quality and interpretation of the image depends highly on the skill of the person doing the scan |
| Thermography | Suitable for muscle tissue | • Noninvasive | • Physicians can have difficulty interpreting the images because of the low quality and low resolution of the images taken by the first generation of the medical infrared imaging cameras |
Note: Data adapted from Kerlikowske et al,23 Heywang-Köbrunner et al,33 and Qi and Diakides.34
Abbreviation: DCIS, ductal carcinoma in situ.
Figure 2Stages of cancer detection by image processing.
Note: Data from Pradeep et al16 and Lin et al.19
Preprocessing techniques
| Preprocessing image techniques | |
|---|---|
| FPN | FPN is the result of differences in responsivity of the detectors to incoming irradiance. It is a common problem when working with FPA. FPN for a particular configuration can be recovered from a blackbody image for later subtraction from the thermogram sequence. |
| Badpixels | A badpixel can be defined as an anomalous pixel behaving differently from the rest of the array. For instance, a dead pixel remains unlit (black), while a hot pixel is permanently lit (white). In any case, badpixels do not provide any useful information and only contribute to deteriorate the image contrast. A map of badpixels is generally known from the FPA manufacturer or they can be detected manually or automatically; the value at badpixel locations is then replaced by the average value of neighboring pixels. |
| Vignetting | Vignetting is another source of noise on thermograms that cause a darkening of the image corners with respect to the image center due to limited exposure. It depends on both pixel location and temperature difference with respect to the ambient. A correction procedure has been proposed. |
| Temperature calibration | A transformation function is used to convert the grayscale values |
| Noise smoothing | One of the most useful preprocessing (and postprocessing) techniques is noise smoothing. For instance, neighbor processing can be performed bypassing a mask or kernel through the image. More elaborate noise removal techniques are available. |
Note: Data adapted from Ibarra-Castanedo et al.17
Abbreviations: FPA, focal plane array; FPN, fixed pattern noise; IR, infrared.
Image segmentation methods
| Name of segmentation method | Description of method | Benefits | Limitations |
|---|---|---|---|
| Edge detection method | Depends on discontinuity detection; generally aims to situate points with less or more rapid gray-level changes | • A human-like approach | • Not good with images where edges are unclearly defined |
| Thresholding method | Requires images with diffident sharp edges, each of which fits a single area | • No need for prior knowledge about the image | • Not good for image with no clear peaks |
| Region-dependent method | • The same pixel in nearby areas are similar | • Does well if region homogeneity norm is painless to define | • The amount of memory and calculation time is very costly |
| Fuzzy method | Uses operators, mathematics, properties and rules of fuzzy inference | • Fuzzy membership function can be used to represent the degree of low properties or linguistic phrases | • Determining the fuzzy membership function is not easy |
| Neural network method | Uses for clustering or categorization | • Does not require writing tedious programs | • Need a lot of time to train |
Note: Data adapted from Pradeep et al.16
The most important texture features16,19
| Texture features |
|---|
| Mean |
| Skewness |
| Entropy |
| SD |
| Line likeness |
| Homogeneity |
| Smoothness |
| Correlation |
| Sum average |
| Sum entropy |
| Variance |
| Kurtosis |
| Energy |
| LBP |
| Regularity |
| Contrast |
| Coarseness |
| Kurtosis |
| Sum variance |
| Difference variance |
Note: Data adapted from Pradeep et al16 and Lin et al.19
Abbreviation: LBP, local binary pattern.
Artificial intelligence techniques for classifying breast cancer
| Methods | Images | Attributes | Advantages | Disadvantages | Reported performance | Reference |
|---|---|---|---|---|---|---|
| SVM | Medical infrared thermal imaging | • Energy, contrast, correlation, sum of squares: variance, inverse difference moment, sum variance, sum entropy, entropy, difference variance, difference entropy, information measures of correlation 1, information measures of correlation 2 | • Use of curve let before feature extraction | – | Accuracy: 90.91 | |
| SVM | Ultrasound | • Histogram features | • Combined textural and morphological features | – | Accuracy: 95.85 | |
| Fuzzy SVM | Ultrasound | • One pixel distance in the mean of the information measure of correlation ( | • Using of fractal dimensions | • In practical classification problems, the effects of the samples in training dataset may be different | Accuracy: 94.25 | |
| Cascade-forward back-propagation artificial neural network | Mammography | • Autocorrelation | • Features, extracted from the known mammogram images, are stored in the knowledge base | – | Accuracy: 67.8 | |
| Feed-forward back-propagation artificial neural network | Mammography | • Autocorrelation | • Using image registration techniques | – | Accuracy: 92.8% | |
| Mammography | • GLCM features (angular second moment, entropy, contrast, local homogeneity, correlation, shade, prominence, variance, sum average, sum entropy, difference entropy, sum variance, and difference variance) | • Texture analysis of tissue surrounding shows promising results in computer-aided diagnosis of breast cancer and may contribute to the reduction in unnecessary biopsies | • Completion of the proposed method should include the investigation of additional classification schemes and textural features, as well as validation over a larger dataset | Accuracy: 64.0–89.0 | ||
| SVM | Mammography | • Wavelet features | • Intensity- and shape-driven features have the greatest role on the later classification stage | • Obvious since increasing the sensitivity will yield in a system that identifies more regions as abnormal in order to achieve higher sensitivity | Accuracy: 93.06 | |
| Genetic algorithm and back-propagation neural network | Medical infrared thermal imaging | • Age | • This method is beneficial to patients with and without symptoms | • Still need to improve the methods | Accuracy: 70 | |
| SVM | Medical infrared thermal imaging | • Kurtosis | • Generates input–output mapping functions which can be used for classification or for regression | • Improved by extracting better texture features and by using a larger sample size | Accuracy: 88.10 | |
| Medical infrared thermal imaging | • Angular second moment (energy) | • Detection using texture and features and minimum variance quantization is based on the data acquisition protocol parameter and the image statistics | – | Accuracy: 92.5 | ||
| Naive Bayes | Medical infrared thermal imaging | • Same above | • Requires a small amount of training data to estimate the parameters necessary | • SVM classification results are better than the naive Bayes classification results | Accuracy: 80.0 | |
| Deep learning | Mammogram | • Density | • Classifying between malignant and benign masses | • Heavily depends on the distribution and/or diversity of the ROIs (or cases) in the specific training and testing datasets | ROC curve: 0.790±0.019 |
Abbreviations: C, circularity (the ratio of square of the perimeter to the tumor area); GLCM, gray-level co-occurrence matrix; GLNU, gray-level no uniformity; GLRLM, gray-level run length matrix; k-NN, k-nearest neighbor; LRE, long-run emphasis; LTEM, laws texture energy measures; MRF, Markov Random Fields; NRV, normalized residual value; RLNU, run length non-uniformity; ROC, receiver operating characteristic; ROI, region of interest; RPERC, run percentage; RS, overlap ratio; SRE, short-run emphasis; SVM, support vector machine; US, ultrasound.