| Literature DB >> 23762182 |
Cruz-Ramírez Nicandro1, Mezura-Montes Efrén, Ameca-Alducin María Yaneli, Martín-Del-Campo-Mena Enrique, Acosta-Mesa Héctor Gabriel, Pérez-Castro Nancy, Guerra-Hernández Alejandro, Hoyos-Rivera Guillermo de Jesús, Barrientos-Martínez Rocío Erandi.
Abstract
Breast cancer is one of the leading causes of death among women worldwide. There are a number of techniques used for diagnosing this disease: mammography, ultrasound, and biopsy, among others. Each of these has well-known advantages and disadvantages. A relatively new method, based on the temperature a tumor may produce, has recently been explored: thermography. In this paper, we will evaluate the diagnostic power of thermography in breast cancer using Bayesian network classifiers. We will show how the information provided by the thermal image can be used in order to characterize patients suspected of having cancer. Our main contribution is the proposal of a score, based on the aforementioned information, that could help distinguish sick patients from healthy ones. Our main results suggest the potential of this technique in such a goal but also show its main limitations that have to be overcome to consider it as an effective diagnosis complementary tool.Entities:
Mesh:
Year: 2013 PMID: 23762182 PMCID: PMC3674659 DOI: 10.1155/2013/264246
Source DB: PubMed Journal: Comput Math Methods Med ISSN: 1748-670X Impact factor: 2.238
Names, definitions, and values of variables. In the experiments the positive value is discretized to 1 and the negative value is discretized to 0. All the values of qualitative variables are given by the image analyst.
| Variable name | Definition | Variable value | Variable type |
|---|---|---|---|
| Asymmetry | Temperature difference (in Celsius) between the right and the left breasts | If difference < 1°C, then value = 5, difference between 1°C and 2°C, the value is 10, and difference > 2°C, the value is 15 | Nominal (5, 10, 15) |
|
| |||
| Thermovascular network | Number of veins with the highest temperature | If the visualization is abundant vascularity, the value is 15, if it is moderate, the value is 10, and if it is slight, the value is 5 | Nominal (5, 10, 15) |
|
| |||
| Curve pattern | Heat area under the breast | If heat visualized is abundant, the value is 15, if it is moderate, the value is 10, and if it is slight, the value is 5 | Nominal (5, 10, 15) |
|
| |||
| Hyperthermia | Hottest point of the breast | If there is at least one hottest point, the value is 20 and otherwise the value is 0 | Binary (0, 20) |
|
| |||
| 2c | Temperature difference between the hottest points of the two breasts | If difference between 1 and 10, the value is 10, difference between 11 and 15, the value is 15, difference between 16 and 20, the value is 20 and if difference > 20, the value is 25 | Nominal (10, 15, 20, 25) |
|
| |||
| F unique | Amount of hottest points | If sum = 1, the value is 40, if sum = 2, the value is 20, if sum = 3, the value is 10, and if sum > 3, the value is 5 | Nominal (5, 10, 20, 40) |
|
| |||
| 1c | Hottest point in only one breast | If the hottest point is only one breast, the value is 40 and if the hottest point is both breasts, the value is 20 | Binary (20, 40) |
|
| |||
| Furrow | Furrows under the breasts | If the furrow is visualized, the value is positive; if not,the value is negative | Binary (0, 1) |
|
| |||
| Pinpoint | Veins going to the hottest points of the breasts | If the veins are visualized, the value is positive; if not, the value is negative | Binary (0, 1) |
|
| |||
| Hot center | The center of the hottest area | If the center of the hottest is visualized, the value is positive; if not, the value is negative | Binary (0, 1) |
|
| |||
| Irregular form | Geometry of the hot center | If the hot center is visualized like a nongeometrical figure, the value is positive; if not, the value is negative | Binary (0, 1) |
|
| |||
| Histogram | Histogram in form of a isosceles triangle | If the histogram is visualized as a triangle form, the value is positive; if not, the value is negative | Binary (0, 1) |
|
| |||
| Armpit | Difference temperature between the 2 armpits | If the difference = 0, the value in both is negative; if not, the value is positive; consequently the other is negative | Binary (0, 1) |
|
| |||
| Breast profile | Visually altered profile | If an altered profile is visualized abundantly, the value is 3, if it is moderate, value is 2, if it is small, the value is 1, and if it does not exist, the value is 0 | Binary (0, 1) |
|
| |||
| Score | The sum of values of the previous 14 variables | If the sum < 160, then the value is negative for cancer; if the sum ≥ 160, the value is positive for cancer | Binary (0, 1) |
|
| |||
| Age | Age of patient | If the age < 51, the value is 1, if the age between 51 and 71, the value is 2, and if age > 71, the value is 3 | Binary (0, 1) |
|
| |||
| Outcome | The result is obtained via open biopsy | The values are cancer or no-cancer | Binary (0, 1) |
Figure 3Bayesian network built by procedure of Hill-Climber using the 98-case database. Only variable furrow is directly related to the outcome. Once the variable furrow is known, all the other variables are independent of the class.
Figure 4Bayesian network built by procedure of Repeated Hill-Climber using the 98-case database. Only variable furrow is directly related to the outcome. Once the variable furrow is known, all the other variables are independent of the class.
Parameter values for Hill-Climber and Repeated Hill-Climber.
| Parameters | Hill-Climber | Repeated Hill-Climber |
|---|---|---|
| The initial structure NB (Naïve Bayes) | False | False |
| Number of parents | 100,000 | 100,000 |
| Runs | — | 10 |
| Score type | MDL | MDL |
| Seed | — | 1 |
| Arc reversal | True | True |
Figure 1Thermal image showing the temperature of the color-coded breasts. The red and gray tones represent hotter areas.
Figure 2Breast thermography procedure.
Accuracy, sensitivity, and specificity results for the three Bayesian network classifiers presented in Section 3.2.1.
| Naïve Bayes | Hill-Climber | Repeated Hill-Climber | |
|---|---|---|---|
| Accuracy | 71.88% (±12.61) | 76.10% (±7.10) | 76.12% (±7.19) |
| Sensitivity | 82% (74–91) | 97% (94–100) | 99% (96–100) |
| Specificity | 37% (15–59) | 0% (0-0) | 0% (0-0) |
Accuracy, sensitivity, and specificity of artificial neural network, decision trees ID3 and C4.5 for the thermography.
| Artificial neural network | Decision tree ID3 | Decision tree C4.5 | |
|---|---|---|---|
| Accuracy | 67.47% (±15.65) | 73.19% (±12.84) | 75.50% (±6.99) |
| Sensitivity | 82% (73–91) | 87% (79–94) | 94% (88–99) |
| Specificity | 33% (13–53) | 29% (9–48) | 0% (0-0) |
Confusion matrix of Naïve Bayes.
| Cancer | Noncancer | Total | |
|---|---|---|---|
| Cancer | TP 65 | FN 12 | 77 |
| Noncancer | FP 14 | TN 7 | 21 |
|
| |||
| 98 | |||
TP: true positive, FP: false positive, FN: false negative, TN: true negative.
Confusion matrix of Hill-Climber.
| Cancer | Non-cancer | Total | |
|---|---|---|---|
| Cancer | TP 75 | FN 2 | 77 |
| Non-cancer | FP 21 | TN 0 | 21 |
|
| |||
| 98 | |||
TP: true positive, FP: false positive, FN: false negative, TN: true negative.
Confusion matrix of Repeated Hill-Climber.
| Cancer | Non-cancer | Total | |
|---|---|---|---|
| Cancer | TP 76 | FN 1 | 77 |
| Non-cancer | FP 21 | TN 0 | 21 |
|
| |||
| 98 | |||
TP: true positive, FP: false positive, FN: false negative, TN: true negative.
Confusion matrix of artificial neural network.
| Cancer | Non-cancer | Total | |
|---|---|---|---|
| Cancer | TP 58 | FN 19 | 77 |
| Non-cancer | FP 15 | TN 6 | 21 |
|
| |||
| 98 | |||
TP: true positive, FP: false positive, FN: false negative, TN: true negative.
Confusion matrix of decision tree ID3.
| Cancer | Non-cancer | Total | |
|---|---|---|---|
| Cancer | TP 67 | FN 10 | 77 |
| Non-cancer | FP 15 | TN 6 | 21 |
|
| |||
| 98 | |||
TP: true positive, FP: false positive, FN: false negative, TN: true negative.
Confusion matrix of decision tree C4.5.
| Cancer | Non-cancer | Total | |
|---|---|---|---|
| Cancer | TP 76 | FN 1 | 77 |
| Non-cancer | FP 21 | TN 0 | 21 |
|
| |||
| 98 | |||
TP: true positive, FP: false positive, FN: false negative, TN: true negative.
Figure 5Decision tree C4.5 using the 98-case database.