| Literature DB >> 28083100 |
Alan Chan1, Jack A Tuszynski2.
Abstract
Breast cancer is one of the most prevalent types of cancer today in women. The main avenue of diagnosis is through manual examination of histopathology tissue slides. Such a process is often subjective and error-ridden, suffering from both inter- and intraobserver variability. Our objective is to develop an automatic algorithm for analysing histopathology slides free of human subjectivity. Here, we calculate the fractal dimension of images of numerous breast cancer slides, at magnifications of 40×, 100×, 200× and 400×. Using machine learning, specifically, the support vector machine (SVM) method, the F1 score for classification accuracy of the 40× slides was found to be 0.979. Multiclass classification on the 40× slides yielded an accuracy of 0.556. A reduction of the size and scope of the SVM training set gave an average F1 score of 0.964. Taken together, these results show great promise in the use of fractal dimension to predict tumour malignancy.Entities:
Keywords: automatic image slide analysis; cancer prediction; fractal dimension; tumour malignancy
Year: 2016 PMID: 28083100 PMCID: PMC5210682 DOI: 10.1098/rsos.160558
Source DB: PubMed Journal: R Soc Open Sci ISSN: 2054-5703 Impact factor: 2.963
Figure 1.An image of the Koch snowflake, a fractal with fractal dimension d≈ 1.26. From https://commons.wikimedia.org/wiki/File:Flocke.PNG, licensed under Creative Commons.
The distribution of images in BreaKHIS, from [18].
| magnification | benign | malignant | total |
|---|---|---|---|
| 40× | 625 | 1370 | 1995 |
| 100× | 644 | 1437 | 2081 |
| 200× | 623 | 1390 | 2013 |
| 400× | 588 | 1232 | 1820 |
| total | 2480 | 5429 | 7909 |
| no. patients | 24 | 58 | 82 |
Figure 2.Representative benign and malignant images at 40× magnification. Benign: adenosis (A), fibroadenoma (F), phyllodes tumour (PT) and tubular adenoma (TA). Malignant: ductal carcinoma (DC), lobular carcinoma (LC), mucinous carcinoma (MC) and papillary carcinoma (PC).
The distribution of images of malignant tumours in BreaKHIS, from [18].
| magnification | DC | LC | MC | PC | total |
|---|---|---|---|---|---|
| 40× | 864 | 156 | 205 | 145 | 1370 |
| 100× | 903 | 170 | 222 | 142 | 1437 |
| 200× | 896 | 163 | 196 | 135 | 1390 |
| 400× | 788 | 137 | 169 | 138 | 1232 |
| total | 3451 | 626 | 792 | 560 | 5429 |
| no. patients | 38 | 5 | 9 | 6 | 58 |
Figure 3.Image of a 40× slide of ductal carcinoma, after binarization and edge detection.
Figure 4.Fractal dimensions for 40× images.
Figure 7.Fractal dimensions for 400× images.
Classification accuracy for benign versus malignant images at all magnifications.
| magnification | TP | TN | F1 |
|---|---|---|---|
| 40× | 0.990 | 0.968 | 0.979 |
| 100× | 0.983 | 0.090 | 0.165 |
| 200× | 0.974 | 0.090 | 0.165 |
| 400× | 0.940 | 0.146 | 0.253 |
Benign versus malignant classification accuracy with different training sets.
| BT | MT | TP | TN | F1 |
|---|---|---|---|---|
| A | DC | 0.988 | 0.953 | 0.970 |
| A | LC | 0.993 | 0.957 | 0.975 |
| A | MC | 0.979 | 0.967 | 0.973 |
| A | PC | 0.999 | 0.945 | 0.971 |
| F | DC | 0.982 | 0.930 | 0.955 |
| F | LC | 0.975 | 0.946 | 0.960 |
| F | MC | 0.995 | 0.922 | 0.957 |
| F | PC | 0.999 | 0.901 | 0.947 |
| PT | DC | 1.000 | 0.913 | 0.956 |
| PT | LC | 0.993 | 0.942 | 0.967 |
| PT | MC | 0.999 | 0.913 | 0.954 |
| PT | PC | 1.000 | 0.895 | 0.945 |
| TA | DC | 0.992 | 0.956 | 0.974 |
| TA | LC | 0.974 | 0.975 | 0.974 |
| TA | MC | 0.994 | 0.960 | 0.977 |
| TA | PC | 0.999 | 0.954 | 0.976 |
| mean | 0.964 |
The distribution of images of benign tumours in BreaKHIS, from [18].
| magnification | A | F | TA | PT | total |
|---|---|---|---|---|---|
| 40× | 114 | 253 | 109 | 149 | 625 |
| 100× | 113 | 260 | 121 | 150 | 644 |
| 200× | 111 | 264 | 108 | 140 | 623 |
| 400× | 106 | 237 | 115 | 130 | 588 |
| total | 444 | 1014 | 453 | 569 | 2368 |
| no. patients | 4 | 10 | 3 | 7 | 24 |