| Literature DB >> 31385114 |
Eduardo Fleury1,2, Karem Marcomini3.
Abstract
BACKGROUND: The purpose of this work was to evaluate computable Breast Imaging Reporting and Data System (BI-RADS) radiomic features to classify breast masses on ultrasound B-mode images.Entities:
Keywords: Breast neoplasms; Machine Learning; Neural networks (computer); Support vector machine; Ultrasonography
Year: 2019 PMID: 31385114 PMCID: PMC6682836 DOI: 10.1186/s41747-019-0112-7
Source DB: PubMed Journal: Eur Radiol Exp ISSN: 2509-9280
Histopathology of the 206 solid lesions at percutaneous core-biopsy
| Bening lesions | Malignant lesions | |||||
|---|---|---|---|---|---|---|
| Type | Number | Percentage (%) | Type | Number | Percentage (%) | |
| Fibroadenoma | 71 | 49.3 | Ductal carcinoma | 1 | 1.6 | |
| Fibocystic changes | 48 | 33.3 | Invasive ductal carcinoma | GI | 9 | 14.5 |
| Phyllodes tumour | 3 | 2.1 | GII | 33 | 53.2 | |
| Papylary lesion | 3 | 2.1 | GIII | 14 | 22.6 | |
| Other$ | 19 | 13.2 | Invasive lobular carcinoma | 5 | 8.1 | |
| Total | 144 | 100.0 | Total | 62 | 100.0 | |
Steatonecrosis, mastitis, fat tissue
Fig. 1Area difference between a breast malignant mass and its equivalent ellipse
Fig. 2Original contour (white), inside contour (red), and outline contour (blue) of a lesion
Fig. 3Convex hull (red contour) of a malignant lesion (white contour), with peaks on the distance vector. Vconvex are marked by stars
Fig. 4Example of segmentation of a fibroadenoma. Original (a) and segmented (b) images (to classify the lesions, the segmented image was along the plane best representing the mass that was used, in this case, the long axis)
Performance of different feature combinations using the decision tree method
| Features | Sensitivity (%) | Specificity (%) | Area under the curve |
|---|---|---|---|
| {6} | 73.2 | 69.1 | 0.652 |
| {6, 1} | 71.2 | 69.2 | 0.653 |
| {6, 1, 2} | 71.9 | 73.3 | 0.720 |
| {6, 1, 2, 8} | 70.6 | 75.0 | 0.744 |
{1}: area difference with equivalent ellipse; {2}: orientation; {6}: area difference between the convex hull and tumour; {8}: entropy
Performance of different feature combinations using the multilayer perceptron method
| Features | Sensitivity (%) | Specificity (%) | Area under the curve |
|---|---|---|---|
| {5} | 67.5 | 78.7 | 0.759 |
| {5, 2} | 68.4 | 79.2 | 0.789 |
| {5, 2, 1} | 68.8 | 84.1 | 0.799 |
| {5, 2, 1, 3} | 66.2 | 71.7 | 0.806 |
{1}: area difference with equivalent ellipse; {2}: orientation; {3}: average of difference vector; {5}: average of distance vector
Performance of different feature combinations using the random forest method
| Features | Sensitivity (%) | Specificity (%) | Area under the curve |
|---|---|---|---|
| {4} | 62.2 | 71.7 | 0.697 |
| {4, 8} | 72.3 | 74.6 | 0.760 |
| {4, 8, 5} | 72.6 | 72.6 | 0.778 |
| {4, 8, 5, 2} | 72.7 | 75.9 | 0.811 |
{2}: orientation; {4}: number of peaks on the distance vector (NumPeaks); {5}: average of distance vector; {8}: entropy
Performance of different feature combinations using the linear discriminant analysis method
| Features | Sensitivity (%) | Specificity (%) | Area under the curve |
|---|---|---|---|
| {5} | 59.5 | 87.4 | 0.770 |
| {5, 2} | 76.0 | 69.8 | 0.818 |
{2}: orientation; {5}: average of distance vector
Performance of different feature combinations using the support vector machine method
| Features | Sensitivity (%) | Specificity (%) | Area under the curve |
|---|---|---|---|
| {4} | 64.3 | 80.5 | 0.746 |
| {4, 2} | 67.1 | 76.2 | 0.798 |
| {4, 2, 10} | 67.1 | 78.8 | 0.807 |
| {4, 2, 10 ,8} | 68.6 | 76.2 | 0.814 |
| {4, 2, 10 ,8 ,1} | 71.4 | 76.9 | 0.840 |
{1}: difference area with equivalent ellipse; {2}: orientation; {4}: number of peaks on the distance vector; {8}: entropy; {10}: lesion size
Performance of five different machine learning methods for classifying 206 solid breast lesion on ultrasound images
| Method | Features | Sensitivity | Specificity | AUC | |||
|---|---|---|---|---|---|---|---|
| Point estimate (%) | 95% CI | Point estimate (%) | 95% CI | Point estimate | 95% CI | ||
| Decision tree | {6, 1, 2, 8} | 70.6 | 0.5889–0.8008 | 75.0 | 0.6231–0.8448 | 0.744 | 0.677–0.774 |
| Multilayer perceptron | {5, 2, 1, 3} | 66.2 | 0.5462–0.7612 | 71.7 | 0.5843–0.8203 | 0.806 | 0.677–0.839 |
| Random forest | {4, 8, 5, 2} | 72.7 | 0.5983–0.8181 | 75.9 | 0.593–0.811 | 0.811 | 0.710–0.892 |
| Linear discriminant analysis | {5, 2} | 76.0 | 0.6212–0.8345 | 69.8 | 0.6156–0.8316 | 0.818 | 0.6667–0.9444 |
| Support vector machine | {4, 2, 10, 8, 1} | 71.4 | 0.6479–0.8616 | 76.9 | 0.6148–0.8228 | 0.840 | 0.6667–0.9762 |
{1}: area difference with equivalent ellipse; {2}: orientation; {3}: average of difference vector; {4}: number of peaks on the distance vector; {5}: average of distance vector; {6}: area difference between the convex hull and tumour; {7}: echogenicity; {8}: entropy; {9}: shadow; {10}: lesion size. CI Confidence interval
Performance of different machine learning methods obtained by Shan et al. [8]
| Method | Features | Sensitivity (%) | Specificity (%) | Area under the curve |
|---|---|---|---|---|
| Decision tree | {4, 3, 10, 2, 7} | 74.0 | 82.0 | 0.803 |
| Multilayer perceptron | {4, 3, 2, 6, 5, 1} | 78.0 | 78.2 | 0.823 |
| Random forest | {6, 10, 2, 9, 3} | 75.3 | 82.0 | 0.828 |
| Support vector machine | {4, 2, 6, 3, 10} | 77.3 | 78.2 | 0.842 |