| Literature DB >> 35054354 |
Matteo Interlenghi1, Christian Salvatore1,2, Veronica Magni3, Gabriele Caldara2, Elia Schiavon1, Andrea Cozzi3, Simone Schiaffino4, Luca Alessandro Carbonaro5,6, Isabella Castiglioni7,8, Francesco Sardanelli3,4.
Abstract
We developed a machine learning model based on radiomics to predict the BI-RADS category of ultrasound-detected suspicious breast lesions and support medical decision-making towards short-interval follow-up versus tissue sampling. From a retrospective 2015-2019 series of ultrasound-guided core needle biopsies performed by four board-certified breast radiologists using six ultrasound systems from three vendors, we collected 821 images of 834 suspicious breast masses from 819 patients, 404 malignant and 430 benign according to histopathology. A balanced image set of biopsy-proven benign (n = 299) and malignant (n = 299) lesions was used for training and cross-validation of ensembles of machine learning algorithms supervised during learning by histopathological diagnosis as a reference standard. Based on a majority vote (over 80% of the votes to have a valid prediction of benign lesion), an ensemble of support vector machines showed an ability to reduce the biopsy rate of benign lesions by 15% to 18%, always keeping a sensitivity over 94%, when externally tested on 236 images from two image sets: (1) 123 lesions (51 malignant and 72 benign) obtained from two ultrasound systems used for training and from a different one, resulting in a positive predictive value (PPV) of 45.9% (95% confidence interval 36.3-55.7%) versus a radiologists' PPV of 41.5% (p < 0.005), combined with a 98.0% sensitivity (89.6-99.9%); (2) 113 lesions (54 malignant and 59 benign) obtained from two ultrasound systems from vendors different from those used for training, resulting into a 50.5% PPV (40.4-60.6%) versus a radiologists' PPV of 47.8% (p < 0.005), combined with a 94.4% sensitivity (84.6-98.8%). Errors in BI-RADS 3 category (i.e., assigned by the model as BI-RADS 4) were 0.8% and 2.7% in the Testing set I and II, respectively. The board-certified breast radiologist accepted the BI-RADS classes assigned by the model in 114 masses (92.7%) and modified the BI-RADS classes of 9 breast masses (7.3%). In six of nine cases, the model performed better than the radiologist did, since it assigned a BI-RADS 3 classification to histopathology-confirmed benign masses that were classified as BI-RADS 4 by the radiologist.Entities:
Keywords: breast cancer; core needle biopsy; machine learning; positive predictive value; radiomics; sensitivity; ultrasound (US)
Year: 2022 PMID: 35054354 PMCID: PMC8774734 DOI: 10.3390/diagnostics12010187
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Histopathology of the 834 breast masses included in the study.
| Malignant or Benign | Histopathology Type | Number | Percentage |
|---|---|---|---|
| Benign | Fibroadenoma | 146 | 34.0% |
| Sclerosing lesions/adenosis | 64 | 14.9% | |
| Normal breast tissue | 38 | 8.8% | |
| Inflammatory lesions | 36 | 8.4% | |
| Papilloma (no atypia) | 27 | 6.3% | |
| Cysts, ductal ectasia, or seromas | 37 | 8.6% | |
| Usual ductal hyperplasia | 17 | 3.9% | |
| Atypical ductal hyperplasia | 8 | 1.9% | |
| Fibroadenomatoid changes | 23 | 5.3% | |
| Other benign findings | 34 | 7.9% | |
|
|
|
| |
| Malignant | Invasive ductal carcinoma | 304 | 75.2% |
| Invasive lobular carcinoma | 42 | 10.4% | |
| Ductal carcinoma in situ | 19 | 4.7% | |
| Other malignancies originating from breast tissues | 35 | 8.7% | |
| Other malignancies (metastases from non breast tissues) | 4 | 1.0% | |
|
|
|
|
Technical details and composition of the three image sets.
| Dataset | US System | US | Total | Mean Pixel Size (Range) (mm) | Malignant Lesions | Mean Pixel Size (Range) (mm) | Benign | Mean Pixel Size (Range) (mm) |
|---|---|---|---|---|---|---|---|---|
|
| Esaote MyLab 6100 | 273 | 277 | 0.098 | 156 | 0.103 | 121 | 0.092 |
| (0.046–0.154) | (0.046–0.154) | (0.046–0.139) | ||||||
| Esaote MyLab 6150 | 311 | 318 | 0.091 | 142 | 0.095 | 176 | 0.088 | |
| (0.046–0.123) | (0.046–0.123) | (0.046–0.123) | ||||||
| Esaote MyLab 6440 | 2 | 3 | 0.068 | 1 | 0.068 | 2 | 0.068 | |
| (0.068–0.068) | (0.068–0.068) | (0.068–0.068) | ||||||
|
| Esaote MyLab 6100 | 59 | 59 | 0.091 | 20 | 0.094 | 39 | 0.090 |
| (0.046–0.109) | (0.062–0.108) | (0.046–0.108) | ||||||
| Esaote MyLab 6150 | 63 | 63 | 0.097 | 31 | 0.101 | 32 | 0.092 | |
| (0.048–0.139) | (0.048–0.139) | (0.062–0.123) | ||||||
| Esaote MyLab 7340002 | 1 | 1 | 0.106 | 0 | – | 1 | 0.106 | |
| (0.106–0.106) | (0.106–0.106) | |||||||
|
| Samsung RS80A | 86 | 86 | 0.065 | 44 | 0.068 | 42 | 0.062 |
| (0.040–0.110) | (0.050–0.110) | (0.040–0.090) | ||||||
| Siemens | 26 | 27 | 0.067 | 10 | 0.069 | 17 | 0.066 | |
| (0.030–0.080) | (0.060–0.070) | (0.030–0.080) |
US, ultrasound.
Figure 1Ensemble of support vector machines: proportion of correctly predicted benign and malignant lesions versus percentage of voting from the support vector machines.
Performances of the ensemble of support vector machines in the external testing datasets.
| Performance Metric |
|
|
|---|---|---|
| SVM sensitivity (95% CI) | 98.0% (89.6%–99.9%) | 94.4% (84.6%–98.8%) |
| SVM NPV (95% CI) | 92.9% (66.1%–99.8%) | 75.0% (42.8%–94.5%) |
| SVM PPV (95% CI) | 45.9% ** (36.3%–55.7%) | 50.5% ** (40.4%–60.6%) |
| Radiologists’ PPV | 41.5% ** (32.7%–50.7%) | 47.8% ** (38.3%–57.4) |
| SVM specificity (95% CI) | 18.1% ** (10.0%–28.9%) | 15.3% ** (7.2%–27.0%) |
| Radiologists’ specificity | 0.0% | 0.0% |
SVM, support vector machines; CI, confidence interval; NPV, negative predictive value; PPV, positive predictive value. ** indicates a p-value < 0.005, calculated considering chance/random classification. Note: the 0% radiologists’ specificity is an obliged result determined by the inclusion of breast masses that were all referred to ultrasound-guided core needle biopsy.
Ensemble of support vector machines. Top 25 most relevant predictors sorted in descending order of relevance.
| Rank | Feature Family | Feature |
|---|---|---|
|
| Morphology | Perimeter-to-area ratio ** |
|
| Morphology | Maximum diameter ** |
|
| Morphology | Compactness ** |
|
| Morphology | Acircularity ** |
|
| Morphology | Perimeter ** |
|
| Morphology | Area ** |
|
| Morphology | Center of mass shift ** |
|
| Morphology | Circularity * |
|
| Neighborhood grey tone difference matrix | Strength ** |
|
| Neighborhood grey tone difference matrix | Coarseness ** |
|
| Neighborhood grey tone difference matrix | Contrast |
|
| Neighborhood grey tone difference matrix | Busyness * |
|
| Grey-level size zone matrix | Zone size non-uniformity ** |
|
| Grey-level size zone matrix | Grey-level non-uniformity glszm ** |
|
| Neighboring grey-level dependence matrix | Dependence count non-uniformity ** |
|
| Neighboring grey-level dependence matrix | Low-dependence low-grey-level emphasis |
|
| Grey-level run length matrix | Grey-level non-uniformity |
|
| Grey-level run length matrix | Run length non-uniformity |
|
| Intensity-based statistics | Minimum |
|
| Intensity-based statistics | Energy |
|
| Intensity-based statistics | Variance |
|
| Intensity-based statistics | Quartile coefficient |
|
| Intensity-based statistics | 10th percentile |
|
| Intensity histogram | 10th percentile |
|
| Grey-level co-occurrence matrix | First measure of information correlation |
* denotes statistical significance at 0.05 (adjusted with Bonferroni–Holm correction). ** denotes a statistical significance at 0.005 (adjusted with Bonferroni–Holm correction).
Figure 2Ensemble of support vector machines: violin plots and boxplots of the most relevant radiomic predictors ranked from 1 to 15. Green: benign class. Red: malignant class. ** denotes a statistical significance at 0.005 (adjusted with Bonferroni–Holm correction).
Figure 3Representative examples of two benign lesions according to histological diagnosis, as classified by the developed radiomic machine learning system. First row (a): true negative (benign lesion correctly classified as < 2% likelihood of cancer); second row (b): false positive (benign lesion incorrectly classified as > 2% likelihood of cancer). ROIs were manually defined by the expert breast radiologist to segment the suspicious breast lesion. Radiomic features: (a) compactness 0.807; acircularity 0.113; center of mass shift 3.495; zone size non-uniformity 743.6; (b) compactness 0.718; acircularity 0.181; center of mass shift 4.458; zone size non-uniformity 2255.5. Histopathology: (a) cyst; (b) fibroepithelial proliferation.
Figure 4Representative examples of two malignant lesions according to histological diagnosis, as classified by the developed radiomic machine learning system. First row (a): true positive (malignant lesion correctly classified as >2% likelihood of cancer); second row (b): false negative (malignant lesion incorrectly classified as <2% likelihood of cancer). ROIs were manually defined by the expert breast radiologist to segment the suspicious breast lesion. Radiomic features: (a) compactness 0.569; acircularity 0.326; center of mass shift 3.997; zone size non-uniformity 2600.2; (b) compactness 0.614; acircularity 0.276; center of mass shift 5.861; zone size non-uniformity 2209.0. Histopathology: (a) invasive ductal carcinoma; (b) papillary carcinoma.
Ensemble of support vector machines: BI-RADS diagnostic categories predicted for breast masses of the Training and internal testing set according to histopathology groups.
| Histopathology Type | BI-RADS 3 (%) | BI-RADS 4 (%) | BI-RADS 5 (%) |
|---|---|---|---|
| Fibroadenoma | 8 (1.3) | 93 (15.6) | 0 (0.0) |
| Sclerosing lesions/adenosis | 10 (1.7) | 37 (6.2) | 2 (0.3) |
| Normal breast tissue | 6 (1.0) | 26 (4.3) | 1 (0.2) |
| Inflammatory lesions | 2 (0.3) | 26 (4.3) | 0 (0.0) |
| Papilloma (no atypia) | 1 (0.2) | 17 (2.8) | 0 (0.0) |
| Cysts, ductal ectasia, or seromas | 10 (1.7) | 13 (2.2) | 0 (0.0) |
| Usual ductal hyperplasia | 2 (0.3) | 9 (1.5) | 0 (0.0) |
| Atypical ductal hyperplasia | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Fibroadenomatoid changes | 2 (0.3) | 13 (2.2) | 0 (0.0) |
| Other benign findings | 8 (1.3) | 93 (15.6) | 0 (0.0) |
| Invasive ductal carcinoma | 5 (0.8) | 211 (35.3) | 6 (1.0) |
| Invasive lobular carcinoma | 2 (0.3) | 32 (5.4) | 1 (0.2) |
| Ductal carcinoma in situ | 4 (0.7) | 10 (1.7) | 0 (0.0) |
| Other malignancies originating from breast tissues | 2 (0.3) | 24 (4.0) | 0 (0.0) |
| Other malignancies (metastases from non breast tissues) | 0 (0.0) | 1 (0.2) | 1 (0.2) |
Ensemble of support vector machines: BI-RADS diagnostic categories predicted for breast masses of the Testing set I according to histopathology groups.
| Histopathology Type | BI-RADS 3 (%) | BI-RADS 4 (%) | BI-RADS 5 (%) |
|---|---|---|---|
| Fibroadenoma | 2 (1.6) | 19 (15.4) | 0 (0.0) |
| Sclerosing lesions/adenosis | 1 (0.8) | 5 (4.1) | 0 (0.0) |
| Normal breast tissue | 0 (0.0) | 3 (2.4) | 0 (0.0) |
| Inflammatory lesions | 2 (1.6) | 4 (3.3) | 0 (0.0) |
| Papilloma (no atypia) | 2 (1.6) | 4 (3.3) | 0 (0.0) |
| Cysts, ductal ectasia, or seromas | 2 (1.6) | 6 (4.9) | 0 (0.0) |
| Usual ductal hyperplasia | 0 (0.0) | 4 (3.3) | 0 (0.0) |
| Atypical ductal hyperplasia | 3 (2.4) | 5 (4.1) | 0 (0.0) |
| Fibroadenomatoid changes | 0 (0.0) | 1 (0.8) | 0 (0.0) |
| Other benign findings | 1 (0.8) | 8 (6.5) | 0 (0.0) |
| Invasive ductal carcinoma | 1 (0.8) | 40 (32.5) | 0 (0.0) |
| Invasive lobular carcinoma | 0 (0.0) | 3 (2.4) | 0 (0.0) |
| Ductal carcinoma in situ | 0 (0.0) | 2 (1.6) | 0 (0.0) |
| Other malignancies originating from breast tissues | 0 (0.0) | 4 (3.3) | 0 (0.0) |
| Other malignancies (metastases from non breast tissues) | 0 (0.0) | 1 (0.8) | 0 (0.0) |
Ensemble of support vector machines: BI-RADS diagnostic categories predicted for breast masses of the Testing set II according to histopathology groups.
| Histopathology Type | BI-RADS 3 (%) | BI-RADS 4 (%) | BI-RADS 5 (%) |
|---|---|---|---|
| Fibroadenoma | 1 (0.9) | 23 (20.4) | 0 (0.0) |
| Sclerosing lesions/adenosis | 2 (1.8) | 7 (6.2) | 0 (0.0) |
| Normal breast tissue | 0 (0.0) | 2 (1.8) | 0 (0.0) |
| Inflammatory lesions | 1 (0.9) | 1 (0.9) | 0 (0.0) |
| Papilloma (no atypia) | 1 (0.9) | 2 (1.8) | 0 (0.0) |
| Cysts, ductal ectasia, or seromas | 1 (0.9) | 5 (4.4) | 0 (0.0) |
| Usual ductal hyperplasia | 1 (0.9) | 1 (0.9) | 0 (0.0) |
| Atypical ductal hyperplasia | 0 (0.0) | 0 (0.0) | 0 (0.0) |
| Fibroadenomatoid changes | 1 (0.9) | 6 (5.3) | 0 (0.0) |
| Other benign findings | 1 (0.9) | 3 (2.7) | 0 (0.0) |
| Invasive ductal carcinoma | 1 (0.0) | 40 (35.4) | 0 (0.0) |
| Invasive lobular carcinoma | 0 (0.0) | 4 (3.5) | 0 (0.0) |
| Ductal carcinoma in situ | 1 (0.9) | 2 (1.8) | 0 (0.0) |
| Other malignancies originating from breast tissues | 1 (0.9) | 4 (3.5) | 0 (0.0) |
| Other malignancies (metastases from non breast tissues) | 0 (0.0) | 1 (0.9) | 0 (0.0) |