| Literature DB >> 34733774 |
Yanjie Zhao1, Rong Chen2, Ting Zhang1, Chaoyue Chen3, Muhetaer Muhelisa1, Jingting Huang1, Yan Xu4, Xuelei Ma1.
Abstract
BACKGROUND: Differential diagnosis between benign and malignant breast lesions is of crucial importance relating to follow-up treatment. Recent development in texture analysis and machine learning may lead to a new solution to this problem.Entities:
Keywords: MRI; breast lesion; differential diagnosis; linear discriminant analysis; machine learning; texture analysis
Year: 2021 PMID: 34733774 PMCID: PMC8558475 DOI: 10.3389/fonc.2021.552634
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Figure 1Flowchart of the MRI classification process by different selection methods. ROI, region of interest; GLCM, gray-level co-occurrence matrix; GLRLM, gray-level run length matrix; GLZLM, gray-level zone length matrix; NGLDM, neighborhood gray-level dependence matrix; LASSO, least absolute shrinkage and selection operator; GBDT, gradient boosting decision tree; RF, random forest; LDA, linear discriminant analysis; AUC, area under the receiver operating characteristic curve.
Baseline characteristics of the 93 patients included in the analysis.
| Characteristics | Benign lesions | Malignant lesions |
|---|---|---|
| n = 71 (%) | n = 194 (%) | |
| Mean age (years; SD) | 31.9 ± 6 | 51.9 ± 10.3 |
| Location | ||
| Left | 23 (32.4%) | 89 (45.9%) |
| Right | 43 (60.6%) | 105 (54.1%) |
| Bilateral | 5 (7.0%) | 0 (0.0%) |
| Pathology | ||
| PCM | 64 (90.1%) | |
| GM | 7 (9.9%) | |
| Non-invasive carcinoma | 11 (5.7%) | |
| Invasive carcinoma | 194 (92.2%) | |
| Others† | 4 (2.1%) |
PCM, plasma cell mastitis; GM, granulomatous mastitis.
†“Others” refers to carcinoma with medullary features, tubular carcinoma, invasive cribriform carcinoma, and invasive papillary carcinoma each in the present study.
Figure 2Two examples of the axial plane of contrast-enhanced T1-weighted MR images. (A) Malignant breast lesions. (B) Benign breast lesions.
The performance of five different models.
| Training | Validation | |||||||
|---|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Accuracy | AUC | Sensitivity | Specificity | Accuracy | AUC | |
| Distance Correlation | 0.655 | 0.801 | 0.777 | 0.859 | 0.635 | 0.811 | 0.787 | 0.835 |
| RF | 0.753 | 0.863 | 0.839 | 0.906 | 0.675 | 0.865 | 0.825 | 0.881 |
| LASSO | 0.733 | 0.839 | 0.818 | 0.863 | 0.745 | 0.856 | 0.836 | 0.869 |
| XGBoost | 0.702 | 0.813 | 0.795 | 0.899 | 0.701 | 0.837 | 0.815 | 0.899 |
|
|
|
|
|
|
|
|
|
|
LASSO, least absolute shrinkage and selection operator; GBDT, gradient boosting decision tree; RF, random forest; LDA, linear discriminant analysis; AUC, area under the receiver operating characteristic curve.
We highlighted a relatively better performed model in bold values.
Figure 3Discriminative function of the GBDT + LDA models. Distribution of the benign and malignant breast lesions that originated from multiple dimensions were reduced and reflected to a two-dimension plane. Little overlap was observed between the distribution of benign breast lesion groups (triangles) and malignant breast lesion groups (circles) and the distribution of group centroids (squares). It suggests a qualitative separation between benign and malignant breast lesions. GBDT, gradient boosting decision tree; LDA, linear discriminant analysis.
Figure 4Examples of the 100 data analysis training cycles. Distribution of the direct LDA function determining benign and malignant breast lesions illustrates a promising performance of the GBDT + LDA models. GBDT, gradient boosting decision tree; LDA, linear discriminant analysis. (A) Distribution of the LDA function determined for the benign breast lesions for one cycle; (B) distribution of the LDA function determined for the malignant breast lesions for one cycle.