| Literature DB >> 35354835 |
Rana Khaled1, Maha Helal2, Omar Alfarghaly3, Omnia Mokhtar2, Abeer Elkorany4, Hebatalla El Kassas2, Aly Fahmy5.
Abstract
Contrast-enhanced spectral mammography (CESM) is a relatively recent imaging modality with increased diagnostic accuracy compared to digital mammography (DM). New deep learning (DL) models were developed that have accuracies equal to that of an average radiologist. However, most studies trained the DL models on DM images as no datasets exist for CESM images. We aim to resolve this limitation by releasing a Categorized Digital Database for Low energy and Subtracted Contrast Enhanced Spectral Mammography images (CDD-CESM) to evaluate decision support systems. The dataset includes 2006 images, with an average resolution of 2355 × 1315, consisting of 310 mass images, 48 architectural distortion images, 222 asymmetry images, 238 calcifications images, 334 mass enhancement images, 184 non-mass enhancement images, 159 postoperative images, 8 post neoadjuvant chemotherapy images, and 751 normal images, with 248 images having more than one finding. This is the first dataset to incorporate data selection, segmentation annotation, medical reports, and pathological diagnosis for all cases. Moreover, we propose and evaluate a DL-based technique to automatically segment abnormal findings in images.Entities:
Mesh:
Year: 2022 PMID: 35354835 PMCID: PMC8967853 DOI: 10.1038/s41597-022-01238-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1(a) Low-energy, (b) High-energy, and (c) Subtracted image.
Fig. 2Flow diagram of the preparation of (CDD-CESM) and the deep learning method to automatically generate the segmentation annotation.
Fig. 3Samples of low energy and subtracted CESM images from the dataset.
Descriptions of the annotations available for the dataset.
| Annotation | Description | Method | Format |
|---|---|---|---|
| Patient’s age | Age of the patient at time of examination. | Calculated from the date of birth. | Numbers |
| Side of breast | Right or left breast | Manually annotated. | Categorical |
| Breast Composition ACR category | Breast density describes the amount of fibroglandular tissue present in a breast relative to fat. | Blinded evaluation by two radiologists. | Categorical: |
| ACR BIRADS lexicon for standardized descriptors | Radiological lexicon providing the standard descriptors for evaluation of breast findings. | Blinded evaluation by two radiologists. | Mass shape, margin, and density. |
| Architectural distortion. | |||
| Asymmetries. | |||
| Calcification type, and distribution. | |||
| Mass enhancement pattern. | |||
| Non-mass enhancement pattern, and distribution | |||
| Overall BIRADS | Radiological lexicon providing the final assessment categories for evaluation of breast findings. | Blinded evaluation by two radiologists. | BIRADS 1: Normal examination |
| BIRADS 2: Benign findings | |||
| BIRADS 3: Probably benign findings <2% malignancy | |||
| BIRADS 4: Suspicious >2 but <95% malignancy | |||
| BIRADS 5: Highly suspicious of malignancy >95% | |||
| BIRADS 6: Known biopsy-proven malignancy | |||
| Type of image view | Usually two standard views are acquired for each breast: | Manually annotated. | Categorical: |
| • MLO: most important because it allows depiction of most of the breast’s tissues | • MLO | ||
| • CC: reveals medial part and external lateral portion of the breast | • CC | ||
| Tags | Labels assigned as follows: | Manually assigned and annotated by radiologist. | Categorical set of 140 unique tags. |
| • Standardized descriptors of ACR BIRADS 2013 lexicon | |||
| • Probable diagnosis | |||
| • Classification | |||
| Machine label | Two different mammography machines were used. | Manually annotated. | Machine number 1 or 2. |
| Pathology results / follow-up | Three classes: normal, benign, and malignant. | Manually annotated. | Categorical: |
| • Normal | |||
| • Benign | |||
| • Malignant |
Fig. 4(a) Example of the DL Gradcam highlights, (b) Segmentation calculated after applying a threshold on the highlights, (c) Final output after applying the white pixel intensity threshold, and (d) Hand-drawn segmentation annotation.
Fig. 5Histograms for the CDD-CESM dataset showing distribution of (a) BIRADS category for each abnormality, (b) Benign and malignant lesions.
Characteristics of the CDD-CESM dataset.
| CDD-CESM | ||
|---|---|---|
| Stats | Years | 2019–2021 |
| Sources | NCI, Cairo University | |
| No. females | 326 | |
| No. total images | 2006 | |
| No. normal images | 757 (37.4%) | |
| No. benign images | 587 (29.3%) | |
| No. malignant images | 662 (33.3%) | |
| Age (counted per patient) | <40 | 58 (17.8%) |
| 40–49 | 100 (30.7%) | |
| 50–59 | 95 (29.1%) | |
| 60–69 | 59 (18.1%) | |
| ≥70 | 14 (4.3%) | |
| Cancer Type | Invasive ductal carcinoma | 445 (67.5%) |
| Invasive lobular carcinoma | 42 (6.3%) | |
| Mixed invasive ductal carcinoma and invasive lobular carcinoma | 28 (4.2%) | |
| Ductal carcinoma insitu purely | 17 (2.5%) | |
| Inflammatory breast cancer | 40 (6%) | |
| Other | 90 (13.5%) |
The 757 normal images consist of 751 normal images and 6 post-neoadjuvant images considered normal (no residual disease proved by postoperative pathology). The age statistics are provided per number of patients.
Detailed results of our DL segmentaion model.
| Images | Overlap50 | IOU | F1 | ||
|---|---|---|---|---|---|
| Findings | Mass | 310 | 0.85 | 0.65 | 0.72 |
| Distortion | 48 | 0.87 | 0.70 | 0.79 | |
| Asymmetry | 222 | 0.87 | 0.70 | 0.78 | |
| Calcifications | 238 | 0.81 | 0.62 | 0.70 | |
| Postoperative | 159 | 0.77 | 0.61 | 0.68 | |
| Mass enhancement | 334 | 0.91 | 0.66 | 0.73 | |
| Non mass enhancement | 184 | 0.89 | 0.72 | 0.79 | |
| Image Type | DM | 665 | 0.81 | 0.64 | 0.71 |
| CM | 590 | 0.86 | 0.65 | 0.71 | |
| Pathology | Benign | 587 | 0.75 | 0.59 | 0.64 |
| Malignant | 662 | 0.90 | 0.69 | 0.77 | |
| Image View | MLO | 634 | 0.83 | 0.64 | 0.71 |
| CC | 621 | 0.83 | 0.64 | 0.71 | |
| Machine | GE | 1175 | 0.84 | 0.64 | 0.71 |
| Hologic | 80 | 0.70 | 0.60 | 0.67 | |
| Age | <40 | 240 | 0.78 | 0.65 | 0.72 |
| 40–69 | 958 | 0.83 | 0.64 | 0.70 | |
| ≥70 | 57 | 0.94 | 0.71 | 0.78 |
Fig. 6Examples of different cases and their corresponding automatic segmentations.
| Measurement(s) | Dual-Energy Contrast-Enhanced Digital Spectral Mammography |
| Technology Type(s) | digital curation |
| Sample Characteristic - Organism | Homo sapiens • Breast |
| Sample Characteristic - Location | Egypt |