| Literature DB >> 35169217 |
Peng Xue1, Jiaxu Wang1, Dongxu Qin1, Huijiao Yan2, Yimin Qu1, Samuel Seery3,4, Yu Jiang5, Youlin Qiao6.
Abstract
Accurate early detection of breast and cervical cancer is vital for treatment success. Here, we conduct a meta-analysis to assess the diagnostic performance of deep learning (DL) algorithms for early breast and cervical cancer identification. Four subgroups are also investigated: cancer type (breast or cervical), validation type (internal or external), imaging modalities (mammography, ultrasound, cytology, or colposcopy), and DL algorithms versus clinicians. Thirty-five studies are deemed eligible for systematic review, 20 of which are meta-analyzed, with a pooled sensitivity of 88% (95% CI 85-90%), specificity of 84% (79-87%), and AUC of 0.92 (0.90-0.94). Acceptable diagnostic performance with analogous DL algorithms was highlighted across all subgroups. Therefore, DL algorithms could be useful for detecting breast and cervical cancer using medical imaging, having equivalent performance to human clinicians. However, this tentative assertion is based on studies with relatively poor designs and reporting, which likely caused bias and overestimated algorithm performance. Evidence-based, standardized guidelines around study methods and reporting are required to improve the quality of DL research.Entities:
Year: 2022 PMID: 35169217 PMCID: PMC8847584 DOI: 10.1038/s41746-022-00559-z
Source DB: PubMed Journal: NPJ Digit Med ISSN: 2398-6352
Fig. 1PRISMA flowchart of study selection.
Displayed is the PRISMA (preferred reporting items for systematic reviews and meta-analyses) flow of search methodology and literature selection process.
Study design and basic demographics.
| First author and year | Participants | |||
|---|---|---|---|---|
| Inclusion criteria | Exclusion criteria | Mean or median age (SD; range) | ||
| Xiao et al.[ | Had breast lesions clearly visualized by ultrasound; Underwent biopsy and had pathological results; provided informed consent. | Patients who were pregnant or lactating; patients who had breast biopsy or were undergoing neoadjuvant chemotherapy or radiotherapy. | 389 | 46.86 (13.03; 19–84) |
| Zhang et al.[ | NR | Pathological results were neither benign nor malignant; Patients with BI-RADS 1 or 2 and abnormal mammography results; patients who were diagnosed with Paget’s disease but had no masses in the breasts. | 2062 | NR |
| Zhou et al.[ | Images were scanned under the same MR protocol; The lesion had complete pathology results; Imaging reports had definite BI-RADS category diagnosed; Lesions were a) solitary in one breast or b) in both breasts with the same BI-RADS and pathological results. | Normal or typical background parenchyma enhancement in bilateral breasts was eliminated. | 1537 | 47.5 (11.8; NR) |
| Agnes et al.[ | NR | NR | NR | NR |
| Tanaka et al.[ | women with breast masses who were referred for further examination after their initial screening examination of breast cancer and then underwent ultrasonography and pathological examination. | Typical cysts; mass lesions ≥ 4.5 cm diameter | NR | NR |
| Becker et al.[ | Patients with postsurgical scars, initially indeterminate, or malignant lesions with histological diagnoses or 2 years follow up. | Patients with normal breast ultrasound, and all patients with lesions classified as clearly benign, except for patients with prior breast-conserving surgical treatment. | 632 | 53 (15; 15–91) |
| Kyono et al.[ | Women recalled after routine breast screening between ages of 47–73 or women with a family history of breast cancer attending annual screening between ages of 40–49. | NR | 2000 | NR (NR; 47–73) |
| Qi et al.[ | NR | NR | 2047 | NR |
| Salim et al.[ | Women aged 40–74 years who were diagnosed as having breast cancer, who had a complete screening examination prior to diagnosis, had no prior breast cancer, did not have implants. | With a cancer diagnosis that had ≥ 12 months between the examination and diagnosis date. | 8805 | 54.5 (16.1; 40–74) |
| Zhang et al.[ | NR | NR | 121 | NR |
| Wang et al.[ | NR | NR | 263 | 51.4 (9.8; 28–76) |
| Li et al.[ | NR | NR | 124 | NR |
| Mckinney et al.[ | NR | Cases without follow-up were excluded from the test set. | 28953 | NR |
| Shen et al.[ | NR | NR | 1249 | NR |
| Suh et al.[ | 18 years or older and not having a history of previous breast surgery. | Subjects without medical records or pathological confirmation for a suspicious breast lesion, missing mammograms, or having poor-quality mammograms. | 1501 | 48.9 (11.1; NR) |
| O’Connell et al.[ | Adult females or males recommended for ultrasound-guided breast lesion biopsy or ultrasound follow-up with at least one suspicious lesion; age ≥ 18 years. | Unable to read and understand English at the University of Rochester; patients with diagnosis of breast cancer in the same quadrant; unwilling to undergo study procedures and informed consent. | 299 | 52.3 (NR; NR) |
| Ruiz et al.[ | Women presenting for screening with no symptoms or concerns. | Women with implants and/or a history of breast cancer. | 240 | 62 (53–66; 39–89) |
| Adachi et al.[ | Patients who underwent DCE breast MRI; patients who were diagnosed with benign or malignant lesions by pathology or a follow-up examination at more than one year. | Patients who were treated with breast surgery, hormonal therapy, chemotherapy, or radiation therapy; age ≤ 20 years. | 371 | NR |
| Samala et al.[ | NR | NR | 2242 | 51.7 (NR; 24–82) |
| Schaffter et al.[ | NR | NR | 153588 | 56.1 (NR; NR) |
| Kim et al.[ | NR | NR | 172230 | 50.3 (10; NR) |
| Wang et al.[ | All nodules of patients were newly discovered and untreated; patients had undertaken ABUS scan; definite pathological benign and malignant; the image quality of ABUS examination was good enough to show the entire margin of the lesion, no matter distinct or indistinct. | Non-nodular breast disease; ABUS artifact was obvious and the poor images quality; ABUS was not available; patients received chemotherapy, radiation therapy or surgical local resection before ABUS scan. | 264 | 54.31 (9.68; 37–75) |
| Yu et al.[ | Pathological results clearly; at least 2D mode US images available, but preferably CDFI and PW mode images. Without blurred images or color overflow. | A foreign-body in the breast; other metastatic tumors or co-infection with HIV; measurement markers, arrows, or puncture needles within the image; | 3623 | 42.5 (NR; 11–95) |
| Sasaki et al.[ | Patients undergone bilateral mammography; patients in whom ultrasonography had established the presence or absence of a lesion; patients in whom a lesion, if present, had been diagnosed as being benign or malignant by cytology or histology; normal patients in whom ultrasonography had revealed no lesion and who had been followed up for at least 1 year. | NR | 310 | 50 (NR; 20–93) |
| Zhang et al.[ | NR | NR | 2620 | NR |
| Bao et al.[ | Aged 20–65 years participated in the program. | NR | 703103 | NR (NR; 20–65) |
| Holmström et al.[ | Nonpregnant aged between 18-64 years, confirmed HIV positivity, and signed informed consent. | NR | 740 | 41.8 (10.3; 18–64) |
| Cho et al.[ | Age ≥18 years, not pregnant, had no history of cervical surgery, and had Pap test results. All lesions were pathologically confirmed by conization biopsy, and normal were defined as those with normal Pap test results. | NR | 791 | NR (NR; 18–94) |
| Bao et al.[ | Aged 25–64 years; samples were processed with liquid-based method. done with HPV testing, and diagnosed by colposcopy-directed biopsy. | NR | 2145 | 38.4 (6.7; 25–46) |
| Hu et al.[ | NR | No image, multiple colpo sessions, inadequate histology. | 9406 | 35 (NR;18–94) |
| Hunt et al.[ | Abnormal cervical screening test, age ≥18 years, intact uterine cervix, not pregnant, no known allergy to the fluorescent dye used for HRME imaging, does not belong to an indigenous Brazilian population; | unable to provide informed consent; prior treatment history; pregnant; other clinical considerations. | 1486 | 40 (12.1; NR) |
| Wentzensen et al.[ | Women aged ≥18 years referred to colposcopy. | NR | 4253 | NR |
| Xue et al.[ | Aged 24-65 years with indications for the need for colposcopy imaging and biopsy, and those who were pathologically confirmed. | Empty or invalid images, low quality, unsatisfactory images, information loss. | 19435 | NR (NR; 24–65) |
| Yu et al.[ | NR | NR | 679 | NR |
| Yuan et al.[ | NR | Without complete clinical and pathological information; without biopsies; pathologically diagnosed as invasive cervical cancer or glandular intraepithelial lesions; poor-quality colposcopy images. | 22330 | NR (NR; 20–66) |
DCE dynamic contrast enhanced, NR not reported, MRI magnetic resonance imaging, BI-RADS breast imaging reporting and data system, MR magnetic resonance, ABUS automated breast ultrasound, CDFI color doppler flow imaging, PW pulsed wave, HIV human immunodeficiency virus, HRME high-resolution microendoscopy, DS dual stained.
*20 studies included in the meta-analysis.
Indicators, algorithms and data sources.
| First author and year | Indicator definition | Algorithm | Data source | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Device | Exclusion of poor-quality imaging | Heatmap provided | Algorithm architecture | Transfer learning applied | Source of data | Number of images for training/internal/external | Data range | Open access data | |
| Xiao et al.[ | Ultrasound | NR | No | CNN | No | Prospective study, data from Peking Union Medical College Hospital. | NR/NR/451 | 2018.01–2018.12 | No |
| Zhang et al.[ | Ultrasound | Yes | No | CNN | Yes | Retrospective study, training data from Harbin Medical University Cancer Hospital; external data from the First Affiliated Hospital of Harbin Medical University. | 2822/447/210 | NR | No |
| Zhou et al.[ | MRI | Yes | Yes | DenseNet | No | Retrospective study, data from Chinese University of Hong Kong. | 1230/307/NR | 2013.03–2016.12 | No |
| Agnes et al.[ | Mammography | NR | No | MA-CNN | No | Retrospective study, data from mini-Mammographic Image Analysis Society database. | 322/NR/NR | NR | Yes |
| Tanaka et al.[ | Ultrasound | NR | Yes | VGG19, ResNet152 | Yes | Retrospective study, data from Japan Association of Breast Thyroid Sonology. | 1382/154/NR | 2011.11–2015.12 | No |
| Becker et al.[ | Ultrasound | NR | Yes | CNN | No | Retrospective study, data from university hospital of Zurich, Switzerland. | 445/192/NR | 2014.01–2014.12 | No |
| Kyono et al.[ | Mammography | NR | No | CNN | No | Retrospective study, data from UK National Health Service Breast Screening Program Centers. | 1800/200/NR | NR | No |
| Qi et al.[ | Ultrasound | NR | Yes | GoogLeNet | Yes | Retrospective study, data from West China Hospital, Sichuan University. | 6786/1359/NR | 2014.10–2017.08 | No |
| Salim et al.[ | Mammography | NR | No | ResNet-34, MobileNet | No | Retrospective study, data from secondary analysis of a population-based mammography screening cohort in Swedish Cohort of Screen-Age Women. | NR/NR/113663 | 2008–2015 | No |
| Zhang et al.[ | Ultrasound | NR | No | Deep polynomial networks | No | Retrospective study, data source is not clear. | NR/NR/NR | NR | No |
| Wang et al.[ | Ultrasound | NR | No | Inception-v3 CNN | No | Retrospective study, data from Jeonbuk National University Hospital. | 252/64/NR | 2012.03–2018.03 | No |
| Li et al.[ | Ultrasound | NR | No | YOLO-v3 | No | Retrospective study, data from Peking University People’s Hospital. | 3124/10812/NR | 2018.10–2019.03 | No |
| Mckinney et al.[ | Mammography | NR | No | CNN | No | Retrospective study, data 1 from two screening centers in England, data 2 from one medical center in USA. | 25856/NR/3097 | 2001–2018 | No |
| Shen et al.[ | Mammography | NR | Yes | VGG, Resnet | Yes | Retrospective study, data from CBIS-DDSM website. | 2102/376/NR | NR | Yes |
| Suh et al.[ | Mammography | Yes | Yes | DenseNet-169, EfficientNet-B5 | No | Retrospective study, data from Hallym University Sacred Heart Hospital. | 2701/301/NR | 2007.02–2015.05 | No |
| O’Connell et al.[ | Ultrasound | NR | No | CNN | No | Prospective study, data from University of Rochester and University Hospital Palermo, Italy. | NR/NR/299 | 2018–2019 | No |
| Ruiz et al.[ | Mammography | Yes | No | CNN | No | Retrospective study, data from two institutes in the US and Europe. | NR/NR/240 | 2013–2017 | No |
| Adachi et al.[ | MRI | NR | No | RetinaNet | No | Retrospective study, data from Tokyo Medical and Dental University hospital. | 286/85/NR | 2014.03–2018.10 | No |
| Samala et al.[ | Mammography | NR | No | ImageNet DCNN | Yes | Retrospective study, data from University of Michigan Health System and the Digital Database for Screening Mammography. | 1335/907/NR | 2001–2006 | No |
| Schaffter et al.[ | Mammography | NR | No | Faster-RCNN | No | Retrospective study, data from Kaiser Permanente Washington and Karolinska Institute. | 100974/43257/166578 | 2016.09–2017.11 | No |
| Kim et al.[ | Mammography | NR | Yes | ResNet-34 | No | Retrospective study, data from five institutions in South Korea, USA. | 166968/3262/320 | 2000.01–2018.12 | No |
| Wang et al.[ | Ultrasound | Yes | No | 3D U-Net | No | Retrospective study, data from the First Affiliated Hospital of Xi’an Jiao tong University. | 254/73/NR | 2018.06–2019.05 | No |
| Yu et al.[ | Ultrasound | Yes | No | ResNet50, FPN | No | Retrospective study, data from 13 Chinese hospitals. | 7835/7813/NR | 2016.01–2019.12 | No |
| Sasaki et al.[ | Mammography | NR | No | Transpara | No | Retrospective study, data from Sagara Hospital Affiliated Breast Center, Japan. | NR/NR/620 | 2018.01–2018.10 | No |
| Zhang et al.[ | Mammography | NR | Yes | MVNN | No | Retrospective study, data from Digital Database for Screening Mammography. | 5194/512/NR | NR | Yes |
| Bao et al.[ | Cytology | NR | No | DL | No | Retrospective study. data from a cervical cancer screening program. | 103793/NR/69906 | 2017.01–2018.12 | No |
| Holmström et al.[ | Cytology | NR | No | CNN | No | Retrospective study, data from a rural clinic in Kenya. | 350/390/NR | 2018–2019 | No |
| Cho et al.[ | Colposcopy | NR | Yes | Inception-Resnet-v2, Resnet-152 | No | Retrospective study, data from three university affiliated hospitals. | 675/116/NR | 2015–2018 | No |
| Bao et al.[ | Cytology | NR | No | VGG16 | No | Retrospective study, data from eight tertiary hospitals in China. | 15083/NR/2145 | 2017.05–2018.10 | No |
| Hu et al.[ | Cervicography | NR | Yes | Faster R-CNN | Yes | Retrospective study, data from Guanacaste costa Rica cohort. | 744/8917/NR | 1993–2001 | No |
| Hunt et al.[ | Microendoscopy | NR | Yes | CNN | No | Prospectively study, data from Barretos Cancer Hospital. | 870/616/NR | NR | No |
| Wentzensen et al.[ | Cytology | NR | No | CNN4, Inception-v3 | No | Retrospective study, data from Kaiser Permanente Northern California and the University of Oklahoma. | 193/409/NR | 2009–2014 | No |
| Xue et al.[ | Colposcopy | Yes | No | U-Net, YOLO | Yes | Retrospective study, data from six multicenter hospitals across China. | 77788/23479/NR | 2018.01–2018.12 | No |
| Yu et al.[ | Colposcopy | NR | No | C-GCNN, GRU | No | Retrospective study, data from First Affiliated Hospital of the University of Science and Technology of China. | 3802/951/NR | 2013.07–2016.09 | No |
| Yuan et al.[ | Colposcopy | Yes | No | ResNet, U-Net, MASK R-CNN | Yes | Retrospective study, data from Women’s Hospital, School of Medicine, Zhejiang University. | 40194/4466/NR | 2013.08–2019.05 | No |
NR not reported, CNN convolutional neural network, DL deep learning, YOLO you only look once, DNN deep neural network, DCNN deep convolutional neural network, MRI magnetic resonance imaging, DenseNet dense convolutional network, MA-CNN multiattention convolutional neural network, VGG visual geometry group network, ResNet deep residual network, FPN feature pyramid networks, MVNN multiview feature fusion neural network, GRU gate recurrent Unit.
*20 studies included in the meta-analysis.
Fig. 2Pooled overall performance of DL algorithms.
a Receiver operator characteristic (ROC) curves of all studies included in the meta-analysis (20 studies with 55 tables), and b ROC curves of studies reporting the highest accuracy (20 studies with 20 tables).
Fig. 3Pooled performance of DL algorithms using different validation types.
a Receiver operator characteristic (ROC) curves of studies with internal validations (15 studies with 40 tables), b ROC curves of studies with external validations (8 studies with 15 tables).
Fig. 4Pooled performance of DL algorithms using different cancer types.
a Receiver operator characteristic (ROC) curves of studies in detecting breast cancer (10 studies with 36 tables), and b ROC curves of studies in detecting cervical cancer (10 studies with 19 tables).
Fig. 5Pooled performance of DL algorithms using different imaging modalities.
a Receiver operator characteristic (ROC) curves of studies using mammography (4 studies with 15 tables), b ROC curves of studies using ultrasound (4 studies with 17 tables), c ROC curves of studies using cytology (4 studies with 6 tables), and d presented ROC curves of studies using colposcopy (4 studies with 11 tables).
Fig. 6Pooled performance of DL algorithms versus human clinicians and human clinicians using the same sample.
a Receiver operator characteristic (ROC) curves of studies using DL algorithms (11 studies with 29 tables), and b ROC curves of studies using human clinicians (11 studies with 18 tables).
Fig. 7Summary estimate of pooled performance using forest plot.
Data presented forest plot of studies included in the meta-analysis (20 studies).
Methods of model training and validation.
| First author and year | Focus | Reference standard | Type of internal validation | External validation | DL versus clinician |
|---|---|---|---|---|---|
| Xiao et al.[ | Breast cancer | Histopathology | NR | Yes | Yes |
| Zhang et al.[ | Breast cancer | Histopathology, immunohistochemistry | Random split-sample validation | Yes | No |
| Zhou et al.[ | Breast cancer | Histopathology, expert consensus | Random split-sample validation | No | Yes |
| Agnes et al.[ | Breast cancer | Histopathology | NR | No | No |
| Tanaka et al.[ | Breast cancer | Histopathology, two-year follow-up | Random split-sample validation | No | No |
| Becker et al.[ | Breast cancer | Histopathology, two-year follow-up | Random split-sample validation | No | No |
| Kyono et al.[ | Breast cancer | Histopathology, follow-up, expert consensus | Ten-fold cross validation | No | No |
| Qi et al.[ | Breast cancer | Histopathology | Random split-sample validation | No | No |
| Salim et al.[ | Breast cancer | Histopathology, two-year follow-up | NR | Yes | Yes |
| Zhang et al.[ | Breast cancer | Histopathology | NR | No | No |
| Wang et al.[ | Breast cancer | Histopathology, two-year follow-up | Five-fold cross validation | No | No |
| Li et al.[ | Breast cancer | Histopathology | Five-fold cross validation | No | No |
| Mckinney et al.[ | Breast cancer | Histopathology, multiple years of follow-up | NR | Yes | No |
| Shen et al.[ | Breast cancer | Histopathology | Random split-sample validation | No | No |
| Suh et al.[ | Breast cancer | Histopathology | Random split-sample validation | No | No |
| O’Connell et al.[ | Breast cancer | Histopathology, two-year follow-up | NR | Yes | No |
| Ruiz et al.[ | Breast cancer | Histopathology, one-year follow-up | NR | Yes | Yes |
| Adachi et al.[ | Breast cancer | Histopathology, at least one-year follow-up | Random split-sample validation | No | Yes |
| Samala et al.[ | Breast cancer | Histopathology | N-fold cross validation | No | No |
| Schaffter et al.[ | Breast cancer | Histopathology, follow-up | Random split-sample validation | Yes | No |
| Kim et al.[ | Breast cancer | Histopathology, at least one-year follow-up | Random split-sample validation | Yes | Yes |
| Wang et al.[ | Breast cancer | Histopathology | Random split-sample validation | No | No |
| Yu et al.[ | Breast cancer | Histopathology | Random split-sample validation | No | No |
| Sasaki et al.[ | Breast cancer | Histopathology, cytology, at least one-year follow-up | NR | Yes | Yes |
| Zhang et al.[ | Breast cancer | Histopathology | NR | No | No |
| Heling Bao et al.[ | Cervical cancer | Histopathology | NR | Yes | Yes |
| Holmström et al.[ | Cervical cancer | Histopathology | NR | No | Yes |
| Cho et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | No |
| Bao et al.[ | Cervical cancer | Histopathology | NR | Yes | No |
| Hu et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | No |
| Hunt et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | Yes |
| Wentzensen et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | Yes |
| Xue et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | Yes |
| Yu et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | No |
| Yuan et al.[ | Cervical cancer | Histopathology | Random split-sample validation | No | No |
NR not reported, DL deep learning. *20 studies included in the meta-analysis.
*20 studies included in the meta-analysis.