Literature DB >> 32584382

Accuracy and Efficiency of Deep-Learning-Based Automation of Dual Stain Cytology in Cervical Cancer Screening.

Nicolas Wentzensen1, Bernd Lahrmann2, Megan A Clarke1, Walter Kinney3, Diane Tokugawa4, Nancy Poitras4, Alex Locke4, Liam Bartels5,6, Alexandra Krauthoff5,6, Joan Walker7, Rosemary Zuna7, Kiranjit K Grewal4, Patricia E Goldhoff4, Julie D Kingery4, Philip E Castle8, Mark Schiffman1, Thomas S Lorey4, Niels Grabe2,5,6.   

Abstract

BACKGROUND: With the advent of primary human papillomavirus testing followed by cytology for cervical cancer screening, visual interpretation of cytology slides remains the last subjective analysis step and suffers from low sensitivity and reproducibility.
METHODS: We developed a cloud-based whole-slide imaging platform with a deep-learning classifier for p16/Ki-67 dual-stained (DS) slides trained on biopsy-based gold standards. We compared it with conventional Pap and manual DS in 3 epidemiological studies of cervical and anal precancers from Kaiser Permanente Northern California and the University of Oklahoma comprising 4253 patients. All statistical tests were 2-sided.
RESULTS: In independent validation at Kaiser Permanente Northern California, artificial intelligence (AI)-based DS had lower positivity than cytology (P < .001) and manual DS (P < .001) with equal sensitivity and substantially higher specificity compared with both Pap (P < .001) and manual DS (P < .001), respectively. Compared with Pap, AI-based DS reduced referral to colposcopy by one-third (41.9% vs 60.1%, P < .001). At a higher cutoff, AI-based DS had similar performance to high-grade squamous intraepithelial lesions cytology, indicating a risk high enough to allow for immediate treatment. The classifier was robust, showing comparable performance in 2 cytology systems and in anal cytology.
CONCLUSIONS: Automated DS evaluation removes the remaining subjective component from cervical cancer screening and delivers consistent quality for providers and patients. Moving from Pap to automated DS substantially reduces the number of colposcopies and also achieves excellent performance in a simulated fully vaccinated population. Through cloud-based implementation, this approach is globally accessible. Our results demonstrate that AI not only provides automation and objectivity but also delivers a substantial benefit for women by reduction of unnecessary colposcopies.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 32584382      PMCID: PMC7781458          DOI: 10.1093/jnci/djaa066

Source DB:  PubMed          Journal:  J Natl Cancer Inst        ISSN: 0027-8874            Impact factor:   13.506


Advances in digital imaging and machine learning can revolutionize cancer screening, diagnosis, and treatment by improving accuracy and reproducibility of image assessment and streamlining clinical workflow (1–4). With its requirement for high throughput and fast turnaround and its dependence on microscopic and visual technologies, automation can play a major role in improving the efficiency of cervical cancer screening. Many countries are currently switching from Pap cytology to high-risk human papillomavirus (HPV) screening (5–7). Although a negative HPV test provides great reassurance of low cervical cancer risk over the next decade (8–10), only a small subset of women with a positive HPV test require further evaluation. To avoid overburdening the system with HPV-positive women, additional triage is required for colposcopy referral (11, 12). Current triage strategies include partial HPV genotyping and Pap cytology (7, 13). The limited sensitivity and reproducibility of cytology require laborious quality control procedures and frequent retesting (14, 15). Improving the efficiency of cervical cancer screening is particularly important for vaccinated populations due to lower disease prevalence and higher demands for screening test performance. A promising triage strategy is concomitant detection of p16 and Ki-67 in the same cell (p16/Ki-67 dual stain [DS]), 2 markers that are closely linked to cervical carcinogenesis and HPV oncoprotein actions. The HPV oncoprotein E7 interrupts cell cycle control by releasing E2F, activating p16 expression. The coexpression of p16 and Ki-67, a cell proliferation marker, in the same cell is specific to HPV-related carcinogenesis. DS has shown greater accuracy for detection of HPV-related precancers compared with cytology (16–21). Currently, artificial intelligence (AI) algorithms mostly try to match manual reading accuracy to improve automation but do not offer a substantial improvement for patients. Automated scanning and deep-learning evaluation of DS slides can improve throughput, reproducibility, and accuracy of the assay for better risk stratification and a direct benefit to women (8, 22, 23). To achieve this, we developed the CYTOREADER system that combines whole-slide scanning with automated evaluation of DS cytology slides. Cloud-based evaluation provides ample computational capacity and storage space and can provide diagnostic procedures where sufficient personnel, expertise, or infrastructure is lacking. We evaluated the clinical performance of CTYOREADER in 4253 slides from 3 epidemiological studies of HPV-positive cervical and anal precancers.

Methods

General Approach

CYTOREADER uses whole-slide scanners (Hamamatsu Nanozoomers HT, XR, and S360) for imaging of ThinPrep (Hologic) or SurePath (Becton Dickinson, BD) slides, 2 widely used liquid-based cytology technologies. CYTOREADER is a cloud-based system (Google Cloud Platform) that can also run as a local installation. Training of deep-learning algorithms for automated DS evaluation was performed using small areas (tiles) from whole slides containing individual or small numbers of epithelial cells. For training of the deep-learning algorithms, tiles from training slides were manually evaluated for DS-positive cells by 3 observers (Supplementary Figure 1, available online).

Deep Learning

Two deep-learning approaches (Convolutional Neural Network with 4 layers [CNN4] and Inception-v3 with 48 layers [IncV3]) were developed sequentially as shown in Figure 1 and described in Supplementary Methods (available online). The algorithms determine the number of DS-positive cells on a slide by detecting the number of tiles above a certain likelihood threshold. A slide is considered positive if the number of DS-positive cells on a slide exceeds a certain cutoff. Training and validation were conducted on the tile level and the slide level. First, a training set from 450 patients was selected for which the clinical endpoint cervical intraepithelial neoplasia grade 3 or greater (CIN3+) was unblinded. Tiles were selected for initial training (80%) and validation (20%) of the algorithm. The deep-learning network provides a likelihood for each tile above which it is considered positive (0.5 for CNN4 and 0.4 for IncV3). The resulting candidate CNN was applied on the slide level on training slides. A cutoff of positive tiles is used to determine slide positivity (≥3 tiles per cell for CNN4 and ≥2 tiles per cell for IncV3). From misclassified slides, false-positive or false-negative tiles were extracted and fed back into the original CNN training to optimize classification accuracy of the CNN. A final locked CNN was applied on the patient level on the blinded validation set comprising 3803 slides. CNN4 showed good performance in Thinprep slides but not in Surepath slides. Subsequently, a second algorithm (IncV3) was trained specifically for Surepath slides (Supplementary Methods, available online). We published a GitHub repository and created a web page at https://github.com/stcmedhub/dual_stain_dl with a source code description of the models and the installation instructions.
Figure 1.

Study design. AI = artificial intelligence; CNN = convolutional neural network; CIN3+ = cervical intraepithelial neoplasia grade 3 or worse; DS = dual stain.

Study design. AI = artificial intelligence; CNN = convolutional neural network; CIN3+ = cervical intraepithelial neoplasia grade 3 or worse; DS = dual stain.

Study Populations

The Biopsy Study is a population-based study of women aged 18 years or older referred to colposcopy at the University of Oklahoma Health Sciences Center between 2009 and 2011 (24). We included DS slides from 602 women as previously described (19). The study population was split into a representative training set (193 slides with 741 DS-positive and 953 DS-negative tiles) and a validation set of 409 slides (Figure 1). This study was approved by the University of Oklahoma and National Cancer Institute (NCI) institutional review boards (IRB); written informed consent was obtained from all participants before study enrollment. The Anal Cancer Screening Study (ACSS) was based at the San Francisco Kaiser Permanente Northern California (KPNC) Anal Cancer Screening Clinic. HIV-positive men who have sex with men 18 years or older were enrolled at KPNC between 2009 and 2010. DS slides from 318 men were generated as previously described (25). From 19 training slides, 445 DS-positive and 532 DS-negative tiles were used for training (Figure 1). This study was approved by the KPNC and NCI IRBs; written informed consent was obtained from all participants before study enrollment. At KPNC, DS was evaluated for triage of HPV-positive women between 2012 and 2015 in a population of women aged 25 years and older who were undergoing routine cervical cancer screening (16). From a screening population of more than 300 000 women in a year, 3333 slides from HPV-positive women were included. From 238 training slides, 8215 DS-positive and 9739 DS-negative tiles were used for training (Figure 1). The study was approved by the KPNC IRB and was exempted from institutional review at the NCI by the Office of Human Subjects Research. Patient consent was waived because deidentified discarded specimens were used in this study.

Clinical Endpoints

All studies followed routine clinical practice at the respective institutions. Cytology was classified by the Bethesda System: negative for intraepithelial lesions or malignancy, atypical squamous cells of undetermined significance, low-grade squamous intraepithelial lesions, and high-grade squamous intraepithelial lesions (HSIL) (26). Final diagnosis was established by histopathology classified according to the cervical intraepithelial neoplasia (CIN) scale for cervical endpoints, which indicates the extent of dysplastic cells in the cervical epithelium: no indication for biopsy, normal CIN, grade 1 (CIN1), grade 2 (CIN2), grade 3 (CIN3), and cancer. We grouped adenocarcinoma in situ with CIN3. For anal disease endpoints, the comparable anal intraepithelial neoplasia nomenclature (AIN) was used.

p16/Ki-67 Staining and Evaluation

For the Biopsy Study and ACSS, slides were prepared from residual PreservCyt material using a T2000 processor (Hologic, Bedford, MA). For the KPNC study, slides were prepared from residual SurePath tubes according to the manufacturer’s instructions (BD, Sparks, MD). Immunostaining of cervical cytology slides for p16/Ki-67 was performed using the CINtec Plus Kit (Roche, Tucson, AZ) according to the manufacturer’s instructions. DS-trained cytotechnologists reviewed all slides; a slide was considered positive if 1 or more cervical epithelial cell(s) stained both with a brown cytoplasmic stain (p16) and a red nuclear (Ki-67) irrespective of morphologic abnormalities. Slides from the Biopsy Study and ACSS were stained and evaluated at Roche mtm laboratories AG, Heidelberg, Germany, whereas slides from the Kaiser DS study were stained and evaluated at KPNC. HPV testing with partial genotyping (HPV16 and HPV18) at KPNC was based on the cobas assay (Roche, Pleasanton, CA).

Statistical Analysis

We created boxplots and calculated medians to show the distribution of DS-positive cells in cytology and histology categories. We compared differences in DS cell counts in ordinal cytology and histology categories using 1-way analysis of variance. The primary endpoint for the Biopsy Study and the Kaiser Study was CIN3 or greater (CIN3+). For ACSS, the primary endpoint was AIN2 or AIN3 (AIN2+). Receiver operator characteristics curve analysis was conducted for the number of DS-positive cells against the primary endpoints, and the area under the curve (AUC) was calculated. Sensitivity and specificity coordinates for manual DS evaluation and cytology were plotted on the receiver operator characteristics curve for comparison. We calculated percentage positivity, sensitivity, specificity, and Youden’s index in the Biopsy Study and ACSS for the cutoff determined by CNN4 and for manual DS evaluation. In the Kaiser Study, with a representative population of women who underwent routine screening, we calculated percentage positivity, sensitivity, specificity, and positive and negative predictive values for automated and manual DS. Differences in positivity, sensitivity, and specificity were evaluated using an exact McNemar’s χ2, and differences in predictive values were evaluated using the R package DTComPair, using the generalized score statistic (27). To evaluate clinical efficiency of each strategy, we estimated the number of CIN3+ detected for different cutoffs of DS-positive cells and the ratio of the number of tests and colposcopies per case of CIN3+ detected. We also evaluated the theoretical performance of automated DS in a fully vaccinated population by excluding all women who were positive for HPV16 and/or HPV18 from the analysis. Analyses were performed in SPSS, Stata, and R. All statistical tests were 2-sided and P less than .05 was considered statistically significant.

Results

Automated Detection of DS-Positive Cells in Colposcopy and Anoscopy Populations

We developed a deep-learning algorithm for automated detection of DS-positive cells on ThinPrep slides from 2 referral populations (Biopsy Study and ACSS), including 212 training slides with 1186 DS-positive and 1485 DS-negative tiles (Figure 1). We evaluated the algorithm in independent validation slides from both studies (Figure 1). In both studies, we observed an increase in the number of DS-positive cells by increasing severity of cytology and histology, with higher absolute DS-positive cell numbers in ACSS (P < .001 for all comparison; Supplementary Figure 2, available online). In the Biopsy Study validation set with 53 CIN3+, the AUC for detecting CIN3+ based on automated DS using CNN4 was 0.74 (Figure 2). At a cutoff of 3 DS-positive cells, the CNN4 algorithm had marginally lower positivity (58% vs 63%, respectively, P = .06) with comparable sensitivity (P = 1.0) and marginally higher specificity compared with manual DS (40.6% vs 45.7%, respectively, P = .07) (Table 1).
Figure 2.

Receiver operating curve characteristics analysis of number of dual stain (DS)-positive cells detected by CYTOREADER for detection of cervical precancer in the Biopsy Study and anal precancer in the Anal Cancer Screening Study. AUC = area under the curve; AIN2+ = anal intraepithelial neoplasia grade 2 or worse; CIN3+ = cervical intraepithelial neoplasia grade 3 or worse.

Table 1.

Accuracy for cervical and anal precancer based on manual and automated detection of DS-positive cells on ThinPrep slides in a colposcopy population (Biopsy Study, N = 409) and an anoscopy population (ACSS, N = 299)

EvaluationPositive
AUCSensitivity
Specificity
Youden’s index
% P a % (95% CI) P a % (95% CI) P a
Biopsy Study validation set (CIN3+)
 Manual63.1Ref87.0 (75.6 to 93.6)Ref40.5 (35.6 to 45.7)Ref0.27
 CNN457.9.060.7487.0 (75.6 to 93.6)1.045.6 (40.5 to 50.8).070.33
ACSS validation set (AIN2+)
 Manual71.0Ref92.8 (82.2 to 96.5)Ref36.1 (30.3 to 42.4)Ref0.29
 CNN4b62.9<.0010.7791.3 (80.2 to 95.4)1.046.1 (40.0 to 52.6)<.0010.37

Two-sided McNemar’s test. ACCSS = Anal Cancer Screening Study; AIN2+ = anal intraepithelial neoplasia grade 2 or worse; AUC = area under the curve; CIN3+ = cervical intraepithelial neoplasia grade 3 or worse; DS = dual stain; CNN = convolutional neural network; Ref = referent.

CNN4 cutoff for Biopsy: 3 or more cells; CNN4 cutoff for Anal: 3 or more cells.

Receiver operating curve characteristics analysis of number of dual stain (DS)-positive cells detected by CYTOREADER for detection of cervical precancer in the Biopsy Study and anal precancer in the Anal Cancer Screening Study. AUC = area under the curve; AIN2+ = anal intraepithelial neoplasia grade 2 or worse; CIN3+ = cervical intraepithelial neoplasia grade 3 or worse. Accuracy for cervical and anal precancer based on manual and automated detection of DS-positive cells on ThinPrep slides in a colposcopy population (Biopsy Study, N = 409) and an anoscopy population (ACSS, N = 299) Two-sided McNemar’s test. ACCSS = Anal Cancer Screening Study; AIN2+ = anal intraepithelial neoplasia grade 2 or worse; AUC = area under the curve; CIN3+ = cervical intraepithelial neoplasia grade 3 or worse; DS = dual stain; CNN = convolutional neural network; Ref = referent. CNN4 cutoff for Biopsy: 3 or more cells; CNN4 cutoff for Anal: 3 or more cells. In the ACSS validation set with 69 AIN2+, the AUC for detecting AIN2+ based on automated evaluation of DS slides with CNN4 was 0.77 (Figure 2). At a cutoff of 3 DS-positive cells, the positivity of the CNN4 algorithm was lower (63% vs 71%, respectively, P = .001) with comparable sensitivity (P = 1.0) and higher specificity compared with manual DS (36.1% vs 46.1%, respectively, P = .001) (Table 1).

Automated Detection of DS-Positive Cells in an HPV Screening Population

We developed the deep-learning algorithm for SurePath slides using a training set of 238 slides from the Kaiser study with 8215 DS-positive and 9739 DS-negative tiles and applied it in an independent validation set of slides from 3095 women. We observed an increase of DS-positive cells with increasing severity of cytology and histology (Supplementary Figure 3, available online). In the Kaiser validation study including 218 CIN3+, the AUC for detecting CIN3+ based on automated evaluation of DS slides was 0.82 (Figure 3). At a cutoff of 2 cells, the positivity of the algorithm was statistically significantly lower (42% vs 50%, respectively, P < .001) with equal sensitivity (P = .4) but statistically significantly higher specificity (61.5% vs 52.6%, respectively, P < .001) compared with the manual DS. At a cutoff of 100 cells, accuracy approached HSIL cytology that allows for immediate treatment according to current management guidelines (Figure 3). Automated DS provided better risk stratification compared with Pap cytology and manual DS (Figure 4): more women were reassured of a lower risk compared with the other strategies (58% for automated DS vs 50% for manual DS and 40% for cytology), and risk among positives was higher.
Figure 3.

Receiver operating curve characteristics analysis of number of dual stain (DS)-positive cells detected by CYTOREADER for detection of cervical precancer in a human papillomavirus screening population in Kaiser Permanente Northern California. ASC-US = Atypical Squamous Cells of Undetermined Significance; AUC = area under the curve; HSIL = high-grade squamous intraepithelial lesions.

Figure 4.

Absolute risk of precancer for Pap cytology, manual dual stain (DS), and automated DS. ASCUS+= positive for Atypical Squamous Cells of Undetermined Significance or greater cytology results. The dotted lines show clinical action risk thresholds for colposcopy referral (4% risk) and immediate treatment (50% risk).

Receiver operating curve characteristics analysis of number of dual stain (DS)-positive cells detected by CYTOREADER for detection of cervical precancer in a human papillomavirus screening population in Kaiser Permanente Northern California. ASC-US = Atypical Squamous Cells of Undetermined Significance; AUC = area under the curve; HSIL = high-grade squamous intraepithelial lesions. Absolute risk of precancer for Pap cytology, manual dual stain (DS), and automated DS. ASCUS+= positive for Atypical Squamous Cells of Undetermined Significance or greater cytology results. The dotted lines show clinical action risk thresholds for colposcopy referral (4% risk) and immediate treatment (50% risk).

Clinical Efficiency of Automated DS Evaluation

We compared the clinical efficiency of Pap cytology, manual DS, and automated DS at 2 cutoffs (2 or more cells and 1 or more cells) for triage of HPV-positive women (Table 2). All DS strategies achieved equal or better sensitivity for detection of CIN3+ compared with Pap cytology while reducing unnecessary colposcopic referrals. Automated DS reduced overall referral to colposcopy by one-third for the primary automated cutoff of 2 cells (41.9% for automated DS vs 60.1% for cytology, P < .001). Automated DS at a cutoff of 2 or more cells had similar sensitivity but statistically significantly higher specificity compared with manual DS evaluation (61.5% vs 52.6%, P < .001). Automated DS detection at a cutoff of 1 or more cells achieved the highest sensitivity of all strategies, with statistically significantly higher specificity and lower colposcopy referral compared with Pap cytology. Automated DS at a cutoff of 2 or more cells had the most favorable ratio of colposcopies per CIN3+ detected compared with the least favorable ratio for the current standard, Pap cytology (6.8 vs 9.9, respectively). Extrapolating this to the full Kaiser screening population, out of 300 000 women screened annually, approximately 30 000 would test HPV-positive and more than 18 000 would be referred to colposcopy using the current approach with Pap cytology, while only 12 570 would be referred to colposcopy using automated DS. We also estimated the performance of automated DS in a fully vaccinated population. Similar to the overall evaluation, automated DS showed equal sensitivity and lower colposcopy referral compared with Pap cytology with even higher specificity (Supplementary Table 1, available online).
Table 2.

Accuracy for cervical precancer based on Pap cytology and manual and automated detection of DS-positive cells on SurePath slides in the Kaiser Validation Study (N = 3095)

EvaluationColposcopy referral, No. (%) P a (cytology/manual DS)Sensitivity, % (95% CI) P a (cytology/manual DS)Specificity, % (95% CI) P a (cytology/manual DS)PPV, % (95% CI) P b (cytology/ manual DS)NPV, % (95% CI) P b (cytology/manual DS)Colposcopies per CIN3+ detected, No. (95% CI)
Pap cytology (188 CIN3+)1860 (60.1)Ref85.8 (81.2 to 90.5)Ref41.9 (40.1 to 43.7)Ref10.1 (8.7 to 11.5)Ref97.5 (96.6 to 98.3)Ref9.9 (8.7 to 11.3)
Manual DS (197 CIN3+)1536 (49.6)<.001/ Ref90.0 (86.0 to 93.9).2/Ref52.6 (50.8 to 54.5)<.001/ Ref12.6 (11.0 to 14.3)<.001/ Ref98.6 (98.0 to 99.2).02/Ref7.8 (6.8 to 8.9)
Automated DS (2 cells) (192 CIN3+)1298 (41.9)<.001/ <.00188.1 (82.5 to 91.7).6/.461.5 (59.7 to 63.3)<.001/ <.00114.8 (12.9 to 16.8)<.001/ <.00198.5 (97.8 to 99.0).03/.86.8 (5.9 to 7.7)
Automated DS (1 cell) (201 CIN3+)1741 (56.3).007/ <.00191.8 (87.3 to 95.1).05/.546.5 (44.6 to 48.3).06/ <.00111.5 (10.1 to 13.1).002/ <.00198.7 (97.9 to 99.2).01/.58.7 (7.6 to 9.9)

Two-sided McNemar’s test. CIN3+ = cervical intraepithelial neoplasia grade 3 or worse; DS = dual stain; NPV = negative predictive value; PPV = positive predictive value; Ref = referent.

Two-sided generalized score statistic.

Accuracy for cervical precancer based on Pap cytology and manual and automated detection of DS-positive cells on SurePath slides in the Kaiser Validation Study (N = 3095) Two-sided McNemar’s test. CIN3+ = cervical intraepithelial neoplasia grade 3 or worse; DS = dual stain; NPV = negative predictive value; PPV = positive predictive value; Ref = referent. Two-sided generalized score statistic.

Discussion

Using a rigorous study design, we developed a novel deep-learning–based image analysis platform for automated evaluation of DS cytology. In a large population of women undergoing HPV-based cervical cancer screening, we show that automated evaluation of DS slides dramatically increases the efficiency of cervical cancer screening by substantially reducing unnecessary colposcopies compared with current standards and similarly achieves excellent performance in a simulated fully vaccinated population. Thus, CYTOREADER exceeds human diagnostic accuracy and serves as an example of AI achieving improvements beyond the automation of a human standard. Our results demonstrate how automation and machine learning can transform cervical cancer screening that is currently undergoing major changes. HPV testing for cervical cancer screening is an objective and reliable approach directly linked to the carcinogenic process (28). HPV-negative women are at very low risk of developing precancer or cancer over the next decade and screening intervals can be extended (8–10). Yet most HPV infections are transient, and women require additional tests to decide who needs further evaluation or treatment (11, 12). Pap cytology is recommended and approved for triage of HPV-positive women but suffers from subjectivity, lack of reproducibility, and relatively low sensitivity (14). Our previous study comparing manual DS to cytology together with the current results demonstrates that automated DS evaluation can supplant and improve the role of Pap cytology for triage of HPV-positive women and should also be evaluated for postcolposcopy and posttreatment surveillance (16). Compared with Pap cytology, manual DS has higher accuracy and can provide longer reassurance against disease when a test is negative, while the risks to patients do not differ from Pap cytology, because the same sample type is used (17, 21). We previously showed that the few DS-negative CIN3s are more likely to have no HPV16/18 and no high-grade cytology, suggesting that these cases are less likely to progress (16). Automated DS evaluation can provide a completely objective cervical cancer–screening approach, improving efficiency and reducing harms and cost related to false-positive screening results. Furthermore, by demonstrating that AI-based DS detection works for anal cytology, we show the robustness of the imaging and analysis platform. Importantly, our approach is also suited for vaccinated populations, where it may achieve even higher specificity and counterbalance the lower disease prevalence in vaccinated women (29). Automated DS evaluation immediately quantifies the number of DS-positive cells on a slide, allowing tailoring positivity cutoffs for specific clinical decisions. Current guidelines give an option for immediate treatment in women with HSIL cytology, who have a very high probability of having underlying CIN3+ (30). A higher cutoff of DS-positive cells could be used to guide treatment decisions. Moving forward, additional criteria can be developed to expand slide assessment; for example, the presence of abnormal glandular cells to identify adenocarcinoma precursors, which is a particular challenge for Pap cytology (31). Digitization of glass slides paired with automated evaluation in the cloud can provide high-throughput triage of HPV-positive women with inherent objectivity. Furthermore, the functionality of CYTOREADER can provide an assisted diagnostics mode for evaluating DS slides. The automatic algorithm can be used for presenting all DS-positive cells found on a slide ranked by the likelihood that a cell is DS-positive to accelerate slide evaluation. Similarly, CYTOREADER can be used for quality control of a program that is based on manual DS evaluation. Successful implementation of CYTOREADER requires an infrastructure for high-quality staining, full-slide scanning, and running the machine-learning algorithm. However, slide preparation, scanning, and slide evaluation can be geographically separated, providing high-quality cervical cancer screening and triage in locations that currently do not have infrastructure and training to achieve reliable DS evaluation given a reliable courier system is available. Compared with manual evaluation of DS slides, the automated evaluation requires access to scanning infrastructure but may require a smaller cytotechnology workforce. Scanners are increasingly available in pathology laboratories and can process large batches of slides with limited need for a skilled operator (22, 23). Studies are warranted to evaluate if DS is amenable to self-collected specimens, a sampling strategy that is important for low-resource settings. Future efforts also need to evaluate how long a negative automated DS result provides reassurance against precancer and how automated DS can be used in women undergoing surveillance. We conducted a large, well-powered study to evaluate performance of automated DS for triage of HPV-positive women. However, some limitations should be noted. In contrast to the large KPNC study on HPV triage based on SurePath slides, 2 studies using ThinPrep slides were comparably small, and they were conducted in colposcopy/anoscopy populations. Future studies need to evaluate automated DS in larger HPV screening populations using ThinPrep slides. Also, the positivity and sensitivity of cytology at KPNC is much higher compared with other settings, which may affect the comparison of clinical efficiency estimates. Our approach to train and validate both on the tile level and the slide level with ground truth disease endpoints sets our work apart from other deep-learning approaches in digital pathology that focus on replicating a subjective evaluation. We recognize that there is substantial subjectivity underlying histologic endpoints of cervical disease (15). In our study, we minimized the impact by relying on the most reproducible correlate of cervical precancer, CIN3, as our primary endpoint for evaluation of triage of HPV-positive women. Our work also emphasizes the importance of integrating epidemiology and AI with the availability of population bases studies to improve medical diagnostics beyond automation. It has been proposed for a long time that “digital pathology” will become an important cornerstone of future health care. Despite this vision, image analysis currently does not contribute substantially to routine clinical practice and to the benefit of the patient. The automated evaluation of DS cytology slides has substantially improved accuracy and efficiency compared with Pap cytology and serves as an important example for introducing digital pathology and deep learning into clinical practice. This approach has the potential to substantially improve screening program performance, potentially affecting millions of women testing HPV-positive in cervical cancer screening each year.

Funding

This work was supported by the Intramural Research Program of the US National Cancer Institute, National Institutes of Health, Department of Health and Human Services.

Notes

Role of the funder: The funder had no role in the design, of the study; the collection, analysis, and interpretation of the data; the writing of the manuscript; and the decision to submit the manuscript for publication. Conflict of interest: Drs. Wentzensen and Schiffman are employed by the National Cancer Institute (NCI), which has received cervical cancer screening assays in-kind or at reduced cost from BD and Roche for studies that Drs. Wentzensen and Schiffman are conducting. Dr Goldhoff reported receiving grants from the National Institutes of Health (NIH)/NCI during the conduct of the study. Dr Castle reported receiving cervical screening tests and diagnostics from Roche, Becton Dickinson, Cepheid, and Arbor Vita Corp at a reduced cost or no cost for research. Dr Kingery reported receiving grants from the NIH and the NCI during the conduct of the study and receiving grants from the NIH and the NCI outside the submitted work. Dr Grewal reported receiving grants from the NIH and the NCI during the conduct of the study and grants from the NIH and the NCI outside the submitted work. Dr Lorey reported receiving grants from the NIH and the NCI during the conduct of the study and grants from the NIH and the NCI outside the submitted work. No other disclosures were reported. Data accessibility statement: The code is available at: https://github.com/stcmedhub/dual_stain_dl. Author contributions: All authors contributed substantially to the conception and design of the study, the acquisition of data, or the analysis and interpretation. All authors drafted or provided critical revision of the article and provided final approval of the version to publish. Click here for additional data file.
  31 in total

Review 1.  Human papillomavirus testing in the prevention of cervical cancer.

Authors:  Mark Schiffman; Nicolas Wentzensen; Sholom Wacholder; Walter Kinney; Julia C Gage; Philip E Castle
Journal:  J Natl Cancer Inst       Date:  2011-01-31       Impact factor: 13.506

Review 2.  Eurogin roadmap 2017: Triage strategies for the management of HPV-positive women in cervical screening programs.

Authors:  Kate Cuschieri; Guglielmo Ronco; Attila Lorincz; Laurie Smith; Gina Ogilvie; Lisa Mirabello; Francesca Carozzi; Heather Cubie; Nicolas Wentzensen; Peter Snijders; Marc Arbyn; Joe Monsonego; Silvia Franceschi
Journal:  Int J Cancer       Date:  2018-02-08       Impact factor: 7.396

3.  p16/Ki-67 Dual Stain Cytology for Detection of Cervical Precancer in HPV-Positive Women.

Authors:  Nicolas Wentzensen; Barbara Fetterman; Philip E Castle; Mark Schiffman; Shannon N Wood; Eric Stiemerling; Diane Tokugawa; Clara Bodelon; Nancy Poitras; Thomas Lorey; Walter Kinney
Journal:  J Natl Cancer Inst       Date:  2015-09-15       Impact factor: 13.506

4.  Human papillomavirus genotyping, human papillomavirus mRNA expression, and p16/Ki-67 cytology to detect anal cancer precursors in HIV-infected MSM.

Authors:  Nicolas Wentzensen; Stephen Follansbee; Sylvia Borgonovo; Diane Tokugawa; Lauren Schwartz; Thomas S Lorey; Vikrant V Sahasrabuddhe; Brandon Lamere; Julia C Gage; Barbara Fetterman; Teresa M Darragh; Philip E Castle
Journal:  AIDS       Date:  2012-11-13       Impact factor: 4.177

5.  Performance of p16/Ki-67 immunostaining to detect cervical cancer precursors in a colposcopy referral population.

Authors:  Nicolas Wentzensen; Lauren Schwartz; Rosemary E Zuna; Katie Smith; Cara Mathews; Michael A Gold; R Andy Allen; Roy Zhang; S Terence Dunn; Joan L Walker; Mark Schiffman
Journal:  Clin Cancer Res       Date:  2012-06-06       Impact factor: 12.531

6.  Reassurance against future risk of precancer and cancer conferred by a negative human papillomavirus test.

Authors:  Julia C Gage; Mark Schiffman; Hormuzd A Katki; Philip E Castle; Barbara Fetterman; Nicolas Wentzensen; Nancy E Poitras; Thomas Lorey; Li C Cheung; Walter K Kinney
Journal:  J Natl Cancer Inst       Date:  2014-07-18       Impact factor: 13.506

Review 7.  Triage of HPV positive women in cervical cancer screening.

Authors:  Nicolas Wentzensen; Mark Schiffman; Timothy Palmer; Marc Arbyn
Journal:  J Clin Virol       Date:  2015-11-28       Impact factor: 3.168

8.  Prediction of Allogeneic Hematopoietic Stem-Cell Transplantation Mortality 100 Days After Transplantation Using a Machine Learning Algorithm: A European Group for Blood and Marrow Transplantation Acute Leukemia Working Party Retrospective Data Mining Study.

Authors:  Roni Shouval; Myriam Labopin; Ori Bondi; Hila Mishan-Shamay; Avichai Shimoni; Fabio Ciceri; Jordi Esteve; Sebastian Giebel; Norbert C Gorin; Christoph Schmid; Emmanuelle Polge; Mahmoud Aljurf; Nicolaus Kroger; Charles Craddock; Andrea Bacigalupo; Jan J Cornelissen; Frederic Baron; Ron Unger; Arnon Nagler; Mohamad Mohty
Journal:  J Clin Oncol       Date:  2015-08-03       Impact factor: 44.544

Review 9.  Cervical cancer screening following prophylactic human papillomavirus vaccination.

Authors:  Eduardo L Franco; Jack Cuzick
Journal:  Vaccine       Date:  2008-03-14       Impact factor: 3.641

Review 10.  Artificial intelligence in cancer imaging: Clinical challenges and applications.

Authors:  Wenya Linda Bi; Ahmed Hosny; Matthew B Schabath; Maryellen L Giger; Nicolai J Birkbak; Alireza Mehrtash; Tavis Allison; Omar Arnaout; Christopher Abbosh; Ian F Dunn; Raymond H Mak; Rulla M Tamimi; Clare M Tempany; Charles Swanton; Udo Hoffmann; Lawrence H Schwartz; Robert J Gillies; Raymond Y Huang; Hugo J W L Aerts
Journal:  CA Cancer J Clin       Date:  2019-02-05       Impact factor: 508.702

View more
  17 in total

Review 1.  False Negative Results in Cervical Cancer Screening-Risks, Reasons and Implications for Clinical Practice and Public Health.

Authors:  Anna Macios; Andrzej Nowakowski
Journal:  Diagnostics (Basel)       Date:  2022-06-20

2.  The Improving Risk Informed HPV Screening (IRIS) Study: Design and Baseline Characteristics.

Authors:  Julia C Gage; Tina Raine-Bennett; Mark Schiffman; Megan A Clarke; Li C Cheung; Nancy E Poitras; Nicole E Varnado; Hormuzd A Katki; Philip E Castle; Brian Befano; Malini Chandra; Greg Rydzak; Thomas Lorey; Nicolas Wentzensen
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2021-11-17       Impact factor: 4.090

3.  Bayesian deep learning for reliable oral cancer image classification.

Authors:  Bofan Song; Sumsum Sunny; Shaobai Li; Keerthi Gurushanth; Pramila Mendonca; Nirza Mukhia; Sanjana Patrick; Shubha Gurudath; Subhashini Raghavan; Imchen Tsusennaro; Shirley T Leivon; Trupti Kolur; Vivek Shetty; Vidya R Bushan; Rohan Ramesh; Tyler Peterson; Vijay Pillai; Petra Wilder-Smith; Alben Sigamani; Amritha Suresh; Moni Abraham Kuriakose; Praveen Birur; Rongguang Liang
Journal:  Biomed Opt Express       Date:  2021-09-20       Impact factor: 3.562

4.  Designing low-cost, accurate cervical screening strategies that take into account COVID-19: a role for self-sampled HPV typing2.

Authors:  Kayode Olusegun Ajenifuja; Jerome Belinson; Andrew Goldstein; Kanan T Desai; Silvia de Sanjose; Mark Schiffman
Journal:  Infect Agent Cancer       Date:  2020-10-14       Impact factor: 2.965

5.  Novel concepts in cervical cancer screening: a comparison of VIA, HPV DNA test and p16INK4a/Ki-67 dual stain cytology in Western Kenya.

Authors:  Elkanah Omenge Orang'o; Edwin Were; Oliver Rode; Kapten Muthoka; Michael Byczkowski; Heike Sartor; Davy Vanden Broeck; Dietmar Schmidt; Miriam Reuschenbach; Magnus von Knebel Doeberitz; Hermann Bussmann
Journal:  Infect Agent Cancer       Date:  2020-10-02       Impact factor: 2.965

6.  Deep-Learning-Based Evaluation of Dual Stain Cytology for Cervical Cancer Screening: A New Paradigm.

Authors:  Annekathryn Goodman
Journal:  J Natl Cancer Inst       Date:  2021-11-02       Impact factor: 13.506

7.  The Orderly Incorporation of Continuing Technologic Advances Into Cervical Cancer Screening.

Authors:  Mark Schiffman; Nicolas Wentzensen
Journal:  J Natl Cancer Inst       Date:  2021-03-01       Impact factor: 13.506

Review 8.  Artificial intelligence in oncology: Path to implementation.

Authors:  Isaac S Chua; Michal Gaziel-Yablowitz; Zfania T Korach; Kenneth L Kehl; Nathan A Levitan; Yull E Arriaga; Gretchen P Jackson; David W Bates; Michael Hassett
Journal:  Cancer Med       Date:  2021-05-07       Impact factor: 4.452

9.  STRIDES - STudying Risk to Improve DisparitiES in Cervical Cancer in Mississippi - Design and baseline results of a Statewide Cohort Study.

Authors:  Carolann Risley; Mary W Stewart; Kim R Geisinger; Laree M Hiser; Jody C Morgan; Kenyata J Owens; Krishna Ayyalasomayajula; Rhonda M Rives; Ashish Jannela; Dianne E Grunes; Lei Zhang; Mark Schiffman; Nicolas Wentzensen; Megan A Clarke
Journal:  Prev Med       Date:  2021-07-20       Impact factor: 4.018

10.  The Application of Artificial Intelligence-Assisted Colposcopy in a Tertiary Care Hospital within a Cervical Pathology Diagnostic Unit.

Authors:  Aleksandra Zimmer-Stelmach; Jan Zak; Agata Pawlosek; Anna Rosner-Tenerowicz; Joanna Budny-Winska; Michal Pomorski; Tomasz Fuchs; Mariusz Zimmer
Journal:  Diagnostics (Basel)       Date:  2022-01-04
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.