| Literature DB >> 32755460 |
Bridie S Thompson1, Sam Hardy2, Nirmala Pandeya1,3, Jean Claude Dusingize1, Adele C Green1,4, Athon Millane3, Daniel Bourke5, Ronald Grande5, Cameron D Bean5, Catherine M Olsen1,6, David C Whiteman1,6.
Abstract
PURPOSE: Keratinocyte cancers are exceedingly common in high-risk populations, but accurate measures of incidence are seldom derived because the burden of manually reviewing pathology reports to extract relevant diagnostic information is excessive. Thus, we sought to develop supervised learning algorithms for classifying basal and squamous cell carcinomas and other diagnoses, as well as disease site, and incorporate these into a Web application capable of processing large numbers of pathology reports.Entities:
Year: 2020 PMID: 32755460 PMCID: PMC7469600 DOI: 10.1200/CCI.19.00152
Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN: 2473-4276
Calculations Used in the Experiment and Validations
Data Fields in Output Data From Pathology Classifier Web Application
Accuracy of Final Algorithm for Diagnosis Classification in Test Split of Training Data Set in Development
Test Results for Accuracy of Algorithm Prediction for Diagnosis
Test Results for Accuracy of Algorithm Prediction for Site
FIG A1.Test results for agreement (F1 score) and discordance of diagnoses between the predicted labels (algorithm derived classification) and true labels (actual diagnosis). Histologic names for labels are detailed in Table A1.
FIG A2.Test results for agreement (F1 score) and discordance of site between the predicted labels (algorithm-predicted site) and true labels (actual site). Anatomic site names for labels are detailed in Table A2.
Accuracy of Classifying at Least One Case of the Diagnosis in Each Report and Agreement Between Algorithm-Derived and Manual Review (gold standard) Sample of Reports in QSkin Study Participants and External Study Participants (STAR study)
Count of Each Diagnosis for Each Person and Agreement Between Algorithm-Derived Extraction and Manual Review (gold standard) From 3 Validation Sources
Accuracy of Classifying at Least One Keratinocyte Cancer at Each Site in a Report and Agreement Between Algorithm-Derived Extraction and Manual Review (gold standard) Sample of Reports