Giulio Napolitano1, Colin Fox, Richard Middleton, David Connolly. 1. Northern Ireland Cancer Registry, Centre for Public Health, Queen's University of Belfast, Mulhouse Building, Grosvenor Road, Belfast BT12 6BJ, Northern Ireland, UK. g.napolitano@qub.ac.uk
Abstract
OBJECTIVE: To evaluate precision and recall rates for the automatic extraction of information from free-text pathology reports. To assess the impact that implementation of pattern-based methods would have on cancer registration completeness. METHOD: Over 300,000 electronic pathology reports were scanned for the extraction of Gleason score, Clark level and Breslow depth, by a number of Perl routines progressively enhanced by a trial-and-error method. An additional test set of 915 reports potentially containing Gleason score was used for evaluation. RESULTS: Values for recall and precision of over 98 and 99%, respectively, were easily reached. Potential increase in cancer staging completeness of up to 32% was proved. CONCLUSIONS: In cancer registration, simple pattern matching applied to free-text documents can be effectively used to improve completeness and accuracy of pathology information.
OBJECTIVE: To evaluate precision and recall rates for the automatic extraction of information from free-text pathology reports. To assess the impact that implementation of pattern-based methods would have on cancer registration completeness. METHOD: Over 300,000 electronic pathology reports were scanned for the extraction of Gleason score, Clark level and Breslow depth, by a number of Perl routines progressively enhanced by a trial-and-error method. An additional test set of 915 reports potentially containing Gleason score was used for evaluation. RESULTS: Values for recall and precision of over 98 and 99%, respectively, were easily reached. Potential increase in cancer staging completeness of up to 32% was proved. CONCLUSIONS: In cancer registration, simple pattern matching applied to free-text documents can be effectively used to improve completeness and accuracy of pathology information.
Authors: John D Osborne; Matthew Wyatt; Andrew O Westfall; James Willig; Steven Bethard; Geoff Gordon Journal: J Am Med Inform Assoc Date: 2016-03-28 Impact factor: 4.497
Authors: Anobel Y Odisho; Briton Park; Nicholas Altieri; John DeNero; Matthew R Cooperberg; Peter R Carroll; Bin Yu Journal: JAMIA Open Date: 2020-10-14
Authors: Joseph Ross Mitchell; Phillip Szepietowski; Rachel Howard; Phillip Reisman; Jennie D Jones; Patricia Lewis; Brooke L Fridley; Dana E Rollison Journal: J Med Internet Res Date: 2022-03-23 Impact factor: 7.076
Authors: Okechinyere J Achilonu; Elvira Singh; Gideon Nimako; René M J C Eijkemans; Eustasius Musenge Journal: Biomed Res Int Date: 2022-01-20 Impact factor: 3.411