John J Frazier1, Corey D Stein1, Eugene Tseytlin1, Tanja Bekhuis1. 1. jjf60@pitt.edu , Fellow of the American Academy of Oral and Maxillofacial Pathology, Diplomate of the American Board of Oral and Maxillofacial Pathology, and National Library of Medicine Fellow; cds51@pitt.edu , Researcher; tseytlin@pitt.edu , Systems Developer; (Featured), tcb24@pitt.edu , Assistant Professor, Department of Biomedical Informatics and Department of Dental Public Health; School of Medicine and School of Dental Medicine, University of Pittsburgh, 5607 Baum Boulevard, Suite 514, Pittsburgh, PA 15206-3701.
Abstract
OBJECTIVE: To support clinical researchers, librarians and informationists may need search filters for particular tasks. Development of filters typically depends on a "gold standard" dataset. This paper describes generalizable methods for creating a gold standard to support future filter development and evaluation using oral squamous cell carcinoma (OSCC) as a case study. OSCC is the most common malignancy affecting the oral cavity. Investigation of biomarkers with potential prognostic utility is an active area of research in OSCC. The methods discussed here should be useful for designing quality search filters in similar domains. METHODS: The authors searched MEDLINE for prognostic studies of OSCC, developed annotation guidelines for screeners, ran three calibration trials before annotating the remaining body of citations, and measured inter-annotator agreement (IAA). RESULTS: We retrieved 1,818 citations. After calibration, we screened the remaining citations (n = 1,767; 97.2%); IAA was substantial (kappa = 0.76). The dataset has 497 (27.3%) citations representing OSCC studies of potential prognostic biomarkers. CONCLUSIONS: The gold standard dataset is likely to be high quality and useful for future development and evaluation of filters for OSCC studies of potential prognostic biomarkers. IMPLICATIONS: The methodology we used is generalizable to other domains requiring a reference standard to evaluate the performance of search filters. A gold standard is essential because the labels regarding relevance enable computation of diagnostic metrics, such as sensitivity and specificity. Librarians and informationists with data analysis skills could contribute to developing gold standard datasets and subsequent filters tuned for their patrons' domains of interest.
OBJECTIVE: To support clinical researchers, librarians and informationists may need search filters for particular tasks. Development of filters typically depends on a "gold standard" dataset. This paper describes generalizable methods for creating a gold standard to support future filter development and evaluation using oral squamous cell carcinoma (OSCC) as a case study. OSCC is the most common malignancy affecting the oral cavity. Investigation of biomarkers with potential prognostic utility is an active area of research in OSCC. The methods discussed here should be useful for designing quality search filters in similar domains. METHODS: The authors searched MEDLINE for prognostic studies of OSCC, developed annotation guidelines for screeners, ran three calibration trials before annotating the remaining body of citations, and measured inter-annotator agreement (IAA). RESULTS: We retrieved 1,818 citations. After calibration, we screened the remaining citations (n = 1,767; 97.2%); IAA was substantial (kappa = 0.76). The dataset has 497 (27.3%) citations representing OSCC studies of potential prognostic biomarkers. CONCLUSIONS: The gold standard dataset is likely to be high quality and useful for future development and evaluation of filters for OSCC studies of potential prognostic biomarkers. IMPLICATIONS: The methodology we used is generalizable to other domains requiring a reference standard to evaluate the performance of search filters. A gold standard is essential because the labels regarding relevance enable computation of diagnostic metrics, such as sensitivity and specificity. Librarians and informationists with data analysis skills could contribute to developing gold standard datasets and subsequent filters tuned for their patrons' domains of interest.
Authors: Nancy L Wilczynski; K Ann McKibbon; Stephen D Walter; Amit X Garg; R Brian Haynes Journal: J Am Med Inform Assoc Date: 2012-09-27 Impact factor: 4.497
Authors: Geert-Jan Geersing; Walter Bouwmeester; Peter Zuithoff; Rene Spijker; Mariska Leeflang; Karel G M Moons; Karel Moons Journal: PLoS One Date: 2012-02-29 Impact factor: 3.240
Authors: Esther M M van de Glind; Barbara C van Munster; René Spijker; Rob J P M Scholten; Lotty Hooft Journal: J Am Med Inform Assoc Date: 2011-09-23 Impact factor: 4.497
Authors: Sophia Ananiadou; Wael Abdelkader; Tamara Navarro; Rick Parrish; Chris Cotoi; Federico Germini; Lori-Ann Linkins; Alfonso Iorio; R Brian Haynes; Lingyang Chu; Cynthia Lokker Journal: JMIR Res Protoc Date: 2021-11-29