OBJECTIVE: To identify Common Data Elements (CDEs) in eligibility criteria of multiple clinical trials studying the same disease using a human-computer collaborative approach. DESIGN: A set of free-text eligibility criteria from clinical trials on two representative diseases, breast cancer and cardiovascular diseases, was sampled to identify disease-specific eligibility criteria CDEs. In this proposed approach, a semantic annotator is used to recognize Unified Medical Language Systems (UMLSs) terms within the eligibility criteria text. The Apriori algorithm is applied to mine frequent disease-specific UMLS terms, which are then filtered by a list of preferred UMLS semantic types, grouped by similarity based on the Dice coefficient, and, finally, manually reviewed. MEASUREMENTS: Standard precision, recall, and F-score of the CDEs recommended by the proposed approach were measured with respect to manually identified CDEs. RESULTS: Average precision and recall of the recommended CDEs for the two diseases were 0.823 and 0.797, respectively, leading to an average F-score of 0.810. In addition, the machine-powered CDEs covered 80% of the cardiovascular CDEs published by The American Heart Association and assigned by human experts. CONCLUSION: It is feasible and effort saving to use a human-computer collaborative approach to augment domain experts for identifying disease-specific CDEs from free-text clinical trial eligibility criteria.
OBJECTIVE: To identify Common Data Elements (CDEs) in eligibility criteria of multiple clinical trials studying the same disease using a human-computer collaborative approach. DESIGN: A set of free-text eligibility criteria from clinical trials on two representative diseases, breast cancer and cardiovascular diseases, was sampled to identify disease-specific eligibility criteria CDEs. In this proposed approach, a semantic annotator is used to recognize Unified Medical Language Systems (UMLSs) terms within the eligibility criteria text. The Apriori algorithm is applied to mine frequent disease-specific UMLS terms, which are then filtered by a list of preferred UMLS semantic types, grouped by similarity based on the Dice coefficient, and, finally, manually reviewed. MEASUREMENTS: Standard precision, recall, and F-score of the CDEs recommended by the proposed approach were measured with respect to manually identified CDEs. RESULTS: Average precision and recall of the recommended CDEs for the two diseases were 0.823 and 0.797, respectively, leading to an average F-score of 0.810. In addition, the machine-powered CDEs covered 80% of the cardiovascular CDEs published by The American Heart Association and assigned by human experts. CONCLUSION: It is feasible and effort saving to use a human-computer collaborative approach to augment domain experts for identifying disease-specific CDEs from free-text clinical trial eligibility criteria.
Authors: C P Cannon; A Battler; R G Brindis; J L Cox; S G Ellis; N R Every; J T Flaherty; R A Harrington; H M Krumholz; M L Simoons; F J Van De Werf; W S Weintraub; K R Mitchell; S L Morrisson; R G Brindis; H V Anderson; D S Cannom; W R Chitwood; J E Cigarroa; R L Collins-Nakai; S G Ellis; R J Gibbons; F L Grover; P A Heidenreich; B K Khandheria; S B Knoebel; H L Krumholz; D J Malenka; D B Mark; C R Mckay; E R Passamani; M J Radford; R N Riner; J B Schwartz; R E Shaw; R J Shemin; D B Van Fossen; E D Verrier; M W Watkins; D R Phoubandith; T Furnelli Journal: J Am Coll Cardiol Date: 2001-12 Impact factor: 24.094
Authors: Samson W Tu; Mor Peleg; Simona Carini; Michael Bobak; Jessica Ross; Daniel Rubin; Ida Sim Journal: J Biomed Inform Date: 2010-09-17 Impact factor: 6.317
Authors: Ryan R Brinkman; Mélanie Courtot; Dirk Derom; Jennifer M Fostel; Yongqun He; Phillip Lord; James Malone; Helen Parkinson; Bjoern Peters; Philippe Rocca-Serra; Alan Ruttenberg; Susanna-Assunta Sansone; Larisa N Soldatova; Christian J Stoeckert; Jessica A Turner; Jie Zheng Journal: J Biomed Semantics Date: 2010-06-22
Authors: Ashokkumar A Patel; André Kajdacsy-Balla; Jules J Berman; Maarten Bosland; Milton W Datta; Rajiv Dhir; John Gilbertson; Jonathan Melamed; Jan Orenstein; Kuei-Fang Tai; Michael J Becich Journal: BMC Cancer Date: 2005-08-21 Impact factor: 4.430
Authors: Amy Y Wang; William J Lancaster; Matthew C Wyatt; Luke V Rasmussen; Daniel G Fort; James J Cimino Journal: AMIA Annu Symp Proc Date: 2018-04-16