Guan N Guo1,2, Jitendra Jonnagaddala1,2, Sanjay Farshid1, Vojtech Huser3, Christian Reich4, Siaw-Teng Liaw1,2. 1. School of Public Health and Community Medicine, University of New South Wales, Sydney, Australia. 2. WHO Collaborating Centre for eHealth, University of New South Wales, Sydney, Australia. 3. Lister Hill National Centre for Biomedical Communications, National Library of Medicine National Institutes of Health, Bethesda, Maryland, USA. 4. IQVIA, Cambridge, Massachusetts, USA.
Abstract
OBJECTIVE: Electronic health records are increasingly utilized for observational and clinical research. Identification of cohorts using electronic health records is an important step in this process. Previous studies largely focused on the methods of cohort selection, but there is little evidence on the impact of underlying vocabularies and mappings between vocabularies used for cohort selection. We aim to compare the cohort selection performance using Australian Medicines Terminology to Anatomical Therapeutic Chemical (ATC) mappings from 2 different sources. These mappings were taken from the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) and the Pharmaceutical Benefits Scheme (PBS) schedule. MATERIALS AND METHODS: We retrieved patients from the electronic Practice Based Research Network data repository using 3 ATC classification groups (A10, N02A, N06A). The retrieved patients were further verified manually and pooled to form a reference standard which was used to assess the accuracy of mappings using precision, recall, and F measure metrics. RESULTS: The OMOP-CDM mappings identified 2.6%, 15.2%, and 24.4% more drugs than the PBS mappings in the A10, N02A and N06A groups respectively. Despite this, the PBS mappings generally performed the same in cohort selection as OMOP-CDM mappings except for the N02A Opioids group, where a significantly greater number of patients were retrieved. Both mappings exhibited variable recall, but perfect precision, with all drugs found to be correctly identified. CONCLUSION: We found that 1 of the 3 ATC groups had a significant difference and this affected cohort selection performance. Our findings highlighted that underlying terminology mappings can greatly impact cohort selection accuracy. Clinical researchers should carefully evaluate vocabulary mapping sources including methodologies used to develop those mappings.
OBJECTIVE: Electronic health records are increasingly utilized for observational and clinical research. Identification of cohorts using electronic health records is an important step in this process. Previous studies largely focused on the methods of cohort selection, but there is little evidence on the impact of underlying vocabularies and mappings between vocabularies used for cohort selection. We aim to compare the cohort selection performance using Australian Medicines Terminology to Anatomical Therapeutic Chemical (ATC) mappings from 2 different sources. These mappings were taken from the Observational Medical Outcomes Partnership Common Data Model (OMOP-CDM) and the Pharmaceutical Benefits Scheme (PBS) schedule. MATERIALS AND METHODS: We retrieved patients from the electronic Practice Based Research Network data repository using 3 ATC classification groups (A10, N02A, N06A). The retrieved patients were further verified manually and pooled to form a reference standard which was used to assess the accuracy of mappings using precision, recall, and F measure metrics. RESULTS: The OMOP-CDM mappings identified 2.6%, 15.2%, and 24.4% more drugs than the PBS mappings in the A10, N02A and N06A groups respectively. Despite this, the PBS mappings generally performed the same in cohort selection as OMOP-CDM mappings except for the N02A Opioids group, where a significantly greater number of patients were retrieved. Both mappings exhibited variable recall, but perfect precision, with all drugs found to be correctly identified. CONCLUSION: We found that 1 of the 3 ATC groups had a significant difference and this affected cohort selection performance. Our findings highlighted that underlying terminology mappings can greatly impact cohort selection accuracy. Clinical researchers should carefully evaluate vocabulary mapping sources including methodologies used to develop those mappings.
Keywords:
Australian Medicines Terminology; anatomical therapeutic chemical classification; clinical trials; cohort selection; electronic health records
Authors: Rachel E Sherman; Steven A Anderson; Gerald J Dal Pan; Gerry W Gray; Thomas Gross; Nina L Hunter; Lisa LaVange; Danica Marinac-Dabic; Peter W Marks; Melissa A Robb; Jeffrey Shuren; Robert Temple; Janet Woodcock; Lilly Q Yue; Robert M Califf Journal: N Engl J Med Date: 2016-12-08 Impact factor: 91.245
Authors: Sheng Yu; Yumeng Ma; Jessica Gronsbell; Tianrun Cai; Ashwin N Ananthakrishnan; Vivian S Gainer; Susanne E Churchill; Peter Szolovits; Shawn N Murphy; Isaac S Kohane; Katherine P Liao; Tianxi Cai Journal: J Am Med Inform Assoc Date: 2018-01-01 Impact factor: 4.497
Authors: Agnes I Vitry; Elizabeth E Roughead; Adrian K Preiss; Philip Ryan; Emmae N Ramsay; Andrew L Gilbert; Gillian E Caughey; Sepehr Shakib; Adrian Esterman; Ying Zhang; Robyn A McDermott Journal: PLoS One Date: 2010-11-17 Impact factor: 3.240
Authors: Maria C S Inacio; Craig Hansen; Nicole L Pratt; Stephen E Graves; Elizabeth E Roughead Journal: BMJ Open Date: 2016-04-29 Impact factor: 2.692
Authors: Siaw-Teng Liaw; Harshana Liyanage; Craig Kuziemsky; Amanda L Terry; Richard Schreiber; Jitendra Jonnagaddala; Simon de Lusignan Journal: Yearb Med Inform Date: 2020-04-17