Literature DB >> 25220766

LabeledIn: cataloging labeled indications for human drugs.

Ritu Khare1, Jiao Li2, Zhiyong Lu3.   

Abstract

Drug-disease treatment relationships, i.e., which drug(s) are indicated to treat which disease(s), are among the most frequently sought information in PubMed®. Such information is useful for feeding the Google Knowledge Graph, designing computational methods to predict novel drug indications, and validating clinical information in EMRs. Given the importance and utility of this information, there have been several efforts to create repositories of drugs and their indications. However, existing resources are incomplete. Furthermore, they neither label indications in a structured way nor differentiate them by drug-specific properties such as dosage form, and thus do not support computer processing or semantic interoperability. More recently, several studies have proposed automatic methods to extract structured indications from drug descriptions; however, their performance is limited by natural language challenges in disease named entity recognition and indication selection. In response, we report LabeledIn: a human-reviewed, machine-readable and source-linked catalog of labeled indications for human drugs. More specifically, we describe our semi-automatic approach to derive LabeledIn from drug descriptions through human annotations with aids from automatic methods. As the data source, we use the drug labels (or package inserts) submitted to the FDA by drug manufacturers and made available in DailyMed. Our machine-assisted human annotation workflow comprises: (i) a grouping method to remove redundancy and identify representative drug labels to be used for human annotation, (ii) an automatic method to recognize and normalize mentions of diseases in drug labels as candidate indications, and (iii) a two-round annotation workflow for human experts to judge the pre-computed candidates and deliver the final gold standard. In this study, we focused on 250 highly accessed drugs in PubMed Health, a newly developed public web resource for consumers and clinicians on prevention and treatment of diseases. These 250 drugs corresponded to more than 8000 drug labels (500 unique) in DailyMed in which 2950 candidate indications were pre-tagged by an automatic tool. After being reviewed independently by two experts, 1618 indications were selected, and additional 97 (missed by computer) were manually added, with an inter-annotator agreement of 88.35% as measured by the Kappa coefficient. Our final annotation results in LabeledIn consist of 7805 drug-disease treatment relationships where drugs are represented as a triplet of ingredient, dose form, and strength. A systematic comparison of LabeledIn with an existing computer-derived resource revealed significant discrepancies, confirming the need to involve humans in the creation of such a resource. In addition, LabeledIn is unique in that it contains detailed textual context of the selected indications in drug labels, making it suitable for the development of advanced computational methods for the automatic extraction of indications from free text. Finally, motivated by the studies on drug nomenclature and medication errors in EMRs, we adopted a fine-grained drug representation scheme, which enables the automatic identification of drugs with indications specific to certain dose forms or strengths. Future work includes expanding our coverage to more drugs and integration with other resources. The LabeledIn dataset and the annotation guidelines are available at http://ftp.ncbi.nlm.nih.gov/pub/lu/LabeledIn/. Published by Elsevier Inc.

Entities:  

Keywords:  Corpus annotation; Drug indications; Drug labels; Natural language processing

Mesh:

Year:  2014        PMID: 25220766      PMCID: PMC4260997          DOI: 10.1016/j.jbi.2014.08.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  21 in total

1.  Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program.

Authors:  A R Aronson
Journal:  Proc AMIA Symp       Date:  2001

2.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

3.  Semi-automatic semantic annotation of PubMed queries: a study on quality, efficiency, satisfaction.

Authors:  Aurélie Névéol; Rezarta Islamaj Doğan; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2010-11-20       Impact factor: 6.317

4.  Drug off-target effects predicted using structural analysis in the context of a metabolic network model.

Authors:  Roger L Chang; Li Xie; Lei Xie; Philip E Bourne; Bernhard Ø Palsson
Journal:  PLoS Comput Biol       Date:  2010-09-23       Impact factor: 4.475

5.  Prescribing errors involving medication dosage forms.

Authors:  Timothy S Lesar
Journal:  J Gen Intern Med       Date:  2002-08       Impact factor: 5.128

6.  DrugBank 3.0: a comprehensive resource for 'omics' research on drugs.

Authors:  Craig Knox; Vivian Law; Timothy Jewison; Philip Liu; Son Ly; Alex Frolkis; Allison Pon; Kelly Banco; Christine Mak; Vanessa Neveu; Yannick Djoumbou; Roman Eisner; An Chi Guo; David S Wishart
Journal:  Nucleic Acids Res       Date:  2010-11-08       Impact factor: 16.971

7.  Understanding PubMed user search behavior through log analysis.

Authors:  Rezarta Islamaj Dogan; G Craig Murray; Aurélie Névéol; Zhiyong Lu
Journal:  Database (Oxford)       Date:  2009-11-27       Impact factor: 3.451

8.  Pathway-based drug repositioning using causal inference.

Authors:  Jiao Li; Zhiyong Lu
Journal:  BMC Bioinformatics       Date:  2013-10-22       Impact factor: 3.169

9.  Semantic reclassification of the UMLS concepts.

Authors:  Jung-Wei Fan; Carol Friedman
Journal:  Bioinformatics       Date:  2008-07-13       Impact factor: 6.937

10.  DNorm: disease name normalization with pairwise learning to rank.

Authors:  Robert Leaman; Rezarta Islamaj Dogan; Zhiyong Lu
Journal:  Bioinformatics       Date:  2013-08-21       Impact factor: 6.937

View more
  23 in total

Review 1.  Medication-indication knowledge bases: a systematic review and critical appraisal.

Authors:  Hojjat Salmasian; Tran H Tran; Herbert S Chase; Carol Friedman
Journal:  J Am Med Inform Assoc       Date:  2015-09-02       Impact factor: 4.497

Review 2.  Community challenges in biomedical text mining over 10 years: success, failure and the future.

Authors:  Chung-Chi Huang; Zhiyong Lu
Journal:  Brief Bioinform       Date:  2015-05-01       Impact factor: 11.622

3.  Automatic extraction of drug indications from FDA drug labels.

Authors:  Ritu Khare; Chih-Hsuan Wei; Zhiyong Lu
Journal:  AMIA Annu Symp Proc       Date:  2014-11-14

Review 4.  A survey of current trends in computational drug repositioning.

Authors:  Jiao Li; Si Zheng; Bin Chen; Atul J Butte; S Joshua Swamidass; Zhiyong Lu
Journal:  Brief Bioinform       Date:  2015-03-31       Impact factor: 11.622

Review 5.  Crowdsourcing in biomedicine: challenges and opportunities.

Authors:  Ritu Khare; Benjamin M Good; Robert Leaman; Andrew I Su; Zhiyong Lu
Journal:  Brief Bioinform       Date:  2015-04-17       Impact factor: 11.622

6.  Automatic Classification of Structured Product Labels for Pregnancy Risk Drug Categories, a Machine Learning Approach.

Authors:  Laritza M Rodriguez; Dina Demner Fushman
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

7.  Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals.

Authors:  Pedro L Teixeira; Wei-Qi Wei; Robert M Cronin; Huan Mo; Jacob P VanHouten; Robert J Carroll; Eric LaRose; Lisa A Bastarache; S Trent Rosenbloom; Todd L Edwards; Dan M Roden; Thomas A Lasko; Richard A Dart; Anne M Nikolai; Peggy L Peissig; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2016-08-07       Impact factor: 4.497

Review 8.  Clinical information extraction applications: A literature review.

Authors:  Yanshan Wang; Liwei Wang; Majid Rastegar-Mojarad; Sungrim Moon; Feichen Shen; Naveed Afzal; Sijia Liu; Yuqun Zeng; Saeed Mehrabi; Sunghwan Sohn; Hongfang Liu
Journal:  J Biomed Inform       Date:  2017-11-21       Impact factor: 6.317

9.  Disease-drug database for pharmacogenomic-based prescribing.

Authors:  S Hussain; B B Kenigsberg; K Danahey; Y M Lee; P M Galecki; M J Ratain; P H O'Donnell
Journal:  Clin Pharmacol Ther       Date:  2016-06-08       Impact factor: 6.875

10.  A longitudinal analysis of data quality in a large pediatric data research network.

Authors:  Ritu Khare; Levon Utidjian; Byron J Ruth; Michael G Kahn; Evanette Burrows; Keith Marsolo; Nandan Patibandla; Hanieh Razzaghi; Ryan Colvin; Daksha Ranade; Melody Kitzmiller; Daniel Eckrich; L Charles Bailey
Journal:  J Am Med Inform Assoc       Date:  2017-11-01       Impact factor: 4.497

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.