Literature DB >> 27013523

Extracting genetic alteration information for personalized cancer therapy from ClinicalTrials.gov.

Jun Xu1, Hee-Jin Lee1, Jia Zeng2, Yonghui Wu1, Yaoyun Zhang1, Liang-Chin Huang1, Amber Johnson2, Vijaykumar Holla2, Ann M Bailey2, Trevor Cohen1, Funda Meric-Bernstam3, Elmer V Bernstam4, Hua Xu5.   

Abstract

OBJECTIVE: Clinical trials investigating drugs that target specific genetic alterations in tumors are important for promoting personalized cancer therapy. The goal of this project is to create a knowledge base of cancer treatment trials with annotations about genetic alterations from ClinicalTrials.gov.
METHODS: We developed a semi-automatic framework that combines advanced text-processing techniques with manual review to curate genetic alteration information in cancer trials. The framework consists of a document classification system to identify cancer treatment trials from ClinicalTrials.gov and an information extraction system to extract gene and alteration pairs from the Title and Eligibility Criteria sections of clinical trials. By applying the framework to trials at ClinicalTrials.gov, we created a knowledge base of cancer treatment trials with genetic alteration annotations. We then evaluated each component of the framework against manually reviewed sets of clinical trials and generated descriptive statistics of the knowledge base. RESULTS AND DISCUSSION: The automated cancer treatment trial identification system achieved a high precision of 0.9944. Together with the manual review process, it identified 20 193 cancer treatment trials from ClinicalTrials.gov. The automated gene-alteration extraction system achieved a precision of 0.8300 and a recall of 0.6803. After validation by manual review, we generated a knowledge base of 2024 cancer trials that are labeled with specific genetic alteration information. Analysis of the knowledge base revealed the trend of increased use of targeted therapy for cancer, as well as top frequent gene-alteration pairs of interest. We expect this knowledge base to be a valuable resource for physicians and patients who are seeking information about personalized cancer therapy.
© The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  clinical trial; natural language processing; personalized cancer therapy

Mesh:

Substances:

Year:  2016        PMID: 27013523      PMCID: PMC4926744          DOI: 10.1093/jamia/ocw009

Source DB:  PubMed          Journal:  J Am Med Inform Assoc        ISSN: 1067-5027            Impact factor:   4.497


  16 in total

1.  Design, implementation and management of a web-based data entry system for ClinicalTrials.gov.

Authors:  John E Gillen; Tony Tse; Nicholas C Ide; Alexa T McCray
Journal:  Stud Health Technol Inform       Date:  2004

2.  An overview of MetaMap: historical perspective and recent advances.

Authors:  Alan R Aronson; François-Michel Lang
Journal:  J Am Med Inform Assoc       Date:  2010 May-Jun       Impact factor: 4.497

Review 3.  A decision support framework for genomically informed investigational cancer therapy.

Authors:  Funda Meric-Bernstam; Amber Johnson; Vijaykumar Holla; Ann Marie Bailey; Lauren Brusco; Ken Chen; Mark Routbort; Keyur P Patel; Jia Zeng; Scott Kopetz; Michael A Davies; Sarina A Piha-Paul; David S Hong; Agda Karina Eterovic; Apostolia M Tsimberidou; Russell Broaddus; Elmer V Bernstam; Kenna R Shaw; John Mendelsohn; Gordon B Mills
Journal:  J Natl Cancer Inst       Date:  2015-04-11       Impact factor: 13.506

4.  Systematic identification of pharmacogenomics information from clinical trials.

Authors:  Jiao Li; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2012-04-24       Impact factor: 6.317

5.  Identifying the status of genetic lesions in cancer clinical trial documents using machine learning.

Authors:  Yonghui Wu; Mia A Levy; Christine M Micheel; Paul Yeh; Buzhou Tang; Michael J Cantrell; Stacy M Cooreman; Hua Xu
Journal:  BMC Genomics       Date:  2012-12-17       Impact factor: 3.969

6.  genenames.org: the HGNC resources in 2011.

Authors:  Ruth L Seal; Susan M Gordon; Michael J Lush; Mathew W Wright; Elspeth A Bruford
Journal:  Nucleic Acids Res       Date:  2010-10-06       Impact factor: 16.971

7.  BioCreative III interactive task: an overview.

Authors:  Cecilia N Arighi; Phoebe M Roberts; Shashank Agarwal; Sanmitra Bhattacharya; Gianni Cesareni; Andrew Chatr-Aryamontri; Simon Clematide; Pascale Gaudet; Michelle Gwinn Giglio; Ian Harrow; Eva Huala; Martin Krallinger; Ulf Leser; Donghui Li; Feifan Liu; Zhiyong Lu; Lois J Maltais; Naoaki Okazaki; Livia Perfetto; Fabio Rinaldi; Rune Sætre; David Salgado; Padmini Srinivasan; Philippe E Thomas; Luca Toldo; Lynette Hirschman; Cathy H Wu
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

8.  DrugBank: a comprehensive resource for in silico drug discovery and exploration.

Authors:  David S Wishart; Craig Knox; An Chi Guo; Savita Shrivastava; Murtaza Hassanali; Paul Stothard; Zhan Chang; Jennifer Woolsey
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Entrez Gene: gene-centered information at NCBI.

Authors:  Donna Maglott; Jim Ostell; Kim D Pruitt; Tatiana Tatusova
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

10.  Adapting a natural language processing tool to facilitate clinical trial curation for personalized cancer therapy.

Authors:  Jia Zeng; Yonghui Wu; Ann Bailey; Amber Johnson; Vijaykumar Holla; Elmer V Bernstam; Hua Xu; Funda Meric-Bernstam
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2014-04-07
View more
  15 in total

Review 1.  Making Sense of Big Textual Data for Health Care: Findings from the Section on Clinical Natural Language Processing.

Authors:  A Névéol; P Zweigenbaum
Journal:  Yearb Med Inform       Date:  2017-09-11

2.  OCTANE: Oncology Clinical Trial Annotation Engine.

Authors:  Jia Zeng; Md Abu Shufean; Yekaterina Khotskaya; Dong Yang; Michael Kahle; Amber Johnson; Vijaykumar Holla; Nora Sánchez; Kenna R Mills Shaw; Elmer V Bernstam; Funda Meric-Bernstam
Journal:  JCO Clin Cancer Inform       Date:  2019-07

3.  Improving precision in concept normalization.

Authors:  Mayla Boguslav; K Bretonnel Cohen; William A Baumgartner; Lawrence E Hunter
Journal:  Pac Symp Biocomput       Date:  2018

4.  Knowledge bases and software support for variant interpretation in precision oncology.

Authors:  Florian Borchert; Andreas Mock; Aurelie Tomczak; Jonas Hügel; Samer Alkarkoukly; Alexander Knurr; Anna-Lena Volckmar; Albrecht Stenzinger; Peter Schirmacher; Jürgen Debus; Dirk Jäger; Thomas Longerich; Stefan Fröhling; Roland Eils; Nina Bougatf; Ulrich Sax; Matthieu-P Schapranow
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 11.622

5.  Precision medicine informatics.

Authors:  Lewis J Frey; Elmer V Bernstam; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2016-06-06       Impact factor: 7.942

6.  Practical Aspects of Implementing and Applying Health Care Cloud Computing Services and Informatics to Cancer Clinical Trial Data.

Authors:  Jay G Ronquillo; William T Lester
Journal:  JCO Clin Cancer Inform       Date:  2021-08

7.  Big Data Mining and Adverse Event Pattern Analysis in Clinical Drug Trials.

Authors:  Callie Federer; Minjae Yoo; Aik Choon Tan
Journal:  Assay Drug Dev Technol       Date:  2016-09-15       Impact factor: 1.738

8.  CATTLE (CAncer treatment treasury with linked evidence): An integrated knowledge base for personalized oncology research and practice.

Authors:  E Soysal; H-J Lee; Y Zhang; L-C Huang; X Chen; Q Wei; W Zheng; J T Chang; T Cohen; J Sun; H Xu
Journal:  CPT Pharmacometrics Syst Pharmacol       Date:  2017-03-13

Review 9.  Next-Generation Sequencing and the Clinical Oncology Workflow: Data Challenges, Proposed Solutions, and a Call to Action.

Authors:  Jake R Conway; Jeremy L Warner; Wendy S Rubinstein; Robert S Miller
Journal:  JCO Precis Oncol       Date:  2019-10-01

Review 10.  Personalized cancer therapy-leveraging a knowledge base for clinical decision-making.

Authors:  Ecaterina Ileana Dumbrava; Funda Meric-Bernstam
Journal:  Cold Spring Harb Mol Case Stud       Date:  2018-04-02
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.