Literature DB >> 25618216

Representing and extracting lung cancer study metadata: study objective and study design.

Jean I Garcia-Gathright1, Andrea Oh2, Phillip A Abarca3, Mary Han3, William Sago3, Marshall L Spiegel3, Brian Wolf3, Edward B Garon3, Alex A T Bui4, Denise R Aberle4.   

Abstract

This paper describes the information retrieval step in Casama (Contextualized Semantic Maps), a project that summarizes and contextualizes current research papers on driver mutations in non-small cell lung cancer. Casama׳s representation of lung cancer studies aims to capture elements that will assist an end-user in retrieving studies and, importantly, judging their strength. This paper focuses on two types of study metadata: study objective and study design. 430 abstracts on EGFR and ALK mutations in lung cancer were annotated manually. Casama׳s support vector machine (SVM) automatically classified the abstracts by study objective with as much as 129% higher F-scores compared to PubMed׳s built-in filters. A second SVM classified the abstracts by epidemiological study design, suggesting strength of evidence at a more granular level than in previous work. The classification results and the top features determined by the classifiers suggest that this scheme would be generalizable to other mutations in lung cancer, as well as studies on driver mutations in other cancer domains.
Copyright © 2015 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Automatic summarization; Information retrieval; Quality of evidence

Mesh:

Year:  2015        PMID: 25618216      PMCID: PMC4331232          DOI: 10.1016/j.compbiomed.2015.01.004

Source DB:  PubMed          Journal:  Comput Biol Med        ISSN: 0010-4825            Impact factor:   4.589


  19 in total

1.  Identifying diagnostic studies in MEDLINE: reducing the number needed to read.

Authors:  Lucas M Bachmann; Reto Coray; Pius Estermann; Gerben Ter Riet
Journal:  J Am Med Inform Assoc       Date:  2002 Nov-Dec       Impact factor: 4.497

2.  Text categorization models for high-quality article retrieval in internal medicine.

Authors:  Yindalon Aphinyanaphongs; Ioannis Tsamardinos; Alexander Statnikov; Douglas Hardin; Constantin F Aliferis
Journal:  J Am Med Inform Assoc       Date:  2004-11-23       Impact factor: 4.497

3.  The effect of feature representation on MEDLINE document classification.

Authors:  Meliha Yetisgen-Yildiz; Wanda Pratt
Journal:  AMIA Annu Symp Proc       Date:  2005

4.  Towards automatic recognition of scientifically rigorous clinical research evidence.

Authors:  Halil Kilicoglu; Dina Demner-Fushman; Thomas C Rindflesch; Nancy L Wilczynski; R Brian Haynes
Journal:  J Am Med Inform Assoc       Date:  2008-10-24       Impact factor: 4.497

5.  An SVM-based high-quality article classifier for systematic reviews.

Authors:  Seunghee Kim; Jinwook Choi
Journal:  J Biomed Inform       Date:  2013-10-29       Impact factor: 6.317

6.  Document classification for mining host pathogen protein-protein interactions.

Authors:  Lanlan Yin; Guixian Xu; Manabu Torii; Zhendong Niu; Jose M Maisog; Cathy Wu; Zhangzhi Hu; Hongfang Liu
Journal:  Artif Intell Med       Date:  2010-05-15       Impact factor: 5.326

7.  Retrieving randomized controlled trials from medline: a comparison of 38 published search filters.

Authors:  Kathleen Ann McKibbon; Nancy Lou Wilczynski; Robert Brian Haynes
Journal:  Health Info Libr J       Date:  2009-09

8.  PreBIND and Textomy--mining the biomedical literature for protein-protein interactions using a support vector machine.

Authors:  Ian Donaldson; Joel Martin; Berry de Bruijn; Cheryl Wolting; Vicki Lay; Brigitte Tuekam; Shudong Zhang; Berivan Baskin; Gary D Bader; Katerina Michalickova; Tony Pawson; Christopher W V Hogue
Journal:  BMC Bioinformatics       Date:  2003-03-27       Impact factor: 3.169

9.  Semi-automated screening of biomedical citations for systematic reviews.

Authors:  Byron C Wallace; Thomas A Trikalinos; Joseph Lau; Carla Brodley; Christopher H Schmid
Journal:  BMC Bioinformatics       Date:  2010-01-26       Impact factor: 3.169

10.  Afatinib: emerging next-generation tyrosine kinase inhibitor for NSCLC.

Authors:  Valerie Nelson; Jacqueline Ziehr; Mark Agulnik; Melissa Johnson
Journal:  Onco Targets Ther       Date:  2013-03-05       Impact factor: 4.147

View more
  2 in total

1.  Evaluating Casama: Contextualized semantic maps for summarization of lung cancer studies.

Authors:  Jean I Garcia-Gathright; Nicholas J Matiasz; Carlos Adame; Karthik V Sarma; Lauren Sauer; Nova F Smedley; Marshall L Spiegel; Jennifer Strunck; Edward B Garon; Ricky K Taira; Denise R Aberle; Alex A T Bui
Journal:  Comput Biol Med       Date:  2017-11-03       Impact factor: 4.589

2.  Toward patient-tailored summarization of lung cancer literature.

Authors:  Jean I Garcia-Gathright; Nicholas J Matiasz; Edward B Garon; Denise R Aberle; Ricky K Taira; Alex A T Bui
Journal:  IEEE EMBS Int Conf Biomed Health Inform       Date:  2016-04-21
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.