Literature DB >> 17911777

Extracting subject demographic information from abstracts of randomized clinical trial reports.

Rong Xu1, Yael Garten, Kaustubh S Supekar, Amar K Das, Russ B Altman, Alan M Garber.   

Abstract

In order to make more informed healthcare decisions, consumers need information systems that deliver accurate and reliable information about their illnesses and potential treatments. Reports of randomized clinical trials (RCTs) provide reliable medical evidence about the efficacy of treatments. Current methods to access, search for, and retrieve RCTs are keyword-based, time-consuming, and suffer from poor precision. Personalized semantic search and medical evidence summarization aim to solve this problem. The performance of these approaches may improve if they have access to study subject descriptors (e.g. age, gender, and ethnicity), trial sizes, and diseases/symptoms studied. We have developed a novel method to automatically extract such subject demographic information from RCT abstracts. We used text classification augmented with a Hidden Markov Model to identify sentences containing subject demographics, and subsequently these sentences were parsed using Natural Language Processing techniques to extract relevant information. Our results show accuracy levels of 82.5%, 92.5%, and 92.0% for extraction of subject descriptors, trial sizes, and diseases/symptoms descriptors respectively.

Mesh:

Year:  2007        PMID: 17911777

Source DB:  PubMed          Journal:  Stud Health Technol Inform        ISSN: 0926-9630


  11 in total

1.  Automated information extraction of key trial design elements from clinical trial publications.

Authors:  Berry de Bruijn; Simona Carini; Svetlana Kiritchenko; Joel Martin; Ida Sim
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

2.  The Ontology of Clinical Research (OCRe): an informatics foundation for the science of clinical research.

Authors:  Ida Sim; Samson W Tu; Simona Carini; Harold P Lehmann; Brad H Pollock; Mor Peleg; Knut M Wittkowski
Journal:  J Biomed Inform       Date:  2013-11-13       Impact factor: 6.317

Review 3.  Recent progress in automatically extracting information from the pharmacogenomic literature.

Authors:  Yael Garten; Adrien Coulet; Russ B Altman
Journal:  Pharmacogenomics       Date:  2010-10       Impact factor: 2.533

4.  Extracting Characteristics of the Study Subjects from Full-Text Articles.

Authors:  Dina Demner-Fushman; James G Mork
Journal:  AMIA Annu Symp Proc       Date:  2015-11-05

5.  Systematic identification of pharmacogenomics information from clinical trials.

Authors:  Jiao Li; Zhiyong Lu
Journal:  J Biomed Inform       Date:  2012-04-24       Impact factor: 6.317

6.  Data extraction methods for systematic review (semi)automation: A living systematic review.

Authors:  Lena Schmidt; Babatunde K Olorisade; Luke A McGuinness; James Thomas; Julian P T Higgins
Journal:  F1000Res       Date:  2021-05-19

7.  Why Health Services Research Needs Geoinformatics: Rationale and Case Example.

Authors:  Tracy Onega; Jennifer Alford-Teaster; Steven Andrews; Craig Ganoe; Mike Perez; King David; Xun Shi
Journal:  J Health Med Inform       Date:  2014-12

Review 8.  Automating data extraction in systematic reviews: a systematic review.

Authors:  Siddhartha R Jonnalagadda; Pawan Goyal; Mark D Huffman
Journal:  Syst Rev       Date:  2015-06-15

9.  Sentence retrieval for abstracts of randomized controlled trials.

Authors:  Grace Y Chung
Journal:  BMC Med Inform Decis Mak       Date:  2009-02-10       Impact factor: 2.796

10.  Towards Evidence-based Precision Medicine: Extracting Population Information from Biomedical Text using Binary Classifiers and Syntactic Patterns.

Authors:  Kalpana Raja; Naman Dasot; Pawan Goyal; Siddhartha R Jonnalagadda
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2016-07-20
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.