Literature DB >> 34408850

Data extraction methods for systematic review (semi)automation: A living systematic review.

Lena Schmidt1,2, Babatunde K Olorisade1,3, Luke A McGuinness1, James Thomas4, Julian P T Higgins1.   

Abstract

Background: The reliable and usable (semi)automation of data extraction can support the field of systematic review by reducing the workload required to gather information about the conduct and results of the included studies. This living systematic review examines published approaches for data extraction from reports of clinical studies.
Methods: We systematically and continually search MEDLINE, Institute of Electrical and Electronics Engineers (IEEE), arXiv, and the dblp computer science bibliography databases. Full text screening and data extraction are conducted within an open-source living systematic review application created for the purpose of this review. This iteration of the living review includes publications up to a cut-off date of 22 April 2020.
Results: In total, 53 publications are included in this version of our review. Of these, 41 (77%) of the publications addressed extraction of data from abstracts, while 14 (26%) used full texts. A total of 48 (90%) publications developed and evaluated classifiers that used randomised controlled trials as the main target texts. Over 30 entities were extracted, with PICOs (population, intervention, comparator, outcome) being the most frequently extracted. A description of their datasets was provided by 49 publications (94%), but only seven (13%) made the data publicly available. Code was made available by 10 (19%) publications, and five (9%) implemented publicly available tools. Conclusions: This living systematic review presents an overview of (semi)automated data-extraction literature of interest to different types of systematic review. We identified a broad evidence base of publications describing data extraction for interventional reviews and a small number of publications extracting epidemiological or diagnostic accuracy data. The lack of publicly available gold-standard data for evaluation, and lack of application thereof, makes it difficult to draw conclusions on which is the best-performing system for each data extraction target. With this living review we aim to review the literature continually. Copyright:
© 2021 Schmidt L et al.

Entities:  

Keywords:  Data Extraction; Natural Language Processing; Reproducibility; Systematic Reviews; Text Mining

Mesh:

Year:  2021        PMID: 34408850      PMCID: PMC8361807          DOI: 10.12688/f1000research.51117.1

Source DB:  PubMed          Journal:  F1000Res        ISSN: 2046-1402


  50 in total

1.  Automatically identifying health outcome information in MEDLINE records.

Authors:  Dina Demner-Fushman; Barbara Few; Susan E Hauser; George Thoma
Journal:  J Am Med Inform Assoc       Date:  2005-10-12       Impact factor: 4.497

2.  A method of extracting the number of trial participants from abstracts describing randomized controlled trials.

Authors:  Marie J Hansen; Nana Ø Rasmussen; Grace Chung
Journal:  J Telemed Telecare       Date:  2008       Impact factor: 6.184

3.  PICO element detection in medical text without metadata: are first sentences enough?

Authors:  Ke-Chun Huang; I-Jen Chiang; Furen Xiao; Chun-Chih Liao; Charles Chih-Ho Liu; Jau-Min Wong
Journal:  J Biomed Inform       Date:  2013-07-27       Impact factor: 6.317

4.  Exploiting classification correlations for the extraction of evidence-based practice information.

Authors:  Jin Zhao; Praveen Bysani; Min-Yen Kan
Journal:  AMIA Annu Symp Proc       Date:  2012-11-03

5.  Extractive text summarization system to aid data extraction from full text in systematic review development.

Authors:  Duy Duc An Bui; Guilherme Del Fiol; John F Hurdle; Siddhartha Jonnalagadda
Journal:  J Biomed Inform       Date:  2016-10-27       Impact factor: 6.317

6.  Advancing PICO element detection in biomedical text via deep neural networks.

Authors:  Di Jin; Peter Szolovits
Journal:  Bioinformatics       Date:  2020-06-01       Impact factor: 6.937

7.  Machine learning to assist risk-of-bias assessments in systematic reviews.

Authors:  Louise A C Millard; Peter A Flach; Julian P T Higgins
Journal:  Int J Epidemiol       Date:  2015-12-08       Impact factor: 7.196

8.  Trialstreamer: A living, automatically updated database of clinical trial reports.

Authors:  Iain J Marshall; Benjamin Nye; Joël Kuiper; Anna Noel-Storr; Rachel Marshall; Rory Maclean; Frank Soboczenski; Ani Nenkova; James Thomas; Byron C Wallace
Journal:  J Am Med Inform Assoc       Date:  2020-12-09       Impact factor: 4.497

9.  Making progress with the automation of systematic reviews: principles of the International Collaboration for the Automation of Systematic Reviews (ICASR).

Authors:  Elaine Beller; Justin Clark; Guy Tsafnat; Clive Adams; Heinz Diehl; Hans Lund; Mourad Ouzzani; Kristina Thayer; James Thomas; Tari Turner; Jun Xia; Karen Robinson; Paul Glasziou
Journal:  Syst Rev       Date:  2018-05-19

10.  Mining characteristics of epidemiological studies from Medline: a case study in obesity.

Authors:  George Karystianis; Iain Buchan; Goran Nenadic
Journal:  J Biomed Semantics       Date:  2014-05-19
View more
  2 in total

Review 1.  Toward Automated Data Extraction According to Tabular Data Structure: Cross-sectional Pilot Survey of the Comparative Clinical Literature.

Authors:  Kevin Kallmes; Karl Holub; Nicole Hardy
Journal:  JMIR Form Res       Date:  2021-11-24

Review 2.  Comparative Analysis of Sedative Efficacy of Dexmedetomidine and Midazolam in Pediatric Dental Practice: A Systematic Review and Meta-Analysis.

Authors:  Ranu R Oza; Varsha Sharma; Tejas Suryawanshi; Saniya Lulla; Pavan Bajaj; Prasad Dhadse
Journal:  Cureus       Date:  2022-08-26
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.