Literature DB >> 18392657

Development of a Google-based search engine for data mining radiology reports.

Joseph P Erinjeri1, Daniel Picus, Fred W Prior, David A Rubin, Paul Koppel.   

Abstract

The aim of this study is to develop a secure, Google-based data-mining tool for radiology reports using free and open source technologies and to explore its use within an academic radiology department. A Health Insurance Portability and Accountability Act (HIPAA)-compliant data repository, search engine and user interface were created to facilitate treatment, operations, and reviews preparatory to research. The Institutional Review Board waived review of the project, and informed consent was not required. Comprising 7.9 GB of disk space, 2.9 million text reports were downloaded from our radiology information system to a fileserver. Extensible markup language (XML) representations of the reports were indexed using Google Desktop Enterprise search engine software. A hypertext markup language (HTML) form allowed users to submit queries to Google Desktop, and Google's XML response was interpreted by a practical extraction and report language (PERL) script, presenting ranked results in a web browser window. The query, reason for search, results, and documents visited were logged to maintain HIPAA compliance. Indexing averaged approximately 25,000 reports per hour. Keyword search of a common term like "pneumothorax" yielded the first ten most relevant results of 705,550 total results in 1.36 s. Keyword search of a rare term like "hemangioendothelioma" yielded the first ten most relevant results of 167 total results in 0.23 s; retrieval of all 167 results took 0.26 s. Data mining tools for radiology reports will improve the productivity of academic radiologists in clinical, educational, research, and administrative tasks. By leveraging existing knowledge of Google's interface, radiologists can quickly perform useful searches.

Entities:  

Mesh:

Year:  2008        PMID: 18392657      PMCID: PMC3043709          DOI: 10.1007/s10278-008-9110-7

Source DB:  PubMed          Journal:  J Digit Imaging        ISSN: 0897-1889            Impact factor:   4.056


  26 in total

1.  An academic radiology information system (RIS): a review of the commercial RIS systems, and how an individualized academic RIS can be created and utilized.

Authors:  E P Tamm; A Kawashima; P Silverman
Journal:  J Digit Imaging       Date:  2001-06       Impact factor: 4.056

2.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports.

Authors:  George Hripcsak; John H M Austin; Philip O Alderson; Carol Friedman
Journal:  Radiology       Date:  2002-07       Impact factor: 11.105

3.  Application of an XML-based document framework to knowledge content authoring and clinical information system development.

Authors:  Nathan C Hulse; Roberto A Rocha; Richard Bradshaw; Guilherme Del Fiol; Lorrie Roemer
Journal:  AMIA Annu Symp Proc       Date:  2003

Review 4.  A survey of current work in biomedical text mining.

Authors:  Aaron M Cohen; William R Hersh
Journal:  Brief Bioinform       Date:  2005-03       Impact factor: 11.622

5.  Reinventing radiology in the digital age. Part II. New directions and new stakeholder value.

Authors:  James H Thrall
Journal:  Radiology       Date:  2005-10       Impact factor: 11.105

6.  How Google is changing medicine.

Authors:  Dean Giustini
Journal:  BMJ       Date:  2005-12-24

7.  Who's overworked and who's underworked among radiologists? An update on the radiologist shortage.

Authors:  Cristian I Meghea; Jonathan H Sunshine
Journal:  Radiology       Date:  2005-07-14       Impact factor: 11.105

Review 8.  Mining literature for systems biology.

Authors:  Phoebe M Roberts
Journal:  Brief Bioinform       Date:  2006-10-10       Impact factor: 11.622

9.  Medical data mining: knowledge discovery in a clinical data warehouse.

Authors:  J C Prather; D F Lobach; L K Goodwin; J W Hales; M L Hage; W E Hammond
Journal:  Proc AMIA Annu Fall Symp       Date:  1997

10.  Towards filmless and distance radiology.

Authors:  D M Hynes; G Stevenson; C Nahmias
Journal:  Lancet       Date:  1997-08-30       Impact factor: 79.321

View more
  8 in total

1.  Bridging the text-image gap: a decision support tool for real-time PACS browsing.

Authors:  Merlijn Sevenster; Rob van Ommering; Yuechen Qian
Journal:  J Digit Imaging       Date:  2012-04       Impact factor: 4.056

2.  Intelligent image retrieval based on radiology reports.

Authors:  Axel Gerstmair; Philipp Daumke; Kai Simon; Mathias Langer; Elmar Kotter
Journal:  Eur Radiol       Date:  2012-08-04       Impact factor: 5.315

3.  Automated measurement of pediatric cranial bone thickness and density from clinical computed tomography.

Authors:  Kirk Smith; David Politte; Gregory Reiker; Tracy S Nolan; Charles Hildebolt; Chelsea Mattson; Don Tucker; Fred Prior; Sergei Turovets; Linda J Larson-Prior
Journal:  Conf Proc IEEE Eng Med Biol Soc       Date:  2012

4.  An information retrieval system for computerized patient records in the context of a daily hospital practice: the example of the Léon Bérard Cancer Center (France).

Authors:  P Biron; M H Metzger; C Pezet; C Sebban; E Barthuet; T Durand
Journal:  Appl Clin Inform       Date:  2014-03-05       Impact factor: 2.342

Review 5.  A Systematic Review on Healthcare Analytics: Application and Theoretical Perspective of Data Mining.

Authors:  Md Saiful Islam; Md Mahmudul Hasan; Xiaoyi Wang; Hayley D Germack; Md Noor-E-Alam
Journal:  Healthcare (Basel)       Date:  2018-05-23

Review 6.  Searching Data: A Review of Observational Data Retrieval Practices in Selected Disciplines.

Authors:  Kathleen Gregory; Paul Groth; Helena Cousijn; Andrea Scharnhorst; Sally Wyatt
Journal:  J Assoc Inf Sci Technol       Date:  2019-03-12       Impact factor: 2.687

7.  Searching Full-Text Anatomic Pathology Reports Using Business Intelligence Software.

Authors:  Simone Arvisais-Anhalt; Christoph U Lehmann; Justin A Bishop; Jyoti Balani; Laurie Boutte; Marjorie Morales; Jason Y Park; Ellen Araj
Journal:  J Pathol Inform       Date:  2022-02-07

8.  Evaluation of negation and uncertainty detection and its impact on precision and recall in search.

Authors:  Andrew S Wu; Bao H Do; Jinsuh Kim; Daniel L Rubin
Journal:  J Digit Imaging       Date:  2009-11-10       Impact factor: 4.056

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.