Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 RUBY: Natural Language Processing of French Electronic Medical Records for Breast Cancer Research.

Literature DB >> 35960900

RUBY: Natural Language Processing of French Electronic Medical Records for Breast Cancer Research.

Renaud Schiappa¹, Sara Contu¹, Dorian Culie², Brice Thamphya¹, Yann Chateau¹, Jocelyn Gal¹, Caroline Bailleux³, Juliette Haudebourg⁴, Jean-Marc Ferrero⁴, Emmanuel Barranger³, Emmanuel Chamorey¹.

Abstract

PURPOSE: Electronic medical records are a valuable source of information about patients' clinical status but are often free-text documents that require laborious manual review to be exploited. Techniques from computer science have been investigated, but the literature has marginally focused on non-English language texts. We developed RUBY, a tool designed in collaboration with IBM-France to automatically structure clinical information from French medical records of patients with breast cancer.
MATERIALS AND METHODS: RUBY, which exploits state-of-the-art Named Entity Recognition models combined with keyword extraction and postprocessing rules, was applied on clinical texts. We investigated the precision of RUBY in extracting the target information.
RESULTS: RUBY has an average precision of 92.8% for the Surgery report, 92.7% for the Pathology report, 98.1% for the Biopsy report, and 81.8% for the Consultation report.
CONCLUSION: These results show that the automatic approach has the potential to effectively extract clinical knowledge from an extensive set of electronic medical records, reducing the manual effort required and saving a significant amount of time. A deeper semantic analysis and further understanding of the context in the text, as well as training on a larger and more recent set of reports, including those containing highly variable entities and the use of ontologies, could further improve the results.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35960900 PMCID： PMC9470144 DOI： 10.1200/CCI.21.00199

Source DB: PubMed Journal: JCO Clin Cancer Inform ISSN： 2473-4276

Keyword Cloud
References

9 in total

1. Does adoption of electronic health records improve the quality of care management in France? Results from the French e-SI (PREPS-SIPS) study.

Authors: Morgane Plantier; Nathalie Havet; Thierry Durand; Nicolas Caquot; Camille Amaz; Pierre Biron; Irène Philip; Lionel Perrier
Journal: Int J Med Inform Date: 2017-04-04 Impact factor: 4.046

2. Machine Learning Methods to Extract Documentation of Breast Cancer Symptoms From Electronic Health Records.

Authors: Alexander W Forsyth; Regina Barzilay; Kevin S Hughes; Dickson Lui; Karl A Lorenz; Andrea Enzinger; James A Tulsky; Charlotta Lindvall
Journal: J Pain Symptom Manage Date: 2018-02-27 Impact factor: 3.612

3. Using machine learning to parse breast pathology reports.

Authors: Adam Yala; Regina Barzilay; Laura Salama; Molly Griffin; Grace Sollender; Aditya Bardia; Constance Lehman; Julliette M Buckley; Suzanne B Coopey; Fernanda Polubriaginof; Judy E Garber; Barbara L Smith; Michele A Gadd; Michelle C Specht; Thomas M Gudewicz; Anthony J Guidi; Alphonse Taghian; Kevin S Hughes
Journal: Breast Cancer Res Treat Date: 2016-11-08 Impact factor: 4.872

4. Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model.

Authors: Anni Coden; Guergana Savova; Igor Sominsky; Michael Tanenblatt; James Masanz; Karin Schuler; James Cooper; Wei Guan; Piet C de Groen
Journal: J Biomed Inform Date: 2008-12-27 Impact factor: 6.317

5. Labeling for Big Data in radiation oncology: The Radiation Oncology Structures ontology.

Authors: Jean-Emmanuel Bibault; Eric Zapletal; Bastien Rance; Philippe Giraud; Anita Burgun
Journal: PLoS One Date: 2018-01-19 Impact factor: 3.240

6. Validation of natural language processing to extract breast cancer pathology procedures and results.

Authors: Arika E Wieneke; Erin J A Bowles; David Cronkite; Karen J Wernli; Hongyuan Gao; David Carrell; Diana S M Buist
Journal: J Pathol Inform Date: 2015-06-23

Review 7. Clinical Natural Language Processing in languages other than English: opportunities and challenges.

Authors: Aurélie Névéol; Hercules Dalianis; Sumithra Velupillai; Guergana Savova; Pierre Zweigenbaum
Journal: J Biomed Semantics Date: 2018-03-30

8. Electronic Medical Record Search Engine (EMERSE): An Information Retrieval Tool for Supporting Cancer Research.

Authors: David A Hanauer; Jill S Barnholtz-Sloan; Mark F Beno; Guilherme Del Fiol; Eric B Durbin; Oksana Gologorskaya; Daniel Harris; Brett Harnett; Kensaku Kawamoto; Benjamin May; Eric Meeks; Emily Pfaff; Janie Weiss; Kai Zheng
Journal: JCO Clin Cancer Inform Date: 2020-05

9. Automatic extraction of cancer registry reportable information from free-text pathology reports using multitask convolutional neural networks.

Authors: Mohammed Alawad; Shang Gao; John X Qiu; Hong Jun Yoon; J Blair Christian; Lynne Penberthy; Brent Mumphrey; Xiao-Cheng Wu; Linda Coyle; Georgia Tourassi
Journal: J Am Med Inform Assoc Date: 2020-01-01 Impact factor: 4.497

9 in total