Literature DB >> 32813561

Natural Language Processing of Serum Protein Electrophoresis Reports in the Veterans Affairs Health Care System.

Justine H Ryu1, Andrew J Zimolzak2,3.   

Abstract

PURPOSE: Serum protein electrophoresis (SPEP) is a clinical tool used to screen for monoclonal gammopathy, thus it is a critical tool in the evaluation of patients with multiple myeloma. However, SPEP laboratory results are usually returned as short text reports, which are not amenable to simple computerized processing for large-scale studies. We applied natural language processing (NLP) to detect monoclonal gammopathy in SPEP laboratory results and compared its performance at multiple hospitals using both a rules-based manual system and a machine-learning algorithm.
METHODS: We used the data from the VA Corporate Data Warehouse, which comprises data from 20 million unique individuals. SPEP reports were collected from July to December 2015 at 5 Veterans Affairs Medical Centers. Of these reports, we annotated the presence or absence of monoclonal gammopathy in 300 reports. We applied a machine learning-based NLP and a manual rules-based NLP to detect monoclonal gammopathy in SPEP reports at each of the hospitals, then applied the model from 1 hospital to each of the other hospitals.
RESULTS: The learning system achieved an area under the receiver operating characteristic curve of 0.997, and the rules-based system achieved an accuracy of 0.99. When a model trained on 1 hospital's data was applied to a different hospital, however, accuracy varied greatly, and the learning-based models performed better than the rules-based model.
CONCLUSION: Binary classification of short clinical texts such as SPEP reports may be a particularly attractive target on which to train highly accurate NLP systems.

Entities:  

Year:  2020        PMID: 32813561      PMCID: PMC7477876          DOI: 10.1200/CCI.19.00167

Source DB:  PubMed          Journal:  JCO Clin Cancer Inform        ISSN: 2473-4276


  24 in total

1.  Automatic detection of acute bacterial pneumonia from chest X-ray reports.

Authors:  M Fiszman; W W Chapman; D Aronsky; R S Evans; P J Haug
Journal:  J Am Med Inform Assoc       Date:  2000 Nov-Dec       Impact factor: 4.497

2.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports.

Authors:  George Hripcsak; John H M Austin; Philip O Alderson; Carol Friedman
Journal:  Radiology       Date:  2002-07       Impact factor: 11.105

3.  Automated encoding of clinical documents based on natural language processing.

Authors:  Carol Friedman; Lyudmila Shagina; Yves Lussier; George Hripcsak
Journal:  J Am Med Inform Assoc       Date:  2004-06-07       Impact factor: 4.497

4.  Recommendations for standardized reporting of protein electrophoresis in Australia and New Zealand.

Authors:  Jillian Tate; Grahame Caldwell; James Daly; David Gillis; Margaret Jenkins; Sue Jovanovich; Helen Martin; Richard Steele; Louise Wienholt; Peter Mollee
Journal:  Ann Clin Biochem       Date:  2012-03-08       Impact factor: 2.057

5.  Natural language processing to extract medical problems from electronic clinical documents: performance evaluation.

Authors:  Stéphane Meystre; Peter J Haug
Journal:  J Biomed Inform       Date:  2005-12-05       Impact factor: 6.317

6.  Serum and urine electrophoresis for detection and identification of monoclonal proteins.

Authors:  Margaret A Jenkins
Journal:  Clin Biochem Rev       Date:  2009-08

Review 7.  Cancer Phenotype Development: A Literature Review.

Authors:  Pei Wang; Maryam Garza; Meredith Zozus
Journal:  Stud Health Technol Inform       Date:  2019

Review 8.  Machine Learning in Medicine.

Authors:  Alvin Rajkomar; Jeffrey Dean; Isaac Kohane
Journal:  N Engl J Med       Date:  2019-04-04       Impact factor: 91.245

Review 9.  International Myeloma Working Group updated criteria for the diagnosis of multiple myeloma.

Authors:  S Vincent Rajkumar; Meletios A Dimopoulos; Antonio Palumbo; Joan Blade; Giampaolo Merlini; María-Victoria Mateos; Shaji Kumar; Jens Hillengass; Efstathios Kastritis; Paul Richardson; Ola Landgren; Bruno Paiva; Angela Dispenzieri; Brendan Weiss; Xavier LeLeu; Sonja Zweegman; Sagar Lonial; Laura Rosinol; Elena Zamagni; Sundar Jagannath; Orhan Sezer; Sigurdur Y Kristinsson; Jo Caers; Saad Z Usmani; Juan José Lahuerta; Hans Erik Johnsen; Meral Beksac; Michele Cavo; Hartmut Goldschmidt; Evangelos Terpos; Robert A Kyle; Kenneth C Anderson; Brian G M Durie; Jesus F San Miguel
Journal:  Lancet Oncol       Date:  2014-10-26       Impact factor: 41.316

Review 10.  Mining electronic health records: towards better research applications and clinical care.

Authors:  Peter B Jensen; Lars J Jensen; Søren Brunak
Journal:  Nat Rev Genet       Date:  2012-05-02       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.