Stéphane M Meystre1, Paul M Heider1, Youngjun Kim1, Matthew Davis2, Jihad Obeid1, James Madory3, Alexander V Alekseyenko1. 1. Biomedical Informatics Center, Medical University of South Carolina, Charleston, South Carolina, USA. 2. Information Solutions, Medical University of South Carolina, Charleston, South Carolina, USA. 3. Department of Pathology, Medical University of South Carolina, Charleston, South Carolina, USA.
Abstract
OBJECTIVE: The COVID-19 (coronavirus disease 2019) pandemic response at the Medical University of South Carolina included virtual care visits for patients with suspected severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. The telehealth system used for these visits only exports a text note to integrate with the electronic health record, but structured and coded information about COVID-19 (eg, exposure, risk factors, symptoms) was needed to support clinical care and early research as well as predictive analytics for data-driven patient advising and pooled testing. MATERIALS AND METHODS: To capture COVID-19 information from multiple sources, a new data mart and a new natural language processing (NLP) application prototype were developed. The NLP application combined reused components with dictionaries and rules crafted by domain experts. It was deployed as a Web service for hourly processing of new data from patients assessed or treated for COVID-19. The extracted information was then used to develop algorithms predicting SARS-CoV-2 diagnostic test results based on symptoms and exposure information. RESULTS: The dedicated data mart and NLP application were developed and deployed in a mere 10-day sprint in March 2020. The NLP application was evaluated with good accuracy (85.8% recall and 81.5% precision). The SARS-CoV-2 testing predictive analytics algorithms were configured to provide patients with data-driven COVID-19 testing advices with a sensitivity of 81% to 92% and to enable pooled testing with a negative predictive value of 90% to 91%, reducing the required tests to about 63%. CONCLUSIONS: SARS-CoV-2 testing predictive analytics and NLP successfully enabled data-driven patient advising and pooled testing.
OBJECTIVE: The COVID-19 (coronavirus disease 2019) pandemic response at the Medical University of South Carolina included virtual care visits for patients with suspected severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infection. The telehealth system used for these visits only exports a text note to integrate with the electronic health record, but structured and coded information about COVID-19 (eg, exposure, risk factors, symptoms) was needed to support clinical care and early research as well as predictive analytics for data-driven patient advising and pooled testing. MATERIALS AND METHODS: To capture COVID-19 information from multiple sources, a new data mart and a new natural language processing (NLP) application prototype were developed. The NLP application combined reused components with dictionaries and rules crafted by domain experts. It was deployed as a Web service for hourly processing of new data from patients assessed or treated for COVID-19. The extracted information was then used to develop algorithms predicting SARS-CoV-2 diagnostic test results based on symptoms and exposure information. RESULTS: The dedicated data mart and NLP application were developed and deployed in a mere 10-day sprint in March 2020. The NLP application was evaluated with good accuracy (85.8% recall and 81.5% precision). The SARS-CoV-2 testing predictive analytics algorithms were configured to provide patients with data-driven COVID-19 testing advices with a sensitivity of 81% to 92% and to enable pooled testing with a negative predictive value of 90% to 91%, reducing the required tests to about 63%. CONCLUSIONS: SARS-CoV-2 testing predictive analytics and NLP successfully enabled data-driven patient advising and pooled testing.
Keywords:
data science [L01.305]; machine learning [g17.035.250.500]; medical informatics [L01.313.500]; natural language processing (nlp) [L01.224.050.375.580]
Authors: Paul A Harris; Robert Taylor; Robert Thielke; Jonathon Payne; Nathaniel Gonzalez; Jose G Conde Journal: J Biomed Inform Date: 2008-09-30 Impact factor: 6.317
Authors: Stéphane M Meystre; Paul M Heider; Youngjun Kim; Matthew Davis; Jihad Obeid; James Madory; Alexander V Alekseyenko Journal: J Am Med Inform Assoc Date: 2021-12-28 Impact factor: 7.942
Authors: Thomas Struyf; Jonathan J Deeks; Jacqueline Dinnes; Yemisi Takwoingi; Clare Davenport; Mariska Mg Leeflang; René Spijker; Lotty Hooft; Devy Emperador; Sabine Dittrich; Julie Domen; Sebastiaan R A Horn; Ann Van den Bruel Journal: Cochrane Database Syst Rev Date: 2020-07-07
Authors: Jihad S Obeid; Matthew Davis; Matthew Turner; Stephane M Meystre; Paul M Heider; Edward C O'Bryan; Leslie A Lenert Journal: J Am Med Inform Assoc Date: 2020-08-01 Impact factor: 4.497
Authors: Alistair E W Johnson; Tom J Pollard; Lu Shen; Li-Wei H Lehman; Mengling Feng; Mohammad Ghassemi; Benjamin Moody; Peter Szolovits; Leo Anthony Celi; Roger G Mark Journal: Sci Data Date: 2016-05-24 Impact factor: 6.444
Authors: Stéphane M Meystre; Paul M Heider; Youngjun Kim; Matthew Davis; Jihad Obeid; James Madory; Alexander V Alekseyenko Journal: J Am Med Inform Assoc Date: 2021-12-28 Impact factor: 7.942
Authors: Guang Lu; Martin Businger; Christian Dollfus; Thomas Wozniak; Matthes Fleck; Timo Heroth; Irina Lock; Janna Lipenkova Journal: Int J Data Sci Anal Date: 2022-10-06