| Literature DB >> 33778666 |
Ricardo C Cury1, Istvan Megyeri1, Tony Lindsey1, Robson Macedo1, Juan Batlle1, Shwan Kim1, Brian Baker1, Robert Harris1, Reese H Clark1.
Abstract
BACKGROUND: Coronavirus disease 2019 (COVID-19) has spread quickly throughout the United States (US) causing significant disruption in healthcare and society. Tools to identify hot spots are important for public health planning. The goal of our study was to determine if natural language processing (NLP) algorithm assessment of thoracic computed tomography (CT) imaging reports correlated with the incidence of official COVID-19 cases in the US.Entities:
Keywords: big data; chest CT; computed tomography; machine learning; natural language processing; public health; viral outbreak
Year: 2021 PMID: 33778666 PMCID: PMC7977750 DOI: 10.1148/ryct.2021200596
Source DB: PubMed Journal: Radiol Cardiothorac Imaging ISSN: 2638-6135
Correlation matrix demonstrating correlation among the variables.
Figure 1.Weekly time series - Correlation of the two NLP models (Imaging findings NLP and COVID NLP) versus number of official COVID-19 cases per week: Temporal course on a week by week basis demonstrating the relationship between the progression and early rise of the NLP positive Chest CT studies, followed by the increase in the number of official COVID-19 cases and subsequent increase in COVID-19 deaths. There was a strong correlation with the COVID NLP when compared to the number of official COVID-19 cases on a weekly basis (r2=0.91, p<0.005).
Figure 2.Official COVID-19 cases and NLP models by state: Correlation between the number of cases detected by the NLP models and the number of official COVID-19 cases on a state level. All 50 US states, Washington D.C. and Puerto Rico are included in the analysis. There was a strong correlation with the COVID NLP model when compared to new COVID-19 cases by State (r2=0.92, p<0.005).
Figure 3A-panel and Video 1. Temporal progression using a Machine-Learning based geomap comparing the rise overtime of the Viral pneumonia NLP (more general NLP) and COVID NLP (more specific NLP) with the number of official COVID-19 cases (Period January 1March 1st snapshot: note below the progression of the Viral pneumonia NLP cases (gray areas in the states) which correlates with the flu season and the appearance of the COVID NLP cases (blue circles), as well as slightly increase in the number of official COVID-19 cases (yellow circles).
Figure 4A:First forecast model based on an artificial neural network to learn from the NLP models and historical data (independent variables) to predict future new daily COVID-19 cases (dependent variable).
Figure 4B:Second model utilizes forecasting methodology to model time series data with complex seasonal patterns using exponential smoothing with no constraints to create detailed, long-term forecasts.