| Literature DB >> 32449766 |
Jihad S Obeid1,2, Matthew Davis3, Matthew Turner3, Stephane M Meystre2,4, Paul M Heider2, Edward C O'Bryan5, Leslie A Lenert2,6.
Abstract
OBJECTIVE: In an effort to improve the efficiency of computer algorithms applied to screening for coronavirus disease 2019 (COVID-19) testing, we used natural language processing and artificial intelligence-based methods with unstructured patient data collected through telehealth visits.Entities:
Keywords: AI; COVID-19; artificial intelligence; risk assessment; text analytics
Mesh:
Year: 2020 PMID: 32449766 PMCID: PMC7313981 DOI: 10.1093/jamia/ocaa105
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1.Top 10 words that are overrepresented in patients who tested positive for COVID-19 (coronavirus disease 2019), showing relevant words expressed by patients during the virtual care visit intake process.
Figure 2.The area under the receiver-operating characteristic curve (AUC) of the convolutional neural network for predicting SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2)–positive results based on the text content of the virtual care visit notes.
Mean values for AUC, precision, recall, and F1 score based on repeated balanced test sets
| Model | AUC (95% CI) | Precision | Recall | F1 score |
|---|---|---|---|---|
| CNN | 0.732 (0.697-0.767) | 0.754 | 0.453 | 0.566 (0.541-0.586) |
| LR | 0.707 (0.665-0.739) | 0.800 | 0.227 | 0.354 (0.329-0.377) |
AUC: area under the receiver-operating characteristic curve; CI: confidence interval; CNN: convolutional neural network; LR: logistic regression.
Analysis of discriminant power of the model
| Category | Tested | Positive | % Positive |
|---|---|---|---|
| High | 475 | 289 | 60.84 |
| Medium | 1,915 | 127 | 6.63 |
| Low | 9,401 | 244 | 2.60 |
| Total | 11,791 | 660 | 5.60 |