| Literature DB >> 33519109 |
Talha Burak Alakus1, Ibrahim Turkoglu2.
Abstract
The SARS-CoV2 virus, which causes COVID-19 (coronavirus disease) has become a pandemic and has expanded all over the world. Because of increasing number of cases day by day, it takes time to interpret the laboratory findings thus the limitations in terms of both treatment and findings are emerged. Due to such limitations, the need for clinical decisions making system with predictive algorithms has arisen. Predictive algorithms could potentially ease the strain on healthcare systems by identifying the diseases. In this study, we perform clinical predictive models that estimate, using deep learning and laboratory data, which patients are likely to receive a COVID-19 disease. To evaluate the predictive performance of our models, precision, F1-score, recall, AUC, and accuracy scores calculated. Models were tested with 18 laboratory findings from 600 patients and validated with 10 fold cross-validation and train-test split approaches. The experimental results indicate that our predictive models identify patients that have COVID-19 disease at an accuracy of 86.66%, F1-score of 91.89%, precision of 86.75%, recall of 99.42%, and AUC of 62.50%. It is observed that predictive models trained on laboratory findings could be used to predict COVID-19 infection, and can be helpful for medical experts to prioritize the resources correctly. Our models (available at (https://github.com/burakalakuss/COVID-19-Clinical)) can be employed to assists medical experts in validating their initial laboratory findings, and can also be used for clinical prediction studies.Entities:
Keywords: Artificial intelligence; COVID-19; Coronavirus; Deep learning; SARS-CoV2
Year: 2020 PMID: 33519109 PMCID: PMC7833512 DOI: 10.1016/j.chaos.2020.110120
Source DB: PubMed Journal: Chaos Solitons Fractals ISSN: 0960-0779 Impact factor: 5.944
18 Laboratory findings of the patients in the dataset.
| Laboratory Findings | Hematocrit, hemoglobin, platelets, red blood cells, lymphocytes, leukocytes, basophils, eosinophils, monocytes, serum glucose, neutrophils, urea, C reactive protein, creatinine, potassium, sodium, alanine transaminase, aspartate transaminase |
Parameters of each DL classifier.
| Parameters | ANN | CNN | LSTM | RNN | CNNLSTM | CNNRNN |
|---|---|---|---|---|---|---|
| Number of units | 32,16,8 | 512,256 | – | – | 512,256 | 512,256 |
| Number of layers | 1,2,3 | 1,2 | 1 | 1 | 1,2 | 1,2 |
| Activation function | ReLU | ReLU | ReLU | ReLU | ReLU | ReLU |
| Learning rate | 1e-3 | 1e-3 | 1e-3 | 1e-3 | 1e-3 | 1e-3 |
| Loss function | Binary crossentropy | Binary crossentropy | Binary crossentropy | Binary crossentropy | Binary crossentropy | Binary crossentropy |
| Number of epoch | 250 | 250 | 250 | 250 | 250 | 250 |
| Optimizer | SGD | SGD | SGD | SGD | SGD | SGD |
| Decay | 1e-5 | 1e-5 | 1e-5 | 1e-5 | 1e-5 | 1e-5 |
| Momentum | 0.3 | 0.3 | 0.3 | 0.3 | 0.3 | 0.3 |
| Number of fully connected units | – | 2048,1024 | 2048,1024 | 2048,1024 | 2048,1024 | 2048,1024 |
| Number of fully connected layers | – | 1,2 | 1,2 | 1,2 | 1,2 | 1,2 |
| Number of LSTM units | – | – | 512 | – | 512 | 512 |
| Number of RNN units | – | – | – | 512 | – | 512 |
| Dropout | – | – | – | 0.25 | 0.15 | 0.15 |
Fig. 1Flowchart of this study. The orange icon indicates the dataset, which is laboratory findings in this study. The pink ones represent the deep learning models including, ANN, CNN, RNN, LSTM, CNNLSTM, and CNNRNN. All of these models were used to predict the No findings and COVID-19 patients. AUC, Accuracy, Precision, Recall, and F1-Scores were applied to evaluate the results. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Evaluation results of all deep learning application models with 10 fold cross-validation approach.
| Accuracy | F1-Score | Precision | Recall | AUC | |
|---|---|---|---|---|---|
| ANN | 0.8600 | 0.9134 | 0.8855 | 0.9578 | 0.5615 |
| CNN | 0.8800 | 0.9038 | 0.8948 | 0.9248 | 0.6149 |
| CNNLSTM | 0.8416 | 0.9001 | 0.8926 | 0.9214 | 0.5889 |
| CNNRNN | 0.8566 | 0.9120 | 0.8977 | 0.9423 | 0.6408 |
| LSTM | |||||
| RNN | 0.8416 | 0.9061 | 0.8783 | 0.9604 | 0.5245 |
Fig. 2Evaluation results of all deep learning models with 10 fold cross-validation approach.
Evaluation results of all deep learning application models with train-test split approach.
| Accuracy | F1-Score | Precision | Recall | AUC | |
|---|---|---|---|---|---|
| ANN | 0.8690 | 0.8713 | 0.8713 | 0.8713 | 0.85 |
| CNN | 0.8735 | 0.8856 | 0.8847 | 0.8867 | 0.80 |
| CNNLSTM | |||||
| CNNRNN | 0.8624 | 0.8755 | 0.8755 | 0.8755 | 0.69 |
| LSTM | 0.9034 | 0.8997 | 0.8997 | 0.8998 | 0.83 |
| RNN | 0.8400 | 0.8427 | 0.8428 | 0.8427 | 0.83 |
Fig. 3AUC values of all deep learning application models with train-test split approach.
Comparison of evaluation results.
| Study | Dataset Location | AI Technique | Classifier | Accuracy | AUC | F1-Score |
|---|---|---|---|---|---|---|
| Wenzhou Central Hospital and Cangnan People's Hospital in Wenzhu, China | Machine learning | SVM | 80.00% | – | – | |
| Hospital Israelita Albert Einstein at Sao Paulo, Brazil | Machine learning | SVM, RF | – | 0.87 | 0.72 | |
| Hospital Israelita Albert Einstein at Sao Paulo, Brazil | Machine learning | XGB | – | 0.66 | – | |
| This work | Hospital Israelita Albert Einstein at Sao Paulo, Brazil |