| Literature DB >> 34900126 |
Ching-Hsuan Liu1,2, Cheng-Hua Lu1, Liang-Tzung Lin1,3.
Abstract
The emergence of the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), which is the etiologic agent of the coronavirus disease 2019 (COVID-19) pandemic, has dominated all aspects of life since of 2020. Research studies on the virus and exploration of therapeutic and preventive strategies has been moving at rapid rates to control the pandemic. In the field of bioinformatics or computational and structural biology, recent research strategies have used multiple disciplines to compile large datasets to uncover statistical correlations and significance, visualize and model proteins, perform molecular dynamics simulations, and employ the help of artificial intelligence and machine learning to harness computational processing power to further the research on COVID-19, including drug screening, drug design, vaccine development, prognosis prediction, and outbreak prediction. These recent developments should help us better understand the viral disease and develop the much-needed therapies and strategies for the management of COVID-19.Entities:
Keywords: Artificial intelligence; COVID-19; Disease prediction; Drug design; Drug screening; Machine learning; SARS-CoV-2; Vaccine development
Year: 2021 PMID: 34900126 PMCID: PMC8650801 DOI: 10.1016/j.csbj.2021.11.040
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Summary of machine learning models developed for disease prediction.
| Readout | Parameters | Algorithm | Sensitivity (Recall) | Specificity | Precision (PPV) | F1-score | Accuracy | AUROC | Test Cohort | Ref. |
|---|---|---|---|---|---|---|---|---|---|---|
| Mortality | 33 clinical parameters | Random forest | 85.71 % | 92.45% | – | – | 89.47% | 0.921 | No | |
| Mortality | 45 proteins | Bayesian network | 92.68% | 86% | – | – | 89.01% | 0.953 | No | |
| Mortality | CRP, BUN, serum calcium, serum albumin, lactic acid | SVM | 91% | 91% | 62.5% | – | – | 0.93 | No | |
| In-hospital mortality | Age, lymphocyte, D-dimer, CRP, creatinine (ALDCC) | Logistic regression | 0.91 ± 0.03 | 0.78 ± 0.04 | 0.92 ± 0.03 | 0.92 ± 0.03 | 0.91 ± 0.03 | 0.992 | Yes | |
| In-hospital mortality | Age, hs-CRP, lymphocyte, d-dimer | Logistic regression | 0.839 | 0.794 | – | – | – | 0.881 | Yes | |
| In-hospital mortality | LDH, neutrophils, lymphocyte, hs-CRP, age (LNLCA) | Logistic regression | 92 ± 2.6% | 92 ± 3% | – | – | – | 0.991 | Yes | |
| In-hospital mortality | PTA, urea, WBC, IL-2r, indirect bilirubin, myoglobin, FgDP | LASSO logistic regression | 98% | 91% | – | – | – | 0.997 | No | |
| In-hospital mortality | Disease severity, age, hs-CRP, LDH, ferritin, IL-10 | Simple-tree XGBoost | >85% | – | >90% | >0.90 | >0.90 | 1.000 | Yes | |
| Disease severity | 28 blood and urine parameters | SVM | – | – | – | – | 0.8148 | – | Yes | |
| Disease severity | Different biomarker combinations | Penalized logistic regression | >82% | >71% | >87% | – | >85% | – | Yes |
BUN, blood urea nitrogen; CRP, c-reactive protein; FgDP, fibrinogen degradation products; hs-CRP, high-sensitivity C-reactive protein; IL-2r, interleukin-2 receptor; IL-10, interleukin-10; LASSO, least absolute shrinkage and selection operator; LDH, lactate dehydrogenase; MCHC, mean corpuscular hemoglobin concentration; PPV, positive predictive value; PTA, prothrombin; SVM, support vector machine; WBC, white blood cell activity; XGBoost, eXtreme Gradient Boosting.
Fig. 1Applications of computational and structural biology and artificial intelligence (AI) in the COVID-19 pandemic. Created with BioRender.com