| Literature DB >> 34988543 |
Thomas Linden1,2, Frank Hanses3,4, Daniel Domingo-Fernández1, Lauren Nicole DeLong1,2, Alpha Tom Kodamullil1, Jochen Schneider5, Maria J G T Vehreschild6, Julia Lanznaster7, Maria Madeleine Ruethrich8, Stefan Borgmann9, Martin Hower10, Kai Wille11, Torsten Feldt12, Siegbert Rieg13, Bernd Hertenstein13, Christoph Wyen14, Christoph Roemmele15, Jörg Janne Vehreschild6, Carolin E M Jakob16, Melanie Stecher17, Maria Kuzikov4, Andrea Zaliani4, Holger Fröhlich1,2.
Abstract
Despite available vaccinations COVID-19 case numbers around the world are still growing, and effective medications against severe cases are lacking. In this work, we developed a machine learning model which predicts mortality for COVID-19 patients using data from the multi-center 'Lean European Open Survey on SARS-CoV-2-infected patients' (LEOSS) observational study (>100 active sites in Europe, primarily in Germany), resulting into an AUC of almost 80%. We showed that molecular mechanisms related to dementia, one of the relevant predictors in our model, intersect with those associated to COVID-19. Most notably, among these molecules was tyrosine kinase 2 (TYK2), a protein that has been patented as drug target in Alzheimer's Disease but also genetically associated with severe COVID-19 outcomes. We experimentally verified that anti-cancer drugs Sorafenib and Regorafenib showed a clear anti-cytopathic effect in Caco2 and VERO-E6 cells and can thus be regarded as potential treatments against COVID-19. Altogether, our work demonstrates that interpretation of machine learning based risk models can point towards drug targets and new treatment options, which are strongly needed for COVID-19.Entities:
Keywords: Covid19; Drug repositioning; Explainable ai; Machine learning; Precision medicine
Year: 2021 PMID: 34988543 PMCID: PMC8677630 DOI: 10.1016/j.ailsci.2021.100020
Source DB: PubMed Journal: Artif Intell Life Sci ISSN: 2667-3185
Overview of patient demographics in LEOSS.
| Age | |
|---|---|
| 18 - 25 years | 181 |
| 26 - 35 years | 472 |
| 36 - 45 years | 540 |
| 46 - 55 years | 907 |
| 56 - 65 years | 1125 |
| 66 - 75 years | 981 |
| 76 - 85 years | 1231 |
| missing | 242 |
| Gender | |
| Male | 3229 |
| Female | 2218 |
| missing | 232 |
| Ethnicity | |
| Caucasian | 4225 |
| missing | 1195 |
| Asian & Pacific Islander | 155 |
| African & African American | 98 |
| Hispanic or Latino | 6 |
| Country | |
| Germany | 5411 |
| Turkey | 65 |
| Belgium | 40 |
| Czechia | 33 |
| Latvia | 27 |
| Other | 26 |
| GBR | 23 |
| Italy | 19 |
| Spain | 15 |
| France | 11 |
| Austria | 9 |
Fig. 1Kaplan-Meier plot of COVID-19 patients in LEOSS. The plot shows the estimated survival function according to the well-known product limit estimator, see section “Methods” [32]. The gray area depicts the 95% confidence interval.
Fig. 2(a) Model prediction performance measured via Uno's C-index on held out test sets (COX = elastic net penalized Cox proportional hazards regression; WEI = elastic net penalized Weibull accelerated failure time regression; XGBSE = XGBoost Survival Embeddings; RSF = Random Survival Forest; DEEPSURV = DeepSurv); (b) model calibration error measured via Integrated Brier Score (IBS) on held out test sets; (c) model prediction performance as function of time on held out test sets with 95% confidence interval, with integrated AUC (iAUC) denoting the mean (standard error) AUC over time. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 3Feature importance using absolute SHAP values: (a) top 10 predictors; (b) cumulative influence per feature modality. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Partial dependence plots for most influential predictors. Boxplots show the distribution of patient specific hazard ratios per variable category. The red horizontal line defines the reference. The hazard ratio describes by which factor the median lifetime is expected to change compared to reference. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5Regorafenib (panels A and C) and Sorafenib (panels B and D) activities measured in different cell lines (Vero-E6 cells upper panels; Caco2 cells lower panels) as percentage inhibition of viral cytopathic effect normalized to Remdesivir as positive control (100%). Cells in wells were treated with SARS CoV-2 virus, and drugs were administered after 48 or 96 h after infection. Subsequently, cells were stained, washed and counted if alive. Some signs of toxicity on Caco2 cells (lower panels) started to surface at higher drug concentrations and this might be the reason for the higher observed variance of triplicates. The slightly negative relative inhibition shown in panel D is caused by plate control differences within plates.