| Literature DB >> 35692512 |
Priyanka De1, Vinay Kumar1, Supratik Kar2, Kunal Roy1, Jerzy Leszczynski2.
Abstract
The worldwide burden of coronavirus disease 2019 (COVID-19) is still unremittingly prevailing, with more than 440 million infections and over 5.9 million deaths documented so far since the SARS-CoV-2 (severe acute respiratory syndrome coronavirus 2) pandemic. The non-availability of treatment further aggravates the scenario, thereby demanding the exploration of pre-existing FDA-approved drugs for their effectiveness against COVID-19. The current research aims to identify potential anti-SARS-CoV-2 drugs using a computational approach and repurpose them if possible. In the present study, we have collected a set of 44 FDA-approved drugs of different classes from a previously published literature with their potential antiviral activity against COVID-19. We have employed both regression- and classification-based quantitative structure-activity relationship (QSAR) modeling to identify critical chemical features essential for anticoronaviral activity. Multiple models with the consensus algorithm were employed for the regression-based approach to improve the predictions. Additionally, we have employed a machine learning-based read-across approach using Read-Across-v3.1 available from https://sites.google.com/jadavpuruniversity.in/dtc-lab-software/home and linear discriminant analysis for the efficient prediction of potential drug candidate for COVID-19. Finally, the quantitative prediction ability of different modeling approaches was compared using the sum of ranking differences (SRD). Furthermore, we have predicted a true external set of 98 pharmaceuticals using the developed models for their probable anti-COVID activity and their prediction reliability was checked employing the "Prediction Reliability Indicator" tool available from https://dtclab.webs.com/software-tools. Though the present study does not target any protein of viral interaction, the modeling approaches developed can be helpful for identifying or screening potential anti-coronaviral drug candidates. Supplementary information: The online version contains supplementary material available at 10.1007/s11224-022-01975-3.Entities:
Keywords: COVID-19; In silico approaches; Quantitative structure–activity relationship; Read-across; SARS-CoV-2
Year: 2022 PMID: 35692512 PMCID: PMC9171098 DOI: 10.1007/s11224-022-01975-3
Source DB: PubMed Journal: Struct Chem ISSN: 1040-0400 Impact factor: 1.795
Fig. 1The observed versus predicted pIC50 plots of all four PLS models
Descriptors appearing in the four PLS models
| nROR | Functional group counts | Number of aliphatic ether groups | Positive | 4 |
| F06[C–Cl] | 2D atom pairs | Frequency of C – Cl at topological distance 6 | Positive | 4 |
| NsNH2 | Atom-type E-state indices | Number of atoms of type sNH2 | Negative | 4 |
| VE1sign_Dz(p) | 2D matrix-based descriptors | Coefficient sum of the last eigenvector from Barysz matrix weighted by polarizability | Negative | 1 |
| nRCOOR | Functional group counts | Number of aliphatic esters | Negative | 1 |
| VE1_B(e) | 2D matrix-based descriptors | Coefficient sum of the last eigenvector (absolute values) from Burden matrix weighted by Sanderson electronegativity | Negative | 1 |
| VE1_H2 | 2D matrix-based descriptors | Coefficient sum of the last eigenvector (absolute values) from reciprocal squared distance matrix | Negative | 1 |
Fig. 2Features increasing or decreasing the antiviral activity against SARS-CoV-2
Fig. 3Variable importance plot of four PLS models (M1–M4)
Fig. 4Loading plots of all four PLS models (M1–M4)
Statistical qualities of all four PLS models along with their consensus predictions (the best metric values are shown in bold)
| IM1 | 0.672 | 0.612 | 0.487 | 0.203 | 0.184 | 0.831 | 0.831 | 0.668 | 0.110 | 0.139 | 0.114 | 0.906 |
| IM2 | 0.663 | 0.607 | 0.474 | 0.250 | 0.194 | 0.834 | 0.834 | 0.675 | 0.109 | 0.157 | 0.138 | 0.907 |
| IM3 | 0.668 | 0.608 | 0.480 | 0.218 | 0.189 | 0.839 | 0.839 | 0.706 | 0.100 | 0.154 | 0.135 | 0.912 |
| IM4 | 0.665 | 0.604 | 0.477 | 0.209 | 0.187 | 0.826 | 0.826 | 0.705 | 0.104 | 0.153 | 0.132 | 0.906 |
| CM0 | - | - | - | - | - | 0.838 | 0.838 | 0.691 | 0.103 | 0.151 | 0.133 | 0.910 |
| CM1 | - | - | - | - | - | 0.838 | 0.838 | 0.691 | 0.103 | 0.151 | 0.133 | 0.910 |
| CM2 | - | - | - | - | - | |||||||
| CM3 | - | - | - | - | - | |||||||
Comparison between classical QSAR models and their corresponding read-across predictions (the best metric values are shown in bold)
| M1 | PLSR | - | - | - | - | - | 0.831 | 0.831 | 0.139 | 0.179 |
| RA-ED | 1.5 | 1.5 | 10 | 0.5 | 0.0 | 0.879 | 0.878 | 0.127 | 0.152 | |
| RA-GK | 0.893 | 0.893 | 0.121 | 0.143 | ||||||
| RA-LK | ||||||||||
| M2 | PLSR | - | - | - | - | - | 0.834 | 0.834 | 0.157 | 0.178 |
| RA-ED | 1 | 1 | 10 | 0.6 | 0.0 | 0.870 | 0.870 | 0.135 | 0.152 | |
| RA-GK | ||||||||||
| RA-LK | 0.911 | 0.911 | 0.119 | 0.132 | ||||||
| M3 | PLSR | - | - | - | - | - | 0.839 | 0.839 | 0.154 | 0.175 |
| RA-ED | 0.75 | 1.5 | 10 | 0.5 | 0.0 | 0.862 | 0.862 | 0.142 | 0.162 | |
| RA-GK | ||||||||||
| RA-LK | 0.892 | 0.892 | 0.132 | 0.144 | ||||||
| M4 | PLSR | - | - | - | - | - | 0.826 | 0.826 | 0.153 | 0.182 |
| RA-ED | 0.75 | 1.75 | 10 | 0.6 | 0.0 | 0.722 | 0.722 | 0.163 | 0.230 | |
| RA-GK | 0.931 | 0.931 | 0.100 | 0.115 | ||||||
| RA-LK | ||||||||||
Fig. 5Comparative plot of the scaled SRD values of the different modeling approaches
Fig. 6Cross-validated SRD plotting: maximum, minimum, and median SRD values for all the modeling approaches
Qualitative validation parameters for the training and test sets for LDA model
| Training | 33 | 0.824 | 0.875 | 0.848 | 0.875 | 0.848 | 0.849 | 0.699 | 0.697 |
| Test | 11 | 0.60 | 1 | 0.818 | 1 | 0.750 | 0.775 | 0.671 | 0.621 |
Fig. 7Features contributing to the anti-SARS-CoV-2 activity according to the classification model