Literature DB >> 27490990

AVCpred: an integrated web server for prediction and design of antiviral compounds.

Abid Qureshi1, Gazaldeep Kaur1, Manoj Kumar1.   

Abstract

Viral infections constantly jeopardize the global public health due to lack of effective antiviral therapeutics. Therefore, there is an imperative need to speed up the drug discovery process to identify novel and efficient drug candidates. In this study, we have developed quantitative structure-activity relationship (QSAR)-based models for predicting antiviral compounds (AVCs) against deadly viruses like human immunodeficiency virus (HIV), hepatitis C virus (HCV), hepatitis B virus (HBV), human herpesvirus (HHV) and 26 others using publicly available experimental data from the ChEMBL bioactivity database. Support vector machine (SVM) models achieved a maximum Pearson correlation coefficient of 0.72, 0.74, 0.66, 0.68, and 0.71 in regression mode and a maximum Matthew's correlation coefficient 0.91, 0.93, 0.70, 0.89, and 0.71, respectively, in classification mode during 10-fold cross-validation. Furthermore, similar performance was observed on the independent validation sets. We have integrated these models in the AVCpred web server, freely available at http://crdd.osdd.net/servers/avcpred. In addition, the datasets are provided in a searchable format. We hope this web server will assist researchers in the identification of potential antiviral agents. It would also save time and cost by prioritizing new drugs against viruses before their synthesis and experimental testing.
© 2016 John Wiley & Sons A/S.

Entities:  

Keywords:  zzm321990QSARzzm321990; algorithm; antiviral compounds; drug design; inhibition; prediction

Mesh:

Substances:

Year:  2016        PMID: 27490990      PMCID: PMC7162012          DOI: 10.1111/cbdd.12834

Source DB:  PubMed          Journal:  Chem Biol Drug Des        ISSN: 1747-0277            Impact factor:   2.817


Antiviral compounds (AVCs) inhibit the development of viruses in the host cell and are relatively harmless to the host.1 They can be natural, for example, antivirals found in turmeric2 and eucalyptus oil,3 or synthetic, for example, zidovudine (a nucleoside analog)4 and Tamiflu (neuraminidase inhibitor).5 Many compounds and drugs have also been tested and found to be useful in restricting the growth of certain viruses.6, 7 Scientists are endeavoring to broaden the range of antivirals to other families of viruses.8 However, designing safe and effective antiviral drugs is a difficult task due to the high genetic diversity and consequent drug resistance in viruses.9 Initially, antivirals were discovered using traditional trial‐and‐error methods.10 However, it was a very lengthy process for discovering effective antivirals.10, 11 Later, research on virology helped to identify many target pathways to block viral multiplication.12, 13 Scientists are now using rational drug design strategies for developing antivirals that target the viruses at different stages of their life cycles.14 During the past decade, many new drugs have been successfully identified in controlling the viral replication in host cells, for example, maraviroc (inhibits human immunodeficiency virus or HIV entry), pleconaril (inhibits picornavirus uncoating), acyclovir (inhibits herpesvirus replication), and oseltamivir (inhibits influenza release).9, 15 To save time and money for discovering a new drug, researchers have widely used various computational methods to screen virtual libraries of compounds before the synthesis and animal testing of chemicals. Among the different approaches, quantitative structure–activity relationship (QSAR) is mostly used.16, 17, 18 In this approach, relationships connecting molecular descriptors and activity are used to predict the property of other molecules.19 Molecular descriptors transform the chemical information (structure and linking of groups) of a molecule into simple numbers.20 QSAR‐based virtual screening is an effective computational technique leading toward identification and design of novel antiviral agents.21 Lately, many dedicated bioinformatic resources have been developed for antivirals. For example, in the area of RNA interference resources published are VIRsiRNAdb—antiviral siRNAs resource for about 42 disease causing viruses,22 HIVsirDB—anti HIV siRNAs database,23 VIRsiRNApred—antiviral siRNA inhibition efficacy predictor,24 and VIRmiRNA—database of virus encoded miRNAs including antiviral miRNAs.25 Similarly, for peptide‐based antivirals, a few web servers have also been created like AVPdb—collection of antiviral peptides targeting more than 60 medically important viruses,26 HIPdb—HIV inhibiting peptide repository,27 and AVPpred—predictor of antiviral activity of peptides.28. Many general depositories provide information of antiviral molecules. For example, ChEMBL,29 PubChem—a database of molecules and their activities,30 ZINC—database of commercial compounds for virtual screening,31 and DrugBank—a knowledgebase for drugs and drug targets.32 In addition, there are a few QSAR studies targeting specific viral proteins.33, 34, 35, 36, 37, 38, 39, 40, 41 However, till date there is no web server/software, which can regressively predict the percentage inhibition value of a compound against different human viruses under a single platform. To cater this need, we developed AVCpred, a web server for prediction and design of antiviral compounds. In this method, we used previously known AVCs against HIV, hepatitis C virus (HCV), hepatitis B virus (HBV), human herpesvirus (HHV) and 26 other viruses with experimentally validated percentage inhibition from ChEMBL, a large‐scale bioactivity database for drug discovery.29 This was followed by descriptor calculation and selection of best performing molecular descriptors. The latter were then used as input for support vector machine (in regression mode) to develop QSAR models for different viruses as well as a general model for other viruses. We have integrated these models in the AVCpred web server, which will be helpful for virtual screening of AVCs and designing new compounds to target the viruses.

Methods and Materials

Datasets

In this study, we have used different datasets of AVCs having experimentally verified percent inhibition values against HIV, HCV, HHV, HBV and a general dataset having AVCs against 26 human viruses. The data were obtained from the ChEMBL resource ( https://www.ebi.ac.uk/chembl/). The desired data were fetched using target browser (taxonomy tree) as well as target search using keywords such as HIV, HCV, HBV, HHV, virus, viral, viruses. Initially, among the AVCs, the majority of data belonged to HIV (1383 compounds), HCV (803 compounds), HHV (473 compounds), HBV (416 compounds), and other viruses (1635 compounds). After filtering entries with desired information and removing redundant entries, we were left with 389 compounds for HIV, 467 in case of HCV, 124 for HHV, 112 against HBV, and 1391 AVCs targeting the 26 viruses (Table 1 and Table S1). These datasets were used for descriptor selection and model development. The datasets are available along with references on the web server and can be downloaded from this URL: http://crdd.osdd.net/servers/avcpred/datasets.php.
Table 1

Creation of datasets for the development of prediction models

S. no.VirusOverall dataData filtera
Percent inhibition1 Reference2 Non‐redundant3
1Human immunodeficiency virus (HIV)1383594535389
2Hepatitis C virus (HCV)803648618467
3Hepatitis B virus (HBV)416284283112
4Human herpesvirus (HHV)473312278124
5General (26 viruses)b 5684266216351391

Data from ChEMBL were filtered, and only compounds with [1] percent inhibition, [2] reference, and [3] non‐redundant SMILES were considered.

The general dataset is comprised of below viruses with unique number of AVCs in brackets: Dengue virus 1,1 dengue virus 2,16 enterovirus,30 human adenovirus 5,41 human cox B1,4 human cox B5,21 human echovirus 13,3 human echovirus 9,2 human enterovirus 71,19 human enterovirus C,1 human polio virus 1,4 human rhinovirus,1 human rhinovirus 14,29 human rhinovirus 1B,18 human rhinovirus 2,2 human T lymphotropic virus,42 influenza A,36 influenza A (H1N1),16 influenza B,1 monkeypox virus,1 respiratory syncytial virus,4 Rift Valley fever virus (Cercopithecidae),1 sandfly fever Sicilian virus,2 SARS coronavirus,23 simian virus 40,45 Sindbis virus,4 vaccinia virus,12 vaccinia virus WR,22 variola virus,1 vesicular stomatitis virus,63 West Nile virus,17 yellow fever virus.51

Creation of datasets for the development of prediction models Data from ChEMBL were filtered, and only compounds with [1] percent inhibition, [2] reference, and [3] non‐redundant SMILES were considered. The general dataset is comprised of below viruses with unique number of AVCs in brackets: Dengue virus 1,1 dengue virus 2,16 enterovirus,30 human adenovirus 5,41 human cox B1,4 human cox B5,21 human echovirus 13,3 human echovirus 9,2 human enterovirus 71,19 human enterovirus C,1 human polio virus 1,4 human rhinovirus,1 human rhinovirus 14,29 human rhinovirus 1B,18 human rhinovirus 2,2 human T lymphotropic virus,42 influenza A,36 influenza A (H1N1),16 influenza B,1 monkeypox virus,1 respiratory syncytial virus,4 Rift Valley fever virus (Cercopithecidae),1 sandfly fever Sicilian virus,2 SARS coronavirus,23 simian virus 40,45 Sindbis virus,4 vaccinia virus,12 vaccinia virus WR,22 variola virus,1 vesicular stomatitis virus,63 West Nile virus,17 yellow fever virus.51

Descriptor calculation

To develop virus specific as well as general QSAR models, we computed about 18000 chemical descriptors (1D, 2D, and 3D), including geometric, constitutional, electrostatic, topological, hydrophobic, binary fingerprints, using PaDEL, an open‐source software to calculate molecular descriptors and fingerprints.42

Feature selection

To improve the speed of calculation, we selected the most essential descriptors using ‘RemoveUseless’ filter followed by ClassifierSubsetEval (attribute evaluator) with BestFirst (search method) module available in Weka package.43 ClassifierSubsetEval evaluates attribute subsets on training/testing data using a classifier to estimate the merit of a set of attributes.44, 45 The selected descriptors were then used to develop the QSAR models (Table S3).

Machine learning

We developed individual QSAR models for each of the 4 viruses (HIV, HCV, HHV, and HBV) as well as a general model comprising 26 different viruses using SMOreg algorithm46 in Weka machine learning software43 freely available at http://www.cs.waikato.ac.nz/ml/weka. SMOreg implements the support vector machine in regression mode. In SMOreg, Pearson VII function‐based universal kernel (Puk) and RegSMOImproved optimizer were used along with parameters such as (i) the regularization constant/complexity value (c) that allows trade‐off between training error and margin, (ii) the omega exponent value (ω) that controls peak half‐width, and (iii) the sigma bandwidth value (σ) that controls peak tailing factor.47, 48 Simultaneously, software SVMlight (freely available at http://svmlight.joachims.org) was employed for machine learning in classification mode. In SVMlight, radial basis function (RBF) kernel was used with parameters (i) gamma (g) that defines how far the influence of a single training example reaches and (ii) complexity constant (c) that allows trade‐off amid training error and margin.49 Selected molecular descriptors and fingerprints were used as input features for the development of QSAR models.

Evaluation

In order to evaluate performance of our models, we employed a number of statistical parameters including Pearson's correlation coefficient, coefficient of determination, mean absolute error root‐mean‐square error, sensitivity, specificity, accuracy, and Mathew's correlation coefficient as briefly described below. The Pearson's correlation coefficient (R) is a measure of correlation between two variables.where n is the size of test set, and E pred and E act is the predicted and actual efficacy of AVCs respectively. A value of 1 denotes total positive correlation, 0 is no correlation, and −1 is total negative correlation. The coefficient of determination (R 2) indicates how well data fit a statistical model. An R 2 of 1 indicates that the model perfectly fits the data, while an R 2 of 0 means that the model does not fit the data at all. The mean absolute error (MAE) measure indicates how close the predictions are to the eventual outcomes.where E pred is the prediction, E act the true value, and | E pred – E act | the absolute error. MAEs are negatively oriented scores; that is, lower values are better. The root‐mean‐square error (RMSE) measures the average magnitude of the error.RMSEs are also negatively oriented scores; that is, lower values are better. Sensitivity (Sn) or the true positive rate measures the percentage of correctly identified positives.An ideal predictor would be expressed as 100% sensitive. Specificity (Sp) or the true negative rate measures the percentage of correctly identified negativesAn ideal predictor would be expressed as 100% specific. Accuracy (Ac) is the percentage of correct results (i.e. both true positives and true negatives) among the total number of cases.An ideal predictor would be expressed as 100% accurate. The Matthew's correlation coefficient (MCC) is used in machine learning to evaluate the performance of binary classifications.In the above Eqs. (4), (5), (6), (7), TP, FP, TN, and FN represent the true positives, false positives, true negatives, and false negatives respectively. Its value ranges from −1 to 1 and a value close to 1 means a better prediction.

Results

Performance of QSAR models

In order to identify the most effective features or descriptors of antiviral drugs, we computed the correlation between selected chemical features of antiviral drugs and their percent inhibition using comprehensive pharmacological screening datasets from ChEMBL29 (Figure 1).
Figure 1

Schematic diagram demonstrating workflow of AVCpred

Schematic diagram demonstrating workflow of AVCpred After attribute selection, the relevant descriptors were 45 for HIV, 52 for HCV, 15 for HBV, 20 for HHV, and 65 for rest of the viruses. A combination of selected chemical descriptors like partial charge, atom‐type electrotopological state, extended topochemical atom, chi cluster, weighted path, and fingerprints based on substructure, graph, path, and extended features including PubChem and Klekota‐Roth were found to be useful in prediction. The selected descriptors were then used to develop the QSAR models (Table S3). During 10‐fold cross‐validation, we achieved maximum Pearson's correlation coefficient (PCC) of 0.72 in case of HIV; 0.74 in case of HCV; 0.66 in case of HBV; 0.68 in case of HHV; and 0.71 in case of rest of the viruses. Also during validation on independent dataset, we achieved a maximum PCC of 0.63, 0.65, 0.61, 0.64, and 0.67, respectively (Table 2). Other statistical parameters used in the development of QSAR models are depicted in Table S2. A scatter plot between actual and predicted efficacy in each case is shown in Figure 2.
Table 2

Pearson correlation values obtained for each viral dataset on their respective QSAR models

S. no.VirusAntiviral compoundsNo. of selected descriptorsPearson's correlation coefficient (PCC)
TotalTrainingValidationTrainingValidation
(10x)a
1Human immunodeficiency virus (HIV)38935138450.720.63
2Hepatitis C virus (HCV)46742146520.740.65
3Hepatitis B virus (HBV)11210111150.660.61
4Human herpesvirus (HHV)12411212200.680.64
5General (26 viruses)b 13911252139650.710.67

10‐fold cross‐validation.

The general dataset is comprised of below viruses with unique number of AVCs in brackets: Dengue virus 1,1 dengue virus 2,16 enterovirus,30 human adenovirus 5,41 human cox B1,4 human cox B5,21 human echovirus 13,3 human echovirus 9,2 human enterovirus 71,19 human enterovirus C,1 human polio virus 1,4 human rhinovirus,1 human rhinovirus 14,29 human rhinovirus 1B,18 human rhinovirus 2,2 human T lymphotropic virus,42 influenza A,36 influenza A (H1N1),16 influenza B,1 monkeypox virus,1 respiratory syncytial virus,4 Rift Valley fever virus (Cercopithecidae),1 sandfly fever Sicilian virus,2 SARS coronavirus,23 simian virus 40,45 Sindbis virus,4 vaccinia virus,12 vaccinia virus WR,22 variola virus,1 vesicular stomatitis virus,63 West Nile virus,17 yellow fever virus.51

Figure 2

Scatter plot between actual and predicted percentage inhibition on independent validation datasets of (A) HIV, (B) HCV, (C) HBV, (D) HHV, and (E) general (26 viruses)

Pearson correlation values obtained for each viral dataset on their respective QSAR models 10‐fold cross‐validation. The general dataset is comprised of below viruses with unique number of AVCs in brackets: Dengue virus 1,1 dengue virus 2,16 enterovirus,30 human adenovirus 5,41 human cox B1,4 human cox B5,21 human echovirus 13,3 human echovirus 9,2 human enterovirus 71,19 human enterovirus C,1 human polio virus 1,4 human rhinovirus,1 human rhinovirus 14,29 human rhinovirus 1B,18 human rhinovirus 2,2 human T lymphotropic virus,42 influenza A,36 influenza A (H1N1),16 influenza B,1 monkeypox virus,1 respiratory syncytial virus,4 Rift Valley fever virus (Cercopithecidae),1 sandfly fever Sicilian virus,2 SARS coronavirus,23 simian virus 40,45 Sindbis virus,4 vaccinia virus,12 vaccinia virus WR,22 variola virus,1 vesicular stomatitis virus,63 West Nile virus,17 yellow fever virus.51 Scatter plot between actual and predicted percentage inhibition on independent validation datasets of (A) HIV, (B) HCV, (C) HBV, (D) HHV, and (E) general (26 viruses) In addition, we also checked the performance of our models developed using classification mode of machine learning. During 10‐fold cross‐validation, we achieved maximum Matthew's correlation coefficient (MCC) of 0.91 in case of HIV; 0.93 in case of HCV; 0.70 in case of HBV; 0.89 in case of HHV; and 0.71 in case of rest of the viruses. Similar performance was shown on the independent datasets (Table 3). The machine learning parameters used for model development are shown in Table 4. Receiver operating characteristic (ROC) plots illustrating the performance of the QSAR models are shown in Figure 3.
Table 3

Performance of QSAR models obtained for each viral dataset using classification mode of machine learning

S. no.VirusTraining/Testing (10‐fold)Validation
SensitivitySpecificityAccuracyMCCSensitivitySpecificityAccuracyMCC
1HIV94.3096.4095.100.9188.1082.3086.100.70
2HCV96.9096.4096.600.9386.6187.2086.800.73
3HBV87.1081.6085.800.7087.2080.4084.300.69
4HHV93.4092.3093.500.8987.1091.3088. 60.77
5General (26 viruses)88.3082.2085.700.7181.7082.1081.900.64
Table 4

Machine learning parameters selected for the development of the QSAR models

S. no.ModelParameters
SMOregSVMlight
KernelOptimizer c ωσKernel g c
1HIVPukRegSMOImproved423RBF0.02200
2HCVPukRegSMOImproved555RBF0.0250
3HBVPukRegSMOImproved0.10.30.3RBF0.001300
4HHVPukRegSMOImproved333RBF0.1100
5General (26 viruses)PukRegSMOImproved335RBF0.0150

Abbreviations: Puk: Pearson VII function‐based universal kernel. RegSMOImproved: optimizer for algorithm speed improvement. c: regularization constant/complexity parameter allows trade‐off between training error and margin. ω: omega exponent value (controls half‐width of the peak) σ: sigma bandwidth value (controls tailing factor of the peak). RBF: radial basis function g: parameter gamma in RBF kernel.

Figure 3

ROC curves depicting performance of QSAR models for (A) HIV, (B) HCV, (C) HBV, (D) HHV, and (E) general (26 viruses)

Performance of QSAR models obtained for each viral dataset using classification mode of machine learning Machine learning parameters selected for the development of the QSAR models Abbreviations: Puk: Pearson VII function‐based universal kernel. RegSMOImproved: optimizer for algorithm speed improvement. c: regularization constant/complexity parameter allows trade‐off between training error and margin. ω: omega exponent value (controls half‐width of the peak) σ: sigma bandwidth value (controls tailing factor of the peak). RBF: radial basis function g: parameter gamma in RBF kernel. ROC curves depicting performance of QSAR models for (A) HIV, (B) HCV, (C) HBV, (D) HHV, and (E) general (26 viruses)

Web server

The QSAR models have been integrated into a freely available and easy to use web server, ‘AVCpred’, where users can predict the antiviral potential of their query molecules against the different viruses in terms of percent inhibition value. AVCpred web server includes the following modules:

Submission

This allows users to submit on or more molecules at a time. Users have to choose the viruses on which they want to test their query chemical compounds. On submission, it returns with percent inhibition values against the selected viruses. Also users can view the different properties of the query molecule such as structure, charge, molecular weight, logP value, hydrogen and Lipinski bond donors/acceptors, rigid and rotatable bonds to identify drug‐like molecular structures (Figure 4).
Figure 4

AVCpred submission form with output

AVCpred submission form with output

Design analogs

It has been found that analogs of known chemical compounds are sometimes more effective than the parent molecule.50 In order to identify potent analogs of an existing AVC, we have included the ‘Design analogs’ tool, where user can design analogs based on given building blocks and predict their inhibition on the viruses.

Draw structure

Using the ‘Draw tool’, one can sketch the structure of the query molecule using Marvin editor (Figure 5). This tool also gives the predicted percent inhibition values against the different viruses. In addition, one can view the various properties of the query structure.
Figure 5

Web interface of ‘ pred Draw’ tool

Web interface of ‘ pred Draw’ tool

Search

AVCpred also provides the users a search tool to browse the compounds used in our datasets. In this module, different compounds targeting the viruses are stored in a database. The records can be readily searched, filtered/sorted, and downloaded via the web interface.

Implementation

AVCpred has been developed using the open‐source LAMP (Linux‐Apache‐MySQL‐PHP) system. The prediction software runs on Red Hat Enterprise Linux 5 environment using Apache http://httpd server.

Discussion

To inhibit viral growth, the antiviral molecules or drugs target different phases of viral life cycle such as fusion, integration, replication, maturation and should be relatively non‐toxic to the host organism.51, 52 Each stage can be targeted using AVCs that can, for example, inhibit entry receptors (CD4, CCR5) or viral enzymes (protease, neuraminidase).53, 54, 55 Various AVCs are currently in medical use, and new ones are in clinical trials.56, 57 Finding new and improved viral inhibitors is a major concern in the treatment of deadly human viruses.58, 59 However, discovery of novel AVCs is a tedious process.60 To speed up the identification of new AVCs, a computational approach using QSAR method is a rational strategy to decrease cost and time efforts in the wet laboratory.20 QSAR techniques have been widely used in drug designing and further identification of lead molecules.17 Although there are many QSAR studies pertaining to different types of viral protein inhibitors, they are very specific in their approach and deal with a particular class of inhibitors such as endonuclease inhibitors33 in which 40 compounds were used and reached a correlation of 0.76, thiourea derivatives34 where 85 compounds had a correlation of 0.92, protease inhibitors39 in which 170 compounds had a correlation of 0.60–0.83, and flavonoid inhibitors38 where 20 compounds had a correlation of 0.75–0.97 etc. (Table 5). In most of the cases, the studies are carried out on a limited number of inhibitors. Due to this reason, they predict the inhibitors that are similar to the compound type with a high correlation, but do not work on other dissimilar inhibitors for the same target virus. To address these limitations, AVCpred models have been developed using diverse and large number of inhibitors. In the current algorithm, we have employed antiviral compound datasets from different studies due to which the overall correlation is less than above studies, yet the models are comparatively more robust to predict different classes of inhibitors. However, as new high‐throughput screening data tested under homogeneous conditions on antiviral drugs becomes available, performance of the QSAR method can be improved.
Table 5

Existing QSAR studies pertaining to antiviral compounds

S. no.Compound typeNo. of compoundsCorrelationTarget virusWeb server/SoftwareYearReferences
1PA endonuclease inhibitors400.76INFVNo2014 33
2Thiourea derivatives850.92HCVNo2013 34
3Integrase inhibitors770.98HIVNo2012 35
4Three different series of HBV inhibitors300.92HBVNo2010 36
5HIV‐1 entry inhibitors360.72HIVNo2010 37
6Neuraminidase flavonoid inhibitors200.75–0.97H1N1No2010 38
7Protease inhibitors1700.6–0.83HIVNo2010 39
8HPV6‐E1 helicase ATPase inhibitorsFull text not available0.92HPVNo2010 40
9Thymidine kinase N2‐phenylguanine inhibitors200.85–0.98HSVNo2000 41
Existing QSAR studies pertaining to antiviral compounds In this study, we developed virus specific as well as general prediction models to identify the likelihood of a compound being antiviral using selected chemical attributes of experimentally validated AVCs. PaDEL, an open‐source software, was used to calculate molecular descriptors and fingerprints. However, the software calculates a large number of descriptors, and hence, we used attribute selection approach to reduce their number by eliminating unrelated and extraneous descriptors to get a highly correlated descriptor set. Our analysis revealed that several chemical descriptors are important in predicting the compound inhibition activity, for example, partial charge, atom‐type electrotopological state, extended topochemical atom, chi cluster, weighted path, and fingerprints. We employed machine learning to train the QSAR models on different sets of experimentally validated data. These models were validated on independent datasets, not used during training, and were found to have satisfactory performance. We used the pharmacological data from the ChEMBL resource for training/testing the models developed for general as well as specific viruses. These models were integrated in an open‐source web server for evaluation and screening of antiviral compounds. The applicability domain of the QSAR models was demonstrated using Williams plot (Figure 6) in which standardized residuals are plotted against leverages.61 If the standardized residual of a compound is greater than three times standard deviation units (±3σ), the compound is treated as an outlier. The warning value of leverage (h*) is considered as 3p/n, where p is the number of model descriptors plus one and n is the number of training compounds.62, 63 If the leverage of a compound exceeds h*, it is regarded as dissident. The plots demonstrate that the leverages of majority of the compounds do not surpass the critical value (h*) in the regression models, and hence, the compounds are within the chemical domain, implying that the predictivity of the models is reliable.
Figure 6

Applicability domain plots of the QSAR models for (A) HIV, (B) HCV, (C) HBV, (D) HHV, and (E) general (26 viruses)

Applicability domain plots of the QSAR models for (A) HIV, (B) HCV, (C) HBV, (D) HHV, and (E) general (26 viruses) The web server also provides useful services like designing analogs based on given building blocks and drawing structure to sketch novel compounds and predict their inhibition potential against multiple viruses. The AVCpred algorithm is hoped to assist the researchers in discovering novel antiviral compounds as well as virtually check the effect of modifications on existing drugs.

Conclusions

AVCpred is the first web‐based algorithm for prediction of AVCs based on experimentally validated datasets. Five prediction models pertaining to HIV, HCV, HHV, HBV, and a general one were implemented in the web server to make comprehensive predictions. In addition, tools for drug design, virtual screening, and collection of existing AVCs have also been integrated. This web server would be helpful for researchers working for the development of antiviral therapeutics.

Competing interests

The authors declare that they have no competing interests. Click here for additional data file.
  59 in total

Review 1.  Rational approaches to the design of antiviral agents based on S-adenosyl-L-homocysteine hydrolase as a molecular target.

Authors:  S Liu; M S Wolfe; R T Borchardt
Journal:  Antiviral Res       Date:  1992-09       Impact factor: 5.970

2.  A QSAR study of HIV protease inhibitors using theoretical descriptors.

Authors:  Subhash C Basak; Denise Mills; Rajni Garg; Barun Bhhatarai
Journal:  Curr Comput Aided Drug Des       Date:  2010-12       Impact factor: 1.606

3.  Multi-target spectral moment: QSAR for antiviral drugs vs. different viral species.

Authors:  Francisco J Prado-Prado; Fernanda Borges; Eugenio Uriarte; Lazaro G Peréz-Montoto; Humberto González-Díaz
Journal:  Anal Chim Acta       Date:  2009-08-25       Impact factor: 6.558

Review 4.  Antiviral drug discovery: broad-spectrum drugs from nature.

Authors:  J P Martinez; F Sasse; M Brönstrup; J Diez; A Meyerhans
Journal:  Nat Prod Rep       Date:  2015-01       Impact factor: 13.423

Review 5.  Next-generation sequencing technology in clinical virology.

Authors:  M R Capobianchi; E Giombini; G Rozera
Journal:  Clin Microbiol Infect       Date:  2013-01       Impact factor: 8.067

6.  AIDS therapy. First tentative signs of therapeutic promise.

Authors:  K Wright
Journal:  Nature       Date:  1986 Sep 25-Oct 1       Impact factor: 49.962

7.  Novel structural analogues of piperine as inhibitors of the NorA efflux pump of Staphylococcus aureus.

Authors:  Ashwani Kumar; Inshad Ali Khan; Surrinder Koul; Jawahir Lal Koul; Subhash Chandra Taneja; Intzar Ali; Furqan Ali; Sandeep Sharma; Zahid Mehmood Mirza; Manoj Kumar; Pyare Lal Sangwan; Pankaj Gupta; Niranjan Thota; Ghulam Nabi Qazi
Journal:  J Antimicrob Chemother       Date:  2008-03-10       Impact factor: 5.790

8.  PubChem BioAssay: 2014 update.

Authors:  Yanli Wang; Tugba Suzek; Jian Zhang; Jiyao Wang; Siqian He; Tiejun Cheng; Benjamin A Shoemaker; Asta Gindulyte; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2013-11-05       Impact factor: 16.971

Review 9.  A review on antibacterial, antiviral, and antifungal activity of curcumin.

Authors:  Soheil Zorofchian Moghadamtousi; Habsah Abdul Kadir; Pouya Hassandarvish; Hassan Tajik; Sazaly Abubakar; Keivan Zandi
Journal:  Biomed Res Int       Date:  2014-04-29       Impact factor: 3.411

10.  Designing of promiscuous inhibitors against pancreatic cancer cell lines.

Authors:  Rahul Kumar; Kumardeep Chaudhary; Deepak Singla; Ankur Gautam; Gajendra P S Raghava
Journal:  Sci Rep       Date:  2014-04-14       Impact factor: 4.379

View more
  15 in total

1.  SMEpred workbench: A web server for predicting efficacy of chemicallymodified siRNAs.

Authors:  Showkat Ahmad Dar; Amit Kumar Gupta; Anamika Thakur; Manoj Kumar
Journal:  RNA Biol       Date:  2016-09-07       Impact factor: 4.652

2.  Targeting non-structural proteins of Hepatitis C virus for predicting repurposed drugs using QSAR and machine learning approaches.

Authors:  Sakshi Kamboj; Akanksha Rajput; Amber Rastogi; Anamika Thakur; Manoj Kumar
Journal:  Comput Struct Biotechnol J       Date:  2022-06-30       Impact factor: 6.155

3.  aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance.

Authors:  Akanksha Rajput; Anamika Thakur; Shivangi Sharma; Manoj Kumar
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

4.  Bioactivity Comparison across Multiple Machine Learning Algorithms Using over 5000 Datasets for Drug Discovery.

Authors:  Thomas R Lane; Daniel H Foil; Eni Minerali; Fabio Urbina; Kimberley M Zorn; Sean Ekins
Journal:  Mol Pharm       Date:  2020-12-16       Impact factor: 4.939

5.  S2DV: converting SMILES to a drug vector for predicting the activity of anti-HBV small molecules.

Authors:  Jinsong Shao; Qineng Gong; Zeyu Yin; Wenjie Pan; Sanjeevi Pandiyan; Li Wang
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

6.  Exploring general-purpose protein features for distinguishing enzymes and non-enzymes within the twilight zone.

Authors:  Yasser B Ruiz-Blanco; Guillermin Agüero-Chapin; Enrique García-Hernández; Orlando Álvarez; Agostinho Antunes; James Green
Journal:  BMC Bioinformatics       Date:  2017-07-21       Impact factor: 3.169

7.  HIVprotI: an integrated web based platform for prediction and design of HIV proteins inhibitors.

Authors:  Abid Qureshi; Akanksha Rajput; Gazaldeep Kaur; Manoj Kumar
Journal:  J Cheminform       Date:  2018-03-09       Impact factor: 5.514

Review 8.  Recent Advances in Understanding, Diagnosing, and Treating Hepatitis B Virus Infection.

Authors:  Magda Rybicka; Krzysztof Piotr Bielawski
Journal:  Microorganisms       Date:  2020-09-15

9.  Prediction of repurposed drugs for Coronaviruses using artificial intelligence and machine learning.

Authors:  Akanksha Rajput; Anamika Thakur; Adhip Mukhopadhyay; Sakshi Kamboj; Amber Rastogi; Sakshi Gautam; Harvinder Jassal; Manoj Kumar
Journal:  Comput Struct Biotechnol J       Date:  2021-05-24       Impact factor: 7.271

10.  Better understanding and prediction of antiviral peptides through primary and secondary structure feature importance.

Authors:  Abu Sayed Chowdhury; Sarah M Reehl; Kylene Kehn-Hall; Barney Bishop; Bobbie-Jo M Webb-Robertson
Journal:  Sci Rep       Date:  2020-11-06       Impact factor: 4.996

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.