| Literature DB >> 35922786 |
Linda Erlina1,2, Rafika Indah Paramita3,4, Wisnu Ananta Kusuma5,6, Fadilah Fadilah1,2, Aryo Tedjo1,2, Irandi Putra Pratomo2,7, Nabila Sekar Ramadhanti8, Ahmad Kamal Nasution8, Fadhlal Khaliq Surado8, Aries Fitriawan8, Khaerunissa Anbar Istiadi2,9, Arry Yanuar10.
Abstract
BACKGROUND: The number of COVID-19 cases continues to grow in Indonesia. This phenomenon motivates researchers to find alternative drugs that function for prevention or treatment. Due to the rich biodiversity of Indonesian medicinal plants, one alternative is to examine the potential of herbal medicines to support COVID therapy. This study aims to identify potential compound candidates in Indonesian herbal using a machine learning and pharmacophore modeling approaches.Entities:
Keywords: 3CLPro; COVID-19; Indonesian Herbal Compounds; Machine Learning; Molecular Docking; Pharmacophore Modeling; SARS-CoV-2
Mesh:
Substances:
Year: 2022 PMID: 35922786 PMCID: PMC9347098 DOI: 10.1186/s12906-022-03686-y
Source DB: PubMed Journal: BMC Complement Med Ther ISSN: 2662-7671
Fig. 1Study Workflow
List of potential protein target related to COVID-19
| Virus-based protein | Host-based protein | |||||
|---|---|---|---|---|---|---|
| PDB/Uniprot ID | Protein | Reference | Uniprot name | Uniprot ID | Protein | Reference |
| 6LU7:A | 3CLPro | [ | ACE2 | Q9BYF1 | ACE2 | [ |
| PLpro_SARS-CoV-2 | PLPro | [ | AKT1 | P31749 | AKT | [ |
| K4LC41 | ||||||
| PYRD | Q02127 | DHODH | [ | |||
| yp_009725307.1 | RdRp | [ | PPIA | P62937 | ||
| PPIG | Q13427 | |||||
| 6M0J:E | Spike-ACE2 | [ | FKBP5 | Q13451 | ||
| 6LZG:A | FKBP4 | Q02790 | ||||
| 6VSB | FKBP2 | P26885 | ||||
| 6M0J:A | ||||||
| CYP5 | P52013 | PPIASE | [ | |||
| FKB1B | P68106 | |||||
| PPIB | P23284 | |||||
| PPIC | P45877 | |||||
| PPIH | O43447 | |||||
| FKB1A | P62942 | |||||
| IL6RB | P40189 | IL-6 | [ | |||
List of potential drug explored from SuperTarget database
| Drug | Protein Target |
|---|---|
| Moexipril hydrochloride | ACE2 |
| Arsentrioxide | AKT1 |
| Arthrocine | |
| Celecoxib | |
| Erlotinib | |
| Gefitinib | |
| Imatinib Mesylate | |
| Lapatinib ditosylate | |
| Simvastatin | |
| Sorafenibum | |
| Sunitinib | |
| Atovaquone | PYRD, PPIA, PPIG |
| Essigsaeure | |
| Huanghuahaosu | |
| Hydroxycinchophene | |
| Leflunomide | |
| Rapamycin | FKBP5, FKBP4, FKBP2, FKB1B, FKB1A |
| Athylenglykol | FKBP4 |
| Methylsulfinylmethane | |
| Dithiothreitol | CYP5_CAEEL |
| Carboxypyrrolidine | PPIB, PPIC, PPIH |
| Pimecrolimus | FKB1A |
| Tacrolimus | |
| Thiabendazole |
Fig. 2Support vector machine method (x is the data, w is the weight vector, b is the bias score, ε is the minimum error) [28]
Fig. 3a schematic representation of model training and validation approaches for our proposed machine learning methodology
Fig. 4The scheme of predicting Indonesian herbal compounds using the optimal and validated model that was generated by RF, MLP, and SVM in training and validation phase. The class probability score was averaged from three methods. The counter was conducted to indicate the number of methods which predict positive results. The decision was determined based on the criteria of the class probability ≥ 0.5 or at least predicted by two methods.
The performance of each model calculated using 30% of dataset that was excluded from training set
| Method | Performance Measure | Value |
|---|---|---|
| Multilayer Perceptron (MLP) | AUC | 0.98405 |
| F-measure | 0.98254 | |
| Precision | 0.96628 | |
| Recall | 0.99936 | |
| Accuracy | 0.98321 | |
| Random Forest (RF) | AUC | 0.98734 |
| F-measure | 0.98608 | |
| Precision | 0.97255 | |
| Recall | 1 | |
| Accuracy | 0.98665 | |
| Support Vector Machine (SVM) | AUC | 0.99919 |
| F-measure | 0.99911 | |
| Precision | 0.99847 | |
| Recall | 0.99975 | |
| Accuracy | 0.99915 |
The predicted potential compounds targeting 3CLPro, PLPro, and RdRp
| No | Protein Target | Herbal Compound |
|---|---|---|
| 1. | 3CLPro | Amaranthine, 8-Methylthio-octyl glucosinolate, Arabinopyrano, Peonidin 3-(4’arabinosylglucoside), Quercetin 3-(2G-rhamnosylrutinoside), Sinigrin, Hesperidin, Myricetin-3-glucoside, (+)-2,3-Dihydro-9-hydroxy-2 [1-(6-sinapinoyl)beta-D-glucosyloxy-1-methylethyl]-7H-propanoat, Cyanidin-3-sophoroside-5-glucoside, Scutellarein-6,4’-dimethyl ether-7-(6”-acetylglucoside),, Spiraeoside, Glucoputranjivin, Isoforskolin, Kaempferol 3-alpha-D-arabinopyranoside |
| 2. | PLPro | 8-Methylthio-octyl glucosinolate, Sinigrin, Glucoputranjivin |
| 3. | RdRp | 8-Methylthio-octyl glucosinolate, Arabinopyrano, Peonidin 3-(4’arabinosylglucoside), Quercetin 3-(2G-rhamnosylrutinoside), Theviridoside, Sinigrin, Hesperidin, Myricetin-3-glucoside, , (+)-2,3-Dihydro-9-hydroxy-2 [1-(6-sinapinoyl)beta-D-glucosyloxy-1-methylethyl]-7H-propanoat y, Cyanidin-3-sophoroside-5-glucoside, Catalpol, Scandoside, Scutellarein-6,4’-dimethyl ether-7-(6”-acetylglucoside), Spiraeoside, Geniposide, Oleoside, Majoroside, Glucoputranjivin, Isoforskolin, Kaempferol 3-alpha-D-arabinopyranoside |
Fig. 5a 3D structure complex of the main protease and N3 inhibitor (N-[(5-Methylisoxazol-3-Yl)Carbonyl]Alanyl-L-Valyl-N~1~-((1R,2Z)-4-(Benzyloxy)-4-Oxo-1-{[(3R)-2-Oxopyrrolidin-3-Yl]Methyl}But-2-Enyl)-L-Leucinamide), b pharmacophore feature of the N3 inhibitor in the main protease
Fig. 6Pharmacophore model from LBDD analysis. a Pharmacophore feature of the best pharmacophore model, b validation parameters of the best pharmacophore model.
The top-30 of hit compounds from LBDD methods
| No | Compound Name | No | Compound Name |
|---|---|---|---|
| 1 | Kaempferol 3-alpha-D-arabinopyranoside | 16 | Catalpol |
| 2 | Isoforskolin | 17 | Cyanidin-3-sophoroside-5-glucoside |
| 3 | Glucoputranjivin | 18 | (+)-2,3-Dihydro-9-hydroxy-2 [1-(6-sinapinoyl)beta-D-glucosyloxy-1-methylethyl]-7H-propanoat |
| 4 | Loganic Acid | 19 | Myricetin 3-glucoside |
| 5 | Majoroside | 20 | Hesperidin |
| 6 | Oleoside | 21 | Azadirachtin A |
| 7 | Geniposide | 22 | 1-Caffeoyl-beta-D-glucose |
| 8 | Glucobrassicin | 23 | Sinigrin |
| 9 | Spiraeoside | 24 | Theviridoside |
| 10 | Alizarin | 25 | Quercetin 3-(2G-rhamnosylrutinoside) |
| 11 | Morindone | 26 | Peonidin 3-(4’arabinosylglucoside) |
| 12 | Casuarinin | 27 | trans-p-Sinapoyl-b-D-glucopyranoside |
| 13 | Scutellarein-6,4’-dimethyl ether-7-(6”-acetylglucoside) | 28 | 6,8-Di-C-beta-D-arabinopyranosyl apigenin |
| 14 | Scandoside methyl ester | 29 | 8-Methylthio-octyl glucosinolate |
| 15 | beta-Glucogallin | 30 | Amaranthine |
Molecular docking results of 14 hit (overlapped) compounds against the main protease of SARS-CoV-2
| No | Compound name | Binding Energy (ΔG) | Sources |
|---|---|---|---|
| 1 | Cyanidin-3-sophoroside-5-glucoside | -6.52 | |
| 2 | Geniposide | -7.04 | |
| 3 | Hesperidin | -8.72 | |
| 4 | Isoforskolin | -6.88 | |
| 5 | Kaempferol 3,4'-di-O-methylether (Ermanin) | -8.51 | |
| 6 | Majoroside | -7.03 | |
| 7 | Myricetin-3-glucoside | -8.26 | |
| 8 | Oleoside | -6.52 | |
| 9 | Peonidine 3-(4’-arabinosylglucoside) | -8.52 | |
| 10 | Quercetin 3-(2G-rhamnosylrutinoside) | -8.56 | |
| 11 | Rhamnetin 3-mannosyl-(1-2)-alloside | -8.48 | |
| 12 | Sinigrin | -5.19 | |
| 13 | Spiraeoside | -7.97 | |
| 14 | Theviridoside | -7.13 | |
| 15 | Lopinavir | -9.41 | Antiviral drug (positive control) |
Fig. 7Interaction of ligands with receptor (3CLpro / main protease); red quarter circles were residue of protein that have non-covalent bond interaction with ligand; residues that written in green colour were residue which had hydrogen bonds interaction with ligand (written with its distance as well). a Lopinavir; b Hesperidin; c Kaempferol-3,4'-di-O-methyl ether (Ermanin); d Myricetin-3-glucoside; e Peonidine 3-(4’-arabinosylglucoside); f Quercetin 3-(2G-rhamnosylrutinoside); g Rhamnetin 3-mannosyl-(1-2)-alloside (visualization software using LigPlot [69])