| Literature DB >> 36034833 |
Wisnu Ananta Kusuma1,2, Zulfahmi Ibnu Habibi1, Muhammad Fahmi Amir1, Aulia Fadli1, Husnul Khotimah1, Vektor Dewanto1, Rudi Heryanto2,3.
Abstract
Jamu is an Indonesian traditional herbal medicine that has been practiced for generations. Jamu is made from various medicinal plants. Each plant has several compounds directly related to the target protein that are directly associated with a disease. A pharmacological graph can form relationships between plants, compounds, and target proteins. Research related to the prediction of Jamu formulas for some diseases has been carried out, but there are problems in finding combinations or compositions of Jamu formulas because of the increase in search space size. Some studies adopted the drug-target interaction (DTI) implemented using machine learning or deep learning to predict the DTI for discovering the Jamu formula. However, this approach raises important issues, such as imbalanced and high-dimensional dataset, overfitting, and the need for more procedures to trace compounds to their plants. This study proposes an alternative approach by implementing bipartite graph search optimization using the branch and bound algorithm to discover the combination or composition of Jamu formulas by optimizing the search on a plant-protein bipartite graph. The branch and bound technique is implemented using the search strategy of breadth first search (BrFS), Depth First Search, and Best First Search. To show the performance of the proposed method, we compared our method with a complete search algorithm, searching all nodes in the tree without pruning. In this study, we specialize in applying the proposed method to search for the Jamu formula for type II diabetes mellitus (T2DM). The result shows that the bipartite graph search with the branch and bound algorithm reduces computation time up to 40 times faster than the complete search strategy to search for a composition of plants. The binary branching strategy is the best choice, whereas the BrFS strategy is the best option in this research. In addition, the the proposed method can suggest the composition of one to four plants for the T2DM Jamu formula. For a combination of four plants, we obtain Angelica Sinensis, Citrus aurantium, Glycyrrhiza uralensis, and Mangifera indica. This approach is expected to be an alternative way to discover the Jamu formula more accurately.Entities:
Keywords: branch and bound; diabetes mellitus; drug–target interaction; graph traversing; jamu
Year: 2022 PMID: 36034833 PMCID: PMC9403330 DOI: 10.3389/fphar.2022.978741
Source DB: PubMed Journal: Front Pharmacol ISSN: 1663-9812 Impact factor: 5.988
FIGURE 1Illustration of three networks of plants, compounds, and proteins, respectively represented by T, C, and P. We define three networks from different databases to get the relationship between plants and protein. Network A connects plants from KNApSAcK, compounds from KNApSAcK and PubChem, and proteins from PubChem BioAssay. Network B connects compounds in network A, compounds in network B taken from ChemmineTools, and the target protein is the same as that in Network A. Network C connects proteins from Usman et al. (2020), proteins from Uniprot, and compounds from PubChem BioAssay.
Results of data acquisition from the various databases.
| Network | Data | Data resources | Results |
|---|---|---|---|
| A | Plants | KNApSAcK | - 711 plants |
| Compound | KNApSAcK | - obtained 4926 compounds from 711 plants with 7725 interactions of plant–compound | |
| - only 581 plants have at least one compound | |||
| Compound | PubChem | - only 2780 of 4926 compounds have CID and are categorized as compound | |
| - only 541 plants have at least one compound | |||
| Target protein | PubChem BioAssay | - obtained 2308 target proteins with 131.798 interactions of compound–protein | |
| - only 1063 compounds have at least one target protein | |||
| B | Compound | ChemmineTools | - obtained 9647 compounds from the expansion of network A |
| Target protein | PubChem BioAssay | - obtained 2465 target protein from 9647 compound | |
| C | Target protein |
| - 21 target proteins associated with T2DM |
| - The score of betweenness centrality (BC) and closeness centrality (CC) | |||
| Target protein | UniProt | - MGI to GI id conversion for each target protein | |
| Compound | Pubchem BioAssay | - obtained 803 compounds have interaction with 14 target proteins of T2DM |
Protein weight normalization results.
| Gene | BC | CC | AVG | NORM |
|---|---|---|---|---|
| INS | 0.3211 | 0.6250 | 0.4731 | 1.000 |
| AKT1 | 0.2435 | 0.5128 | 0.3782 | 0.799 |
| TCF7L2 | 0.2003 | 0.5714 | 0.3859 | 0.816 |
| KCNJ11 | 0.1342 | 0.5000 | 0.3171 | 0.670 |
| UBC | 0.1097 | 0.4878 | 0.2987 | 0.632 |
| PPARG | 0.0952 | 0.5128 | 0.3040 | 0.643 |
| GCGR | 0.0780 | 0.4762 | 0.2771 | 0.586 |
| INSR | 0.0775 | 0.5000 | 0.2888 | 0.610 |
| IAPP | 0.0526 | 0.4348 | 0.2437 | 0.515 |
| SOCS3 | 0.0518 | 0.4348 | 0.2433 | 0.514 |
| EP300 | 0.0443 | 0.4167 | 0.2305 | 0.487 |
| PPARA | 0.0311 | 0.4082 | 0.2197 | 0.464 |
| WFS1 | 0.0186 | 0.4444 | 0.2315 | 0.489 |
| APOE | 0.0163 | 0.3846 | 0.2004 | 0.424 |
| FOXO1 | 0.0096 | 0.3704 | 0.1900 | 0.402 |
| STAT3 | 0.0066 | 0.3509 | 0.1787 | 0.378 |
| PTH | 0.0044 | 0.3509 | 0.1776 | 0.375 |
| CTLA4 | 0.0000 | 0.3448 | 0.1724 | 0.364 |
| MTNR1B | 0.0000 | 0.3922 | 0.1961 | 0.414 |
| PRKACA | 0.0000 | 0.3390 | 0.1695 | 0.358 |
| SOD3 | 0.0000 | 0.3448 | 0.1724 | 0.364 |
FIGURE 2Searching strategy (Morrison et al., 2016)
FIGURE 3Branching strategy (Morrison et al., 2016)
FIGURE 4Data with weight and profit.
FIGURE 5Lower bounds pruning rule.
FIGURE 6Use of the queue data structure in tree tracing.
FIGURE 7Use of the stack data structure on the tree.
FIGURE 8Priority queue with (binary) heap tree.
FIGURE 9Wide branching strategy.
Complete data on the computational time for each strategy.
| The number of plant (k) | BrFS | DFS | BFS | Wide branching | Complete search |
|---|---|---|---|---|---|
| 2 | 0.25 | 0.256 | 0.31 | 1.14 | 0.98 |
| 3 | 11.40 | 11.64 | 18.70 | 48.25 | 169.12 |
| 4 | 476.21 | 483.75 | 1070.58 | 2106.60 | 20285.02 |
FIGURE 10Comparison of search space area in log(n) units.
Execution time for composition k plants.
| Combination/Composition | Time to k (sec) | Time to- (k+1)/(k) (sec) |
|---|---|---|
| 1 | 0.005 | 68 |
| 2 | 0.34 | 744.11 |
| 3 | 253 | 131.28 |
| 4 | 33214 | — |
Best results of the composition of plants for the Jamu formula.
| Composition of plant | Latin name | Formula score |
|---|---|---|
| 2 |
| 5.26512 |
| 3 |
| 5.77630 |
| 4 |
| 6.13136 |