| Literature DB >> 35536529 |
Anish Nag1, Nikhil Dhull2, Ashmita Gupta2.
Abstract
Tea (Camellia sinensis L.) is considered as to be one of the most consumed beverages globally and a reservoir of phytochemicals with immense health benefits. Despite numerous advantages, tea compounds lack a robust multi-disease target study. In this work, we presented a unique in silico approach consisting of molecular docking, multivariate statistics, pharmacophore analysis, and network pharmacology approaches. Eight tea phytochemicals were identified through literature mining, namely gallic acid, catechin, epigallocatechin gallate, epicatechin, epicatechin gallate (ECG), quercetin, kaempferol, and ellagic acid, based on their richness in tea leaves. Further, exploration of databases revealed 30 target proteins related to the pharmacological properties of tea compounds and multiple associated diseases. Molecular docking experiment with eight tea compounds and all 30 proteins revealed that except gallic acid all other seven phytochemicals had potential inhibitory activities against these targets. The docking experiment was validated by comparing the binding affinities (Kcal mol-1) of the compounds with known drug molecules for the respective proteins. Further, with the aid of the application of statistical tools (principal component analysis and clustering), we identified two major clusters of phytochemicals based on their chemical properties and docking scores (Kcal mol-1). Pharmacophore analysis of these clusters revealed the functional descriptors of phytochemicals, related to the ligand-protein docking interactions. Tripartite network was constructed based on the docking scores, and it consisted of seven tea phytochemicals (gallic acid was excluded) targeting five proteins and ten associated diseases. Epicatechin gallate (ECG)-hepatocyte growth factor receptor (PDB id 1FYR) complex was found to be highest in docking performance (10 kcal mol-1). Finally, molecular dynamic simulation showed that ECG-1FYR could make a stable complex in the near-native physiological condition.Entities:
Keywords: Diseases; Molecular docking; Network pharmacology; Phytochemicals; Simulation; Statistics; Tea
Year: 2022 PMID: 35536529 PMCID: PMC9086669 DOI: 10.1007/s11030-022-10437-1
Source DB: PubMed Journal: Mol Divers ISSN: 1381-1991 Impact factor: 3.364
List of selected tea phytochemicals and corresponding chemical classes
| S/N | Phytochemicals | IUPAC name | Classes | PubChem IF | Structures |
|---|---|---|---|---|---|
| 1 | (−)-Epicatechin (EC) | (2R,3R)-2-(3-hydroxy-4-methoxyphenyl)-3,4-dihydro-2H-chromene-3,5,7-triol | Flavonoids-flavanols | 14,332,898 |
|
| 2 | (−)-Epicatechin gallate (ECG) | [(2R,3R)-2-(3,4-dihydroxyphenyl)-5,7-dihydroxy-3,4-dihydro-2H-chromen-3-yl] 3,4,5-trihydroxybenzoate | Flavonoids-flavanols | 107,905 |
|
| 3 | (−)-Epigallocatechin gallate (EGCG) | [(2R,3R)-5,7-dihydroxy-2-(3,4,5-trihydroxyphenyl)-3,4-dihydro-2H-chromen-3-yl] 3,4,5-trihydroxybenzoate | Flavonoids-flavanols | 65,064 |
|
| 4 | (+)-Catechin | (2R,3S)-2-(3,4-dihydroxyphenyl)-3,4-dihydro-2H-chromene-3,5,7-triol | Flavonoids-flavanols | 9064 |
|
| 5 | Quercetin | 2-(3,4-dihydroxyphenyl)-3,5,7-trihydroxychromen-4-one | Flavonoids-flavanols | 5,280,343 |
|
| 6 | Kaempferol | 3,5,7-trihydroxy-2-(4-hydroxyphenyl)chromen-4-one | Flavonoids-flavanols | 5,280,863 |
|
| 7 | Ellagic acid | 6,7,13,14-tetrahydroxy-2,9-dioxatetracyclo[6.6.2.04,16.011,15]hexadeca-1(15),4,6,8(16),11,13-hexaene-3,10-dione | Tannins | 5,281,855 |
|
| 8 | Gallic acid | 3,4,5-trihydroxybenzoic acid | Benzene and substituted derivatives-Hydroxybenzoic acid derivatives | 370 |
|
Fig. 1Schematic representation of the study
Target proteins and associated diseases along with control drugs
| S/N | UniProt ID | PDB ID | Name of the proteins | Diseases | Control drugs (Drugbank id) |
|---|---|---|---|---|---|
| 1 | O43570 | 1JCZ [ | Carbonic anhydrase 12 | Hyperchlorhidrosis isolated (HCHLH) (H01302, #143,860) | Benzthiazide (DB00562) |
| 2 | P00374 | 1BOZ [ | Dihydrofolate reductase | Dihydrofolate reductase (DHFR) deficiency (H01197) | Methotrexate (DB00563) |
| 3 | P00533 | 1IVO [ | Epidermal growth factor receptor | Oral cancer (H00016), Lung cancer (#211,980) | Afatinib (DB08916) |
| 4 | P00734 | 1A2C [ | Prothrombin | Prothrombin deficiency, congenital (#613,679) | Argatroban (DB08916) |
| 5 | P00918 | 12CA [ | Carbonic anhydrase 2 | Combined proximal and distal renal tubular acidosis (H00241), Osteopetrosis (H00436), Osteopetrosis, autosomal recessive 3 (#259,730) | Dorzolamide (DB00869) |
| 6 | P04626 | 1MFG [ | Receptor tyrosine-protein kinase erbB-2 | Gastric cancer (#613,659) | Brigatinib (DB12267) |
| 7 | P05129 | 2E73NP | Protein kinase C gamma type (PRKCG) | Spinocerebellar ataxia (H00063), Spinocerebellar ataxia 14 (#605,361) | Fostamatinib (DB12267) |
| 8 | P08246 | 1B0F [ | Neutrophil elastase | Neutropenic disorders (H00100), Cyclic neutropenia (#162,800) | Freselestat (DB03925) |
| 9 | P08253 | 1CK7 [ | 72 kDa type IV collagenase | Multicentric osteolysis, nodulosis, and arthropathy; mona (#259,600) | Marimastat (DB00786) |
| 10 | P08581 | 1FYR [ | Hepatocyte growth factor receptor | Renal cell carcinoma (H00021), Gastric cancer (H00018), Hepatocellular carcinoma (#114,550) | Crizotinib (DB08865) |
| 11 | P10253 | 5KZW NP | Lysosomal alpha-glucosidase | Glycogen storage diseases (GSD) (H00069), Glycogen storage disease ii (#232,300) | Acarbose (DB00284) |
| 12 | P10635 | 2F9Q [ | Cytochrome P450 2D6 | Drug metabolism, poor, cyp2d6-related (#608,902) | Panobinostat (DB06603) |
| 13 | P11802 | 2W96 [ | Cyclin-dependent kinase 4 | Cervical cancer (H00030), Melanoma, cutaneous malignant, susceptibility to, 3 (#609,048) | Palbociclib (DB09073) |
| 14 | P12821 | 1O86 [ | Angiotensin-converting enzyme | Allograft rejection (H00083), Hemorrhage, intracerebral (#614,519) | Perindopril (DB00790) |
| 15 | P14780 | 1GKC [ | Matrix metalloproteinase-9 | Penile cancer (H00025), Intervertebral disc disease (#603,932) | Marimastat (DB00786) |
| 16 | P16109 | 1G1Q [ | P-selectin | Stroke, ischemic (#601,367) | N-acetyl-alpha-neuraminic acid (DB03721) |
| 17 | P22303 | 1B41 [ | Acetylcholinesterase | Yt blood group antigen (#112,100) | Pyridostigmine (DB00545) |
| 18 | P22309 | Homologous model | UDP-glucuronosyltransferase 1–1 | Hyperbilirubinemia (H00208), Bilirubin, serum level of, quantitative trait locus 1; biliqtl1 (#601,816) | Adenine (DB00173) |
| 19 | P22310 | Homologous model | UDP-glucuronosyltransferase 1–4 | Gilbert syndrome (#143,500) | Idelalisib (DB09054) |
| 20 | P22748 | 1ZNC [ | Carbonic anhydrase 4 | Retinitis pigmentosa (H00527) | Topiramate (DB00273) |
| 21 | P24385 | 2W9Z [ | G1/S-specific cyclin-D1 | Oral cancer (H00016), Myeloma, multiple (#254,500) | Encorafenib (DB11718) |
| 22 | P27487 | 1J2E [ | Dipeptidyl peptidase 4 | Thyroid cancer (H00032) | Sitagliptin (DB01261) |
| 23 | P35354 | 5F19 [ | Prostaglandin G/H synthase 2 | Cholangiocarcinoma (H00046), Penile cancer (H00025), Esophageal cancer (H00017) | Icosapent (DB00159) |
| 24 | P36888 | 1RJB [ | Receptor-type tyrosine-protein kinase FLT3 | Acute myeloid leukemia (H00003) | Sunitinib (DB01268) |
| 25 | P42336 | 2RD0 [ | Phosphatidylinositol 4,5-bisphosphate 3-kinase catalytic subunit alpha isoform or PI3-kinase subunit alpha | Ovarian cancer (H00027), Breast cancer (#114,480) | Caffeine (DB00201) |
| 26 | P47989 | 2CKZ [ | Xanthine dehydrogenase/oxidase | Xanthinuria (H00192) | Allopurinol (DB00437) |
| 27 | Q00534 | 1BI7 [ | Cyclin-dependent kinase 6 | Stature quantitative trait locus 11 (612,223) | Palbociclib (DB09073) |
| 28 | Q04759 | 1XJD [ | Protein kinase C theta type | Type I diabetes mellitus (H00408) | Fostamatinib (DB12010) |
| 29 | Q12809 | 1BYW [ | Potassium voltage-gated channel subfamily H member 2 or hERG-1 | Short QT syndrome (H00725, #609,620), Long QT syndrome (H00720, #613,688) | Ibutilide (DB00308) |
| 30 | Q13315 | 5NP1 [ | Serine-protein kinase ATM | Chronic lymphocytic leukemia (H00005), DNA repair defects (H00094), Ataxia-telangiectasia (#208,900) | Caffeine (DB00201) |
‘H’ KEGG id; ‘#’ OMIM id; NP: Not Published
Drug-likeness of the phytochemicals
| Molecules | TPSA (Å2) | Lipinski violations | GI absorption | PGP substrate | XLOGP3 | Log S |
|---|---|---|---|---|---|---|
| Epicatechin | 110.38 | 0 | High | Yes | 0.36 | − 2.22 |
| Epicatechin gallate | 177.14 | 1 | Low | No | 1.53 | − 3.70 |
| Epigallocatechin gallate | 197.37 | 2 | Low | No | 1.17 | − 3.56 |
| Catechin | 110.38 | 0 | High | Yes | 0.36 | − 2.22 |
| Quercetin | 131.36 | 0 | High | No | 1.54 | − 3.16 |
| Kaempferol | 111.13 | 0 | High | No | 1.9 | − 3.31 |
| Ellagic acid | 141.34 | 0 | High | No | 1.1 | − 2.94 |
| Gallic acid | 97.99 | 0 | High | No | 0.7 | − 1.64 |
Binding affinity (Kcal mol−1) between compounds and target proteins as generated by Autodock Vina
| Proteins (PDB id) | EPC | ECG | EGCG | CAT | QUE | KAE | EA | GA | Drug |
|---|---|---|---|---|---|---|---|---|---|
| Carbonic anhydrase 12 (1JCZ) | − 9.5 | − 9.4 | − 9.3 | − 9.4 | − 8.6 | − 6.9 | − 9.2 | ||
| Dihydrofolate reductase (1BOZ) | − 7.8 | − 9.7 | − 8.3 | − 8.4 | − 8.2 | − 8.4 | − 5.6 | − 8.3 | |
| Epidermal growth factor receptor (1IVO) | − 7.5 | − 7.7 | − 7.7 | − 7.6 | − 7.7 | − 7.4 | − 6.0 | − 7.9 | |
| Prothrombin (1A2C) | − 8.0 | − 8.8 | − 8.9 | − 8.0 | − 8.6 | − 8.3 | − 9.1 | − 6.5 | − 9.5 |
| Carbonic anhydrase 2 (12CA) | − 7.4 | − 7.3 | − 7.1 | − 7.5 | − 7.2 | − 7.1 | − 5.9 | − 6.8 | |
| Receptor tyrosine-protein kinase erbB-2 (1MFG) | − 6.7 | − 7.0 | − 6.6 | − 6.5 | − 6.7 | − 6.4 | − 7.2 | − 5.1 | − 7.8 |
| Protein kinase C gamma type (PRKCG) (2E73) | − 6.5 | − 6.3 | − 6.3 | − 5.8 | − 6.4 | − 6.5 | − 6.5 | − 4.7 | − 6.5 |
| Neutrophil elastase (1B0F) | − 6.5 | − 6.8 | − 7.3 | − 6.6 | − 6.8 | − 6.2 | − 6.3 | − 6.4 | |
| 72 kDa type IV collagenase (1CK7) | − 8.6 | − 8.6 | − 8.6 | − 8.8 | − 8.8 | − 8.3 | − 6.6 | − 7.2 | |
| Hepatocyte growth factor receptor (1FYR) | − 9.6 | − 9.9 | − 9.4 | − 9.9 | − 9.6 | − 9.7 | − 6.1 | − 9.8 | |
| Lysosomal alpha-glucosidase (5KZW) | − 6.5 | − 7.0 | − 6.8 | − 6.5 | − 6.3 | − 6.7 | − 5.5 | − 6.6 | |
| Cytochrome P450 2D6 (2F9Q) | − 8.7 | − 8.8 | − 9.1 | − 8.9 | − 9.5 | − 9.7 | − 8.4 | − 6.3 | − 10.2 |
| Cyclin-dependent kinase 4 (2W96) | − 7.7 | − 8.6 | − 8.8 | − 8.0 | − 8.1 | − 7.9 | − 8.1 | − 6.3 | − 9.0 |
| Angiotensin-converting enzyme (1O86) | − 7.8 | − 9.0 | − 8.0 | − 8.3 | − 8.3 | − 8.9* | − 6.1 | − 6.2 | |
| Matrix metalloproteinase-9 (1GKC) | − 8.4 | − 8.2 | − 8.9 | − 8.8 | − 7.3 | − 6.5 | − 7.4 | ||
| P-selectin (1G1Q) | − 6.6 | − 7.4 | − 6.3 | − 6.7 | − 6.5 | − 7.3 | − 5.9 | − 6.2 | |
| Acetylcholinesterase (1B41) | − 8.3 | − 6.6 | − 8.3 | − 8.8 | − 9.1 | − 8.9 | − 6.7 | − 6.3 | |
| UDP-glucuronosyltransferase 1–1 (Homologous model) | − 7.7 | − 7.3 | − 7.6 | − 7.8 | − 7.7 | − 7.7 | − 6.2 | − 5.4 | |
| UDP-glucuronosyltransferase 1–4 (Homologous model) | − 8.2 | − 6.1 | − 7.0 | − 7.0 | − 7.1 | − 6.2 | − 6.5 | − 6.5 | |
| Carbonic anhydrase 4 (1ZNC) | − 6.7 | − 8.4 | − 6.9 | − 7.4 | − 7.5 | − 7.2 | − 5.4 | − 6.9 | |
| G1/S-specific cyclin-D1 (2W9Z) | − 7.2 | − 8.7 | − 7.2 | − 8.1 | − 7.1 | − 8.1 | − 6.3 | − 8.3 | |
| Dipeptidyl peptidase 4 (1J2E) | − 7.6 | − 8.1 | − 8.1 | − 8.8 | − 7.8 | − 8.4 | − 6.6 | − 8.5 | |
| Prostaglandin G/H synthase 2 (5F19) | − 9.1 | − 9.3 | − 9.3 | − 9.2* | − 9.2 | − 7.6 | − 6.7 | − 7.2 | |
| Receptor-type tyrosine-protein kinase FLT3 (1RJB) | − 6.2 | − 7.3 | − 6.2 | − 6.6 | − 6.3 | − 6.7 | − 5.0 | − 6.7 | |
| PI3-kinase subunit alpha (2RD0) | − 8.9 | − 8.6 | − 8.1 | − 7.8 | − 8.6 | − 6.7* | − 5.9 | ||
| Xanthine dehydrogenase/oxidase (2CKZ) | − 7.9 | − 7.5 | − 7.4 | − 7.9 | − 7.6 | − 6.0 | − 6.2 | ||
| Cyclin-dependent kinase 6 (1BI7) | − 7.3 | − 8.6 | − 7.7 | − 7.9 | − 7.5 | − 8.5 | − 5.5 | − 8.5 | |
| Protein kinase C theta type (1XJD) | − 7.7 | − 7.5 | − 7.6 | − 8.0 | − 7.8 | − 7.7 | − 5.1 | − 7.6 | |
| hERG-1 (1BYW) | − 5.5 | − 6.4 | − 5.7 | − 5.9 | − 6.1 | − 5.6 | − 4.5 | − 4.6 | |
| Serine-protein kinase ATM (5NP1) | − 7.9 | − 8.5 | − 8.5 | − 8.7 | − 8.7 | − 7.9 | − 5.8 | − 5.9 |
Bold highlight: Binding affinity (Kcal mol−1) greater than the control drug (row wise);
*selected for network formation, binding affinity (Kcal mol−1) greater than the control drug (column wise); EC epicatechin, ECG epicatechin gallate, EGCG epigallocatechin gallate, CAT catechin, QUE quercetin, KAE kaempferol, and EA ellagic acid, GA gallic acid
Fig. 2Superimposed three-dimensional docking interaction between control drugs and phytochemicals to target proteins (Uniprot id), along with H bond interactions; catechin (CAT), epigallocatechin gallate (EGCG), epicatechin (EC), epicatechin gallate (ECG), quercetin (QUE), kaempferol (KAE), and ellagic acid (EA)
Amino acid residues interacting with the phytochemicals
| Compound | Protein name(UniProt ID/PDB ID) | Amino acid interactions | ||
|---|---|---|---|---|
| H Bonds | Pi-H bonds | Other bonds | ||
| Control (Marimastat) | Matrix metalloproteinase-9 (P14780/1GKC) | GLY186A, LEU188A, ALA189A, PRO421A | LEU187A, HIS401A, TYR423A | |
| Epicatechin | VAL398A | |||
| Kaempferol | VAL398A, | |||
| Control (Crizotinib) | Hepatocyte growth factor receptor | TYR134A, TRP121B, GLN145B, | VAL123A, THR138B, PHE147C, PHE147D | SER137A |
| Epicatechin gallate | ||||
| Control (Methotrexate) | Dihydrofolate Reductase (P00374/1BOZ) | LEU27A, SER59A, THR56A | LEU22A, VAL115A, THR56A, ALA9A, ILE16A | |
| Epigallocatechin gallate | GLU30A, | |||
| Control (Icosapent) | Prostaglandin G/H synthase 2 (P35354/5F19) | ALA202B, HIS207B, VAL295B, HIS386B, HIS388B, LEU390B, LEU391B, PHE395B, TYR404B | ||
| Catechin | THR206B | |||
| Quercetin | THR206B, THR212B, ASN382B | |||
| Control (Perindopril) | Angiotensin-converting enzyme (P12821/1O86) | ARG124A, TYR135A, TRP220A SER517A | PRO519A | GLU123A |
| Ellagic acid | ILE204A, ALA207A, | |||
Fig. 3Statistical analysis of binding affinities (kcal mol−1), a principal component analysis and b heat map with clustering
Fig. 4Molecular alignment of phytochemicals showing pharmacophores (ACC acceptor, DON donor, AR aromatic)
Fig. 5Protein–phytochemical interacting amino acids showing pharmacophores involved: catechin (CAT), epigallocatechin gallate (EGCG), epicatechin (EC), epicatechin gallate (ECG), quercetin (QUE), kaempferol (KAE), and ellagic acid (EA)
Fig. 6Tripartite network of phytochemicals, proteins and associated diseases: catechin (CAT), epigallocatechin gallate (EGCG), epicatechin (EC), epicatechin gallate (ECG), quercetin (QUE), kaempferol (KAE), and ellagic acid (EA)
Fig. 7Molecular dynamic (MD) simulation of epicatechin gallate (ECG)-hepatocyte growth factor receptor (PDB id 1FYR) complex: a root-mean-square deviation (RMSD), b radius of gyration, c root-mean-square fluctuation (RMSF) and ligand–protein H bonds
MM-PBSA calculations of binding free energy for ECG-P00374 complex
| Types of binding energy(Kj mol−1) | Binding energy ECG-P00374 complex |
|---|---|
| ΔG_binding | − 242.09 ± 10.97 |
| ΔG_Non Polar | − 22.11 ± 0.96 |
| ΔG_Polar | 71.20 ± 6.23 |
| ΔG_Electrostatic | − 15.95 ± 4.57 |
| ΔG_Van der Waal | − 275.23 ± 10.15 |
Fig. 8Free energy terms of epicatechin gallate (ECG)-hepatocyte growth factor receptor (PDB id 1FYR) complex, a ΔG_Van der Waal, b ΔG_Electrostatic, c ΔG_Polar, d ΔG_Non-Polar and e ΔG_binding