Literature DB >> 36164436

Pharmacogenomics deliberations of 2-deoxy-d-glucose in the treatment of COVID-19 disease: an in silico approach.

Navya B Prabhu1, Chigateri M Vinay1, Kapaettu Satyamoorthy2, Padmalatha S Rai1.   

Abstract

The outbreak of COVID-19 caused by the coronavirus (SARS-CoV-2) prompted number of computational and laboratory efforts to discover molecules against the virus entry or replication. Simultaneously, due to the availability of clinical information, drug-repurposing efforts led to the discovery of 2-deoxy-d-glucose (2-DG) for treating COVID-19 infection. 2-DG critically accumulates in the infected cells to prevent energy production and viral replication. As there is no clarity on the impact of genetic variations on the efficacy and adverse effects of 2-DG in treating COVID-19 using in silico approaches, we attempted to extract the genes associated with the 2-DG pathway using the Comparative Toxicogenomics Database. The interaction between selected genes was assessed using ClueGO, to identify the susceptible gene loci for SARS-CoV infections. Further, SNPs that were residing in the distinct genomic regions were retrieved from the Ensembl genome browser and characterized. A total of 80 SNPs were retrieved using diverse bioinformatics resources after assessing their (a) detrimental influence on the protein stability using Swiss-model, (b) miRNA regulation employing miRNASNP3, PolymiRTS, MirSNP databases, (c) binding of transcription factors by SNP2TFBS, SNPInspector, and (d) enhancers regulation using EnhancerDB and HaploReg reported A2M rs201769751, PARP1 rs193238922 destabilizes protein, six polymorphisms of XIAP effecting microRNA binding sites, EGFR rs712829 generates 15 TFBS, BECN1 rs60221525, CASP9 rs4645980, SLC2A2 rs5393 impairs 14 TFBS, STK11 rs3795063 altered 19 regulatory motifs. These data may provide the relationship between genetic variations and drug effects of 2-DG which may further assist in assigning the right individuals to benefit from the treatment. Supplementary Information: The online version contains supplementary material available at 10.1007/s13205-022-03363-4.
© The Author(s) 2022.

Entities:  

Keywords:  2-Deoxy-d-glucose (2-DG); COVID-19; Drug response; SARS-CoV-2; Single nucleotide polymorphisms

Year:  2022        PMID: 36164436      PMCID: PMC9491670          DOI: 10.1007/s13205-022-03363-4

Source DB:  PubMed          Journal:  3 Biotech        ISSN: 2190-5738            Impact factor:   2.893


Introduction

The outbreak of coronavirus disease 2019 (COVID-19) caused by the new coronavirus (SARS-CoV-2) infection has prompted worldwide attempts to develop efficient molecules to treat the disease and symptoms (Samantaray et al. 2021). However, developing novel molecules culminating in translation against infections can be laborious, time-consuming, and expensive (Paul et al. 2010). Thus, identifying the therapeutically effective entity against the disease from a pre-existent clinically approved repository of molecules may be advantageous (Ciliberto et al. 2020). Virus infections such as SARS-CoV-2 reprogram the host cells to consume more glucose and upregulate metabolic activities such as glycolysis, akin to the Warburg effect and alter glycosylation to survive, replicate, and transmit infections (Mullen et al. 2021). Similar to glucose, internalization of 2-DG is facilitated by glucose transporters followed by its phosphorylation into inactive metabolite, 2-deoxyglucose-6-phosphate. This glucose deprivation in the cells leads to reduced proliferation and induction of apoptosis (Schmidt and O’Donnell 2021). 2-Deoxy-d-glucose (2-DG) has been tested to inhibit glycolysis and hence SARS-CoV-2 replication in monocytes and epithelial cells consequently leading to increased HIF-1α and reduced inflammatory mediators (Pliszka and Szablewski 2021; Medini et al. 2021; Codo et al. 2020). It was reported that 2-DG prevents viral replication by hindering virus DNA polymerase (Codo et al. 2020; Liu et al. 2021) attaching to specified receptors on the cell surface and obstructing viral invasion into the target cells; blocking viral protein synthesis, obstructing delayed phases of virus assembly (Codo et al. 2020). The metabolic processes such as glycolysis in the cytoplasm and glycosylation in the endoplasmic reticulum can be interrupted using glucose mimics such as 2-deoxy-D-glucose (2-DG) (Xi et al. 2014). By impeding viral replication, high energy requirements, and viral assembly, it could be a potential therapeutic candidate (Khurana et al. 2022). The reports from the different phases of clinical trials have shown that 2-DG aids in improving the health status of severely Covid-19-infected individuals and decreases oxygen therapy dependency. It was found that a large number of 2-DG-treated patients reported negative within 5 days (Goel 2021; Wang et al. 2021). 2-DG as an anti-viral agent has previously been reported wherein the inhibition of replication of enveloped viruses such as herpes simplex virus (Courtney et al. 1973), measles virus, respiratory syncytial virus (Hodes et al. 1975), Semiliki forest virus, and Sindbis virus (Kaluza et al. 1972) are demonstrated. In an in vivo study, 2-DG inhibited rhinovirus load and inflammation in mice (Gualdoni et al. 2018). Several proteins such as non-structural protein 1 (Nsp1), RNA-dependent RNA polymerase, 3CLpro are the attractive targets involved in COVID-19 treatment (Singh et al. 2021, 2022). In silico analysis suggested efficient binding of 2-DG with SARS-CoV-2 viral main protease 3CLpro and NSP15 endoribonuclease (Balkrishna et al. 2020). As considerable knowledge on molecular interaction between 2-DG and SARS-CoV-2 and drug response is lacking, there is an absolute requisite to integrate the information from 2DG interacting genes by in silico analysis. The genes and their products are regulated by various mechanisms that involve correlation between many processes, metabolic pathways, and regulatory factors (Vohra et al. 2021). One prevalent form of gene variants is single nucleotide polymorphisms (SNPs), where two different bases appear at a remarkable rate in human diversity (Prabhu et al. 2021). The genetic profiling based on the identified and functionally characterized SNPs is considered a “fingerprint”, possibly used to determine the risk of disease susceptibility and drug response (Shastry 2007). Many variants residing in non-coding and non-regulatory sequences are functionally silent. However, few SNPs alter the structure and function of the protein. The role of functional SNPs, which can alter the regulation and structure of the protein in relation to the effects of 2-DG, is not well understood. These functional SNPs are considered an ideal substrate for the human population in health and illness (Alwi 2005). Hence, the current study is aimed to investigate the influence of functional or regulatory SNPs on the potency and pernicious effect of 2-DG. Therefore, the main purpose of the research was to examine the impact of SNPs in the 2-DG interacting pathway genes by interrogating various bioinformatics resources and assessed the influence of SNPs on the protein stability, miRNA regulation, and cis-acting elements to evolve a relationship for pharmacogenomics purposes.

Materials and methods

Identification of interacting genes of 2-DG

The interacting genes of 2-DG were retrieved from the Comparative Toxicogenomic Database (CTD) (Grondin et al. 2021) using the parameter named chemical-gene interaction in Homo sapiens. UniProt database (Uniprot Consortium 2021) was used to retrieve the data of all the 2-DG interacting gene families, and further, these data were utilized for downstream analysis.

Pathway interaction among 2-DG interacting genes

The 2-DG interacting genes were subjected to the Cytoscape tool v3.0 Software ClueGO v2.5.8 (Bindea et al. 2009) was employed to identify the networks in the degree sorted circular layout to interpret the biological function of the selected genes. The distinct ontologies such as molecular function, pathways, and human diseases were used in the framework, and the GO terms were connected using kappa statistics based on the overlapping genes.

Retrieval and characterization of SNPs

For the selected genes, SNPs were retrieved by preferring the option variant table in the Ensembl genome browser (m.ensembl.org). The retrieved SNPs were further classified into missense variants, 5′-UTR variants, 3′-UTR variants, synonymous SNPs, intronic SNPs, splice donor, splice acceptor variants, splice region SNPs, stop retained SNPs, stop-loss SNPs, stop-gained SNPs, and non-coding transcript exon variants. Among these, missense SNPs were considered for further functional analysis.

In silico prediction of missense variants functional impacts

The selected missense variants were scrutinized utilizing six diverse tools with mutation score accessible in the Ensembl genome browser, and these included CADD (Combined Annotation-Dependent Depletion), Mutation assessor, SIFT (Sorting Intolerant from Tolerant), Revel (Rare exome variant ensemble learner), MetaLR, and PolyPhen-2 (Polymorphism Phenotyping). The SNPs characterized as “deleterious” in all the tools were carefully chosen and evaluated for their effect on protein structure and stability.

Protein modeling and mutation effect on protein stability

To interpret the effect of deleterious SNPs on protein structure, we predicted the native and mutant forms by protein modeling. The predicted model of the native form was available from the AlphaFold protein structure database (Jumper et al. 2021), and the mutant form of the protein structure was modeled using an automated protein structure homology-modeling server, SWISS-MODEL via Expasy webserver (Waterhouse et al. 2018), by considering the native predicted model as a template. The alteration in the hydrophilicity or hydrophobicity for the deleterious SNPs due to the amino acid change is presented using the hydropathy index (Kyte et al. 1982). The stability of the protein was determined based on point mutation using the CUPSAT mutation tool (Parthiban et al. 2006) of the 3D AlphaFold structure of variants retrieved from UniProt database. Using Swiss-PDB Viewer (Kaplan and Littlejohn 2001), the energy minimization using the steepest descent algorithm was performed for the mutated protein model with the corresponding amino acid substitution, compared with the native protein model, followed by total energy calculations. The root-mean-square deviation (RMSD) was calculated using align function from Pymol software to find the divergence in mutant form from the native form of the protein (Yuan et al. 2017).

Prediction of functional microRNA target SNPs

The identified 2-DG-associated genes were deployed to predict the SNPs in the microRNA binding sites that were functional using three databases. These were microRNA-related Single Nucleotide Polymorphisms v3 (miRNASNP3) (Gong et al. 2015), PolymiRTS database (Bhattacharya et al. 2014), and miRNA-related SNPs (MirSNP) database (Liu et al. 2012). The MirSNP database was utilized to investigate the miRNA binding SNP locations and their consequences on the target position. Furthermore, the PolymiRTS database was employed to obtain the variants and their concomitant miRNAs at wild and mutant alleles and assessed their effect on the target gain/loss in the 3′-UTR using the miRNASNP3 database.

SNPs at the transcription factor binding site (TFBS)

The shortlisted 2-DG interacting genes were utilized to obtain the SNPs in TFBS employing SNP2TFBS (Kumar et al. 2017). The parameter named annotated variants were employed to obtain the SNPs residing in the upstream and 5′-UTR regions. The SNPInspector in Genomatix Software Suite (https://www.genomatix.de/) was applied to predict if SNPs in TFBS generate or destroy the TF binding sites.

Enhancers SNPs

The identified 2-DG-associated genes were further utilized to analyze the influence of SNPs residing in enhancers using EnhancerDB (Kang et al. 2019) and ENCODE laboratories software HaploReg version 4.1 (Ward and Kellis 2012). The search preference comprising gene was utilized in the EnhancerDB database to retrieve the enhancer SNPs of the shortlisted genes. Further, HaploReg v4.1 was used to evaluate the regulatory motifs of the enhancer SNPs that were altered.

Results

Identification of interacting genes for 2-DG

We identified 48 interacting genes for 2-DG (Table 1) and plotted their position using the Circos ideogram. The depiction indicated the distribution of genes over 21 autosomes and X chromosome except for 13 autosome and Y chromosome (Fig. S1). The overview of plot shows 48 genes (from outer ring inwards), 5′-UTR SNPs, intronic SNPs, 3′-UTR SNPs, synonymous SNPs, missense variants, splice variants (splice region, splice donor, splice acceptor), start lost, stop-lost, stop-gained, stop-lost SNPs, inner most ring constitutes non-coding transcript exon variant and NMD transcript variant. The schematic illustration of in silico workplan is shown in Fig. 1.
Table 1

Details of selected 2DG interacting genes for downstream analysis

Sl. No.GeneGene symbolFamilyChromosomeLocationStrand
1Alpha-2-MacroglobulinA2MProtease inhibitorchr129,067,664–9,116,229Minus
2Adiponectin, C1Q And Collagen Domain ContainingADIPOQHormonechr3186,842,704–186,858,463Plus
3Amyloid Beta Precursor ProteinAPPProtease inhibitorchr2125,880,550–26,171,128Minus
4Autophagy Related 7ATG7chr311,272,309–11,564,652Plus
5BCL2 Associated X, Apoptosis RegulatorBAXchr1948,954,815–48,961,798Plus
6Beclin 1BECN1chr1742,810,132–42,833,350Minus
7BH3 Interacting Domain Death AgonistBIDchr2217,734,138–17,774,770Minus
8Caspase 3CASP3Proteasechr4184,627,696–184,649,509Minus
9Caspase 9CASP9Proteasechr115,490,832–15,526,534Minus
10DNA Damage Inducible Transcript 3DDIT3chr1257,516,588–57,521,737Minus
11Epidermal Growth Factor ReceptorEGFRKinasechr755,019,017–55,211,628Plus
12Eukaryotic Translation Initiation Factor 4E Binding Protein 1EIF4EBP1Protein synthesis inhibitorchr838,030,534–38,060,365Plus
13Erb-B2 Receptor Tyrosine Kinase 2ERBB2Kinasechr1739,687,914–39,730,426Plus
14FXYD Domain Containing Ion Transport Regulator 2FXYD2Ion transportchr11117,800,844–117,828,698Minus
15Glycogen Synthase Kinase 3 BetaGSK3BKinasechr3119,821,321–120,095,823Minus
16H2A.X Variant HistoneH2AXchr11119,093,854–119,095,467Minus
17Hexokinase 2HK2Kinasechr274,834,126–74,893,359Plus
18Heme Oxygenase 1HMOX1Oxidoreductasechr2235,380,361–35,394,214Plus
19Heat Shock Protein 90 Alpha Family Class A Member 1HSP90AA1Proteasechr14102,080,742–102,139,749Minus
20Heat Shock Protein 90 Beta Family Member 1HSP90B1Chaperonechr12103,930,107–103,953,931Plus
21Heat Shock Protein Family A (Hsp70) Member 5HSPA5Proteasechr9125,234,853–125,241,382Minus
22HtrA Serine Peptidase 2HTRA2Proteasechr274,529,405–74,533,556Plus
23Insulin Like Growth Factor 1 ReceptorIGF1RKinasechr1598,648,539–98,964,530Plus
24Interleukin 1 BetaIL1BCytokine receptorschr2112,829,751–112,836,843Minus
25InsulinINSHormonechr112,159,779–2,161,221Minus
26Integrin Subunit Beta 1ITGB1Integrinchr1032,887,273–33,005,792Minus
27Potassium Voltage-Gated Channel Subfamily H Member 2KCNH2Ion channelchr7150,944,956–150,978,321Minus
28Microtubule Associated Protein 1 Light Chain 3 AlphaMAP1LC3Achr2034,546,823–34,560,345Plus
29Microtubule Associated Protein 1 Light Chain 3 BetaMAP1LC3Bchr1687,383,953–87,404,779Plus
30Mitogen-Activated Protein Kinase 1MAPK1Kinasechr2221,759,657–21,867,680Minus
31Mitogen-Activated Protein Kinase 3MAPK3Kinasechr1630,114,105–30,123,506Minus
32Matrix Metallopeptidase 13MMP13Proteasechr11102,942,995–102,955,732Minus
33Matrix Metallopeptidase 9MMP9Proteasechr2046,008,908–46,016,561Plus
34Mechanistic Target Of Rapamycin KinaseMTORKinasechr111,106,535–11,273,497Minus
35Nitric Oxide Synthase 2NOS2Oxidoreductasechr1727,756,766–27,800,529Minus
36Poly (ADP-Ribose) Polymerase 1PARP1Glycosyltransferasechr1226,360,210–226,408,154Minus
37Ribosomal Protein S6RPS6Ribonucleoproteinchr919,375,715–19,380,236Minus
38Ribosomal Protein S6 Kinase B1RPS6KB1Kinasechr1759,893,046–59,950,574Plus
39Serpin Family B Member 5SERPINB5chr1863,476,958–63,505,085Plus
40Solute Carrier Family 2 Member 1SLC2A1Transportchr142,925,353–42,958,893Minus
41Solute Carrier Family 2 Member 2SLC2A2Transportchr3170,996,341–171,026,743Minus
42Solute Carrier Family 2 Member 3SLC2A3Transportchr127,919,230–7,936,187Minus
43Superoxide Dismutase 2SOD2Oxidoreductasechr6159,669,069–159,762,529Minus
44Sequestosome 1SQSTM1chr5179,806,393–179,838,078Plus
45Serine/Threonine Kinase 11STK11Kinasechr191,177,558–1,228,431Plus
46Tumor Necrosis FactorTNFCytokinechr631,575,565–31,578,336Plus
47Tumor Protein P53TP53Activator and repressorchr177,661,779–7,687,538Minus
48X-Linked Inhibitor of ApoptosisXIAPProtease inhibitorchr X123,859,708–123,913,976Plus

SNPs single nucleotide polymorphisms, chr chromosome

Fig. 1

Schematic representation of in silico workflow of the study

Details of selected 2DG interacting genes for downstream analysis SNPs single nucleotide polymorphisms, chr chromosome Schematic representation of in silico workflow of the study The interaction among 2-DG genes constituted a network after employing the statistical option Enrichment/Depletion test (two-sided hypergeometric test) (Fig. S2). The resulting network indicated 13 Kappa score groups such as apoptotic factor-mediated response, the intrinsic pathway for apoptosis, cytochrome C-mediated apoptotic response, interleukin-4, and interleukin-13 signaling, integration of energy metabolism, macroautophagy, purinergic signaling in leishmaniasis infection, ATF6 alpha activates chaperone genes, mTOR signaling, FOXO-mediated transcription, protease binding, collagen-binding and SARS-CoV infections (Fig. S2). It was found that 45.87% of the associated genes (CASP3, CASP9, MAPK1, MAPK3, XIAP) contributed to cytochrome C-mediated apoptotic response and 3.67% of the associated genes (BECN1, FXYD2, GSK3B, HSP90AA1, ITGB1, MAP1LC3B) contributed to SARS-CoV infections (Fig. S2).

SNPs characterization

A sum of 9,66,482 SNPs was obtained by using the Ensembl genome browser (m.ensembl.org) from human genome assembly GRCh38.p13 (1000 Genomes Project). The retrieved variants were mined which generated 1,04,034 SNPs. These shortlisted variants were further classified depending on their function. These SNPs were from 5′-UTR (295), intronic regions (27,917), 3′-UTR (1729), synonymous SNPs (519), splice variants of the genes including splice donor, acceptor, splice region (119), non-coding transcript exons (103), 8 stop-gained, stop-lost SNP (1), NMD-transcript variants (20), and 616 were missense variants (Fig. 2).
Fig. 2

Schematic representation of in silico SNP search and characterization

Schematic representation of in silico SNP search and characterization

Selection of lethal nsSNPs

Among 616 missense SNPs, SIFT analysis predicted 248 SNPs (40.25%) as “deleterious”, however, the prediction rate of mutation by PolyPhen-2 was 149 (24.18%) as “probably damaging”. CADD, Revel, Meta LR, and Mutation Assessor reported 27 SNPs (4.38%), 109 SNPs (17.69%), 116 SNPs (18.83%), and 464 SNPs (6.49%) as likely deleterious, likely disease-causing, damaging, and high, respectively (Fig. S3). A total of six diverse bioinformatic resources, such as CADD, Mutation assessor, SIFT, Revel MetaLR, and PolyPhen-2 collectively showed three lethal missense variants (Fig. 3); A2M rs201769751, rs778604418, and PARP1 rs193238922 (Table S1).
Fig. 3

Pathogenicity predictions showing common deleterious non-synonymous SNPs

Pathogenicity predictions showing common deleterious non-synonymous SNPs Out of three deleterious SNPs identified, A2M (rs778604418) and PARP1 (rs193238922) showed a change in hydrophobicity or hydrophilicity, but none of them showed a change in its polarity. The change in polarity and hydrophobicity may affect the protein structure and its activity. The divergence in free energy of unfolding between native form and mutant form of proteins known as ΔΔG is calculated by CUPSAT tool using structural environment-specific atom capability and torsion angle capability. Henceforth, the stability of the protein was identified in terms of predicted ΔΔG values (kcal/mol). Out of three deleterious SNPs, A2M (rs778604418) showed more stability with a predicted ΔΔG value of 3.35 kcal/mol and A2M (rs201769751), PARP1 (rs193238922) affects the protein stability with predicted ΔΔG value of − 5.07 kcal/mol and − 0.51 kcal/mol, respectively (Table S2). The native form of the protein A2M (AlphaFold ID: AF-P01023-F1), PARP1 (AlphaFold ID: AF-P09874-F1) was retrieved from the AlphaFold database, and the mutant form was modeled and validated using the Ramachandran plot. The mutant model showed that 95% of the amino acids were present in the favorable region and considered for further in silico analysis. The native and mutant protein forms of deleterious SNPs along with overlapping models were shown (Fig. 4). A high QMEAN score and sequence identity from the swiss model was considered for the superimposition of the mutant model over the native structure and visualized using Swiss-PDB Viewer. The total energy of mutant structures in all three polymorphisms was less compared to native protein structures. Hence, it is believed that these three deleterious SNPs may affect the protein structure and function. Further, the calculated RMSD value for A2M (rs201769751, rs778604418) and PARP1 (rs193238922) were 0.052 Å, 0.047 Å, and 0.221 Å, respectively. It is reported that the higher the RMSD value, the greater the deviation between the native and mutant forms of the protein structures, which in turn indicates the change in its functional activity. The total energy and RMSD value of native and mutant forms of all the polymorphisms are tabulated in Table S3.
Fig. 4

Native, mutant and superimposition of native and mutant modeled structures of the A2M (1) rs201769751 (2) rs778604418 (3) PARP1 rs193238922. a Structure of native protein, b enlarged structure of native protein, c structure of mutant protein, d enlarged structure of mutant protein, e superimposed model of native and mutant protein structures, f enlarged superimposed model of native and mutant protein structures

Native, mutant and superimposition of native and mutant modeled structures of the A2M (1) rs201769751 (2) rs778604418 (3) PARP1 rs193238922. a Structure of native protein, b enlarged structure of native protein, c structure of mutant protein, d enlarged structure of mutant protein, e superimposed model of native and mutant protein structures, f enlarged superimposed model of native and mutant protein structures The functional microRNA targeting SNPs were predicted using three different resources, and these were miRNASNP3, PolymiRTS, and MirSNP, which concomitantly reported 12 SNPs (rs11552192 in the BECN1, rs60393216 in the GSK3B, rs9903 in the MAP1LC3B, rs1065154, and rs10277 in the SQSTM1, rs10415095 in the STK11, rs28382747, rs28382755, rs28382752, rs28382740, rs28382742, rs17330644 in the XIAP) with the minor allele frequency (MAF) of 10% in the microRNA binding sites. It also indicates any miRNAs linked with SNPs residing in the target position would create or destroy a miRNA-mRNA binding site (Table 2).
Table 2

Details of miRNA target site SNPs with minor allele frequency > 0.1

Sl no.GeneCommon SNPs among MirSNP, miRNASNP3 and PolymiRTS databasesMAFmiRNA binding at mutant allelemiRNA binding at ancestral alleleEffect
1BECN1rs115521920.11hsa-miR-590-3pCreate
2GSK3Brs603932160.30hsa-miR-3662Break
3MAP1LC3Brs99030.18hsa-miR-3960Break
hsa-miR-4467Break
hsa-miR-4484Break
4SQSTM1rs10651540.277hsa-miR-4643Create
hsa-miR-466Create
hsa-miR-4694-3pBreak
hsa-miR-4717-3pCreate
hsa-miR-4802-3pCreate
rs102770.31hsa-miR-3178Break
hsa-miR-4634Break
5STK11rs104150950.29hsa-miR-4781-5pBreak
hsa-miR-330-3pCreate
6XIAPrs283827470.26hsa-miR-4634Break
rs283827550.26hsa-miR-143-5pCreate
hsa-miR-5693Create
rs283827520.26hsa-miR-150-5pCreate
hsa-miR-562Create
hsa-miR-5697Create
rs283827400.26hsa-miR-24-1-5pBreak
hsa-miR-24-2-5pBreak
hsa-miR-625-3pBreak
rs283827420.26hsa-miR-326Break
hsa-miR-330-5pBreak
hsa-miR-3675-5pCreate
hsa-miR-5195-5pCreate
rs173306440.26hsa-miR-3609Create
hsa-miR-548ah-5pCreate
hsa-miR-5586-3pBreak
Details of miRNA target site SNPs with minor allele frequency > 0.1 A sum of 22 SNPs was found to be in TFBS with MAF > 0.1 by SNP2TFBS; among them, 17 and 5 SNPs reside in the upstream and 5′-UTR region, respectively. Further, SNPInspector projected that rs712829 in the EGFR generates 15 TFBS; rs60221525 in the BECN1, rs4645980 in the CASP9, and rs5393 in the SLC2A2 impaired binding position for 14 transcription factors (TFs). The effect of 22 SNPs at TFBS revealed those SNPs that would generate or disrupt the positions for the binding of TFs (Table 3).
Table 3

Impact of SNPs in the transcription factor binding site with MAF > 0.1

Sl no.Gene namersIDAlleleFunctionMAFGenerated sitesDeleted sites
1A2Mrs226380A > CUTR50.47SMADSORY, GREF, GREF
2APPrs364048T > CUpstream0.13LEFF, HNFPBCDF, FKHD
3ATG7rs2594971G > A/CUpstream0.49NOLF, MZF1, E2FF, XCPE, MZF1, AP2F, GCF2, ZTRECTCF, NOLF, ZF02, ZTRE, GLIF, EGRF, SP1F, AP2F
4BECN1rs60221525C > A/GUpstream0.10ZF07, ZF02, BEDF, GLIF, INSM, CNBP, RREB, GCMFMAZF, ZF07, NDPK, ZF02, BEDF, SP1F, CTBP, KLFS, EGRF, GLIF, INSM, CNBP, RREB, NDPK
5CASP3rs114746204G > C/TUpstream0.39MAZF, KLFS, ZTRE, NDPK, EGRF, GLIF, ZF02, CTCF, SP1F, ZF07, E2FF, PURA, HEATMOKF, SP1F, BTBF, BTBF
6CASP9rs4645978C > A,TUpstream0.41CEBP, GCMF, ZF30, MAZF, NR2F, PERO, ZF07MAZF, PLAG, PAX6, SP1F, KLFS, ZTRE, E2FF, NDPK, EGRF, IKRS, SPZ1, ZF37
7CASP9rs4645980C > A,TUpstream0.41CTBPMAZF, PLAG, SP1F, KLFS, ZTRE, E2FF, NDPK, ZF42, ZF02, ZF5F, BEDF, GCF2, XCPE, EGRF
8DDIT3rs703835A > GUTR50.21HUB1, STAT, CEBP, RBPF, IKRSSTAT, CEBP
9EGFRrs712829G > C/TUTR50.22MAZF, EGRF, KLFS, CTCF, VEZF, ZF02, SP1F, NDPK, CTBP, MZF1, ZF07, BEDF, GLIF, CTBP, INSMSP1F, KLFS, ZF02, BEDF, GLIF
10EIF4EBP1rs3750243C > A/GUpstream0.23GCF2, CTCF, ZTRE, KLFS, MAZF, E2FF, NDPK, EGRFKLFS, BEDF, ZF57, EBOX, HESF, CHRE
11IL1Brs1143627G > AUpstream0.47PTBP, MYT1
12INSrs3842737T > GUpstream0.11PLAG, RXRF, ZF02, SP1F, ZF07, BEDF, KLFS, ZF37ZF02, TAIP
13INSrs689A > G,TUTR50.35PAX6, GREF, PBXC, RXRF, TALE, MYRFTF3A, NGRE, IKRS
14MAPK3rs61764202C > TUpstream0.36AP1FSP1F, KLFS, ZTRE, E2FF, HESF
15PARP1rs2793379T > A/CUpstream0.20PARF, TALE, HAND, EBOX, MYOD, NREBPDX1
16PARP1rs2077197C > G/TUpstream0.26CDXF, ABDB, E2FF, EGRF, ZF02, HBOX, RXRFPLAG, ZF02, GLIF, RREB
17RPS6rs35096177A > GUpstream0.12SORY, CLOX, DMRT, HNF6, HOXC, LEFF, AP1FSORY, ARID, HOXF, PIT1, BRNF, CART, LHXF, HOXF, LHXF, HBOX, HOMF, NKX1, NKX6
18SLC2A2rs5393T > GUpstream0.22NOLF, NGRE, BCDFLHXF, DLXF, PDX1, BRN5, BRNF, HOMF, GATA, NKX1, OCT1, CART, HOXF, NKX6, PAXH, DLXF
19SLC2A2rs5394G > AUpstream0.14MIPUTF3A, STAF
20STK11rs3795061G > CUpstream0.15CTCF, ETSFZF02, GCMF, KLFS
21TP53rs2909430C > G/TUTR50.15ZF01, IRXF, RP58, CEBPPAX6
22XIAPrs12687176C > TUpstream0.22MAZF, E2FF, SPZ1MAZF, GLIF, ZF07, SAL4

MAF minor allele frequency

Impact of SNPs in the transcription factor binding site with MAF > 0.1 MAF minor allele frequency

SNPs in enhancers

The two databases, namely, EnhancerDB and HaploReg were employed to identify SNPs in the enhancers which unanimously identified 42 SNPs residing in the introns and 1 3′-UTR SNP with MAF > 0.1. Out of 43 SNPs, rs3795063 in the STK11 gene showed 19 regulatory motifs that were altered which included CAC-binding-protein, CACD, E2A, Egr-1, Irf, Klf4, Klf7, Myc, Myf, NRSF, Pou2f2, Rad21, SMC3, SP1, SP4, TATA, UF1H3BETA, YY1, and Zfp740. The rs10861203 in the HSP90B1 gene reported 14 regulatory motifs that were altered and these included BCL, BDP1, ELF1, Egr-1, Ets, FEV, Myc, NERF1a, Nrf-2, Pax-5, STAT, TBX5, Tel2, and p300. The specifics of SNPs residing in the enhancers and their altered regulatory motifs are catalogued (Table 4).
Table 4

SNPs in enhancers and their altered regulatory motifs with MAF > 0.1

Sl noGeneChromosomers IDMAFReference AlleleAlternative AlleleEnhancer IDFunctional annotationRegulatory motifs altered
1SLC2A1chr1rs112107710.36TCenh11804IntronicRad21
2SLC2A1chr1rs75125650.14CTenh107379IntronicIrf, Maf, Mxi 1, Nkx2
3SLC2A1chr1rs7102210.43GAenh107379IntronicHNF4
4ADIPOQchr3rs8755710.34TC,Genh48780IntronicBCL, CCNT2, GATA, HDAC2, HMGN3, TAL1
5ADIPOQchr3rs556473620.30AGenh7748IntronicEBF, GR, Sox, TATA
6ATG7chr3rs119150500.40AGenh6998Intronicp300
7CASP9chr1rs46460290.41GAenh78963IntronicERalpha-a, Pax-5, TCF11::MafG, ZID
8CASP9chr1rs42335360.41CTenh98427IntronicHEY1,Pou1f1,Sox
9DDIT3chr12rs47592770.37CAenh98973IntronicZnf143,p300
10EGFRchr7rs65932070.18TCenh24214IntronicZfp691
11EIF4EBP1chr8rs96448110.33GAenh24927IntronicDbx1, Evi-1, Foxa, Foxp1, HDAC2, HMG-IY, Ncx, TATA, Zfp105
12EIF4EBP1chr8rs109585410.23GCenh40918IntronicAhR::Arnt_1, Arnt, NF-E2, NRSF, Nanog, TATA
13ERBB2chr17rs29521550.37TCenh17134IntronicBCL, Egr-1, Ets, GATA, Hsf, Maf, PU.1, Pax-5, STAT, UF1H3BETA, Zfp281, Znf143
14FXYD2chr11rs8697890.11GAenh645343'-UTRCTCF,TCF12
15GSK3Bchr3rs285366620.40GAenh20841IntronicEWSR1-FLI1, Gfi1, HDAC2, HMG-IY, Mef2
16GSK3Bchr3rs46880540.30TCenh7475IntronicCTCF, ERalpha-a, Foxk1, Irf, Rad21, SMC3, SZF1-1
17HK2chr2rs15455220.30TCenh5656IntronicMIF-1
18HK2chr2rs37717630.12CTenh18686IntronicERalpha-a, GCNF, HNF4, RXRA, SF1
19HSP90AA1chr14rs71565640.32AGenh15961IntronicMef2
20HSP90AA1chr14rs17465870.10AGenh3457IntronicDobox4
21HSP90B1chr12rs3121360.24AGenh29941IntronicAIRE
22HSP90B1chr12rs79803260.44TGenh14948IntronicMyc,Smad3,Zfp410
23HSP90B1chr12rs108612030.19GAenh86141IntronicBCL, BDP1, ELF1, Egr-1, Ets, FEV, Myc, NERF1a, Nrf-2, Pax-5, STAT, TBX5, Tel2, p300
24HTRA2chr2rs134111850.29CTenh33451IntronicPax-5
25HTRA2chr2rs729206760.19CTenh81722IntronicAP-1, Mef2, NRSF, YY1
26HTRA2chr2rs178384120.11TCenh33456IntronicFoxp3,Pou5f1
27IGF1Rchr15rs65985410.44AGenh16457IntronicNRSF,RFX5
28ITGB1chr10rs107639230.40GAenh43272IntronicCdx2,Hoxd10
29ITGB1chr10rs110093380.17AG,Tenh62030IntronicEWSR1-FLI1,TAL1,VDR
30MAP1LC3Bchr16rs45989160.43CGenh32021IntronicDBP,Irf,PU.1
31MAP1LC3Bchr16rs80522440.16GCenh32020IntronicBCL, ELF1, Egr-1, Ets, GATA, Maf, NERF1a, PU.1, Pax-5, SPIB, TEF, TFIIA, p300
32MAP1LC3Bchr16rs37946730.24GCenh115621IntronicEts,PLAG1,Pax-6
33NOS2chr17rs47962220.21AGenh17068IntronicSox
34RPS6chr9rs9447200.29AGenh25490IntronicFoxf2, Foxl1, Foxp3, Gm397, Nanog, Pou1f1, Pou2f2, Pou3f3
35RPS6KB1chr17rs110793740.20TCenh17258IntronicLBP-1,LBP-9
36RPS6KB1chr17rs99105980.10GAenh96758IntronicMsx-1,Ncx
37SERPINB5chr18rs116611840.25GAenh4892IntronicCTCF, HNF4, RXRA, Rad21, SMC3, SP1, TATA
38SOD2chr6rs69139040.11AGenh9509IntronicEts, Pax-4, Pbx3, TBX5
39SQSTM1chr5rs5027290.49ACenh22874IntronicOsr,TCF12
40SQSTM1chr5rs592030820.17CTenh22871IntronicFoxp1,RFX5
41SQSTM1chr5rs104640930.42GAenh22874IntronicSin3Ak-20,TATA
42STK11chr19rs37950630.25CG,Tenh17841IntronicCAC-binding-protein, CACD, E2A, Egr-1, Irf, Klf4, Klf7, Myc, Myf, NRSF, Pou2f2, Rad21, SMC3, SP1, Sp4, TATA, UF1H3BETA, YY1, Zfp740
43STK11chr19rs349288890.46GAenh17841IntronicERalpha-a,Rad21,Zfx

MAF minor allele frequency, chr chromosome

SNPs in enhancers and their altered regulatory motifs with MAF > 0.1 MAF minor allele frequency, chr chromosome

Discussion

Detection of therapeutically effective entity counter to the disease from a pre-existent molecule repository may substantially reduce the time and efforts against new drug discovery and clinical trial randomization. The approach of repurposing the existing drugs has resulted in the detection of a large number of effective molecules for the treatment of COVID-19 infection (Ciliberto et al. 2020). In order to simplify the overview of large number of 2-DG interacting genes that has been extracted, massive number of SNPs residing in respective genes were mined and characterized based on their location. The distribution of these SNPs was depicted by circos which is an unambiguous representation to lessen the inherent complexities and consider the density and dynamic range within huge data sets (Krzywinski et al. 2009). Further, our in silico approach has detected 80 genetic variants associated with 2-DG interacting genes using diverse bioinformatics resources. Therefore, an assessment of these variants was performed by employing various SNP prediction resources and by choosing the overlapping SNPs to overcome the false-positive findings. The pathway analysis aids in investigating interrelationships of terms and functional groups that constitute biological networks (Bindea et al. 2009). The pathway analysis of 2-DG interacting genes emphasized various processes: cytochrome C-mediated apoptotic response, interleukin-4, and interleukin-13 signaling, among others. Interestingly, the assessment also indicated susceptible gene loci for SARS-CoV infections. The pathway assessment among 2-DG interacting genes also highlighted apoptosis-related signaling mediated by the caspase family of proteins which may modify the metabolism of cells and enhance the rate of cell death (Gioti et al. 2021) and its potential role in viral infection inhibition (Plassmeyer et al. 2021). Cell death due to 2-DG in various tumor cells has been reported and could be mediated by ER stress/autophagy in HCT116 colon cancer cells or through Cytochrome C-Caspase 3-PARP axis in certain other cells (Maximchik et al. 2018). Similarly, A2M which is a key anti-inflammatory protease can induce cell proliferation when ligated to chaperon GRP78 by increasing the glucose uptake (Vandooren and Itoh 2021). GRP78 also accumulates upon ER stress induced by 2-DG thus sequentially increasing its uptake when provided in place of glucose (Kim et al. 2018). Thus, any structural alterations in A2M may determine the efficacy of 2-DG treatment. Often 3′-UTR and less frequently exon bound miRNAs silence and regulate the genes at a posttranscriptional level. The variations due to SNPs introduced into the miRNA binding regions may diminish binding affinity and consequently affect its function (Prabhu et al. 2021). We extracted the SNP information of 2-DG interacting genes to unravel the miRNA binding sites employing three databases namely miRNASNP3, MirSNP, and PolymiRTS and examined whether or not miRNAs linked polymorphisms residing in the target region would generate or disrupt a miRNA-mRNA binding region. The findings of our study showed the impact of two miRNA target SNPs (rs1065154, rs10277) residing in the SQSTM1 gene which could create or break at ancestral and mutant allele. Expression quantitative trait loci analysis is a robust technique toward determining genetic loci linked with quantitative variations in gene expression. After employing Genome-Wide Association analysis to the set of records containing approximately 3,00,000 SNPs and 48,000 mRNA expression traits from high throughput technique, researchers found 1226 significant associations, out of which 95 associations were linked to ADME of drugs. The variant rs10277 residing in the gene SQSTM1 in human liver samples reported that allele C is linked with increased transcription compared to allele T. These data broaden our understanding regarding the genetic features of inter-individual variation in gene expression in conjunction with specific prominence on pharmacogenomics (Table S4) (Schröder et al. 2011; Whirl-Carrillo et al. 2012). In this study, the influence of polymorphisms in TFBS and enhancers were also analyzed. The massive number of genetic variants detected from GWAS resides in the genome’s noncoding region and are of significant interest when located in regulatory sites such as promoters and enhancers as these variants may influence gene expression and these may play a major role in the complex traits that elicits drug response. Thus, we screened the 5′-UTR and upstream SNPs of the selected genes to verify whether the substitution of SNP allele and modified TF binding sites would possibly perturb gene regulation (Buroker 2017). Pathogenic and other exposures cause leucocytes to respond quickly, with effects ranging from cytokine generation to migration and engulfing by phagocytosis (Marsin et al. 2002; Yang et al. 2012; Wahl et al. 2012). Activation of mononuclear cells with lipopolysaccharide enhanced the production of cytokines IL-1B, IL-6, and TNF-alpha, as predicted (Fangradt et al. 2012; Freemerman et al. 2014). Accelerated glycolytic flow produces ATP quickly to meet these critical processes, which are bioenergetically expensive (Palsson-Mcdermott and O’Neill 2013; Macintyre and Rathmell 2013). For all three cytokines, the competitive glycolysis inhibitor 2-DG dramatically inhibited lipopolysaccharide-mediated generation of cytokines (Jones et al. 2015). Our findings reported rs1143627 residing in IL1B generated two TFBS for PTBP, MYT1. One of the studies proclaimed that rs1143627 residing in the gene IL1B was found to be associated with Influenza A susceptibility in humans. The findings also showed that aged adults or individuals of any age with comorbid or immunosuppressive conditions might be at a greater risk of disease development. IL1B rs1143627 was also considered to be susceptibility alleles in individuals suffering from liver fibrosis infected by the hepatitis B virus (Wu et al. 2018). Extensive data reported the role of two variants, namely, rs712829 residing in EGFR gene and rs1143627 in IL1B gene in NCI-60 cancer cell lines and human samples, highlighting the effect of genotype on neoplasms and psoriasis on the usage of diverse drugs molecules (Tables S4, S5) (Whirl-Carrillo et al. 2012). Additionally, enhancers that regulate gene expression function as rheostats for transcription, which will further tune up the levels of specific transcripts (Corradin and Scacheri 2014). Henceforth, in the current study 43 SNPs have shown a wide spectrum of altered motifs that may result in gene regulation. Due to the complexity of the infection, an apt determinative model and efficacious medication for COVID-19 infection are yet to be evolved. As the innate immune system is inadequate to produce a powerful immune response counter to the virus, multi-targeted factors that mitigate viral infection, replication, and host immune reactions are warranted. In the present study, a sum of three polymorphisms (SQSTM1 rs10277, IL1B rs1143627, EGFR rs712829) of 2-DG interacting genes may increase the susceptibility to SARS-CoV infections than other polymorphisms. However, these identified polymorphisms need to be considered by experimental validation of the likelihoods proposed in the current work is required in larger cohorts for repurposing the drug. Further, this in silico study was conducted to shed light on the pharmacogenomic concerns of 2-DG against SARS-CoV-2. We believe that the selected variants in the current study should be wisely considered to overcome adverse drug reaction and to strengthen the foundation for future medical exploration. Nevertheless, it is universally believed that an SNP acts through neighboring genes when it is most likely connected to a phenotype or illness. Therefore, it is undeniable that the present strategy may overlook certain associated genes.

Conclusions

In the current in silico study, efforts were made to identify the genetic biomarkers of 2-DG interacting genes, which may determine the risk of gene polymorphisms and drug response. The in silico data mining strategy aids predominantly in finding the drug interacting genes, and their respective pathways and supports in assessing the influence of SNPs in distinct genic regions. Eventually, the information creates an integrated foundation to delineate the intricate molecular relationships among 2-DG interacting genes and may subsequently provide insight to predict COVID-19 infection risk and treatment strategies with 2-DG. Below is the link to the electronic supplementary material. Supplementary file1 (PDF 580 KB)
  54 in total

1.  Bioenergetic analysis of human peripheral blood mononuclear cells.

Authors:  N Jones; J Piasecka; A H Bryant; R H Jones; D O F Skibinski; Nigel J Francis; C A Thornton
Journal:  Clin Exp Immunol       Date:  2015-07-24       Impact factor: 4.330

2.  SNP2TFBS - a database of regulatory SNPs affecting predicted transcription factor binding site affinity.

Authors:  Sunil Kumar; Giovanna Ambrosini; Philipp Bucher
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

3.  Clinical Predictors of Liver Fibrosis in Patients With Chronic Hepatitis B Virus Infection From Children to Adults.

Authors:  Jia-Feng Wu; Shih-Hsi Song; Chee-Seng Lee; Huey-Ling Chen; Yen-Hsuan Ni; Hong-Yuan Hsu; Tzee-Chung Wu; Mei-Hwei Chang
Journal:  J Infect Dis       Date:  2018-04-11       Impact factor: 5.226

4.  The wonders of 2-deoxy-D-glucose.

Authors:  Haibin Xi; Metin Kurtoglu; Theodore J Lampidis
Journal:  IUBMB Life       Date:  2014-02-27       Impact factor: 3.885

5.  SWISS-MODEL: homology modelling of protein structures and complexes.

Authors:  Andrew Waterhouse; Martino Bertoni; Stefan Bienert; Gabriel Studer; Gerardo Tauriello; Rafal Gumienny; Florian T Heer; Tjaart A P de Beer; Christine Rempfer; Lorenza Bordoli; Rosalba Lepore; Torsten Schwede
Journal:  Nucleic Acids Res       Date:  2018-07-02       Impact factor: 16.971

Review 6.  'Sugarcoating' 2-deoxyglucose: mechanisms that suppress its toxic effects.

Authors:  Martin C Schmidt; Allyson F O'Donnell
Journal:  Curr Genet       Date:  2020-11-02       Impact factor: 3.886

Review 7.  Animal Coronaviruses Induced Apoptosis.

Authors:  Katerina Gioti; Christine Kottaridi; Chrysa Voyiatzaki; Dimitrios Chaniotis; Theodoros Rampias; Apostolos Beloukas
Journal:  Life (Basel)       Date:  2021-02-26

8.  COVID-19: A game of drugs, vaccines, hope and… death!

Authors:  Aloka Samantaray; Edward Johnson; Nishant Kumar; Lalit Mehdiratta
Journal:  Indian J Anaesth       Date:  2021-06-22

9.  PolymiRTS Database 3.0: linking polymorphisms in microRNAs and their target sites with human diseases and biological pathways.

Authors:  Anindya Bhattacharya; Jesse D Ziebarth; Yan Cui
Journal:  Nucleic Acids Res       Date:  2013-10-24       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.