Literature DB >> 27212841

Towards finding the linkage between metabolic and age-related disorders using semantic gene data network analysis.

Mohammad Uzzal Hossain1, Abu Zaffar Shibly1, Taimur Md Omar1, Fatama Tous Zohora1, Umme Sara Santona1, Md Jakir Hossain1, Md Sadek Hosen Khoka1, Chaman Ara Keya2, Md Salimullah3.   

Abstract

A metabolic disorder (MD) occurs when the metabolic process is disturbed. This process is carried out by thousands of enzymes participating in numerous inter-dependent metabolic pathways. Critical biochemical reactions that involve the processing and transportation of carbohydrates, proteins and lipids are affected in metabolic diseases. Therefore, it is of interest to identify the common pathways of metabolic disorders by building protein-protein interactions (PPI) for network analysis. The molecular network linkages between MD and age related diseases (ARD) are intriguing. Hence, we created networks of protein-protein interactions that are related with MD and ARD using relevant known data in the public domain. The network analysis identified known MD associated proteins and predicted genes and or its products of ARD in common pathways. The genes in the common pathways were isolated from the network and further analyzed for their co-localization and shared domains. Thus, a model hypothesis is proposed using interaction networks that are linked between MD and ARD. This data even if less conclusive finds application in understanding the molecular mechanism of known diseases in relation to observed molecular events.

Entities:  

Year:  2016        PMID: 27212841      PMCID: PMC4857462          DOI: 10.6026/97320630012022

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Metabolic disorder (MD) is a cluster of metabolic risk factors characterized by obesity, elevated blood pressure, increased plasma glucose (fasting), high triglycerides in serum and decreased high-density cholesterol levels [1]. Metabolic disorder affected people are at increased risk for atherosclerosis, peripheral vascular disease, coronary heart disease, myocardial infarction, stroke, and type 2 diabetes [2-5]. These are the leading causes of disability worldwide [6]. The consequences of metabolic disorders are often treated by healthy weight, diet and physical activities [7,8]. So, evaluation of metabolic risk factors and the identification of population groups at risk of chronic diseases are essential for developing prevention strategies. Hence, the dynamic modeling of biological systems to describe various human diseases is of interest in recent years. The complex network of proteins (gene products) and their biological processes mediating interactions among them in these diseases are of importance to understand. The application of protein interaction networks to available disease datasets in the public domain allows the identification of genes and their corresponding proteins. This helps the creation of sub-networks to study network properties for the classification of diseaseassociated genes in networks. It is found that several strategies have been employed to analyze gene networks using data for protein interactions in these conditions. However, this is a complex and a challenging task to pursue [9]. The information related to the disease mechanism gleaned using data for gene networks at a system level is critical yet it is highly convoluted. This is possible by collecting relevant data followed by cleaning such data by removing redundant information for useful yet specific knowledge establishments. This is helpful for improved data analysis followed by data integration to create a reliable model of the disease under study. Thus, gene network methods have been used to gain insights into disease mechanisms [10,11], co-morbidity (anomalous conditions) [12,13], protein target identification [14-16] and biomarker detection [17,18]. The gene network based study includes elucidation of a complex system by fragmenting them into finite components (nodes or vertices) and interactions (edges). This conceptual illustration helps in the understanding of complex molecular disturbances in diseased conditions. Therefore, it is of interest to use graph theory based pathway diagrams using pertinent co-localization information with shared domain data between MD and ARD by mediating protein-protein interaction networks to identify the genes in a common pathway among disease types, states and conditions.

Methodology

Disease associated gene data collection from known literature

We gathered disease associated proteins (gene products) and or their corresponding genes related data from publically (WWW – World Wide Web) available databases such as PubMed (http://www.ncbi.nlm.nih.gov/pubmed), PubMed Central (PMC - (http://www.ncbi.nlm.nih.gov/pmc) and other open access journals maintained by several publishers across Nations. This is done through disease specific manual keyword (metabolic disorder (MD), age related disorder (ARD), relevant genes) searching, article gathering, visual scanning, reading, studying, understating, cleaning, grouping, labeling, refining, storing in simple RDBMS, and subsequent data retrieval for value addition, information enrichment and knowledge creation on the subject of the study. It should be noted that PubMed and PMC are maintained at National Centre for Biotechnology (NCBI), National Institute of Health (NIH), USA.

Disease specific network creation

We used GeneMANIA (http://www.genemania.org/) [19] to collect data related to metabolic disorder (MD) for relevant information gathering and knowledge establishment with available graphical network diagrams retrieved from the server in this study. GeneMANIA provides data for protein-protein interactions, protein-DNA interactions and or protein-gene interactions, corresponding pathways, associated reactions, available phenotypic profiles and genes expression data with corresponding known yet characterized proteins in the network. The data thus described is reasonably representative if not comprehensive.

Common pathway identification

We used the pathway database (http://www. pathwaycommons.org) [20] to identify the common pathways amongst genes of metabolic disorder (MD). Pathway commons is a collection of pathway information from multiple organisms. A comprehensive collection of biological pathways from multiple sources is provided by pathway commons and represented in a common language for genes based metabolic pathway analysis.

Pathway analysis

We used Reactome (http://www.reactome.org/), a curated, peer reviewed and an open source pathway database for exploring the pathway knowledge for common genes [21].

Gene localization and domain analysis

Genes are often expressed in the same tissue and corresponding proteins can be found in the same location. Two or more genes are linked if they are expressed in the same tissue and their corresponding proteins are found in the same cellular location. Similarly, two proteins (gene products) are linked if they have the same defined (sequence and or structure) protein domain. This is completed using InterPro, SMART and Pfam facilities in the public domain. Gene localization and domain analysis are completed using the tools at GeneMANIA (http://www.gene mania.org/).

Results & Discussion

The hypothesis describing the mechanism leading the molecular disturbances in metabolic disorders (MD) and agerelated disorders (ARD) is usually non-trivial nature. Therefore, it is important to relate a disease condition with its know yet reasonably representative associated genes for the construction of its corresponding protein-protein interaction networks. It is also of critical importance to identify genes and or their protein products that share common pathways between different disease conditions (e.g. MD and ARD). Therefore, we used textmining (keyword searching) techniques to identify genes associated with MD as shown in Figure 1. The text-mining analysis searched for respective known genes associated with 36 metabolic diseases (Table 1). Thus, genes known to be associated with each described metabolic disorder is listed.
Figure 1

A schematic representation for the linkage hypothesis between MD and ARD is shown.

Table 1

Description of common genes in pathways associated with specific diseases.

Metabolic disordersResponsible genes
DiabetesType 1HLA DQA1, DQB1, DRB1
Type 2TCF7L2, PPARG, KCNJ11, NOTCH2, WFS1, IGF2BP2, SLC30A8, JAZF1, HHEX
Phenyl-keto-nuriaPAH
Glucose galactose mal-absorptionSGLT1
Tyrosine-miaFAH, HPD, TAT
AlkaptonuriaHGD
Homo-cystinuriaCBS, MTHFR, MTR, MTRR, MMADHC
Maple syrup urine disease BCKDHA, BCKDHB, DBT
Propionic acid-emiaPCCA, PCCB
Methyl-malonic acid-emiaMUT, MMAA, MMAB, MMADHC, MCEE
Hyper-cholesterol-emiaAPOB, LDLR, LDLRAP1, PCSK9
Glycogen storage diseaseType 0 GYS1, GYS2
Type 1G6PC, SLC37A4
Type 2GAA (Pomp disease)
Type 3AGL
Type 4GBE1
>Type 5PYGM
Type 6PYGL
Type 7PFKM
Type 9PHKA1, PHKA2, PHKB, PHKG2
Type 10PGAM2 (Phospho-glycerate mutase deficiency)
Galactose-miaGALE, GALK1, GALT
Congenital glycosylation disorderALG12
Type IcALG6
Type IiiCOG5
Lesch-Nyhan syndromeHPRT1
Gaucher diseaseGBA
Tay-Sachs diseaseHEXA
Fabry diseaseGLA
Hurler syndromeIDUA
Hunter syndromeIDS
Sanfilippo syndromeGNS, HGSNAT, NAGLU, SGSH
Maroteaux-Lamy syndromeARSB
Morquio syndromeGALNS, GLB1
Refsum diseasePEX7, PHYH
Hemo-chromatosisHAMP, HFE, HFE2, SLC40A1, TFR28
The listed data were further analyzed to identify key genes associated with various diseases for network analysis and evaluation (Figure 2). This exercise identified five common pathways with predicted genes for possible disease association from the network analysis (Figure 3). Subsequently, common pathway genes with specific diseases are thus summarized (Table 2). This data shows that APOB, LDLR, APOE, LIPC genes are found in common pathways for lipid digestion, mobilization, and transport. Results show that two genes APOE, LIPC were not in MD but they shared the common pathway with APOB, LDLR responsible for Glycogen storage Disease type 0 (GSD). These two predicted genes are known to cause age-related macular degeneration (AMD) [22,23]. The BCKDHA, DBT, DLD and BCKDHB genes were found in the pathways of branched-chain amino acid catabolism and metabolism of amino acids and its derivatives. However, the predicted DLD gene was known to be associated with skin photo aging [24]. The other three genes (BCKDHA, DBT, and BCKDHB) are the causative agent of MD described as Male Syrap Urine Disease (MSUD) as given in Table 1.
Figure 2

A network of genes associated with the metabolic disorder (MD) is illustrated in this diagram. It should be noted that genes that are already known to be associated with the disorder is shown in the figure. Black nodes indicate known MD genes and gray nodes indicate genes that are predicted to be associated with the disorder.

Figure 3

An illustration showing common pathways between MD and ARD is given. (A) Common pathways of MD with predicted disorder associated genes are shown. (B) Five common pathways between MD and predicted disorder genes are illustrated.

Table 2

Description of manually gleaned common pathway genes with known specific diseases.

ID = identifier; ATP = adenosine tri-phosphate; K+ = potassium; SLC = solute carrier; ABC = ATP-binding cassette; AMD = age-related macular degeneration; HC = Hemo-chromatosis; GSDV = Glycogen storage disease type 5; GSDIX = Glycogen storage disease type 9; MSUD = Maple Syrap Urine Disease; GSD = Glycogen storage disease type 0; SP = skin photo aging; glyco-genolysis = glycogen breakdown

GenesPathway ID R-HAS(prefix)PathwaysKnown disease genesCommon pathway (predicted)Reference
DiseaseGenesGenesDisease
APOB; LDLR; APOE; LIPC73923Lipid digestion, mobilization and transportGSDAPOB APOE AMD[22]
LDLLIPCAMD[23]
BCKDHA; DBT; DLD; BCKDHB70895Branched chain amino acid catabolismMSUDBCKDHA; BCKDHB; DBTDLDSP[24]
71291Metabolism of amino acids and its derivatives
PYGM; PHKB;PHKG1; PHKA170221Glyco-genolysisGSDVPYGMPHKG1Adenoid cystic carcinoma[25]
70326Carbohydrate metabolismGSDIXPHKB; PHKA1
71387
1430728
HEPH;CP;SLC40A1425410Metal ion SLC transportersHCSLC40A1HEPHAMD[26]
917937Iron uptake and transport
425366Transport of sugars (glucose), bile salts, organic acids, metal ions and amine compounds
425407SLC-mediated trans-membrane transportCPAMD[26]
382551Trans-membrane transport of small molecules
ABCC8;KCNJ111296025ATP sensitive K+ channelsType 2 diabetesKCNJ11ABCC8Ageing[27]
1296065Inwardly rectifying K+ channels
382556ABC proteins mediated transport
422356Insulin secretion regulation
1296071K+ channels
163685Energy metabolism
112316Neuronal system
382551Trans-membrane transport of small molecules
1430728Metabolism related
The four genes (PYGM, PHKB, PHKG1, PHKA1) were found in a common pathway from the network analysis sharing glycogenolysis (glycogen breakdown) and carbohydrate metabolism. The predicted gene PHKG1 is known to be associated with adenoid cystic carcinoma [25]. The other three genes PYGM, PHKB and PHKA1 involved in glycogen storage disease type 5 (GSDV) and glycogen storage disease type 9 (GSDIX) is also observed in the network analysis. The other set of genes (HEPH, CP and SLC40A1) were also found in common pathways of metal ion SLC transporters, iron uptake and transport, glucose transport, transport of bile salts and organic acids, metal ions and amine compounds, SLC-mediated transmembrane transport and trans-membrane transport of small molecules. It should be noted that HEPH and CP are not associated with MD but they are involved with age-related macular degeneration (AMD) [25]. Two genes (ABCC8 and KCNJ11) associated with Type 2 diabetes (Table 2) and aging share a common pathway and hence have a linkage [26]. ABCC8 and KCNJ11 shared the common pathways for ATP sensitive K+ channels, inwardly rectifying K+ channels, ABC-family proteins mediated transport, regulation of insulin secretion, integration of energy metabolism, neuronal system association, trans-membrane transport of small molecules and other metabolism as given in Table 2. Common genes associated with both MD and ARD are further processed for molecular interactions using network analysis as described in the methodology section. It is further showed that these genes shared the same domain in pathway regulation (Figure 4). They are also co-localized with each other in tissues when expressed.
Figure 4

A network between co-localized and domain-sharing genes is shown. Black nodes indicate the common pathway of genes between MD and ARD.

It is of important to understand the specific molecular pathways unique to a disease to elucidate the difference in these pathways. Therefore, it is essential to construct a ‘linkage network’ between diseases that are inter-linked by one or more genes found associated with the diseases using simplified network diagrams. We illustrated a linkage network based on pathway data, domain information and co-localization analyses between MD and ARD in this report. Thus, a model hypothesis is proposed using interaction networks that are linked between MD and ARD (Figure 5).
Figure 5

Venn diagram showing MD (dark green) and ARD (dark blue) disorders, and common genes of MD (light green) and ARD (light blue) is shown. Set of common pathway of genes for MD (light green) and intersection (Ash) for set of common genes between MD (light green) and ARD (light blue), (MD) ∩ (ARD) = {APOE, LIPC, HEPH, CP, DLD, ABCC8} are shown.

Conclusion

We report data for common pathway of genes responsible for metabolic disorders (MD) and age related disorders (ARD). Data shows the linkage of genes in these diseases by analyzing their co-localization and shared domains. Pathway analysis with gene regulatory network evaluation using gene circuits and module design for further analysis of gene product interactions is essential for understanding the mechanism of the disease in relation to molecular cellular biology events.
  25 in total

1.  The human disease network.

Authors:  Kwang-Il Goh; Michael E Cusick; David Valle; Barton Childs; Marc Vidal; Albert-László Barabási
Journal:  Proc Natl Acad Sci U S A       Date:  2007-05-14       Impact factor: 11.205

2.  Drug-target network.

Authors:  Muhammed A Yildirim; Kwang-Il Goh; Michael E Cusick; Albert-László Barabási; Marc Vidal
Journal:  Nat Biotechnol       Date:  2007-10       Impact factor: 54.908

3.  Network-based drug ranking and repositioning with respect to DrugBank therapeutic categories.

Authors:  Matteo Re; Giorgio Valentini
Journal:  IEEE/ACM Trans Comput Biol Bioinform       Date:  2013 Nov-Dec       Impact factor: 3.710

4.  Metabolic syndrome as a precursor of cardiovascular disease and type 2 diabetes mellitus.

Authors:  Peter W F Wilson; Ralph B D'Agostino; Helen Parise; Lisa Sullivan; James B Meigs
Journal:  Circulation       Date:  2005-11-07       Impact factor: 29.690

5.  CpG island methylation profiling in human salivary gland adenoid cystic carcinoma.

Authors:  Achim Bell; Diana Bell; Randal S Weber; Adel K El-Naggar
Journal:  Cancer       Date:  2011-01-11       Impact factor: 6.860

6.  Genetic association of apolipoprotein E with age-related macular degeneration.

Authors:  C C Klaver; M Kliffen; C M van Duijn; A Hofman; M Cruts; D E Grobbee; C van Broeckhoven; P T de Jong
Journal:  Am J Hum Genet       Date:  1998-07       Impact factor: 11.025

7.  Pathway Commons, a web resource for biological pathway data.

Authors:  Ethan G Cerami; Benjamin E Gross; Emek Demir; Igor Rodchenkov; Ozgün Babur; Nadia Anwar; Nikolaus Schultz; Gary D Bader; Chris Sander
Journal:  Nucleic Acids Res       Date:  2010-11-10       Impact factor: 16.971

Review 8.  Metabolic syndrome: the danger signal in atherosclerosis.

Authors:  Patrick Mathieu; Philippe Pibarot; Jean-Pierre Després
Journal:  Vasc Health Risk Manag       Date:  2006

9.  Validation of candidate causal genes for obesity that affect shared metabolic pathways and networks.

Authors:  Xia Yang; Joshua L Deignan; Hongxiu Qi; Jun Zhu; Su Qian; Judy Zhong; Gevork Torosyan; Sana Majid; Brie Falkard; Robert R Kleinhanz; Jenny Karlsson; Lawrence W Castellani; Sheena Mumick; Kai Wang; Tao Xie; Michael Coon; Chunsheng Zhang; Daria Estrada-Smith; Charles R Farber; Susanna S Wang; Atila van Nas; Anatole Ghazalpour; Bin Zhang; Douglas J Macneil; John R Lamb; Katrina M Dipple; Marc L Reitman; Margarete Mehrabian; Pek Y Lum; Eric E Schadt; Aldons J Lusis; Thomas A Drake
Journal:  Nat Genet       Date:  2009-03-08       Impact factor: 38.330

10.  The impact of cellular networks on disease comorbidity.

Authors:  Juyong Park; Deok-Sun Lee; Nicholas A Christakis; Albert-László Barabási
Journal:  Mol Syst Biol       Date:  2009-04-07       Impact factor: 11.429

View more
  3 in total

1.  Finding Potential Therapeutic Targets against Shigella flexneri through Proteome Exploration.

Authors:  Mohammad Uzzal Hossain; Md Arif Khan; Abu Hashem; Md Monirul Islam; Mohammad Neaz Morshed; Chaman Ara Keya; Md Salimullah
Journal:  Front Microbiol       Date:  2016-11-22       Impact factor: 5.640

2.  Pathway based therapeutic targets identification and development of an interactive database CampyNIBase of Campylobacter jejuni RM1221 through non-redundant protein dataset.

Authors:  Mohammad Uzzal Hossain; Taimur Md Omar; Iftekhar Alam; Keshob Chandra Das; A K M Mohiuddin; Chaman Ara Keya; Md Salimullah
Journal:  PLoS One       Date:  2018-06-08       Impact factor: 3.240

3.  Exploring Genomic Variants Related to Residual Feed Intake in Local and Commercial Chickens by Whole Genomic Resequencing.

Authors:  Jie Liu; Ranran Liu; Jie Wang; Yonghong Zhang; Siyuan Xing; Maiqing Zheng; Huanxian Cui; Qinghe Li; Peng Li; Xiaoyan Cui; Wei Li; Guiping Zhao; Jie Wen
Journal:  Genes (Basel)       Date:  2018-01-24       Impact factor: 4.096

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.