Literature DB >> 25879401

Network Cluster Analysis of Protein-Protein Interaction Network-Identified Biomarker for Type 2 Diabetes.

Zhonghui Li1, Zijun Qiao1, Wenling Zheng1, Wenli Ma1.   

Abstract

Type 2 diabetes mellitus (T2DM) is a complex disease that is caused by an impairment in the secretion of β-cell insulin and by a peripheral resistance to insulin. Most patients suffering from T2DM and from obesity exhibit insulin resistance in the muscles, liver, and fat, resulting in a reduced response of these tissues to insulin. In healthy individuals, pancreatic islet β-cells secrete insulin to regulate the increase in blood glucose levels. Once these β-cells fail to function, T2DM develops. Despite the progress achieved in this field in recent years, the genetic causes for insulin resistance and for T2DM have not yet been fully discovered. The present study aims to characterize T2DM by comparing its gene expression with that of normal controls, as well as to identify biomarkers for early T2DM. Gene expression profiles were downloaded from the Gene Expression Omnibus, and differentially expressed genes (DEGs) were identified for type 2 diabetes. Furthermore, functional analyses were conducted for the gene ontology and for the pathway enrichment. In total, 781 DEGs were identified in the T2DM samples relative to healthy controls. These genes were found to be involved in several biological processes, including cell communication, cell proliferation, cell shape, and apoptosis. We constructed a protein-protein interaction (PPI) network, and the clusters in the PPI were analyzed by using ClusterONE. Six functional genes that may play important roles in the initiation of T2DM were identified within the network.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25879401      PMCID: PMC4504429          DOI: 10.1089/dia.2014.0204

Source DB:  PubMed          Journal:  Diabetes Technol Ther        ISSN: 1520-9156            Impact factor:   6.118


Introduction

Improvements in living standards and lifestyle changes have brought about an increase in the morbidity of type 2 diabetes mellitus (T2DM), a disease that seriously endangers people's lives.[1,2] According to the International Diabetes Federation, 100 million patients suffered from diabetes in the end of the 20th Century, and this number has grown rapidly to 382 million as of 2013. In 2035, the world is expected to have 592 million people affected by T2DM.[3] T2DM is a complex multifactorial disorder that is characterized by insulin resistance and impaired β-cell functions.[4-6] Numerous multifactorial mechanisms are involved in the development of insulin resistance and in the impairment of insulin secretion, including genetic and environmental factors that lead to a decline in insulin activity, followed by a chronic pancreatic β-cell dysfunction.[7,8] However, insulin resistance alone may not be a decisive pathogenic factor of T2DM because insulin resistance often exists in many people without diabetes, and, in fact, the pancreatic islets of the Langerhans play a more important role in the development of T2DM.[9] Prediabetes insulin resistance does not always lead to diabetes because a considerable proportion of individuals with prediabetes do not develop T2DM when the β-cells compensate for the insulin deficiency by secreting insulin to maintain glucose homeostasis. When insulin secretion by the β-cells cannot keep pace with the increase in insulin resistance, high blood glucose levels will occur throughout the body, and T2DM will develop.[7,8,10,11] The increase in blood glucose level in diabetes patients leads to a wide variety of long-term complications, including retinopathy, nephropathy, and cardiovascular disease. The complications associated with diabetes result in chronic tissue damage—especially of important organs of the central nervous system, the kidneys, and the heart—as well as in chronic damage to other tissues, seriously jeopardizing the normal functions of the human body. In this study, the differentially expressed genes (DEGs) between those with diabetes and normal controls were identified, and the biomarkers of diabetes were screened based on a protein–protein interaction (PPI) network. Our results may contribute to the further understanding of diabetes.

Materials and Methods

DNA microarray data

GSE25724 microarray data were downloaded from the Gene Expression Omnibus database,[12] and a microarray analysis was performed to evaluate the differences in the transcriptome of human T2DM islets samples in comparison with those of samples from subjects without diabetes. Collagenase digestion, followed by density gradient purification, was used to isolate human islets from the pancreas of organ donors, and the islets were then hand-picked and cultured for 2 days in M199 culture medium. A GeneChip® human genome U133A array (Affymetrix, Santa Clara, CA) was used to process a total of 13 arrays that were available for further analysis, including six arrays of T2DM and seven without diabetes. Pancreata were obtained with the approval of the local Ethics Committee and consent by donors' relatives. Human islets were isolated from seven organ donors without diabetes (age, 58±17 years; gender, four males/three females; body mass index, 24.8±2.5 kg/m2) and six organ donors with T2DM (age, 71±9 years; gender, three males/three females; body mass index, 26.0±2.2 kg/m2) by collagenase digestion followed by density gradient purification as previously reported.

Identification of DEGs and methylated sites

We use the Qlucore Omics Explorer version 3.0 software program (Qlucore AB, Lund, Sweden) to analyze the data from the microarray. Qlucore Omics Explorer is a next-generation bioinformatics software that can analyze and explore datasets, including those obtained from gene expression arrays, micro RNA arrays, protein arrays, and DNA methylation arrays. Qlucore Omics Explorer uses state-of-the-art mathematical and statistical methods to provide new methods to perform data analysis and data mining.

Principal component analyses and hierarchical clustering analysis of DEGs

A hierarchical clustering analysis was carried out for the DEGS to identify the differences in gene expression in a more intuitive manner. The Qlucore Omics Explorer version 3.0 software was used to generate the principal component analyses, hierarchical clustering, and a heat map.

Function and pathway enrichment analysis of DEGs

The Database for Annotation, Visualization and Integrated Discovery (DAVID) of bioinformatics resources consists of an integrated biological knowledge base and analytical tools aimed at systematically extracting biological meaning from large lists of genes or proteins.[13] DAVID was applied in order to conduct the gene ontology analysis, and the Kyoto Encyclopedia of Genes and Genomes was used to carry out the pathway enrichment analysis. A false discovery rate of <0.05 and a count of >2 were chosen as the cutoff criteria.

PPI data

In order to obtain more reliable relationships, duplicate edges and interactions with self-loops were deleted. Finally, in total, 5,233 PPIs were obtained for further investigation. The PPI data were downloaded from the Biological General Repository for Interaction Datasets (BioGRID), the Molecular INTeraction database (MINT), the IntAct molecular interaction database (IntAct), and the iRefIndex database. BioGRID houses and distributes collections of protein and genetic interactions from major model organism species. As of August 1, 2013, BioGRID (a general repository for interaction datasets) contains 703,150 interactions from 32 different species, as derived from 41,099 high-throughput and conventional focused studies. At present, the MINT team focuses on the physical interactions between proteins, storing information on molecular interactions in a structured format in the MINT database by extracting experimental details from studies published in journals. The data in the current version of MINT include 241,458 experimentally verified interactions. IntAct (an open source molecular and open interaction database) is a comprehensive, open resource database as well as an analysis system for protein interactions that can promote consistency in the interaction datasets by using the same infrastructure and annotation system. The iRefIndex database (a consolidated protein interaction database with provenance) provides an index of protein interactions available in several primary interaction databases, including BIND, BioGRID, CORUM, DIP, HPRD, IntAct, MINT, MPact, MPPI, and OPHID. These data include multiple interaction types, including physical and genetic interactions (mapped to their corresponding protein products) that are determined through a multitude of methods.

PPI network analysis

The PPI information data were imported into Cytoscape (an open source software platform), and a PPI network was easily constructed. Clustering with overlap neighborhood expansion (ClusterONE) was used in the Cytoscape network in order to discover densely connected regions in the network with possible overlap. A quality function can guide ClusterONE by growing dense regions out of small seeds. The modules were identified to have a minimum density of >0.05 and a degree of >6. A cluster with a P value of <0.001 was determined to be a new network. Several parameters from the nodes in the network were obtained by using the Network Analyzer in Cytoscape, which can be used to help us determine the hub nodes for the network.

Results

The DEGs of the GSE25724 data for human T2DM islets and those from subjects without diabetes were identified using Qlucore Omics Explorer version 3.0. As shown in Figure 1, in total, 781 genes were screened, including 741 down-regulated genes, leaving 40 genes remaining up-regulated (P=0.005, q=0.0037, fold change ≥2 times).

(A) Cluster analysis of differentially expressed genes. The colors from green to black and to red represent the expression values of the differentially expressed genes that have increased. (B) Principal cluster analysis plots. The yellow samples represent type 2 diabetes mellitus islets, and the blue samples represent islets from subjects without diabetes. Color graphics are available online at www.liebertonline.com/dia.

(A) Cluster analysis of differentially expressed genes. The colors from green to black and to red represent the expression values of the differentially expressed genes that have increased. (B) Principal cluster analysis plots. The yellow samples represent type 2 diabetes mellitus islets, and the blue samples represent islets from subjects without diabetes. Color graphics are available online at www.liebertonline.com/dia.

Hierarchical clustering analysis

As shown in Figure 1, the expression value for most DEGs for islets from subjects without diabetes was higher (deeper) than that for T2DM islets, suggesting that almost all the DEGs in islets from subjects without diabetes were down-regulated. These observations were consistent with the fact that DEG analysis indicated 741 out of 781 genes were down-regulated.

Function and pathway annotation

In Table 1, in total, 10 functions were enriched; the most significant function was that of intracellular transport (P=3.21E-16). In Table 2, the most significant pathway was the proteasome signaling pathway (P=2.16E-07). The dysregulated pathways in T2DM included pathways associated with signal transduction, such as the proteasome pathway, citrate cycle (tricarboxylic acid cycle) pathway, and the ubiquitin-mediated proteolysis pathway.
1.

Classification of Genes in the Functional Modules According to Gene Ontology Terms with

Term IDDescriptionCountP value
0046907Intracellular transport823.24E-16
0006511Ubiquitin-dependent protein catabolic process411.78E-12
0032268Regulation of cellular protein metabolic process514.26E-08
0044265Cellular macromolecule catabolic process784.87E-12
0051438Regulation of ubiquitin-protein ligase activity223.17E-11
0006084Acetyl-coenzyme A metabolic process96.22E-05
0051340Regulation of ligase activity226.94E-11
0008135Translation factor activity, nucleic acid binding177.02E-06
0031396Regulation of protein ubiquitination237.61E-10
0005739Mitochondrion878.04E-08
2.

Classification of Genes in Functional Modules According to Kyoto Encyclopedia of Gene and Genomes Terms with False Discovery Rate of <0.05 (Top 10)

TermCountP value
hsa03050:Proteasome152.16E-07
hsa03010:Ribosome161.17E-04
hsa00020:Citrate cycle (tricarboxylic acid cycle)92.85E-04
hsa04120:Ubiquitin-mediated proteolysis203.04E-04
hsa04114:Oocyte meiosis150.004212
hsa03060:Protein export40.0087
hsa03040:Spliceosome150.013725
hsa05012:Parkinson's disease150.015611
hsa00510:N-Glycan biosynthesis80.015655
hsa00071:Fatty acid metabolism70.026233
hsa00620:Pyruvate metabolism70.026233
hsa04110:Cell cycle140.028226
hsa00640:Propanoate metabolism60.03522
hsa03430:Mismatch repair50.041016
hsa03022:Basal60.04936
Classification of Genes in the Functional Modules According to Gene Ontology Terms with Classification of Genes in Functional Modules According to Kyoto Encyclopedia of Gene and Genomes Terms with False Discovery Rate of <0.05 (Top 10)

Construction of PPI network and network clustering

The PPI network was constructed with 8,731 nodes and 35,456 edges, and the DEGs were mapped into PPIs of the network. The clustering analysis was conducted in the PPI network by ClusterONE plugin in Cytoscape. The cluster with a significant value of 0.001 remained in order to construct a new network containing 295 nodes. The hub nodes with betweenness centrality and closeness centrality more than average, and a degree of more than twofold on average was kept for further analysis after the network analysis. Six of these are DEGs: ANAPC10, LAMTPR5, RRAGA, RRAGD, UBE2D2, and YWHAQ (Fig. 2).

The significant functional modules in the protein–protein interaction network. The yellow node represents differentially expressed genes, and the green node represents normal genes. The edge represents the interaction relationship among proteins. Color graphics are available online at www.liebertonline.com/dia.

The significant functional modules in the protein–protein interaction network. The yellow node represents differentially expressed genes, and the green node represents normal genes. The edge represents the interaction relationship among proteins. Color graphics are available online at www.liebertonline.com/dia.

Discussion

In the past few decades, the development of bioinformatics has enabled for targeted genetic and molecular treatments to become an important part of disease therapy.[14,15] In this study, we analyzed the gene expression profiles to select DEGs in normal subjects and T2DM patients at different stages. In total, 781 genes were identified between normal subjects and T2DM patients. Of these, 740 genes were down-regulated, and 41 were up-regulated. DAVID was used to conduct function and pathway enrichment. In addition, the interaction network was constructed for the DEGs, and the degrees of hub genes were calculated.[16] The anaphase-promoting complex (APC) is a cell cycle-regulated ubiquitin ligase (also referred to as a cyclosome) that is composed of at least 11 subunits. ANAPC10 is a core subunit of the APC, and the C-terminus of APC10 binds to CDC27/AP3, playing a role in controlling progression through mitosis and the G1 phase of the cell cycle. [17] The subunit ANANC10/APC10 may have a function in ubiquitination reactions due to the conserved core (residues of 22–161 amino acids of all 185) in its structure that is homologous to other founded domains in putative ubiquitin ligases. The low expression of this gene in human β-cells may reduce the secretion of glucagon, which results in the occurrence of T2DM.[18] The hepatitis B X-interacting protein (HBXIP/LAMTOR5) is located at the human chromosome 1p13.3 and encodes a 9.6-kDa protein. It was originally identified as a result of its interaction with the C-terminus of the hepatitis B virus X protein, which negatively regulates the activity of hepatitis B virus X protein.[19] In hepatoma cells, hepatitis B virus X protein plays a role in suppressing apoptosis initiated via the mitochondrial/cytochrome c pathway. LAMTOR5 promotes the proliferation of hepatoma cancer cells via the phosphatidylinositol 3-kinase/Akt pathway.[20-22] In our study, the low expression of this gene in T2DM patients may promote apoptosis in the pancreas islets. However, its potential role in the development of T2DM should also be investigated. The YWHAQ(14-3-3) family of proteins, including the seven isoforms β, ɛ, η, γ, σ, ζ, and τ in mammals, belongs to a family of highly conserved 30-kDa molecules[23,24] that can bind to target proteins depending on the phosphorylated Ser or Thr residues mainly within the two 14-3-3 binding motifs, RSXpSXP (mode I) and RXXXpSXP (mode II), of the target protein (pS represents phosphoserine).[25,26] The 14-3-3 proteins have a wide range of potential functions and pathological relevance,[27-33] regulating the intercellular signal processes for differentiation, development, and growth, and binding to many important target molecules, including transcription cofactors. In addition, they can prevent or mediate apoptosis and survival by controlling the localization of the potential signaling molecules between the nucleus or the mitochondrion and the cytoplasm. In our study, 14-3-3 was down-regulated in T2DM patients, and we could conclude that 14-3-3 plays an important role in the development of T2DM. The ubiquitination of proteins is mediated by a cascade of enzymes, including E1 (ubiquitin activating), E2 (ubiquitin conjugating), and E3 (ubiquitin ligases) enzymes. UBE2D2 encodes a member of the E2 enzyme family. Also, the tumor suppressor protein p53 and the peroxisomal biogenesis factor 5 are the substrates of this enzyme. UBE2D2 is an essential protein in the degradation of many regulatory and abnormal proteins,[34] and UbcH5A and UbcH5C together form one of the most active classes of E2 enzymes related to UBE2D2.[35] However, these could be associated with the degradation of several important human transcription factors, such as p53,[36] nuclear factor κB, and c-Fos.[37] Recently, it has been noticed that the ubiquitination of proteins could not be restricted to protein degradation only, but could also contain other regulatory functions in, for example, signal transduction, transcription regulation, chromatin remodeling, and DNA repair.[38] In our study, the low expression of this gene in T2MD patients may result in the degradation of insulin. RRAG A/Rag A is a functional homolog to yeast Gtr1p and is a member of the Ras superfamily of small G proteins. Moreover, while polymerase chain reaction methods were used to isolate RRAGA/Rag A/FIP-1 during the search for novel G proteins, RRAGA/RagA was observed to develop a complex with other G proteins, RRAG C/RagC and RRAG D/Rag D.[39] RRAGD is a monomeric guanine nucleotide-binding protein, or G protein, that acts as a molecular switch in numerous cell processes and signaling pathways while binding to GTP or GDP, which are small G proteins. Previous studies indicated that RRAGA is closely related to RRAGD and that both participate in the mammalian target of rapamycin signaling pathway related to the insulin signaling pathway. This suggests that RagD has a role in mTORC1 activation and that a molecular function for the FLCN tumor suppressor could potentially exist. In our study, we can conclude that the down-regulated expression of RRAGA and RRAGD is a potential biomaker for early-onset T2DM. From the above discussion, we could draw the conclusion that the six genes (ANAPC10, LAMTPR5, RRAGA, RRAGD, UBE2D2, and YWHAQ) are identified as differentially expressed and are all down-regulated in T2DM patients (Fig. 3). Meanwhile, these six DEGs are hub nodes in the PPI network (Table 3). Thus, we predict that all these six genes could be potential biomarkers of early T2DM. Furthermore, Figure 2 indicates that RRAGA, RRAGD, and LAMTOR5 are closely related. All in all, we can regard the six genes to be biomarkers for T2DM.

The line map of GSE25724 shows six genes that are all down-regulated in type 2 diabetes mellitus. Color graphics are available online at www.liebertonline.com/dia.

3.

The Six Differentially Expressed Genes and Their 30 Hub Nodes

Hub nodeCloseness centralityDegreeBetweenness centrality
ANAPC10.28416779150.00161029
ANAPC100.36020583180.08334712
ANAPC10P10.35897436160.07459775
ANAPC130.33386328150.02966666
ANAPC160.28340081135.98E-04
ANAPC20.28074866110.00957107
ANAPC40.28416779160.01794462
ANAPC50.34201954240.07845842
ANAPC70.28340081140.00149308
BUB1B0.26785714140.03080296
CDC160.34768212310.1242791
CDC200.2872777220.07109287
CDC230.28493894160.0116858
CDC260.28532609160.00281479
CDC270.28649386190.01848354
CTNNB10.40540541420.34130418
LAMTOR10.5957446890.03143739
LAMTOR50.8220.77270723
MTUS21141
RCHY10.328125270.24110276
RRAGA0.60869565100.21375661
RRAGC0.5714285780.079806
RRAGD0.4444444480.08906526
UBE2C0.3206106980.02287318
UBE2D10.42424242640.33524711
UBE2D20.43298969690.45291098
UBE2E10.33653846460.04686355
UBE2E30.328125410.05150276
UBE2N0.3030303320.06705781
YWHAQ1391
The line map of GSE25724 shows six genes that are all down-regulated in type 2 diabetes mellitus. Color graphics are available online at www.liebertonline.com/dia. The Six Differentially Expressed Genes and Their 30 Hub Nodes

Conclusions

Overall, we have constructed a PPI network of T2MD and have identified six functional modules that may play important roles in the initiation of early-onset T2MD. The hub nodes of these six modules and the genes ANAPC10, LAMTOR5, RRAGA, RRAGD, UBE2D2, and YWHAQ may serve as biomarkers of early-onset T2MD and have the potential to be targets for therapeutic intervention. However, further studies are still needed in order to confirm our findings.
  38 in total

Review 1.  14-3-3 proteins; bringing new definitions to scaffolding.

Authors:  G Tzivion; Y H Shen; J Zhu
Journal:  Oncogene       Date:  2001-10-01       Impact factor: 9.867

Review 2.  The 14-3-3 proteins: gene, gene expression, and function.

Authors:  Yasuo Takahashi
Journal:  Neurochem Res       Date:  2003-08       Impact factor: 3.996

Review 3.  14-3-3 proteins in the nervous system.

Authors:  Daniela Berg; Carsten Holzmann; Olaf Riess
Journal:  Nat Rev Neurosci       Date:  2003-09       Impact factor: 34.870

Review 4.  14-3-3 proteins--an update.

Authors:  Paulette Mhawech
Journal:  Cell Res       Date:  2005-04       Impact factor: 25.617

5.  DISC1 regulates the transport of the NUDEL/LIS1/14-3-3epsilon complex through kinesin-1.

Authors:  Shinichiro Taya; Tomoyasu Shinoda; Daisuke Tsuboi; Junko Asaki; Kumiko Nagai; Takao Hikita; Setsuko Kuroda; Keisuke Kuroda; Mariko Shimizu; Shinji Hirotsune; Akihiro Iwamatsu; Kozo Kaibuchi
Journal:  J Neurosci       Date:  2007-01-03       Impact factor: 6.167

Review 6.  Type 2 diabetes as an inflammatory disease.

Authors:  Marc Y Donath; Steven E Shoelson
Journal:  Nat Rev Immunol       Date:  2011-01-14       Impact factor: 53.106

7.  Degradation of the proto-oncogene product c-Fos by the ubiquitin proteolytic system in vivo and in vitro: identification and characterization of the conjugating enzymes.

Authors:  I Stancovski; H Gonen; A Orian; A L Schwartz; A Ciechanover
Journal:  Mol Cell Biol       Date:  1995-12       Impact factor: 4.272

8.  Identification of human APC10/Doc1 as a subunit of anaphase promoting complex.

Authors:  Y Kurasawa; K Todokoro
Journal:  Oncogene       Date:  1999-09-16       Impact factor: 9.867

9.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

Review 10.  National, regional, and global trends in fasting plasma glucose and diabetes prevalence since 1980: systematic analysis of health examination surveys and epidemiological studies with 370 country-years and 2·7 million participants.

Authors:  Goodarz Danaei; Mariel M Finucane; Yuan Lu; Gitanjali M Singh; Melanie J Cowan; Christopher J Paciorek; John K Lin; Farshad Farzadfar; Young-Ho Khang; Gretchen A Stevens; Mayuree Rao; Mohammed K Ali; Leanne M Riley; Carolyn A Robinson; Majid Ezzati
Journal:  Lancet       Date:  2011-06-24       Impact factor: 79.321

View more
  4 in total

1.  A cluster of blood-based protein biomarkers reflecting coagulation relates to the burden of cerebral small vessel disease.

Authors:  Sanne Kuipers; L Malin Overmars; Bram van Es; Jeroen de Bresser; Esther E Bron; Imo E Hoefer; L Jaap Kappelle; Charlotte E Teunissen; Geert Jan Biessels; Saskia Haitjema
Journal:  J Cereb Blood Flow Metab       Date:  2022-01-27       Impact factor: 6.960

2.  An improved DBSCAN algorithm based on cell-like P systems with promoters and inhibitors.

Authors:  Yuzhen Zhao; Xiyu Liu; Xiufeng Li
Journal:  PLoS One       Date:  2018-12-17       Impact factor: 3.240

3.  Genome-wide profiling of mRNA and lncRNA expression in dengue fever and dengue hemorrhagic fever.

Authors:  Xiao-Lan Zhong; Xiao-Ming Liao; Fei Shen; Hai-Jian Yu; Wen-Sheng Yan; Yun-Fang Zhang; Jia-Jun Ye; Zhi-Ping Lv
Journal:  FEBS Open Bio       Date:  2019-02-02       Impact factor: 2.693

4.  A systems biology-driven approach to construct a comprehensive protein interaction network of influenza A virus with its host.

Authors:  Qurat Ul Ain Farooq; Zeeshan Shaukat; Sara Aiman; Tong Zhou; Chunhua Li
Journal:  BMC Infect Dis       Date:  2020-07-06       Impact factor: 3.090

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.