Literature DB >> 30474315

Gene coexpression network analysis identified potential biomarkers in gestational diabetes mellitus progression.

Xiaomin Zhao1, Wen Li1.   

Abstract

BACKGROUND: Gestational diabetes mellitus (GDM) is one of the most common problems during pregnancy. Lack of international consistent diagnostic procedures has limit improvement of current therapeutic effectiveness. Here, we aimed to screen potential gene biomarkers that might play vital roles in GDM progression for assistance of its diagnostic and treatment.
METHODS: Gene expression profiles in four GDM placentae at first trimester, four GDM placentae at second trimester, and four normal placentae were obtained from the publicly available Gene Expression Omnibus (GEO). Weighted gene coexpression network analysis (WGCNA) indicated two gene modules, that is, black and brown module, that was significantly positively and negatively correlated with GDM progression time points, respectively. Additionally, a significant positive correlation between module membership (MM) and degree in protein-protein interaction network of brown module genes was observed.
RESULTS: KIF2C, CENPE, CCNA2, AURKB, MAD2L1, CCNB2, CDC20, PLK1, CCNB1, and CDK1 all have degree larger than 50 and MM larger than 0.9, so they might be valuable biomarkers in GDM. Gene set enrichment analysis inferred tight relations between carbohydrate metabolism or steroid biosynthesis-related processes and GDM progression.
CONCLUSIONS: All in all, our study should provide several novel references for GDM diagnosis and therapeutic.
© 2018 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.

Entities:  

Keywords:  GDM; GEO; GSEA; WGCNA; gene module

Mesh:

Substances:

Year:  2018        PMID: 30474315      PMCID: PMC6382444          DOI: 10.1002/mgg3.515

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Gestational diabetes mellitus (GDM) represents the most common pathoglycemia form in pregnant women that might also cause hypertension and cardiovascular disease (Oliveira et al., 2015). GDM's morbidity is about 2%–5% across worldwide (Ashwal & Hod, 2015), and which is deeply affected by pathological and environment factors. For example, dangerousness of GDM could be increased up to 3.8 times in obese pregnant women than those with normal body mass index (Pantham & Aye, 2015). Previous study even reported that social capital, such as neighborhood trust, emotional support, has obvious influences on prevalence of GDM (Mizuno et al., 2016). What is more, GDM could increase incidence of type 2 diabetes mellitus and obesity in not only women, but also their offsprings (Catalano, 2010; Coustan, 2013; Harreiter, Dovjak, & Kautzky‐Willer, 2014; Nikolic et al., 2016). Gene expression profiling has been an important mean for exploring disease mechanism and identifying valuable diagnostic and treatment biomarkers particularly after the development of gene microarray and high‐throughput sequencing technologies (Janikova et al., 2011; Kanda et al., 2015; Vu et al., 2015). As a polygenic disease, GDM progression was generally considered to be promoted by aberrant expression of multiple genes in a gestational age‐dependent manner (Uuskula et al., 2012). Insulin has been a key agent with effective treatment results for GDM mainly through regulating cholesterol transport in human placenta (Dube & Ethier‐Chiasson, 2013). Differences exist in placenta surfaces that exposed to bloodstreams of mother and fetus which corresponding to trophoblasts and endothelial cells, respectively, and could yield large amount of insulin receptor. Through interacting with those receptors, insulin could strikingly control placental gene expression shifts from mother to fetus over the time course of pregnancy, which might shed new light on exploring therapeutic targets for GDM (Hiden et al., 2006). It was also previously reported that reduced trophoblast apoptosis along with elevated inflammation could significantly result in aberrant expression of associated placental transcripts or proteins followed by GDM‐related increased placenta and newborn weights (Magee et al., 2014). Genetic variation including single nucleotide polymorphisms (SNPs) and structure variation makes up the most common variations across the human genome, and it has been applied for multiple disease diagnostics and treatments. Vitamin D is closely associated with β‐cell function and impaired glucose absorption in GDM through transportation to placenta by vitamin D‐binding protein, that is, vitamin D receptor, and SNPs in which have been previously reported to be correlated with GDM clinical parameters (Wang, Wang, et al., 2015). Several previous studies even developed SNP‐based risk score for GDM prediction. For example, through including type 2 diabetes‐related risk variants, Kawai et al. (2017) developed a risk score formula that could stably predict GDM risk; Chawla et al. (2014) proposed a model comprised of 48 SNPs along with several common clinicopathological features including ancestry, sex, gestational age, and so on, that could improve prediction of large‐for‐gestational‐age and newborn adiposity. Although the recent advancement of understanding about GDM mechanisms, the efficacy of conventional therapy is still poor and further studies for identifying valuable biomarkers are still needed. In this study, we conducted weighted gene coexpression network analysis (WGCNA) for time series gene expression profiles in GDM samples at first trimester and second trimester as well as control samples. Compared with traditional methodologies that take every transcript in the microarray alone and only capture two few information than that the microarray could provide, WGCNA takes correlations among those transcripts into account and identified potential disease‐related gene coexpression modules (GCMs) by considering associations between GCMs and disease's traits as well as intramodular associations. We identified several gene modules that closely associated with GDM progression and screened potential biomarkers via combination with protein–protein interaction (PPI) network. Additionally, carbohydrate metabolism or steroid biosynthesis‐related pathways were obtained through gene set enrichment analysis (GSEA), which has been previously reported to associate with GDM development.

MATERIALS AND METHODS

Gene expression profile dataset

We searched the Gene Expression Omnibus (GEO) with the keywords of “(gestational diabetes mellitus)” AND “Homo sapiens[porgn:__txid9606]” and only retained gene expression datasets that profiled by using Affymetrix HG‐U133 Puls2.0 platform (ref no.: GPL570). As a result, we obtained one dataset that deposited by Mikheev et al. (2008) with the accession number of GSE9984, which consisted of placenta expression profiles of four GDM patients at first trimester (45–59 days), four GDM patients at second trimester and four control samples.

Gene expression preprocessing

R and Bioconductor packages were applied for preprocessing raw gene expression profiles. Background correction, normalization, and logarithm transformation were conducted by using the affy package (Gautier, Cope, Bolstad, & Irizarry, 2004). Probes were annotated as gene symbols based on the GPL570 annotation file, and average expression value was used for genes annotated by multiple probes.

Weighted gene coexpression network analysis

Gene coexpression analysis is a powerful mean for exploring correlations among genes at specific conditions, particularly in time series analysis. Weighted gene coexpression network analysis (WGCNA) assigns a connection weight to each gene pair in the coexpression network and uses soft thresholds that should be more biologically meaningful compares with traditional methods that use binary information (0 = unconnected, 1 = connected) (Zhang & Horvath, 2005). WGCNA package is a collection of R functions for soft power selection, GCM detection, identification of module eigengene (first principle component of the module), assessment of associations between modules and clinical traits, and so on (Langfelder, 2008). Here, we screened GCM based on gene expression profiles of the 12 samples and assessed associations between modules and GDM by using pregnancy age as clinical variable.

Protein–protein interaction analysis

Protein–protein interactions among genes were obtained from the STRING database (Szklarczyk et al., 2015), containing interactions that with reliability scores according to means by which they were presented, such as high‐throughput, bioinformatics prediction, or low‐throughput methods. Here, we used reliability score >0.9 as the threshold for screening of valuable gene pairs. Cytoscape version3.6.0 (Su, Morris, Demchak, & Bader, 2014) was applied for visualizing PPI network.

Gene set enrichment analysis

Gene set enrichment analysis was performed by using the GSEA software (Subramanian, Kuehn, Gould, Tamayo, & Mesirov, 2007). “C2.CP.V6.0.ENTREZ.gmt” was used as the gene set for the analysis. Permutation number was set to 1,000, and p value <0.05 was considered as statistically significant.

RESULTS

Gene coexpression modules

Coefficient of variation (CV) of every gene between GDM samples at first‐trimester or second‐trimester pregnancy age and control samples was calculated, and expression profiles of overlapping genes between the top 5,000 genes with largest CV in GDM samples at first and second trimester were used for identification of valuable GCMs. As a result, 3,731 genes were screened and sample clustering based on those genes’ Euclidean distance separated samples into two distinct clusters that contained control and GDM samples, respectively, as shown in Figure 1a.
Figure 1

Sample and gene clustering analysis based on the 3,731 gene expression profiles. (a) Sample clustering identified two main clusters that containing gestational diabetes mellitus and control samples, respectively. (b) Gene coexpression modules (GCMs) obtained through weighted gene coexpression network analysis and GCM clusters based on their eigengenes. The red line represents the height of 0.2. (c) Visualization of GCMs before and after merging closer GCMs according to the height of 0.2. Colors indicate GCMs, and leaves represent genes

Sample and gene clustering analysis based on the 3,731 gene expression profiles. (a) Sample clustering identified two main clusters that containing gestational diabetes mellitus and control samples, respectively. (b) Gene coexpression modules (GCMs) obtained through weighted gene coexpression network analysis and GCM clusters based on their eigengenes. The red line represents the height of 0.2. (c) Visualization of GCMs before and after merging closer GCMs according to the height of 0.2. Colors indicate GCMs, and leaves represent genes A total of 33 GCMs were identified based on the soft threshold of 11. Module clustering analysis was further performed based on correlations among eigengenes of the 33 GCMs, which produced seven merged GCMs according to height of 0.2 (red line in Figure 1b). Figure 1c illustrated the GCMs before and after merging closer GCMs.

Module‐trait correlation analysis

To screen candidate GCMs that might contribute GDM progression, we used GDM patients’ pregnancy age as clinical variable and estimated their correlations with GCMs’ eigengene. As a result, brown and black modules were closely correlated with pregnancy age of GDM patients by using correlation p value ≤0.01 as threshold (Figure 2a). Additionally, we calculated correlation between every gene's module membership (MM, correlation between a specific gene and module's eigengene) and gene significance (GS, correlation between a specific gene and clinical variable) in brown and black modules. As expected, significant positive correlations between gene's MM and GS were obtained in black as well as brown module as shown in Figure 2b,c, respectively.
Figure 2

Gene coexpression module (GCM)‐gestational diabetes mellitus (GDM) progression correlation analysis. (a) A heatmap visualization of correlation between GCM and GDM patients’ pregnancy age. Numbers outside and inside brackets represent correlation coefficients and p value, respectively. (b,c) was the correlation plot of GS versus module membership for gene contained in black and brown module, respectively

Gene coexpression module (GCM)‐gestational diabetes mellitus (GDM) progression correlation analysis. (a) A heatmap visualization of correlation between GCM and GDM patients’ pregnancy age. Numbers outside and inside brackets represent correlation coefficients and p value, respectively. (b,c) was the correlation plot of GS versus module membership for gene contained in black and brown module, respectively

Protein–protein interaction network

Identification of GCMs was purely based on mathematical correlations among genes in specific module, so we further explored potential biological associations among genes in brown and black modules. As a result, a total of 842 and 2,654 gene pairs were, respectively, obtained for genes contained in black and brown module. Figure 3a,b illustrate PPI network that comprised of pairs among genes in black and brown module, respectively. Additionally, degree of brown module genes (direct neighborhood in PPI network) was significantly positively correlated with their MM (Figure 3c), which might indicate closer relation between brown module and GDM progression than that of black module. Table 1 shows the genes that with PPI degree larger than 50 and their MM.
Figure 3

Protein–protein interaction (PPI) network analysis of genes contained in black and brown modules. (a) PPI network of genes contained in the black module. (b) PPI network of genes contained in the brown module. (c) Scatter plot indicated the positive correlation between network degree and module membership of genes contained in the brown module

Table 1

Module membership (MM) and degree of genes that with degree larger than 10 in protein–protein interaction network

GeneMMDegreeGeneMMDegree
CDK10.90102AURKB0.9154
CCNB10.9076CCNA20.9553
PLK10.9269CENPE0.9052
CCNB20.9662PAFAH1B10.8652
CDC200.9562KIF2C0.9051
MAD2L10.9156
Protein–protein interaction (PPI) network analysis of genes contained in black and brown modules. (a) PPI network of genes contained in the black module. (b) PPI network of genes contained in the brown module. (c) Scatter plot indicated the positive correlation between network degree and module membership of genes contained in the brown module Module membership (MM) and degree of genes that with degree larger than 10 in protein–protein interaction network We divided samples into first trimester and control group or second trimester and control group and subjected them to GSEA. Figure 4a,b illustrated the full list of KEGG pathways and top five KEGG pathways that significantly up‐regulated in first‐trimester and second‐trimester GDM samples compared with control samples, respectively. Table 2 is the full list of KEGG pathways significantly up‐regulated in second‐trimester GDM samples. Lysine degradation, carbon pool by folate, and steroid biosynthesis pathways were found to be significantly up‐regulated in both first‐trimester and second‐trimester GDM samples. Strikingly, cancer‐related pathways, such as colorectal cancer, cell cycle, were also significantly up‐regulated in second‐trimester GDM samples.
Figure 4

Gene set enrichment analysis. (a) The full list of significantly up‐regulated KEGG pathways in first‐trimester gestational diabetes mellitus (GDM) samples compared with normal samples. (b) The top five most significantly up‐regulated KEGG pathways in second‐trimester GDM samples compared with normal samples

Table 2

Full list of KEGG pathways that significantly up‐regulated in second‐trimester gestational diabetes mellitus samples

Pathway nameESNom p value
KEGG_BIOSYNTHESIS_OF_UNSATURATED_FATTY_ACIDS0.6770
KEGG_ONE_CARBON_POOL_BY_FOLATE0.7270
KEGG_PORPHYRIN_AND_CHLOROPHYLL_METABOLISM0.5390
KEGG_COLORECTAL_CANCER0.4040
KEGG_STEROID_BIOSYNTHESIS0.7380.0192
KEGG_LYSINE_DEGRADATION0.5560.0195
KEGG_GLYCEROLIPID_METABOLISM0.4170.0200
KEGG_CELL_CYCLE0.5770.0303
KEGG_AMINOACYL_TRNA_BIOSYNTHESIS0.5670.0306
KEGG_VIBRIO_CHOLERAE_INFECTION0.4120.0446
KEGG_RNA_POLYMERASE0.4600.0497

ES: enrichment score; Nom p value: normalized p value.

Gene set enrichment analysis. (a) The full list of significantly up‐regulated KEGG pathways in first‐trimester gestational diabetes mellitus (GDM) samples compared with normal samples. (b) The top five most significantly up‐regulated KEGG pathways in second‐trimester GDM samples compared with normal samples Full list of KEGG pathways that significantly up‐regulated in second‐trimester gestational diabetes mellitus samples ES: enrichment score; Nom p value: normalized p value.

DISCUSSION

Gestational diabetes mellitus represents the most prevalence form of pathoglycemia in pregnancy that deeply affects the life of mothers as well as their offspring. In this study, we identified some potential GDM‐related genes by analyzing GDM time series gene expression profiles through WGCNA along with PPI network analysis. GSEA strikingly obtained some cancer‐related pathways in addition to several well‐known GDM‐associated pathways which were significantly up‐regulated in GDM samples compared with control samples. This study should shed some new light on the understanding of GDM mechanisms and its diagnosis or treatment. Abnormal insulin secretion and metabolism contribute greatly to GDM initiation and progression. Here, lysine degradation, carbon pool by folate, and steroid biosynthesis pathways were significantly up‐regulated in first as well as second‐trimester GDM samples compared with control samples. Lysine represents a major component of histone, and modifications such as acetylation, methylation, and so on in it play vital roles in major cellular functions, for example, posttranscriptional proteins’ modification. In addition, lysine acetylation was also previously reported to affect both immunological and metabolic pathways, which could then induce type II diabetes and cardiovascular disease (Iyer & Fairlie, 2012; Kosanam et al., 2014). Nε‐(carboxymethyl) lysine‐conjugated bovine serum albumin is an essential component of advanced glycation end products which could damage mitochondrial functions and result in reducing insulin secretion followed by the incidence of diabetes (Lo et al., 2015). Diabetes could control many aspects of endocrine, including steroidogenesis, whose perturbation could in turn induce the initiation of diabetes (Hwang, 2014; Jangir, 2014). In addition to some well‐known pathways related to GDM, we strikingly identified the up‐regulation of several cancer‐related pathways in second‐trimester GDM samples, such as colorectal cancer, cell cycle. It has been widely reported that diabetes could result in elevated colorectal cancer risk (Jarvandi & Davidson, 2013; Wu et al., 2013), which was also confirmed in a rat model (Jia et al., 2014). MM and degree are important for evaluating genes’ associations with specific trait in gene coexpression and PPI network, respectively. In this study, several genes have both high MM in brown module and high degree in PPI network, which might serve as important diagnosis or treatment biomarkers for GDM. CDK1 and CCNB1 are the top two genes with larger degree in PPI network, and CDK1 and CCNB1 interaction was previously proved to coordinates mitochondrial respiration and affect G2/M cell cycle progression (Wang et al., 2014). Cell cycle perturbation is closely associated with multi diseases’ progression including diabetes (Saavedra‐Avila et al., 2014; Wang, Fiaschi‐Taesch, et al., 2015). Besides, aberrant expression of CDK1 and CCNB1 in diabetes patients were also proved by previous studies (Page, Morris, Williams, Ruhland, & Malik, 1997; Su et al., 2015). CCNB2’s MM is the largest one compared with the other nine genes in Table 1. In Zhang's study (2017), CCNB2 was found to be significantly up‐regulated in diabetes mice compared with normal mice and was also the core gene in PPI network of DEGs. Although no previous study about direct association between some of those genes with GDM progression, lots of studies have proved associations between those genes with the well‐known GDM‐associated biological processes, such as CDC20 with metabolism (Martin, Mebarki, Paradis, Friguet, & Radman, 2014), CENPE with insulin absorption (Zhu, Ai, Wang, Xu, & Teng, 2012), and so on. So, those genes might be potential biomarkers for GDM.

CONCLUSIONS

In conclusion, we identified several potential biomarkers for GDM through WGCNA and PPI network analysis. GSEA identified some cancer‐related pathways in addition to several well‐known GDM‐related pathways, which might provide novel clues for GDM experimental research and clinical treatment.

CONFLICT OF INTEREST

I declare that there is no conflict between authors.
  42 in total

1.  affy--analysis of Affymetrix GeneChip data at the probe level.

Authors:  Laurent Gautier; Leslie Cope; Benjamin M Bolstad; Rafael A Irizarry
Journal:  Bioinformatics       Date:  2004-02-12       Impact factor: 6.937

Review 2.  Gestational diabetes mellitus and cardiovascular risk after pregnancy.

Authors:  Jürgen Harreiter; Gregor Dovjak; Alexandra Kautzky-Willer
Journal:  Womens Health (Lond)       Date:  2014-01

3.  A genetic risk score that includes common type 2 diabetes risk variants is associated with gestational diabetes.

Authors:  V K Kawai; R T Levinson; A Adefurin; D Kurnik; S P Collier; D Conway; C M Stein
Journal:  Clin Endocrinol (Oxf)       Date:  2017-05-26       Impact factor: 3.478

4.  Hepatocellular carcinoma protein carbonylation in virus C and metabolic syndrome patients.

Authors:  Fernando Ariel Martin; Mouniya Mebarki; Valérie Paradis; Bertrand Friguet; Miroslav Radman
Journal:  Free Radic Biol Med       Date:  2014-12-10       Impact factor: 7.376

5.  Diabetes induces lysine acetylation of intermediary metabolism enzymes in the kidney.

Authors:  Hari Kosanam; Kerri Thai; Yanling Zhang; Andrew Advani; Kim A Connelly; Eleftherios P Diamandis; Richard E Gilbert
Journal:  Diabetes       Date:  2014-03-27       Impact factor: 9.461

Review 6.  Lysine acetylation in obesity, diabetes and metabolic disease.

Authors:  Abishek Iyer; David P Fairlie; Lindsay Brown
Journal:  Immunol Cell Biol       Date:  2011-11-15       Impact factor: 5.126

7.  Isolation of diabetes-associated kidney genes using differential display.

Authors:  R Page; C Morris; J Williams; C von Ruhland; A N Malik
Journal:  Biochem Biophys Res Commun       Date:  1997-03-06       Impact factor: 3.575

Review 8.  Incretins, Pregnancy, and Gestational Diabetes.

Authors:  Dragana Nikolic; Khalid Al-Rasadi; Noor Al Busaidi; Khalid Al-Waili; Yajnavalka Banerjee; Khamis Al-Hashmi; Giuseppe Montalto; Ali A Rizvi; Manfredi Rizzo; Tamima Al-Dughaishi
Journal:  Curr Pharm Biotechnol       Date:  2016       Impact factor: 2.837

9.  Biological network exploration with Cytoscape 3.

Authors:  Gang Su; John H Morris; Barry Demchak; Gary D Bader
Journal:  Curr Protoc Bioinformatics       Date:  2014-09-08

10.  Transcriptional Profile of Kidney from Type 2 Diabetic db/db Mice.

Authors:  Haojun Zhang; Tingting Zhao; Zhiguo Li; Meihua Yan; Hailing Zhao; Bin Zhu; Ping Li
Journal:  J Diabetes Res       Date:  2017-01-23       Impact factor: 4.011

View more
  7 in total

1.  Placental gene network modules are associated with maternal stress during pregnancy and infant temperament.

Authors:  Vasily N Aushev; Qian Li; Maya Deyssenroth; Wei Zhang; Jackie Finik; Yasmin L Hurd; Yoko Nomura; Jia Chen
Journal:  FASEB J       Date:  2021-10       Impact factor: 5.191

2.  Identification of novel functional CpG-SNPs associated with type 2 diabetes and coronary artery disease.

Authors:  Zun Wang; Chuan Qiu; Xu Lin; Lan-Juan Zhao; Yong Liu; Xinrui Wu; Qian Wang; Wei Liu; Kelvin Li; Hong-Wen Deng; Si-Yuan Tang; Hui Shen
Journal:  Mol Genet Genomics       Date:  2020-03-11       Impact factor: 3.291

3.  Gene coexpression network analysis identified potential biomarkers in gestational diabetes mellitus progression.

Authors:  Xiaomin Zhao; Wen Li
Journal:  Mol Genet Genomic Med       Date:  2018-11-25       Impact factor: 2.183

4.  Identification of Potential Biomarkers of Type 2 Diabetes Mellitus-Related Immune Infiltration Using Weighted Gene Coexpression Network Analysis.

Authors:  Ji Zhou; Xiaoyi Zhang; Lixia Ji; Guohui Jiang
Journal:  Biomed Res Int       Date:  2022-02-09       Impact factor: 3.411

5.  Effects of Shenkang Pills on Early-Stage Diabetic Nephropathy in db/db Mice via Inhibiting AURKB/RacGAP1/RhoA Signaling Pathway.

Authors:  Fujing Wang; Jia'er Fan; Tingting Pei; Zhuo'en He; Jiaxing Zhang; Liliang Ju; Zhongxiao Han; Mingqing Wang; Wei Xiao
Journal:  Front Pharmacol       Date:  2022-02-11       Impact factor: 5.810

6.  COBLL1 and IRS1 Gene Polymorphisms and Placental Expression in Women with Gestational Diabetes.

Authors:  Przemyslaw Ustianowski; Damian Malinowski; Michał Czerewaty; Krzysztof Safranow; Maciej Tarnowski; Violetta Dziedziejko; Andrzej Pawlik
Journal:  Biomedicines       Date:  2022-08-09

7.  Integrative Analysis of Angiogenesis-Related Long Non-Coding RNA and Identification of a Six-DEARlncRNA Signature Associated with Prognosis and Therapeutic Response in Esophageal Squamous Cell Carcinoma.

Authors:  Shasha Cao; Xiaomin Wang; Xiaohui Liu; Junkuo Li; Lijuan Duan; Zhaowei Gao; Shumin Lun; Yanju Zhu; Haijun Yang; Hao Zhang; Fuyou Zhou
Journal:  Cancers (Basel)       Date:  2022-08-30       Impact factor: 6.575

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.