| Literature DB >> 28117714 |
Hao Mei1,2, Lianna Li3, Shijian Liu4, Fan Jiang5, Michael Griswold6, Thomas Mosley7.
Abstract
We performed expression studies to identify tissue non-specific genes and pathways of diabetes by meta-analysis. We searched curated datasets of the Gene Expression Omnibus (GEO) database and identified 13 and five expression studies of diabetes and insulin responses at various tissues, respectively. We tested differential gene expression by empirical Bayes-based linear method and investigated gene set expression association by knowledge-based enrichment analysis. Meta-analysis by different methods was applied to identify tissue non-specific genes and gene sets. We also proposed pathway mapping analysis to infer functions of the identified gene sets, and correlation and independent analysis to evaluate expression association profile of genes and gene sets between studies and tissues. Our analysis showed that PGRMC1 and HADH genes were significant over diabetes studies, while IRS1 and MPST genes were significant over insulin response studies, and joint analysis showed that HADH and MPST genes were significant over all combined data sets. The pathway analysis identified six significant gene sets over all studies. The KEGG pathway mapping indicated that the significant gene sets are related to diabetes pathogenesis. The results also presented that 12.8% and 59.0% pairwise studies had significantly correlated expression association for genes and gene sets, respectively; moreover, 12.8% pairwise studies had independent expression association for genes, but no studies were observed significantly different for expression association of gene sets. Our analysis indicated that there are both tissue specific and non-specific genes and pathways associated with diabetes pathogenesis. Compared to the gene expression, pathway association tends to be tissue non-specific, and a common pathway influencing diabetes development is activated through different genes at different tissues.Entities:
Keywords: diabetes; gene expression; pathway
Year: 2017 PMID: 28117714 PMCID: PMC5295038 DOI: 10.3390/genes8010044
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Characteristics of the gene expression studies.
| Study | GDS_ID | GPL_ID | Pub_ID | N_genes | Size | Contrast | Tissue |
|---|---|---|---|---|---|---|---|
| 1 | GDS3665 | GPL2986 | 16,075 | 10 | T2D vs. control | adipose | |
| 2 | GDS3980 | GPL571 | 21926180 [ | 12,778 | 21 | T2D vs. control | artery |
| 3 | GDS3874 | GPL96 | 17595242 [ | 18,552 | 117 | T1D vs. healthy and T2D vs. healthy | blood |
| GDS3875 | GPL97 | ||||||
| 4 | GDS3963 | GPL6883 | 21829658 [ | 17,476 | 24 | T2D vs. impaired fasting glucose vs. control | blood |
| 5 | GDS3656 | GPL2700 | 19706161 [ | 16,778 | 32 | T1D vs. Healthy | EPC |
| 6 | GDS3876 | GPL96 | 19549744 [ | 12,779 | 18 | obese T2D vs. obese no T2D | liver |
| 7 | GDS3883 | GPL570 | 21035759 [ | 20,539 | 17 | T2D vs. normal glucose tolerance | liver |
| 8 | GDS3681 | GPL8300 | 18719883 [ | 8861 | 20 | T2D vs. control | myotube |
| 9 | GDS3782 | GPL1352 | 20644627 [ | 20,185 | 20 | T2D vs. control | pancreas |
| 10 | GDS3882 | GPL96 | 21127054 [ | 12,779 | 13 | T2D vs. non-diabetes | pancreas |
| 11 | GDS4337 | GPL6244 | 22768844 [ | 17,323 | 63 | T2D vs. non-diabetes | pancreas |
| 12 | GDS3880 | GPL570 | 22802091 [ | 20,539 | 42 | T2D vs. pre-diabetes vs. normoglycemic control | skeletal muscle |
| 13 | GDS3884 | GPL570 | 21393865 [ | 20,539 | 50 | T2D vs. Normoglycemia with FH+ vs. Normoglycemia with FH− | skeletal muscle |
| 1 | GDS157 | GPL80 | 12436343 [ | 13,742 | 10 | insulin resistant vs. insulin sensitive | skeletal muscle |
| GDS158 | GPL98 | ||||||
| GDS160 | GPL99 | ||||||
| GDS161 | GPL100 | ||||||
| GDS162 | GPL101 | ||||||
| 2 | GDS2790 | GPL80 | 17472435 [ | 12,885 | 12 | Before vs. after Hyperinsulinemic-euglycemic clamp for nondiabetes | skeletal muscle |
| GDS2791 | GPL96 | ||||||
| 3 | GDS3181 | GPL96 | 18334611 [ | 12,779 | 36 | −60 vs. 30 vs. 240 min of Hyperinsulinemic-euglycemic clamp for nondiabetes | skeletal muscle |
| 4 | GDS3715 | GPL91 | 17709892 [ | 8768 | 110 | Diabetes vs. insulin sensitive vs. insulin resistant before and after Hyperinsulinemic-euglycemic clamp | skeletal muscle |
| 5 | GDS3781 | GPL570 | 20678967 [ | 20,539 | 39; 19 | insulin sensitive vs. insulin resistant | adipose |
| GDS3962 | |||||||
GDS_ID: the GDS ID of the expression dataset; GPL_ID: ID of the platform for generating the expression dataset; PUB_ID: the publication ID at the PubMed database; N_genes: the number of genes measured for expression level; Size: the number of samples at the study; Contrast: it presented the test of differential gene expression between two or more phenotypes by the regression method; EPC: endothelial progenitor cells; FH+: family history of diabetes; FH-: no family history of diabetes; T1D: type I diabetes; T2D: type II diabetes.
Differential gene expression and meta-analysis.
| PGRMC1 | HADH | IRS1 | MPST | ||
|---|---|---|---|---|---|
| 1 | GDS3665 | 9.17 | |||
| 2 | GDS3980 | 31.31 | 14.46 | 20.64 | |
| 3 | GDS3874/GDS3875 | 8.55 | 24.46 | 97.48 | |
| 4 | GDS3963 | 45.03 | 48.6 | ||
| 5 | GDS3656 | 68.95 | 89.07 | ||
| 6 | GDS3876 | 82.07 | 89.31 | 35.61 | |
| 7 | GDS3883 | 97.9 | 69.24 | 15.34 | |
| 8 | GDS3681 | 88.77 | 81.4 | 27.87 | |
| 9 | GDS3782 | 7.43 | 81 | 79.18 | |
| 10 | GDS3882 | 66.82 | 95.75 | 47.81 | |
| 11 | GDS4337 | 7.54 | 37.53 | ||
| 12 | GDS3880 | 95.17 | 50.5 | 41.35 | 6.55 |
| 13 | GDS3884 | 96.25 | 48.08 | 54.24 | |
| 1.03E−6 | 1.03E−6 | 0.14 | 2.87E−4 | ||
| 1 | GDS157/GDS158/GDS160/GDS161/GDS162 | 7.48 | NA | ||
| 2 | GDS2790/GDS2791 | 9.59 | 10.14 | ||
| 3 | GDS3181 | 55.03 | 16.78 | ||
| 4 | GDS3715 | 30.9 | 82.89 | 25.76 | |
| 5 | GDS3781/GDS3962 | 48.65 | 23.59 | ||
| 0.23 | 0.01 | 3.13E−7 | 3.13E−7 | ||
| 6.31E−7 | 6.28E−8 | ||||
The bold italic font indicates significantly differential gene expression (i.e., U-score ≤ 5%).
Meta-analysis p-values of significant gene sets.
| PID | GeneSet | |||
|---|---|---|---|---|
| Diabetes Studies | ||||
| 5599 | UV response | 7.72E−17 | 4.01E−08 | 3.10E−03 |
| 4914 | chronic myelogenous leukemia | 1.45E−38 | 1.16E−09 | 1.97E−05 |
| 7922 | KLF1 targets | 3.35E−26 | 3.47E−13 | 3.10E−03 |
| 5947 | SMARCA2 targets | 1.95E−25 | 4.01E−08 | 3.10E−03 |
| 6442 | Alzheimer’s disease | 1.65E−19 | 4.01E−08 | 2.87E−04 |
| 7145 | stromal stem cells | 1.88E−15 | 1.16E−09 | 2.87E−04 |
| Insulin Response Studies | ||||
| 5599 | UV response | 1.48E−18 | 3.00E−05 | 0.023 |
| 4914 | chronic myelogenous leukemia | 4.27E−08 | 3.00E−05 | 0.023 |
| 7922 | KLF1 targets | 4.55E−05 | 3.00E−05 | 0.023 |
| 5947 | SMARCA2 targets | 1.12E−04 | 0.023 | 0.023 |
| 6442 | Alzheimer’s disease | 0.042 | 1.16E−03 | 0.23 |
| 7145 | stromal stem cells | 2.14E−08 | 1.11E−03 | 0.023 |
| Joint Analysis | ||||
| 5599 | UV response | 9.46E−32 | 1.12E−10 | 1.55E−03 |
| 4914 | chronic myelogenous leukemia | 3.91E−44 | 3.41E−12 | 1.52E−05 |
| 7922 | KLF1 targets | 4.72E−29 | 1.54E−15 | 1.55E−03 |
| 5947 | SMARCA2 targets | 1.11E−27 | 6.28E−08 | 1.55E−03 |
| 6442 | Alzheimer’s disease | 1.84E−18 | 2.95E−09 | 1.55E−03 |
| 7145 | stromal stem cells | 2.93E−22 | 1.12E−10 | 1.72E−04 |
PID: ID of the significantly identified gene sets; GeneSet: name of the gene sets; Fixed_p: the unadjusted p-value by fixed-effect meta-analysis; Bin_p0: the unadjusted meta-analysis p-value by binomial test; Bin_p1: the adjusted meta-analysis p-value by binomial test.
The closest KEGG Pathway
| Gene Set | KEGG Pathway | Size | Gene | Effect | SE | adj_p | |
|---|---|---|---|---|---|---|---|
| UV response | TGF-beta signaling pathway | 86 | 22 | 0.20 | 0.03 | 1.28E−09 | <0.001 |
| chronic myelogenous leukemia | The citrate cycle | 32 | 15 | 0.35 | 0.06 | 1.11E−06 | <0.001 |
| KLF1 targets | DNA replication | 36 | 13 | 0.24 | 0.05 | 1.54E−04 | 0.016 |
| SMARCA2 targets | Nucleotide excision repair | 44 | 6 | 0.11 | 0.02 | 4.29E−04 | 0.034 |
| Alzheimer’s disease | 1. Oxidative phosphorylation | 135 | 62 | 0.35 | 0.03 | 9.20E−27 | <0.001 |
| 2. Parkinson’s disease | 133 | 60 | 0.35 | 0.03 | 2.13E−25 | <0.001 | |
| stromal stem cells | 1. PPAR signaling pathway | 69 | 9 | 0.10 | 0.02 | 3.02E−4 | 0.029 |
| 2. p53 signaling pathway | 69 | 9 | 0.10 | 0.02 | 3.02E−4 | 0.029 |
Size: the number of genes in the KEGG pathway; Gene: the number of overlapped genes between the gene set and the KEGG pathway; Effect: the higher probability for a gene of the gene set that belongs to the KEGG pathway than a random gene; SE: the standard error of the estimated effect; p: the unadjusted p-value; and adj_p: the adjusted p-value by permutation test.
Figure 1Correlation and independent analyses of gene (A) and pathway (B) expression association profiles between studies. The lower triangle is the correlation analysis and the upper triangle is the independent test. The ‘X’ indicates the analysis p-value is not significant.