| Literature DB >> 28473845 |
Helle Krogh Pedersen1, Valborg Gudmundsdottir1, Søren Brunak1,2.
Abstract
Type 2 diabetes (T2D) is a complex disease that involves multiple genes. Numerous risk loci have already been associated with T2D, although many susceptibility genes remain to be identified given heritability estimates. Systems biology approaches hold potential for discovering novel T2D genes by considering their biological context, such as tissue-specific protein interaction partners. Pancreatic islets are a key T2D tissue and many of the known genetic risk variants lead to impaired islet function, hence a better understanding of the islet-specific dysregulation in the disease-state is essential to unveil the full potential of person-specific profiles. Here we identify 3,692 overlapping pancreatic islet protein complexes (containing 10,805 genes) by integrating islet gene and protein expression data with protein interactions. We found 24 of these complexes to be significantly enriched for genes associated with diabetic phenotypes through heterogeneous evidence sources, including genetic variation, methylation, and gene expression in islets. The analysis specifically revealed ten T2D candidate genes with probable roles in islets (ANPEP, HADH, FAM105A, PDLIM4, PDLIM5, MAP2K4, PPP2R5E, SNX13, GNAS, and FRS2), of which the last six are novel in the context of T2D and the data that went into the analysis. Fifteen of the twenty-four complexes were further enriched for combined genetic associations with glycemic traits, exemplifying how perturbation of protein complexes by multiple small effects can give rise to diabetic phenotypes. The complex nature of T2D ultimately prompts an understanding of the individual patients at the network biology level. We present the foundation for such work by exposing a subset of the global interactome that is dysregulated in T2D and consequently provides a good starting point when evaluating an individual's alterations at the genome, transcriptome, or proteome level in relation to T2D in clinical settings.Entities:
Keywords: data integration; diabetes; pancreatic islets; patient network biology; protein complexes; tissue specificity
Year: 2017 PMID: 28473845 PMCID: PMC5397424 DOI: 10.3389/fgene.2017.00043
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Overview of the methodology employed. We first generated an integration scaffold of islet protein complexes in healthy tissue (A) and defined a subset of complexes with coordinated expression in islets (B) that were further benchmarked (C,D). We then identified the subset of islet complexes with potential dysregulation in the T2D state by functional convergence of 13 islet diabetic phenotype gene sets (E), followed by functional annotation and validation (F–H). For comparison, direct convergence of the islet diabetic phenotype gene sets was evaluated (E).
Description of the thirteen islet diabetic phenotype gene sets and the four islet biology related gene sets.
| T2D GWAS/rare variant | Genes in the vicinity of T2D GWAS SNPs, using a boundary of 110 kb upstream and 40 kb downstream of each gene, as well as genes harboring rare variants associated with T2D. | Morris et al., | 235 (162) |
| Glycemic GWAS/rare variant | Genes in the vicinity of GWAS SNPs (FG, BMI-adjusted FG, 2 h Glu, BMI-adjusted 2 h Glu, insulinogenic index, disposition index, proinsulin), using a boundary of 110 kb upstream and 40 kb downstream of each gene, as well as genes harboring rare variants associated with FG, proinsulin, or insulinogenic index. | Strawbridge et al., | 135 (107) |
| Glycemic gene-based | Genes associated with FG, 2 h Glu, or proinsulin using a gene-based analysis. | Scott et al., | 146 (130) |
| OMIM | Genes associated with “Diabetes mellitus, noninsulin-dependent; NIDDM” in the OMIM database (accession | 26 (24) | |
| Monogenic | MODY and other monogenic diabetes genes. | McCarthy, | 28 (28) |
| T2D eQTL | Five cis and 176 trans eQTLs in islets, based on 47 SNPs associated with T2D. | Taneera et al., | 163 (129) |
| T2D methylation (A) | Genes in differentially methylated regions that are also differentially expressed. | Dayeh et al., | 113 (88) |
| T2D methylation (B) | Genes in differentially methylated regions. | Volkmar et al., | 221 (169) |
| Co-expression | Genes that are co-expressed in islets with 2 or more of 48 T2D genes. | Taneera et al., | 231 (197) |
| Hyperglycemia expression | Differentially expressed genes in islets, in hyperglycemic vs. normoglycemic individuals. | Taneera et al., | 121 (109) |
| T2D expression (A) | Differentially expressed genes in islets, in T2D patients vs. controls. | Taneera et al., | 106 (90) |
| T2D expression (B) | Differentially expressed genes in islets, in T2D patients vs. controls. | Dominguez et al., | 174 (150) |
| T2D expression (C) | Differentially expressed genes in beta-cells, in T2D patients vs. controls. | Marselli et al., | 281 (237) |
| Islet specific | Top 30 islet specific genes. | Morán et al., | 33 (28) |
| Open chromatin | Genes with islet-selective (compared to five non-islet cell lines) open chromatin in the transcription start sites or gene-body. | Gaulton et al., | 319 (226) |
| Open chromatin clusters | Genes overlapping | Gaulton et al., | 1,512 (1,340) |
| Islet biology | Sixty-seven genes curated as important for islet cell identity and function, Supplementary Table | Pasquali et al., | 67 (57) |
2 h Glu, 2 hour glucose; BMI, body mass index; FG, fasting glucose; T2D, type 2 diabetes.
Figure 2Direct overlap of the thirteen islet diabetic phenotype gene sets. (A) Overlap in terms of gene counts. (B) Overlap in terms of percent overlap relative to the size of the shortest gene sets. (C) BH-adjusted P-values for testing significance of overlap (hypergeometric test using all 22,766 genes as background), stars are as follows: ***P ≤ 0.001, **P ≤ 0.01, *P ≤ 0.05.
Figure 3The 24 complexes with potential T2D dysregulation are enriched for diverse and relevant functions. Subset of Consensus PathDB-pathways, for which at least one protein complex is enriched with BH-adjusted P < 0.001. The pathways and complexes are clustered with Ward's hierarchical clustering using an asymmetric binary similarity measure.
Plausible novel T2D genes prioritized from the complexes with potential T2D dysregulation.
| Mitogen-activated protein kinase kinase 4 | 0 | 1 | 7.83 × 10−6 (rs929441) | AUCIns/AUCGluc | |
| PDZ and LIM domain 5 | 0 | 1 | 9.87 × 10−5 (rs17021900) | Fasting glucose | |
| Protein phosphatase 2, regulatory subunit B, epsilon isoform | 0 | 1 | 7.05 × 10−5 (rs10151995) | Fasting glucose | |
| Sorting nexin 13 | 0 | 1 | 4.02 × 10−6 (rs2723517) | HbA1c | |
| GNAS complex locus | 0 | 1 | 4.73 × 10−5 (rs6026565) | Fasting glucose, Manning | |
| Fibroblast growth factor receptor substrate 2 | 0 | 1 | 9.76 × 10−6 (rs12425398) | Fasting glucose, Manning |
Genes are prioritized if they, besides being part of a protein complex showing potential T2D dysregulation, are part of at least one of the four islet biology gene sets and harbor at least one SNP with P < 1 × 10.
Figure 4Breakdown of significantly enriched complex-trait combinations. (A) The enrichment of GWAS signals for each of the 30 significant complex-trait combinations when including all genes, excluding input genes, and excluding genes with genome-wide significant association in the given GWAS (see Section Methods for details). The genes in each complex-trait combination are colored based on P-value (i.e., minimum P-value for the SNPs mapping to the respective gene) partitioned into factor-10 groups; (B) actual count and (C) percentage distribution of gene P-values within a complex in the GWAS for the given glycemic trait. (D) Example of complexes.
Figure 5Highlevel grouping of complexes by nature of evidence driving their enrichment. Schematic visualization (top) and representative examples (bottom) for the three overall groups. The fourth theoretical category with few sources but a high percentage of genes supported by multiple sources is excluded here, as we did not observe any good examples. Group A, Complex-2; group B, Complex-14; and group C, Complex-16.