| Literature DB >> 16046815 |
Jonathan D Wren1, Harold R Garner.
Abstract
The etiological origin of type 2 diabetes mellitus (T2DM) has long been controversial. The body of literature related to T2DM is vast and varied in focus, making a broad epidemiological perspective difficult, if not impossible. A data-mining approach was used to analyze all electronically available scientific literature, over 12 million Medline records, for "objects" such as genes, diseases, phenotypes, and chemical compounds linked to other objects within the T2DM literature but were not themselves within the T2DM literature. The goal of this analysis was to conduct a comprehensive survey to identify novel factors implicated in the pathology of T2DM by statistically evaluating mutually shared associations. Surprisingly, epigenetic factors were among the highest statistical scores in this analysis, strongly implicating epigenetic changes within the body as causal factors in the pathogenesis of T2DM. Further analysis implicates adipocytes as the potential tissue of origin, and cytokines or cytokine-like genes as the dysregulated factor(s) responsible for the T2DM phenotype. The analysis provides a wealth of literature supporting this hypothesis, which-if true-represents an important paradigm shift for researchers studying the pathogenesis of T2DM.Entities:
Year: 2005 PMID: 16046815 PMCID: PMC1184044 DOI: 10.1155/JBB.2005.104
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Figure 1Identifying new relationships that are potentially implicit from known relationships. IRIDESCENT's method of analysis begins with a primary object of interest such as T2DM (black node) and then identifies all co-citations with other objects (gray nodes) observed within Medline. IRIDESCENT then examines all these directly related (gray) nodes in turn for their relationships with other objects (white nodes) that are not themselves related to the primary object. Once all of Medline is analyzed and all implicit relationships (white nodes) are identified, all relationships they share (gray nodes) with the primary node are individually scored for their statistical significance of co-occurrence and collectively scored for statistical significance against a random network model.
A nonexhaustive list of SNP studies that have reported finding a “significant” association between one or more SNPs and a cohort of T2DM patients. NCBI's sequence viewer was used to obtain flanking sequence. a denotes PMID = PubMed ID of paper publishing the SNP. b denotes that C allele was also present. c means that no dbSNP entry was found in exact map position as given, but the closest dbSNP entries had the same alleles as published, so these were used instead.
| Gene | Chr | Region | Mutation | Position | CpG? | PMIDa | Sequence | dbSNP |
| IDE | 10q23 | 3′ UTR | multiple | — | — | 12765971 | — | — |
| GFPT2 | 2p13 | 3′ UTR | multiple | — | — | 14764791 | — | — |
| IsI-1 | 5q11.2 | 5′ UTR | A→G | −47 | — | 11978668 | — | — |
| KIR6.2 | 11p15.1 | Coding | E23K | — | — | 12643262 | — | — |
| SUR1 | 11p15.1 | Coding | G→A | 3819 | — | 11030411 | — | — |
| PLA2G4A | 1q25 | Coding | F479L | — | — | 12765847 | — | — |
| PTP-1B | 20q13.1 | Coding | C→T | 981 | — | 11836311 | — | — |
| GFPT2 | 2p13 | Coding | I471V | — | — | 14764791 | — | — |
| PPARG2 | 3p25 | Coding | P12A | — | — | 12829658 | — | — |
| GLUT2 | 3q26.1 | Coding | A→G | 103 | — | 12017192 | — | — |
| PGC-1 | 4p15.1 | Coding | G482S | — | — | 12606537 | — | — |
| NR3C1 | 5q31 | Coding | A→G | 1220 | — | 12864802 | — | — |
| Syntaxin 1A | 7q11.23 | Coding | T→C | 204 | — | 11719842 | — | — |
| PRKCZ | 1p36.33 | Intron | G→A | — | — | 12970910 | — | rs436045 |
| CAPN10 | 2q37.3 | Intron | G→A | — | — | 14730479 | — | SNP43 |
| UCP2 | 11q13 | Promoter | G→A | −866 | Yb | 12915397 | CTGA | rs659366 |
| Resistin | 19p13.3 | Promoter | C→G | −180 | Y | 12829623 | AAGA | rs1862513 |
| CRP | 1q21 | Promoter | C→T | −700 | Y | 12618085 | AACA | SNP133552 |
| PTGS2 | 1q25.2 | Promoter | C→G | −766 | Y | 12920574 | TCCC | rs20417 |
| PCK1 | 20q13.31 | Promoter | C→G | −232 | Yc | 14764811 | CAAC | rs6025628 |
| GLUT2 | 3q26.2 | Promoter | A→C | −269 | N | 12017192 | AATC | None |
| APM1 | 3q27 | Promoter | A→G | −11426 | Yc | 14749263 | TCTC | rs12631446 |
| APM1 | 3q27 | Promoter | C→G | −11377 | Nc | 14749263 | ATTA | rs10937272 |
| TNF-alpha | 6p21.3 | Promoter | G→A | −308 | Nc | 12818408 | CATG | rs1800629 |
| IL-6 | 7p21 | Promoter | G→C | −174 | Y | 12589429 | TTGC | rs1800795 |
Figure 2Important relationships shared by methylation and T2DM (referred to as “NIDDM” in the system). A total of 1287 co-cited objects were identified between the two, about 959 of these reflect actual relationships of a nontrivial nature. Only relationships emphasized within this report are shown here. A full list is available online at http://innovation.swmed.edu/IRIDESCENT/NIDDM_theory.htm.
Objects linked to T2DM solely by virtue of relationships they share within Medline. At the time of analysis, none of these objects were documented in Medline to have a relationship with T2DM (shown at top as a positive control for the query). The nature of each of these implicit relationships varies and must be determined by examination of the intermediate connections. The expected value represents how many shared relationships would be expected given a randomly connected network of relationships with the same properties of the literature-derived one. The quality score represents a statistical weighting of co-mention frequency to reflect confidence that the relationship is not of a trivial nature. The observed to expected ratio (Obs/exp) provides a ranking of how statistically exceptional any given set of shared relationships is.
| Rank | Shared relationships | Implicit relationship | Quality | Expect | Obs/exp |
| — | 2105 | T2DM | 1421 | 329 | 4.32 |
| 1 | 1361 | Endotoxin | 1054 | 308 | 3.42 |
| 2 | 1312 | Hydrocortisone | 991 | 296 | 3.35 |
| 3 | 1301 | Neuroblastoma | 975 | 339 | 2.88 |
| 4 | 1287 | Methylation | 959 | 346 | 2.77 |
| 5 | 1256 | Chromatin | 938 | 339 | 2.77 |