| Literature DB >> 25268122 |
Daniele Muraro1, Douglas A Lauffenburger2, Alison Simmons1.
Abstract
Recent Genome-Wide Association Studies (GWAS) have revealed numerous Crohn's disease susceptibility genes and a key challenge now is in understanding how risk polymorphisms in associated genes might contribute to development of this disease. For a gene to contribute to disease phenotype, its risk variant will likely adversely communicate with a variety of other gene products to result in dysregulation of common signaling pathways. A vital challenge is to elucidate pathways of potentially greatest influence on pathological behaviour, in a manner recognizing how multiple relevant genes may yield integrative effect. In this work we apply mathematical analysis of networks involving the list of recently described Crohn's susceptibility genes, to prioritise pathways in relation to their potential development of this disease. Prioritisation was performed by applying a text mining and a diffusion based method (GRAIL, GPEC). Prospective biological significance of the resulting prioritised list of proteins is highlighted by changes in their gene expression levels in Crohn's patients intestinal tissue in comparison with healthy donors.Entities:
Mesh:
Year: 2014 PMID: 25268122 PMCID: PMC4182533 DOI: 10.1371/journal.pone.0108624
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Network associated with Crohn's disease and motifs.
(a) Representation of the protein interaction network obtained by prioritisation. The network presents connected components, each one being highlighted using a different colour. The giant component, namely the connected subgraph that contains the majority of the entire graph's nodes, is shown in red. (b) Sub-graphs frequency and z-scores in the network associated with Crohn's. Considering the threshold , subgraphs id78 and id4382 are over-represented (motifs), whereas subgraphs id238, id4698, id4958, id13278, id31710 are under-represented (anti-motifs).
Hubs distribution.
| Proteins list | N. proteins | N. Hubs | p-value |
| NCBI Human PPI network proteins |
|
|
|
| Prioritised proteins |
|
|
|
| Disease network proteins |
|
|
|
Table summarising the number of hub proteins in the NCBI proteome, in the list of prioritised proteins and in the same list together with their first neighbours (Disease network proteins). Over-representation of hubs is statistically significant when considering first neighbours of the prioritised list (Hypergeometric distribution p-values).
Figure 2Topological segregation.
Series of plots representing the segregation functions of over-represented categories in the network associated with Crohn's sorted from the most to the least segregated category. (a) categories within biological processes; (b) categories within protein classes; (c) categories within molecular functions and pathways.
Selected proteins.
| Protein name | N. neighbours | N. hub neighbours | p-value |
| STAT3 |
|
|
|
| JAK2 |
|
|
|
| VDR |
|
|
|
| PRDM1 |
|
|
|
| FASLG |
|
|
|
| ATF4 |
|
|
|
Table summarising the number of hub first neighbours in the selected proteins listed in section 'Results and discussion'. P-values represent the probability that the number of neighbour hubs is due to random choice and are calculated using a Fisher's exact test which compares the total number of hubs in the NCBI proteome with the number of hubs in the neighbours of the selected proteins. Of the proteins listed in the NCBI protein interaction network have a number of first neighbours which is strictly higher than the average; connectivity with these hubs is over-represented for the proteins presented.