| Literature DB >> 24289864 |
Raquel M Fernández, Marta Bleda, Berta Luzón-Toro, Luz García-Alonso, Stacey Arnold, Yunia Sribudiani, Claude Besmond, Francesca Lantieri, Betty Doan, Isabella Ceccherini, Stanislas Lyonnet, Robert Mw Hofstra, Aravinda Chakravarti, Guillermo Antiñolo, Joaquín Dopazo1, Salud Borrego.
Abstract
Despite it has been reported that several loci are involved in Hirschsprung's disease, the molecular basis of the disease remains yet essentially unknown. The study of collective properties of modules of functionally-related genes provides an efficient and sensitive statistical framework that can overcome sample size limitations in the study of rare diseases. Here, we present the extension of a previous study of a Spanish series of HSCR trios to an international cohort of 162 HSCR trios to validate the generality of the underlying functional basis of the Hirschsprung's disease mechanisms previously found. The Pathway-Based Analysis (PBA) confirms a strong association of gene ontology (GO) modules related to signal transduction and its regulation, enteric nervous system (ENS) formation and other processes related to the disease. In addition, network analysis recovers sub-networks significantly associated to the disease, which contain genes related to the same functionalities, thus providing an independent validation of these findings. The functional profiles of association obtained for patients populations from different countries were compared to each other. While gene associations were different at each series, the main functional associations were identical in all the five populations. These observations would also explain the reported low reproducibility of associations of individual disease genes across populations.Entities:
Mesh:
Year: 2013 PMID: 24289864 PMCID: PMC3879038 DOI: 10.1186/1750-1172-8-187
Source DB: PubMed Journal: Orphanet J Rare Dis ISSN: 1750-1172 Impact factor: 4.123
Figure 1Schema of the analysis strategy used to validate the findings in the Spanish population [[17]] with the available scattered information about the four additional populations present in the consortium.
Available trios for any of the chips distributed among the five country populations analyzed
| 15 | 20 | 25 | 37 | 97 | |
| 16 | 18 | 25 | 32 | 91 | |
| 5 | 1 | 9 | 11 | 26 | |
| 26 | 37 | 41 | 58 |
First and second rows refer to the trios genotyped with each specific chip (Nsp or Sty). Third file contains the trios simultaneously genotyped with both chips (Nsp + Sty). Forth row is the total number of trios used in the study for each population.
GO modules significantly associated to HSCR in the different populations analyzed with the and chips, separately
| | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Signaling | GO:0051056 | Regulation of small GTPase mediated signal transduction | Y | | Y | | Y | | Y | Y |
| GO:0046578 | Regulation of Ras protein signal transduction | Y | | Y* | | Y | | Y* | Y | |
| | | | ||||||||
| | | | | |||||||
| GO:0035023 | Regulation of Rho protein signal transduction | Y | | Y | | Y* | | | | |
| | | | | |||||||
| | | | | | | | ||||
| | | | | |||||||
| ENS formation | | | | |||||||
| | | |||||||||
| | | | ||||||||
| | | | | | ||||||
| | | | | |||||||
| | | | | | ||||||
The first column specifies the general biological process represented by the GO terms (columns 2 and 3) which were found significant in the Spanish population and constitute the initial hypothesis for the functional basis of the disease. From fourth column ahead a Y means that the GO term was significant in the corresponding population and chip (FDR-adjusted p-values < 0.05) using the PBA approach described in Methods. Y* means that although this particular term was not significant, another direct descendant or ancestor GO term in the hierarchy was significant. The last four files are GO terms (in italics) that were not initially significant in the Spanish population but were consistently significant through the populations analyzed here.
Figure 2Hierarchy of GO terms significantly associated to HSCR in the different populations analyzed with the and chips, separately (See Table2). Red color means that the GO term, originally significant in the Spanish population [17] was also significant in all the four Consortium populations in at least one of the chips (FDR-adjusted p-values < 0.05) using the PBA approach described in Methods. In Orange color are represented those initially red color GO term in which there seems to be a potential bias due to the size of the genes composing the GO. In green color there is a GO term, significant in the Spanish population and significant in only one population of the consortium (USA). The blue color correspond GO terms that were not initially significant in the Spanish population but were consistently significant through the populations analyzed here.
Figure 3Number of genes (blue line) and GO terms common to the four Consortium populations studied and the Spanish population. The X axis (in logarithmic scale) represents the list of SNPs ranked by p-value. For example, within the 50 first SNPs there is only one gene common to the five populations, AGAP3, while SNPs in different genes were defining 11 GO terms common to the five populations analyzed. The upper left square represents a detailed view of the first 100 ranked SNPs.
Figure 4Significant sub-network of 53 genes, allowing for one intermediate gene, associated to HSCR (p-value = 0.027) in the chip. The network analysis [28] is carried out by using the functional analysis options (Set enrichment analysis/NetworkMiner) of Babelomics [26] on the list of genes ranked by the p-values obtained upon the application of a conventional TDT association test as implemented in PLINK [24].
Extra-genic SNPs with a nominal p-value < 10 mapping in regions recently annotated in ENCODE [[32]]
| rs2435367 | 8.067×10-11 | 2.2083×10-06 | 5.9 kb 5′ of | DNAse hypersensitivity | | | |
| rs2505526 | 8.353×10-07 | 0.01506847 | 7.5 kb 3′ of RASGEF1A | | | Pax-4 | |
| rs16838932 | 4.459×10-07 | 0.021091873 | 421 kb 5′ of SLC39A10 | Ultraconserved region [ | | | |
| rs2659635 | 7.098×10-06 | 0,08204207 | 125 kb 5′ of XRCC6BP1 | Ultraconserved region [ | | Foxj2 | |
| rs4570660 | 7.744×10-06 | 0,08479448 | 65 kb 3′ of APOBEC1 | DNAse hypersensitivity | | Sox | |
| rs12067906 | 9.584×10-06 | 0,09994469 | 96 kb 5′ of RGS21 | DNAse hypersensitivity | RAD21 | Gfi1, HLF |
In this case all the trios have been analysed together for each chip.