| Literature DB >> 28993790 |
Ettore Mosca1, Matteo Bersanelli2, Matteo Gnocchi1, Marco Moscatelli1, Gastone Castellani2, Luciano Milanesi1, Alessandra Mezzelani1.
Abstract
Autism spectrum disorder (ASD) is marked by a strong genetic heterogeneity, which is underlined by the low overlap between ASD risk gene lists proposed in different studies. In this context, molecular networks can be used to analyze the results of several genome-wide studies in order to underline those network regions harboring genetic variations associated with ASD, the so-called "disease modules." In this work, we used a recent network diffusion-based approach to jointly analyze multiple ASD risk gene lists. We defined genome-scale prioritizations of human genes in relation to ASD genes from multiple studies, found significantly connected gene modules associated with ASD and predicted genes functionally related to ASD risk genes. Most of them play a role in synapsis and neuronal development and function; many are related to syndromes that can be in comorbidity with ASD and the remaining are involved in epigenetics, cell cycle, cell adhesion and cancer.Entities:
Keywords: autism spectrum disorder; biological networks; data integration; gene module; network diffusion
Year: 2017 PMID: 28993790 PMCID: PMC5622537 DOI: 10.3389/fgene.2017.00129
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Overlap between lists of genes harboring variations associated with ASDs.
Gene list size (diagonal) and number of genes co-occurring between pairs of lists (off-diagonal elements); lower triangle and diagonal (bottom): original gene lists; upper triangle and diagonal (top): original gene lists with only genes occurring in STRING network.
Figure 1A significantly connected gene module based on SFARIh genes. The network-diffusion based analysis of SFARIh genes (red) leads to the definition of a significantly connected gene module of 244 genes (blue border). (A) Network smoothing index and corresponding estimated p-value (p). (B) For each rank n (horizontal axis), the estimated p-value (p) that quantifies the significance of the gene network defined by the genes ranked up to the n-th rank. (C) Number of interactions (d) and normalized betweenness (B) of genes in MSFARI. (D) Visualization of MSFARI gene module, in which genes are circles and PPI are links; circle size is proportional to gene degree. (A–D) G: all genes included in the STRING network.
Number of SFARIl genes in network proximity to SFARIh genes.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 2 | 10 | 11,371 | 216 | 0.190 | 1.46·10−2 |
| 9 | 100 | 11,281 | 216 | 1.90 | 1.15·10−4 |
| 9* | 102* | 11,279* | 216* | 1.94* | 1.34·10−4* |
| 20 | 300 | 11,081 | 216 | 5.69 | 1.07·10−6 |
| 22 | 400 | 10,981 | 216 | 7.59 | 7.27·10−6 |
| 26 | 500 | 10,881 | 216 | 9.49 | 2.87·10−6 |
M, genes in network proximity to SFARIh genes; G, all genes; |M ∩ SFARIl| size of the intersection between M and SFARIl; .
Figure 2A small number of ASD risk genes is found in more than one large study. (A) Number of risk genes found in 1 or more of the six gene lists on ASD; i: original gene lists; n0: original gene lists wit only genes occurring in the STRING network. (B) Permutation-adjusted network smoothing index of the 14 genes occurring in 3 or more original lists (the number is reported between parenthesis); the asterisk (*) indicates genes of the corresponding original list.
Figure 3Genes in network proximity to ASD risk genes. The network smoothing index (S) and the corresponding p value (p) are the two factors that make up Sp. Blue: genes belonging to the corresponding original list (sources); red: genes in network proximity to sources; gray: remaining genes.
Co-occurrence of genes in network proximity to ASD risk genes from one or more sources.
|
| ||||||
|---|---|---|---|---|---|---|
|
|
|
|
|
| ||
|
|
| 691 | 68 | 14 | 2 | 775 |
|
| 89 | 15 | 4 | 1 | 109 | |
|
| 18 | 2 | 0 | 0 | 20 | |
|
| 6 | 0 | 0 | 0 | 6 | |
|
| 3 | 0 | 0 | 0 | 3 | |
|
| 807 | 85 | 18 | 3 | 913 | |
A total of 913 genes were predicted to be in network proximity to ASD risk genes of one or more studies (rows) where could appear as ASD risk genes (columns). For example, 15 genes were predicted in network proximity to ASD risk genes of 2 studies and were proposed as ASD risk genes in 1 study (n.
Figure 4Genes in network proximity to ASD risk genes from 3 or more studies. Permutation-adjusted network smoothing index (Sp) of the 29 genes predicted as functionally related to 3 or more ASD risk gene lists (the number is reported between parenthesis); the asterisk (*) indicates genes of the corresponding original list; faded colors indicate input ASD risk genes or genes with no significant Sp values.
Hub genes predicted in network proximity to ASD risk genes of three or more studies establish a significant number of interactions with ASD risk genes.
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| SRC | 20q11.23 | 0 | 5 | 532 | 82 | 2.00·10−8 | |
| TP53 | 17p13.1 | 25 | 4 | 719 | 92 | 1.36·10−5 | |
| AKT1 | 14q32.33 | 25 | 4 | 589 | 73 | 2.90·10−4 | |
| RHOA | 3p21.31 | 13 | 4 | 406 | 65 | 1.53·10−7 | |
| MAPK14 | 6p21.31 | 11 | 4 | 333 | 44 | 1.30·10−3 | |
| HDAC3 | 5q31.3 | 8 | 4 | 275 | 43 | 3.54·10−5 | |
| HSP90AA1 | 14q32.31 | 10 | 3 | 589 | 65 | 9.99·10−3 | |
| APP | 21q21.3 | 15 | 3 | 363 | 42 | 1.68·10−2 | |
| HDAC1 | 1p35.2 | 12 | 3 | 351 | 48 | 3.69·10−4 | |
| CDC42 | 1p36.12 | 9 | 3 | 336 | 56 | 2.97·10−7 | |
| GRB2 | 17q25.1 | 21 | 3 | 279 | 42 | 1.06·10−4 | |
| PRKACA | 19p13.12 | 14 | 3 | 278 | 41 | 2.00·10−4 | |
| SOS1 | 2p22.1 | 14 | 3 | 260 | 50 | 1.26·10−8 | |
| SMAD3 | 15q22.33 | 1 | 3 | 241 | 33 | 2.83·10−3 | |
| PRKACB | 1p31.1 | 22 | 3 | 213 | 28 | 9.81·10−3 | |
| CBL | 11q23.3 | 12 | 3 | 203 | 37 | 3.77·10−6 | |
| PRKACG | 9q21.11 | 7 | 3 | 185 | 27 | 2.70·10−3 | |
| RHOB | 2p24.1 | 8 | 3 | 174 | 38 | 2.19·10−8 | |
| SOS2 | 14q21.3 | 12 | 3 | 109 | 24 | 7.36·10−6 | |
| DNM2 | 19p13.2 | 19 | 3 | 84 | 21 | 3.27·10−6 |
Gene and function from GeneCards (Safran et al., .
Figure 5Genes in network proximity to ASD risk genes from each study. Top 10 genes with the highest permutation-adjusted network smoothing index (Sp) calculated in the analysis of each ASD risk gene list; the number of lists to which the gene was predicted as network proximal is reported between parenthesis; the asterisk (*) indicates genes of the corresponding original list; faded colors indicate input ASD risk genes or genes with no significant Sp values.
Figure 6A significantly connected gene module based on ASD risk genes from 6 sources. The network-diffusion based analysis of 956 ASD risk genes from 6 sources (red) leads to the definition of a significantly connected gene module (MASD) of 561 genes (blue border). (A) Network smoothing index (S) and corresponding p-value (p). (B) For each rank n (horizontal axis), the estimated p-value (p) that quantifies the significance of the gene network defined by the genes ranked up to the n-th rank is reported on the vertical axis. (C) Percent of ASD genes of original lists included in MASD. (D) Number of interactions (d) and betweenness (B) of genes in MASD. (A–D) G: all genes included in the PPI network.
Figure 7Significant pathways enriched in genes associated with ASD. (A–E) Pathways found in the analysis of each original list (i) and corresponding predicted genes (n*). (F) Heatmap of all the pathways found in the analysis of the lists reported on the rows; also pathways found analyzing the original lists without genes not occurring in the STRING network (n0) are represented; the color bar indicates the −log10(p) of the hypergeomtric p-value, adjusted for multiple hypothesis testing; the number 7 indicates p ≤ 10−7. (G) Enrichment map; vertex size is proportional to pathway significance (adjusted p-value); links are reported only for overlap coefficients >0.5; for each pathway, only the links with the top 5 most similar pathways are drawn. (A–G) See Supplementary Table 6.