| Literature DB >> 31323926 |
Noemi Di Nanni1,2, Matteo Bersanelli3,4, Francesca Anna Cupaioli1, Luciano Milanesi1, Alessandra Mezzelani1, Ettore Mosca5.
Abstract
Current studies suggest that autism spectrum disorders (ASDs) may be caused by many genetic factors. In fact, collectively considering multiple studies aimed at characterizing the basic pathophysiology of ASDs, a large number of genes has been proposed. Addressing the problem of molecular data interpretation using gene networks helps to explain genetic heterogeneity in terms of shared pathways. Besides, the integrative analysis of multiple omics has emerged as an approach to provide a more comprehensive view of a disease. In this work, we carry out a network-based meta-analysis of the genes reported as associated with ASDs by studies that involved genomics, epigenomics, and transcriptomics. Collectively, our analysis provides a prioritization of the large number of genes proposed to be associated with ASDs, based on genes' relevance within the intracellular circuits, the strength of the supporting evidence of association with ASDs, and the number of different molecular alterations affecting genes. We discuss the presence of the prioritized genes in the SFARI (Simons Foundation Autism Research Initiative) database and in gene networks associated with ASDs by other investigations. Lastly, we provide the full results of our analyses to encourage further studies on common targets amenable to therapy.Entities:
Keywords: autism spectrum disorders; biological networks; data integration; genomics; multi-omics; network diffusion
Mesh:
Year: 2019 PMID: 31323926 PMCID: PMC6651137 DOI: 10.3390/ijms20133363
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Datasets considered in this study. Selected: The number of genes for which at least a high confidence interaction with any other gene is catalogued in the STRING database (see methods). G: Genomics; E: Epigenomics; T: Transcriptomics. ** major evidence; * minor evidence.
| Type of Evidence | Description | Subjects | Number of Genes | |||
|---|---|---|---|---|---|---|
| Initial | Selected | |||||
| ** | * | ** | * | |||
| G | SFARI [ | - | 404 | 1087 | 334 | 799 |
| G | Network diffusion-based prioritization of autism risk genes identifies significantly connected gene modules [ | - | ||||
| G | Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder [ | 15,954 | ||||
| G | Synaptic, transcriptional and chromatin genes disrupted in autism [ | 13,808 | ||||
| E | Case-control meta-analysis of blood DNA methylation and autism spectrum disorder [ | 1654 | 416 | 1444 | 272 | 955 |
| T | Gene expression profiling differentiates autism case- controls and phenotypic variants of autism spectrum disorders [ | 116 | 330 | 3045 | 256 | 2131 |
| T | Blood gene expression signatures distinguish autism spectrum disorders from controls [ | 285 | ||||
| T | Disrupted functional networks in autism underlie early brain mal-development and provide accurate classification. [ | 147 | ||||
| T | Gene expression in blood of children with autism spectrum disorder [ | 47 | ||||
Figure 1Genes in network proximity to the core genes of autism spectrum disorders (ASDs). (A) Diffusion score (Xs) normalized by its empirical p-value (horizontal axis) and number of interactions (|I|, vertical axis); only genes with p < 0.05 are shown. (B) Connected components of “core+13” network. (A,B) Blue points: 13 genes of “core+13”; pink points: core genes; yellow points: Significant genes outside “core+13” genes; red border of points: Genes supported at transcriptomic and/or epigenetic levels.
The 13 genes that closely interact with the core genes of ASDs. |I|: Number of interactors; |Ic|: Number of interactors that are core genes; p: Hypergeometric probability of observing |Ic| in a hypergeometric experiment; G: Genomics; E: Epigenomics; T: Transcriptomics; ** major; * minor; 0: No evidence; SFARI score: “minimal evidence” (4), “hypothesized but untested” (5); other modules: Reference of gene-networks studies of ASDs in which the gene is mentioned. The total number of genes considered is equal to the interactome size: 12,739 genes.
| Symbol | Description | |I| | |Ic| | |core| |
| G | E | T | SFARI Score | Other Modules |
|---|---|---|---|---|---|---|---|---|---|---|
|
| hyperpolarization activated cyclic nucleotide gated potassium channel 4 | 4 | 2 | 334 | 3.97 × 10−3 | * | * | 0 | - | - |
|
| DLG associated protein 2 | 21 | 8 | 334 | 3.10 × 10−8 | * | 0 | 0 | 4 | [ |
|
| hyperpolarization activated cyclic nucleotide gated potassium and sodium channel 2 | 4 | 1 | 334 | 1.01 × 10−1 | * | 0 | 0 | - | - |
|
| ubiquitin C | 1168 | 43 | 334 | 1.41 × 10−2 | * | 0 | 0 | - | [ |
|
| neuroligin 2 | 28 | 8 | 334 | 4.04 × 10−7 | * | 0 | 0 | 4 | [ |
|
| WD repeat domain 37 | 2 | 2 | 334 | 6.85 × 10−4 | 0 | 0 | * | - | - |
|
| myotubularin related protein 2 | 6 | 1 | 334 | 1.47 × 10−1 | * | 0 | 0 | - | - |
|
| erythrocyte membrane protein band 4.1 like 1 | 34 | 9 | 334 | 1.55 × 10−7 | * | 0 | 0 | - | [ |
|
| gamma-aminobutyric acid type A receptor alpha5 subunit | 17 | 4 | 334 | 8.43 × 10−4 | * | 0 | 0 | 5 | [ |
|
| syntaxin 1A | 78 | 10 | 334 | 3.47 × 10−5 | * | 0 | 0 | 4 | [ |
|
| erythrocyte membrane protein band 4.1 | 16 | 5 | 334 | 4.14 × 10−5 | * | 0 | ** | - | [ |
|
| calcium voltage-gated channel subunit alpha1 F | 37 | 6 | 334 | 3.63 × 10−4 | * | 0 | 0 | 4 | [ |
|
| protein kinase C alpha | 197 | 11 | 334 | 1.48 × 10−2 | * | * | * | 4 | - |
Collectively, we observed that a significant number of “core+13” genes emerged as associated with ASDs at epigenomics level (p = 2.63 × 10−4; hypergeometic test; Table 3) and at transcriptomics level; p = 1.22 × 10−3 hypergeometic test, Table 3).
Overlaps among the lists of genes associated with ASDs. G: Genomic; E: Epigenomics; T: Transcriptomics; ** major; * minor; core+13(E) and core+13(T) indicate genes belonging to the core+13 set and which are supported by E and T, respectively.
| A | B |
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|
| core+13 | core+13(E) | 347 | 1227 | 12739 | 54 | 3.27 | 2.63 × 10−4 |
| core+13 | core+13(T) | 347 | 2387 | 12739 | 88 | 6.37 | 1.22 × 10−3 |
| G | E | 1133 | 1227 | 12739 | 146 | 109 | 1.09 × 10−4 |
| G | T | 1133 | 2387 | 12739 | 235 | 212 | 3.95 × 10−2 |
| E | T | 1227 | 2387 | 12739 | 243 | 230 | 1.66 × 10−1 |
| G ** | E ** | 334 | 272 | 12739 | 15 | 7.13 | 5.47 × 10−3 |
| G ** | T ** | 334 | 256 | 12739 | 15 | 6.71 | 3.12 × 10−3 |
| E ** | T ** | 272 | 256 | 12739 | 5 | 5.47 | 6.42 × 10−1 |
Figure 2Overlaps among genes associated with ASDs by genomics, epigenomics, and transcriptomics. (A,B) G: Genomics; E: Epigenomics; T: Transcriptomics. ** major.
Figure 3Integrative multi-omics analysis. (A) Global network diffusion scores (horizontal axis) and number of interactions (vertical axis) of the top ranking genes; the vertical dashed line separates the top 275 genes belonging to the INT-MODULE (higher scores, on the right) from the other genes (lower scores, on the left). (B) Network of the top 275 genes (INT-MODULE). Green circles: shared genes; blue circles: Genes included in SFARI categories 4 and 5; red circle: LRRC46.
INT-MODULE genes SFARI. G: Genomics; E: Epigenomics; T: Transcriptomics. ** major; * minor. Im: Number of interactors within the INT-MODULE; SFARI score: “minimal evidence” (4), “hypothesized but untested” (5). Other modules: Reference of gene-networks studies that also associated the gene to ASDs.
| Symbol | Description | #Im | G | E | T | SFARI Score | Other Modules |
|---|---|---|---|---|---|---|---|
|
| BAI1-associated protein 2 | 4 | * | * | 0 | 5 | [ |
|
| calcium voltage-gated channel subunit alpha1 B | 7 | 0 | ** | 0 | 4 | [ |
|
| CREB binding protein | 43 | 0 | 0 | ** | 5 | [ |
|
| homeobox B1 | 12 | 0 | * | * | 5 | [ |
|
| inositol polyphosphate-1-phosphatase | 1 | 0 | ** | 0 | 4 | [ |
|
| inositol 1,4,5-trisphosphate receptor type 1 | 11 | * | * | 0 | 4 | [ |
|
| potassium large conductance calcium-activated channel, subfamily M, alpha member 1 | 1 | 0 | ** | 0 | 4 | [ |
|
| Ras association domain family member 5 | 0 | 0 | ** | ** | 4 | - |
|
| RNA binding motif protein 8A | 10 | 0 | ** | * | 5 | - |
|
| SH3-domain kinase binding protein 1 | 12 | * | 0 | ** | 5 | - |
Figure 4Functional characterization of the INT-MODULE. (A) Topological clusters; #1–12: Clusters of the largest connected component; #13,14: Two clusters of three and two genes, respectively; #15: The remaining four genes. (B) Enrichment (vertical axis) of each cluster in terms of genes supported by genomics (G), epigenomics (E), and transcriptomics (T): A value of 1 indicates the same proportion within the cluster and in the whole INT-MODULE.