| Literature DB >> 25409314 |
Catarina Correia1, Guiomar Oliveira2, Astrid M Vicente1.
Abstract
Genome-wide association studies (GWAS) for Autism Spectrum Disorder (ASD) thus far met limited success in the identification of common risk variants, consistent with the notion that variants with small individual effects cannot be detected individually in single SNP analysis. To further capture disease risk gene information from ASD association studies, we applied a network-based strategy to the Autism Genome Project (AGP) and the Autism Genetics Resource Exchange GWAS datasets, combining family-based association data with Human Protein-Protein interaction (PPI) data. Our analysis showed that autism-associated proteins at higher than conventional levels of significance (P<0.1) directly interact more than random expectation and are involved in a limited number of interconnected biological processes, indicating that they are functionally related. The functionally coherent networks generated by this approach contain ASD-relevant disease biology, as demonstrated by an improved positive predictive value and sensitivity in retrieving known ASD candidate genes relative to the top associated genes from either GWAS, as well as a higher gene overlap between the two ASD datasets. Analysis of the intersection between the networks obtained from the two ASD GWAS and six unrelated disease datasets identified fourteen genes exclusively present in the ASD networks. These are mostly novel genes involved in abnormal nervous system phenotypes in animal models, and in fundamental biological processes previously implicated in ASD, such as axon guidance, cell adhesion or cytoskeleton organization. Overall, our results highlighted novel susceptibility genes previously hidden within GWAS statistical "noise" that warrant further analysis for causal variants.Entities:
Mesh:
Year: 2014 PMID: 25409314 PMCID: PMC4237351 DOI: 10.1371/journal.pone.0112399
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Workflow of the strategy for network definition, validation and identification of most relevant candidate genes.
ASD
-associated proteins at the same range of gene-wise -Log10 P-values than in random sets (Empirical P values 0.001ASD dataset are interconnected in a significantly larger LCC (Empirical P values 0.001
AGP and AGRE datasets, respectively, indicates the existence of several small effect risk genes reinforcing the high genetic heterogeneity in ASD.
Figure 2Network properties of proteins selected at gene-wise P<0.1 in each ASD.
a) Comparison of percentage of direct interactions and isolated nodes between proteins selected at gene-wise P<0.1 in each GWAS dataset (red circles) vs 1000 random samples of network proteins (represented by light gray and dark gray box plots, for direct interactions and isolated nodes, respectively). The bottom and top of the box represent the 25th and 75th percentile and the extremity of the whiskers the maximum and minimum of the random samples data. b) Same comparison for the largest connected component (LCC) size.
Precision and recall were consistently higher for LCC genes relative to top GWAS genes or genes selected at P<0.1.
| Precision (%) | Recall (%) | |||
| Gene subset | AGP dataset | AGRE dataset | AGP dataset | AGRE dataset |
|
| 2.16 | 2.74 | 3.81 | 4.24 |
|
| 1.68 | 0.82 | 1.27 | 2.97 |
|
| 0.96 | 1,11 | 8.47 | 9.43 |
Precision and Recall (Percentage), by ASD dataset, of three sets of genes (genes selected at a gene wise P-value cutoff of 0.1, genes included in the LCC and the same number of GWAS top genes) against a list of known disease candidates.
Figure 3ASD top scoring gene network.
This network illustrates the 14 top scoring genes included in the ASD LCC and their first neighbors. Nodes are colored based on a score reflecting their presence in the second ASD dataset and in the 6 unrelated diseases LCCs. A darker color represents a higher score, which means a higher specificity for ASD.
Top scoring ASD network genes.
| Gene (Uniprot ID) | Description | Location | Relevant biological processes | Gene-wise association | Published studies in autism | Neurological and behavioral features in mouse models |
|
| nuclearautoantigenicsperm protein (histone-binding) | 1p34.1 | blastocyst development, cellproliferation, cell cycle | 1.470e−02 | NEXTBIO: deletion inidiopathic females(Sakai et al, 2011), significantlydownregulated in brainsamples (Chow et al.2011) | NA |
|
| peroxiredoxin 1 | 1p34.1 | redox regulation, cellproliferation | 1.760e−04 | - | NA |
|
| ribosomal proteinS6 kinase, 90kDa, polypeptide 1 | 1p36.11 | protein kinase, synaptictransmission, axon guidance, long-term potentiation, toll-likeand NGF receptor signaling pathway | 2.422e−02 | - | NNP |
|
| phosphogluconate dehydrogenase | 1p36.22 | cell redox regulation | 7.851e−02 | De novo mutation in autistic patient (O′Roak et al. 2012) | NNP |
|
| lamin A/C | 1q22 | regulation of cell migration, regulation of apoptotic process, spermatogenesis | 8.226e−02 | Nextbio:altered expression in Lymphoblastoid cells from males with autism (15q11–13 duplication) and brain samples (Chow et al. 2011; Nishimura et al. 2007) | abnormal axon morphology, abnormal myelination |
|
| protein phosphatase 1, catalytic subunit, beta isozyme | 2p23 | regulation of cell cycle, focal-adhesion, long-term potentiation, regulation of actin cytoskeleton | 3.749e−02 | - | NA |
|
| Ras association (RalGDS/AF-6) domain family member 1 | 3p21.3 | Cell cycle, response to DNA damage stimulus, positive regulation of protein ubiquitination | 1.949e−02 | - | NA |
|
| cathepsin B | 8p23.1 | regulation of apoptotic process, cellular response to thyroid hormone stimulus | 2.343e−03 | Purkinje cell degeneration, abnormalneuron apoptosis (details)neuron degeneration | |
|
| c-abl oncogene 1, non-receptor tyrosine kinase | 9q34.1 | axon guidance, regulation of cell adhesion, motility, cycle, actin cytoskeleton organization, response to DNA damage stimulus | 9.560e−02 | NextBio:altered expression in autistic brain samples (Chow et al. 2011) | abnormal cerebellum morphology, small cerebellum, abnormal cerebellum development, abnormal cerebellar foliation, ectopic Purkinje cell, abnormal cerebellar lobule formation, absent cerebellar lobules, abnormal neuron differentiation |
|
| Abl-interactor 1 previously known as spectrin SH3 domain binding protein 1 | 10p12.1 | transmembrane receptor protein tyrosine kinase signaling pathway, negative regulation of cell proliferation | 7.352e−02 | NextBio:downregulation in autistic brain samples (Chow et al. 2011) | NA |
|
| polymerase (RNA) II (DNA directed) polypeptide L, 7.6kDa | 11p15.5 | DNA repair, regulation of transcription | 7.821e−02 | - | NNP |
|
| nuclear receptor subfamily 4, group A, member 1 also known as nerve Growth factor IB (NGFIB) | 12q13.13 | nuclear transcription factor, epidermal and fibroblast growth factor receptor signaling pathway, nerve growth factor receptor signaling pathway | 4.783e−02 | NextBio:downregulation in autistic brain samples (Chow et al. 2011) | NA |
|
| Bardet-Biedl syndrome 4 | 15q22.3–q23 | centrosome organization, microtubule cytoskeleton organization, neural tube closure, dendrite, striatum, hippocampus, cerebral cortex development | 7.511e−02 | Nextbio:altered expression in lymphoblasts and brain samples (Chow et al. 2011; Nishimura et al. 2007) | abnormal neural tubemorphology/development, thincerebral cortex, abnormal basal ganglion morphologyabnormal olfactory neuron morphology, small hippocampusenlarged lateral ventriclesenlarged third ventricle |
|
| eukaryotic translation initiation factor 2, subunit 3 gamma, 52kDa | Xp22.11 | cellular protein metabolic process | 7.322e−02 | - | NNP |
List of the 14 top scoring ASD network genes, present in both ASD networks and in none of the other disorders (ASD specificity score = 1), with information on gene-wise association P-value and biological processes relevant for ASD.
NNP - No neurological phenotypes|; NA - No mouse model available.