| Literature DB >> 30886520 |
Min Woo Sun1, Anika Gupta1, Maya Varma1, Kelley M Paskov1, Jae-Yoon Jung1, Nate T Stockham1, Dennis P Wall1.
Abstract
Studies on autism spectrum disorder (ASD) have amassed substantial evidence for the role of genetics in the disease's phenotypic manifestation. A large number of coding and non-coding variants with low penetrance likely act in a combinatorial manner to explain the variable forms of ASD. However, many of these combined interactions, both additive and epistatic, remain undefined. Coalitional game theory (CGT) is an approach that seeks to identify players (individual genetic variants or genes) who tend to improve the performance-association to a disease phenotype of interest-of any coalition (subset of co-occurring genetic variants) they join. This method has been previously applied to boost biologically informative signal from gene expression data and exome sequencing data but remains to be explored in the context of cooperativity among non-coding genomic regions. We describe our extension of previous work, highlighting non-coding chromosomal regions relevant to ASD using CGT on alteration data of 4595 fully sequenced genomes from 756 multiplex families. Genomes were encoded into binary matrices for three types of non-coding regions previously implicated in ASD and separated into ASD (case) and unaffected (control) samples. A player metric, the Shapley value, enabled determination of individual variant contributions in both sets of cohorts. A total of 30 non-coding positions were found to have significantly elevated player scores and likely represent significant contributors to the genetic coordination underlying ASD. Cross-study analyses revealed that a subset of mutated non-coding regions (all of which are in human accelerated regions (HARs)) and related genes are involved in biological pathways or behavioral outcomes known to be affected in autism, suggesting the importance of single nucleotide polymorphisms (SNPs) within HARs in ASD. These findings support the use of CGT in identifying hidden yet influential non-coding players from large-scale genomic data, to better understand the precise underpinnings of complex neurodevelopmental disorders such as autism.Entities:
Keywords: autism spectrum disorder; coalitional game theory; non-coding genome
Year: 2019 PMID: 30886520 PMCID: PMC6410388 DOI: 10.1177/1178222619832859
Source DB: PubMed Journal: Biomed Inform Insights ISSN: 1178-2226
Figure 1.Data analysis flow diagram, starting from the sequenced genomes to identification of statistically significant non-coding variants through coalitional game theory. Data from human accelerated regions (red), psychiatric disorder-associated dysregulated miRNA (yellow), and schizophrenia-associated miRNA (blue) were analyzed independently.
Non-coding variants highlighted through coalitional game theory at two levels of significance.
| Significance | Chromosomal position |
|---|---|
| chr2:176990625, chr2:208297661, chr3:70655368, chr 4:182669844, chr4:35519558, | |
| chr7:25357732, chr8:116717451, |
The 30 variants in non-coding chromosomal positions fall into one of three categories: dysregulated segments associated with psychiatric disorders (underlined), schizophrenia-associated miRNA regions (italicized), and HARs. The 26 variants listed in P < .05 are the subset of the 30 variants not in P < .01. All coordinates are relative to build GRCh37.
Figure 2.A non-coding variant (rs724600) identified by CGT as significantly (P < .01) associated with ASD. The position of the variant, X:147783665, is highlighted in yellow. This variant falls within an intron of the gene AFF2 (indicated by the red bar). Alternative splicing and regulatory interactions could implicate the variant with different phenotypic outcomes.
Figure 3.Example of five non-coding variants identified by CGT as significantly (P < .05) associated with ASD. These variants all fall within the HLA-C gene on chromosome 6. Red variants are missense, and green are synonymous (adapted from the UCSC Genome Browser).