| Literature DB >> 32787845 |
Min Woo Sun1,2, Stefano Moretti3, Kelley M Paskov1, Nate T Stockham4, Maya Varma5, Brianna S Chrisman6, Peter Y Washington6, Jae-Yoon Jung1,2, Dennis P Wall7,8,9.
Abstract
BACKGROUND: Complex human health conditions with etiological heterogeneity like Autism Spectrum Disorder (ASD) often pose a challenge for traditional genome-wide association study approaches in defining a clear genotype to phenotype model. Coalitional game theory (CGT) is an exciting method that can consider the combinatorial effect of groups of variants working in concert to produce a phenotype. CGT has been applied to associate likely-gene-disrupting variants encoded from whole genome sequence data to ASD; however, this previous approach cannot take into account for prior biological knowledge. Here we extend CGT to incorporate a priori knowledge from biological networks through a game theoretic centrality measure based on Shapley value to rank genes by their relevance-the individual gene's synergistic influence in a gene-to-gene interaction network. Game theoretic centrality extends the notion of Shapley value to the evaluation of a gene's contribution to the overall connectivity of its corresponding node in a biological network.Entities:
Keywords: Autism spectrum disorder; Biological network; Coalitional game theory; Game theoretic centrality; Shapley value
Mesh:
Year: 2020 PMID: 32787845 PMCID: PMC7430867 DOI: 10.1186/s12859-020-03693-1
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Table of selected genes
| Analysis | Genes |
|---|---|
| First Analysis | A2M, NT5C1B, PGM1, ERCC1, H6PD, CCR5, VNN1, OAS3, FAM187B, FOLH1, COL6A5, ASB15, GALNT9, CYP2C19, PPIG, RAD52, |
| IFIH1, WWTR1, DNAH11, FSIP2, PIK3C2G, GJE1, WDR63, SLC25A43, APOOL, HLA-B, HLA-G, HLA-A, OPRM1, HLA-DRB1, TLR8, EGF, | |
| PNLIPRP3, GRIA1, GUCY2F, LPL, CYP2D6, COL4A6, IL12RB1, CYP2C18, GSTT2B, PSG3, GLRA4, PSG1, GPR119, GPR142, ACYP2, PPP1R3F | |
| Second Analysis | OR2T4, CTB-23I7.1, AP002856.6, SSPO, OR6C1, BPIFB5P, RP11-573D15.1, SCRN3, RP11-404K5.2, RP11-104E19.1, AC008703.1, PEBP4, |
| CSAG1, LRRIQ1, OR4Q2, ERCC6L2, OR7E5P, ZNF473, KRTAP13-2, AC007680.2, OR52B4, AP000289.6, C11orf40, TMEM254-AS1, | |
| AC023115.1, MUC19, NOS2P1, PDE4DIP, VCX3A, RP11-780M14.1, CLECL1, GAB4, CCDC7, ST3GAL6-AS1, ZNF586, OR5H8P, PKD1L2, | |
| OR4L1, MAGEE2, AC007317.1, ATP6AP1, ATP6V1B1, OR51I2, RP11-613D13.4, GSDMB, GUCY2F, GUCA1C, PRSS48 | |
| CASh | A2ML1, AC008703.1, AC093911.1, ALOX15P2, ATP13A5, BORA, BPIFB5P, C12orf60, C3orf35, CARD8, CCDC26, CCDC7, CDH15, |
| COQ10A, CTC-525D6.1, DUSP16, ERCC6L2, FAM151A, FAM81B, FLG, GBGT1, HLA-K, LGALS8, MAGEC3, MYCT1, OR2T4, OR4Q2, | |
| OR6C1, OR8B3, RBAK-RBAKDN, RP11-104E19.1, RP11-160N1.10, RP11-404K5.2, RP11-56H2.2, RP11-618I10.2, RP11-738O11.13, | |
| SLC3A1, SSPO, TCP11, TRBV6-7, TRIM48, UBXN11, YME1L1, ZNF99, AF196972.4, AP002856.6, ATP6V1B1, C10ORF68, CDRT15P1, | |
| CTB-23I7.1, CTD-2130O13.1, CTD-2509G16.2, GEN1, KRT43P, MDP1, MPRIP, NT5C1B, OR4P4, OR5M10, OR5M11, OR8I2, PRIM2, | |
| RP11-15E18.4, RP11-283G6.4, RP11-705C15.2, SSXP3, VWA7 |
Table of genes that were selected using the three different analyses described in the section, Game theory analyses
Fig. 1Common top-ranking genes among the centrality measures. Each element of the matrix represents the number of genes shared at the top 10% (left matrix) and 20% (right matrix) ranking between two centrality measures in comparison. The complete list of genes ranked by the various centrality measures can be found in “Additional File 1” under “Supplementary information”
Fig. 2Graph of protein-protein interactions between game theoretic centrality genes and high confidence ASD genes. Node color: first analysis (purple), second analysis (green), SFARI (blue), 69 genes from Ruzzo et al. 2019 (yellow), Root 66 (red)
Fig. 3Example 4x3 binary matrix. 4×3 binary matrix representing 4 genes and 3 samples
Fig. 4Example network. A graph of 14 vertices (genes) N={g1,g2,...,g14} and 11 edges (respective biological interactions)
Fig. 5Example 14x5 binary matrix. 14×5 binary matrix representing 14 genes and 5 samples
Fig. 6Example gene ranking. Table A. (left) shows the gene ranking based on microarray game. Table B. (right) shows the gene ranking based on game theoretic centrality with microarray Shapley values as weights. The genes are sorted by highest (top) to lowest (bottom) score
Fig. 7Game theoretic centrality flow diagram. Flow diagram, beginning from the whole genome sequence data to ranking genes using game theoretic centrality