| Literature DB >> 25883595 |
Yuval Itan1, Jean-Laurent Casanova2.
Abstract
Germline genetic mutations underlie various primary immunodeficiency (PID) diseases. Patients with rare PID diseases (like most non-PID patients and healthy individuals) carry, on average, 20,000 rare and common coding variants detected by high-throughput sequencing. It is thus a major challenge to select only a few candidate disease-causing variants for experimental testing. One of the tools commonly used in the pipeline for estimating a potential PID-candidate gene is to test whether the specific gene is included in the list of genes that were already experimentally validated as PID-causing in previous studies. However, this approach is limited because it cannot detect the PID-causing mutation(s) in the many PID patients carrying causal mutations of as yet unidentified PID-causing genes. In this study, we expanded in silico the list of potential PID-causing candidate genes from 229 to 3,110. We first identified the top 1% of human genes predicted by the human genes connectome to be biologically close to the 229 known PID genes. We then further narrowed down the list of genes by retaining only the most biologically relevant genes, with functionally enriched gene ontology biological categories similar to those for the known PID genes. We validated this prediction by showing that 17 of the 21 novel PID genes published since the last IUIS classification fall into this group of 3,110 genes (p < 10(-7)). The resulting new extended list of 3,110 predicted PID genes should be useful for the discovery of novel PID genes in patients.Entities:
Keywords: candidate gene identification; computational biology; disease genomics; primary immunodeficiencies; the human gene connectome
Year: 2015 PMID: 25883595 PMCID: PMC4381650 DOI: 10.3389/fimmu.2015.00142
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1The human genome and PID gene network. This figure shows all 229 known PID-causing genes (red) together with all 14,131 human protein-coding genes for which HGC-predicted biological distance information was available.
Figure 2Comparison of biological distances between PID genes and all human genes. Biological distances between PID genes (red) and all human genes (gray), according to the proportion of distances in five categories: (1) small biological distance (<10); (2) small-medium biological distance (between 10 and 20); (3) medium-large biological distance (between 20 and 30); (4) large-very large biological distance (between 30 and 40); and (5) extremely large biological distance (>40).
Figure 3Functional genomic alignment phylogeny of known PID genes and novel PID-candidate genes. A phylogenetic tree of biological distances generated by the functional genomic alignment method, showing hierarchical clustering of all known (red) and predicted (blue) PID genes.
New PID genes predicted from the list of novel PID-candidate genes.
| Predicted PID gene candidate | Known PID gene candidate | Biological distance between candidate and known | Rank of candidate in known | Route between candidate and known | |
|---|---|---|---|---|---|
| BCL10 | CARD11 | 1.001 | 1 | 0.00007 | CARD11 ↔ BCL10 |
| IRF7 | MYD88 | 1.001 | 1 | 0.00007 | MYD88 ↔ IRF7 |
| IL21 | IL21R | 1.001 | 1 | 0.00007 | IL21R ↔ IL21 |
| CTLA4 | ICOS | 1.616 | 3 | 0.00021 | ICOS ↔ CTLA4 |
| STING (TMEM173) | TBK1 | 1.001 | 3 | 0.00021 | TBK1 ↔ TMEM173 |
| NFKB1 | NFKBIA | 1.001 | 4 | 0.00028 | NFKBIA ↔ NFKB1 |
| NLRC4 | NOD2 | 1.183 | 5 | 0.00035 | NOD2 ↔ NLRC4 |
| NIK (MAP3K14) | CD40 | 1.064 | 8 | 0.00057 | CD40 ↔ MAP3K1 |
| TPP1 | TINF2 | 1.191 | 18 | 0.00127 | TINF2 ↔ TPP1 |
| JAGN1 (JAG1) | CHD7 | 2.387 | 27 | 0.00191 | CHD7 ↔ JAG1 |
| TGFBR2 | C8B | 6.441 | 27 | 0.00191 | C8B ↔ CLU ↔ TGFBR2 |
| TGFBR1 | C8B | 6.441 | 28 | 0.00198 | C8B ↔ CLU ↔ TGFBR1 |
| INO80 | ACTB | 1.582 | 35 | 0.00248 | ACTB ↔ INO80 |
| DOCK2 | RAC2 | 1.610 | 35 | 0.00248 | RAC2 ↔ DOCK2 |
| STAT4 | STAT3 | 1.104 | 35 | 0.00248 | STAT3 ↔ STAT4 |
| ADA2 (TADA2A) | TBX1 | 7.228 | 83 | 0.00587 | TBX1 ↔ C11orf30 ↔ TADA2A |
| IFIH1 | FADD | 4.141 | 87 | 0.00616 | FADD ↔ MAVS ↔ IFIH1 |
Showing the 17 (of 21) PID genes recently reported and identified in the list of PID-candidate genes described in this study.