| Literature DB >> 30279426 |
Imane Boudellioua1, Maxat Kulmanov1, Paul N Schofield2, Georgios V Gkoutos3,4,5,6,7, Robert Hoehndorf8.
Abstract
An increasing number of disorders have been identified for which two or more distinct alleles in two or more genes are required to either cause the disease or to significantly modify its onset, severity or phenotype. It is difficult to discover such interactions using existing approaches. The purpose of our work is to develop and evaluate a system that can identify combinations of alleles underlying digenic and oligogenic diseases in individual whole exome or whole genome sequences. Information that links patient phenotypes to databases of gene-phenotype associations observed in clinical or non-human model organism research can provide useful information and improve variant prioritization for genetic diseases. Additional background knowledge about interactions between genes can be utilized to identify sets of variants in different genes in the same individual which may then contribute to the overall disease phenotype. We have developed OligoPVP, an algorithm that can be used to prioritize causative combinations of variants in digenic and oligogenic diseases, using whole exome or whole genome sequences together with patient phenotypes as input. We demonstrate that OligoPVP has significantly improved performance when compared to state of the art pathogenicity detection methods in the case of digenic diseases. Our results show that OligoPVP can efficiently prioritize sets of variants in digenic diseases using a phenotype-driven approach and identify etiologically important variants in whole genomes. OligoPVP naturally extends to oligogenic disease involving interactions between variants in two or more genes. It can be applied to the identification of multiple interacting candidate variants contributing to phenotype, where the action of modifier genes is suspected from pedigree analysis or failure of traditional causative variant identification.Entities:
Year: 2018 PMID: 30279426 PMCID: PMC6168481 DOI: 10.1038/s41598-018-32876-3
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of different variant prioritization systems for recovering biallelic variants.
| All | Interacting only | |||||
|---|---|---|---|---|---|---|
| Top pair | Top 10 pairs | Combinations | Top pair | Top 10 pairs | Interacting combinations | |
| PVP | 88 (53.7%) | 107 (65.2%) | 164 | 42 (59.2%) | 51 (71.8%) | 71 |
| CADD | 34 (20.7%) | 87 (53.1%) | 164 | 10 (14.1%) | 37 (52.1%) | 71 |
| DANN | 5 (3.1%) | 59 (36.0%) | 164 | 0 | 17 (23.9%) | 71 |
| Genomiser | 0 | 0 | 164 | 0 | 0 | 71 |
| GWAVA | 0 | 0 | 164 | 0 | 0 | 71 |
| OligoPVP | 47 (28.7%) | 59 (36.0%) | 164 | 47 (66.2%) | 59 (83.1%) | 71 |
We split the evaluation in two parts, one in which we consider all variants and another in which we only consider variants for which we have background knowledge about their interactions.
Comparison of different variant prioritization systems for recovering triallelic variants.
| All | Interacting only | |||||
|---|---|---|---|---|---|---|
| Top triple | Top 10 triple | Combinations | Top triple | Top 10 triple | Interacting combinations | |
| PVP | 10 (40.0%) | 14 (56.0%) | 25 | 7 (43.8%) | 10 (40.0%) | 16 |
| CADD | 4 (16.0%) | 9 (36.0%) | 25 | 7 (43.8%) | 12 (75.0%) | 16 |
| DANN | 0 | 6 (24.0%) | 25 | 0 | 4 (25.0%) | 16 |
| Genomiser | 0 | 0 | 25 | 0 | 0 | 16 |
| GWAVA | 0 | 0 | 25 | 0 | 0 | 16 |
| OligoPVP | 10 (40.0%) | 10 (40.0%) | 25 | 10 (62.5%) | 10 (62.5%) | 16 |
We split the evaluation in two parts, one in which we consider all variants and another in which we only consider variants for which we have background knowledge about their interactions.
Analysis of top ranks of variants by PVP summarized by disease.
| Top hit | Top 3 hits | Top 10 hits | Variants (Combinations) | |
|---|---|---|---|---|
| Familial long QT syndrome | 21 (50.0%) | 38 (90.5%) | 41 (97.6%) | 42 (21) |
| Kallmann syndrome | 18 (47.4%) | 27 (71.1%) | 27 (71.1%) | 38 (19) |
| Bardet-Biedl syndrome | 14 (36.8%) | 28 (73.7%) | 32 (84.2%) | 38 (14) |
| Alport syndrome | 14 (45.2%) | 28 (90.3%) | 29 (93.6%) | 31 (15) |
| Non-syndromic genetic deafness | 12 (50.0%) | 18 (75.0%) | 18 (75.0%) | 24 (12) |
| Oculocutaneous albinism | 6 (40.0%) | 12 (80.0%) | 12 (80.0%) | 15 (7) |
| Primary ovarian insufficiency | 2 (13.3%) | 2 (13.3%) | 2 (13.3%) | 15 (7) |
| Usher syndrome | 5 (33.3%) | 11 (73.3%) | 12 (80.0%) | 15 (7) |
| Hypodontia | 6 (50.0%) | 12 (100.0%) | 12 (100.0%) | 12 (6) |
| Others | 66 (38.2%) | 118 (68.2%) | 128 (74.0%) | 173 (81) |
Performance of PVP on all combinations present in DIDA database (All DIDA), combinations added before the PVP build date date of Feb 7th, 2017 (Old DIDA), and combinations added after the cutoff date of Feb 7th, 2017 (New DIDA).
| Biallelic | Triallelic | |||||
|---|---|---|---|---|---|---|
| Top pair | Top 10 pairs | Combinations | Top triple | Top 10 triples | Interacting combinations | |
| All DIDA | 88 (53.7%) | 107 (65.2%) | 164 | 10 (40.0%) | 14 (56.0%) | 25 |
| Old DIDA | 69 (56.6%) | 84 (86.9%) | 122 | 10 (45.5%) | 14 (63.6%) | 22 |
| New DIDA | 19 (45.2%) | 23 (54.8%) | 42 | 0 | 0 | 3 |
Cases of DIDA combinations improved by OligoPVP in comparison to PVP. OligoPVP incorporates protein-protein interactions in the prioritization of variant tuples.
| DIDA ID | Gene A | Gene B | Disease name (ORPHANET) | PVP Rank A | PVP Rank B | OligoPVP Rank |
|---|---|---|---|---|---|---|
| dd225 | PSMA3 (c.696_698delAAG) | PSMB8 (c.224C > T) | CANDLE syndrome | 8 | 1 | 1 |
| dd226 | PSMA3 (c.404 + 2T > G) | PSMB8 (c.224C > T) | CANDLE syndrome | 292 | 1 | 2 |
| dd228 | PSMB4 (c.666C > A) | PSMB8 (c.313A > C) | CANDLE syndrome | 1980 | 1 | 2 |
| dd159 | EMD (c.110_112delAGA) | LMNA (c.892C > T) | Familial atrial fibrillation | 1 | 21 | 4 |
| dd043 | SCN1A (c.5054C > T) | SCN2A (c.1571G > A) | Generalized epilepsy with febrile seizures-plus | 1 | 7 | 2 |
| dd114 | CD2AP (c.1488G > A) | NPHS2 (c.622G > A) | Familial idiopathic steroid-resistant nephrotic syndrome | 1 | 141 | 4 |
| dd053 | KCNE1 (c.379C > A) | KCNQ1 (c.1022C > T) | Familial long QT syndrome | 30 | 1 | 4 |
| dd229 | CDK5RAP2 (c.4187T > C) | CEP152 (c.3014_3015delAAinsT) | Seckel syndrome | 22 | 1 | 5 |
| dd007 | PCDH15 (c.5601_5603delAAC) | CDH23 (c.193delC) | Usher syndrome | 7 | 1 | 1 |
| dd052 | HAMP (c.212G > A) | HFE (c.845G > A) | Rare hereditary hemochromatosis | 22 | 1 | 3 |
We compare the results of applying OligoPVP to the ranks obtained using PVP on individual variants.