| Literature DB >> 25078397 |
Damian Smedley1, Sebastian Köhler1, Johanna Christina Czeschik1, Joanna Amberger1, Carol Bocchini1, Ada Hamosh1, Julian Veldboer2, Tomasz Zemojtel2, Peter N Robinson3.
Abstract
MOTIVATION: Whole-exome sequencing (WES) has opened up previously unheard of possibilities for identifying novel disease genes in Mendelian disorders, only about half of which have been elucidated to date. However, interpretation of WES data remains challenging.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25078397 PMCID: PMC4221119 DOI: 10.1093/bioinformatics/btu508
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Performance of ExomeWalker using STRING v9.05 as the source of interactome data. The bars show the percentage of exomes where the true disease gene is identified as the top hit or in the top 10 or 50 results. Either in-house or 1000 Genomes Project exomes were used. All exomes are filtered to remove synonymous, intergenic and intronic variants except for those in splice sites. In addition, variants with a MAF > 1% are excluded. Results are shown without (All) or with an AD or AR inheritance model applied. Ranking is either by Variant scoring that combines MAF and predicted pathogenicity, RWR analysis alone or ExomeWalker scoring that additionally includes evidence of protein–protein associations with other genes linked to the disease
Fig. 2.Performance of ExomeWalker using STRING v9.05 without text-mined associations as the source of interactome data. Abbreviations are as in Figure 1
List of 19 genes discovered during the year 2012 and for which a disease-causing mutation was listed in HGMD
| Gene | ID | Disease gene family | Publication date | Variant | RandomWalk | ExomeWalker | Exomiser | eXtasy |
|---|---|---|---|---|---|---|---|---|
| 5119 | Pontocerebellar hypoplasia | November 2012 ( | 3 | 66 | 3 | 1 | ||
| 64802 | Leber congenital amaurosis | September 2012 ( | 61 | 14 | 61 | 52 | 99 | |
| 9662 | Microcephaly, primary AR | May 2012 ( | 18 | 3 | 2 | 16 | ||
| 26249 | Pseudohypoaldosteronism, type II | January 2012 ( | 4 | 131 | 5 | 2 | 1 | |
| 7067 | Hypothyroidism, congenital, non-goitrous | January 2012 ( | 51 | 2 | 1 | 148 | 5 | |
| 10329 | Muscular dystrophy-dystroglycanopathy, type A | December 2012 ( | 19 | 298 | 19 | 19 | ||
| 1650 | Congenital disorders of glycosylation, type I | February 2012 ( | 22 | 9 | 23 | 17 | ||
| 87178 | Combined oxidative phosphorylation deficiency | November 2012 ( | 9 | 4 | 2 | 119 | 3 | |
| 8818 | Congenital disorders of glycosylation, type I | October 2012 ( | 3 | 1 | 1 | 2 | 4 | |
| 55690 | Mental retardation, AD | December 2012 ( | 29 | 283 | 39 | 15 | ||
| 103 | Aicardi-Goutieres syndrome | November 2012 ( | 17 | 26 | 17 | 119 | ||
| 51475 | Deafness, AR | October 2012 ( | 59 | 109 | 61 | 40 | ||
| 667 | Hereditary sensory and autonomic neuropathy | April 2012 | 18 | 108 | 18 | 132 | ||
| 137492 | Spastic paraplegia | July 2012 ( | 11 | 318 | 11 | 9 | 10 | |
| 3229 | Ectodermal dysplasia | November 2012 ( | 19 | 5 | 17 | 131 | 48 | |
| 284058 | Mental retardation, AD | April 2012 ( | 88 | 63 | 45 | 222 | 178 | |
| 2984 | Diarrhea, congenital | April 2012 ( | 18 | 4 | 1 | 124 | 13 | |
| 5216 | Amyotrophic lateral sclerosis | August 2012 ( | 46 | 68 | 50 | 166 | 91 | |
| 57680 | Autism | December 2012 ( | 80 | 95 | 80 | 192 |
Notes. The first column shows the gene symbol and the second column shows the NCBI Entrez Gene ID. The third column provides the OMIM phenotypic series to which the gene was assigned after its identification as being causative for the disease. The next three columns show the rank obtained in ExomeWalker analysis using STRING v9.0, 1000 Genomes Project exomes and mode of inheritance filtering sorted by the variant, random walk or combined ExomeWalker score. Finally, we show the ranks obtained from the Exomiser and eXtasy tools that take an alternative approach of prioritizing by phenotype. The columns Variant, RandomWalk, ExomeWalker, Exomiser and eXtasy show the ranks of the gene by each method. The eXtasy rank was obtained after preprocessing.
aVariant lost during eXtasy prioritization.
bDeletion so not suitable for eXtasy prioritization.
cNo phenotype annotations.
Fig. 3.PPA network derived from congenital disorders of glycosylation, type I (CDG-I) seed genes. The candidate genes DDOST and DMP2 (shown in blue) interact with multiple other CDG-I genes (shown as red nodes in the network) via paths of length one and two. The random walk methodology essentially integrates over all interaction paths between seed genes and a candidate gene to generate a similarity score. Although short paths such as those shown in the figure have the most influence on the score, other aspects of the global network structure are also taken into account (Köhler )