| Literature DB >> 27375772 |
Brigitte Glanzmann1, Hendri Herbst2, Craig J Kinnear3, Marlo Möller3, Junaid Gamieldien4, Soraya Bardien1.
Abstract
BACKGROUND: Whole exome sequencing (WES) has provided a means for researchers to gain access to a highly enriched subset of the human genome in which to search for variants that are likely to be pathogenic and possibly provide important insights into disease mechanisms. In developing countries, bioinformatics capacity and expertise is severely limited and wet bench scientists are required to take on the challenging task of understanding and implementing the barrage of bioinformatics tools that are available to them.Entities:
Keywords: Bioinformatics capacity; TAPER™; Variant identification; Whole exome sequencing
Year: 2016 PMID: 27375772 PMCID: PMC4929716 DOI: 10.1186/s13029-016-0056-8
Source DB: PubMed Journal: Source Code Biol Med ISSN: 1751-0473
Fig. 1The seven-level filtration framework that makes up the backbone of TAPER™. (Abbreviations: 1KGP – 1000 Genomes Project; ESP6500 – Exome Sequencing Project 6500; GERP – Genomic Evolutionary Rate Prediction Score; FATHMM – Functional Annotation Through Hidden Markov Models)
Stepwise breakdown of results obtained by TAPER™ using WES datasets for which the causal mutations have previously been identified
| Parkinson’s disease dataset 1 – | Intellectual disability and microcephaly dataset 1 – | Ataxia and myoclonic epilepsy dataset 1 – | Parkinson’s disease dataset 2 - | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Individual_1 | Individual_2 | Individual_3 | Individual_1 | Individual_2 | Individual_1 | Individual_1 | Individual_2 | Individual_3 | |
| Total number of variants in VCF file | 55 726 | 55 336 | 55 289 | 54 426 | 54 574 | 60 128 | 104 307 | 108 243 | 97 833 |
| STEP 1: Total number of variants assigned to exonic regions by wANNOVAR | 19 727 | 19 969 | 20 353 | 24 573 | 24 425 | 23 163 | 19 850 | 19 972 | 19 863 |
| STEP 2: All synonymous and non-frameshifts removed | 9 465 | 9 544 | 9 766 | 12 227 | 12 248 | 11 693 | 9 752 | 9 777 | 9 838 |
| STEP 3: Remove all variants with a frequency >1 % in 1KGP | 1 281 | 934 | 966 | 2 177 | 2 153 | 1 377 | 1 771 | 1 681 | 1 932 |
| STEP 4: Remove all variants with a frequency >1 % in ESP6500 | 917 | 797 | 819 | 1 928 | 1 906 | 941 | 1 335 | 1 445 | 1 575 |
| STEP 5: Remove all variants with negative GERP+++ scores | 718 | 615 | 651 | 1 243 | 1 261 | 688 | 1 014 | 1 126 | 1 232 |
| STEP 6: Remove all variants with FATHMM scores greater than 1.0 | 262 | 224 | 240 | 252 | 231 | 257 | 413 | 301 | 328 |
| STEP 7: Variants linked to relevant diseases | 252 | 221 | 236 | 241 | 221 | 239 | 398 | 262 | 292 |
| Variant of interest in shortlist? | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |