| Literature DB >> 28061747 |
Hashem A Shihab1, Mark F Rogers2, Michael Ferlaino2, Colin Campbell2, Tom R Gaunt3.
Abstract
BACKGROUND: Accurate methods capable of predicting the impact of single nucleotide variants (SNVs) are assuming ever increasing importance. There exists a plethora of in silico algorithms designed to help identify and prioritize SNVs across the human genome for further investigation. However, no tool exists to visualize the predicted tolerance of the genome to mutation, or the similarities between these methods.Entities:
Keywords: Genome browser; Genome tolerance; Mutation; Pathogenicity prediction; Prediction algorithm; SNVs; Variant effect prediction
Mesh:
Substances:
Year: 2017 PMID: 28061747 PMCID: PMC5219737 DOI: 10.1186/s12859-016-1436-4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
List of in silico prediction algorithms and conservation scores summarized through the Genome Tolerance Browser
|
| |
|---|---|
|
|
|
| SIFT | TransFIC (SIFT) |
| PolyPhen-2 (HumVar & HumDiv) | TransFIC (PolyPhen-2) |
| MutationAssessor | TransFIC (MutationAssessor) |
| FATHMM (Unweighted & Weighted) | FATHMM (Cancer) |
| FATHMM-MKL (Coding) | |
| MutationTaster2 | |
| PROVEAN | |
| VEST | |
| LRT | |
| MetaLR | |
| MetaSVM | |
|
| |
| CADD | |
| DANN | |
| FATHMM-MKL (Non-coding) | |
| fitCons | |
|
| |
| PhastCons (46-Way) | |
| PhyloP (46-way; vertebrate, primates and placental mammals) | |
| PhastCons (100-Way) | |
| PhyloP (100-way; vertebrate, primates and placental mammals) | |
| GERP++ | |
| SiPhy | |
Fig. 1Tolerance profile of HOXA5 shows regions of similarity between sequence-based prediction algorithms: SIFT and PROVEAN. However, subtle differences in tolerance can be observed when comparing these sequence-based algorithms with a structure-based algorithm, PolyPhen-2. Insight into potential regions of interest can be also obtained from genome-wide prediction algorithms such as FATHMM-MKL and CADD
Fig. 2A similar trend in intolerance can be observed across LDLR using sequence- and structure-based prediction algorithms, i.e. sequence-based methods tend to agree on intolerance given that they both rely on sequence conservation whereas structure-based algorithms utilize the additional structure-based properties made available to them to show a different tolerance profile. Unlike HOXA5, genome-wide prediction algorithms appear to agree on potential peaks of intolerance across the non-coding region of LDLR
Fig. 3Subtle differences between generic and cancer-specific prediction algorithms can be observed across TP53. For example, cancer-specific transformations of traditional germline prediction algorithms amplify intolerance across the entire region
Fig. 4Cancer-specific transformations of traditional germline prediction algorithms amplifies the intolerance of BRCA1