| Literature DB >> 29361909 |
R Michael Sivley1, Jonathan H Sheehan2, Jonathan A Kropski3, Joy Cogan4, Timothy S Blackwell3, John A Phillips4, William S Bush5, Jens Meiler6, John A Capra7.
Abstract
BACKGROUND: Next-generation sequencing of individuals with genetic diseases often detects candidate rare variants in numerous genes, but determining which are causal remains challenging. We hypothesized that the spatial distribution of missense variants in protein structures contains information about function and pathogenicity that can help prioritize variants of unknown significance (VUS) and elucidate the structural mechanisms leading to disease.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29361909 PMCID: PMC5781290 DOI: 10.1186/s12859-018-2010-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Identification and classification of novel pathogenic FIP variants in RTEL1. a The locations of known pathogenic (red), putatively neutral 1000 Genomes (blue), and FIP VUS (yellow) missense variants are plotted in the context of the RTEL1 protein sequence and known domains. b The locations of pathogenic, putatively neutral, and candidate variants in the RTEL1 N-terminal structural model. c Leave-one-out cross validation of the pathogenic proximity score applied to characterized RTEL1 variants yielded an improved area under the ROC curve (AUC) relative to PolyPhen2 and SIFT, but was outperformed by evolutionary conservation scores. These results demonstrate that considering the 3D spatial distribution of known pathogenic and neutral variants can identify pathogenic hotspots and assist in the classification of VUS
Pathogenicity predictions for RTEL1 missense VUS from FIP patients
| Pos | Ref | Alt | Telomere % | Segregation | PPH2 | SIFT | ConSurf | PathProx | Model |
|---|---|---|---|---|---|---|---|---|---|
| 55 | T | S | 3% | Seg | 0.00 | 1.00 |
| −0.02 | N-terminal |
| 516 | V | L | 1% | Seg | 0.05 | 0.62 |
|
| N-terminal |
| 540 | S | A | 2% | Seg |
| 0.09 |
|
| N-terminal |
| 559 | F | I | 6% | Seg |
|
|
|
| N-terminal |
| 688 | S | C | 1% | Seg |
| 0.14 |
|
| N-terminal |
| 719 | D | G | 8% | Seg | 0.03 | 0.22 | 0.21 |
| N-terminal |
| 512 | W | C | Unknown | Unknown | 0.17 | 0.48 | 0.31 |
| N-terminal |
| 161 | H | Q | Unknown | NonSeg | 0.40 | 0.16 |
| −0.13 | N-terminal |
| 397 | Q | E | 94% | NonSeg | 0.08 | 0.20 | 0.40 | −0.09 | N-terminal |
| 528 | A | E | 58% | Unknown |
|
|
|
| N-terminal |
| 574 | R | W | 45% | NonSeg |
|
|
|
| N-terminal |
| 1107 | P | L | 6% | NonSeg | 0.63 |
| −0.13 | C-terminal | |
| 1110 | F | L | Unknown | NonSeg | 0 | 1 | −0.17 | C-terminal |
Variants are grouped by evidence for pathogenicity, which is inferred from disease co-segregation and patient telomere lengths. Variants that segregate with disease and short telomeres are treated as pathogenic (Additional file 1: Figure S1). Scores in bold indicate deleterious predictions. All thresholds were applied as recommended by each method
Fig. 2Pathogenic proximity scores in RTEL1 are correlated with decreased ATPase activity in mutagenesis studies of the homologous XPD protein. Pathogenic proximity scores were calculated for each missense mutation (N = 25) using their position relative to known pathogenic and neutral missense variants in RTEL1. a Pathogenic proximity was significantly correlated with a decrease in ATPase activity (Pearson r = −0.65, p = 0.0004), but b not significantly correlated with changes in helicase activity (Pearson r = −0.36, p = 0.08) in the homologous XPD protein
Fig. 3Structural hypotheses about the effects of six segregating RTEL1 VUS. a W512 is predicted to lie on the surface of the protein. A mutation to cysteine has the potential to interfere with functionally important protein-protein interactions. b V516 forms a small well-packed hydrophobic core, which lies under a patch of positively charged surface residues. Mutation to leucine adds steric bulk and may induce structural rearrangements that disrupt DNA binding. c S540 is a polar residue predicted to lie on a surface-exposed alpha helix in the helicase II domain. Mutation to alanine may alter surface charge or cause rotation of the alpha helix. d F559 is buried in the core of the protein, in close proximity to residues predicted to form part of the DNA-binding cavity, including A621 and E591. Mutation to isoleucine removes steric bulk and is likely to leave a void in the hydrophobic core of the protein, disrupting structure and reducing stability. e D719 is predicted to fall in a surface-exposed helix. Mutation to glycine drastically reduces both the bulk and charge of the protein’s surface, and likely disrupts the helix at that point. f T55 is predicted to form part of the interface between helices 1 and 2 in RTEL1. Mutation to a serine would reduce the steric bulk and alter the packing between the two helices