| Literature DB >> 25871441 |
K Joeri van der Velde1,2, Joël Kuiper2,3, Bryony A Thompson4, John-Paul Plazzer5, Gert van Valkenhoef3, Mark de Haan1,2, Jan D H Jongbloed2, Cisca Wijmenga2, Tom J de Koning2, Kristin M Abbott2, Richard Sinke2, Amanda B Spurdle4, Finlay Macrae5,6, Maurizio Genuardi7, Rolf H Sijmons2, Morris A Swertz1,2.
Abstract
Next-generation sequencing in clinical diagnostics is providing valuable genomic variant data, which can be used to support healthcare decisions. In silico tools to predict pathogenicity are crucial to assess such variants and we have evaluated a new tool, Combined Annotation Dependent Depletion (CADD), and its classification of gene variants in Lynch syndrome by using a set of 2,210 DNA mismatch repair gene variants. These had already been classified by experts from InSiGHT's Variant Interpretation Committee. Overall, we found CADD scores do predict pathogenicity (Spearman's ρ = 0.595, P < 0.001). However, we discovered 31 major discrepancies between the InSiGHT classification and the CADD scores; these were explained in favor of the expert classification using population allele frequencies, cosegregation analyses, disease association studies, or a second-tier test. Of 751 variants that could not be clinically classified by InSiGHT, CADD indicated that 47 variants were worth further study to confirm their putative pathogenicity. We demonstrate CADD is valuable in prioritizing variants in clinically relevant genes for further assessment by expert classification teams.Entities:
Keywords: Lynch syndrome; cumulative link model; pathogenicity prediction; variant classification
Mesh:
Year: 2015 PMID: 25871441 PMCID: PMC4973827 DOI: 10.1002/humu.22798
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Figure 1Probability that a CADD score will belong to a certain InSiGHT class. The inverse logit (logit−1) was applied to each of the response variables. Classes 2 and 4 are dominated by class 3 under this model.
Figure 2Flowchart describing the steps and results of the analysis.
Figure 3Beanplot [Kampstra, 2008] showing the data points (green) and density estimation (purple) of the scaled CADD C‐score per InSiGHT class. The width of the green lines is relative to the number of data points at that score. Black horizontal lines indicate the mean per InSiGHT class; the dotted line shows the overall mean. The mean scores of classes 1–5 show a respective stepwise increase of 8.41 (σ = 7.46), 11.44 (σ = 7.72), 16.87 (σ = 9.40), 21.41 (σ = 6.13), and 29.04 (σ = 10.28). The unclassified group (class 3) shows a flatter distribution than the other classes.
Number of InSiGHT Variants Reassigned to Alternative Classes According to the Cumulative Link Model Fitted on CADD Scores
| InSiGHT classification | |||||
|---|---|---|---|---|---|
| CADD model | Class 1 | Class 2 | Class 3 | Class 4 | Class 5 |
| Class 1 | 135 | 19 | |||
| Class 2 | 71 | ||||
| Class 3 | 4 | 1 | 704 | 3 | 3 |
| Class 4 | 171 | ||||
| Class 5 | 12 | 12 | 47 | 7 | 1021 |
The 24 Variants That Are Still Uncertain and Predicted by Bioinformatic Tools to Be Likely Pathogenic, According to the Probabilities of the MAPP + PolyPhen2 Calibrated Model [Thompson et al., 2013a] and the CADD Model
| Gene | Variant | AA ch. | Previous [Thompson et al., | Here |
|---|---|---|---|---|
| MLH1 | c.1037A>G | p.Q346R | 0.95 | 0.99 |
| MLH1 | c.109G>A | p.E37K | 0.87 | 0.99 |
| MLH1 | c.112A>G | p.N38D | 0.94 | 0.99 |
| MLH1 | c.125C>T | p.A42V | 0.96 | 0.99 |
| MLH1 | c.184C>A | p.Q62K | 0.88 | 0.99 |
| MLH1 | c.1918C>T | p.P640S | 0.82 | 0.99 |
| MLH1 | c.1919C>T | p.P640L | 0.93 | 0.99 |
| MLH1 | c.304G>A | p.E102K | 0.87 | 0.99 |
| MLH1 | c.307G>C | p.A103P | 0.97 | 0.99 |
| MLH1 | c.331G>C | p.A111P | 0.97 | 0.99 |
| MLH1 | c.347C>A | p.T116K | 0.93 | 0.99 |
| MLH1 | c.65G>C | p.G22A | 0.89 | 0.99 |
| MLH1 | c.67G>A | p.E23K | 0.86 | 0.99 |
| MLH1 | c.74T>C | p.I25T | 0.86 | 0.99 |
| MLH1 | c.80G>C | p.R27P | 0.97 | 0.99 |
| MLH1 | c.925C>T | p.P309S | 0.83 | 0.99 |
| MSH2 | c.1799C>T | p.A600V | 0.96 | 0.99 |
| MSH2 | c.1826C>T | p.A609V | 0.96 | 0.99 |
| MSH2 | c.2064G>A | p.M688I | 0.89 | 0.99 |
| MSH2 | c.2141C>T | p.A714V | 0.87 | 0.99 |
| MSH2 | c.2168C>T | p.S723F | 0.88 | 0.99 |
| MSH2 | c.2187G>T | p.M729I | 0.88 | 0.99 |
| MSH2 | c.529G>A | p.E177K | 0.86 | 0.99 |
| MSH6 | c.3682G>C | p.A1228P | 0.97 | 0.99 |