| Literature DB >> 28566687 |
Urszula Lechowicz1, Tomasz Gambin2,3, Agnieszka Pollak1, Anna Podgorska1, Piotr Stawinski1, Andre Franke4, Britt-Sabina Petersen4, Malgorzata Firczuk5, Monika Oldak1, Henryk Skarzynski6, Rafal Ploski7.
Abstract
Autosomal recessive diseases (ARD) are typically caused by a limited number of mutations whose identification is challenged by their low prevalence. Our purpose was to develop a novel approach allowing an efficient search for mutations causing ARD and evaluation of their pathogenicity without a control group. We developed Iterative Sequencing and Variant Screening (ISVS) approach based on iterative cycles of gene sequencing and mutation screening, and ISVS Simulator software ( http://zsibio.ii.pw.edu.pl/shiny/isvs/ ) for assessment of detected variants' significance. As shown by simulations, ISVS efficiently identifies and correctly classifies pathogenic mutations except for cases where the gene of interest has extremely high number of low frequency nonpathogenic variants. By applying ISVS, we found 4 known and 9 novel (p.C73Y, p.S124L, p.C194Mfs*17, c.782 + 2 T > A, c.953-5 A > G, p.L325Q, p.D334Mfs*24, p.R436G, p.M448T) TMPRSS3 variants among deaf patients. For 3 known and 5 novel variants the disease association was supported by ISVS Simulator odds >90:1. Pathogenicity of 6 novel mutations has been supported by in-silico predictions of variants' deleteriousness. By directly comparing variant prevalence in patients and controls, disease association was demonstrated only for two variants and it was relatively weak (P < 0.05). Summarizing, ISVS strategy and ISVS Simulator are useful for detection of genetic variants causing AR diseases.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28566687 PMCID: PMC5451398 DOI: 10.1038/s41598-017-02315-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Screen-shot from ISVS simulator showing an example of “Fraction bi-allelic plot” obtained using default settings. Note information about given (x,y) point provided in the tooltip. This particular point indicates that if in a real ISVS experiment given variant was found in 20 patients of whom 19 also had a rare variant in the other allele (intrans:all = 19:20) there is considerable evidence (#SVM LR = 107) that the variant is pathogenic.
Figure 2Summary of findings during cascade TMPRSS mutation screening by ISVS in a cohort of HI subjects. WES – whole exome sequencing; arrows – cycles of Real Time PCR screening; filled boxes - variants found by Sanger sequencing, (number of samples with each variant is given in parentheses); italics – variants observed at a previous stage.
Prevalence of rare variants in the TMPRSS3 gene detected among Polish HI patients and controls.
| Variant | het | hom | PATIENTS | CONTROLS | P value | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| allele mut | allele sum | freq. % | het | hom | allele mut | allele sum | freq. % | ||||
| p.H70Tfs*19 | 20 | 3 | 26 | 4376 | 0.59 | 0 | 0 | 0 | 1058 | 0 | 0.012 |
| p.A138E | 17 | 2 | 21 | 4604 | 0.46 | 2 | 0 | 2 | 1918 | 0.1 | 0.029 |
| p.M448T^ | 10 | 0 | 10 | 4596 | 0.22 | 0 | 0 | 0 | 1222 | 0 | 0.13 |
| p.S124L^ | 5 | 0 | 5 | 4524 | 0.11 | 0 | 0 | 0 | 1046 | 0 | 0.59 |
| p.C194Mfs*17^ | 4 | 0 | 4 | 4594 | 0.09 | 2 | 0 | 2 | 1932 | 0.1 | 1.0 |
| c.953-5 A > G^ | 4 | 0 | 4 | 4560 | 0.09 | 0 | 0 | 0 | 1058 | 0 | 1.0 |
| p.R109W | 2 | 0 | 2 | 4506 | 0.04 | 0 | 0 | 0 | 998 | 0 | 1.0 |
| p.D334Mfs*24^ | 2 | 0 | 2 | 4618 | 0.04 | 0 | 0 | 0 | 1080 | 0 | 1.0 |
| p.C73Y^ | 1 | 0 | 1 | 4562 | 0.02 | 0 | 0 | 0 | 1046 | 0 | 1.0 |
| c.782 + 2 T > A^ | 1 | 0 | 1 | 4206 | 0.02 | 0 | 0 | 0 | 1042 | 0 | 1.0 |
| p.L325Q^ | 1 | 0 | 1 | 4566 | 0.02 | 0 | 0 | 0 | 1018 | 0 | 1.0 |
| p.A426T | 1 | 0 | 1 | 4104 | 0.02 | 0 | 0 | 0 | 1112 | 0 | 1.0 |
| p.R436G^ | 1 | 0 | 1 | 4604 | 0.02 | 0 | 0 | 0 | 986 | 0 | 1.0 |
^Mutation not reported to date; het- heterozygous, hom- homozygous, allele mut- number of mutated alleles, allele sum - number of tested alleles.
Statistical assessment of the disease-association of the detected TMPRSS3 mutations using ISVS Simulator.
| Mutation | BAM/all | #pat/#non-pat | SVM LR | Second mutations found in patients | |
|---|---|---|---|---|---|
| Observed | Cumulat. | ||||
| p.H70Tfs*19 | 20/23 | 92/0 | 929/0 | 107 | p.A138E (7), p.M448T (4), p.H70Tfs*19 (3), p.C194Mfs*17 (3), wt (3), p.S124L (1), c.953-5 A > G (1), p.D334Mfs*24 (1) |
| p.A138E | 16/19 | 349/0 | 1076/0 | 107 | p.H70Tfs*19 (7), wt (3), p.A138E (2), p.C73Y (1), p.R109W (1), p.S124L (1), p.C194Mfs*17 (1), c.953-5 A > G (1), p.D334Mfs*24 (1), p.M448T (1) |
| p.M448T^ | 10/10 | 361/0 | 2599/0 | 107 | p.H70Tfs*19 (4), c.953-5 A > G (2), p.S124L (1), p.A138E (1), c.782 + 2 T > A (1), p.A426T (1), |
| p.S124L^ | 4/5 | 1689/0 | 1689/0 | 1390 | p.H70Tfs*19 (1), p.A138E (1), p. L325Q (1), p.M448T (1), wt (1) |
| p.C194Mfs*17^ | 4/4 | 2204/1 | 2204/1 | 7126 | p.H70Tfs*19 (3), p.A138E (1) |
| c.953-5 A > G^ | 4/4 | 2204/1 | 2204/1 | 7126 | p.M448T (2), p.H70Tfs*19 (1), p.A138E (1) |
| p.R109W | 2/2 | 4709/57 = 82 | 4709/57 = 82 | 90 | p.A138E (1), p.R436G (1) |
| p.D334Mfs*24^ | 2/2 | 4709/57 = 82 | 4709/57 = 82 | 90 | p.H70Tfs*19 (1), p.A138E (1) |
| p.L325Q^ | 1/1 | 7429/655 = 11 | 7429/655 = 11 | 10 | p.S124L (1) |
| p.R436G^ | 1/1 | 7429/655 = 11 | 7429/655 = 11 | 10 | p.R109W (1) |
| p.A426T | 1/1 | 7429/655 = 11 | 7429/655 = 11 | 10 | p.M448T (1) |
| c.782 + 2 T > A^ | 1/1 | 7429/655 = 11 | 7429/655 = 11 | 10 | p.M448T (1) |
| p.C73Y^ | 1/1 | 7429/655 = 11 | 7429/655 = 11 | 10 | p.A138E (1) |
BAM/all– number with bi-allelic mutations/total number with given mutation. in brackets - number of patients with a given mutation. The settings for ISVS Simulator: number of patients = 2 200; disease prevalence 1/1000; fraction of disease cases explained by mutation in the gene: 0.02; Number of iterations: 10 000; Number of pathogenic mutations: 10; Number of non-pathogenic mutations: 20; Cumulative frequency of non-pathogenic variants: 0.05.
Splice site mutation predictions with the usage of Alamut bioinformatics algorithms. SSF- SpliceSiteFinder-like; HSF- Human Splicing Finder; – no data.
| Variant | Alamut bioinformatics algorithms | Interpretation | ||||
|---|---|---|---|---|---|---|
| SSF | MaxEntScan | NNSPLICE | GeneSplicer | HSF | ||
| c.782 + 2 T > A | 100% | 100% | 100% | 100% | 100% | Broken WT Donor Site, most probably affecting splicing |
| c.953-5A > G | — | 61% | — | 46% | <1% | Broken WT Acceptor Site, most probably affecting splicing |
Bioinformatics analysis of novel missense variants in TMPRSS3 gene.
| TMPRSS3 mutation | SIFT | PolyPhen-2 | MutationTaster2 | MetaSVM | MetaLR |
|---|---|---|---|---|---|
| p.C73Y | damaging | probably damaging | disease causing | damaging | damaging |
| p.S124L | tolerated | benign | polymorphism | tolerated | tolerated |
| p.L325Q | damaging | probably damaging | disease causing | damaging | damaging |
| p.R436G | damaging | probably damaging | disease causing | damaging | damaging |
| p.M448T | tolerated | benign | disease causing | tolerated | tolerated |