| Literature DB >> 30587507 |
Jenny Lord1, Giuseppe Gallone1, Patrick J Short1, Jeremy F McRae1, Holly Ironfield1, Elizabeth H Wynn1, Sebastian S Gerety1, Liu He1, Bronwyn Kerr2,3, Diana S Johnson4, Emma McCann5, Esther Kinning6, Frances Flinter7, I Karen Temple8,9, Jill Clayton-Smith2,3, Meriel McEntagart10, Sally Ann Lynch11, Shelagh Joss12, Sofia Douzgou2,3, Tabib Dabir13, Virginia Clowes14, Vivienne P M McConnell13, Wayne Lam15, Caroline F Wright16, David R FitzPatrick1,15, Helen V Firth1,17, Jeffrey C Barrett1, Matthew E Hurles1.
Abstract
Mutations that perturb normal pre-mRNA splicing are significant contributors to human disease. We used exome sequencing data from 7833 probands with developmental disorders (DDs) and their unaffected parents, as well as more than 60,000 aggregated exomes from the Exome Aggregation Consortium, to investigate selection around the splice sites and quantify the contribution of splicing mutations to DDs. Patterns of purifying selection, a deficit of variants in highly constrained genes in healthy subjects, and excess de novo mutations in patients highlighted particular positions within and around the consensus splice site of greater functional relevance. By using mutational burden analyses in this large cohort of proband-parent trios, we could estimate in an unbiased manner the relative contributions of mutations at canonical dinucleotides (73%) and flanking noncanonical positions (27%), and calculate the positive predictive value of pathogenicity for different classes of mutations. We identified 18 patients with likely diagnostic de novo mutations in dominant DD-associated genes at noncanonical positions in splice sites. We estimate 35%-40% of pathogenic variants in noncanonical splice site positions are missing from public databases.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30587507 PMCID: PMC6360807 DOI: 10.1101/gr.238444.118
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Signals of purifying selection around splice sites. (A) Selective constraint across splicing region in 13,750 unaffected parents of DDD probands and more than 60,000 aggregated exomes from ExAC. Mutability-adjusted proportion of singletons (MAPS) with 95% confidence intervals (CIs) shown for Ensembl's Variant Effect Predictor (VEP) annotated exonic sites, extended splice acceptor and splice donor regions, the last base of the exon, split by reference nucleotide, and grouped sites in the polypyrimidine tract (PolyPy) region, split by changes from a pyrimidine to a purine (PyPu) versus all other changes. (B) Proportion of variants with 95% CI in 13,750 unaffected parents of DDD probands that fall within genes with high probability of loss-of-function intolerance (pLI > 0.9) across VEP annotated exonic sites, extended splice acceptor and splice donor regions, the last base of the exon, split by reference nucleotide, and grouped sites in the PolyPy region, split by changes from a pyrimidine to a purine (PyPu) versus all other changes. Lower panel shows splice acceptor and splice donor consensus sequences, based on our exons of interest.
Figure 2.De novo mutations (DNMs) around splice sites. Enrichment of DNMs across the splicing region in 7833 DDD probands. (A) Numbers of observed and expected DNMs across the splicing region in known dominant and recessive DD genes, as well as in non-DD–associated genes, with FDR-corrected Poisson P-values. Splice acceptor and splice donor consensus sequences are shown below, as in Figure 1. (B) Aggregation of observed and expected numbers of DNMs in the PolyPy region, with changes from a pyrimidine to a purine (PyPu) and all other changes shown separately for known dominant and recessive DD genes, as well as non-DD–associated genes. (C) Positive predictive values (PPVs) for DNMs in dominant DD-associated genes in positions across the splicing region, as well as VEP annotated stop gained and missense changes, calculated from observed and expected numbers of DNMs. (D) Enrichment (observed/expected) of DNMs by gene probability of pLI split into sextiles for donor+5, pyrimidine to purine PolyPy, and synonymous sites. pLI scores encompassed by each sextile: 1 = 5.36 × 10−91–0.000000605, 2 = 0.000000609–0.000558185, 3 = 0.000559475–0.027905143, 4 = 0.027908298–0.377456159, 5 = 0.377491926–0.919495985, 6 = 0.91955878–1.
Diagnostic de novo mutations in noncanonical dinucleotide near-splice positions
Figure 3.Clinical classifications of noncanonical near-splice DNMs. Relationship between clinical classifications of 38 splice region DNMs in undiagnosed DDD probands and PPVs calculated using observed and expected numbers of DNMs in 7833 probands.
Figure 4.Selective constraint and pathogenicity scores. MAPS, with 95% CI, calculated for pathogenicity score brackets (least to most severe) in 13,750 unaffected parents from the DDD project, with Spearman's rank correlation coefficient.
Figure 5.Pathogenicity scores for observed near-splice site DNMs. Cumulative percentage of DNMs in known dominant DD genes with decreasing pathogenicity score bracket, shown with canonical splice site positions included (left) and excluded (right). (AUC*) area under curve.