| Literature DB >> 31227601 |
Joanna Kaplanis1, Nadia Akawi2, Giuseppe Gallone1, Jeremy F McRae1, Elena Prigmore1, Caroline F Wright3, David R Fitzpatrick4, Helen V Firth1,5, Jeffrey C Barrett1, Matthew E Hurles1.
Abstract
Approximately 2% of de novo single-nucleotide variants (SNVs) appear as part of clustered mutations that create multinucleotide variants (MNVs). MNVs are an important source of genomic variability as they are more likely to alter an encoded protein than a SNV, which has important implications in disease as well as evolution. Previous studies of MNVs have focused on their mutational origins and have not systematically evaluated their functional impact and contribution to disease. We identified 69,940 MNVs and 91 de novo MNVs in 6688 exome-sequenced parent-offspring trios from the Deciphering Developmental Disorders Study comprising families with severe developmental disorders. We replicated the previously described MNV mutational signatures associated with DNA polymerase zeta, an error-prone translesion polymerase, and the APOBEC family of DNA deaminases. We estimate the simultaneous MNV germline mutation rate to be 1.78 × 10-10 mutations per base pair per generation. We found that most MNVs within a single codon create a missense change that could not have been created by a SNV. MNV-induced missense changes were, on average, more physicochemically divergent, were more depleted in highly constrained genes (pLI ≥ 0.9), and were under stronger purifying selection compared with SNV-induced missense changes. We found that de novo MNVs were significantly enriched in genes previously associated with developmental disorders in affected children. This shows that MNVs can be more damaging than SNVs even when both induce missense changes, and are an important variant type to consider in relation to human disease.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31227601 PMCID: PMC6633265 DOI: 10.1101/gr.239756.118
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Properties of MNVs. (A) Schematic showing how sim-MNVs, two variants that occur simultaneously, are defined as having two variants with identical allele frequencies, and con-MNVs, two variants that occur consecutively, as having different allele frequencies. (B) Proportion of pairs of heterozygous variants (possible MNVs) that phase to the same haplotype as a function of distance separated by sim and con. (C) The number of sim-MNVs and con-MNVs by distance between the two variants.
Numbers of MNVs in each category type
Numbers and proportions of consequence types for MNVs within the same codon
Figure 2.Classification of intra-codon MNV missense mutations. (A) Example of how one-step missense MNVs and two-step missense MNVs are classified using a single codon “CAC.” Venn diagram shows amino acids that can be created with either a single-base change or a two-base change in the codon “CAC.” (B–D) Across all codons, the distribution of physicochemical distances for the amino acid changes caused by different types of missense variants: (B) exclusive SNV missense; (C) one-step MNV missense; and (D) two-step MNV missense. Dashed line indicates the median of the distribution.
Figure 3.Quantifying the pathogenicity of MNVs. (A) Proportion of variants that fall in genes with pLI ≥ 0.9 over different classes of variants for both DDD and ExAC data sets. SNVs are green; MNVs, purple. Lines are 95% confidence intervals. (B) The median CADD score over different classes of variants identified from DDD data with bootstrapped 95% confidence intervals. (C) Singleton proportion for different classes of DDD variants. In yellow are SNVs stratified by binned CADD scores with their corresponding singleton proportions. Lines are 95% confidence intervals.
De novo MNVs that fall in genes associated with developmental disorders
(Chr) Chromosome; (Ref) reference allele; (Alt) alternate allele.
Figure 4.Enrichment of de novo MNVs in DDD study. Ratio of observed number of de novo MNVs versus the expected number of de novo MNVs based on the estimate of the MNV mutation rate. Compared with enrichment of SNVs in DD genes in consequence classes synonymous, missense, and stop-gain.