| Literature DB >> 28827725 |
Vagheesh M Narasimhan1, Raheleh Rahbari2, Aylwyn Scally3, Arthur Wuster2,4, Dan Mason5, Yali Xue2, John Wright5, Richard C Trembath6, Eamonn R Maher7,8, David A van Heel9, Adam Auton10, Matthew E Hurles2, Chris Tyler-Smith2, Richard Durbin2.
Abstract
Heterozygous mutations within homozygous sequences descended from a recent common ancestor offer a way to ascertain de novo mutations across multiple generations. Using exome sequences from 3222 British-Pakistani individuals with high parental relatedness, we estimate a mutation rate of 1.45 ± 0.05 × 10-8 per base pair per generation in autosomal coding sequence, with a corresponding non-crossover gene conversion rate of 8.75 ± 0.05 × 10-6 per base pair per generation. This is at the lower end of exome mutation rates previously estimated in parent-offspring trios, suggesting that post-zygotic mutations contribute little to the human germ-line mutation rate. We find frequent recurrence of mutations at polymorphic CpG sites, and an increase in C to T mutations in a 5' CCG 3' to 5' CTG 3' context in the Pakistani population compared to Europeans, suggesting that mutational processes have evolved rapidly between human populations.Estimates of human mutation rates differ substantially based on the approach. Here, the authors present a multi-generational estimate from the autozygous segment in a non-European population that gives insight into the contribution of post-zygotic mutations and population-specific mutational processes.Entities:
Mesh:
Year: 2017 PMID: 28827725 PMCID: PMC5566399 DOI: 10.1038/s41467-017-00323-y
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Study design: strategy to estimate the mutation rate. Bottom left: regions of the genome in an individual with first cousin parents are autozygous due to being inherited by two routes from a common founding chromosome. The X marks represent a DNM transmitted along the pedigree to the sequenced individual. Top: most sites in autozygous regions are homozygous, except for recent mutations, gene conversions and sequencing errors. Bottom right: the estimate depends on three factors: N, L and M, as described in the text
Fig. 2MAF-threshold regression to simultaneously obtain mutation rate and gene conversion rate. The mutation rate, is calculated by obtaining values of N at different thresholds of minor allele frequency. The intercept on the y axis of the regression provides an estimate of the mutation rate that is corrected for gene conversion and the slope is used to calculate the estimate of the gene conversion rate
Fig. 3Signatures of DNMs and overlap of mutations with ExAC. a The distribution of de novo mutational signatures across all 1152 singleton candidate de novo mutations and 350 that overlap with ExAC. b Differences in context-specific mutation rate. y-axis: significance of the difference in proportion of 5ʹ CCG → CTG 3ʹ DNMs in 1152 mutations from the autozygosity data set (AZ) and 850 DNMs from the 1000 Genomes Project Complete Genomics trio data set (PJL) in comparison with 6948 mutations from the meta-analysis data set (MDNM) and variants private to Europeans in the 1000 Genomes Project (EURpriv). The combined p-value shows the result of meta-analysis of the AZ/MDNM and PJL/EURpriv comparisons. A comparison between private mutations in PJL and EUR in the 1000 Genomes Project population data set (PJLpriv) and (EURpriv), respectively is also shown. Significance of the difference in 747 DNMs from the Scottish Family Health Study (SFHS) is shown as a control; the size of the disk indicates the fold difference of the test as in the legend. c Proportion of mutations that are 5ʹ CCG → CTG 3ʹ for various whole genome data sets. The AZ data were adjusted for trinucleotide composition differences between exomes and whole genomes. PJLpriv variants show this increased context-specific mutation rate compared to both East Asians (ASNpriv) and Europeans (EURpriv) indicating that the increase is specific to Pakistanis