| Literature DB >> 27760127 |
Priya Moorjani1, Ziyue Gao2, Molly Przeworski1,3.
Abstract
Our understanding of the chronology of human evolution relies on the "molecular clock" provided by the steady accumulation of substitutions on an evolutionary lineage. Recent analyses of human pedigrees have called this understanding into question by revealing unexpectedly low germline mutation rates, which imply that substitutions accrue more slowly than previously believed. Translating mutation rates estimated from pedigrees into substitution rates is not as straightforward as it may seem, however. We dissect the steps involved, emphasizing that dating evolutionary events requires not "a mutation rate" but a precise characterization of how mutations accumulate in development in males and females-knowledge that remains elusive.Entities:
Mesh:
Year: 2016 PMID: 27760127 PMCID: PMC5070741 DOI: 10.1371/journal.pbio.2000744
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Estimates of mutation rates from pedigree studies.
| Study | Reported mutation rate per bp per generation (x10-8) | Mean paternal age in study (in years) | Mutation rate at paternal age of 30 years (x10-8) | Paternal age effect reported as the increase in number of mutations for each year of father’s age | Callable genome in Gb (reported false negative rate [FNR] in %) | Sample size (number of trios) | Mean sequence coverage | Fraction of CpG transitions |
|---|---|---|---|---|---|---|---|---|
| Venn 2014 | 1.20 | 24.3 | 1.51 | 3.00 | 2.4 (13.4 | 6 | 34.4b | 0.239 (0.183–0.296) |
| Roach 2010 | 1.10 (0.68–1.70) | — | — | — | 1.8 (5.0) | 2 | 61.3b | 0.178 (0.037–0.320) |
| Conrad 2011 (CEU) [ | 1.17 (0.88–1.62) | — | — | — | 2.5 (5.0) | 1 | 29.3 | 0.146 (0.046–0.246) |
| Conrad 2011 (YRI) [ | 0.97 (0.67–1.34) | — | — | — | 2.5 (3.5) | 1 | 29.2 | 0.114 (0.009–0.220) |
| Campbell 2012 [ | 0.96 (0.82–1.09) | 26.3 | — | — | 2.2 (1.7) | 5 | 13.0 | 0.165 (0.110–0.220) |
| Kong 2012 [ | 1.20 | 29.7 | 1.21 | 2.01 | 2.6 (2.0) | 78 | 30.0 | 0.173 (0.163–0.184) |
| Michaelson 2012 [ | 1.00 | 33.6 | 0.93 | 1.02 | 2.8 | 10 | 30.0 | 0.128 (0.099–0.156) |
| Jiang 2013 [ | — | 34.4 | — | 1.50 | — | 32 | 30.0 | 0.162 (0.146–0.177) |
| Francioli 2015 [ | — | 29.4 | — | 1.20 | 2.1 (31.1) | 250 | 13.0 | 0.165 (0.158–0.172) |
| Besenbacher 2015 [ | 1.27 (1.16–1.38) | 28.4 | 1.3 | 2.00 | — | 10 | 50.0 | 0.201 (0.166–0.236) |
| Rahbari 2015 | 1.28 (1.13–1.43) | 29.8 | 1.29 | 2.87 (1.46–3.65) | 2.5 (—) | 12 | 24.7 | 0.210 (0.180–0.240) |
| Yuen 2015 | 1.18 | 34.1 | 1.08 | 1.19 | 2.5 (8.0 | 140 | 56.0 | 0.159 (0.151–0.167) |
| Wong 2016 | 1.05 | 33.4 | 0.95 | 0.92 | 1.6 (13.0) | 693 | 60.0 | 0.131 (0.127–0.135) |
| Goldmann 2016 | — | 33.7 | — | 0.91 | — | 816 | 60.0 | 0.179 (0.175–0.182) |
—Not available.
§ - Denotes studies that used Complete Genomics technology for sequencing, as most were based on Illumina sequencing.
† - Estimated assuming linearity and using the reported paternal age effect, accounting for the length of the callable genome. Specifically, we used: where is the estimated mutation rate per bp per generation reported in the study, is the estimated slope for the paternal age effect, P is the mean paternal age in the study, and N is the length of the callable genome in base pairs.
*—A paternal age effect is not reported in paper but estimated by Poisson regression on counts of autosomal de novo mutations.
¶ - CG dinucleotides (CpG) fraction based on autosomal mutations and binomial 95% CI shown. When possible, we relied on validated mutations. However, in some studies, only a small fraction of mutations were validated, and, hence, we used all putative de novo mutations.
#—This study includes one multigenerational pedigree.
‡ - These studies found a significant maternal age effect, which might lead to lower estimates of the paternal age effect (if parental ages are correlated).
a—This is the estimated mean age of reproduction of male chimpanzees in the wild, not the age of the actual individuals studied (18.9 for males and 15.0 for females).
b—Refers to average, the estimate varies across individuals or families in the study.
c—Includes siblings.
d—Based on visual inspection of slope reported in the study.
e—Not reported in the article, based on personal communication.
f—Based on validated de novo mutation counts and extrapolated to a surveyed genome size of 3 gigabases (Gb).
g—All estimates are based on the subset of the 140 non- lymphoblast-derived cell lines (LCL) samples.
h—Not reported in the article; based on the number of de novo mutations reported in the study and an estimated denominator (2.5 Gb, personal communication).
i—Average of the estimates based on two cohorts.
Fig 1The many steps involved in the conversion of mutation rate estimates from pedigree studies into yearly substitution rates.
Fig 2Schematic illustration of mutations occurring during embryonic development and gametogenesis.
For simplicity, we show only mutations that arose in the father and one offspring (child 1). Stars represent mutations that originate in different stages of embryogenesis and gametogenesis of the father and the offspring; solid stars are mutations that arise in the father, and hollow stars are those that occur in the offspring. Shown below each individual are the expected frequencies of the labeled mutations in his or her blood sample. Red, brown, and green stars are heritable and should be included in an estimate of germline mutation rates, whereas blue stars are somatic mutations present only in blood samples, which should be excluded. The detection of mutations that are mosaic in both soma and germline strongly suggests that, in the cell lineage tree of human development, soma and germline are not reciprocally monophyletic [46,64]. The standard pipelines require allelic balance in the child and no (or very low) read depths in the parents, leading to inclusion of some postzygotic mutations in the child and exclusion of a fraction of germline mutations in the parents. The two effects partially balance, so the overall mutation rate is unlikely to be greatly biased. However, there is a tendency to detect child-specific mutations and to miss ones shared among siblings. As a consequence, the mutation rates during early development are likely underestimated, with potentially important practical implications for predictions of recurrence risk of diseases caused by de novo mutations.
Fig 3Variation in the estimated paternal age effect for autosomes.
We plot the de novo mutation rate as a function of the paternal age at conception of the child. The rate was obtained from the reported counts of de novo mutations divided by the fraction of the genome assayed in each study (shown in the title of each subplot, along with the mean sequence coverage per individual). The solid line denotes the fitted slope (i.e., the increase in the mutation rate for each additional year of father’s age). Following the approach of Rahbari et al. 2015 [46], for their study, we used the corrected counts of de novo mutations, which are extrapolated to a genome length of 3 Gb (thereby assuming the mutation rate in the inaccessible regions of the genome is the same as that in surveyed regions). The three colors used in this plot denote the three different families that were studied: blue, family 244; green, family 603; and red, family 569.