| Literature DB >> 32425975 |
Jiayi Qu1, Stephen D Kachman2, Dorian Garrick3, Rohan L Fernando4, Hao Cheng1.
Abstract
Linkage disequilibrium (LD), often expressed in terms of the squared correlation (r 2) between allelic values at two loci, is an important concept in many branches of genetics and genomics. Genetic drift and recombination have opposite effects on LD, and thus r 2 will keep changing until the effects of these two forces are counterbalanced. Several approximations have been used to determine the expected value of r 2 at equilibrium in the presence or absence of mutation. In this paper, we propose a probability-based approach to compute the exact distribution of allele frequencies at two loci in a finite population at any generation t conditional on the distribution at generation t - 1. As r 2 is a function of this distribution of allele frequencies, this approach can be used to examine the distribution of r 2 over generations as it approaches equilibrium. The exact distribution of LD from our method is used to describe, quantify, and compare LD at different equilibria, including equilibrium in the absence or presence of mutation, selection, and filtering by minor allele frequency. We also propose a deterministic formula for expected LD in the presence of mutation at equilibrium based on the exact distribution of LD.Entities:
Keywords: effective population size; linkage disequilibrium; minor allele frequency filtering; mutation rate; selection
Year: 2020 PMID: 32425975 PMCID: PMC7212447 DOI: 10.3389/fgene.2020.00362
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Comparison of Sved's and Hill's approximation to exact distribution of (scatter points) derived from transition-matrix approach. Mean square errors are shown in the parentheses. “Calibrated” denotes the non-linear regression formula derived from the transition-matrix approach. (A) In the absence of mutation and selection. (B) In the presence of mutation but no selection.
Figure 2Expected value of LD E(r2) over generations under four different conditions at different recombination rates (c) for a population of effective size Ne = 50.
Figure 3Relationship between the expectation of LD at equilibrium () and recombination rate (c) in the absence or presence of selection or filtering by MAF at locus B for a population of effective size N = 50 with mutation rate u = 1.0 × 10−9. Only locus A is under selection with selection coefficient s = 0.1.
Figure 4Distribution of LD (r2) with recombination rate (c) of 6.25 × 10−5 at equilibrium in the presence of mutation but no selection or in the presence of mutation and selection in a population of effective size Ne = 50. Only locus A is under selection. (A) Presence of mutation but no selection (s = 0). (B) Presence of mutation and selection (s = 0.1).
Calibration (estimation of βs) of the non-linear regression model under different recombination rates (c).
| 6.25 × 10−5 | 2.26 | 1557.11 |
| 0.01 | 1.75 | 21.41 |
| 0.02 | 1.49 | 15.13 |
| 0.03 | 1.31 | 12.58 |
| 0.04 | 1.17 | 11.10 |
| 0.05 | 1.05 | 10.09 |
| 0.06 | 0.96 | 9.33 |
| 0.07 | 0.87 | 8.74 |
| 0.08 | 0.80 | 8.25 |
| 0.09 | 0.73 | 7.84 |
| 0.1 | 0.67 | 7.49 |
| 0.2 | 0.28 | 5.48 |
| 0.3 | 0.02 | 4.49 |
| 0.4 | −0.19 | 3.85 |
| 0.5 | −0.38 | 3.38 |