| Literature DB >> 34824382 |
Qian S Zhang1, Jérôme Goudet2, Bruce S Weir3.
Abstract
The two alleles an individual carries at a locus are identical by descent (ibd) if they have descended from a single ancestral allele in a reference population, and the probability of such identity is the inbreeding coefficient of the individual. Inbreeding coefficients can be predicted from pedigrees with founders constituting the reference population, but estimation from genetic data is not possible without data from the reference population. Most inbreeding estimators that make explicit use of sample allele frequencies as estimates of allele probabilities in the reference population are confounded by average kinships with other individuals. This means that the ranking of those estimates depends on the scope of the study sample and we show the variation in rankings for common estimators applied to different subdivisions of 1000 Genomes data. Allele-sharing estimators of within-population inbreeding relative to average kinship in a study sample, however, do have invariant rankings across all studies including those individuals. They are unbiased with a large number of SNPs. We discuss how allele sharing estimates are the relevant quantities for a range of empirical applications.Entities:
Mesh:
Year: 2021 PMID: 34824382 PMCID: PMC8733021 DOI: 10.1038/s41437-021-00471-4
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Measures of inbreeding and coancestry.
| Measure | Description | Evaluation |
|---|---|---|
| Inbreeding coefficient for individual | ||
| ibd probability for homologous alleles | ||
| Coancestry for individuals | ||
| for random alleles from | ||
| The following hold for PED and Gold values. | ||
| Average inbreeding coefficient. | ||
| Ψ | Average coancestry coefficient for individual | |
| Average coancestry coefficient. | ||
| Within-population inbreeding coefficient | ||
| for individual | ||
| Average within-population inbreeding coefficient. | ||
| Within-population average kinship coefficient for | ||
| individual | ||
Estimators of inbreeding.
| Estimate | Calculationa | Expected Valueb |
|---|---|---|
| Proportion of homozygous blocks. | No explicit expression. | |
| Maximization of likelihood for | No explicit expression. | |
| No explicit expression. | ||
| No explicit expression. |
aX is the reference allele dosage for SNP l in individual j.
a is the sample allele frequency for SNP l.
bFor weighted averages over large numbers of loci.
Correlations among inbreeding measuresa for simulated data.
| 1.00 | 0.94 | 0.92 | 1.00 | 0.94 | 0.84 | 0.84 | 0.80 | 0.80 | 0.71 | 0.74 | 0.44 | 0.36 | −0.25 | |
| 0.94 | 1.00 | 0.99 | 0.94 | 1.00 | 0.90 | 0.90 | 0.88 | 0.86 | 0.78 | 0.80 | 0.48 | 0.41 | −0.24 | |
| 0.92 | 0.99 | 1.00 | 0.92 | 0.99 | 0.91 | 0.91 | 0.89 | 0.87 | 0.80 | 0.82 | 0.50 | 0.45 | −0.20 | |
| 1.00 | 0.94 | 0.92 | 1.00 | 0.94 | 0.84 | 0.84 | 0.80 | 0.80 | 0.71 | 0.74 | 0.44 | 0.36 | −0.25 | |
| 0.94 | 1.00 | 0.99 | 0.94 | 1.00 | 0.90 | 0.90 | 0.88 | 0.86 | 0.78 | 0.80 | 0.48 | 0.41 | −0.24 | |
| 0.84 | 0.90 | 0.91 | 0.84 | 0.90 | 1.00 | 1.00 | 0.99 | 0.77 | 0.86 | 0.86 | 0.42 | 0.44 | −0.22 | |
| 0.84 | 0.90 | 0.91 | 0.84 | 0.90 | 1.00 | 1.00 | 0.99 | 0.77 | 0.86 | 0.86 | 0.42 | 0.44 | −0.22 | |
| 0.80 | 0.88 | 0.89 | 0.80 | 0.88 | 0.99 | 0.99 | 1.00 | 0.82 | 0.92 | 0.91 | 0.53 | 0.57 | −0.10 | |
| 0.80 | 0.86 | 0.87 | 0.80 | 0.86 | 0.77 | 0.77 | 0.82 | 1.00 | 0.89 | 0.91 | 0.86 | 0.74 | 0.18 | |
| 0.71 | 0.78 | 0.80 | 0.71 | 0.78 | 0.86 | 0.86 | 0.92 | 0.89 | 1.00 | 0.98 | 0.75 | 0.84 | 0.17 | |
| 0.74 | 0.80 | 0.82 | 0.74 | 0.80 | 0.86 | 0.86 | 0.91 | 0.91 | 0.98 | 1.00 | 0.76 | 0.80 | 0.17 | |
| 0.44 | 0.48 | 0.50 | 0.44 | 0.48 | 0.42 | 0.42 | 0.53 | 0.86 | 0.75 | 0.76 | 1.00 | 0.87 | 0.55 | |
| 0.36 | 0.41 | 0.45 | 0.36 | 0.41 | 0.44 | 0.44 | 0.57 | 0.74 | 0.84 | 0.80 | 0.87 | 1.00 | 0.53 | |
| −0.25 | −0.24 | −0.20 | −0.25 | −0.24 | −0.22 | −0.22 | −0.10 | 0.18 | 0.17 | 0.17 | 0.55 | 0.53 | 1.00 |
aAs shown in Tables 1 and 2.
Fig. 1Allele sharing estimates for 283 non-founders in simulated pedigree.
Left: Pedigree f vs Pedigree F; Center: Gold f vs Pedigree f; Right: Pedigree coancestry vs Pedigree f.
Fig. 2Values of ROH estimates of F and allele-sharing estimates of f for 283 non-founders in simulated pedigree.
Left: vs FGold; Center: vs ; Right: vs fGold.
Fig. 3Values of UNI and STD estimates for 283 non-founders in simulated pedigree.
Top left: vs ; Top right: vs ; Bottom left: vs ; Bottom right: vs .
Fig. 4Individual inbreeding coefficient estimates for 1000 Genomes data.
Left panel: ; Right panel: . Green: Population as reference; Blue: Continental group as reference; Red: World as reference. Populations, left to right: (AFR) ACB, ASW, ESN, GWD, LWK, MSL, YRI; (AMR) CLM, PEL, PUR, MXL; (EAS) CHB, CHS, CDX, JPT, KHV; (EUR) CEU, FIN, GBR, IBS, TSI; (SAS) BEB, GIH, ITU, PJL, STU.
Fig. 5Estimates of within-population individual-specific average kinships vs estimates of within-population individual-specific inbreeding coefficients for 1000 Genomes data.
Y-axis: ; X-axis: . Top: Population as reference set; Center: Continent as reference set; Bottom: World as reference set. Left: All populations; Right: Excluding AMR populations in top and center rows. Excluding AMR and AFR in bottom row. Gold: AFR (not ACB or ASW); Orange: AFR (ACB and ASW); Red: AMR; Purple: SAS; Blue: EUR; Green: EAS.
Fig. 6ROH/PLINK estimates vs SNP by SNP estimates for 1000 Genomes data, with the World as a reference set.
Left: vs ; Right: FROH vs . Solid line X = Y. Gold: AFR (not ACB or ASW); Orange: AFR (ACB and ASW); Red: AMR; Purple: SAS; Blue: EUR; Green: EAS.