| Literature DB >> 35751014 |
Anthony F Herzig1, M Ciullo2,3, A-L Leutenegger4, H Perdry5.
Abstract
BACKGROUND: Estimating relatedness is an important step for many genetic study designs. A variety of methods for estimating coefficients of pairwise relatedness from genotype data have been proposed. Both the kinship coefficient [Formula: see text] and the fraternity coefficient [Formula: see text] for all pairs of individuals are of interest. However, when dealing with low-depth sequencing or imputation data, individual level genotypes cannot be confidently called. To ignore such uncertainty is known to result in biased estimates. Accordingly, methods have recently been developed to estimate kinship from uncertain genotypes.Entities:
Keywords: Fraternity coefficient; Genotype likelihoods; Kinship; Low-depth; Moment estimators; Sequencing data
Mesh:
Year: 2022 PMID: 35751014 PMCID: PMC9233360 DOI: 10.1186/s12859-022-04795-8
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.307
Fig. 1a LowKi estimates for kinship and fraternity for CilentoSim. Off-diagonal elements of the estimated kinship and fraternity matrices against the true simulated IBD sharing coefficient in CilentoSim at three different simulated mean read depths (2.5, 5, and 10). The number of variants (M) and the time (T) required for the calculation of both kinship and fraternity matrices are overlaid on the figure (for SEEKIN the time refers only to the calculation of kinship). The mean-squared error (MSE) between the estimators and their respective benchmarks are also given. b–c Corresponding estimates from SEEKIN (kinship only) and NGSRelateV2
Analysis of FranceGenRef data. is the estimate of the two siblings’ unobserved kinship coefficient
| WGS data at 30–40× | Down-sampled WGS data at ~ 2.5× | |||||||
|---|---|---|---|---|---|---|---|---|
| Gaston | NGS- | LowKi (Unadjusted)* | LowKi | BEAGLE | BEAGLE + | NGS- | ||
| Sibling Pair 1 | 0.258 | 0.268 | 0.122 | 0.278 | 0.265 | 0.268 | 0.256 | |
| 0.294 | 0.379 | 0.059 | 0.185 | 0.255 | † | 0.459 | ||
| Sibling Pair 2 | 0.216 | 0.228 | 0.100 | 0.223 | 0.225 | 0.226 | 0.216 | |
| 0.196 | 0.270 | 0.035 | 0.115 | 0.166 | † | 0.367 | ||
| M (number of variants) | 1,009,181 | 1,009,181 | 949,075 | 949,075 | 1,009,181 | |||
| Time | 4 min | 4 min | 15 h** | 15 h ** | 6 h | |||
is the estimate of the two siblings’ unobserved fraternity coefficient . (*) Unadjusted values from LowKi, similar to in Additional file 1: Fig. S2. (**) The time required for these estimators is almost entirely due to BEAGLE, both LowKi and SEEKIN require only a few minutes. (†) SEEKIN does not provide estimates of . (‡) NGSRelateV2 results on down-sampled data should be compared to results from the same software but applied to the original data that had not been down-sampled
Parameters of the Poisson distribution for reads carrying alleles A, a, and error reads, depending on the true genotypes and the error rate