| Literature DB >> 33452467 |
Iván Galván-Femenía1,2, Carles Barceló-Vidal1, Lauro Sumoy3, Victor Moreno4,5,6,7, Rafael de Cid8, Jan Graffelman9,10.
Abstract
The detection of family relationships in genetic databases is of interest in various scientific disciplines such as genetic epidemiology, population and conservation genetics, forensic science, and genealogical research. Nowadays, screening genetic databases for related individuals forms an important aspect of standard quality control procedures. Relatedness research is usually based on an allele sharing analysis of identity by state (IBS) or identity by descent (IBD) alleles. Existing IBS/IBD methods mainly aim to identify first-degree relationships (parent-offspring or full siblings) and second degree (half-siblings, avuncular, or grandparent-grandchild) pairs. Little attention has been paid to the detection of in-between first and second-degree relationships such as three-quarter siblings (3/4S) who share fewer alleles than first-degree relationships but more alleles than second-degree relationships. With the progressively increasing sample sizes used in genetic research, it becomes more likely that such relationships are present in the database under study. In this paper, we extend existing likelihood ratio (LR) methodology to accurately infer the existence of 3/4S, distinguishing them from full siblings and second-degree relatives. We use bootstrap confidence intervals to express uncertainty in the LRs. Our proposal accounts for linkage disequilibrium (LD) by using marker pruning, and we validate our methodology with a pedigree-based simulation study accounting for both LD and recombination. An empirical genome-wide array data set from the GCAT Genomes for Life cohort project is used to illustrate the method.Entities:
Mesh:
Year: 2021 PMID: 33452467 PMCID: PMC8027836 DOI: 10.1038/s41437-020-00392-8
Source DB: PubMed Journal: Heredity (Edinb) ISSN: 0018-067X Impact factor: 3.821
Degree of relationship (R), kinship coefficient (ϕ), and probability of sharing zero, one or two alleles identical by descent (k0, k1, k2).
| Probability of IBD sharing | |||||
|---|---|---|---|---|---|
| Type of relative | R | ||||
| Monozygotic twins (MZ) | 0 | 1/2 | 0 | 0 | 1 |
| Parent–offspring (PO) | 1 | 1/4 | 0 | 1 | 0 |
| Full siblings (FS) | 1 | 1/4 | 1/4 | 1/2 | 1/4 |
| Three-quarter siblings (3/4S) | – | 3/16 | 3/8 | 1/2 | 1/8 |
| Half-siblings/grandchild–grandparent/niece/nephew–uncle/aunt (2nd) | 2 | 1/8 | 1/2 | 1/2 | 0 |
| First cousins (FC) | 3 | 1/16 | 3/4 | 1/4 | 0 |
| Unrelated (UN) | 0 | 1 | 0 | 0 | |
Possible pairs of biallelic genotypes and the probability of each pair given the number of alleles identical by descent (m).
| 2 | 0 | ||
| 0 | 0 | ||
| 4 | 2 |
We assume that the order of the genotypes is irrelevant, i.e., the probabilities for G1/G2 and G2/G1 are the same.
Likelihood ratio (LR) for relatedness research for biallelic SNPs.
| LR | ||||
|---|---|---|---|---|
| PO ~ UN | ||||
| FS ~ UN | ||||
| 3/4S ~ UN | ||||
| 2nd ~ UN | ||||
| FC ~ UN |
The considered LR are PO, FS, 3/4S, 2nd, or FC relationships in the numerator and the UN relationship in the denominator. The LR values depend on the observed genotypes of a pair of individuals and the allele frequencies p and q of the population under study. The e parameter is used to account for genotype errors and de novo mutations if the genotype combination does not occur (Heinrich et al., 2016). We assume that the order of the genotypes is irrelevant, i.e., the LR for G1/G2 and G2/G1 is the same.
Fig. 1-plot of ~18 million pairs of simulated individuals using 27,087 SNPs.
UN: unrelated; 2nd: second-degree relationships; 3/4S: three-quarter siblings. FS: full siblings; PO: parent–offspring. Brown open dots represent theoretical IBD probabilities; brown + signs the average of the corresponding group.
Fig. 2Boxplot of kinship estimates of ~18 million pairs of simulated individuals using 27,087 SNPs.
Fig. 3Log10 likelihood ratio approach of the simulated 2nd, 3/4S, and FS pairs (500 for each relationship) using 27,087 SNPs.
Note the larger than sign shaped (“ > ”) pattern (gray dashed lines) for most 3/4S pairs.
Fig. 4-plot of the GCAT cohort for 5075 individuals and 26,006 SNPs (MAF > 0.40, LD-pruned, HWE exact mid p value > 0.05, and missing call rate 0).
3rd, 4th, 5th, or UN: third, fourth, fifth-degree relationships or unrelated; 2nd: second-degree relationships; 3/4S: three-quarter siblings; FS: full siblings; PO: parent–offspring.
Fig. 5Boxplot of kinship estimates of the GCAT cohort for 5,075 individuals and 26,006 SNPs (MAF > 0.40, LD-pruned, HWE exact mid p value > 0.05, and missing call rate 0).
Fig. 6Log10 likelihood ratio approach of the presumably 2nd, 3/4S, and FS pairs from the GCAT cohort using 26,006 SNPs (MAF > 0.40, LD-pruned, HWE exact mid p value > 0.05, and missing call rate 0).
Likelihood ratio inference (LR approach) for the presumably 2nd, 3/4S, and FS pairs from the GCAT cohort.
| Pair | IID | Sex | IID | Sex | LR-kinbiplot | FS~UN | 3/4S~UN | 2nd~UN | LR approach | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | REL_00178 | F | REL_01132 | F | 0.61 | 0.36 | 0.04 | 0.107 | 2nd | −0.0165 | 0.0027 | 2nd | |
| 2 | REL_02227 | F | REL_00865 | M | 0.57 | 0.43 | 0.00 | 0.109 | 2nd | −0.0164 | 0.0035 | 2nd | |
| 3 | REL_04137 | F | REL_03163 | M | 0.51 | 0.49 | 0.00 | 0.122 | 2nd | −0.0103 | 0.0082 | 2nd | |
| 4 | REL_04126 | F | REL_02089 | F | 0.50 | 0.50 | 0.00 | 0.126 | 2nd | −0.0106 | 0.0080 | 2nd | |
| 5 | REL_04141 | F | REL_02030 | M | 0.49 | 0.50 | 0.01 | 0.129 | 2nd | −0.0072 | 0.0101 | 2nd | |
| 6 | REL_02092 | M | REL_00587 | F | 0.48 | 0.52 | 0.00 | 0.129 | 2nd | −0.0073 | 0.0104 | 2nd | |
| 7 | REL_02212 | M | REL_04828 | F | 0.47 | 0.53 | 0.00 | 0.132 | 2nd | −0.0061 | 0.0111 | 2nd | |
| 8 | REL_00603 | F | REL_00189 | F | 0.47 | 0.53 | 0.00 | 0.134 | 2nd | −0.0076 | 0.0101 | 2nd | |
| 9 | REL_03666 | M | REL_02902 | M | 0.47 | 0.53 | 0.00 | 0.134 | 2nd | −0.0057 | 0.0112 | 2nd | |
| 10 | REL_00132 | F | REL_00707 | M | 0.45 | 0.55 | 0.00 | 0.137 | 2nd | −0.0059 | 0.0113 | 2nd | |
| 11 | REL_02058 | F | REL_03610 | F | 0.45 | 0.55 | 0.00 | 0.139 | 2nd | −0.0041 | 0.0125 | 2nd | |
| 12 | REL_01692 | F | REL_00010 | F | 0.44 | 0.56 | 0.00 | 0.139 | 2nd | −0.0041 | 0.0127 | 2nd | |
| 13 | REL_03969 | M | REL_00271 | M | 0.34 | 0.56 | 0.10 | 0.189 | 3/4S | 0.0260 | 0.0279 | 3/4S | |
| 14 | REL_03803 | F | REL_02343 | M | 0.35 | 0.51 | 0.14 | 0.198 | 3/4S | 0.0317 | 0.0287 | 3/4S | |
| 15 | REL_03924 | M | REL_03023 | F | 0.37 | 0.46 | 0.17 | 0.201 | 3/4S | 0.0365 | 0.0301 | 3/4S | |
| 16 | REL_00083 | M | REL_02333 | M | 0.33 | 0.52 | 0.15 | 0.207 | 3/4S | 0.0377 | 0.0313 | 3/4S | |
| 17 | REL_01344 | M | REL_02408 | F | 0.36 | 0.44 | 0.20 | 0.210 | 3/4S | 0.0402 | 0.0304 | 3/4S | |
| 18 | REL_04189 | M | REL_00775 | M | 0.36 | 0.44 | 0.20 | 0.210 | 3/4S | 0.0422 | 0.0314 | 3/4S | |
| 19 | REL_03150 | F | REL_01804 | F | 0.32 | 0.51 | 0.17 | 0.212 | 3/4S | 0.0411 | 0.0322 | 3/4S | |
| 20 | REL_02752 | F | REL_04859 | F | 0.34 | 0.46 | 0.20 | 0.215 | 3/4S | 0.0441 | 0.0325 | 3/4S | |
| 21 | REL_01502 | M | REL_03665 | M | 0.31 | 0.48 | 0.21 | 0.225 | FS | 0.0469 | 0.0339 | FS | |
| 22 | REL_04592 | F | REL_04600 | F | 0.30 | 0.48 | 0.21 | 0.226 | FS | 0.0493 | 0.0358 | FS | |
| 23 | REL_04693 | F | REL_00797 | F | 0.31 | 0.47 | 0.22 | 0.228 | FS | 0.0498 | 0.0357 | FS | |
| 24 | REL_03607 | M | REL_00319 | F | 0.30 | 0.49 | 0.21 | 0.228 | FS | 0.0484 | 0.0350 | FS | |
| 25 | REL_03220 | F | REL_04615 | F | 0.31 | 0.46 | 0.23 | 0.230 | FS | 0.0505 | 0.0360 | FS | |
| 26 | REL_03212 | M | REL_02516 | F | 0.28 | 0.53 | 0.20 | 0.231 | FS | 0.0526 | 0.0386 | FS | |
| 27 | REL_03310 | M | REL_03659 | F | 0.26 | 0.56 | 0.18 | 0.231 | FS | 0.0484 | 0.0358 | FS | |
| 28 | REL_04427 | F | REL_02635 | F | 0.26 | 0.54 | 0.19 | 0.232 | FS | 0.0487 | 0.0358 | FS | |
| 29 | REL_00122 | M | REL_01902 | F | 0.29 | 0.49 | 0.22 | 0.233 | FS | 0.0513 | 0.0368 | FS | |
| 30 | REL_00284 | M | REL_02444 | F | 0.28 | 0.51 | 0.21 | 0.233 | FS | 0.0494 | 0.0356 | FS | |
| 31 | REL_03838 | F | REL_02496 | F | 0.31 | 0.45 | 0.24 | 0.234 | FS | 0.0523 | 0.0367 | FS | |
| 32 | REL_01564 | F | REL_03827 | F | 0.32 | 0.43 | 0.26 | 0.236 | FS | 0.0528 | 0.0365 | FS | |
| 33 | REL_04529 | F | REL_04492 | F | 0.28 | 0.50 | 0.22 | 0.236 | FS | 0.0522 | 0.0373 | FS | |
| 34 | REL_04494 | M | REL_00931 | M | 0.28 | 0.49 | 0.23 | 0.237 | FS | 0.0525 | 0.0373 | FS | |
| 35 | REL_04466 | F | REL_02680 | F | 0.31 | 0.43 | 0.26 | 0.237 | FS | 0.0531 | 0.0367 | FS | |
| 36 | REL_04405 | M | REL_03949 | M | 0.26 | 0.52 | 0.22 | 0.238 | FS | 0.0525 | 0.0376 | FS | |
| 37 | REL_03880 | M | REL_04789 | F | 0.27 | 0.50 | 0.23 | 0.239 | FS | 0.0529 | 0.0376 | FS | |
| 38 | REL_00383 | F | REL_03293 | M | 0.25 | 0.53 | 0.22 | 0.241 | FS | 0.0538 | 0.0385 | FS | |
| 39 | REL_01888 | M | REL_04360 | M | 0.25 | 0.54 | 0.21 | 0.241 | FS | 0.0532 | 0.0383 | FS | |
| 40 | REL_00792 | F | REL_00954 | M | 0.26 | 0.51 | 0.23 | 0.242 | FS | 0.0543 | 0.0385 | FS | |
| 41 | REL_00872 | F | REL_01784 | F | 0.25 | 0.53 | 0.22 | 0.242 | FS | 0.0556 | 0.0398 | FS | |
| 42 | REL_01450 | M | REL_01960 | M | 0.26 | 0.51 | 0.23 | 0.242 | FS | 0.0544 | 0.0386 | FS | |
| 43 | REL_04616 | F | REL_02777 | F | 0.28 | 0.47 | 0.25 | 0.243 | FS | 0.0553 | 0.0386 | FS | |
| 44 | REL_02899 | M | REL_01707 | F | 0.28 | 0.45 | 0.26 | 0.244 | FS | 0.0562 | 0.0389 | FS | |
| 45 | REL_02905 | F | REL_02575 | F | 0.25 | 0.52 | 0.23 | 0.245 | FS | 0.0557 | 0.0394 | FS | |
| 46 | REL_00769 | M | REL_04746 | F | 0.23 | 0.57 | 0.21 | 0.246 | FS | 0.0564 | 0.0406 | FS | |
| 47 | REL_00009 | F | REL_02335 | F | 0.23 | 0.55 | 0.22 | 0.246 | FS | 0.0558 | 0.0399 | FS | |
| 48 | REL_04475 | F | REL_04218 | M | 0.25 | 0.51 | 0.24 | 0.247 | FS | 0.0564 | 0.0397 | FS | |
| 49 | REL_01150 | F | REL_04384 | F | 0.26 | 0.49 | 0.25 | 0.249 | FS | 0.0580 | 0.0403 | FS | |
| 50 | REL_03944 | M | REL_03475 | F | 0.23 | 0.54 | 0.23 | 0.249 | FS | 0.0568 | 0.0403 | FS | |
| 51 | REL_03904 | F | REL_04994 | F | 0.25 | 0.50 | 0.25 | 0.249 | FS | 0.0573 | 0.0400 | FS | |
| 52 | REL_01654 | M | REL_03485 | M | 0.28 | 0.43 | 0.29 | 0.251 | FS | 0.0588 | 0.0398 | FS | |
| 53 | REL_00504 | M | REL_04718 | F | 0.24 | 0.50 | 0.25 | 0.252 | FS | 0.0582 | 0.0404 | FS | |
| 54 | REL_00339 | F | REL_02473 | F | 0.25 | 0.48 | 0.27 | 0.253 | FS | 0.0584 | 0.0400 | FS | |
| 55 | REL_01016 | M | REL_00887 | M | 0.24 | 0.50 | 0.26 | 0.254 | FS | 0.0594 | 0.0411 | FS | |
| 56 | REL_03977 | M | REL_01080 | M | 0.22 | 0.54 | 0.24 | 0.255 | FS | 0.0583 | 0.0408 | FS | |
| 57 | REL_02339 | M | REL_02391 | M | 0.27 | 0.44 | 0.29 | 0.256 | FS | 0.0608 | 0.0411 | FS | |
| 58 | REL_01524 | F | REL_03272 | F | 0.23 | 0.51 | 0.26 | 0.256 | FS | 0.0604 | 0.0419 | FS | |
| 59 | REL_01285 | M | REL_03761 | F | 0.24 | 0.50 | 0.27 | 0.257 | FS | 0.0597 | 0.0410 | FS | |
| 60 | REL_03395 | F | REL_02694 | F | 0.22 | 0.52 | 0.25 | 0.257 | FS | 0.0609 | 0.0423 | FS | |
| 61 | REL_03151 | M | REL_02204 | F | 0.23 | 0.50 | 0.26 | 0.257 | FS | 0.0610 | 0.0421 | FS | |
| 62 | REL_00968 | M | REL_01577 | F | 0.26 | 0.45 | 0.29 | 0.259 | FS | 0.0654 | 0.0445 | FS | |
| 63 | REL_04439 | F | REL_01640 | F | 0.26 | 0.43 | 0.31 | 0.260 | FS | 0.0630 | 0.0421 | FS | |
| 64 | REL_01546 | M | REL_03566 | F | 0.21 | 0.53 | 0.26 | 0.263 | FS | 0.0621 | 0.0428 | FS | |
| 65 | REL_03442 | F | REL_04510 | F | 0.22 | 0.51 | 0.27 | 0.264 | FS | 0.0630 | 0.0431 | FS | |
| 66 | REL_00340 | F | REL_04294 | F | 0.21 | 0.53 | 0.26 | 0.264 | FS | 0.0628 | 0.0432 | FS | |
| 67 | REL_03001 | F | REL_04111 | F | 0.23 | 0.48 | 0.29 | 0.265 | FS | 0.0636 | 0.0430 | FS | |
| 68 | REL_00282 | F | REL_04918 | F | 0.25 | 0.44 | 0.31 | 0.267 | FS | 0.0648 | 0.0430 | FS | |
| 69 | REL_01083 | F | REL_01704 | F | 0.18 | 0.57 | 0.25 | 0.267 | FS | 0.0634 | 0.0439 | FS | |
| 70 | REL_03388 | F | REL_02608 | F | 0.22 | 0.50 | 0.29 | 0.268 | FS | 0.0645 | 0.0436 | FS | |
| 71 | REL_01924 | F | REL_00727 | M | 0.24 | 0.45 | 0.32 | 0.270 | FS | 0.0663 | 0.0440 | FS | |
| 72 | REL_02208 | F | REL_03486 | F | 0.23 | 0.46 | 0.31 | 0.270 | FS | 0.0665 | 0.0444 | FS | |
| 73 | REL_02718 | M | REL_02913 | M | 0.22 | 0.48 | 0.30 | 0.271 | FS | 0.0662 | 0.0443 | FS | |
| 74 | REL_00634 | M | REL_03507 | M | 0.20 | 0.51 | 0.29 | 0.272 | FS | 0.0656 | 0.0443 | FS | |
| 75 | REL_04741 | F | REL_02513 | F | 0.19 | 0.52 | 0.30 | 0.277 | FS | 0.0676 | 0.0455 | FS | |
| 76 | REL_00601 | M | REL_02989 | F | 0.19 | 0.51 | 0.30 | 0.278 | FS | 0.0689 | 0.0462 | FS | |
| 77 | REL_01624 | F | REL_00750 | F | 0.19 | 0.51 | 0.30 | 0.278 | FS | 0.0680 | 0.0456 | FS | |
| 78 | REL_00824 | F | REL_00213 | F | 0.22 | 0.45 | 0.33 | 0.278 | FS | 0.0693 | 0.0456 | FS | |
| 79 | REL_01264 | M | REL_04751 | F | 0.18 | 0.52 | 0.30 | 0.279 | FS | 0.0684 | 0.0459 | FS | |
| 80 | REL_02208 | F | REL_01630 | F | 0.18 | 0.52 | 0.31 | 0.283 | FS | 0.0706 | 0.0473 | FS | |
| 81 | REL_04704 | F | REL_00804 | M | 0.17 | 0.52 | 0.31 | 0.285 | FS | 0.0707 | 0.0472 | FS | |
| 82 | REL_03627 | F | REL_03315 | F | 0.15 | 0.55 | 0.30 | 0.288 | FS | 0.0714 | 0.0478 | FS | |
| 83 | REL_03486 | F | REL_01630 | F | 0.17 | 0.50 | 0.33 | 0.289 | FS | 0.0738 | 0.0488 | FS |
FS~UN, 3/4S~UN and 2nd~UN are the LR values for each pair. LR-kinbiplot is the inferred relationship from Graffelman et al. (2019). : estimated kinship coefficient. , , and : estimated IBD probabilities.
Maximum values of the likelihood ratios of each pair are marked in bold.
Fig. 7Bootstrap distribution of the LR for eight presumably 3/4S pairs of the GCAT project.
Vertical dashed lines indicate the average LR values and the 95% bootstrap confidence interval limits.
Number of IBD alleles for all possible pairs of 3/4S where their unshared parents are FS.
| 2 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | |
| 1 | 2 | 0 | 1 | 0 | 1 | 0 | 1 | |
| 1 | 0 | 2 | 1 | 1 | 0 | 1 | 0 | |
| 0 | 1 | 1 | 2 | 0 | 1 | 0 | 1 | |
| 1 | 0 | 1 | 0 | 2 | 1 | 1 | 0 | |
| 0 | 1 | 0 | 1 | 1 | 2 | 0 | 1 | |
| 1 | 0 | 1 | 0 | 1 | 0 | 2 | 1 | |
| 0 | 1 | 0 | 1 | 0 | 1 | 1 | 2 |