| Literature DB >> 27472558 |
Chie Morimoto1, Sho Manabe1, Takahisa Kawaguchi2, Chihiro Kawai1, Shuntaro Fujimoto1, Yuya Hamano1,3, Ryo Yamada4, Fumihiko Matsuda2, Keiji Tamaki1.
Abstract
We developed a new approach for pairwise kinship analysis in forensic genetics based on chromosomal sharing between two individuals. Here, we defined "index of chromosome sharing" (ICS) calculated using 174,254 single nucleotide polymorphism (SNP) loci typed by SNP microarray and genetic length of the shared segments from the genotypes of two individuals. To investigate the expected ICS distributions from first- to fifth-degree relatives and unrelated pairs, we used computationally generated genotypes to consider the effect of linkage disequilibrium and recombination. The distributions were used for probabilistic evaluation of the pairwise kinship analysis, such as likelihood ratio (LR) or posterior probability, without allele frequencies and haplotype frequencies. Using our method, all actual sample pairs from volunteers showed significantly high LR values (i.e., ≥ 108); therefore, we can distinguish distant relationships (up to the fifth-degree) from unrelated pairs based on LR. Moreover, we can determine accurate degrees of kinship in up to third-degree relationships with a probability of > 80% using the criterion of posterior probability ≥ 0.90, even if the kinship of the pair is totally unpredictable. This approach greatly improves pairwise kinship analysis of distant relationships, specifically in cases involving identification of disaster victims or missing persons.Entities:
Mesh:
Year: 2016 PMID: 27472558 PMCID: PMC4966930 DOI: 10.1371/journal.pone.0160287
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Overview of the method for pairwise kinship analysis proposed in this study.
ICS is the index of chromosome sharing. C-Th and L-Th indicate the most appropriate thresholds for collateral relatives and for lineal relatives, respectively.
Fig 2The expected ICS variability for each relationship.
Boxplots indicate the variability from the computationally generated genotypes. Triangular plots indicate the values from half of the actual samples. (A) ICS variability for collateral relatives using only IBS segments longer than 4 cM and (B) ICS variability for lineal relatives using only IBS segments longer than 3 cM.
Fig 3Q-Q plots for the ICS values calculated by the genotypes of computationally generated C-5.
These plots are assumed as (A) the normal distribution, (B) the truncated-normal distribution, and (C) the log-normal distribution.
Three quantiles of LR values for the actual sample pairs obtained by comparing the true relationship with the UN.
| C-1 | 10 | 1.76 × 1052 | 6.62 × 1055 | 7.16 × 1057 |
| C-2 | 8 | 2.66 × 1039 | 2.87 × 1043 | 1.38 × 1047 |
| C-3 | 7 | 1.37 × 1024 | 1.93 × 1027 | 6.82 × 1033 |
| C-4 | 5 | 2.76 × 109 | 2.28 × 1016 | 1.05 × 1018 |
| C-5 | 3 | 1.63 × 108 | 2.74 × 1010 | 6.86 × 1018 |
| L-2 | 8 | 6.52 × 1069 | 7.23 × 1088 | 2.60 × 1094 |
| L-3 | 1 | 1.16 × 1045 | ||
a N indicates the number of pairs.
Fig 4The range of ICS values in each Hummel’s predicate for kinship determination.
ICS values for (A) collateral relatives and (B) lineal relatives. Solid lines indicate the estimated log-normal distributions of the ICS and ICS values. The colors in the bar under the distributions indicate the range of the ICS and ICS values that can be obtained from the posterior probabilities corresponding to each Hummel’s predicate for each relationship. The plots indicate the ICS and ICS values calculated by the rest of the actual sample pairs. All actual L-1 pairs were almost equal to the maximum value of ICS (i.e., 3662.522) and are indicated in red. * The distribution of UN is partially displayed because the density of the estimated distribution of UN is high.
Percentages of the posterior probabilities ≥ 0.998, 0.99, 0.95, and 0.90 for the distributions of each relationship.
| C-1 | 83.0% | 92.6% | 97.2% | 98.3% |
| C-2 | 80.6% | 90.5% | 95.7% | 97.0% |
| C-3 | 0.0% | 38.0% | 71.6% | 81.6% |
| C-4 | 0.0% | 0.0% | 11.5% | 37.4% |
| C-5 | 6.66% | 24.1% | 46.4% | 56.9% |
| UN | 89.2% | 94.8% | 97.6% | 98.4% |
| L-2 | 51.6% | 73.2% | 87.7% | 91.9% |
| L-3 | 75.4% | 83.7% | 90.4% | 92.7% |
| UN | 99.99997% | > 99.99999% | ≈ 100% | ≈ 100% |
UN, unrelated.
* The flat values of the prior probabilities were used.
The number of samples categorized by the range of posterior probabilities based on Hummel’s predicates.
| < | ||||||
| C-1 | 10 | 7 | 2 | 1 | ||
| C-2 | 8 | 8 | ||||
| C-3 | 7 | 3 | 1 | 2 | 1 | |
| C-4 | 5 | 1 | 3 | 1 | ||
| C-5 | 3 | 3 | ||||
| UN | 7 | 7 | ||||
| < | ||||||
| L-2 | 8 | 6 | 1 | 1 | ||
| L-3 | 1 | 1 | ||||
| UN | 7 | 7 | ||||
UN, unrelated.
* The flat values of the prior probabilities were used.
a N indicates the number of pairs.
Fig 5Pedigree consisted of twelve people.