| Literature DB >> 20878401 |
Pierre-Antoine Gourraud1, Ashley Meenagh, Anne Cambon-Thomsen, Derek Middleton.
Abstract
An extensive family-based study of linkage disequilibrium (LD) in the killer cell immunoglobulin-like receptors (KIR) cluster was performed. We aimed to describe the LD structure in the KIR gene cluster using a sample of 418 founder haplotypes identified by segregation in a group of 106 families from Northern Ireland. The LD was studied at two levels of polymorphism: the structural level (presence or absence of KIR genes) and the allelic level (between alleles of KIR genes). LD was further assessed using the predictive value of one KIR polymorphism for another one in order to provide an interpretative framework for the LD effect in association studies. In line with previous research, distinct patterns of KIR genetic diversity within the genomic region centromeric to KIR2DL4 (excluding KIR2DL4) and within the telomeric region including KIR2DL4 were found. In a comprehensive PPV/NPV-based LD analysis within the KIR cluster, robust tag markers were found that can be used to identify which genes are concomitantly present or absent and to further identify groups of associated KIR alleles. Several extended KIR haplotypes in the study population were identified (KIR2DS2*POS-KIR2DL2*001-KIR2DL5B*002-KIR2DS3*00103-KIR2DL1*00401; KIR2DL4*011-KIR3DL1/S1*005-KIR2DS4*003-KIR3DL2*003; KIR2DL4*00802-KIR3DL1/S1*004-KIR2DS4*006-KIR3DL2*005; KIR2DL4*00801-KIR3DL1/S1*00101-KIR2DS4*003-KIR3DL2*001; KIR2DL4*00103-KIR3DL1/S1*008-KIR2DS4*003-KIR3DL2*009; KIR2DL4*00102-KIR3DL1/S1*01502/*002-KIR2DS4*00101-KIR3DL2*002; KIR2DL4*00501-KIR3DL1/S1*013-KIR2DL5A*001-KIR2DS5*002-KIR2DS1*002-KIR3DL2*007). The present study provides a rationale for analyzing associations of KIR polymorphisms by taking into account the complex LD structure of the KIR region.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20878401 PMCID: PMC2978314 DOI: 10.1007/s00251-010-0478-4
Source DB: PubMed Journal: Immunogenetics ISSN: 0093-7711 Impact factor: 2.846
Fig. 1Pairwise linkage disequilibrium between the presence or absence of 13 KIR genes. Lewontin’s pairwise standardized metrics describe linkage disequilibrium (LD) between all KIR genes but KIR3DL2 and KIR3DL3 which were thought to be always present. KIR genes are considered at the structural variation level (the presence or absence level; copy number variant). Computations were double-checked using Haploview software (a SNP-oriented software) as well as the combined use of Gold software for graphical display and Stata ver. 10.0 SE for computing the LD metrics. KIR3DL1 and KIR3DS1 are considered as two different genes. The physical gene position is a consensus map, and another order may better reflect the reality of a given haplotype. Although LD computations do not use the physical positions of the genes and their alleles, LD measures are not necessarily independent of gene order. For example, this is the case when one considers the possibility of 2DS3 and 2DS5 being present on either side or both sides of 2DL4
Fig. 2Compact illustration of the KIR genes framework haplotype using centromeric and telomeric regions. Lines indicate associations of telomeric and centromeric regions. Thin lines connect haplotypes whose frequency is between 0.3% and 10%. Thick lines connect haplotype whose frequency is greater than 10%. Rectangles indicate presence of the KIR gene, and lines represent absence of the KIR gene. All haplotype structures within the sample are displayed. This figure also presents a specific D′: the Hedrick’s multiallelic D′, which represents the degree of LD between two regions, treating each haplotype within a region as an allele. The display is adapted from the Haploview software haplotype blocks display. KIR3DL2 is always present at the telomeric end. (*) KIR3DL3 is assumed to be always present but was not typed at the allelic level in the families. All founders and offspring were positive for KIR3DL3. Because KIR3DL3 and KIR3DL2 appear to be always present, it is not possible to study LD for these genes in terms of their presence or absence. The physical gene position is a consensus map, and another order may better reflect the reality of a given haplotype. Although LD computations do not use the physical positions of the genes and their alleles, LD measures are not necessarily independent of gene order. For example, this is the case when one considers the possibility of 2DS3 and 2DS5 being present on either side or both sides of 2DL4. Frequencies and membership in the A or B haplotype are displayed on the side of the haplotypes. For example, in the centromeric region, the haplotype KIR3DL3–KIR2DL3–KIR2DL1 can belong to both A and B haplotypes and its frequency in our data is 0.648 (64.8%), it combines in more than 10% of the cases with KIR2DL4–KIR3DS1–KIR2DL5A–KIR2DS1–KIR3DL2 telomeric haplotype, which is found in B haplotypes and whose frequency is 15.8%
Tagging strategy of KIR genes using LD between genes to minimize the number of KIR genes to study
| Test | KIR genes | KIR genes captured through LD |
|---|---|---|
| 1 | Absence of KIR3DL1 | KIR2DL5A; KIR3DS1; KIR2DS1; absence of KIR2DS4 |
| 2 | KIR2DL2 | KIR2DS2; absence of KIR2DL3 |
| 3 | KIR2DS5 | |
| 4 | KIR2DS3 | |
| 5 | KIR2DL4 | |
| 6 | KIR2DL1 | |
| 7 | KIR2DL2–KIRD2S3 | KIR2DL5B |
According to the haplotype display in Fig. 2, a parsimonious strategy of testing for the presence or absence of KIR genes can be drawn. The 13 genes can be tested using six genes and seven tests. In the tagging approach, KIR2DL3, KIR3DL1, and KIR2DS4 differ from the other genes because it is not the genes themselves that are taken into account but their absence. Test 7 corresponds to the simultaneous presence of KIR2DL2 and KIR2DS3. Test 7 cannot be directly inferred from the data in Table 2 and must consider KIR2DL2–KIR2DS3 or KIR2DS2–KIR2DS3 structural haplotypes to capture the KIR2DL5B gene (Fig. 2)
KIR gene alleles of the centromeric LD region tagging other KIR gene alleles
| KIR tagging allele | KIR tagged allele | PPV | NPV | Haplotype frequency | D' (*) | |
|---|---|---|---|---|---|---|
| KIR2DL3 | *002 | KIR2DL1*002 | 94.30% | 99.70% | 23.90% | 0.45 |
| *001 | KIR2DL1*00302 | 94.80% | 96.60% | 35.20% | 0.45 | |
| KIR2DL2 | *003 | KIR2DS2*POS | 100.00% | 77.60% | 12.40% | 0.79 |
| *001 | KIR2DS2*POS | 100.00% | 83.30% | 18.40% | 0.79 | |
| *001 | KIR2DL5B*002 | 71.40% | 99.40% | 13.20% | 0.95 | |
| KIR2DL5B | *002 | KIR2DS2*POS | 98.20% | 78.40% | 13.40% | 0.97 |
| *002 | KIR2DL1*00401 | 78.90% | 98.10% | 10.80% | 0.82 | |
| *002 | KIR2DS3*00103 | 84.20% | 98.60% | 11.50% | 0.84 | |
| *002 | KIR2DL2*001 | 96.50% | 93.90% | 13.20% | 0.95 | |
| KIR2DS3 | *00103 | KIR2DS2*POS | 90.60% | 76.40% | 11.50% | 0.79 |
| *00103 | KIR2DL1*00401 | 69.80% | 95.90% | 8.90% | 0.77 | |
| *00103 | KIR2DL5B*002 | 90.60% | 97.50% | 11.50% | 0.84 | |
| *00103 | KIR2DL2*001 | 86.80% | 91.50% | 11.00% | 0.79 | |
| KIR2DL1 | *00401 | KIR2DS2*POS | 100.00% | 77.60% | 12.40% | 0.75 |
| *00401 | KIR2DL5B*002 | 86.50% | 96.70% | 10.80% | 0.82 | |
| *00401 | KIR2DS3*00103 | 71.20% | 95.60% | 8.90% | 0.77 | |
| *00401 | KIR2DL2*001 | 84.60% | 91.00% | 10.50% | 0.74 | |
| *00302 | KIR2DL3*001 | 94.20% | 96.90% | 35.20% | 0.45 | |
| *002 | KIR2DL3*002 | 99.00% | 98.10% | 23.90% | 0.45 | |
Only alleles of the previously defined centromeric region are displayed. Other constraints were applied: tagging must have a minimal PPV of 65%. For robustness of the estimation, only haplotypes of more than 5% were studied; haplotypes without KIR gene at another locus were not studied. (*) PPV and NPV drastically differ from D′. PPV and NPV are measures of association at the allelic level, whereas D′ is a locus-level average measure of LD. D′ stands for the pairwise standardized Hedrick’s measure of LD between multiallelic genes. Confidence intervals can be computed because frequencies were estimated by gene counting. The total number of haplotypes is 418 (~209 founders of the families). For example (2nd row), the probability of having KIR2DL1*00302 given the presence of KIR2DL3*001 is 94.8%, the probability of not having KIR2DL1*00302 given the absence of KIR2DL3*001 is 96.6%, the KIR2DL3*001–KIR2DL1*00302 haplotype frequency is 35.2%, and D′ between KIR2DL3 and KIR2DL1 is 0.45
PPV positive predictive value, the probability of the tagged KIR allele given the presence of the tagging KIR allele; NPV negative predictive value, the probability of the absence of the tagged KIR allele given the absence of the tagging KIR allele
KIR gene alleles of the telomeric LD region tagging other KIR genes alleles
| KIR tagging allele | KIR tagged allele | PPV | NPV | Haplotype Frequency | D′ (*) | |
|---|---|---|---|---|---|---|
| KIR2DL4 | *011 | KIR3DL1/S1*005 | 100.00% | 99.70% | 13.60% | 0.21 |
| *011 | KIR2DS4*003 | 100.00% | 72.30% | 13.60% | 0.49 | |
| *011 | KIR3DL2*001 | 77.20% | 85.90% | 10.50% | 0.27 | |
| *00802 | KIR2DS4*006 | 98.60% | 99.40% | 16.70% | 0.49 | |
| *00802 | KIR3DL1/S1*004 | 95.80% | 99.70% | 16.30% | 0.21 | |
| *00801 | KIR2DS4*003 | 94.90% | 75.60% | 17.70% | 0.49 | |
| *00801 | KIR3DL1/S1*00101 | 89.70% | 99.70% | 16.70% | 0.21 | |
| *00501 | KIR3DL1/S1*013 | 93.80% | 99.70% | 18.20% | 0.21 | |
| *00501 | KIR3DL2*007 | 95.10% | 97.60% | 18.40% | 0.27 | |
| *00501 | KIR2DS1*002 | 100.00% | 97.60% | 19.40% | 0.71 | |
| *00501 | KIR2DS5*002 | 82.70% | 98.80% | 16.00% | 0.86 | |
| *00501 | KIR2DL5A*001 | 79.00% | 99.40% | 15.30% | 0.69 | |
| *00103 | KIR3DL2*009 | 81.30% | 99.20% | 6.20% | 0.27 | |
| *00103 | KIR3DL1/S1*008 | 75.00% | 100.00% | 5.70% | 0.21 | |
| *00103 | KIR2DS4*003 | 71.90% | 65.30% | 5.50% | 0.49 | |
| *00102 | KIR3DL2*002 | 100.00% | 99.10% | 17.50% | 0.27 | |
| *00102 | KIR2DS4*00101 | 98.60% | 96.80% | 17.20% | 0.49 | |
| KIR3DL1/S1 | *01502 | KIR3DL2*002 | 100.00% | 89.30% | 8.40% | 0.25 |
| *01502 | KIR2DL4*00102 | 100.00% | 90.10% | 8.40% | 0.21 | |
| *01502 | KIR2DS4*00101 | 97.10% | 87.20% | 8.10% | 0.47 | |
| *013 | KIR3DL2*007 | 94.80% | 96.50% | 17.50% | 0.25 | |
| *013 | KIR2DL4*00501 | 98.70% | 98.50% | 18.20% | 0.21 | |
| *013 | KIR2DS1*002 | 100.00% | 96.50% | 18.40% | 0.71 | |
| *013 | KIR2DS5*002 | 81.80% | 97.70% | 15.10% | 0.84 | |
| *013 | KIR2DL5A*001 | 77.90% | 98.20% | 14.40% | 0.69 | |
| *008 | KIR2DS4*003 | 95.80% | 66.00% | 5.50% | 0.47 | |
| *008 | KIR2DL4*00103 | 100.00% | 98.00% | 5.70% | 0.21 | |
| *005 | KIR2DL4*011 | 98.30% | 100.00% | 13.60% | 0.21 | |
| *005 | KIR2DS4*003 | 98.30% | 72.20% | 13.60% | 0.47 | |
| *005 | KIR3DL2*001 | 75.90% | 85.80% | 10.50% | 0.25 | |
| *004 | KIR2DL4*00802 | 98.60% | 99.10% | 16.30% | 0.21 | |
| *004 | KIR2DS4*006 | 98.60% | 98.90% | 16.30% | 0.47 | |
| *002 | KIR3DL2*002 | 100.00% | 89.50% | 8.60% | 0.25 | |
| *002 | KIR2DL4*00102 | 100.00% | 90.30% | 8.60% | 0.21 | |
| *002 | KIR2DS4*00101 | 100.00% | 87.70% | 8.60% | 0.47 | |
| *00101 | KIR2DL4*00801 | 98.60% | 97.70% | 16.70% | 0.21 | |
| *00101 | KIR2DS4*003 | 95.80% | 74.40% | 16.30% | 0.47 | |
| KIR2DL5A | *001 | KIR3DL1/S1*013 | 90.90% | 95.20% | 14.40% | 0.69 |
| *001 | KIR3DL2*007 | 95.50% | 93.80% | 15.10% | 0.8 | |
| *001 | KIR2DL4*00501 | 97.00% | 95.20% | 15.30% | 0.69 | |
| *001 | KIR2DS1*002 | 100.00% | 93.50% | 15.80% | 0.85 | |
| *001 | KIR2DS5*002 | 100.00% | 98.60% | 15.80% | 0.83 | |
| KIR2DS5 | *002 | KIR3DL1/S1*013 | 88.70% | 96.00% | 15.10% | 0.84 |
| *002 | KIR3DL2*007 | 95.80% | 95.10% | 16.30% | 0.84 | |
| *002 | KIR2DL4*00501 | 94.40% | 96.00% | 16.00% | 0.86 | |
| *002 | KIR2DS1*002 | 100.00% | 94.80% | 17.00% | 0.87 | |
| *002 | KIR2DL5A*001 | 93.00% | 100.00% | 15.80% | 0.83 | |
| KIR2DS1 | *002 | KIR3DL1/S1*013 | 86.50% | 100.00% | 18.40% | 0.71 |
| *002 | KIR3DL2*007 | 92.10% | 99.10% | 19.60% | 0.86 | |
| *002 | KIR2DL4*00501 | 91.00% | 100.00% | 19.40% | 0.71 | |
| *002 | KIR2DS5*002 | 79.80% | 100.00% | 17.00% | 0.87 | |
| *002 | KIR2DL5A*001 | 74.20% | 100.00% | 15.80% | 0.85 | |
| KIR2DS4 | *006 | KIR2DL4*00802 | 97.20% | 99.70% | 16.70% | 0.49 |
| *006 | KIR3DL1/S1*004 | 94.40% | 99.70% | 16.30% | 0.47 | |
| *00101 | KIR3DL2*002 | 89.20% | 99.40% | 17.70% | 0.62 | |
| *00101 | KIR2DL4*00102 | 86.70% | 99.70% | 17.20% | 0.49 | |
| KIR3DL2 | *011 | KIR2DL4*00801 | 95.70% | 85.80% | 5.30% | 0.27 |
| *011 | KIR2DS4*003 | 91.30% | 65.60% | 5.00% | 0.62 | |
| *009 | KIR2DL4*00103 | 89.70% | 98.50% | 6.20% | 0.27 | |
| *007 | KIR3DL1/S1*013 | 85.90% | 98.80% | 17.50% | 0.25 | |
| *007 | KIR2DL4*00501 | 90.60% | 98.80% | 18.40% | 0.27 | |
| *007 | KIR2DS1*002 | 96.50% | 97.90% | 19.60% | 0.86 | |
| *007 | KIR2DS5*002 | 80.00% | 99.10% | 16.30% | 0.84 | |
| *007 | KIR2DL5A*001 | 74.10% | 99.10% | 15.10% | 0.8 | |
| *005 | KIR2DL4*00802 | 92.10% | 90.50% | 8.40% | 0.27 | |
| *005 | KIR2DS4*006 | 94.70% | 90.50% | 8.60% | 0.62 | |
| *005 | KIR3DL1/S1*004 | 86.80% | 90.50% | 7.90% | 0.25 | |
| *003 | KIR2DL4*00802 | 93.90% | 89.60% | 7.40% | 0.27 | |
| *003 | KIR2DS4*006 | 97.00% | 89.60% | 7.70% | 0.62 | |
| *003 | KIR3DL1/S1*004 | 93.90% | 90.10% | 7.40% | 0.25 | |
| *002 | KIR2DL4*00102 | 96.10% | 100.00% | 17.50% | 0.27 | |
| *002 | KIR2DS4*00101 | 97.40% | 97.40% | 17.70% | 0.62 | |
| *001 | KIR2DS4*003 | 95.80% | 79.60% | 21.80% | 0.62 | |
Only alleles of the previously defined central telomeric region are displayed. Other constraints were applied: Tagging must have a minimal PPV of 65%. For robustness of the estimation, only haplotypes of more than 5% were studied; haplotypewithout the KIR gene at another locus were not studied. (*) PPV and NPV drastically differ from D′. PPV and NPV are measures of association at the allelic level, whereas D′ is a locus-level average measure of LD. Confidence intervals can be computed because frequencies were estimated by gene counting. The total number of haplotype is 418 (~209 founders of the families). For example, (1st row) the probability of having KIR3DL1/S1*005 given the presence of KIR2DL4*011 is 100.0%, the probability of not having KIR3DL1/S1*005 given that KIR2DL4*011 is absent in 99.7%, the KIR3DL1/S1*005–KIR2DL4*0011 haplotype frequency is 13.6%, and D′ between KIR2DL4 and KIR3DL1/S1 is 0.21
PPV positive predictive value, the probability of the tagged KIR allele given the presence of the tagging KIR allele; NPV negative predictive value, the probability of the absence of the tagged KIR allele given the absence of the tagging KIR allele
Haplotypes exhibiting the strongest KIR allele associations in the KIR gene cluster
| Centromeric haplotype |
| KIR2DS2*POS–KIR2DL2*001–KIR2DL5B*002–KIR2DS3*00103–KIR2DL1*00401 |
| Telomeric haplotypes |
| KIR2DL4*011–KIR3DL1*005–KIR2DS4*003–KIR3DL2*003 |
| KIR2DL4*00802–KIR3DL1*004–KIR2DS4*006– KIR3DL2*005 |
| KIR2DL4*00801–KIR3DL1*00101–KIR2DS4*003–KIR3DL2*001 |
| KIR2DL4*00103–KIR3DL1*008–KIR2DS4*003–KIR3DL2*009 |
| KIR2DL4*00102–KIR3DL1*01502/*002–KIR2DS4*00101–KIR3DL2*002 |
| KIR2DL4*00801–KIR2DS4*003–KIR3DL2*011 |
| KIR2DL4*00501–KIR3DS1*013–KIR2DL5A*001–KIR2DS5*002–KIR2DS1*002–KIR3DL2*007 |