| Literature DB >> 15588486 |
Vanessa J Clark1, Michael Dean.
Abstract
To dissect the haplotype structure of candidate genes for disease association studies, it is important to understand the nature of genetic variation at these loci in different populations. We present a survey of haplotype structure and linkage disequilibrium of chemokine and chemokine receptor genes in 11 geographically-distinct population samples (n=728). Chemokine proteins are involved in intercellular signalling and the immune response. These molecules are important modulators of human immunodeficiency virus (HIV)-1 infection and the progression of the acquired immune deficiency syndrome, tumour development and the metastatic process of cancer. To study the extent of genetic variation in this gene family, single nucleotide polymorphisms (SNPs) from 13 chemokine and chemokine receptor genes were genotyped using the 5' nuclease assay (TaqMan). SNP haplotypes, estimated from unphased genotypes using the Expectation-Maximization-algorithm, are described in a cluster of four CC-chemokine receptor genes (CCR3, CCR2, CCR5 and CCRL2) on chromosome 3p21, and a cluster of three CC-chemokine genes [MPIF-1 (CCL23), PARC (CCL18) and MIP-1alpha (CCL3)] on chromosome 17q11-12. The 32 base pair (bp) deletion in exon 4 of CCR5 was also included in the haplotype analysis of 3p21. A total of 87.5 per cent of the variation of 14 biallelic loci scattered over 150 kilobases of 3p21 is explained by 11 haplotypes which have a frequency of at least 1 per cent in the total sample. An analysis of haplotype blocks in this region indicates recombination between CCR2 and CCR5, although long-range pairwise linkage disequilibrium across the region appears to remain intact on two common haplotypes. A reduced-median network demonstrates a clear relationship between 3p21 haplotypes, rooted by the putative ancestral haplotype determined by direct sequencing of four primate species. Analysis of six SNPs on 17q11-12 indicates that 97.5 per cent of the variation is explained by 15 haplotypes, representing at least 1 per cent of the total sample. Additionally, a possible signature of selection at a non-synonymous coding SNP (M106V) in the MPIF-1 (CCL23) gene warrants further study. We anticipate that the results of this study of chemokine and chemokine receptor variation will be applicable to more extensive surveys of long-range haplotype structure in these gene regions and to association studies of HIV-1 disease and cancer.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15588486 PMCID: PMC3525265 DOI: 10.1186/1479-7364-1-4-255
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Biallelic loci typed in total population sample
| Chromosome location | Haplotype position | Gene | NCBI Locus Link | Nuclotide position | NCBI Genbank | NCBI contig | Contig position | Ancestral allele | NCBI dbSNP# |
|---|---|---|---|---|---|---|---|---|---|
| 3p21 | |||||||||
| CCR3 | 1232 | Y17YA/G | NM_001837 | NT_05827 | 3997337 | 4987053 | |||
| CCR2 | 1231 | -5983 G/A | U95626 | NT_05827 | 4083672 | ||||
| CCR2 | 1231 | -5048 G/T | U95626 | NT_05827 | 4084607 | 3918357 | |||
| CCR2 | 1231 | -4866 G/C | U95626 | NT_05827 | 4084789 | 3918370 | |||
| CCR2 | 1231 | -3433 T/C | U95626 | NT_05827 | 4086222 | 3092964 | |||
| CCR2 | 1231 | V64I C/T | NM_000647 | NT_05827 | 4089845 | 1799864 | |||
| CCR2 | 1231 | N260N A/G | NM_000647 | NT_05827 | 4090435 | 1799865 | |||
| CCR5 | 1234 | 208 C/A | NM_000579 | NT_05827 | 4102477 | 2734648 | |||
| CCR5 | 1234 | 303 C/T | NM_000579 | NT_05827 | 4102572 | 1799987 | |||
| CCR5 | 1234 | 676 T/C | NM_000579 | NT_05827 | 4102945 | 1800023 | |||
| CCR5 | 1234 | L55Q T/A | NM_000579 | NT_05827 | 4015194 | 1799863 | |||
| CCR5 | 1234 | D32 | NM_000579 | NT_05827 | |||||
| CCRL2 | 9034 | 1243V C/T | NM_003965 | NT_05827 | 4140934 | 3204850 | |||
| CCRL2 | 9034 | 1137 C/G | NM_003965 | NT_05827 | 4141344 | ||||
| CX3CRI | 1524 | V2491 G/A | NM_001337 | NT_037565 | 1246118 | 3732379 | |||
| CX3CRI | 1524 | T280M C/T | NM_001337 | NT_037565 | 1246212 | 3732378 | |||
| ENA-78 (CXCI.5) | 6374 | Q56Q G/A | NM_002997 | NT_006216 | 3371007 | 425535 | |||
| GRO1 (CXCL1) | 2919 | 1086 G/A | X54489 | NT_006216 | 3243154 | 4074 | |||
| GRO1 (CXCL1) | 2919 | 1176 C/T | X54489 | NT_006216 | 3243245 | 1814092 | |||
| IPIO(CXCLIO) | 3627 | 503 C/G | NM_001565 | NT_016354 | 1437648 | 3921 | |||
| MCPI (CCL2) | 6347 | -362 C/G | M37719 | NT_010799 | 7315787 | 2857656 | |||
| EOTAXIN (CCL11) | 6356 | -1382 C/T | Z92709 | NT_010799 | 7345226 | 4795895 | |||
| MPIF-1 (CCL23) | 6368 | MI06VG/A | U85767 | NT_010799 | 9074064 | 1003645 | |||
| PARC (CCL18) | 6362 | -1 16 C/T | AB0121 13 | NT_010799 | 9125397 | 2015086 | |||
| PARC (CCL18) | 6362 | 81 G/A | AB0121 13 | NT_010799 | 9125563 | 2015070 | |||
| PARC (CCL18) | 6362 | 311 C/A | AB0121 13 | NT_010799 | 9125793 | 2015052 | |||
| PARC (CCL18) | 6362 | 6793 A/G | AB0121 13 | NT_010799 | 9132275 | 14304 | |||
| MIP-IA(CCL3) | 6348 | - 1541 T/C | M23178 | NT_010799 | 9152727 | 1634497 |
Abbreviations: NCBI = National Center for Biotechnology Information; dbSNP = Database for single nucleotide polymorphisms.
Allele frequencies of all loci typed in I I population samples
| Mende ( | Temne ( | Afr-Amer-( | Euro-Amer-( | Hispanic ( | Mixteca ( | Mixe ( | Zapotec ( | Chinese ( | Indonesians ( | PNG ( | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| CCR3(YI7Y_A) | 0.865 | 0.872 | 0.924 | 0.904 | 0.938 | 0.91 1 | 1.000 | 0.953 | 0.967 | 0.978 | 1.000 |
| CCR2(-5983 G) | 0.716 | 0.683 | 0.807 | 1.000 | 0.957 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| CCR2(-5048 G) | 0.858 | 0.901 | 0.792 | 0.746 | 0.649 | 0.71 1 | 0.783 | 0.932 | 0.847 | ||
| CCR2(-4866 C) | 0.699 | 0.621 | 0.745 | 0.996 | 1.000 | 0.974 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| CCR2(-3433 T) | 0.927 | 0.916 | 0.785 | 0.708 | 0.800 | 0.733 | 0.939 | 0.781 | |||
| CCR2(64V_C) | 0.941 | 0.922 | 0.840 | 0.904 | 0.826 | 0.750 | 0.649 | 0.71 1 | 0.808 | 0.955 | 0.826 |
| CCR2(N260N_T) | 0.637 | 0.580 | 0.646 | 0.682 | 0.667 | 0.757 | 0.716 | 0.710 | 0.755 | ||
| CCR5(208 G) | 0.632 | 0.676 | 0.639 | 0.625 | 0.640 | 0.797 | 0.778 | 0.484 | 0.538 | ||
| CCR5(303 G) | 0.321 | 0.245 | 0.412 | 0.533 | 0.583 | 0.500 | 0.797 | 0.733 | 0.525 | ||
| CCR5(676 A) | 0.941 | 0.883 | 0.643 | 0.667 | 0.643 | 0.797 | 0.767 | 0.516 | |||
| CCR5(55L_T) | 1.000 | 1.000 | 0.995 | 0.978 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
| CCR5(+32) | 1.000 | 0.995 | 0.962 | 0.891 | 0.979 | 1.000 | 1.000 | 0.978 | 1.000 | 1.000 | 1.000 |
| CCRL2(243I_C) | 1.000 | 1.000 | 0.986 | 0.907 | 0.957 | 0.904 | 1.000 | 0.933 | 0.952 | 0.979 | 1.000 |
| CCRL2(II37C) | 0.658 | 0.794 | 0.996 | 0.979 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | |
| MCP-I(-362C) | 0.521 | 0.532 | 0.570 | 0.713 | 0.708 | 0.300 | 0.108 | 0.227 | 0.433 | 0.710 | |
| EOTAXIN(-l382C) | 0.995 | 0.994 | 0.949 | 0.776 | 0.848 | 0.991 | 0.986 | 0.988 | 0.919 | 0.904 | 0.860 |
| MIP-la(--1541 A) | 0.816 | 0.830 | 0.801 | 0.728 | 0.771 | 0.792 | 0.757 | 0.739 | 0.638 | 0.633 | 0.826 |
| PARC(6793 G) | 0.984 | 0.989 | 0.953 | 0.761 | 0.789 | 0.608 | 0.716 | 0.650 | 0.670 | 0.820 | |
| PARC(3 1 1 A) | 0.696 | 0.676 | 0.745 | 0.970 | 0.938 | 1.000 | 1.000 | 1.000 | 0.950 | 0.938 | 0.833 |
| PARC(81 G) | 0.968 | 0.947 | 0.922 | 0.792 | 0.825 | 0.736 | 0.644 | 0.919 | 0.867 | 0.419 | |
| PARC(-116C) | 0.615 | 0.606 | 0.607 | 0.906 | 0.729 | 0.825 | 0.730 | 0.644 | 0.897 | 0.798 | |
| MPIF-1(106M_A) | 0.047 | 0.064 | 0.260 | 0.800 | 0.739 | 0.769 | 0.770 | 0.889 | 0.633 | 0.567 | 0.276 |
| CX3CR1(249V_G) | 0.891 | 0.921 | 0.868 | 0.724 | 0.761 | 0.750 | 0.750 | 0.800 | 0.984 | 0.919 | 0.784 |
| CX3CR1(280T_G) | 0.990 | 1.000 | 0.958 | 0.842 | 0.875 | 0.804 | 0.757 | 0.81 1 | 0.983 | 0.796 | |
| ENA(Q56Q_A) | 0.521 | 0.500 | 0.605 | 0.890 | 0.938 | 0.873 | 0.905 | 0.898 | 0.962 | 0.830 | 0.631 |
| GROI(I086G) | 0.995 | 1.000 | 0.313 | 0.521 | 0.632 | 0.527 | 0.636 | 0.532 | 0.378 | 0.510 | |
| GRO1(1176 C) | 0.737 | 0.720 | 0.778 | 0.996 | 1.000 | 1.000 | 1.000 | 0.989 | 1.000 | 1.000 | 0.990 |
| IP 10 (503 C) | 0.734 | 0.814 | 0.678 | 0.553 | 0.771 | 0.945 | 0.971 | 0.942 | 0.967 | 0.875 | 0.792 |
Those genotype distributions that demonstrate a distortion from the Hardy-Weinberg equilibrium are indicated with an underline.
Abbreviations: Afr-Amer = African-Americans; Euro-Amer = European-Americans; PNG = Papua New Guineans.
3p21 haplotype frequencies in 11 population samples
| Haplotype | Mende | Temne | Afr-Amer | Euro-Amer | Hispanic | Mixteca | Mixe | Zapotec | Chinese | Indonesian | PNG | Global | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11111111111111 | 0.143 | 0.100 | 0.134 | 0.294 | 0.308 | 0.245 | 0.391 | 0.422 | 0.062 | 0.415 | 0.546 | ||
| 11211211111111 | 0.029 | 0.056 | 0.085 | 0.089 | 0.187 | 0.255 | 0.351 | 0.254 | 0.158 | 0.053 | 0.100 | ||
| 11112122221111 | 0.059 | 0.033 | 0.062 | 0.175 | 0.204 | 0.282 | 0.162 | 0.189 | 0.021 | ||||
| 11121121211111 | 0.273 | 0.283 | 0.174 | 0.027 | |||||||||
| 11111112221111 | 0.029 | 0.173 | 0.109 | 0.027 | 0.014 | 0.01 1 | 0.165 | 0.158 | 0.035 | ||||
| 12111112211112 | 0.274 | 0.293 | 0.127 | ||||||||||
| 11111122221111 | 0.006 | 0.064 | 0.027 | 0.022 | 0.288 | 0.163 | 0.094 | ||||||
| 11112111111111 | 0.169 | 0.033 | 0.133 | ||||||||||
| 21111121211121 | 0.007 | 0.078 | 0.091 | 0.044 | 0.038 | 0.013 | |||||||
| 11111111111211 | 0.006 | 0.024 | 0.105 | 0.028 | |||||||||
| 11112121111111 | 0.023 | 0.046 | 0.055 | ||||||||||
| 21121121211111 | 0.028 | 0.075 | |||||||||||
| 11212211111111 | 0.034 | 0.065 | |||||||||||
| 11112112221111 | 0.007 | 0.066 | 0.026 | ||||||||||
| 21211211111111 | 0.030 | 0.027 | 0.022 | 0.012 | |||||||||
| 11111112211111 | 0.032 | 0.011 | 0.041 | ||||||||||
| 11211111111111 | 0.040 | 0.012 | 0.026 | ||||||||||
| 11111121111111 | 0.065 | ||||||||||||
| 11111121211121 | 0.042 | 0.009 | 0.013 | ||||||||||
| 11111122211111 | 0.005 | 0.053 | |||||||||||
| 11121111211111 | 0.055 | ||||||||||||
| 21111111111111 | 0.034 | 0.008 | |||||||||||
| 12111122211112 | 0.036 | ||||||||||||
| 11111111112111 | 0.022 | ||||||||||||
| 11111121211111 | 0.004 | ||||||||||||
Haplotype diversity values are indicated at the bottom of the table. Haplotype numbers describe haplotypes defined in Clark et al. [43].
Only estimated haplotypes above 3 per cent frequency in any population are listed. The ancestral haplotype is denoted by the box.
Abbreviations: Afr-Amer = African-Americans; Euro-Amer = European-Americans; PNG = Papua New Guineans.
Figure 1(a-k) The D' statistic for pairwise comparisons of the 14 SNPs genotyped in 3p21 in each population sample. The axes of each plot indicate those polymorphic SNPs included in each analysis. SNP labels refer to numbering system described in Table 1. Abbreviations: Euro-Americans = European-Americans; PNG = Papua New Guineans.
Figure 2Reduced median (R-M) Network of haplotypes of 14 SNPs in 3p21. Haplotypes indicated comprise nearly 80 per cent of the variation in the total population sample (N = 690). The size of the circle for each haplotype indicates the frequency of the haplotype in the pooled sample.
Haplotype frequencies of six single nucleotide polymorphisms in the 17q12 chemokine gene cluster in 11 population samples
| Haplotype | Mende | Temne | Afr-Amer | Euro-Amer | Hispanic | Mixteca | Mixe | Zapotec | Chinese | Indonesian | PNG | Global freq | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 111111 | 0.02 | 0.03 | 0.12 | 0.46 | 0.41 | 0.47 | 0.36 | 0.29 | 0.26 | 0.30 | 0.11 | ||
| 211111 | 0.49 | 0.51 | 0.35 | 0.11 | 0.07 | 0.14 | 0.03 | 0.09 | 0.22 | 0.18 | 0.05 | ||
| 111122 | 0.03 | 0.21 | 0.12 | 0.21 | 0.20 | 0.23 | 0.11 | 0.18 | 0.04 | ||||
| 222111 | 0.01 | 0.01 | 0.02 | 0.14 | 0.09 | 0.16 | 0.02 | 0.09 | 0.54 | ||||
| 122111 | 0.04 | 0.05 | 0.08 | 0.09 | 0.06 | 0.32 | 0.08 | 0.05 | 0.03 | ||||
| 221211 | 0.21 | 0.23 | 0.16 | 0.03 | 0.02 | 0.02 | |||||||
| 211122 | 0.03 | 0.05 | 0.15 | 0.09 | 0.03 | ||||||||
| 111121 | 0.06 | 0.05 | 0.14 | 0.02 | 0.08 | ||||||||
| 211112 | 0.07 | 0.07 | 0.06 | 0.01 | |||||||||
| 221212 | 0.07 | 0.05 | 0.07 | 0.01 | 0.01 | ||||||||
| 121211 | 0.02 | 0.02 | 0.03 | 0.04 | 0.07 | ||||||||
| 111112 | 0.03 | 0.02 | 0.02 | 0.01 | 0.06 | 0.01 | |||||||
| 221111 | 0.06 | 0.03 | 0.08 | ||||||||||
| 221222 | 0.01 | 0.03 | 0.06 | ||||||||||
| 121212 | 0.01 | 0.02 | 0.03 | ||||||||||
| 222112 | 0.02 | 0.01 | |||||||||||
| 211121 | 0.02 | 0.01 | 0.01 | 0.01 | |||||||||
| 221112 | 0.02 | ||||||||||||
| 222122 | 0.03 | ||||||||||||
| 121112 | 0.02 | ||||||||||||
| 122122 | 0.02 | 0.01 | |||||||||||
| 122112 | 0.02 | ||||||||||||
Only those estimated haplotypes above 2 per cent in any one population are listed. The inferred ancestral haplotype is denoted by the box.
Abbreviations: Afr-Amer = African-Americans; Euro-Amer = European-Americans; PNG = Papua New Guineans.
Figure 3(a-k). The D' statistic for pairwise comparisons of the 6 SNPs genotyped in . The axes of each plot indicate those polymorphic SNPs included in each analysis. SNP labels refer to numbering system described in Table 1. Abbreviations: Euro-Americans = European-Americans; PNG = Papua New Guineans.
FST values calculated from 3p21 haplotype frequencies
| Mende | Temne | Afr-Amer | Euro-Amer | Hispanics | Mixteca | Mixe | Zapotecs | Chinese | Indonesians | PNG | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Mende | 0.000 | ||||||||||
| Temne | 0.000 | ||||||||||
| Afr-Amer | 0.023 | 0.029 | 0.000 | ||||||||
| Euro-Amer | 0.126 | 0.144 | 0.067 | 0.000 | |||||||
| Hispanics | 0.122 | 0.141 | 0.059 | 0.000 | |||||||
| Mixteca | 0.137 | 0.151 | 0.073 | 0.041 | 0.000 | ||||||
| Mixe | 0.169 | 0.185 | 0.101 | 0.063 | 0.027 | 0.000 | |||||
| Zapotecs | 0.156 | 0.176 | 0.092 | 0.042 | 0.020 | -0.001 | 0.000 | ||||
| Chinese | 0.159 | 0.163 | 0.095 | 0.111 | 0.115 | 0.117 | 0.158 | 0.158 | 0.000 | ||
| Indonesians | 0.146 | 0.171 | 0.095 | 0.044 | 0.046 | 0.100 | 0.094 | 0.068 | 0.101 | 0.000 | |
| PNG | 0.183 | 0.210 | 0.133 | 0.088 | 0.076 | 0.130 | 0.089 | 0.064 | 0.167 | 0.028 | 0.000 |
Those FST values that did achieve significance with the permutation test [50] are indicated in italics. Those comparisons between populations that did not yield significant differences with the exact test of population differentiation [52] are indicated with a box.
Abbreviations: Afr-Amer = African-Americans; Euro-Amer = European-Americans; PNG = Papua New Guineans.
FST values calculated from 17q11-12 Those FST values that did achieve significance with the permutation test [50]
| Mende | Temne | Afr-Amer | Euro-Amer | Hispanics | Mixteca | Mixe | Zapotecs | Chinese | Indonesians | PNG | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.000 | |||||||||||
| 0.000 | |||||||||||
| 0.017 | 0.021 | 0.000 | |||||||||
| 0.242 | 0.248 | 0.141 | 0.000 | ||||||||
| 0.225 | 0.233 | 0.120 | 0.018 | 0.000 | |||||||
| 0.232 | 0.238 | 0.128 | 0.006 | 0.000 | |||||||
| 0.248 | 0.256 | 0.144 | 0.028 | 0.001 | 0.021 | 0.000 | |||||
| 0.233 | 0.241 | 0.133 | 0.065 | 0.049 | 0.053 | 0.060 | 0.000 | ||||
| 0.140 | 0.147 | 0.061 | 0.043 | 0.026 | 0.043 | 0.051 | 0.054 | 0.000 | |||
| 0.149 | 0.156 | 0.066 | 0.028 | 0.017 | 0.027 | 0.049 | 0.000 | ||||
| PNG | 0.297 | 0.303 | 0.222 | 0.258 | 0.154 | 0.216 | 0.153 | 0.235 | 0.208 | 0.163 | 0.000 |
Those population comparisons that did not yield significant differences with the exact test of population differentiation [52] are indicated with a box.
Abbreviations: Afr-Amer = African-Americans; Euro-Amer = European-Americans; PNG = Papua New Guineans.