| Literature DB >> 20041166 |
Jacques Fellay1, Dongliang Ge, Kevin V Shianna, Sara Colombo, Bruno Ledergerber, Elizabeth T Cirulli, Thomas J Urban, Kunlin Zhang, Curtis E Gumbs, Jason P Smith, Antonella Castagna, Alessandro Cozzi-Lepri, Andrea De Luca, Philippa Easterbrook, Huldrych F Günthard, Simon Mallal, Cristina Mussini, Judith Dalmau, Javier Martinez-Picado, José M Miro, Niels Obel, Steven M Wolinsky, Jeremy J Martinson, Roger Detels, Joseph B Margolick, Lisa P Jacobson, Patrick Descombes, Stylianos E Antonarakis, Jacques S Beckmann, Stephen J O'Brien, Norman L Letvin, Andrew J McMichael, Barton F Haynes, Mary Carrington, Sheng Feng, Amalio Telenti, David B Goldstein.
Abstract
To extend the understanding of host genetic determinants of HIV-1 control, we performed a genome-wide association study in a cohort of 2,554 infected Caucasian subjects. The study was powered to detect common genetic variants explaining down to 1.3% of the variability in viral load at set point. We provide overwhelming confirmation of three associations previously reported in a genome-wide study and show further independent effects of both common and rare variants in the Major Histocompatibility Complex region (MHC). We also examined the polymorphisms reported in previous candidate gene studies and fail to support a role for any variant outside of the MHC or the chemokine receptor cluster on chromosome 3. In addition, we evaluated functional variants, copy-number polymorphisms, epistatic interactions, and biological pathways. This study thus represents a comprehensive assessment of common human genetic variation in HIV-1 control in Caucasians.Entities:
Mesh:
Year: 2009 PMID: 20041166 PMCID: PMC2791220 DOI: 10.1371/journal.pgen.1000791
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Significant hits in the MHC region.
Representation of a 3 Mb stretch in the MHC region, encompassing the HLA Class I gene loci and the genome-wide significant SNPs identified in the study (red dots represents SNPs with p-value<5E–08). Results are shown for set point (upper plot) and for progression (lower plot). The figure was created with WGAViewer [40].
Figure 2Correlation between HIV-1 set point and the genotypes of the top associated SNPs.
HIV-1 viremia at set point strongly associates with rs2395029 (upper panel) and rs9264942 (lower panel) genotypes. The rs2395029 minor allele G has a frequency of 4.8% and each copy of this allele associates with a 0.7 log lower set point. The rs9264942 minor allele C has a frequency of 41.2%, and each copy of this allele associates with a 0.3 log lower set point. Mean and Standard Deviation (error bars) are represented for the respective genotypes.
P-values and population effect sizes (or explained fraction of the inter-individual variability) for the strongest determinants of HIV-1 viremia at set point.
| SNP | MAF | Gene | P-value (initial, N = 486) | Effect Size (initial) | P-value (replication, N = 1896) | Effect Size (replication) | P-value (all, N = 2362) | Effect Size (all) |
| rs2395029 | 4.8% | HCP5/B*5701 | 9.4E–12 | 9.6% | 4.2E–23 | 4.9% | 4.5E–35 | 5.8% |
| rs9264942 | 41.2% | HLA-C | 3.8E–09 | 6.5% | 1.6E–23 | 4.8% | 5.9E–32 | 5.3% |
Results are shown for the subjects that were included in our initial study [3] (initial), for the subset of subjects that are new to the present study (replication), and for the global study population (all). MAF: Minor allele frequency.
Associations between 4-digit HLA Class I alleles and HIV-1 set point in a subset of 1,204 subjects with full results.
| HLA Allele | Model with HLA allele only | Model with HLA allele, rs2395029 | Model with HLA allele, rs2395029, rs9264942 | Model with HLA allele, rs2395029, rs9264942, rs259919, rs9468692, rs9266409, rs8192591 | Effect |
| A*2402 | 3.4E–02 | 6.9E–02 | 1.1E–01 | 9.9E–01 | |
| A*2501 | 2.8E–02 | 2.7E–02 | 1.2E–01 | 3.2E–01 | |
| A*3201 | 5.0E–03 | 1.0E–03 | 6.0E–03 | 2.9E–02 | protective |
| B*0702 | 7.0E–03 | 1.5E–02 | 4.3E–01 | 3.8E–01 | |
| B*0801 | 7.0E–03 | 3.3E–02 | 5.8E–01 | 2.4E–01 | |
| B*1302 | 2.0E–03 | 2.1E–04 | 1.2E–02 | 1.6E–02 | protective |
| B*2705 | 5.2E–05 | 4.9E–06 | 2.0E–03 | 3.0E–03 | protective |
| B*3502 | 1.9E–05 | 3.9E–05 | 6.2E–04 | 2.0E–03 | deleterious |
| B*5601 | 4.2E–02 | 7.8E–02 | 1.9E–01 | 1.2E–01 | |
| C*0202 | 1.0E–03 | 2.5E–04 | 6.0E–02 | 5.0E–02 | |
| C*0401 | 5.0E–03 | 3.0E–02 | 9.3E–01 | 8.5E–01 | |
| C*0602 | 1.2E–11 | 4.9E–02 | 7.1E–01 | 7.8E–01 | |
| C*0701 | 5.4E–04 | 3.0E–03 | 1.8E–01 | 7.8E–01 | |
| C*0702 | 3.4E–02 | 6.3E–02 | 9.9E–01 | 9.5E–01 | |
| C*0802 | 1.2E–02 | 2.0E–03 | 1.6E–01 | 2.5E–01 |
P-values are shown for all Class I alleles that have a nominally significant association with HIV-1 viral load at set point (with the exception of B*5701, discussed in the text). All linear regression models include gender, age and the 12 Eigenstrat axes as covariates. Most of the association signals disappear once the top associated SNPs are added. However, A*3201, B*1302, B*2705 and B*3502 still have an independent effect. See Table S4 for a complete list of all HLA Class I allele results. In addition, Table S5 lists all pairs of HLA-B and HLA-C alleles that are in LD (with an r2>0.1) and therefore can represent the same association signal (as for example in the case of HLA-C*0602, which is often on the same haplotype as HLA-B*5701).
Figure 3Kaplan-Meier survival estimates for the top associated variants.
Results are shown for the 3 most associated SNPs indentified in the genome-wide progression scan and for CCR5-Δ32. The survival curves show, for each genotype, the proportion of the individuals that do not reach a progression outcome over the first 10 years after seroconversion.
Variants in HIV-1 candidate genes, previously reported to associate with viral control or disease progression.
| group | SNP | gene | variant | genotyping | proxy | r2 | model | p setpoint |
| p progression |
| effect |
| Chemokine receptors | rs333 | CCR5 | delta 32 | TaqMan | no | dominant | 1.7E–10 |
| 3.5E–07 |
|
| |
| rs1799988 | CCR5 | P1 - 627T>C | TaqMan | no | recessive | 7.5E–05 |
| 3.8E–04 |
|
| ||
| rs1799864 | CCR2 | V64I | TaqMan | no | dominant | 8.1E–03 |
| 1.5E–01 |
|
| ||
| rs3732378 | CX3CR1 | T280M | present on all chips | - | recessive | 1.5E–01 |
| 6.0E–01 | 1071 | |||
| Chemokines | rs1719134 | CCL3/MIP1a | intron 459C>T | proxy on all chips | rs1634508 | 1 | dominant | 6.0E–01 |
| 8.2E–01 |
| |
| rs2107538 | CCL5/RANTES | promoter -403G>A | proxy on all chips | rs2291299 | 1 | haplotypes R1-R5: additive | 2.9E–01 |
| 2.3E–01 |
| ||
| rs2280788 | promoter -28C>G | proxy on all chips | rs4251739 | 1 | ||||||||
| rs2280789 | In1.1 T>C | proxy on all chips | rs2306630 | 1 | ||||||||
| rs1801157 | SDF-1/CXCL12 | SDF-1 3′A | proxy on all chips | rs10900029 | 1 | recessive | 8.6E–01 |
| 9.2E–01 |
| ||
| Cytokines | rs2243250 | IL-4 | promoter -589C>T | proxy on all chips | rs2243290 | 1 | additive | 3.4E–01 |
| 6.3E–01 |
| |
| rs1800872 | IL-10 | promoter -592C>A | proxy on all chips | rs3024490 | 1 | dominant | 6.5E–02 |
| 1.4E–01 |
| ||
| rs1799946 | DEFB1 | promoter -52G>A | proxy on all chips | rs2741127 | 0.9 | recessive | 1.9E–01 |
| 2.5E–01 |
| ||
| Intracellular life cycle | rs8177826 | PPIA | promoter 1604C>G | TaqMan | no | dominant | 6.0E–01 |
| 6.3E–01 |
| ||
| rs6850 | promoter 1650A>G | TaqMan | no | dominant | 7.7E–01 |
| 1.4E–01 |
| ||||
| rs2292179 | TSG101 | promoter -183T>C | proxy on all chips | rs3781640 | 1 | haplotypes | 7.9E–01 |
| 1.9E–01 |
| ||
| rs1395319 | intron 181A>C | TaqMan | no | |||||||||
| Intrisic immunity | rs8177832 | APOBEC3G | NS coding H186R | present on all chips | - | additive | 3.2E–01 |
| 5.2E–01 |
| ||
| rs3740996 | TRIM5a | NS coding H43Y | present on all chips | - | recessive | 7.3E–01 |
| 9.3E–01 |
| |||
| rs10838525 | NS coding R136Q | present on all chips | - | additive | 9.0E–01 |
| 8.3E–01 |
| ||||
| Innate immunity | rs2287886 | DC-SIGN/CD209 | promoter -139T>C | present on all chips | - | additive | 9.2E–01 |
| 1.2E–01 |
| ||
| rs5030737 | MBL2 | NS coding R52C | 1M chip + TaqMan | no | recessive | 4.5E–01 |
| 2.9E–01 |
| |||
| rs1800450 | NS coding G54D | present on all chips | - | recessive | 4.3E–01 |
| 9.0E–01 |
| ||||
| rs1800451 | NS coding G57E | 1M chip + TaqMan | no | recessive | 5.5E–01 |
| 8.5E–01 |
| ||||
| rs352139 | TLR9 | intron 1174G>A | proxy on all chips | rs352163 | 0.9 | additive | 6.5E–01 |
| 7.9E–01 |
| ||
| rs352140 | syn coding P545P | proxy on all chips | 0.9 | |||||||||
| rs3764880 | TLR8 | NS coding V1M | present on all chips | - | additive | 5.0E–01 |
| 9.3E–01 |
| |||
| Others | rs601338 | FUT2 | W154stop | proxy on all chips | rs504963 | 0.8 | dominant | 1.1E–01 |
| 7.0E–02 |
| |
| rs1801274 | FCGR2A | NS coding H131R | present on all chips | - | recessive | 7.9E–01 |
| 2.0E–01 |
| |||
| rs1544410 | VDR | intron 8 variant | present on all chips | - | recessive | 5.1E–01 |
| 6.0E–01 |
|
See http://www.hiv-pharmacogenomics.org/pdf/ref_tbl_nat_history/The_complete_reference_table_for_HIV_natural_history_modifiers.pdf for references. Dominant or additive genetic models were used in the analyses for individual SNPs on the basis of their described effect and/or their minor allele frequencies. The P1 variant in the CCR5 promoter region is defined by the SNP rs1799988 (627 C>T). Haplotypes R1 to R5 in the (RANTES) gene were defined using 2 promoter variants (−403C>G, defining haplotype R1, and −28C>G, defining haplotype R5), and 1 intronic variant (375T>C, or In1.1, present in haplotypes R3, R4 and R5). The haplotype R4 is defined by a −222T>C SNP that is monomorphic in Caucasians and was therefore absent in our study population. A combined variable was then defined and tested in additive models: 0 = putatively deleterious haplotypes (presence of an R3 haplotype in the absence of R1 and R5), 1 = neutral haplotypes (all other) and 2 = haplotypes putatively protective (presence of an R1 or R5 haplotype in the absence of R3). For the gene, 2 SNPs defined haplotype B (−183T/181C), haplotype C (−183C/181C) and haplotype A (−183T/181A). Again, a combined variable was defined and tested in additive models: 0 = haplotypes putatively deleterious (AC or CC), 1 = neutral haplotypes (AA or BC) and 2 = haplotypes putatively protective (AB or BB). Only variants from the chromosome 3 CCR5-CCR2 genomic region showed nominally significant association with the HIV-1-related outcomes under study. SNP: single nucleotide polymorphism. proxy: high-LD SNP (r2>0.8) that can be used as a tag for the original variant. r2: r-squared. p: p-value.
Figure 4Allelic distribution of the significant variants in subsets of the study population.
The bar graphs show the allelic distribution of the 4 variants that have a genome-wide significant association with HIV-1 set point and/or disease progression in subsets of the study population. Groups were defined according to HIV-1 set point (left-hand side graphs) and to progression time (right-hand side graphs).