Literature DB >> 24784026

Host genetics and immune control of HIV-1 infection: fine mapping for the extended human MHC region in an African cohort.

H A Prentice1, N M Pajewski2, D He1, K Zhang3, E E Brown1, W Kilembe4, S Allen5, E Hunter6, R A Kaslow1, J Tang7.   

Abstract

Multiple major histocompatibility complex (MHC) loci encoding human leukocyte antigens (HLA) have allelic variants unequivocally associated with differential immune control of HIV-1 infection. Fine mapping based on single nucleotide polymorphisms (SNPs) in the extended MHC (xMHC) region is expected to reveal causal or novel factors and to justify a search for functional mechanisms. We have tested the utility of a custom fine-mapping platform (the ImmunoChip) for 172 HIV-1 seroconverters (SCs) and 449 seroprevalent individuals (SPs) from Lusaka, Zambia, with a focus on more than 6400 informative xMHC SNPs. When conditioned on HLA and nongenetic factors previously associated with HIV-1 viral load (VL) in the study cohort, penalized approaches (HyperLasso models) identified an intergenic SNP (rs3094626 between RPP21 and HLA-E) and an intronic SNP (rs3134931 in NOTCH4) as novel correlates of early set-point VL in SCs. The minor allele of rs2857114 (downstream from HLA-DOB) was an unfavorable factor in SPs. Joint models based on demographic features, HLA alleles and the newly identified SNP variants could explain 29% and 15% of VL variance in SCs and SPs, respectively. These findings and bioinformatics strongly suggest that both classic and nonclassic MHC genes deserve further investigation, especially in Africans with relatively short haplotype blocks.

Entities:  

Mesh:

Year:  2014        PMID: 24784026      PMCID: PMC4111776          DOI: 10.1038/gene.2014.16

Source DB:  PubMed          Journal:  Genes Immun        ISSN: 1466-4879            Impact factor:   2.676


Introduction

In the era of genome-wide association studies (GWAS), the search for definitive host genetic factors underlying effective immune control of HIV-1 infection has yet to yield convincing results beyond those already obtained through hypothesis-driven approaches.[1] To date, the most consistent observations center on the importance of human leukocyte antigen class I (HLA-I) genes within the human major histocompatibility complex (MHC) region on chromosome 6p21, although other candidates in the extended MHC (xMHC) may also play some role.[2, 3] From the nine HIV-1-related GWAS[4-12], only two single nucleotide polymorphisms (SNPs) within the xMHC have consistently shown favorable impact on set-point viral load (VL): rs2395029 within the HCP5 gene and rs9264942 located 35 kb upstream of HLA-C. Follow-up studies have noted strong linkage disequilibrium (LD) between rs2395029 and HLA-B*57:01, an established allele for favorable outcomes after HIV-1 infection.[10, 13, 14] The majority of reported GWAS have been performed in populations of European ancestry, while two have scanned African-American populations, without detecting any robust novel associations with disease progression.[10, 11] The overall lack of consensus in various findings may reflect the evolving viral diversity, as well as the complexity of human population heterogeneity in genetic architecture.[15, 16] Fine-mapping of implicated genetic regions in diverse racial groups is an important step in follow-up studies, as variable LD patterns can help resolve specific haplotype blocks for in-depth interrogation of causal variants or primary association signals that are not fully captured by genome-wide scans.[17] In sub-Saharan African populations with the greatest burden of HIV/AIDS and profound genetic diversity, fine-mapping of genetic factors related to immune control of HIV-1 infection may help steer future translational research, especially if dense coverage of candidate loci can effectively rule out the uncertainty with local and extended haplotypes. Previous studies based on a large Zambian cohort with longitudinal data have identified several host genetic factors that have a cumulative impact on VL.[18-22] These studies have also highlighted the importance of viral evolution and adaptation.[21-24] With a focus on both early VL in recent HIV-1 seroconverters (SCs) and chronic VL in seroprevalent individuals (SPs), we have tested the utility of a custom genotyping platform (the ImmunoChip) for fine-mapping of host genetic factors in the xMHC region.

RESULTS

Characteristics of seroconverted and seroprevalent individuals included for final analysis

A total of 172 eligible SCs and 449 SPs had successful HLA and ImmunoChip genotyping results that met all inclusion criteria and passed several layers of quality control procedures. In comparisons of overall characteristics of individuals within the two subgroups (Table 1), there were more women in the SC group than in the SP group, with a male-to-female sex ratio of 0.6 and 1.1, respectively. On average, SCs were younger than SPs. Previously established HLA factors had similar distribution in SCs and SPs, except that HLA-B*57 (a favorable factor) was more common in SPs (12.9%) than in SCs (7.6%). Mean log10-transformed VL was slightly lower in SCs (4.6 ± 0.7) than in SPs (4.8 ± 0.7).
Table 1

Overall characteristics of 172 HIV-1 seroconverters and 449 seroprevalent subjects with HLA and SNP genotyping results.

CharacteristicaSCs (N = 172)SPs (N = 449)P
Sex ratio, M/F (no. of individuals)0.6 (66/106)1.1 (237/212)0.001
Age: mean ± SD (year)b30.5 ± 7.532.7 ± 7.90.002
Enrollment dates0.387
  EarliestApr 1995Mar 1995
  LatestFeb 2008Sept 2008
Estimated dates of infection (EDI)
  EarliestDec 1995-NA
  LatestMay 2009-NA
Duration of follow-up (month): median (IQR)-17.9 (8.0–31.6)NA
Viral load (VL) outcome
  Mean ± SD (log10)c4.6 ± 0.74.8 ± 0.7<0.001
  Categories: n (%)0.004
    <10,000 copies/mL38 (22.1)76 (16.9)0.137
    10,000–100,000 copies/mL83 (48.3)174 (38.8)0.031
    >100,000 copies/mL51 (29.6)199 (44.3)0.001
Key HLA variants of interest: n (%)
  A*3627 (15.7)47 (10.5)0.072
  A*7422 (12.8)62 (13.8)0.740
  B*134 (2.3)17 (3.8)0.368
  B*5713 (7.6)58 (12.9)0.060
  B*8111 (6.4)39 (8.7)0.348
  DRB1*01:0222 (12.8)34 (7.6)0.042d

SCs, HIV-1 seroconverters; SPs, seroprevalent individuals; M, male; F, female; SD, standard deviation; IQR, interquartile range; NA, not applicable.

At EDI for SCs; at viral load sampling date for SPs.

Geometric mean of all VL measurements in the 3–12 months interval after the EDI in SCs versus first available VL in the SPs.

q = 0.144 (for comparing six HLA variants of interest).

Haplotype blocks defined by informative xMHC SNPs

Following the removal of 591 MHC SNPs (~7% of the total) known to be duplicates on the ImmunoChip (Table S1 in Supplemental Materials), 6,417 SNPs in SCs and 6,708 in SPs passed several genotyping quality control procedures and showed minor allele frequencies (MAF) over the analysis thresholds (≥0.025 in SCs and ≥0.015 in SPs, for a minimum of 10 observations in each group) (Tables S2 and S3 in Supplemental Materials). These informative SNPs divided the xMHC region into 410 haplotype blocks of various sizes (Tables S2 and S3) in Supplemental Materials), but they rarely tagged HLA class I or class II alleles (i.e., pairwise r2 >0.80). Accordingly, major HLA factors pertinent to the study population (Figure 1 and Figure 2) were mostly independent of the SNP genotypes.
Figure 1

Patterns of linkage disequilibrium between individual MHC SNPs and HLA-B*57. The r2 values for B*57:03 and B*57 are shown separately. The relative location of major HLA loci of interest is indicated, so is the density of SNP coverage along the MHC region.

Figure 2

Patterns of linkage disequilibrium between MHC SNPs and two HLA-DRB1 alleles of interest. The r2 values for DRB1*01:02 and DRB1*15:03 are plotted using a strategy described in Figure 1.

xMHC SNP variants and set-point VL in HIV-1 seroconverters

Conventional approach to association analyses did not identify any xMHC SNP with an adjusted p value less than 2.8×10−5, the threshold for xMHC-wide statistical significance after correction for 1,800 independent tests. Several trends (p <0.001) were noted for multiple SNPs mapped to NOTCH4 and to an intergenic region between RPP21 and HLA-E (Table S4 in Supplemental Materials). On the other hand, high-dimensional HyperLasso model (penalized regression) indicated that three SNPs in the MHC class I region and two others in class II and class III might contribute to VL variability in SCs (Figure 3A). These five SNPs, along with sex and age, explained 34.4% of the overall VL variance (p <0.0001) (Table 2). When conditioned further on three prominent HLA factors (A*74, B*13 and B*57), only rs3094626 in the MHC class I region (intergenic between RPP21 and HLA-E) and rs3134931 in class III (intronic within NOTCH4) showed independent contributions to VL variability (Figure 3B). In the reduced multivariable model (Table 2), sex, age, rs3094626 (allele C), rs3134931 (allele G) and the three known HLA factors collectively explained 28.9% of the overall VL variance (p <0.0001). The two MHC SNPs were also among the top hits in single-variant models, with the 11th and 16th ranked p-values, respectively, after statistical adjustment for demographic features and HLA factors (Table S4 in Supplemental Materials).
Figure 3

Associations of SNPs within the extended MHC with Box-Cox transformed log10 viral load in 172 seroconverters. A) Results adjusted for sex and age at time of HIV-1 infection; B) Results adjusted for sex, age at time of infection and three prominent HLA factors (HLA-A*74, B*13 and B*57). As in Figure 1, the density of SNP coverage along the MHC region is also shown. One of the SNPs, rs114500036, is also known as rs2394250 (see Table 2)

Table 2

Summary of HyperLasso results from analyses of Box-Cox-transformed log10 viral load in HIV-1 seroconverters.a

SNP IDChr 6 positionGeneClassMAFAlleleHaplotype blocknumberbRelation to VL
1st analysisc
  rs173700429,869,190HCG4|HLA-Gupstream0.439C93Low
  rs239425030,051,635HCG9ncRNA0.282TNALow
  rs312981330,446,226RPP21|HLA-Eintergenic0.436GNAHigh
  rs40486032,292,323NOTCH4intron0.334G285High
  rs381971432,912,195TAP2intron0.422G351Low
2nd analysisd
  rs309462630,431,602RPP21|HLA-Eintergenic0.430C133Low
  rs313493132,298,598NOTCH4intron0.517GNAHigh

Testing additive effects for the minor alleles (nucleotides) of single nucleotide polymorphism (SNPs); Chr, chromosome; MAF, minor allele frequency; VL, viral load

Not all eligible SNPs can be assigned to haplotype blocks (details are shown in Supplemental Materials). NA, not applicable.

First analysis has age and sex as covariates; overall R2=34.4% for the model.

Second analysis has age, sex, HLA-A*74, B*13 and B*57 as covariates; overall R2=28.9 for the model.

xMHC SNP variants and chronic VL in seroprevalent individuals

When tested individually, no single SNPs reached the Bonferroni-corrected, statistical significance threshold for potential association with chronic VL in SPs (Table S5 in Supplemental Materials). HyperLasso model initially identified two SNPs in the MHC class I region (rs3823376 and rs2517425) and another in class II (rs2875114) as apparent contributors to VL variability in SPs (Figure 4A). These three SNPs, along with sex and age, could explain 14.8% of the overall VL variance (p <0.0001) (Table 3). Indeed, the allele C for rs2857114, a SNP downstream from HLA-DOB in the MHC class II region, remained as an independent marker associated with unfavorable VL outcome after additional adjustment for previously identified HLA alleles (Figure 4B and Table 3). The final model explained 14.9% of the overall VL variance (p <0.0001) (Table 3).
Figure 4

Associations of SNPs within the extended MHC with Box-Cox transformed log10 viral load (VL) in 449 Zambians with seroprevalent HIV-1 infection. A) Results adjusted for sex and age at the time of VL measurement; B) Results adjusted for sex, age at the time of VL measurement and five known HLA factors (HLA-A*36, A*74, B*57, B*81 and DRB1*01:02). As in Figure 1, the density of SNP coverage along the MHC region is also shown.

Table 3

Summary of HyperLasso results from analysis of Box-Cox transformed log10 viral load (VL) in seroprevalent individuals.a

SNP IDChr 6 positionGeneClassMAFAlleleHaplotype blocknumberbRelation to VL
1st Analysisc
  rs382337630,052,163HCG9ncRNA0.404GNALow
  rs251742531,057,649MUC21intergenic0.280C170Low
  rs285711432,887,974HLA-DQB2|HLA-DOBdownstream0.457C346High
2nd analysisd
  rs285711432,887,974HLA-DQB2|HLA-DOBdownstream0.457C346High

Testing additive effects for the minor alleles of single nucleotide polymorphisms (SNPs); MAF, minor allele frequency.

Not all eligible SNPs can be assigned to haplotype blocks (details are shown in Supplemental Materials). NA, not applicable.

First analysis has adjusted for sex and age at VL measurement; overall R2=14.8% for the model.

Second analysis has adjusted for sex, age at VL measurement, HLA-A*36, A*74, B*57, B*81 and DRB1*01:02; Overall R2=14.9% for the model.

Findings from bioinformatics

A search in the HaploReg database (http://www.broadinstitute.org/mammals/haploreg/haploreg.php) revealed that the three independent associations (rs3094626, rs3134931, and rs2857114) identified by HyperLasso models were not complicated by tagging SNPs (r2 ≥0.80) reported in African populations (Figure S1 in Supplemental Materials). First, neither rs3094626 nor rs3134931 can tag other SNPs. Second, the only SNP (rs2621331 in HLA-DOB) tagged by rs2857114 (r2 = 1.0) has less functional attributes than rs2857114. In terms of previously reported associations captured by the NCBI Global Cross-database, rs2857114 has been implicated as an expression QTL for TAP2 and HLA-DOB in B-cells and monocytes.[25] In populations of European ancestry, however, two of the three SNPs of major interest have quite different LD patterns (Figure S2). In particular, rs2857114 can effectively tag 70 other SNPs at the HLA-DOB locus, including two (rs2187684 and rs2257789) that may govern promoter functions; one of these (rs2187684) has been associated with Juvenile idiopathic arthritis (JIA).[26] No other clues can point to the causal variants.

DISCUSSION

Guided by strong epidemiologic evidence from hypothesis-driven approaches,[21, 22] our study here revealed several xMHC SNPs as novel correlates of differential immune control of HIV-1 infection in an African cohort. Of note, when results obtained from three analytical approaches (linear regression, penalized regression and ordinal regression) are compared side by side for seroconverters (recent HIV-1 infection) and seroprevalent subjects (unknown but longer duration of infection), there is a striking contrast for the list of independent markers at the gene and SNP levels (e.g., Figure 3 versus Figure 4). In light of the various pathways for viral adaptation and immune escape mutations in individuals and populations,[24, 27, 28] it is now evident that seroconverters and seroprevalent subjects should be treated as distinct subgroups for certain epidemiologic analysis even when they are derived from the same location, because lumping them together (for the sake of statistical power) can obscure the search for early correlates and mechanisms of durable immune control.[21, 29] Our work has also demonstrated that penalized regression can be advantageous when a large number of individuals SNPs with modest effects are analyzed. Penalized regression, such as HyperLasso, focuses on prediction and not on hypothesis testing, which requires moving away from the usual and overwhelming reliance on p values in genetic studies.[30] Although the models can be calibrated to offer some control of overall false discovery rates, regularization also induces biased coefficient estimates and complicates the calculations of standard errors for variables selected into the model. Methods such as HyperLasso typically include the best SNPs out of a pool of closely related variants for characterizing the association.[31] Thus, the resulting models are not necessarily unique, i.e., multiple competing models may provide virtually identical predictive accuracy. To this end, it is often necessary to emphasize the specific genes or haplotype blocks rather than the individual SNPs. Surveys based on various databases, including The 1000 Genomes Project,[32] ENCODE[33, 34], and the NCBI Global Cross-database, can help narrow the search for consensus findings or causal variants, especially when functional attributes in various cell lineages are known.[25] The novel MHC SNPs identified by two series of HyperLasso models can explain a good proportion of the variability in both early and chronic VL (Tables 2 and 3), clearly beyond those already attributed to classical HLA alleles in the Zambian cohort.[22] In particular, the association of allele G for rs3134931 (NOTCH4) with set-point VL has some consistency with reported literature, as two previous studies on HIV-1 have highlighted a nonsynonymous SNP (rs8192591) within exon 9 of NOTCH4, corresponding to Gly534Ser.[8, 35] Similar to our findings on rs3134931-G and high set-point VL, an unfavorable relationship has been reported for rs8192591 and disease progression.[35] The conservative amino acid substitution introduced by rs8192591 is not expected to alter the protein function.[8] The low minor allele frequency for rs8192591 in our study cohort also rules out the possibility of a major impact. In contrast, the allelic variants of rs3134931 (a SNP mapped to an enhancer region) can have distinct binding affinity for PU1 in fibroblasts and TH1 cells (Figure S1). While rs3134931-G may well be a causal variant of immunological importance, recognition of NOTCH4 (http://www.ncbi.nlm.nih.gov/gene/4855) as a hot spot for autoimmune disorders and infectious diseases[8, 35–38] still justifies an in-depth evaluation of neighboring variants in follow-up studies. The biological relevance of other xMHC SNPs associated with set-point and chronic VL is less obvious, even when LD patterns and other datasets are considered (Figure S1). The association signal for rs2857114 may point to the potential role of HLA-DOB (http://www.ncbi.nlm.nih.gov/gene/3112), a non-classic HLA gene encoding the DOβ chain that forms a heterodimer with HLA-DOα. The DOαβ complex is typically stored in lysosomes to regulate HLA-DM-mediated loading of antigenic peptides to MHC class II molecules.[39-41] In the absence of strong LD with known SNPs beyond the HLA-DOB locus, further analysis of rs2857114 in African populations can be highly advantageous because there is little complication by neighboring variants (Figure S1). Conversely, presence of conserved extended haplotypes in populations of European ancestry[42] can obscure the search for causal variants within cis- and trans-acting elements.[25] Several studies have noted the benefits of the popular ImmunoChip for detecting independent association signals for coding variants at many loci.[43, 44] By design, the ImmunoChip may allow for fine-mapping in populations of European ancestry, with clear improvement for SNP coverage when compared with existing GWAS arrays.[45, 46] However, as far as the xMHC region is concerned, the first-generation ImmunoChip still has a major caveat in its lack of dense coverage (e.g., Figure 1), especially for coding SNPs (Table S2 and Table S3). It is also apparent that multiple MHC SNPs (i.e. rs2395029 and rs9264942) highly relevant to HIV/AIDS and other infectious diseases are not covered at all by the ImmunoChip. Our statistical models should continue to improve when fine-mapping based on sequencing can adequately capture variants in both classic and non-classic MHC loci. Further analyses of rare variants in large cohort should be informative as well. Future generations of the ImmunoChip may need to meet these needs and minimize uncertainties about coverage for the xMHC region.

PATIENTS AND METHODS

Study population

Subjects available for this study came from the Zambia-Emory HIV-1 Research Project (ZEHRP). Based on HIV-1 infection status at the time of enrollment, subjects were classified into two subgroups: 242 seroconverters (SCs) who became infected during quarterly follow-up visits and 482 seroprevalent individuals (SPs) who were already infected at baseline. The overall study design and follow-up strategies have been described in detail elsewhere for this cohort.[47-49]

Ethical aspects

The research outlined in this study was approved by the institutional review boards at clinical sites in Lusaka, Zambia and at two collaborating institutions (Emory University and University of Alabama at Birmingham) in the United States. All subjects gave written informed consent for screening and participation.

Assessment of virologic outcomes

Plasma VL was measured as HIV-1 RNA concentration (copies/mL) using the Roche Amplicor 1.0 assay (Roche Diagnostics Systems Inc., Branchburg, NJ) in a laboratory certified by the virology quality assurance program of the AIDS Clinical Trial Group. Results after the initiation of antiretroviral therapy (ART) were censored. For SCs, set-point VL corresponded to the log10-transformed geometric mean of all available VL measurements during the 3–12 months interval after the estimated date of infection (EDI). For SPs, the first log10-transformed VL measurement available was used for analysis. Subjects (49 SCs and 5 SPs) who had missing VL data were excluded first, so were those (5 SCs and 13 SPs) who had VL below the lower limit of detection (i.e., <400 copies/ml). The remaining SCs (n=188) and SPs (n=464) were retained for further analyses.

Genotyping

Allelic variants for three classic HLA class I genes (HLA-A, HLA-B and HLA-C) and two HLA class II loci (HLA-DRB1 and HLA-DQB1) were resolved to the first four digits (distinct protein sequences) using a combination of PCR-based methods, including PCR with sequence-specific primers (SSP) (Dynal/Invirtrogen, Brown Deer, WI), automated sequence-specific oligonucleotide (SSO) probe hybridization (Innogenetics, Alpharetta, GA), and sequencing-based typing (SBT) (Abbott Molecular, Inc., Des Plaines, IL). SNP genotyping with the Illumina ImmunoChip[46] was processed at a genomics core facility (University of Alabama at Birmingham); SNP alleles were inferred using the joint calling and haplotype phasing algorithm implemented in BEAGLECALL.[50] We completed a series of data cleaning and quality control procedures for SNPs in the xMHC region, excluding SNPs based on the following criteria: (i) duplication (Table S1), (ii) missingness (call rate <98.5%), (iii) minor allele frequency (MAF) <0.025 in SCs and <0.015 in SPs, and (iv) deviation from Hardy-Weinberg equilibrium (HWE) (p <10−6). In the end, 6,417 informative SNPs in SCs (Tables S3) and 6,708 SNPs in SPs (Table S3) were mapped to the xMHC region (~7.5 Mb between rs498548 and rs2772390).[51]

Data cleaning based on ImmunoChip results and population stratification

Subjects with ImmunoChip data were further evaluated against three additional (and occasionally overlapping) quality control criteria for final exclusion: (i) overall call rates <95% (8 SCs and 3 SPs), (ii) failing sex determination (n=0); (iii) up to third degree familial relationships (4 SCs and 5 SPs) according to kinship coefficients (the KING software package).[52, 53] Population stratification was assessed using multidimensional scaling (MDS) implemented in KING.[53, 54] We gathered SNP data for 1,184 unrelated individuals from the eleven study populations in Phase 3 of the International HapMap Project, defined by the Illumina 1M and Affymetrix 6.0 genome-wide SNP arrays. For the MDS analysis, we used a subset of SNPs (~30,000) that were (i) annotated with reference sequence (rs) IDs on the ImmunoChip, (ii) outside of regions of known extended LD in European populations, and (iii) reliably aligned with the HapMap 3 data (i.e., removing SNPs with ambiguous A/T and C/G variants).[55] The MDS data revealed four outliers (1 SC and 3 SPs, possibly immigrants) that were not suitable for final analysis.

Analysis of general features and population genetics

The overall characteristics of SCs and SPs were compared using Student’s t test (for continuous variables), χ2 test (for categorical variables), and the Wilcoxon-Mann-Whitney test (for non-parametric variables) (Table 1). HLA alleles and xMHC SNP genotypes in SCs and SPs were also analyzed separately for patterns of LD, with SNPs assigned to individual haplotype blocks using HaploView[56] whenever possible (Tables S2 and S3). SNPs that can effectively tag individual HLA alleles (pairwise r2 >0.80) received special attention, especially in terms of their physical location (relative to the HLA alleles and other MHC genes) and potential function (transcription and translation).

Strategies for association analysis

To effectively identify novel SNPs that are independently associated with VL outcomes in SCs and SPs, we adopted a high-dimension method known as penalized regression (HyperLasso),[31] which tests all eligible xMHC SNPs at once and minimizes the risk of false discovery from random testing. The HyperLasso model uses a hierarchical Normal-Exponential-Gamma (NEG) regression coefficient, which is parameterized in terms of shape and scale. Based on the method reported by Vignal et al.,[57] the shape parameter was set to 1, and 100 null permutations were analyzed for both the SC and SP datasets (for a given set of non-SNP covariates, i.e. non-genetic factors and previously identified HLA alleles) over a grid of scale parameter values. The null datasets were created by randomly pairing phenotypes/non-genetic covariates with genotypes (thus maintaining the relationship between the non-genetic covariates and VLs). We then selected a value for the scale parameter that allowed no more than one spurious SNP signal in the final regression model. The minor alleles of individual SNPs were tested for additive effect, and they were included only if the minor allele was observed in at least 10 subjects (either heterozygote or homozygote) in a given dataset. For analyses of set-point VL in SCs, the HyperLasso model was adjusted for potential effects of sex, age at EDI and three known HLA factors (A*74, B*13, and B*57).[21] Analyses of chronic VL in SPs were conditioned on sex, age at VL measurement and five known HLA factors (HLA-A*36, A*74, B*57, B*81, and DRB1*01:02) as reported earlier.[21] To facilitate comparison with findings from multiple GWAS, we also applied a standard approach to testing individual candidate SNPs and ranked them based on their adjusted p values in generalized linear models (GLMs) (SAS version 9.2). To account for the widespread LD within the xMHC region, SimpleM was used to calculate the number of independent tests for Bonferroni correction of p values.[58, 59] For both SCs and SPs, the SimpleM estimate for the number of independent tests was nearly 1,800. Thus, a p value of 2.8×10−5 was considered statistically significant for the MHC-wide random testing for individual SNPs. For all linear regression models, the assumption of a normal VL data distribution was assessed using the Kolmogorov-Smirnov test. The standard log10-transformation of VL helped with data normalization, but Box-Cox transformation of log10 VL was generally applicable to separate analyses of SCs and SPs. We further obtained summary statistics (proportion odds ratios, 95% confidence intervals and p values) from ordinal logistic regression models for three VL categories: (i) <10,000 copies/mL (low), (ii) 10,000–100,000 copies/mL (medium), and (iii) >100,000 copies/mL (high). This alternative strategy is expected to alleviate concerns about minor VL fluctuations within individuals and inter-assay variability in VL data. More importantly, these VL categories have strong and persistent implications for rates of heterosexual HIV-1 transmission.[47, 60]

Bioinformatics for individual SNPs of interest

To gather evidence for functional relevance, SNPs associated with HIV-1 VL in HyperLasso models were first queried in HaploReg[61] for known LD with SNPs documented by the 1000 Genomes Project (not necessary covered by the ImmunoChip) or annotated by the ENCODE project.[33, 34] MHC SNPs implicated for other diseases (dbGaP) and/or gene expression QTL (eQTL) in various cell lines[25] were also surveyed (NCBI Global Cross-database, http://www.ncbi.nlm.nih.gov/), with a focus on functional attributes in cells that mediate immune pathways. These procedures further took into account of (i) cis- and trans-acting elements, (ii) local haplotypes resolved by HaploView,[56] (iii) conserved extended haplotypes known for various HLA alleles.[42]
  61 in total

1.  Conserved extended haplotypes of the major histocompatibility complex: further characterization.

Authors:  M T Dorak; W Shao; H K G Machulla; E S Lobashevsky; J Tang; M H Park; R A Kaslow
Journal:  Genes Immun       Date:  2006-06-22       Impact factor: 2.676

2.  Human susceptibility to viral infection: the search for HIV-protective alleles among Africans by means of genome-wide studies.

Authors:  Joseph Donfack; Farrel J Buchinsky; J Christopher Post; Garth D Ehrlich
Journal:  AIDS Res Hum Retroviruses       Date:  2006-10       Impact factor: 2.205

3.  MHC fine mapping of human type 1 diabetes using the T1DGC data.

Authors:  C He; S Hamon; D Li; S Barral-Rodriguez; J Ott
Journal:  Diabetes Obes Metab       Date:  2009-02       Impact factor: 6.577

4.  Avoiding the high Bonferroni penalty in genome-wide association studies.

Authors:  Xiaoyi Gao; Lewis C Becker; Diane M Becker; Joshua D Starmer; Michael A Province
Journal:  Genet Epidemiol       Date:  2010-01       Impact factor: 2.135

Review 5.  Human genetic susceptibility to infectious disease.

Authors:  Stephen J Chapman; Adrian V S Hill
Journal:  Nat Rev Genet       Date:  2012-02-07       Impact factor: 53.242

6.  Screening low-frequency SNPS from genome-wide association study reveals a new risk allele for progression to AIDS.

Authors:  Sigrid Le Clerc; Cédric Coulonges; Olivier Delaneau; Danielle Van Manen; Joshua T Herbeck; Sophie Limou; Ping An; Jeremy J Martinson; Jean-Louis Spadoni; Amu Therwath; Jan H Veldink; Leonard H van den Berg; Lieng Taing; Taoufik Labib; Safa Mellak; Matthieu Montes; Jean-François Delfraissy; François Schächter; Cheryl Winkler; Philippe Froguel; James I Mullins; Hanneke Schuitemaker; Jean-François Zagury
Journal:  J Acquir Immune Defic Syndr       Date:  2011-03-01       Impact factor: 3.731

7.  ENCODE whole-genome data in the UCSC Genome Browser.

Authors:  Kate R Rosenbloom; Timothy R Dreszer; Michael Pheasant; Galt P Barber; Laurence R Meyer; Andy Pohl; Brian J Raney; Ting Wang; Angie S Hinrichs; Ann S Zweig; Pauline A Fujita; Katrina Learned; Brooke Rhead; Kayla E Smith; Robert M Kuhn; Donna Karolchik; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2009-11-17       Impact factor: 16.971

8.  Common genetic variation and the control of HIV-1 in humans.

Authors:  Jacques Fellay; Dongliang Ge; Kevin V Shianna; Sara Colombo; Bruno Ledergerber; Elizabeth T Cirulli; Thomas J Urban; Kunlin Zhang; Curtis E Gumbs; Jason P Smith; Antonella Castagna; Alessandro Cozzi-Lepri; Andrea De Luca; Philippa Easterbrook; Huldrych F Günthard; Simon Mallal; Cristina Mussini; Judith Dalmau; Javier Martinez-Picado; José M Miro; Niels Obel; Steven M Wolinsky; Jeremy J Martinson; Roger Detels; Joseph B Margolick; Lisa P Jacobson; Patrick Descombes; Stylianos E Antonarakis; Jacques S Beckmann; Stephen J O'Brien; Norman L Letvin; Andrew J McMichael; Barton F Haynes; Mary Carrington; Sheng Feng; Amalio Telenti; David B Goldstein
Journal:  PLoS Genet       Date:  2009-12-24       Impact factor: 5.917

9.  Distinct genetic loci control plasma HIV-RNA and cellular HIV-DNA levels in HIV-1 infection: the ANRS Genome Wide Association 01 study.

Authors:  Cyril Dalmasso; Wassila Carpentier; Laurence Meyer; Christine Rouzioux; Cécile Goujard; Marie-Laure Chaix; Olivier Lambotte; Véronique Avettand-Fenoel; Sigrid Le Clerc; Laure Denis de Senneville; Christiane Deveau; Faroudy Boufassa; Patrice Debré; Jean-François Delfraissy; Philippe Broet; Ioannis Theodorou
Journal:  PLoS One       Date:  2008-12-24       Impact factor: 3.240

10.  Simultaneous analysis of all SNPs in genome-wide and re-sequencing association studies.

Authors:  Clive J Hoggart; John C Whittaker; Maria De Iorio; David J Balding
Journal:  PLoS Genet       Date:  2008-07-25       Impact factor: 5.917

View more
  9 in total

1.  Dynamics and Correlates of CD8 T-Cell Counts in Africans with Primary Human Immunodeficiency Virus Type 1 Infection.

Authors:  Heather A Prentice; Hailin Lu; Matthew A Price; Anatoli Kamali; Etienne Karita; Shabir Lakhi; Eduard J Sanders; Omu Anzala; Susan Allen; Paul A Goepfert; Eric Hunter; Jill Gilmour; Jianming Tang
Journal:  J Virol       Date:  2016-10-28       Impact factor: 5.103

2.  Immunogenetic factors in early immune control of human immunodeficiency virus type 1 (HIV-1) infection: Evaluation of HLA class I amino acid variants in two African populations.

Authors:  Howard W Wiener; Sadeep Shrestha; Hailin Lu; Etienne Karita; William Kilembe; Susan Allen; Eric Hunter; Paul A Goepfert; Jianming Tang
Journal:  Hum Immunol       Date:  2017-12-28       Impact factor: 2.850

3.  Differences in T cell distribution and CCR5 expression in HIV-positive and HIV-exposed seronegative persons who inject drugs.

Authors:  Eveli Kallas; Kristi Huik; Silver Türk; Merit Pauskar; Ene-Ly Jõgeda; Marina Šunina; Tõnis Karki; Don Des Jarlais; Anneli Uusküla; Radko Avi; Irja Lutsar
Journal:  Med Microbiol Immunol       Date:  2015-12-22       Impact factor: 3.402

4.  CD4:CD8 lymphocyte ratio as a quantitative measure of immunologic health in HIV-1 infection: findings from an African cohort with prospective data.

Authors:  Jianming Tang; Xuelin Li; Matthew A Price; Eduard J Sanders; Omu Anzala; Etienne Karita; Anatoli Kamali; Shabir Lakhi; Susan Allen; Eric Hunter; Richard A Kaslow; Jill Gilmour
Journal:  Front Microbiol       Date:  2015-07-01       Impact factor: 5.640

5.  Immunogenetic influences on acquisition of HIV-1 infection: consensus findings from two African cohorts point to an enhancer element in IL19 (1q32.2).

Authors:  X Li; K Zhang; N M Pajewski; I Brill; H A Prentice; S Shrestha; W Kilembe; E Karita; S Allen; E Hunter; R A Kaslow; J Tang
Journal:  Genes Immun       Date:  2015-01-29       Impact factor: 2.676

6.  Protective HLA alleles are associated with reduced LPS levels in acute HIV infection with implications for immune activation and pathogenesis.

Authors:  Daniel T Claiborne; Eileen P Scully; Christine D Palmer; Jessica L Prince; Gladys N Macharia; Jakub Kopycinski; Clive M Michelo; Howard W Wiener; Rachel Parker; Krystelle Nganou-Makamdop; Daniel Douek; Marcus Altfeld; Jill Gilmour; Matt A Price; Jianming Tang; William Kilembe; Susan A Allen; Eric Hunter
Journal:  PLoS Pathog       Date:  2019-08-26       Impact factor: 6.823

Review 7.  Immunogenetic determinants of heterosexual HIV-1 transmission: key findings and lessons from two distinct African cohorts.

Authors:  Jianming Tang
Journal:  Genes Immun       Date:  2021-05-01       Impact factor: 2.676

8.  Host genetics and viral load in primary HIV-1 infection: clear evidence for gene by sex interactions.

Authors:  Xuelin Li; Matthew A Price; Dongning He; Anatoli Kamali; Etienne Karita; Shabir Lakhi; Eduard J Sanders; Omu Anzala; Pauli N Amornkul; Susan Allen; Eric Hunter; Richard A Kaslow; Jill Gilmour; Jianming Tang
Journal:  Hum Genet       Date:  2014-06-27       Impact factor: 4.132

9.  Clinical significance of PI3 and HLA-DOB as potential prognostic predicators for ovarian cancer.

Authors:  Yanrong Li; Haixia Li; Baojun Yang; Jing Wei; Cheng Zhen; Limin Feng
Journal:  Transl Cancer Res       Date:  2020-02       Impact factor: 1.241

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.