Literature DB >> 28182649

Estimating the Respective Contributions of Human and Viral Genetic Variation to HIV Control.

István Bartha1,2, Paul J McLaren3,4, Chanson Brumme5, Richard Harrigan5, Amalio Telenti6, Jacques Fellay1,2.   

Abstract

We evaluated the fraction of variation in HIV-1 set point viral load attributable to viral or human genetic factors by using joint host/pathogen genetic data from 541 HIV infected individuals. We show that viral genetic diversity explains 29% of the variation in viral load while host factors explain 8.4%. Using a joint model including both host and viral effects, we estimate a total of 30% heritability, indicating that most of the host effects are reflected in viral sequence variation.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28182649      PMCID: PMC5300119          DOI: 10.1371/journal.pcbi.1005339

Source DB:  PubMed          Journal:  PLoS Comput Biol        ISSN: 1553-734X            Impact factor:   4.475


Introduction

There are differences in the rate of disease progression among individuals infected with HIV. An easy to measure and reliable correlate of disease progression is the mean log viral load (HIV RNA copies per ml of plasma). The viral load measured during the chronic phase of infection (referred to as setpoint viral load, spVL) exhibits large variation in a population. Several studies have been carried out to elucidate whether this variation is primarily driven by host genetics [1-4], viral genetics [5-9], or environmental effects [7]. Genome-wide association studies consistently show that amino acid polymorphisms in the peptide binding groove of the HLA-A and HLA–B proteins are associated with the viral load of an individual. Furthermore, variants in the HLA-C and CCR5 genes have also been shown to impact spVL. However, those host factors explain less than 15% of the observed phenotypic variance [4]. In contrast, viral genetic studies and studies of donor-recipient transmission pairs established that 33% of the phenotypic variance is attributable to the transmitted virus itself [5, 10–13]. HIV is an extremely variable and adaptive organism with a rapid replication time, and high rates of mutation. Within-host evolution of the viral population occurs during the chronic phase of infection in which the pathogen adapts to its host environment. Several studies showed that a major proportion of the viral sequence is under selective pressure in the host environment, and several viral amino acid changes are associated with host genetic variants in the Human Leukocyte Antigen (HLA) genes [14, 15]. Viral strains harbor epitope sequences that can be presented by HLA class I proteins of the infected host, which allows the detection and killing of infected cells. The viral population evades detection through escape mutations that modify the epitope sequence but may incur a fitness cost. Compensatory mutations may follow until the viral population reaches its optimal place in a sequence space constrained by the host immune system [16]. There are two main different approaches to viral heritability estimation in the literature. The first one is based on the regression of phenotypic values in donor-recipient transmission pairs, while the other quantifies the difference between the observed phenotypic variance-covariance structure and the phylogenetic variance-covariance structure. Because our study population did not include donor-recipient data, we used the latter strategy. In particular we used linear mixed models (LMMs) to explain inter-patient differences in spVL while taking into account host and viral genetic relatedness. LMMs use the pairwise relatedness of individuals with respect to a large set of features (rather than the individual data points) to estimate the fraction of phenotypic variance attributable to those features. Such models have been successfully applied to estimate narrow-sense heritability from genome-wide genotype data [17]. Concurrently, LMMs were proposed to incorporate phylogenetic relatedness between samples in comparative analyses [18], a technique that was further developed to estimate the viral genetic contribution to spVL [6, 8].

Results

To estimate the respective contribution of host and viral genetics to the variation in spontaneous HIV control, we collected paired viral/host genotypes along with spVL measurements from 541 chronically infected individuals enrolled in two prospective cohort studies in Switzerland and in Canada. We estimated the respective contributions of host and viral genetics to spVL by defining two relatedness measures, one with respect to the host genotypes, the other with respect to the viral genotypes, and used these jointly in a linear mixed model. On the host side, we focused on amino acid variations in the HLA-A, B and C genes due to their established associations with HIV control [1]. In particular, we used 33 amino acid polymorphisms selected by L1 regularized regression [19] to represent the genetic relatedness of the host (). Principal component analysis based on host genome-wide genotype data confirmed the lack of major population stratification in the host sample. We built three LMMs, one containing human variants, one derived from phylogenetic trees, and one including both host and virus information (). The genetic relatedness matrix created from 33 amino acid polymorphisms of the human class I HLA genes explained 8.4% (SD = 4%) of the observed variance in spVL. In contrast, 28.8% (SD = 11%) of phenotypic variation was attributable to the viral phylogenetic tree. Combining the two relatedness matrices in one model yielded a total variance explained of 29.9% (SD = 12%), less than the sum of the latter two models. Thus, we show that HLA polymorphisms do not explain additional phenotypic variance beyond viral sequence variation. We next assessed the contribution of viral variants most likely to have an impact on spVL. These included amino acids in known CTL epitopes [20] and those positions whose variation is associated with host polymorphisms [14] (82%, 60% and 84% of gag, pol, nef codons respectively, ). We used phylogenetic trees built from those codons to show that viral variation in epitopes or other HLA-associated positions explain 23.6% (SD = 11%) of phenotypic variance. However, this explained fraction might be overestimated due to linkage disequilibrium on the viral haplotype. We therefore repeated the analysis after randomly picking 70% of variable viral positions, and obtained very similar results. We thus cannot conclude that viral variants in known epitopes contribute disproportionately to variance in spVL. Additional evidence for the existence of substantial linkage disequilibrium on the viral haplotype comes from the analysis of the smaller, complementary set of variable viral positions (located in non-epitope regions), which explained 18.5% (SD = 10%) of the phenotypic variance. This leads to lower bounds of 11.4% and 6.3% of variance in spVL explained by variation in epitope and non-epitope regions, respectively, leaving 12.2% of variance unresolved due to linkage disequilibrium.

Discussion

By jointly analyzing host and viral genetic relatedness, we here provide estimates of the total and respective contributions of human and viral genetic variation to HIV control. Our results do not challenge the current consensus estimates of the host or viral contributions to spVL. Nevertheless, our combined analysis demonstrates that human HLA polymorphisms do not explain additional variance in spVL once viral genetic diversity is taken into account. The difference between the variance explained by viral phylogeny and the variance explained by HLA polymorphisms may be attributed to two effects. First, selected viral variants might provide a better surrogate of the impact of the host genotype than the imputed host amino acid variants we used. Rare host genetic factors outside of the major histocompatibility complex region (e.g. the CCR5 deletion), as well as environmental interactions may influence viral fitness, and these effects are not accounted for in our estimate of host heritability. Thus some host effects might be missed from the host partition, while their footprint in the virus is still detected in the viral partition. Second, the difference could partly be due to the effect of viral variation independent of the current host, including transmitted escape mutations, i.e. viral sequence variation carried over from the previous host, rather than induced by the current host. Indeed, a recent study showed that spVL is dependent on the degree of pre-adaptation of the viral strain to the HLA class I genotype of the current host [21]. In particular, an increase in the frequency of pre-existing escape mutations, at the population level, led to higher viral heritability estimates. This indicates that both host and viral estimates of heritability depend on the amount of pre-adaptation in the sample population, which varies based on the level of HLA diversity. It has also been shown that reversion of some fitness reducing escape variants is very slow, potentially allowing for a transitory but measurable effect on viral load at the population level [15, 22]. A limitation of our study is the fact that study participants were collected from two cohorts. To reduce batch effect, we included a cohort-specific variable in all our models. Still, differences in inclusion criteria, health system, geographical exposure and other factors are very likely to increase environmental variance, thus negatively impacting our heritability estimates. Another potential shortcoming is our implicit assumption of the absence of selection on spVL, which might be incorrect, as suggested by recent studies [23, 24], and might thus lead to over- or under-estimation of heritability due to model misspecification. Still, because our estimates are comparable to results obtained in donor-recipient transmission studies and in host-genetic studies, we conclude that they are useful for the purpose of delineating the respective amounts of host and viral contributions to phenotypic variation of HIV spVL. In conclusion, our results suggest that host genetic association studies not taking the virus into account underestimate the population level effect of host genetic variation. Combining host and pathogen data provides additional insight into the genetic determinants of the clinical outcome of HIV infection, which can serve as a model for other chronic infectious diseases.

Materials and Methods

Ethics statement

All participants were HIV-1-infected adults, and written informed consent for genetic testing was obtained from all individuals as part of the original study in which they were enrolled. Ethical approval was obtained from institutional review boards for each of the respective contributing centers.

Data collection

Bulk sequences of the HIV-1 gag, pol and nef genes, human genome-wide genotyping data and viral load measurements were obtained for 541 individuals of Western European ancestry infected with HIV-1 Subtype B, and followed in the Swiss HIV Cohort Study (SHCS, www.shcs.ch) or in the HAART Observational Medical Evaluation and Research study in Canada (HOMER, www.cfenet.ubc.ca/our-work/initiatives/homer) [14]. Viral sequences data were generated from samples collected two to five years after infection (for SHCS) or during chronic infection (for HOMER) but prior to the initiation of antiretroviral therapy. Thus, the viral genotypes reflect the result of natural adaptation of the pathogen to the host environment. The viral sequences for 1262, 2187 and 548 nucleotides of the gag, pol and nef genes were available for at least 80% of samples studied. The analysis was limited to these three genes because sequences of the rest of the retroviral genome were not available for the majority of study samples. Overlapping viral genomic regions were excluded from gag, to avoid duplicated sequences in the analysis. Human DNA samples were genotyped in the context of previous genome-wide association studies. High-resolution HLA class I typing (4 digits; HLA-A, HLA-B, and HLA-C) was imputed from the genome-wide genotyping data as described previously [14]. Set point viral load (spVL) was defined as the average of the log10-transformed numbers of HIV-1 RNA copies per ml of plasma obtained in the absence of antiretroviral therapy, excluding VL measured in the first 6 months after seroconversion and during periods of advanced immunosuppression (i.e., with <100 CD4+ T cells per ul of blood). The distributions of spVL in the two cohorts are shown in .

Viral genetic relatedness

The pairwise genetic relatedness of the dominant viral strains observed in the samples was calculated from phylogenetic trees similarly to [6]. Nucleotide sequences were translated to amino acid sequences, which were in turn aligned with MUSCLE [25] and used to derive the correct codon-aware nucleotide alignment. The phylogenetic tree was built from the aligned nucleotide sequences using RAxML [26] with the following command line: “raxml -w {PATH} -s {PATH} -m GTRCAT -f a -N 30 -k -n {NAME} -T {NUMBER} -x 1234 -p 1234”. Individual sequences were then rooted to the HIV-1 group M ancestral sequence, downloaded from the Los Alamos sequence database. Using an HIV-1 subtype C sequence as outgroup led to similar results. The whole tree was scaled with the inverse of the median height of the branches. We followed the method of Hodcroft et al, to create a relatedness matrix from a phylogenetic tree [6]. The genetic relatedness of two samples in a given phylogenetic tree is the amount of shared ancestry, i.e. the distance from the root of the tree (excluding the outgroup) to their most recent common ancestor [27].

Host genetic relatedness

We selected 33 amino acid variants with L1-regularized regression (LASSO) out of all polymorphisms in the HLA-A, B and C genes and used them to generate a genetic relatedness matrix as described in [17]. Our relatively small sample size made it necessary to use a small subset of selected markers rather than genome-wide variant information to create the genetic relatedness matrix. Doing otherwise would have resulted in very large errors of the estimates.

Heritability estimations

To estimate heritability, we used the gcta software as a generic implementation of the linear mixed model [17]. In such a framework, a multivariate Gaussian distribution models HIV viral load with a variance-covariance matrix consisting of the linear combination of the sample-sample genetic relatedness matrices (one for the host and one for the virus) and the identity matrix (representing sample-specific noise). The total heritability estimate is the fraction of variance explained by the genetic relatedness matrices over the total variance. All models included a binary variable indicating cohort as a fixed effect. Variance components were estimated by restricted maximum likelihood.

Distribution of HIV setpoint viral load values in the Swiss (SHCS) and Canadian (HOMER) cohorts.

(PNG) Click here for additional data file.

List of human amino acid variants in HLA-I genes selected by L1 regularized regression and used throughout the paper.

(XLSX) Click here for additional data file.

List of MHC-associated HIV amino acid positions based on epitope maps (20) and previous association studies (14).

(TXT) Click here for additional data file.
  23 in total

1.  The phylogenetic mixed model.

Authors:  Elizabeth A Housworth; Emília P Martins; Michael Lynch
Journal:  Am Nat       Date:  2004-01-28       Impact factor: 3.926

2.  HIV-1 transmitting couples have similar viral load set-points in Rakai, Uganda.

Authors:  T Déirdre Hollingsworth; Oliver Laeyendecker; George Shirreff; Christl A Donnelly; David Serwadda; Maria J Wawer; Noah Kiwanuka; Fred Nalugoda; Aleisha Collinson-Streng; Victor Ssempijja; William P Hanage; Thomas C Quinn; Ronald H Gray; Christophe Fraser
Journal:  PLoS Pathog       Date:  2010-05-06       Impact factor: 6.823

3.  Phylogenetic approach reveals that virus genotype largely determines HIV set-point viral load.

Authors:  Samuel Alizon; Viktor von Wyl; Tanja Stadler; Roger D Kouyos; Sabine Yerly; Bernard Hirschel; Jürg Böni; Cyril Shah; Thomas Klimkait; Hansjakob Furrer; Andri Rauch; Pietro L Vernazza; Enos Bernasconi; Manuel Battegay; Philippe Bürgisser; Amalio Telenti; Huldrych F Günthard; Sebastian Bonhoeffer
Journal:  PLoS Pathog       Date:  2010-09-30       Impact factor: 6.823

4.  Polymorphisms of large effect explain the majority of the host genetic contribution to variation of HIV-1 virus load.

Authors:  Paul J McLaren; Cedric Coulonges; István Bartha; Tobias L Lenz; Aaron J Deutsch; Arman Bashirova; Susan Buchbinder; Mary N Carrington; Andrea Cossarizza; Judith Dalmau; Andrea De Luca; James J Goedert; Deepti Gurdasani; David W Haas; Joshua T Herbeck; Eric O Johnson; Gregory D Kirk; Olivier Lambotte; Ma Luo; Simon Mallal; Daniëlle van Manen; Javier Martinez-Picado; Laurence Meyer; José M Miro; James I Mullins; Niels Obel; Guido Poli; Manjinder S Sandhu; Hanneke Schuitemaker; Patrick R Shea; Ioannis Theodorou; Bruce D Walker; Amy C Weintrob; Cheryl A Winkler; Steven M Wolinsky; Soumya Raychaudhuri; David B Goldstein; Amalio Telenti; Paul I W de Bakker; Jean-François Zagury; Jacques Fellay
Journal:  Proc Natl Acad Sci U S A       Date:  2015-11-09       Impact factor: 11.205

5.  Host genetic and viral determinants of HIV-1 RNA set point among HIV-1 seroconverters from sub-saharan Africa.

Authors:  Romel D Mackelprang; Mary Carrington; Katherine K Thomas; James P Hughes; Jared M Baeten; Anna Wald; Carey Farquhar; Kenneth Fife; Mary S Campbell; Saida Kapiga; Xiaojiang Gao; James I Mullins; Jairam R Lingappa
Journal:  J Virol       Date:  2014-12-03       Impact factor: 5.103

6.  A strong case for viral genetic factors in HIV virulence.

Authors:  Viktor Müller; Christophe Fraser; Joshua T Herbeck
Journal:  Viruses       Date:  2011-03-08       Impact factor: 5.818

7.  How effectively can HIV phylogenies be used to measure heritability?

Authors:  George Shirreff; Samuel Alizon; Anne Cori; Huldrych F Günthard; Oliver Laeyendecker; Ard van Sighem; Daniela Bezemer; Christophe Fraser
Journal:  Evol Med Public Health       Date:  2013-09-13

8.  A genome-to-genome analysis of associations between human genetic variation, HIV-1 sequence diversity, and viral control.

Authors:  István Bartha; Jonathan M Carlson; Chanson J Brumme; Paul J McLaren; Zabrina L Brumme; Mina John; David W Haas; Javier Martinez-Picado; Judith Dalmau; Cecilio López-Galíndez; Concepción Casado; Andri Rauch; Huldrych F Günthard; Enos Bernasconi; Pietro Vernazza; Thomas Klimkait; Sabine Yerly; Stephen J O'Brien; Jennifer Listgarten; Nico Pfeifer; Christoph Lippert; Nicolo Fusi; Zoltán Kutalik; Todd M Allen; Viktor Müller; P Richard Harrigan; David Heckerman; Amalio Telenti; Jacques Fellay
Journal:  Elife       Date:  2013-10-29       Impact factor: 8.140

9.  The contribution of viral genotype to plasma viral set-point in HIV infection.

Authors:  Emma Hodcroft; Jarrod D Hadfield; Esther Fearnhill; Andrew Phillips; David Dunn; Siobhan O'Shea; Deenan Pillay; Andrew J Leigh Brown
Journal:  PLoS Pathog       Date:  2014-05-01       Impact factor: 6.823

10.  Common genetic variation and the control of HIV-1 in humans.

Authors:  Jacques Fellay; Dongliang Ge; Kevin V Shianna; Sara Colombo; Bruno Ledergerber; Elizabeth T Cirulli; Thomas J Urban; Kunlin Zhang; Curtis E Gumbs; Jason P Smith; Antonella Castagna; Alessandro Cozzi-Lepri; Andrea De Luca; Philippa Easterbrook; Huldrych F Günthard; Simon Mallal; Cristina Mussini; Judith Dalmau; Javier Martinez-Picado; José M Miro; Niels Obel; Steven M Wolinsky; Jeremy J Martinson; Roger Detels; Joseph B Margolick; Lisa P Jacobson; Patrick Descombes; Stylianos E Antonarakis; Jacques S Beckmann; Stephen J O'Brien; Norman L Letvin; Andrew J McMichael; Barton F Haynes; Mary Carrington; Sheng Feng; Amalio Telenti; David B Goldstein
Journal:  PLoS Genet       Date:  2009-12-24       Impact factor: 5.917

View more
  12 in total

1.  Pathogen Genetic Control of Transcriptome Variation in the Arabidopsis thaliana - Botrytis cinerea Pathosystem.

Authors:  Nicole E Soltis; Celine Caseys; Wei Zhang; Jason A Corwin; Susanna Atwell; Daniel J Kliebenstein
Journal:  Genetics       Date:  2020-03-12       Impact factor: 4.562

2.  Two-way mixed-effects methods for joint association analysis using both host and pathogen genomes.

Authors:  Miaoyan Wang; Fabrice Roux; Claudia Bartoli; Carine Huard-Chauveau; Christopher Meyer; Hana Lee; Dominique Roby; Mary Sara McPeek; Joy Bergelson
Journal:  Proc Natl Acad Sci U S A       Date:  2018-05-30       Impact factor: 11.205

3.  Immune Control of HIV.

Authors:  Muthukumar Balasubramaniam; Jui Pandhare; Chandravanu Dash
Journal:  J Life Sci (Westlake Village)       Date:  2019-06

4.  Association Between Single-Nucleotide Polymorphisms in HLA Alleles and Human Immunodeficiency Virus Type 1 Viral Load in Demographically Diverse, Antiretroviral Therapy-Naive Participants From the Strategic Timing of AntiRetroviral Treatment Trial.

Authors:  Christina Ekenberg; Man-Hung Tang; Adrian G Zucco; Daniel D Murray; Cameron Ross MacPherson; Xiaojun Hu; Brad T Sherman; Marcelo H Losso; Robin Wood; Roger Paredes; Jean-Michel Molina; Marie Helleberg; Nureen Jina; Cissy M Kityo; Eric Florence; Mark N Polizzotto; James D Neaton; H Clifford Lane; Jens D Lundgren
Journal:  J Infect Dis       Date:  2019-09-13       Impact factor: 5.226

5.  Human Immunotypes Impose Selection on Viral Genotypes Through Viral Epitope Specificity.

Authors:  Migle Gabrielaite; Marc Bennedbæk; Adrian G Zucco; Christina Ekenberg; Daniel D Murray; Virginia L Kan; Giota Touloumi; Linos Vandekerckhove; Dan Turner; James Neaton; H Clifford Lane; Sandra Safo; Alejandro Arenas-Pinto; Mark N Polizzotto; Huldrych F Günthard; Jens D Lundgren; Rasmus L Marvig
Journal:  J Infect Dis       Date:  2021-12-15       Impact factor: 5.226

Review 6.  Host genetic variation and HIV disease: from mapping to mechanism.

Authors:  Vivek Naranbhai; Mary Carrington
Journal:  Immunogenetics       Date:  2017-07-10       Impact factor: 2.846

7.  Viral genetic variation accounts for a third of variability in HIV-1 set-point viral load in Europe.

Authors:  François Blanquart; Chris Wymant; Marion Cornelissen; Astrid Gall; Margreet Bakker; Daniela Bezemer; Matthew Hall; Mariska Hillebregt; Swee Hoe Ong; Jan Albert; Norbert Bannert; Jacques Fellay; Katrien Fransen; Annabelle J Gourlay; M Kate Grabowski; Barbara Gunsenheimer-Bartmeyer; Huldrych F Günthard; Pia Kivelä; Roger Kouyos; Oliver Laeyendecker; Kirsi Liitsola; Laurence Meyer; Kholoud Porter; Matti Ristola; Ard van Sighem; Guido Vanham; Ben Berkhout; Paul Kellam; Peter Reiss; Christophe Fraser
Journal:  PLoS Biol       Date:  2017-06-12       Impact factor: 8.029

Review 8.  Interaction of the Host and Viral Genome and Their Influence on HIV Disease.

Authors:  Riley H Tough; Paul J McLaren
Journal:  Front Genet       Date:  2019-01-23       Impact factor: 4.599

9.  Dissecting HIV Virulence: Heritability of Setpoint Viral Load, CD4+ T-Cell Decline, and Per-Parasite Pathogenicity.

Authors:  Frederic Bertels; Alex Marzel; Gabriel Leventhal; Venelin Mitov; Jacques Fellay; Huldrych F Günthard; Jürg Böni; Sabine Yerly; Thomas Klimkait; Vincent Aubert; Manuel Battegay; Andri Rauch; Matthias Cavassini; Alexandra Calmy; Enos Bernasconi; Patrick Schmid; Alexandra U Scherrer; Viktor Müller; Sebastian Bonhoeffer; Roger Kouyos; Roland R Regoes
Journal:  Mol Biol Evol       Date:  2018-01-01       Impact factor: 16.240

Review 10.  HIV and the tuberculosis "set point": how HIV impairs alveolar macrophage responses to tuberculosis and sets the stage for progressive disease.

Authors:  Sara C Auld; Bashar S Staitieh
Journal:  Retrovirology       Date:  2020-09-23       Impact factor: 4.602

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.