| Literature DB >> 28604782 |
François Blanquart1, Chris Wymant1,2, Marion Cornelissen3, Astrid Gall4, Margreet Bakker3, Daniela Bezemer5, Matthew Hall2, Mariska Hillebregt5, Swee Hoe Ong4, Jan Albert6,7, Norbert Bannert8, Jacques Fellay9,10, Katrien Fransen11, Annabelle J Gourlay12, M Kate Grabowski13, Barbara Gunsenheimer-Bartmeyer14, Huldrych F Günthard15,16, Pia Kivelä17, Roger Kouyos15,16, Oliver Laeyendecker18, Kirsi Liitsola19, Laurence Meyer20, Kholoud Porter12, Matti Ristola19, Ard van Sighem5, Guido Vanham21, Ben Berkhout3, Paul Kellam22,23, Peter Reiss5,24, Christophe Fraser1,2.
Abstract
HIV-1 set-point viral load-the approximately stable value of viraemia in the first years of chronic infection-is a strong predictor of clinical outcome and is highly variable across infected individuals. To better understand HIV-1 pathogenesis and the evolution of the viral population, we must quantify the heritability of set-point viral load, which is the fraction of variation in this phenotype attributable to viral genetic variation. However, current estimates of heritability vary widely, from 6% to 59%. Here we used a dataset of 2,028 seroconverters infected between 1985 and 2013 from 5 European countries (Belgium, Switzerland, France, the Netherlands and the United Kingdom) and estimated the heritability of set-point viral load at 31% (CI 15%-43%). Specifically, heritability was measured using models of character evolution describing how viral load evolves on the phylogeny of whole-genome viral sequences. In contrast to previous studies, (i) we measured viral loads using standardized assays on a sample collected in a strict time window of 6 to 24 months after infection, from which the viral genome was also sequenced; (ii) we compared 2 models of character evolution, the classical "Brownian motion" model and another model ("Ornstein-Uhlenbeck") that includes stabilising selection on viral load; (iii) we controlled for covariates, including age and sex, which may inflate estimates of heritability; and (iv) we developed a goodness of fit test based on the correlation of viral loads in cherries of the phylogenetic tree, showing that both models of character evolution fit the data well. An overall heritability of 31% (CI 15%-43%) is consistent with other studies based on regression of viral load in donor-recipient pairs. Thus, about a third of variation in HIV-1 virulence is attributable to viral genetic variation.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28604782 PMCID: PMC5467800 DOI: 10.1371/journal.pbio.2001855
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Fig 1Predictions from the Brownian motion (BM) and Ornstein-Uhlenbeck (OU) models of evolution.
(A) Illustration of the models of character evolution on a phylogeny (top panel), showing unconstrained neutral evolution leading to increasing genetic variance under the BM model (middle panel) versus stabilizing selection around an optimum θ under the OU model, which results in stable variance over time (bottom panel). Edges of the phylogeny were arbitrarily colored for illustrative purposes. (B) The distribution of gold standard viral load (GSVL) over evolutionary time (as quantified by root-to-tip distance [i.e., distance from the common ancestor as assessed by the phylogeny]). Points are the data; boxplots show the median, lower, and upper quartiles, and the whiskers are the lower and upper quartile minus or plus 1.5 times the interquartile range for 8 bins of equal size. (C) The correlation coefficient of GSVL across 511 phylogenetic cherries in the subtype B phylogeny as a function of the patristic distance between cherries. Phylogenetic cherries were grouped by patristic distance in 10 bins of equal size. Points are the data, the dashed line is a decreasing exponential fit on the data, and thick lines show predictions from the maximum likelihood (ML) BM and OU models. The large points at patristic distance 0 show the population-level heritability estimated under the BM (blue) and OU (red) model. The data used in the figure are provided as S1 Data.
Summary of model fit for the subset of subtype B viruses and for all subtypes.
| Subtype | Model | Measure | AIC | AIC weight | VE | σ2 | α | Optimum θ | h2 | |
|---|---|---|---|---|---|---|---|---|---|---|
| B | NULL | GSVL | 1,581 | 3,461.7 | 0 | 0.52 (0.47–0.55) | - | - | - | 0 (0–0) |
| BM | GSVL | 1,581 | 3,442.1 | 0.29 | 0.43 (0.39–0.47) | 0.5 (0.22–0.76) | - | - | 0.17 (0.08–0.26) | |
| OU | GSVL | 1,581 | 3,440.3 | 0.71 | 0.35 (0.29–0.43) | 3 (0.96–4.8) | 8.5 (2.4–10) | 4.4 (3.7–5.1) | 0.31 (0.15–0.43) | |
| NULL | SPVL | 1,581 | 3,355.6 | 0 | 0.48 (0.44–0.51) | - | - | - | 0 (0–0) | |
| BM | SPVL | 1,581 | 3,343.2 | 0.66 | 0.42 (0.37–0.46) | 0.35 (0.14–0.54) | - | - | 0.13 (0.05–0.2) | |
| OU | SPVL | 1,581 | 3,344.5 | 0.34 | 0.37 (0.29–0.42) | 1.7 (0.44–3.5) | 7.6 (1.2–10) | 4.1 (3.5–4.9) | 0.21 (0.1–0.36) | |
| NULL | CD4 slope | 1,170 | 377 | 0.02 | 0.08 (0.067–0.096) | - | - | - | 0 (0–0) | |
| BM | CD4 slope | 1,170 | 370.8 | 0.55 | 0.07 (0.063–0.08) | 0.048 (1e-06–0.088) | - | - | 0.11 (0–0.19) | |
| OU | CD4 slope | 1,170 | 371.3 | 0.43 | 0.071 (0.056–0.079) | 0.043 (1e-06–0.37) | 0.095 (1.1e-06–10) | –4.9 (–520000 to 0.041) | 0.1 (0.01–0.27) | |
| All | NULL | GSVL | 2,028 | 4,474.8 | 0 | 0.53 (0.49–0.55) | - | - | - | 0 (0–0) |
| BM | GSVL | 2,028 | 4,463.3 | 0 | 0.45 (0.41–0.49) | 0.38 (0.19–0.62) | - | - | 0.17 (0.09–0.29) | |
| OU | GSVL | 2,028 | 4,451.6 | 1 | 0.35 (0.29–0.43) | 3.6 (1.6–5.1) | 10 (6.9–10) | 4.2 (3.9–4.6) | 0.32 (0.18–0.44) | |
| NULL | SPVL | 2,028 | 4,395.4 | 0 | 0.51 (0.47–0.53) | - | - | - | 0 (0–0) | |
| BM | SPVL | 2,028 | 4,388.4 | 0.08 | 0.45 (0.42–0.48) | 0.3 (0.11–0.44) | - | - | 0.14 (0.05–0.22) | |
| OU | SPVL | 2,028 | 4,383.5 | 0.92 | 0.38 (0.32–0.47) | 2.6 (0.32–3.6) | 10 (3.5–10) | 4.2 (3.5–4.7) | 0.24 (0.06–0.34) | |
| NULL | CD4 slope | 1,476 | 471.7 | 0.01 | 0.08 (0.069–0.091) | - | - | - | 0 (0–0) | |
| BM | CD4 slope | 1,476 | 464.3 | 0.45 | 0.072 (0.065–0.08) | 0.033 (0.0016–0.058) | - | - | 0.1 (0.01–0.18) | |
| OU | CD4 slope | 1,476 | 463.9 | 0.54 | 0.069 (0.058–0.077) | 0.081 (1e-06–0.38) | 2.9 (1.6e-06–10) | –0.3 (–170000 to 0.051) | 0.13 (0.01–0.25) |
Maximum likelihood estimates are given with 95% parametric bootstrap confidence intervals, accounting for uncertainty because of finite sample size and uncertainty in phylogenetic reconstruction (N = 100 bootstraps, see text for details). Three phylogenetic models (NULL, BM, and OU) are presented for 3 viral phenotypes: GSVL and SPVL are in units of log10 viral copies/mL of blood and CD4 slope is in units of cells/mm3/day. N is the sample size. AIC is the Akaike Information Criterion; AIC weight for model i (NULL, BM, or OU) is defined as . VE is the environmental variance, including host factors and error variance (but not variability in assay, as the linear regression adjusts for assay). σ2 is the stochastic variance governing evolution of the viral genetic factors along the phylogeny. α is the strength of stabilising selection in the OU model. The optimum θ is the optimal trait in the OU model. h2 is the heritability. BM, Brownian motion; GSVL, gold standard viral load; OU, Ornstein–Uhlenbeck; SPVL, set-point viral load.
Fig 2Maximum likelihood estimates of heritability (points) and bootstrap confidence intervals (segments) for subtype B sequence.
This is shown for the BM and OU model and the GSVL and SPVL measures of viral load for the whole-genome phylogeny; for the OU model and GSVL on the phylogeny inferred only from gag, pol, and env genes (light grey box); and for several stratifications of the data, only when the size of the subset was greater than 400 (dark grey box). CH: Switzerland, NL: the Netherlands; male sex; MSM. The data used in the figure are provided as S1 Data.
Summary of model fit for GSVL, for subtype B, for each gene.
Parameters as in Table 1.
| Model | Gene | AIC | AIC weight | VE | σ2 | α | Optimum θ | h2 | |
|---|---|---|---|---|---|---|---|---|---|
| NULL | 1,569 | 3,378.7 | 0 | 0.5 (0.46–0.53) | - | - | - | 0 (0–0) | |
| BM | 1,569 | 3,351.5 | 0.14 | 0.41 (0.37–0.44) | 0.9 (0.48–1.3) | - | - | 0.17 (0.09–0.26) | |
| OU | 1,569 | 3,347.8 | 0.86 | 0.36 (0.31–0.42) | 3.2 (1.4–4.5) | 10 (3.9–10) | 4.3 (3.7–5) | 0.27 (0.14–0.36) | |
| NULL | 1,377 | 2,814.2 | 0 | 0.45 (0.41–0.48) | - | - | - | 0 (0–0) | |
| BM | 1,377 | 2,790.5 | 0.34 | 0.36 (0.32–0.42) | 0.72 (0.29–1.1) | - | - | 0.19 (0.08–0.28) | |
| OU | 1,377 | 2,789.2 | 0.66 | 0.32 (0.26–0.36) | 2.2 (0.83–3.8) | 7.2 (1.4–10) | 4.3 (3.6–5.1) | 0.28 (0.16–0.39) | |
| NULL | 1,487 | 3,016.3 | 0 | 0.44 (0.39–0.47) | - | - | - | 0 (0–0) | |
| BM | 1,487 | 2,996 | 0.56 | 0.38 (0.32–0.42) | 0.24 (0.085–0.42) | - | - | 0.13 (0.04–0.23) | |
| OU | 1,487 | 2,996.5 | 0.44 | 0.34 (0.25–0.39) | 0.87 (0.3–3.4) | 3.8 (0.87–10) | 4.3 (3.5–5) | 0.21 (0.11–0.39) |
AIC, Akaike Information Criterion; BM, Brownian motion; OU, Ornstein–Uhlenbeck.
Fig 3Maximum likelihood estimates of heritability across the genome.
Heritability was inferred for overlapping windows of 1,000 bp separated by 500 bp for the Ornstein–Uhlenbeck (OU) model (black bullets) and the Brownian motion (BM) model (grey bullets). The horizontal dashed lines are the whole-genome heritability estimates. The 3 colored segments show heritability for gag, pol, and env genes in blue, green, red (for OU only). Confidence intervals (grey and colored regions) reflect phylogenetic uncertainty. The largest heritability is in the region where gag and pol overlap. We also show entropy—a measure of genetic diversity—along the genome (dashed curve and right axis). Entropy at a position was calculated as −Σp log(p), and we show the average entropy over 200-bp windows. The data used in the figure are provided as S1 Data.
Summary of the main characteristics of the cohort.
| GSVL | Year sampled | Country sampled | Transmission mode | Sex | Age at infection | Ethnicity | Subtype | Sequence length (bp) |
|---|---|---|---|---|---|---|---|---|
| Min.: 2.000 | Min.: 1985 | BE: 52 | HEAM/TRANSF: 3 | Female: 295 | Min.: 17 | Black: 36 | B: 1,581 | Min.: 601 |
| 1st Qu.: 4.167 | 1st Qu.: 2004 | CH: 1,005 | HET: 475 | Male: 1,733 | 1st Qu.: 29 | White: 421 | A1: 113 | 1st Qu.: 7,669 |
| Median: 4.675 | Median: 2007 | FR: 378 | HET/IDU: 44 | Median: 35 | Other/Unknown: 1,571 | 02_AG: 107 | Median: 9,018 | |
| Mean: 4.573 | Mean: 2006 | NL: 480 | IDU: 90 | Mean: 36 | 01_AE: 68 | Mean: 8,168 | ||
| 3rd Qu.: 5.066 | 3rd Qu.: 2009 | UK: 113 | MSM: 1343 | 3rd Qu.: 42 | C: 41 | 3rd Qu.: 9,068 | ||
| Max.: 7.002 | Max.: 2015 | MSM/IDU: 1 | Max.: 78 | D: 25 | Max.: 9,639 | |||
| Other/Unknown: 72 | Unknown: 445 | (Other): 93 |
BE, Belgium; CH, Switzerland; FR, France; GSVL, gold standard viral load; HEAM/TRANSF, haemophiliac or blood transfusion; HET, heterosexual; IDU, injecting drug user; MSM, men having sex with men; NL, the Netherlands; UK, United Kingdom.