| Literature DB >> 27642070 |
Manon L Ragonnet-Cronin1, Mohaned Shilaih2,3, Huldrych F Günthard2,3, Emma B Hodcroft1, Jürg Böni3, Esther Fearnhill4, David Dunn4, Sabine Yerly5, Thomas Klimkait6, Vincent Aubert7, Wan-Lin Yang2, Alison E Brown8, Samantha J Lycett1, Roger Kouyos2,3, Andrew J Leigh Brown1.
Abstract
Phylogenetic clustering approaches can elucidate HIV transmission dynamics. Comparisons across countries are essential for evaluating public health policies. Here, we used a standardised approach to compare the UK HIV Drug Resistance Database and the Swiss HIV Cohort Study while maintaining data-protection requirements. Clusters were identified in subtype A1, B and C pol phylogenies. We generated degree distributions for each risk group and compared distributions between countries using Kolmogorov-Smirnov (KS) tests, Degree Distribution Quantification and Comparison (DDQC) and bootstrapping. We used logistic regression to predict cluster membership based on country, sampling date, risk group, ethnicity and sex. We analysed >8,000 Swiss and >30,000 UK subtype B sequences. At 4.5% genetic distance, the UK was more clustered and MSM and heterosexual degree distributions differed significantly by the KS test. The KS test is sensitive to variation in network scale, and jackknifing the UK MSM dataset to the size of the Swiss dataset removed the difference. Only heterosexuals varied based on the DDQC, due to UK male heterosexuals who clustered exclusively with MSM. Their removal eliminated this difference. In conclusion, the UK and Swiss HIV epidemics have similar underlying dynamics and observed differences in clustering are mainly due to different population sizes.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27642070 PMCID: PMC5027562 DOI: 10.1038/srep32251
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Baseline demographics of the two datasets.
| Sex | Female | UK | CH | ||||
|---|---|---|---|---|---|---|---|
| A1 | B | C | A1 | B | C | ||
| 1442 (57.5%) | 2327 (7.4%) | 3869 (62.2%) | 269 (62%) | 1812 (22%) | 237 (57%) | ||
| Risk | MSM | 170 (6.8%) | 22157 (70.4%) | 660 (4.2%) | 21 (5%) | 3914 (47%) | 25 (6%) |
| HET | 1718 (68.5%) | 3399 (10.8%) | 11893 (75.2%) | 366 (84%) | 2626 (31%) | 357 (85%) | |
| PWID | 92 (3.7%) | 873 (2.8%) | 128 (0.8%) | 12 (3%) | 1553 (19%) | 9 (2%) | |
| Other/NA | 527 (21%) | 5021 (16%) | 3134 (19.8%) | 36 (8%) | 287 (3%) | 28 (7%) | |
| Ethnicity | White | 494 (19.7%) | 22724 (72.3%) | 1724 (10.9%) | 194 (45%) | 7333 (87%) | 118 (28%) |
| Total | 2507 | 31450 | 15815 | 435 | 8390 | 419 | |
Figure 1Proportion of UK (pink) and Swiss (blue) sequences in clusters at different genetic distance (1.5% and 4.5%) and bootstrap (70%, 80%, 90%, 95%) thresholds.
CH - Switzerland.
Unadjusted logistic regression for the comparison of the degree of patients clustering between the UK and Switzerland (Subtype B).
| Bootstrap | Genetic Distance | Covariates | OR | 2.5% | 97.5% |
|---|---|---|---|---|---|
| 0.7 | 0.015 | UK | 1.316 | 1.243 | 1.394 |
| 0.8 | 0.015 | UK | 1.335 | 1.258 | 1.418 |
| 0.9 | 0.015 | UK | 1.396 | 1.308 | 1.492 |
| 0.95 | 0.015 | UK | 1.444 | 1.345 | 1.551 |
| 0.7 | 0.045 | UK | 5.001 | 4.745 | 5.272 |
| 0.8 | 0.045 | UK | 4.119 | 3.904 | 4.347 |
| 0.9 | 0.045 | UK | 3.942 | 3.722 | 4.176 |
| 0.95 | 0.045 | UK | 3.176 | 2.986 | 3.381 |
OR: odds-ratio.
Logistic regression for the comparison of the degree of patients clustering between the UK and Switzerland, corrected for sample year (Subtype B).
| Bootstrap | Genetic Distance | Covariates | OR | 2.5% | 97.5% |
|---|---|---|---|---|---|
| 0.7 | 0.015 | UK | 0.66 | 0.617 | 0.705 |
| Sample year | 1.138 | 1.131 | 1.145 | ||
| 0.8 | 0.015 | UK | 0.665 | 0.621 | 0.713 |
| Sample year | 1.142 | 1.134 | 1.149 | ||
| 0.9 | 0.015 | UK | 0.694 | 0.645 | 0.749 |
| Sample year | 1.146 | 1.138 | 1.154 | ||
| 0.95 | 0.015 | UK | 0.706 | 0.651 | 0.765 |
| Sample year | 1.153 | 1.144 | 1.162 | ||
| 0.7 | 0.045 | UK | 2.692 | 2.538 | 2.856 |
| Sample year | 1.134 | 1.128 | 1.14 | ||
| 0.8 | 0.045 | UK | 2.12 | 1.996 | 2.252 |
| Sample year | 1.145 | 1.139 | 1.151 | ||
| 0.9 | 0.045 | UK | 1.968 | 1.846 | 2.099 |
| Sample year | 1.156 | 1.15 | 1.163 | ||
| 0.95 | 0.045 | UK | 1.551 | 1.448 | 1.662 |
| Sample year | 1.161 | 1.154 | 1.168 |
OR: odds-ratio.
Figure 2Degree distributions of the UK (pink) and Swiss (blue) subtype B epidemics.
MSM - men who have sex with men, HET - heterosexuals. Cluster definition was: genetic distance = 4.5% and bootstrap = 90%. Note that the proportion of individuals of each degree is shown rather than the absolute number of individuals of each degree. The number of clustered individuals was much larger in the UK than in Switzerland.
Figure 3DDQC distances within and between countries.
DDQC – Degree Distribution Quantification and Comparison, CH - Switzerland, MSM - men who have sex with men, HET - heterosexuals. PWID – people who inject drugs. Cluster definition was: genetic distance = 4.5% and bootstrap = 90%. In order to generate null distributions of the expected values for the DDQC, we bootstrapped the Swiss and UK distributions and calculated DDQC values comparing the original datasets to their bootstrap samples (in blue and pink). The top of the coloured bars represent the mean distance of within country comparisons and the whiskers represent the 95% percentiles. The DDQC distance was then calculated between the UK and Swiss degree distributions (black triangles). The distance between countries was considered significant if it exceeded the 95% percentile from the simulated values, which was the case only for HET at 4.5% GD (indicated by *). When we removed heterosexuals who were likely to have been infected through sex with men from the UK dataset, the DDQC distance between the UK and Swiss HET degree distributions fell within the simulated null distribution (orange triangle).
Figure 4Jack-knife and bootstrap sampled degree distributions of the UK (pink) and Swiss (blue) epidemics.
UK subtype B degree distribution for men who have sex with men (MSM), heterosexuals (HET) and the population as a whole (all) were jack-knife sampled 100 times to match the size of the Swiss epidemics (in light pink). The Swiss epidemic was bootstrapped 100 times to its full size (in light blue). Degree distributions are shown on a double-logged scale. Samples overlapped for MSM and the dataset as a whole, but not for HET. Where Swiss replicates cannot be seen they are covered by the UK replicates.