| Literature DB >> 28353537 |
Amin S Hassan1, Oliver G Pybus, Eduard J Sanders, Jan Albert, Joakim Esbjörnsson.
Abstract
: Understanding HIV-1 transmission dynamics is relevant to both screening and intervention strategies of HIV-1 infection. Commonly, HIV-1 transmission chains are determined based on sequence similarity assessed either directly from a sequence alignment or by inferring a phylogenetic tree. This review is aimed at both nonexperts interested in understanding and interpreting studies of HIV-1 transmission, and experts interested in finding the most appropriate cluster definition for a specific dataset and research question. We start by introducing the concepts and methodologies of how HIV-1 transmission clusters usually have been defined. We then present the results of a systematic review of 105 HIV-1 molecular epidemiology studies summarizing the most common methods and definitions in the literature. Finally, we offer our perspectives on how HIV-1 transmission clusters can be defined and provide some guidance based on examples from real life datasets.Entities:
Mesh:
Year: 2017 PMID: 28353537 PMCID: PMC5482559 DOI: 10.1097/QAD.0000000000001470
Source DB: PubMed Journal: AIDS ISSN: 0269-9370 Impact factor: 4.177
Components of HIV-1 transmission cluster definitions based on phylogenetic node support.
| Neighbour-joining |
| Maximum-likelihood |
| Bayesian |
| Jukes–Cantor (JC) |
| Tamura–Nei (TN) |
| General time reversible (GTR) |
| Bootstrap test |
| Approximate likelihood-ratio test (aLRT) |
| Zero-branch length test |
Fig. 1Flow chart showing results from the literature search and inclusion of articles considered in the review.
Three main types of cluster definition.
| Pure phylogenetic transmission cluster definitions based solely on phylogenetic node support |
| Pure distance-based transmission cluster definitions based solely on pairwise genetic distances |
| Combined transmission cluster definitions based on both phylogenetic node support and pairwise genetic distances |
Results of the systematic review of 105 articles employing different strategies to define HIV-1 transmission clusters.
| Categories of cluster definition | Number of articles | Median | Median study period in years (IQR | Median sequence length (IQR | Most analysed genetic region (proportion of articles) | Tree building model used (proportion of articles) | Substitution model used (proportion of articles) | Branch support approach (proportion of articles) | Median cut-off for determining clusters (IQR |
| Phylogenetic | 53 | 219 (96–562) | 6 (2–12) | 1100 (895–1497) | ML (60%) | GTR (73%) | Bootstrap (71%) | 90% (75–90%) | |
| Distance-based | 7 | 2747 (179–40950) | 11 (6–16) | 900 (500–1800) | NA | TN (100%) | NA | 0.015 Substitutions/site (0.014–0.019) | |
| Distance-based & phylogenetic | 45 | 534 (131–1413) | 7 (2–12) | 1150 (915–1308) | ML (76%) | GTR (74%) | Bootstrap (86%) | 90% (90–98%), 0.015 Substitutions/site (0.015–0.038) |
aPairwise comparisons using the Mann–Whitney U test: phylogenetic vs. distance-based, P = 0.036; phylogenetic vs. distance-based & phylogenetic, P = 0.030; distance-based vs. distance-based & phylogenetic, P = 0.13.
bInterquartile range.
cThe most commonly used methodology are presented with the proportion of articles in which this methodology was used.
dML, maximum likelihood; NA, not applicable.
eGTR, general time reversible; TN, Tamura–Nei.
Fig. 2Number of articles stratified by strategy of HIV-1 transmission cluster determination in relation to availability and number of analysed sequences during the study period.
Fig. 3Comparison between phylogenetic cluster definitions employing a branch support criteria with or without different distance thresholds.