| Literature DB >> 28642469 |
Xingguang Li1,2, Haizhou Liu3, Lu Liu4,5, Yi Feng1,2, Marcia L Kalish6, Simon Y W Ho7, Yiming Shao8,9.
Abstract
Human immunodeficiency virus (HIV) has a number of circulating recombinant forms that are the product of recombination between different HIV subtypes. The first circulating recombinant form of HIV-1 to be identified was CRF01_AE, which originated in Central Africa and is now most prevalent in Southeast and East Asia. In this study, we investigated the timescale, evolutionary history, and population genetics of the HIV-1 CRF01_AE strains primarily responsible for the epidemic in Asia. A further aim of our study was to define and standardize the nomenclature and provide well-characterized reference sequences for the phylogenetic transmission clusters of CRF01_AE. We analysed a data set of 334 near-complete genome sequences from various risk groups, sampled between 1990 and 2011 from nine countries. Phylogenetic analyses of these sequences were performed using maximum likelihood and Bayesian methods. Our study confirms that the diversity of HIV-1 CRF01_AE originated in Central Africa in the mid-1970s, was introduced into Thailand between 1979 and 1982, and began expanding there shortly afterwards (1982-1984). Subsequently, multiple clusters significantly contributed to China's HIV epidemic. A Bayesian skyline plot revealed the rapid expansion of CRF01_AE in China around 1999-2000. We identified at least eight different clusters of HIV-1 CRF01_AE formed by rapid expansion into different risk groups and geographic regions in China since the late 1980s.Entities:
Mesh:
Year: 2017 PMID: 28642469 PMCID: PMC5481428 DOI: 10.1038/s41598-017-03820-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Geographic source, sampling year, and risk factor for HIV-1 CRF01_AE strains analysed in the present study.
| Geographic source | Sampling year | na | Risk factorb | |||||
|---|---|---|---|---|---|---|---|---|
| Hetero | IDU | MSM | MTCT | ST | n/a | |||
| China | 1997–2011 | 154 (4) | 56 (1) | 21 | 56 (3) | 3 | 9 | 9 |
| Vietnam | 1997–1998 | 33 | 17 | 16 | ||||
| Afghanistan | 2007 | 1 | 1 | |||||
| Central African Republic | 1990 | 3 | 1 | 2 | ||||
| Hong Kong | 2004 | 1 | 1 | |||||
| Indonesia | 1993 | 1 | 1 | |||||
| Japan | 1993–2000 | 2 | 2 | |||||
| Thailand | 1990–2009 | 134 | 14 | 30 | 1 | 89 | ||
| United States | 1998–2005 | 5 | 4 | 1 | ||||
|
|
|
|
|
|
|
|
| |
aThe numbers of HIV-1 CRF01_AE sequences newly reported in this study are shown in parentheses.
bRisk group: Hetero, heterosexual; IDU, injecting drug user; MSM, men who have sex with men; MTCT, mother-to-child transmission; ST, sexual transmission, unspecified type; n/a, not available.
Classification and risk factor of distinct HIV-1 CRF01_AE clusters analysed in the present study.
| CRF01_AE cluster | na | Risk factorb | |||||
|---|---|---|---|---|---|---|---|
| Hetero | IDU | MSM | MTCT | ST | n/a | ||
| CRF01_1AE | 40 (1) | 24 (1) | 14 | 2 | |||
| CRF01_2AE | 26 | 8 | 18 | ||||
| CRF01_3AE | 3 | 1 | 1 | 1 | |||
| CRF01_4AE | 25 | 1 | 1 | 21 | 1 | 1 | |
| CRF01_5AE | 37 (3) | 2 | 34 (3) | 1 | |||
| CRF01_6AE | 4 | 4 | |||||
| CRF01_7AE | 3 | 3 | |||||
| CRF01_8AE | 5 | 3 | 1 | 1 | |||
| CRF01_9AE | 3 | 3 | |||||
| CRF01_10AE | 5 | 5 | |||||
| Ungrouped | 183 | 49 | 25 | 1 | 4 | 7 | 97 |
|
|
|
|
|
|
|
|
|
aNumbers of HIV-1 CRF01_AE sequences newly reported in our study are shown in parentheses.
bRisk group: Hetero, heterosexual; IDU, injecting drug user; MSM, men who have sex with men; MTCT, mother-to-child transmission; ST, sexual transmission, unspecified type; n/a, not available.
Classification and sampling year of distinct HIV-1 CRF01_AE clusters analysed in the present study.
| CRF01_AE cluster | na | Sampling year | ||||
|---|---|---|---|---|---|---|
| 1990–1994 | 1995–1999 | 2000–2004 | 2005–2009 | 2010–2011 | ||
| CRF01_1AE | 40 (1) | 38 | 2 (1) | |||
| CRF01_2AE | 26 | 19 | 7 | |||
| CRF01_3AE | 3 | 3 | ||||
| CRF01_4AE | 25 | 9 | 16 | |||
| CRF01_5AE | 37 (3) | 21 | 16 (3) | |||
| CRF01_6AE | 4 | 4 | ||||
| CRF01_7AE | 3 | 3 | ||||
| CRF01_8AE | 5 | 5 | ||||
| CRF01_9AE | 3 | 2 | 1 | |||
| CRF01_10AE | 5 | 3 | 2 | |||
| Ungrouped | 183 | |||||
|
|
|
|
|
|
| |
aNumbers of HIV-1 CRF01_AE sequences newly reported in our study are shown in parentheses.
Figure 1Maximum-likelihood phylogeny of HIV-1 CRF01_AE strains. Maximum-likelihood phylogeny of near-complete genome sequences of HIV-1 CRF01_AE. The two circles of colored cells show geographic location (inner circle, A) and risk group (outer circle, B).
Figure. 2Geographic distribution of HIV-1 CRF01_AE clusters identified in the present study. The geographic distribution of HIV-1 CRF01_AE clusters is shown at the (A) country level, and at the provincial level for (B) China and (C) Vietnam. Each CRF01_AE cluster identified in this study is color-coded, as shown on the left. Maps were obtained from Craft MAP website (http://www.craftmap.box-i.net/).
Figure 3Regression of the root-to-tip genetic distance against year of sampling for 334 HIV-1 CRF01_AE sequences. Genetic distances are based on the tree in Supplementary Figure S2. Colors indicate different sampling locations.
Figure 4Bayesian skyline demographic reconstruction of HIV-1 CRF01_AE. The vertical axis shows the effective number of infections (N ) multiplied by mean viral generation time (τ). The solid line and shaded region represent the median and 95% credibility interval of N τ through time.
Figure 5Maximum-clade-credibility tree estimated from near-complete genome sequences of HIV-1 CRF01_AE. Sequence names include accession number, geographic location, and year of sampling. Only internal nodes with posterior probability support >0.5 are shown with white, grey, and black circles.
Statistical analysis of geographic location of CRF01_AE sequences.
| Statistic | No. of sequences | Observed mean (95% CI) | Null mean (95% CI) |
|
|---|---|---|---|---|
| AI | 10.9 (10.0, 11.8) | 23.9 (21.7, 26.1) | <0.001* | |
| PS | 82.5 (80.0, 85.0) | 139.7 (132.3, 147.1) | <0.001* | |
| MC (AF) | 1 | 1.0 (1.0, 1.0) | 1.0 (1.0, 1.0) | N/A |
| MC (CF) | 3 | 1.8 (1.0, 2.0) | 1.0 (1.0, 1.0) | 0.005* |
| MC (CN) | 154 | 10.0 (10.0, 10.0) | 4.5 (3.1, 6.1) | 0.001* |
| MC (HK) | 1 | 1.0 (1.0, 1.0) | 1.0 (1.0, 1.0) | N/A |
| MC (ID) | 1 | 1.0 (1.0, 1.0) | 1.0 (1.0, 1.0) | N/A |
| MC (JP) | 2 | 2.0 (2.0, 2.0) | 1.0 (1.0, 1.0) | 0.002* |
| MC (TH) | 134 | 40.0 (40.0, 40.0) | 3.9 (3.0, 5.5) | 0.001* |
| MC (US) | 5 | 2.0 (2.0, 2.0) | 1.0 (1.0, 1.0) | 0.008* |
| MC (VN) | 33 | 5.0 (5.0, 5.0) | 1.7 (1.0, 2.3) | 0.001* |
AI, association index.
PS, parsimony score.
MC, monophyletic clade statistic.
95% CI, 95% credbility interval.
*Statistically significant (P < 0.05).
N/A, not available because of the observed 95% CI contains the null 95% CI.
Statistical analysis of risk group for CRF01_AE sequences.
| Statistic | No. of sequences | Observed mean (95% CI) | Null mean (95% CI) |
|
|---|---|---|---|---|
| AI | 20.1 (19.2, 21.1) | 29.7 (27.7, 31.7) | <0.001* | |
| PS | 133.2 (130.0, 136.0) | 184.4 (176.8, 191.7) | <0.001* | |
| MC (Hetero) | 56 | 5.0 (5.0, 5.0) | 3.0 (2.0, 4.1) | 0.017* |
| MC (IDU) | 21 | 3.0 (3.0, 3.0) | 2.4 (2.0, 3.1) | 0.192 |
| MC (MSM) | 56 | 5.0 (5.0, 5.0) | 2.2 (1.8, 3.0) | 0.002* |
| MC (MTCT) | 3 | 1.0 (1.0, 1.0) | 1.0 (1.0, 1.0) | N/A |
| MC (ST) | 9 | 1.0 (1.0, 1.0) | 1.0 (1.0, 1.0) | N/A |
| MC (n/a) | 9 | 40.0 (40.0, 40.0) | 3.1 (2.1, 4.5) | 0.001* |
AI, association index.
PS, parsimony score.
MC, monophyletic clade statistic.
95% CI, 95% credibility interval.
*Statistically significant (P < 0.05).
Hetero, heterosexual.
IDU, injecting drug user.
MSM, men who have sex with men.
MTCT, mother-to-child transmission.
ST, sexual transmission, unspecified type.
n/a, not available.
N/A, not available because of the observed 95% CI contains the null 95% CI.