| Literature DB >> 22574228 |
Chiyu Zhang1, Suresh de Silva, Jian-Hua Wang, Li Wu.
Abstract
Cross-species transmission and adaptation of simian immunodeficiency viruses (SIVs) to humans have given rise to human immunodeficiency viruses (HIVs). HIV type 1 (HIV-1) and type 2 (HIV-2) were derived from SIVs that infected chimpanzee (SIVcpz) and sooty mangabey (SIVsm), respectively. The HIV-1 restriction factor SAMHD1 inhibits HIV-1 infection in human myeloid cells and can be counteracted by the Vpx protein of HIV-2 and the SIVsm lineage. However, HIV-1 and its ancestor SIVcpz do not encode a Vpx protein and HIV-1 has not evolved a mechanism to overcome SAMHD1-mediated restriction. Here we show that the co-evolution of primate SAMHD1 and lentivirus Vpx leads to the loss of the vpx gene in SIVcpz and HIV-1. We found evidence for positive selection of SAMHD1 in orangutan, gibbon, rhesus macaque, and marmoset, but not in human, chimpanzee and gorilla that are natural hosts of Vpx-negative HIV-1, SIVcpz and SIVgor, respectively, indicating that vpx drives the evolution of primate SAMHD1. Ancestral host state reconstruction and temporal dynamic analyses suggest that the most recent common ancestor of SIVrcm, SIVmnd, SIVcpz, SIVgor and HIV-1 was a SIV that had a vpx gene; however, the vpx gene of SIVcpz was lost approximately 3643 to 2969 years ago during the infection of chimpanzees. Thus, HIV-1 could not inherit the lost vpx gene from its ancestor SIVcpz. The lack of Vpx in HIV-1 results in restricted infection in myeloid cells that are important for antiviral immunity, which could contribute to the AIDS pandemic by escaping the immune responses.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22574228 PMCID: PMC3345027 DOI: 10.1371/journal.pone.0037477
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phylogeny and positive selection of primate SAMHD1 genes.
The actual numbers of n/s changes and ω values (dN/dS, in parentheses) are shown above each branch. N and S are the potential numbers of non-synonymous and synonymous sites, respectively. The thick red lines represent the branches under positive selection.
Phylogenetic analysis by maximum likelihood estimation of the primate SAMHD1 genes.
| Dataset | Model code | lnL | Estimates of parameters | 2Δl | PSS |
|
| M1a (Nearly Neutral) | −3777.65 | p0 = 0.57163 p1 = 0.42837 ω0 = 0.00 ω1 = 1.00 | 22.21 P<0.01 | Not allowed |
| M2a (Positive Selection) | −3766.54 | p0 = 0.61742 p1 = 0.27008 p2 = 0.11250 ω0 = 0.00000 ω1 = 1.00 ω2 = 4.00 | 34P, 63V*, 76A, 283S*, 293N*, 447Q, 482S*, 626M | ||
| M7 (beta) | −3778.69 | p = 0.00503 q = 0.00500 | 24.30 P<0.01 | ||
| M8 (beta&w>1) | −3766.54 | p0 = 0.88519 p = 0.00500 q = 0.01092 (p1 = 0.11481) ω = 3.95 | 32W, 34P*, 63V* 76A*, 107L, 283S**, 293N**, 408R, 447Q*, 482S*, 557V*, 559R, 604V, 618V, 626M* | ||
|
| M1a (Nearly Neutral) | −2680.82 | p0 = 0.99999 p1 = 0.00001 ω0 = 0.38 ω1 = 1.00 | 0 P>0.05 | Not allowed |
| M2a (Positive Selection) | −2680.82 | p0 = 1.00000 p1 = 0.00000 p2 = 0.00000 ω0 = 0.38 ω1 = 1.00 ω2 = 1.00 | None | ||
| M7 (beta) | −2680.82 | p = 61.12811 q = 99.00000 | 0 P>0.05 | Not allowed | |
| M8 (beta&w>1) | −2680.82 | p0 = 0.99999 p = 61.12626 q = 99.00000 (p1 = 0.00001) ω = 1.00 | None | ||
|
| M1a (Nearly Neutral) | −3523.48 | p0 = 0.55026 p1 = 0.44974 ω0 = 0.00 ω1 = 1.00 | 19.57 P<0.01 | Not allowed |
| M2a (Positive Selection) | −3513.69 | p0 = 0.83059 p1 = 0.00000 p2 = 0.16941 ω0 = 0.16398 ω1 = 1.00 ω2 = 3.72 | 34P, 283S*, 293N, 447Q, 626M | ||
| M7 (beta) | −3523.97 | p = 0.00500 q = 0.00746 | 20.55 P<0.01 | Not allowed | |
| M8 (beta&w>1) | −3513.69 | p0 = 0.83094 p = 19.56472 q = 98.97297 (p1 = 0.16906) ω = 3.72 | 32W, 34P*, 63V, 283S**, 293N*, 447Q*, 482S, 557V, 626M* |
lnL, log-likelihood value. 2Δl, the likelihood ratio test statistics (2 delta lambda statistics). The P values represent a level of significance with a χ2 distribution and degrees of freedom = 2 (M1a vs. M2a and M7 vs. M8). Positively selected sites (PSS) were identified with posterior probability (p)≥0.90. One asterisk indicates posterior probability p≥0.95 and two asterisks p≥0.99. The null models do not allow for a site with ω>1. The amino acid sequence of human SAMHD1 was used as the reference standard. The results from M3 (discrete) are not shown since M3 can overestimate the number of positively selected sites.
Figure 2Positively selected residues in primate SAMHD1 proteins.
The black small dots indicate identical residues compared to the human SAMHD1 sequence. Positive selected sites were identified with posterior probabilities >0.90. One and two asterisks indicate posterior probabilities ≥0.95 and 0.99, respectively. The green and pink shadows indicate the SAM and HD domains, respectively.
Numbers of conservative and radical non-synonymous substitutions on the primate branches.
| R | C | Σr | Σc | Σr/R | Σc/C | |
|
| 330.96 | 453.71 | 658.84 | 344 | 1.99 | 0.76 |
|
| 225.76 | 558.90 | 687.84 | 315 | 3.05 | 0.56 |
|
| 541.29 | 243.37 | 358.84 | 644 | 0.67 | 2.65 |
The potential numbers of radical non-synonymous substitutions and conservative non-synonymous substitutions.
The total numbers of radical and conservative non-synonymous substitutions on all branches.
The total radical and conservative non-synonymous substitution ratios of all branches.
Figure 3Phylogenetic tree of pol genes from primate lentiviruses.
Because the Vpx of two SIVrcm isolates from Nigeria and Gabon cannot degrade human SAMHD1 [3], we predicted that Vpx from other SIVs in SIV(Vpx+) sub-clade III may not be able to degrade human SAMHD1. The red solid nodes on the trees represent the most recent common ancestors (MRCA) of corresponding virus strains. The thick pink branch indicates the occurrence of vpx gene loss. Only bootstrap values of >75 are shown at the corresponding nodes. This tree was derived from the sequence analysis of 182 primate lentivirus strains/isolates.
Figure 4Maximum clade credibility tree of HIV-1 and different SIVs based on their RT genes.
Ancestral host states were reconstructed using Bayesian phylogeographic inference framework implemented in the BEAST v1.6.2 package. The host state posterior probabilities of the most recent common ancestors (MRCAs) are shown on the left upper panel. The tree branches are colored according to their respective host species. The red solid nodes on the trees represent the MRCAs of corresponding virus strains. Estimated times of the MRCA are shown at corresponding nodes. BC: before Christ.
Figure 5Schematic representation of evolutionary history and loss of the vpx gene in certain SIVs and HIV-1.
The ancestor virus might have encoded functional Vpx proteins that could degrade SAMHD1. It was likely transmitted to red-capped mangabeys and mandrills prior to the loss of vpx gene and evolved into SIVrcm/mnd among the two primate species. Because the ancestral host state of SIVcpz, SIVrcm, SIBmnd, SIVgor and HIV-1 most likely is SIVcpz in the gag and pol gene regions (Fig. 4, and Fig. S5A), but more likely is SIVrcm or SIVmnd in the env region (Fig. S5B), implying that the loss of vpx in SIVcpz was a result from positive selection of SIV recombination and viral fitness. The vpx gene of SIVcpz was likely lost around 3643 to 2969 years ago, resulting in the emergence of SIVcpz(Vpx−). SIVcpz(Vpx−) was later transmitted to humans and gorillas, evolving into HIV-1 and SIVgor(Vpx−), respectively. The pink arrow indicates the occurrence of vpx gene loss.
Site-model (M7 vs. M8) test for vpx genes from two HIV-2 sub-clades and three different SIV(Vpx+) sub-clades.
| Sub-clade | dN/dS (M0) | Estimates of parameters | lnL | 2Δl | P-value | PSS | ||
| M7 | M8 | M7 | M8 | |||||
|
| 0.32 | p = 0.16255 q = 0.41963 | p0 = 0.98103 p = 0.17910 q = 0.48719 (p1 = 0.01897) ω = 4.73 | −1925.03 | −1915.79 | 18.49 | <0.0001 | Yes |
|
| 0.26 | p = 0.11279 q = 0.30495 | p0 = 0.94395 p = 0.16311 q = 0.61416 (p1 = 0.05605) ω = 2.22 | −1289.45 | −1287.91 | 3.09 | 0.2133 | Yes |
|
| 0.17 | p = 0.30435 q = 1.41758 | p0 = 0.97628 p = 14.36000 q = 99.00000 (p1 = 0.02372) ω = 2.17 | −700.66 | −700.89 | 0.47 | 0.7906 | Yes |
|
| 0.44 | p = 0.13092 q = 0.18331 | p0 = 0.94523 p = 40.73726 q = 99.00000 (p1 = 0.05477) ω = 3.79 | −700.28 | −698.85 | 2.86 | 0.2393 | Yes |
|
| 0.19 | p = 0.42080 q = 1.38006 | p0 = 0.99999 p = 0.42080 q = 1.38012 (p1 = 0.00001) ω = 1.00 | −1666.94 | −1666.94 | 0 | 1.0000 | No |
Positively selected sites (PSS) were identified with posterior probability ≥0.90 using Bayes Empirical Bayes (BEB) analysis. M0: one-ratio model.
One PSS was identified with posterior probability = 0.774 using BEB analysis. When Naive Empirical Bayes analysis was used, the posterior probability = 0.946.
For other details, see Table 1.