| Literature DB >> 19487424 |
Jesus F Salazar-Gonzalez1, Maria G Salazar, Brandon F Keele, Gerald H Learn, Elena E Giorgi, Hui Li, Julie M Decker, Shuyi Wang, Joshua Baalwa, Matthias H Kraus, Nicholas F Parrish, Katharina S Shaw, M Brad Guffey, Katharine J Bar, Katie L Davis, Christina Ochsenbauer-Jambor, John C Kappes, Michael S Saag, Myron S Cohen, Joseph Mulenga, Cynthia A Derdeyn, Susan Allen, Eric Hunter, Martin Markowitz, Peter Hraber, Alan S Perelson, Tanmoy Bhattacharya, Barton F Haynes, Bette T Korber, Beatrice H Hahn, George M Shaw.
Abstract
Identification of full-length transmitted HIV-1 genomes could be instrumental in HIV-1 pathogenesis, microbicide, and vaccine research by enabling the direct analysis of those viruses actually responsible for productive clinical infection. We show in 12 acutely infected subjects (9 clade B and 3 clade C) that complete HIV-1 genomes of transmitted/founder viruses can be inferred by single genome amplification and sequencing of plasma virion RNA. This allowed for the molecular cloning and biological analysis of transmitted/founder viruses and a comprehensive genome-wide assessment of the genetic imprint left on the evolving virus quasispecies by a composite of host selection pressures. Transmitted viruses encoded intact canonical genes (gag-pol-vif-vpr-tat-rev-vpu-env-nef) and replicated efficiently in primary human CD4(+) T lymphocytes but much less so in monocyte-derived macrophages. Transmitted viruses were CD4 and CCR5 tropic and demonstrated concealment of coreceptor binding surfaces of the envelope bridging sheet and variable loop 3. 2 mo after infection, transmitted/founder viruses in three subjects were nearly completely replaced by viruses differing at two to five highly selected genomic loci; by 12-20 mo, viruses exhibited concentrated mutations at 17-34 discrete locations. These findings reveal viral properties associated with mucosal HIV-1 transmission and a limited set of rapidly evolving adaptive mutations driven primarily, but not exclusively, by early cytotoxic T cell responses.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19487424 PMCID: PMC2715054 DOI: 10.1084/jem.20090378
Source DB: PubMed Journal: J Exp Med ISSN: 0022-1007 Impact factor: 14.307
Subject demographics, risk group, and baseline laboratory data
| Subject | HIV-1 subtype | Geographic location | Gender | Risk group | Sampling date | VL | Clinical specimen | EIA | WB | Fiebig stage |
| WITO4160 | B | Alabama | M | Heterosexual | 08.04.00 | 325,064 | Plasma | neg | neg | II |
| SUMA0874 | B | Alabama | M | MSM | 05.13.91 | 939,260 | Plasma | neg | neg | II |
| WEAU0575 | B | Alabama | M | MSM | 05.30.90 | 216,415 | Plasma | neg | neg | II |
| TRJO4551 | B | Alabama | M | MSM | 10.10.01 | 8,121,951 | Plasma | neg | neg | II |
| 04013226-2 | B | New York | M | MSM | 11.20.02 | 26,700,000 | Plasma | neg | neg | II |
| 04013396-0 | B | New York | M | MSM | 08.16.05 | 1,600,000 | Plasma | pos | ind | IV |
| CH40 (700010040) | B | North Carolina | M | MSM | 07.11.06 | 2,197,248 | Serum | neg | neg | II |
| CH40 (700010040) | 07.27.06 | 298,026 | Plasma | pos | pos (p31−) | V | ||||
| CH58 (700010058) | B | North Carolina | M | MSM | 08.22.06 | 92,581 | Serum | neg | neg | II |
| CH58 (700010058) | 08.31.06 | 394,649 | Plasma | pos | neg | III | ||||
| CH77 (700010077) | B | North Carolina | M | MSM | 08.25.06 | 3,565,728 | Serum | neg | neg | II |
| CH77 (700010077) | 09.08.06 | 144,145 | Plasma | pos | pos (p31−) | V | ||||
| ZM246F | C | Zambia | F | Heterosexual | 01.14.03 | 10,013,800 | Plasma | neg | neg | II |
| ZM249M | C | Zambia | M | Heterosexual | 08.05.03 | >2,000,000 | Plasma | pos | ind | IV |
| ZM247F | C | Zambia | F | Heterosexual | 10.28.03 | 10,823,500 | Plasma | neg | neg | II |
EIA, enzyme-linked immunosorbant assay for HIV-1; ind, indeterminant; MSM, men who have sex with mena; VL, viral load (RNA molecules/ml); WB, Western blot for HIV-1.
Fiebig clinical serological staging system (17).
Diversity analysis of full-length HIV-1 genomes derived from patients with primary infection
| Number of transmitted viruses | ||||||||||||||||
| Subject | Fiebig stage | Total number of HIV-1 genomes | Maximum nt length of viral genome analyzed | Minimum nt length of viral genome analyzed | Number of genomes analyzed | Nucleotide sequence diversity (mean %) | Nucleotide sequence diversity (range %) | Max HD | Poisson estimated days since MRCA | Lambda | Goodness of fit p-value | HD fit to Poisson | Star phylogeny | Deviation from star phylogeny | env-only analysis | Full-length genome analysis |
| WITO4160 | II | 18 | 9,027 | 7,698 | 14 | 0.04 | 0–0.09 | 8 | 17 (12, 22) | 3.57 | 0.923 | Yes | Yes | 1 | 1 | |
| SUMA0874 | II | 6 | 9,033 | 9,020 | 6 | 0.03 | 0.01–0.04 | 4 | 11 (7, 15) | 2.33 | 0.546 | Yes | Yes | 1 | 1 | |
| WEAU0575 | II | 9 | 9,028 | 9,022 | 9 | 0.03 | 0–0.08 | 7 | 14 (8, 21) | 3.06 | 0.49 | Yes | No | Early stochastic | 1 | 1 |
| TRJO4551 | II | 7 | 9,051 | 9,046 | 7 | 0.06 | 0.03–0.07 | 7 | 20 (14, 26) | 4.29 | 0.836 | Yes | Yes | 1 | 1 | |
| 04013226-2 | II | 9 | 9,068 | 9,038 | 8 | 0.08 | 0.04–0.11 | 10 | 33 (28, 38) | 7.04 | 0.269 | Yes | No | Early stochastic | 1 | 1 |
| 04013396-0 | IV | 5 | 9,049 | 9,048 | 4 | 0.03 | 0–0.06 | 5 | 12 (7, 17) | 2.5 | 0.704 | Yes | Yes | 1 | 1 | |
| CH40 (700010040) | II | 26 | 3,593 | 2,977 | 26 | 0.04 | 0–0.11 | 4 | 16 (12, 21) | 1.372 | 0.251 | Yes | Yes | 1 | 1 | |
| CH40 (700010040) | V | 9 | 8,990 | 8,989 | 9 | 0.05 | 0.01–0.1 | 9 | 21 (14, 28) | 4.44 | 0.46 | Yes | No | CTL selection | 1 | 1 |
| CH58 (700010058) | III | 7 | 9,027 | 8,717 | 7 | 0.06 | 0.02–0.1 | 9 | 24 (14, 33) | 5.05 | 0.941 | Yes | No | Early stochastic | 1 | 1 |
| CH77 (700010077) | II | 23 | 2,785 | 2,785 | 23 | 0.05 | 0–0.14 | 4 | 20 (13,26) | 1.3 | 0.449 | Yes | Yes | 1 | 1 | |
| CH77 (700010077) | V | 8 | 9,048 | 9,037 | 8 | 0.06 | 0.02–0.13 | 12 | 27 (16, 37) | 5.75 | 0.46 | Yes | No | CTL selection | 1 | 1 |
| ZM246F | II | 10 | 8,978 | 8,977 | 10 | 0.04 | 0–0.09 | 8 | 17 (10, 24) | 3.56 | 0.88 | Yes | No | Early stochastic | 1 | 1 |
| ZM249M | IV | 7 | 9,054 | 8,735 | 7 | 0.03 | 0.01–0.06 | 5 | 15 (9, 20) | 3.1 | 0.671 | Yes | Yes | 1 | 1 | |
| ZM247F | II | 13 | 9,040 | 7,957 | 12 | 1.19 | 0.01–2.46 | 220 | 493 (314, 673) | 106.197 | 0 | No | No | Two strains | 2 | 2 |
| ZM247F lng 1 | II | 9 | 9,040 | 7,957 | 8 | 0.05 | 0.01–0.09 | 8 | 21 (14, 28) | 4.5 | 0.967 | Yes | Yes | 1 | 1 | |
| ZM247F lng 2 | II | 4 | 9,034 | 9,033 | 4 | 0.05 | 0.02–0.08 | 7 | 21 (14, 28) | 4.5 | 0.939 | Yes | Yes | 1 | 1 | |
Genomes with multiple APOBEC G-to-A hypermutations or large deletions were excluded from model analysis.
Predicted minimum number of days needed to achieve observed within-patient diversity.
Mean of best fitting Poisson distribution. Lambda is the free parameter that defines a Poisson.
Env-only analyses were previously reported (unpublished data) (11, 12).
SGA sequences representing 3′ half subgenomic fragments from the screening sample.
lng 1= lineage 1; lng 2= lineage 2.
Figure 1.Maximum likelihood phylogenetic tree of full-length HIV-1 genome sequences from 12 subjects. Individual sequences are shown as filled ovals. HIV-1 clade B and C reference sequences from the LANL database (http://www.hiv.lanl.gov/content/index) are shown in gray. Numerals at nodes indicate phylogenetic support for sequences included at that node; maximum likelihood bootstrap support ≥ 70% is shown in italics. Bayesian posterior probability values ≥ 0.9 are shown in bold. The scale bar indicates 5% genetic distance (5 nt differences per 100 sites).
Figure 2.Phylogenetic and WITO_fl.CON is the inferred transmitted/founder sequence. In the phylogram (left), asterisks indicate sequences that had International Union of Pure and Applied Chemistry ambiguous state assignments caused by mixed bases. Mixed bases were attributed to Taq polymerase errors and, thus, are not reflected in branch lengths of phylograms in this or other figures. The scale bar indicates a genetic distance of 0.0001 (0.01%, or 1 nt difference per genome). In the Highlighter plots (middle and right), nucleotide differences from the transmitted/founder consensus sequence are indicated by tic marks. Brackets denote insertions of single bases. Gray tics and boxes indicate a deleted sequence. The consensus (CON) is the same whether or not mixed bases are included. The middle Highlighter plot shows sequences without ambiguous (mixed) base assignments. The Highlighter plot on the right includes sequences with one or more ambiguous bases (shown as dark blue tic marks). Sequences H3 and C3 contained APOBEC-related G-to-A hypermutation. The horizontal axis indicates nucleotide positions in the HIV-1 genome. Sequences begin at nucleotide position 582 in the 5′ LTR (U5) and extend to position 9,606 in the 3′ LTR (R) based on HXB2 reference sequence numbering (http://www.hiv.lanl.gov/content/sequence/HIV/REVIEWS/HXB2.html).
Figure 3.Phylogenetic and Subject ZM247F had sequences comprising two distinct lineages that were 2.4% different from each other, as shown by the phylogram (left) and Highlighter plot (right). ZM247F_fl.CON1 is the consensus of lineage 1. The second lineage (sequences B8, E10, E11, and G11) differs from ZM247F_fl.CON1 by 213–217 nt differences, as indicated by tic marks.
Figure 4.Phylogenetic and (A and B) Analyses are from subjects CH40 (A) and CH77 (B). In Highlighter plots, the boxed tic marks indicate nucleotide polymorphisms shared between two or more sequences. Such shared mutations are also reflected by branches emanating from common nodes in the phylograms. In B, sequences are rooted on sequence A2, not on a consensus of the sequences. This is because sequence A2 was most similar to the transmitted/founder sequence, differing from it by only a single nucleotide at position 6,021 (see Results for explanation).
Figure 5.(A–C) Samples were from subjects CH40 (A), CH58 (B), and CH77 (C). The dark line represents the transmitted/founder sequence (T/F) for each subject. Sequences from different sample dates are indicated by the grouped horizontal lines, with tic marks indicating differences from the T/F sequence. The screening sample (S) was in each case obtained from a preseroconversion Fiebig stage II sample near peak viremia. The sampling times in days after screening and corresponding viral loads are indicated (left). Light blue tic marks indicate differences in noncoding regions of the genomes. Gray tic marks indicate deletions. Green tic marks indicate differences that are synonymous in the reading frames indicated at the bottom of the figure. Red tic marks indicate differences that are nonsynonymous in a coding gene. Short horizontal lines (labeled A, B, and C) indicate short (∼1-kb) sequences from a screening sample used to verify the presence or absence of a shared polymorphism (Figs. S3–S6). The horizontal axis indicates nucleotide positions in the alignment beginning at position 582 in the 5′ LTR (U5) and extending to position 9,606 in the 3′ LTR (R) based on HXB2 reference sequence numbering (http://www.hiv.lanl.gov/content/sequence/HIV/REVIEWS/HXB2.html).
Figure 6.Cloning strategy and functional analysis of three subtype C full-length molecular clones from subjects ZM246F and ZM247F. (A) The full-length transmitted/founder proviral sequence for ZM246F was chemically synthesized in three overlapping fragments. The two transmitted/founder proviruses in subject ZM247F were PCR amplified from PBMC DNA in two overlapping halves. Proviral fragments were cloned into the indicated plasmid vectors. (B) Plasmid DNAs were transfected into 293T cells, and progeny viruses were examined for their ability to replicate in activated primary human CD4+ lymphocytes (top) and monocyte-derived macrophages (bottom) from the same donor. Culture supernatants were analyzed at days 1–16 for p24 production. The results shown were replicated four times in lymphocytes and macrophages from four different normal blood donors.
Phenotype of transmitted HIV-1 clade C viruses compared with prototype clade B viruses
| Virus | sCD4 | HIVIG | Clade B plasma | Clade C plasma | 17b | 21c | b12 | 2G12 | 2F5 | 4E10 | Z13e1 | 447-52D | 447-52D+ sCD4 | F425-B4e8 | F425-B4e8+ sCD4 | 3074 | 3074+ sCD4 | T-20 | T-1249 | Anti-CD4 mab | TAK-779 | Co-receptor usage |
| YU-2 | 13 | 619 | 20 | 20 | >10 | >10 | 7.16 | >50 | >50 | >50 | >50 | 13.36 | 1.48 | >25 | 1.85 | >25 | >25 | 0.49 | 0.094 | 0.18 | 0.047 | R5 |
| NL4.3 | 1 | 60 | 92 | 31 | 0.21 | 0.59 | 0.01 | 1.08 | 0.24 | 1.05 | >50 | 0.03 | 0.01 | >25 | >25 | >25 | >25 | 0.47 | 0.003 | 0.63 | >10 | X4 |
| ADA | 105 | 1,000 | 19 | 17 | >10 | >10 | 1.31 | >50 | 33.2 | >50 | >50 | >25 | 0.03 | >25 | 0.11 | >25 | 6.81 | 0.15 | 0.094 | 0.31 | 0.307 | R5 |
| NL.Bal. ecto | 8 | 113 | 50 | 32 | >10 | >10 | 0.11 | 1.39 | 12.3 | 15.1 | >50 | 0.14 | 0.04 | 0.54 | 0.05 | >25 | 1.47 | 0.09 | 0.015 | 0.44 | 1.04 | R5 |
| ZM246F | 125 | 748 | 13 | 31 | >10 | >10 | 21.35 | >50 | >50 | 5.50 | >50 | >25 | >25 | >25 | >25 | >25 | 3.35 | 0.07 | 0.012 | 0.17 | 0.003 | R5 |
| ZM247Fv1 | 155 | >1,000 | 13 | 18 | >10 | >10 | >50 | >50 | >50 | 11.33 | >50 | >25 | >25 | >25 | >25 | >25 | >25 | 0.05 | 0.011 | 0.11 | 0.032 | R5 |
| ZM247Fv2 | 125 | >1,000 | 12 | 20 | >10 | >10 | >50 | >50 | >50 | 47.11 | >50 | >25 | >25 | >25 | >25 | >25 | >25 | 0.08 | 0.027 | 0.16 | 0.034 | R5 |
Viruses derived by transfection of 293T cells with replication-competent molecularly cloned proviral DNA.
Reciprocal geometric mean neutralization titers (IC50) of heterologous plasma specimens from 11 HIV-1 clade B chronically infected subjects.
Reciprocal geometric mean neutralization titers (IC50) of heterologous plasma specimens from 22 HIV-1 clade C chronically infected subjects.