| Literature DB >> 22927816 |
Hui Li1, Mark B Stoddard, Shuyi Wang, Lily M Blair, Elena E Giorgi, Erica H Parrish, Gerald H Learn, Peter Hraber, Paul A Goepfert, Michael S Saag, Thomas N Denny, Barton F Haynes, Beatrice H Hahn, Ruy M Ribeiro, Alan S Perelson, Bette T Korber, Tanmoy Bhattacharya, George M Shaw.
Abstract
A precise molecular identification of transmitted hepatitis C virus (HCV) genomes could illuminate key aspects of transmission biology, immunopathogenesis and natural history. We used single genome sequencing of 2,922 half or quarter genomes from plasma viral RNA to identify transmitted/founder (T/F) viruses in 17 subjects with acute community-acquired HCV infection. Sequences from 13 of 17 acute subjects, but none of 14 chronic controls, exhibited one or more discrete low diversity viral lineages. Sequences within each lineage generally revealed a star-like phylogeny of mutations that coalesced to unambiguous T/F viral genomes. Numbers of transmitted viruses leading to productive clinical infection were estimated to range from 1 to 37 or more (median = 4). Four acutely infected subjects showed a distinctly different pattern of virus diversity that deviated from a star-like phylogeny. In these cases, empirical analysis and mathematical modeling suggested high multiplicity virus transmission from individuals who themselves were acutely infected or had experienced a virus population bottleneck due to antiviral drug therapy. These results provide new quantitative and qualitative insights into HCV transmission, revealing for the first time virus-host interactions that successful vaccines or treatment interventions will need to overcome. Our findings further suggest a novel experimental strategy for identifying full-length T/F genomes for proteome-wide analyses of HCV biology and adaptation to antiviral drug or immune pressures.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22927816 PMCID: PMC3426529 DOI: 10.1371/journal.ppat.1002880
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Figure 1HCV plasma viral RNA kinetics in acute infection subjects.
PlasmavRNA was quantified by Roche COBAS AmpliPrep/Taqman HCV assay. Circled values indicate samples subjected to vRNA sequencing and plus signs denote HCV antibody positivity.
Figure 2Maximum-likelihood tree (ML) of HCV sequences from acute and chronic infection subjects.
5′ quarter 1 genomesequences (core, E1, E2) from acute (red) and chronic (blue) subjects are shown along with HCV genotype 1 to 7 reference sequences(gray). Bootstrap values represent 100 repetitions in this and subsequent figures. The horizontal scale bar indicates 5% diversity.
Diversity and mutation analyses of HCV sequences in acute infection.
| Sample ID | Sample date | Genotype | Genome | Maximum nt length of viral genome analyzed | Total number of sequences analyzed | Number of transmitted/founder viruses | Maximum nucleotide sequence diversity (%) | Transmitted/founder lineage analyzed | Total number of sequences within lineage | Maximum intra-lineage diversity (%) |
| 10021 | 1998 | 1a | 5q1, 5q2, 5h | 4878 | 151 | 1 | 0.14 | v1 | 151 | 0.14 |
| 10051 | 1998 | 1b | 5q1, 5q2, 5h | 4863 | 303 | 1 | 0.19 | v1 | 303 | 0.19 |
| 10025 | 1994 | 1a | 5q1, 5q2, 5h | 4881 | 175 | 1 | 0.16 | v1 | 175 | 0.16 |
| 10024 | 1994 | 1a | 5q1, 5q2, 5h | 4881 | 222 | 6 | 1.35 | v1 | 212 | 0.12 |
| 10012 | 1998 | 1a | 5q1, 5q2, 5h | 4881 | 230 | 3 | 3.34 | v1 | 96 | 0.12 |
| v2 | 96 | 0.14 | ||||||||
| v3 | 38 | 0.12 | ||||||||
| 10029 | 1998 | 1a | 5q1, 5q2, 5h | 4875 | 322 | 9 | 3.08 | v1 | 241 | 0.18 |
| v2 | 19 | 0.10 | ||||||||
| v5 | 18 | 0.06 | ||||||||
| v6 | 11 | 0.04 | ||||||||
| v8 | 13 | 0.12 | ||||||||
| 10062 | 1996 | 1a | 5q1, 5q2, 5h | 4857 | 188 | 3 | 0.80 | v1 | 163 | 0.16 |
| v2 | 20 | 0.06 | ||||||||
| 10017 | 1993 | 1a | 5q1, 5q2, 5h | 4887 | 249 | 4 | 1.39 | v1 | 220 | 0.16 |
| v2 | 6 | 0.06 | ||||||||
| 6213 | 1992 | 1a | 5h | 4905 | 41 | 3 | 6.40 | v1 | 31 | 0.14 |
| v3 | 9 | 0.08 | ||||||||
| 6222 | 1992 | 1a | 5h | 4902 | 17 | 4 | 0.59 | v1 | 7 | 0.10 |
| 10020 | 1998 | 1a | 5q1, 5h | 4905 | 122 | 10 | 0.27 | all | 122 | N/A |
| 06.QB18 | 27 | 0.12 | ||||||||
| 10002 | 1994 | 1a | 5h | 4905 | 31 | 13 | 3.12 | v5 | 5 | 0.08 |
| 10004 | 1998 | 2b | 5q1 | 2802 | 36 | 3 | 4.39 | v1 | 16 | 0.11 |
| v2 | 11 | 0.11 | ||||||||
| v3 | 9 | 0.14 | ||||||||
| 10016 | 1998 | 2b | 5q1 | 2804 | 72 | 15 | 0.36 | all | 72 | N/A |
| 10.QE4 | 10 | 0.18 | ||||||||
| 9055 | 1998 | 3a | 5h | 4911 | 157 | 1 | 0.18 | v1 | 157 | 0.18 |
| 10003 | 1998 | 3a | 5h | 4911 | 133 | 37 | 0.41 | all | 134 | N/A |
| 07NA5 | 10 | 0.14 | ||||||||
| 07B13 | 11 | 0.10 | ||||||||
| 106889 | 2008 | 1a | 5h | 4902 | 87 | >30 | 1.02 | all | 87 | N/A |
| 5.B.F9 | 10 | 0.06 | ||||||||
| 5.02C22 | 7 | 0.10 | ||||||||
| Average | ||||||||||
| Median | 151 | 4 | 0.80 | 0.12 |
5h - 5′ half genome contains core, E1, E2, p7, NS2, and NS3; 5q1- 5′ quarter 1 genome contains core, E1, E2, p7 and partial NS2; 5q2 - 5′ quarter 2 genome contains partial NS2 and NS3.
Rate calculations were derived from sequences from each discrete transmitted/founder lineage.
Sequences from 1st sampled time point with ≥4 sequences per lineage were analyzed. If 5′ half genomes were not available, quarter genomes 1 and 2 were analyzed with each result shown.
Sequences from 2nd sampled time point were analyzed due to insufficient number of sequences or sequence diversity from 1st sampled time point.
Insufficient numbers of sequences from each time point to calculate fit to Poisson or star-like phylogeny.
Transmitted/founder lineage identified by this sequence in respective ML tree and Highlighter plot.
N/A, not applicable due to multiple transmitted/founder virus genomes.
Averages were calculated from total mutations in all transmitted/founder lineages from all subjects combined. Because of low numbers of sequences and mutations in some lineages, certain values (e.g. dN/dS for subjects 10062 v2 and 10004 v3) vary substantially from the mean.
Figure 3Models of HCV and HIV-1 diversification in acute infection.
Panels A and C depict patterns of HIV-1 replication and diversification compared with HCV in panels B and D. The various generations of HIV-1 (panel C) and HCV (panel D) are indicated as filled dots to represent HIV-1 proviruses or HCV replication complexes. The symbol t indicates the time to sampling in units of provirus generations for HIV-1 (∼2 days) and of replication complex generations for HCV (∼1 day). Whereas HIV-1 is produced only from the latest generation of provirus, HCV is produced from replication complexes of all generations. In this model, the number of HCV replication complexes of generation g at time t is given by . For HIV-1 each provirus of the same generation produces roughly equal numbers of descendant HIV viruses. The replication complexes of HCV produce widely different number of descendant viruses. Panels A and B show the impact of this difference on the expected fraction of pairs of viruses, sampled at various times, that share a given number of ancestors. For HCV, the number of pairs of viruses sampled at time t that share at least g generations is given by . One sees that HCV viruses typically share 50% more generations of ancestors than HIV, and with the estimated mutation rates, this means that they share about 3 times the stochastic mutation events expected in HIV.
Figure 4HCV diversity in chronic (WIMI4025)and acute (10051) subjects.
5′ half genome sequences (core, E1, E2, p7, NS2 and NS3) from WIMI4025 (A) and 10051 (B) are depicted by mid-point rooted ML phylogenies and Highlighter plots.
Figure 5HCV diversity in acute infection.
ML trees and Highlighter plots of 5′ quarter 2 genomesequences from four acutely infected subjects. Sequences are color coded to reflect sampling time points indicated in Figure 1. Subjects 10021 (A) and 10025 (B) revealed productive clinical infection by a single viruses whereas subjects 10012 (C) and 10062 (D) each showed infection by three viruses (v1–3).
Figure 6HCV diversity in acute subject 10029.
MLtree and Highlighter plot of 5′ quarter 1 genomesequences,color coded to reflect sampling time points in Figure 1, reveal productive infection by 9genetically-distinct viruses.
Figure 7HCV diversity in subject 10003.
ML tree and Highlighter plot of 5′ halfgenome sequences reveal many sets of closely related sequences distinguished by unique shared mutations.
Figure 8HCV diversity analysis in subject 10016 suggests acute-to-acute transmission.
Highlighter plot and neighbor-joining tree of 5′ quarter 1 genome sequences. Visualization of 15 potential T/F viral lineages distinguished by unique shared mutations is indicated by lower case (blue) letters. Model estimates of T/F virus lineages using maximum (red) and average (green) cut-offs reveal 10 and 4 potential T/F virus lineages, respectively, based on increasingly stringent model assumptions (see text).
Figure 9HCV plasma vRNA kinectics and diversity in subject 106889 reveals multivariant transmission of drug resistant mutants.
vRNA kinetics (inset), ML phylogenetic tree, and Highlighter analysis of 5′ half genome sequences from the initial vRNA positive sample (circled). Sequences fall into distinct lineages (color-coded) with high statistical bootstrap support. All inferred T/F genomes contained V36M and R155K NS3 protease drug resistance mutations.