| Literature DB >> 23112852 |
Austin L Hughes1, Ericka A Becker, Michael Lauck, Julie A Karl, Andrew T Braasch, David H O'Connor, Shelby L O'Connor.
Abstract
Deep sequencing technology is revolutionizing our understanding of HIV/SIV evolution. It is known that acute SIV sequence variation within CD8 T lymphocyte (CD8-TL) epitopes is similar among MHC-identical animals, but we do not know whether this persists into the chronic phase. We now determine whether chronic viral variation in MHC-identical animals infected with clonal SIV is similar throughout the entire coding sequence when using a sensitive deep sequencing approach. We pyrosequenced the entire coding sequence of the SIV genome isolated from a unique cohort of four SIVmac239-infected, MHC-identical Mauritian cynomolgus macaques (MCM) 48 weeks after infection; one MCM in the cohort became an elite controller. Among the three non-controllers, we found that genome-wide sequences were similar between animals and we detected increased sequence complexity within 64% of CD8-TL epitopes when compared to Sanger sequencing methods. When we compared sequences between the MHC-matched controller and the three non-controllers, we found the viral population in the controller was less diverse and accumulated different variants than the viral populations in the non-controllers. Importantly, we found that initial PCR amplification of viral cDNA did not significantly affect the sequences detected, suggesting that data obtained by pyrosequencing PCR-amplified viral cDNA accurately represents the diversity of sequences replicating within an animal. This demonstrates that chronic sequence diversity across the entire SIV coding sequence is similar among MHC-identical animals with comparable viral loads when infected with the same clonal virus stock. Additionally, our approach to genome-wide SIV sequencing accurately reflects the diversity of sequences present in the replicating viral population. In sum, our study suggests that genome-wide pyrosequencing of immunodeficiency viruses captures a thorough and unbiased picture of sequence diversity, and may be a useful approach to employ when evaluating which sequences to include as part of a vaccine immunogen.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23112852 PMCID: PMC3480401 DOI: 10.1371/journal.pone.0047818
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Synonymous (π) and nonsynonymous (π) nucleotide diversity in non-overlapping coding regions of SIV populations.
| Host |
|
|
| CY0165 | 0.0011±0.0008 | 0.0011±0.0005 |
| CY0163 | 0.0082±0.0022 | 0.0036±0.0009 |
| CY0164 | 0.0101±0.0025 | 0.0039±0.0009 |
| CY0166 | 0.0072±0.0021 | 0.0038±0.0009 |
| Mean (all except CY0165) | 0.0085±0.0009 | 0.0038±0.0001 |
π or π significantly different from corresponding value for CY0165 (P<0.05; Z-test).
π or π significantly different from corresponding value for CY0165 (P<0.01; Z-test).
π or π significantly different from corresponding value for CY0165 (P<0.001; Z-test).
Figure 1Variation in CD8-TL epitopes in virus populations from non-controller M3/M3 MCM and an SIVmac239 stock.
Amino acid variation detectable by pyrosequencing in 12 CD8-TL epitopes in viruses isolated from three M3/M3 MCM (CY0163, CY0164, and CY0166) is compared to variation previously reported by Sanger sequencing bulk PCR amplicons [18]. Amino acid variants detectable within an SIVmac239 stock were characterized here and elsewhere [34]. Sequences that match the inoculum are represented with a “.”. Mixed populations are represented with an “X”. The number of high quality reads with each specific sequence is shown. The total number of reads for each epitope can be found in Table S2. The frequency of reads with a specific amino acid sequence is shown as “% Var.” The sequences labeled as “other” include variants that were individually present at less than 1% and variants that were also detected in the inoculum, which included the A2V mutation in Gag221–229PR9. The relative shading of each box reflects the frequency of reads, as indicated in the legend.
Synonymous (πS) and nonsynonymous (πN) nucleotide diversity in variant epitopes from CY0163, CY0164, and CY0166.
| Region | πS ± S.E. | πN ± S.E. |
| Epitopes with variation detected only by pyrosequencing | 0.0043±0.0020 | 0.0188±0.0048a |
| Epitopes with variation detected by Sanger & pyrosequencing | 0.0022±0.0009 | 0.0288±0.0059b |
| Remainders | 0.0080±0.0009 | 0.0045±0.0007a |
Paired t-tests of the hypothesis that πS = πN: aP<0.05; bP<0.01.
See Table S3 for the value of nucleotide diversity for individual epitopes and remainders.
Figure 2Sequence variants in CD8-TL epitopes are detectable by directly pyrosequencing virus populations isolated from CY0166.
Amino acid variation detected by directly pyrosequencing the virus population in CY0166 is shown for all 12 CD8-TL epitopes. Sequences that match the inoculum are represented with a “.”. The number of high quality reads with each specific sequence is shown. The total number of reads for each epitope can be found in Table S2. The frequency of reads with a specific amino acid sequence is shown as “% Var.” The sequences labeled as “other” include variants that were individually present at less than 1% and variants that were also detected in the inoculum. The relative shading of each box reflects the frequency of reads, as indicated in the legend.
Figure 3Variation in CD8-TL epitopes in a virus population from an M3/M3 elite controller.
Amino acid variation detected by pyrosequencing in 12 CD8-TL epitopes in viruses isolated from CY0165 is compared to variation previously reported by Sanger sequencing bulk PCR amplicons [18]. Sequences that match the inoculum are represented with a “.”. The number of high quality reads with each specific sequence is shown. The total number of reads for each epitope can be found in Table S2. The frequency of reads with a specific amino acid sequence is shown as “% Var.” The sequences labeled as “other” include variants that were individually present at less than 1% and variants that were also detected in the inoculum. The relative shading of each box reflects the frequency of reads, as indicated in the legend.
Synonymous (πS) and nonsynonymous (πN) nucleotide diversity in epitope and non-epitope regions from all animals.
| Host | Epitope | Non-Epitope | ||
| πS ± S.E. | πN ± S.E. | πS ± S.E. | πN ± S.E. | |
| CY0165 | 0.0011±0.0006 | 0.0014±0.0005c | 0.0006±0.0003d | 0.0006±0.0002c |
| Others | 0.0035±0.0012 | 0.0206±0.0037b | 0.0069±0.0009 | 0.0044±0.0008 |
| All | 0.0029±0.0009 | 0.0158±0.0033a | 0.0053±0.0009 | 0.0035±0.0007 |
Paired t-tests of the hypothesis that πS = πN; a P<0.05; b P<0.01.
Nucleotide diversity is analyzed in all 12 epitope and non-epitope regions of the SIV genomes isolated from all four animals (mean values by gene). T-tests of the hypothesis that πS or πN for CY0165 equals the corresponding value for the other animals: c P<0.01; d P<0.001.
Correlation coefficients between proportion variant at SNP sites in SIV from different hosts.
| CY0163 | CY0164 | CY0166 | ||
| Synonymous | CY0165 | −0.041 | −0.037 | −0.034 |
| CY0163 | 0.140 | 0.307 | ||
| CY0164 | 0.130 | |||
| Nonsynonymous | ||||
| CY0165 | 0.152 | 0.120 | 0.079 | |
| CY0163 | 0.617 | 0.653 | ||
| CY0164 | 0.567 |
Only sites synonymous in all reading frames and showing one or more differences from the inoculum in at least one host (N = 451 sites) were included.
Only sites nonsynonymous in all reading frames and showing one or more differences from the inoculum in at least one host (N = 420 sites) were included.
Significance of correlation coefficient: P<0.05 (Bonferroni-correction for multiple testing).
Significance of correlation coefficient: P<0.01 (Bonferroni-correction for multiple testing).
Significance of correlation coefficient: P<0.001 (Bonferroni-correction for multiple testing).
The correlation for nonsynonymous sites was significantly different (P<0.001 in each case; Bonferroni-correction for multiple testing) from the corresponding correlation for synonymous sites.
Median proportion variant at sites shared or unique to the SIV from each host.
| Host | Shared | Unique | |||
| N (%) | Prop. variant | N (%) | Prop. variant | ||
| Synonymous | CY0165 | 9 (36.0%) | 0.1069 | 16 (64.0%) | 0.0478 |
| CY0163 | 88 (43.1%) | 0.0264 | 116 (56.9%) | 0.0155 | |
| CY0164 | 100 (44.8%) | 0.0193 | 123 (55.2%) | 0.0145 | |
| CY0166 | 78 (49.4%) | 0.0196 | 80 (50.6%) | 0.0164 | |
| Non-synon. | CY0165 | 30 (34.5%) | 0.0288 | 57 (65.5%) | 0.0261 |
| CY0163 | 112 (58.9%) | 0.0473 | 78 (41.1%) | 0.0159 | |
| CY0164 | 102 (52.3%) | 0.0578 | 93 (47.7%) | 0.0174 | |
| CY0166 | 89 (59.7%) | 0.0728 | 60 (40.3%) | 0.0187 | |
The proportions of shared and unique synonymous SNPs did not differ significantly among hosts; χ2 = 2.32; 3 d.f.; N.S.
The proportions of shared and unique nonsynonymous SNPs differed significantly among hosts; χ2 = 15.71; 3 d.f.; P = 0.001.
Mann-Whitney test of the hypothesis that the median proportion variant of shared SNPs equaled that of unique SNPs: P<0.01.
Mann-Whitney test of the hypothesis that the median proportion variant of shared SNPs equaled that of unique SNPs: P<0.001.