| Literature DB >> 23402264 |
Wei Shao1, Valerie F Boltz, Jonathan E Spindler, Mary F Kearney, Frank Maldarelli, John W Mellors, Claudia Stewart, Natalia Volfovsky, Alexander Levitsky, Robert M Stephens, John M Coffin.
Abstract
BACKGROUND: 454 sequencing technology is a promising approach for characterizing HIV-1 populations and for identifying low frequency mutations. The utility of 454 technology for determining allele frequencies and linkage associations in HIV infected individuals has not been extensively investigated. We evaluated the performance of 454 sequencing for characterizing HIV populations with defined allele frequencies.Entities:
Mesh:
Year: 2013 PMID: 23402264 PMCID: PMC3599717 DOI: 10.1186/1742-4690-10-18
Source DB: PubMed Journal: Retrovirology ISSN: 1742-4690 Impact factor: 4.602
Detection of recombination during PCR
| | | |||||||
|---|---|---|---|---|---|---|---|---|
| Run1 | ||||||||
| MID1 | 1 | 100% wild type | 8503 | 8483 | 0 | 20 | 0.24% | 0.00% |
| MID2 | 1 | 100% wild type | 43699 | 43480 | 2 | 217 | 0.50% | 0.00% |
| MID5 | 1 | 10% mutant | 15923 | 12485 | 2039 | 1399 | 8.79% | 8.42% |
| MID7 | 1 | 1% mutant | 9548 | 9329 | 98 | 121 | 1.27% | 0.90% |
| MID3 | 2 | 100% mutant | 24210 | 1 | 24127 | 82 | 0.34% | 0.00% |
| MID4 | 2 | 100% mutant | 13466 | 0 | 13432 | 34 | 0.25% | 0.00% |
| MID10 | 2 | 10% mutant | 49169 | 35477 | 10156 | 3536 | 7.19% | 6.90% |
| MID11 | 2 | 1% mutant | 12070 | 11681 | 171 | 218 | 1.81% | 1.51% |
| Run2 | ||||||||
| MID1 | 1 | 100% wild type | 48828 | 48735 | 0 | 90 | 0.18% | 0.00% |
| MID2a | 1 | 100% wild type; Clone DNA/no PCR | 98785 | 98670 | 2 | 113 | 0.11% | 0.00% |
| MID5 | 1 | 100% mutant | 52287 | 0 | 51903 | 384 | 0.73% | 0.00% |
| MID7 | 1 | 50% mutant | 58882 | 18565 | 31388 | 8929 | 15.16% | 14.82% |
| MID9 | 1 | 50/50 mix RNA | 54744 | 14151 | 27648 | 12945 | 23.65% | 23.30% |
| MID3 | 2 | 100% mutant | 60477 | 6 | 60297 | 174 | 0.29% | 0.00% |
| MID4 | 2 | 100% wild type | 38970 | 38705 | 0 | 265 | 0.68% | 0.00% |
| Run 3 | ||||||||
| MID11 | 1 | 50% mutant | 62437 | 21163 | 40777 | 497 | 0.78% | 0.43% |
| MID12 | 1 | 50% mutant | 122327 | 23589 | 83134 | 15604 | 12.00% | 11.65% |
a Run2MID2: Cloned DNA without PCR amplification.
Figure 1Recombinant crossover rate estimation. (A) Observed crossover rates at each interval of recombinant sequences in Run3. The Y axis is the crossover rate in percentage. The X axis shows the interval sites between each drug resistance site and the length of the intervals. (B) Normalized average crossover rates. AXBR was obtained by dividing average crossover/base by total number of recombinant sequences. AXBS was obtained by dividing average crossover/base by the total number of all sequences.
Summary of recombination patterns
| | ||||
|---|---|---|---|---|
| 1 Crossover | 12 | 12378 | 9 | 311 |
| 2 Crossover | 28 | 1710 | 8 | 161 |
| 3 Crossover | 28 | 188 | 3 | 4 |
| 4 Crossover | 6 | 20 | 0 | 0 |
| 5 Crossover | 1 | 1 | 0 | 0 |
| Sum | 75 | 14297 | 20 | 476 |
Figure 2Crossover patterns of recombinant sequences from Run3 MID12 analyzed using standard PCR conditions. The second row shows the drug resistance codon positions. The 1st to the 7th columns show the nucleotides at each of the 7 sites with WT marked in white and mutant marked in gray. Detectable crossover events are shown as color changes (from white to gray or from gray to white). The last row represents recombinants that have one or more nucleotides in the drug resistance codons missing from sequencing.
Figure 3Crossover patterns of recombinant sequences from Run3 MID11 obtained using low recombination PCR conditions. The second row shows the drug resistance codon positions. The 1st to the 7th columns show the nucleotides at each of the 7 sites with WT marked in white and mutant marked in gray. Detectable crossover events are shown as color changes (from white to gray or from gray to white). The last row represents recombinants that have one or more nucleotides in the drug resistance codons missing from sequencing.
Distribution of errors per sequence
| Run1MID1 (8305) | 24.23% | 44.32% | 18.62% | 8.04% | 3.05% | 1.01% | .067% | 0.06% | 6293 (75.77%) |
| Run1MID2 (43699) | 25.50% | 14.47% | 36.56% | 15.79% | 4.68% | 1.53% | 1.42% | 0.05% | 32556 (74.50%) |
| Run1MID3 (24210) | 67.16% | 24.51% | 5.03% | 1.62% | 0.85% | 0.43% | 0.40% | 0.02% | 7951 (32.84%0 |
| Run1MID4 (13466) | 69.69% | 22.70% | 4.60% | 1.45% | 0.63% | 0.40% | 0.52% | 0.00% | 4081 (30.31%) |
| Run2MID1 (48825) | 23.23% | 43.26% | 20.37% | 7.09% | 2.59% | 1.27% | 1.80% | 0.40% | 37483 (76.77%) |
| Run2MID2 (98785) | 0.12% | 65.26% | 23.88% | 6.17% | 2.33% | 0.96% | 1.08% | 0.20% | 95665 (99.88%) |
| Run2MID3 (60477) | 66.18% | 23.57% | 5.90% | 1.96% | 0.79% | 0.51% | 0.98% | 0.09% | 20451 (33.82%) |
| Run2MID4 (38970) | 60.18% | 26.66% | 7.64% | 2.42% | 1.03% | 0.70% | 1.27% | 0.12% | 15517 (39.82%) |
| Number of sequences (336737) | 113740 | 129224 | 59438 | 19843 | 7031 | 3056 | 3875 | 526 | 222997 (66.22%) |
| Percentage in all sequences | 33.78% | 38.38% | 17.65% | 5.89% | 2.09% | 0.91% | 1.15% | 0.16% |
Figure 4Point error and indel error estimation. (A). Point error distribution in Run1 fragment 1. The data are from Run1 MID1, 2, 5, 7, 8, and 9. The X axis is the nucleotide position of sequences, with position 1 corresponding to nucleotide 101 in BH10 RT [29]. The Y axis is the mean error rate of all samples at each position (blue diamonds). The error bars correspond to 1 standard deviation. The orange diamonds at the top show the locations of drug resistance sites. The numbers with arrows indicate nucleotide positions of sites with high error rates. The gray boxes show the 20 nucleotide PCR primer regions. (B) Indel error distribution in Run1 fragment 1, combined MID1, 2, 5, 7, 8, and 9. The Y axis is the mean indel error rate of all samples at each position (blue squares). The error bars correspond to 1 standard deviation of the error rate of each position. Other symbols are as in panel A. The numbers in parentheses indicate the number of As in homopolymer regions. (C) Point error distribution in Run2 fragment 1. The data are from Run2 MID1, 5, 7, and 9. Conventions are as in panel A. The pink squares show point errors from cloned sample DNA (MID2). Blue diamonds show the mean error rate of all other samples at each position. (D) Indel error distribution from Run2 MID1, 5, 7, and 9. Conventions as in panel B Pink squares show the indel errors of cloned sample DNA. Blue squares are the mean indel error rate of all other samples at each position. The colored symbols and vertical lines after the second gray box show the average errors and standard deviations.
Summary of point and indel errors arising during PCR and 454 sequencing
| | ||||||||
|---|---|---|---|---|---|---|---|---|
| Run1 | 0.02 | 1.36 | 0.15 | 0.14 | 0.002 | 49.99 | 0.51 | 4.16 |
| Run2 amplified | 0.008 | 1.06 | 0.12 | 0.16 | 0.002 | 12.18 | 0.25 | 1.10 |
| Run2 cloned (MID2) | 0.001 | 0.72 | 0.02 | 0.06 | 0.001 | 20.84 | 0.25 | 1.14 |
Base-specific error rate
| Run1 | A | C | G | T | Total |
| A | | 0.02% | 0.02% | 0.13% | |
| C | 0.01% | | 0.01% | 0.05% | |
| G | 0.01% | | 0.01% | 0.10% | |
| T | 0.03% | 0.02% | | 0.13% | |
| Run2 amplified b | A | C | G | T | Total |
| A | | 0.00% | 0.02% | 0.09% | |
| C | 0.00% | | 0.00% | 0.03% | |
| G | 0.00% | | 0.01% | 0.06% | |
| T | 0.01% | 0.01% | | 0.07% | |
| Run2 clone (MID2) | A | C | G | T | Total |
| A | | 0.00% | 0.01% | 0.05% | |
| C | 0.01% | | 0.00% | 0.07% | |
| G | 0.00% | | 0.01% | 0.05% | |
| T | 0.01% | 0.00% | 0.02% | ||
a Transition errors are marked in bold.
b Run2 amplified samples include MID1 (100% wt), 2 (100% wt, no PCR), 5 (100% mutant), 7 (50% mutant), and 9 (50% mutant); Run2 clone is Run2 MID2.
Point errors at drug resistance mutation sites
| 21 | M41L(A->C) | 0.02% | 0.01% | 0.02% (0.01 to 0.03) |
| 94 | 0.31% | 0.09% | 0.31(0.19 to 0.42) | |
| 99 | 0.07% | 0.02% | 0.07% (0.01 to 0.13) | |
| 109 | 0.09% | 0.06% | 0.09% (0.03 to 0.16) | |
| 120 | L74V(T->G) | 0.02% | 0.00% | 0.02% (0.00 to 0.01) |
| 198 | L90I(T->A) | 0.00% | 0.00% | 0.00% (0.00 to 0.03) |
| 209 | K103N(A->C) | 0.02% | 0.01% | 0.02% (0.01 to 0.03) |
| 210 | K103N(A->T) | 0.02% | 0.02% | 0.02% (0.01 to 0.03) |
a Transition errors are in bold.
Run1 includes MID1 (100% wt), 2 (100% wt), 5 (10% mutant), 7 (1% mutant); Totally 77,673 sequences analyzed . Run2 includes MID1 (100% wt), 2 (100% wt), 5 (100% mutant), 7 (50% mutant), 9 (50% mutant); Totally 313,523 sequences analyzed.
Drug resistance mutation detection in mixed samples
| | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Fragment 1 mutations | | | M41 | K65 | D67 | K70 | L74 | L160 | K103 | | Mean | 95% CI |
| Run1MID5 | 10 | 16.9 | 16.21 | 16.94 | 15.42 | 17.01 | 16.79 | 16.4 | 16.67 | | 16.49 | 15.99-17.0 |
| Run1MID7 | 1 | 1.50 | 1.48 | 1.63 | 1.32 | 1.57 | 1.51 | 1.45 | 1.33 | | 1.47 | 1.37-1.55 |
| Run2MID7 | 50 | 61.58 | 60.26 | 60.91 | 60.08 | 61.53 | 61.38 | 61.64 | 62.24 | | 61.15 | 60.43-61.87 |
| Run2MID9 | 50 | 63.24 | 61.56 | 62.60 | 61.96 | 630 | 62.99 | 62.85 | 64.10 | | 62.72 | 61.97-63.48 |
| Run3MID11 | 50 | 65.69 | 65.71 | 65.67 | 650 | 65.69 | 65.68 | 65.71 | 65.72 | | 65.60 | 65.35-65.84 |
| Run3MID12 | 50 | 74.63 | 74.41 | 74.48 | 73.92 | 74.43 | 74.32 | 74.59 | 75.29 | | 74.49 | 74.11-74.87 |
| Fragment 2 mutations | | | Y181 | M184 | Y188 | G190 | L205 | T215 | T215 | K219 | | |
| Run1MID10 | 10 | 22.26 | 20.35 | 20.77 | 21.05 | 21.07 | 21.38 | 21.35 | 21.37 | 22.15 | 21.19 | 20.75-21.63 |
| Run1MID11 | 1 | 2.24 | 2.18 | 2.43 | 2.40 | 2.21 | 1.94 | 1.76 | 1.73 | 1.84 | 2.06 | 1.83-2.30 |
Mean: Mean percentage of all drug resistance codons;
All samples were prepared with standard PCR except Run3 MID11 which was prepared with low recombination PCR.