| Literature DB >> 21042592 |
Stephanie M Willerth1, Hélder A M Pedro, Lior Pachter, Laurent M Humeau, Adam P Arkin, David V Schaffer.
Abstract
BACKGROUND: With an estimated 38 million people worldwide currently infected with human immunodeficiency virus (HIV), and an additional 4.1 million people becoming infected each year, it is important to understand how this virus mutates and develops resistance in order to design successful therapies. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2010 PMID: 21042592 PMCID: PMC2962647 DOI: 10.1371/journal.pone.0013564
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flowchart of experimental methodology.
Cell culture supernatants containing viral particles was collected from HIV infected cells. The RNA was extracted using the QIAamp Viral RNA Mini Kit and coverted into large quantities of single stranded DNA using the WT-Ovation Pico RNA Amplification System. The complementary strand for the ssDNA was then synthesized using the WT-Ovation Exon Module. The final step in the process involved using the Genomic DNA Sample Prep Kit to produce an Illumina library.
Figure 2Flowchart of computational methodology.
Once the millions of Illumina reads were obtained, they were then aligned against the 144 HIV-1 B genomes taken from the Los Alamos National Lab database. These alignments produced regions of consensus for each of these individual genomes which were then combined for the final alignment using MAFFT, which is a multiple sequence alignment program for nucleotide sequences.
Composition of Illumina Reads Obtained.
| Sample | Total Number of Reads | Number of reads mapping to the reference HIV genome | Number of reads mapping to the reference HIV genome in pairs | Number of reads with possible adapter contamination | Number of reads mapping to the human genome |
|
| 21,655,108 | 15,536,824 (71.7%) | 12,808,867 (59.1%) | 1,191,975 (5.5%) | 371,752 (1.72%) |
|
| 14,632,646 | 6,869,780 (46.9%) | 5,323,304 (36.4%) | 1,052,221 (7.2%) | 203,690 (1.39%) |
Figure 3A) Coverage (log of read depth) of the HIV genome for both clonal and clinical samples. B) Base substitution rates plots for the same two samples. C) Magnified region of base substitution plot showing the location of the drug resistant mutations present in the clinical sample. The blue line corresponds to the coverage for the homogenous NL4-3 sample and the red line corresponds to the coverage of the heterogeneous clinically relevant HIV population. The average coverage was higher for the clonal sample than the clinical sample while the average base substitution rate was higher for the clinical sample. Key for drug resistant mutations in figure 3C (number- gene:mutation (basepair location)): 1 – PR:L10F (2298), 2 – PR:M46I (2411), 3- PR:I54V (2436), 4- PR:G73T (2487), 5- PR:I84V (2520), 6- PR:L90M (2548), 7- RT:M41L (2688), 8- RT:D67N (2766), 9- RT:K101H (2868), 10- RT:V106I (2883), 11- RT:V118I (2919), 12- RT:G190A (3135), 13- RT:L210W (3196), 14- RT:T215Y (3210), 15- RT:K219N (3222).
Mean base substitution rate for genes.
| Sample | Mean base substitution rate for | Mean base substitution rate for | Mean base substitution rate for |
|
| 0.902% | 0.961% | 0.853% |
|
| 1.459% | 1.597% | 1.707% |
Statistical Comparison of Base Substitution Rates in Different Genes**.
|
|
|
|
|
| |
|
| 0.170 | ||||
|
| 1.29E-14 | 2.20E-16 | |||
|
| 3.75E-3 | ND | ND | ||
|
| ND | 0.0324 | ND | 0.0158 | |
|
| ND | ND | 2.15E-4 | 5.01E-7 | 2.20E-16 |
**p values are given in this table.
***ND - Not determined, as the mean base substitution rates were statistically compared only for relevant pairs of samples.
Location and analysis of indels from clinical sample.
| Location in genome | Length and indel sequence | Frequency (occurs in % of reads) |
| 6675 | -6: | 2.44% |
| 7321 | -3: | 36.1% |
List of resistance mutation for the clinical sample*.
| Known PR resistance mutations | M46I (2411), I54V (2436), I84V (2520), L90M (2548), L10F (2298), G73T (2487) |
| Known RT resistance mutations | M41L (2688), D67N (2766), V118I (2919), L210W (3196), T215Y (3210), K219N (3222), K101H (2868), V106I (2883), G190A (3135) |
| Known IN resistance mutations | none |
Mutations are listed by amino acid mutation within the specific gene with the basepair position being given in parentheses.