| Literature DB >> 28918033 |
Olga V Kretova1, Daria M Fedoseeva1, Maria A Gorbacheva1, Natalya M Gashnikova2, Maria P Gashnikova2, Nataliya V Melnikova3, Vladimir R Chechetkin1, Yuri V Kravatsky1, Nickolai A Tchurikov4.
Abstract
RNAi has been suggested for use in gene therapy of HIV/AIDS, but the main problem is that HIV-1 is highly variable and could escape attack from the small interfering RNAs (siRNAs) due to even single nucleotide substitutions in the potential targets. To exhaustively check the variability in selected RNA targets of HIV-1, we used ultra-deep sequencing of six regions of HIV-1 from the plasma of two independent cohorts of patients from Russia. Six RNAi targets were found that are invariable in 82%-97% of viruses in both cohorts and are located inside the domains specifying reverse transcriptase (RT), integrase, vpu, gp120, and p17. The analysis of mutation frequencies and their characteristics inside the targets suggests a likely role for APOBEC3G (apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like 3G, A3G) in G-to-A mutations and a predominant effect of RT biases in the detected variability of the virus. The lowest frequency of mutations was detected in the central part of all six targets. We also discovered that the identical RNAi targets are present in many HIV-1 strains from many countries and from all continents. The data are important for both the understanding of the patterns of HIV-1 mutability and properties of RT and for the development of gene therapy approaches using RNAi for the treatment of HIV/AIDS.Entities:
Keywords: HIV-1; RNAi targets; conserved HIV-1 sequences; gene therapy; ultra-deep sequencing
Year: 2017 PMID: 28918033 PMCID: PMC5537207 DOI: 10.1016/j.omtn.2017.07.010
Source DB: PubMed Journal: Mol Ther Nucleic Acids ISSN: 2162-2531 Impact factor: 8.886
Figure 1Schematic Presentation of RNAi Targets A1–A6 within the HIV-1 Gene Map
The positions of 19-bp targets are indicated. The values shown above the targets indicate the 5′ numbering of a target in the reference sequence (GenBank: AF316544).
Figure 2Alignments of Deep Sequencing Reads from Both Cohorts
The top ten alignments are shown. The complete alignments are shown in Figures S1–S12. The reference sequence is shown on the top (GenBank: AF316544). The second line represents the sequences observed in the majority of reads. The number and percentage of reads are indicated. The 19-nt core RNAi sequences are not shaded.
Sequences of the 27- to 30-nt Regions Possessing RNAi Targets that Are Shared by the Majority of Viruses in Both Cohorts
Percentage of viruses possessing particular 27- to 30-nt regions in the deep sequencing data is shown after the corresponding sequence. The 19-nt core sequences corresponding to overlapping regions in double-stranded siRNAs are underlined. Percentage of viruses containing identical core sequence is shown in parentheses. The changes in nucleotide sequences from the reference sequence (isolate 97CDKP58e, 2001, from the Republic of the Congo, GenBank: AF316544) are highlighted in yellow. The changes between sequences in the same cohort are highlighted in green. The number of high-quality reads possessing the indicated target is shown in the rightmost column. C1, cohort 1; C2, cohort 2. The data on cloned sequences were deposited in GenBank: KC681847–KC681888. The deep sequencing data for the two cohorts were deposited in NCBI (Bioproject: PRJNA344431). seq., sequence.
Figure 3Curve Showing the Frequencies of Nucleotide Substitutions along the 27- to 30-nt Targets for Cohort 1
The horizontal red line corresponds to the threshold of reliable mutation detection (Equation 5 in Materials and Methods). The frequencies were determined against the most invariable RNAi target and were calculated by Equation 1 (Materials and Methods).
Figure 4Curve Showing the Frequencies of Nucleotide Substitutions along the 27- to 30-nt Targets for Cohort 2
The horizontal red line corresponds to the threshold of reliable mutation detection (Equation 5 in Materials and Methods). The frequencies were determined against the most invariable RNAi target and were calculated by Equation 1 (Materials and Methods).
Figure 5The Most Conserved Positions inside 19-nt RNAi Core Sequences
The regions possessing the mutation frequencies below 10−4 (see Figures 3 and 4) are shown between the blue lines. The values indicate the number of the nucleotide in a target. Cohort 1 is shown in red. Cohort 2 is shown in black.
Figure 6Frequencies of All Possible Nucleotide Substitutions along the 27- to 30-nt Targets for Cohort 1
The frequencies were determined against the most invariable RNAi target and were calculated by Equation 1 (Materials and Methods).
Figure 7Frequencies of All Possible Nucleotide Substitutions along the 27- to 30-nt Targets for Cohort 2
The frequencies were determined against the most invariable RNAi target and were calculated by Equation 1 (Materials and Methods).
Characteristics of the Mutations in Target A2 that Affect Y183 and M184 in RT
| Amino Acid, Codon | First Base Mutations, Codons, Amino Acids/Frequencies | Second Base Mutations, Codons, Amino Acids/Frequencies | Third Base Mutations, Codons, Amino Acids/Frequencies |
|---|---|---|---|
| Y183, TAC | T to C, CAC, H/7·10−4 | A to G, TGC, C/5·10−2 | C to T, TAT, Y/1.5·10−3 |
| T to A, AAC, N/4.5·10−5 | A to T, TTC, F/6·10−4 | C to A, TAA, Stop/4.5·10−4 | |
| T to G, GAC, D/1.8·10−5 | A to C, TCC, S/15.5·10−5 | C to G, TAG, Stop/8.5·10−5 | |
| M184, ATG | A to G, GTG, V/9·10−2 | T to C, ACG, T/6·10−3 | G to A, ATA, I/2.5·10−3 |
| A to T, TTG, L/1.8·10−3 | T to G, AGG, R/1.7·10−3 | G to C, ATC, I/3.7·10−4 | |
| A to C, CTG, L/1.7·10−3 | T to A, AAG, K/1.5·10−3 | G to T, ATT, I/2·10−4 | |
| Y183, TAC | T to C, CAC, H/7·10−5 | A to G, TGC, C/1.5·10−4 | C to T, TAT, Y/6·10−5 |
| T to A, AAC, N/6·10−5 | A to T, TTC, F/2·10−5 | C to A, TAA, Stop/2·10−5 | |
| T to G, GAC, D/3·10−6 | A to C, TCC, S/9·10−6 | C to G, TAG, Stop/5.5·10−6 | |
| M184, ATG | A to G, GTG, V/3·10−4 | T to C, ACG, T/1.0·10−4 | G to A, ATA, I/2.3·10−3 |
| A to T, TTG, L/3·10−5 | T to G, AGG, R/3·10−5 | G to C, ATC, I/5.2·10−4 | |
| A to C, CTG, L/2·10−5 | T to A, AAG, K/5·10−5 | G to T, ATT, I/5·10−5 | |
RNA Sequences that Were Used for the Preparation of Dicer Substrates and the DNA Sequence Containing Six RNAi Targets
| Target | Domain | Target Sequence, 5′-3′ | RNAs, 5′-3′ |
|---|---|---|---|
| A1 | RT | AAAAA | AAAAAGCAUCAGAAAGAACCUCCAUUU |
| AAAUGGAGGUUCUUUCUGAUGCUUUUU | |||
| A2 | RT | AGAC | AGACAUAGUUAUCUAUCAAUACAUGGA |
| UCCAUGUAUUGAUAGAUAACUAUGUCU | |||
| A3 | Int | AGGA | AGGAGUAGUGGAGUCUAUGAAUAAGGA |
| UCCUUAUUCAUAGACUCCACUACUCCU | |||
| A4 | vpu | GTGT | GUGUGGACUAUAGUAGGUAUAGAAUAU |
| AUAUUCUAUACCUACUAUAGUCCACAC | |||
| A5 | gp120 | ACCA | ACCAGGACAGACAUGGUAUGGAACAGG |
| CCUGUUCCAUACCAUGUCUGUCCUGGU | |||
| A6 | p17 | GT | GUGCGAGAGCGUCAGUAUUAAGUGGGG |
| CCCCACUUAAUACUGACGCUCUCGCAC |
The 19-nt core sequences are underlined. The pairs of RNA molecules are shown in the 5′-3′ direction. These pairs could form 27-bp double-stranded RNAs (Dicer substrates).
Figure 8Efficiency of RNAi Initiated by the Dicer Substrates
Results of co-transfection experiments are shown (see Materials and Methods). Luminescence of the Renilla luc gene (RLU) was normalized to that of the firefly luc gene. *p < 0.005.
Spread of the Identical Targets in HIV-1 Isolates from Different Countries
| Target/Domain | 19-nt Core Sequence | Number of Identical BLAST Hits | Country |
|---|---|---|---|
| A1/RT | GCATCAGAAAGAACCTCCA | 4,651 | Russia, USA, Uzbekistan, Spain, Malaysia, Kenya, Rwanda, Ukraine, China, Uganda, Cameroon, Nigeria, South Africa, South Korea, Hungary, Iran, Brazil, UK, Germany, Morocco, Ethiopia, Senegal, Mexico, Togo, Panama, French Guiana, Belgium, Angola, Sierra Leone, Ghana, Thailand, D.R. Congo, Burundi, Puerto Rico, Australia, Poland, Algeria, Israel, Cuba, India, Botswana, Jamaica, Turkey, Tanzania, Bulgaria, Cyprus, Portugal |
| A2/RT | ATAGTTATCTATCAATACA | 4,837 | Russia, USA, South Africa, France, Uzbekistan, Spain, Malaysia, Kenya, South Korea, UK, Peru, China, Switzerland, Germany, Uganda, Cameroon, Surinam, Brazil, Mexico, Kuwait |
| A3/Int | GTAGTGGAGTCTATGAATA | 1,395 | Russia, Uzbekistan, Rwanda, Ukraine, Kenya, Spain, Zambia, Armenia, Kirgizstan, Kazakhstan, France, Uganda, Canada, South Africa, UK, Cyprus, Cameroon, Germany, Belgium, Cuba, Georgia |
| A4/Vpu | GGACTATAGTAGGTATAGA | 768 | Russia, Rwanda, Ukraine, Iran, Nigeria, Uganda, D.R. Congo, Kenya, Spain, Cameroon, India, China, Myanmar, Thailand, Cyprus, Afghanistan, USA, Viet Nam, South Africa, Denmark, Tanzania, Kazakhstan, Uzbekistan, Georgia, Botswana |
| A5/Gp120 | GGACAGACATGGTATGGAA | – | Russia |
| A6/P17 | GCGAGAGCGTCAGTATTAA | 4,956 | USA, Thailand, Russia, Kenya, Rwanda, Cameroon, Pakistan, Ukraine, Brazil, China, India, Nigeria, Cameroon, Greece, Tanzania, Spain, Uganda, Senegal, UK, South Africa, Germany, Botswana, Portugal, D.R. Congo, Burkina Faso, France, Zambia, Cuba, India, Guinea-Bissau, Sweden, Estonia, Malaysia, Iran, Canada, Myanmar, Switzerland, Netherlands, South Korea, Malawi, Poland |
The search using nucleotide BLAST (Nucleotide collection [nr/nt]) was performed using the 19-nt core sequences. The number of identical hits is indicated. The list of countries is shown in the order of the number of corresponding hits that appeared in the list. The search was performed for the maximum of 5,000 target sequences with automatically adjusted parameters for short input sequences. The target A5 was detected only in Russia.