| Literature DB >> 22363449 |
Elena Delgado1, Cristina Carrera, Paloma Nebreda, Aurora Fernández-García, Milagros Pinilla, Valentina García, Lucía Pérez-Álvarez, Michael M Thomson.
Abstract
The HIV-1 primary transcript undergoes a complex splicing process by which more than 40 different spliced RNAs are generated. One of the factors contributing to HIV-1 splicing complexity is the multiplicity of 3' splice sites (3'ss) used for generation of rev RNAs, with two 3'ss, A4a and A4b, being most commonly used, a third site, A4c, used less frequently, and two additional sites, A4d and A4e, reported in only two and one isolates, respectively. HIV-1 splicing has been analyzed mostly in subtype B isolates, and data on other group M clades are lacking. Here we examine splice site usage in three primary isolates of subtype C, the most prevalent clade in the HIV-1 pandemic, by using an in vitro infection assay of peripheral blood mononuclear cells. Viral spliced RNAs were identified by RT-PCR amplification using a fluorescently-labeled primer and software analyses and by cloning and sequencing the amplified products. The results revealed that splice site usage for generation of rev transcripts in subtype C differs from that reported for subtype B, with most rev RNAs using two previously unreported 3'ss, one located 7 nucleotides upstream of 3'ss A4a, designated A4f, preferentially used by two isolates, and another located 14 nucleotides upstream of 3'ss A4c, designated A4g, preferentially used by the third isolate. A new 5' splice site, designated D2a, was also identified in one virus. Usage of the newly identified splice sites is consistent with sequence features commonly found in subtype C viruses. These results show that splice site usage may differ between HIV-1 subtypes.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22363449 PMCID: PMC3281843 DOI: 10.1371/journal.pone.0030574
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Schematic representation of HIV-1 splicing.
Open reading frames are shown as open boxes and exons as black bars. Exons are named as previously [1], [2]. All spliced transcripts incorporate exon 1. Optionally, noncoding exons 2 or 3 or both can be incorporated into tat, rev, nef, or env transcripts, and exon 2 into vpr transcripts. Proteins encoded in spliced RNAs are indicated on the right of the 3′ exon.
Figure 2GeneMapper analyses of DS RNAs expressed by three HIV-1 subtype C primary isolates in PBMCs.
Green peaks represent PCR products and orange peaks represent size standards. Size of PCR product, encoded gene, and exon composition (named as in previous studies [1], [2]) predicted according to the size of the PCR product are shown on top or on the side of each peak. Peaks whose sizes do not match HIV-1 transcripts using previously reported splice sites are marked with interrogation signs. For each subtype C virus, three GeneMapper analyses are shown, corresponding to infections using PBMCs from three different donors.
Figure 3Sequence electropherograms of splice junctions newly identified in subtype C isolates.
Splice junctions are shown as vertical lines. 5′ and 3′ splice sites involved in splicing, named as in previous studies [1], [2] and in this study (see main text), are signaled, with nucleotide positions in the HXB2 genome in parentheses. Nearby splice sites are also indicated.
Exon composition of clones derived from DS rev and tat RNAs expressed by three subtype C isolates*.
|
|
| |||||||||
| Isolate | 1.4f | 1.3.4f | 1.2.3.4f | 1.4c | 1.4g | 1.2.4g | 1.3.4g | 1.2a.3.4g | 1.4 | Total |
| X1702-3 | 15 | 5 | 2 | 1 | 23 | |||||
| X1936 | 11 | 5 | 2 | 1 | 2 | 21 | ||||
| X2363-2 | 9 | 1 | 1 | 1 | 12 | |||||
Exons 1, 2, 3, 4c, and 4 are named as in previous studies [1], [2]. Exons 4f and 4g designate those using newly identified 3'ss A4f and A4g, respectively, and exon 2a designates that using 3'ss A1 and newly identified 5'ss D2a (see main text). All transcripts are assumed to include exon 7 through splicing from 5'ss D4 to 3'ss A7. However this splice junction was not sequenced, since the antisense PCR primer anneals upstream of D4. nef RNAs were not amplified because the antisense PCR primer recognizes a sequence upstream of 3'ss A5.
Relative expression of rev RNAs in subtype C viruses according to peak areas in GeneMapper analyses.
| Isolate | PCR product size | Exon composition | % total | % |
| X1702-3 | 305 | 1.4f.7 | 70.3 | 68.2 |
| 316 | 1.4c.7 | 7.1 | 9.1 | |
| 379 | 1.3.4f.7 | 18.0 | 22.7 | |
| 390 | 1.2.4c.7 | 3.9 | - | |
| 428 | 1.2.3.4f.7 | 0.8 | - | |
| X1936 | 305 | 1.4f.7 | 52.2 | 57.9 |
| 316 | 1.4c.7 | 8.6 | 5.3 | |
| 379 | 1.3.4f.7 | 27.4 | 26.3 | |
| 390 | 1.2.4c.7 | 9.0 | - | |
| 428 | 1.2.3.4f.7 | 2.7 | 10.5 | |
| X2363-2 | 299 | 1.4a.7 | 2.6 | - |
| 305 | 1.4f.7 | 0.8 | - | |
| 331 | 1.4g.7 | 50.1 | 75.0 | |
| 367 | 1.2.4c.7 | 9.1 | - | |
| 380 | 1.2.4g.7 | 6.7 | 8.3 | |
| 390 | 1.3.4c.7 | 0.7 | - | |
| 404 | 1.3.4g.7 | 19.5 | 8.3 | |
| 421 | 1.2a.4g.7 | 5.8 | - | |
| 440 | 1.2.3.4c.7 | 1.6 | - | |
| 454 | 1.2.3.4g.7 | 2.0 | - | |
| 494 | 1.2a.3.4g.7 | 1.1 | 8.3 |
Results correspond to peaks shown in Fig. 2, and are shown as % of individual peak areas relative to the sum of peak areas of all rev RNA-derived products. Percentages at the column on the right correspond to the cloned and sequenced rev RNA-derived amplicons (Table 1).
A small 331 nt peak, coincident with that of 1.4g.7 rev RNA, was seen in X1702-3 and X1936 (Fig. 2). However, nested PCR using an antisense primer specific for rev, tat and vpr RNAs failed to detect 1.4g.7 rev RNA in these isolates.
Nested PCR with primers recognizing exons 2 and 3 allowed to confirm that these products, only 1 nt longer in X1702-3 and X1936 than in X2363-2, correspond to 1.3.4f.7 in the first two viruses and to 1.2.4g.7 in the third one.
The 367 nt peak seen in X2363-2 may correspond to both 1.2.4c.7 and 1.3.4b.7 rev RNAs. Nested PCR using primers recognizing exons 2 and 3 allowed to determine that this peak corresponds to 1.2.4c.7.
Figure 4Intronic and exonic sequences surrounding newly identified splice sites in three subtype C isolates.
Sequences are aligned with consensuses of subtypes B and C. (a) Sequences surrounding 3'ss A4f and A4g. AG dinucleotides in the intron ends adjacent to splice sites are in bold type. Polypyrimidine tracts potentially used for splicing at A4f and A4g are boxed. The sequences of subtype B NL4-3 and SF2 isolates are on bottom with branch sites previously identified for rev RNA splicing [8], [14] underlined. Nucleotides in the subtype C isolates and in the consensus subtype C sequence potentially used as branch points for splicing at A4f and A4g (see main text) are indicated with arrows. (b) Sequences surrounding 5'ss D2 and D2a. Exon-intron borders are signaled with vertical lines. Highly conserved GU dinucleotides at intron ends adjacent to the 5'ss are in bold type. Nucleotides at splice sites potentially pairing with U1 snRNA are underlined.