| Literature DB >> 23235488 |
Formijn J van Hemert1, Antoinette C van der Kuyl, Ben Berkhout.
Abstract
A bipartition of HIV-1 RNA genome sequences into single- and double-stranded nucleotides is possible based on the secondary structure model of a complete 9 kb genome. Subsequent analysis revealed that the well-known lentiviral property of A-accumulation is profoundly present in single-stranded domains, yet absent in double-stranded domains. Mutational rate analysis by means of an unrestricted model of nucleotide substitution suggests the presence of an evolutionary equilibrium to preserve this biased nucleotide distribution.Entities:
Keywords: HIV; RNA structure; evolution; lentiviruses; mutational pattern; nucleotide composition
Mesh:
Substances:
Year: 2012 PMID: 23235488 PMCID: PMC3594280 DOI: 10.4161/rna.22896
Source DB: PubMed Journal: RNA Biol ISSN: 1547-6286 Impact factor: 4.652
Table 1. Biased nucleotide composition of single- and double-stranded regions in HIV-1 NL4-3 RNA
| Nucl | ss ds | Number | ss/ds | |
|---|---|---|---|---|
| (proportion) | ||||
| All | ss | 5377 | (0.59) | 1.416 |
| ds | 3796 | (0.41) | ||
| A | ss | 2596 | (0.79) | 3.779 |
| ds | 687 | (0.21) | ||
| U | ss | 1155 | (0.57) | 1.304 |
| ds | 886 | (0.43) | ||
| C | ss | 623 | (0.38) | 0.616 |
| ds | 1012 | (0.62) | ||
| G | ss | 1003 | (0.45) | 0.828 |
| ds | 1211 | (0.55) | ||

Figure 1. Base pair composition of the double-stranded portion of the HIV-1 NL4-3 RNA structure.

Figure 2. SHAPE reactivity of each nucleotide (A, U, C and G) in HIV-1 NL4-3 RNA. The histograms show increasing SHAPE reactivity in windows of 0.2 (X-axis, relative units). Frequency refers to the number of nucleotides per SHAPE window. Note the deviant SHAPE reactivity of the A-nucleotide.
Table 2. Nucleotide frequencies in single- and double-stranded regions of HIV-1 subtype B RNA genomes
| 448 HIV-1 isolates | A | U | C | G | |
|---|---|---|---|---|---|
| AVG | 36.20 | 22.23 | 17.64 | 23.94 | |
| StD | 0.55 | 0.17 | 0.31 | 0.33 | |
| AVG | 47.50 | 21.30 | 11.90 | 19.20 | |
| StD | 0.40 | 0.21 | 0.26 | 0.30 | |
| AVG | 19.90 | 23.60 | 25.80 | 30.70 | |
| StD | 0.44 | 0.23 | 0.29 | 0.30 | |
The alignment of 448 sequences (All) was divided in two parts by the ss or ds designation of bases in the NL4-3 RNA structure. The individual nucleotide compositions were used for the calculation of average and standard deviation.
Table 3. Different patterns of nucleotide substitution for ss and ds nucleotides in HIV-1 subtype B RNA genomes
| Qss | A | U | C | G | Qds | A | U | C | G |
|---|---|---|---|---|---|---|---|---|---|
| A | -0.699428 | 0.081518 | 0.156830 | 0.461079 | A | -1.499043 | 0.254533 | 0.339833 | 0.904678 |
| U | 0.138854 | -0.899975 | 0.568940 | 0.192180 | U | 0.175181 | -0.868417 | 0.532371 | 0.160865 |
| C | 0.574634 | 1.007012 | -1.776556 | 0.194910 | C | 0.300107 | 0.471187 | -0.837239 | 0.065945 |
| G | 1.148242 | 0.084045 | 0.114724 | -1.347011 | G | 0.736551 | 0.068547 | 0.064782 | -0.869880 |
Patterns of nucleotide substitution are presented as rate matrices (Qss and Qds). A positive value of a row represents the rate of substitution of the row nucleotide into one of the column nucleotides. A negative value on the matrix diagonal is the quantity by which the sum of the positive row becomes reduced to zero (meaning a zero rate of substitution). An unrestricted model of nucleotide substitution was used. The two alignments of ss and ds nucleotides were analyzed in five batches of 80 sequences. The resulting matrices () were arithmetically averaged to obtain the two “consensus” matrices (Qss and Qds).