| Literature DB >> 31081611 |
Emanuela Ruggiero1, Martina Tassinari1, Rosalba Perrone2, Matteo Nadai1, Sara N Richter1.
Abstract
Retroviruses infect almost all vertebrates, from humans to domestic and farm animals, from primates to wild animals, where they cause severe diseases, including immunodeficiencies, neurological disorders, and cancer. Nonhuman retroviruses have also been recently associated with human diseases. To date, no effective treatments are available; therefore, finding retrovirus-specific therapeutic targets is becoming an impelling issue. G-Quadruplexes are four-stranded nucleic acid structures that form in guanine-rich regions. Highly conserved G-quadruplexes located in the long-terminal-repeat (LTR) promoter of HIV-1 were shown to modulate the virus transcription machinery; moreover, the astonishingly high degree of conservation of G-quadruplex sequences in all primate lentiviruses corroborates the idea that these noncanonical nucleic acid structures are crucial elements in the lentiviral biology and thus have been selected for during evolution. In this work, we aimed at investigating the presence and conservation of G-quadruplexes in the Retroviridae family. Genomewide bioinformatics analysis showed that, despite their documented high genetic variability, most retroviruses contain highly conserved putative G-quadruplex-forming sequences in their promoter regions. Biophysical and biomolecular assays proved that these sequences actually fold into G-quadruplexes in physiological concentrations of relevant cations and that they are further stabilized by ligands. These results validate the relevance of G-quadruplexes in retroviruses and endorse the employment of G-quadruplex ligands as innovative antiretroviral drugs. This study indicates new possible pathways in the management of retroviral infections in humans and animal species. Moreover, it may shed light on the mechanism and functions of retrovirus genomes and derived transposable elements in the human genome.Entities:
Keywords: G-quadruplex; LTR promoter; conservation; genome structure; retroviruses
Mesh:
Substances:
Year: 2019 PMID: 31081611 PMCID: PMC6630527 DOI: 10.1021/acsinfecdis.9b00011
Source DB: PubMed Journal: ACS Infect Dis ISSN: 2373-8227 Impact factor: 5.084
Figure 1RV structure and genome organization. (A) Simplified model of an RV virion (left) and of the integrated provirus (right). (B) RV-provirus organization. (C) Regions of the 5′-LTR promoter.
Figure 2Box plots showing average PQS densities (PQS/Kb) in full-length genomes (A) and LTR regions (B) of RVs.
Figure 3PQS distribution along the LTR regions of RVs. Each red circle indicates one PQS.
PQS Analysis Performed with QuadBase2 within the LTR Regions of RVsa
G3 tracts are shown in red and bold, nonoverlapping bulged G3 tracts (e.g., GGXG) are shown in blue and bold, and overlapping bulged G3 tracts (e.g., GXGGG) are underlined.
PQS location: “+” indicates the forward strand, and “–” indicates the reverse strand.
Figure 4Base conservation of putative G4-forming sequences within strains of each RV species. Consensus sequences were obtained by alignment of at least five sequences.
Figure 5Representative CD spectra of RV G4 sequences in the absence (black line) or presence of G4 ligands 1 (red line) and 2 (blue line). (A) G4 CD spectrum, characterized by a maximum peak at λ = 260 nm and a minimum one at λ = 240 nm, which define a parallel conformation. (B) Non-G4 CD spectrum, characterized by a broad signal at 260 < λ < 280 nm. (C–D) Two different mixed-G4 CD profiles.
Figure 6Chemical structures of the G4 ligands B19 (1) and c-exNDI (2) employed in this study.
CD Tm Values of RV G4s in the Absence and Presence of G4 Ligands 1 and 2a
| Δ | ||||||
|---|---|---|---|---|---|---|
| — | ||||||
| beta-RVs | RV1 | 48.1 ± 0.9 | 68.9 ± 0.2 | 60.6 ± 0.8 | 20.8 | 12.5 |
| RV2 | ND | ND | ND | |||
| RV3 | ND | ND | ND | |||
| RV4 | 67.1 ± 1.2 | >90 | >90 | >22.9 | >22.9 | |
| RV5 | 48.0 ± 1.9 | 68.9 ± 3.1 | 85.9 ± 1.2 | 20.9 | 37.9 | |
| ND | ND | 62.3 ± 1.1 | ND | ND | ||
| RV6 | ND | ND | ND | |||
| RV7 | 63.9 ± 0.8 | 75.8 ± 0.9 | >90 | 11.9 | >26.1 | |
| RV8 | ND | ND | ND | |||
| delta-RVs | RV9 | 65.1 ± 0.3 | >90 | >90 | >24.9 | >24.9 |
| 64.9 ± 0.3 | >90 | >90 | >25.1 | >25.1 | ||
| RV10 | 66.4 ± 1.3 | 83.8 ± 2.1 | >90 | 17.4 | >20.6 | |
| 48.9 ± 0.8 | 72.1 ± 0.9 | 70.3 ± 2.5 | 23.2 | 24.4 | ||
| RV11 | 61.4 ± 0.3 | 79.2 ± 0.7 | ND | 14.6 | ND | |
| 56.6 ± 2.1 | 69.0 ± 3.8 | 63.4 ± 0.3 | 12.4 | 6.8 | ||
| RV12 | 63.1 ± 0.4 | ND | ND | ND | ND | |
| 63.3 ± 0.4 | 66.3 ± 0.1 | 66.9 ± 0.8 | 3.2 | 3.8 | ||
| RV13 | ND | ND | ND | |||
| RV14 | 65.5 ± 0.8 | >90 | >90 | >24.5 | >24.5 | |
| gamma-RVs | RV15 | 55.4 ± 0.1 | >90 | >90 | >34.6 | >34.6 |
| ND | 67.0 ± 0.1 | 62.1 ± 2.6 | ND | ND | ||
| RV16 | ND | ND | ND | ND | ND | |
| 53.3 ± 1.4 | 63.4 ± 1.0 | 68.5 ± 2.3 | 10.1 | 15.2 | ||
| RV17 | 52.3 ± 0.8 | 86.7 ± 1.0 | 57.0 ± 3.4 | 33.7 | 4.7 | |
| RV18 | 59.9 ± 0.4 | 76.5 ± 0.1 | 70.6 ± 1.0 | 16.6 | 10.7 | |
| RV19 | 66.8 ± 0.1 | 77.6 ± 0.6 | 85.1 ± 0.1 | 10.8 | 18.3 | |
| RV20 | ND | ND | ND | |||
| RV21 | 56.8 ± 0.1 | ND | 65.6 ± 1.8 | ND | 8.8 | |
| 56.1 ± 0.1 | 60.0 ± 2.4 | 69.6 ± 2.0 | 3.9 | 13.5 | ||
| RV22 | 54.2 ± 0.8 | ND | ND | ND | ND | |
| 54.7 ± 0.6 | 64.9 ± 2.9 | 75.9 ± 3.9 | 10.2 | 21.2 | ||
| RV23 | 56.9 ± 2.7 | 83.8 ± 3.9 | 81.4 ± 2.2 | 25 | 22.6 | |
| RV24 | 41.2 ± 0.2 | 50.8 ± 0.1 | 43.9 ± 3.0 | 9.6 | 2.7 | |
| RV25 | 53.4 ± 0.1 | >90 | >90 | >35.6 | >35.6 | |
| RV26 | 73.6 ± 0.6 | >90 | >90 | >16.4 | >16.4 | |
| RV27 | ND | ND | ND | |||
| RV28 | >90 | >90 | >90 | ND | ND | |
| ND | 58.5 ± 4.5 | 71.0 ± 0.5 | ||||
| RV29 | ND | ND | ND | ND | ND | |
| 53.3 ± 1.4 | 63.4 ± 1.0 | 68.5 ± 2.3 | 10.1 | 15.2 | ||
Data are reported as mean values ± SD from at least two independent experiments. In cases of double transitions, Tm values calculated at λ = 260 nm (first value) and 290 nm (second value) are shown.
Figure 7Representative Taq-polymerase stop assay of RV G4 sequences. (A) Templates amplified by Taq polymerase at the indicated temperature in the absence (lane 1) or presence of 100 mM K+ alone (lane 2) or with G4 ligand 1 (lane 3) or 2 (lane 4). A template sequence (non-G4 cnt) made of a scrambled sequence unable to fold into a G4 was also used as an internal control. Lane P: unreacted labeled primer. Lane M: ladder of markers obtained by the Maxam and Gilbert sequencing protocol carried out on the amplified strand complementary to the template strand. Vertical bars indicate G4-specific Taq-polymerase stop sites. (B) Quantification of lanes shown in panel (A). Quantification of stop bands corresponding to G4 and of the full-length amplification product (FL) is shown.