| Literature DB >> 29630682 |
Maryam Zahin1, William L Dean1, Shin-Je Ghim1, Joongho Joh1,2, Robert D Gray1, Sujita Khanal1,3, Gregory D Bossart4,5, Antonio A Mignucci-Giannoni6, Eric C Rouchka7,8, Alfred B Jenson1, John O Trent1,2,3, Jonathan B Chaires1,2,3, Julia H Chariker8,9.
Abstract
The Florida manatee (Trichechus manatus latirotris) is a threatened aquatic mammal in United States coastal waters. Over the past decade, the appearance of papillomavirus-induced lesions and viral papillomatosis in manatees has been a concern for those involved in the management and rehabilitation of this species. To date, three manatee papillomaviruses (TmPVs) have been identified in Florida manatees, one forming cutaneous lesions (TmPV1) and two forming genital lesions (TmPV3 and TmPV4). We identified DNA sequences with the potential to form G-quadruplex structures (G4) across the three genomes. G4 were located on both DNA strands and across coding and non-coding regions on all TmPVs, offering multiple targets for viral control. Although G4 have been identified in several viral genomes, including human PVs, most research has focused on canonical structures comprised of three G-tetrads. In contrast, the vast majority of sequences we identified would allow the formation of non-canonical structures with only two G-tetrads. Our biophysical analysis confirmed the formation of G4 with parallel topology in three such sequences from the E2 region. Two of the structures appear comprised of multiple stacked two G-tetrad structures, perhaps serving to increase structural stability. Computational analysis demonstrated enrichment of G4 sequences on all TmPVs on the reverse strand in the E2/E4 region and on both strands in the L2 region. Several G4 sequences occurred at similar regional locations on all PVs, most notably on the reverse strand in the E2 region. In other cases, G4 were identified at similar regional locations only on PVs forming genital lesions. On all TmPVs, G4 sequences were located in the non-coding region near putative E2 binding sites. Together, these findings suggest that G4 are possible regulatory elements in TmPVs.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29630682 PMCID: PMC5891072 DOI: 10.1371/journal.pone.0195625
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Secondary intramolecular G4 structures (top) and corresponding DNA sequences (bottom) with varying numbers of G-tetrads. This Figure has been modified from [87].
The number of putative G4 sequences identified and the number of structures possible across different regions on forward and reverse DNA strands for each TmPV genome.
| Forward DNA Strand | Reverse DNA Strand | |||||
|---|---|---|---|---|---|---|
| Region | TmPV1 | TmPV3 | TmPV4 | TmPV1 | TmPV3 | TmPV4 |
| E6 | - | 1 (1) | - | - | - | - |
| E7 | 2 (2) | 1 (1) | - | - | - | - |
| E1 | 5 (5) | 3 (3) | 4 (5) | - | - | - |
| E2 | 1 (1) | 1 (1) | 2 (2) | 1 (1) | - | - |
| E1/E2 | 1 (1) | 1 (1) | 1 (1) | - | - | - |
| E2/E4 | 1 (1) | - | 7 (12) | 3 (5) | 3 (6) | 6 (9) |
| Total | 10 (10) | 7 (7) | 14 (20) | 4 (6) | 3 (6) | 6 (9) |
| L2 | 2 (4) | 2 (4) | 3 (3) | 5 (6) | 5 (6) | 7 (7) |
| L1 | 2 (2) | 3 (3) | 2 (2) | 4 (5) | 3 (3) | 3 (3) |
| Total | 4 (6) | 5 (7) | 5 (5) | 9 (11) | 8 (9) | 10 (10) |
| NCR | - | 1 (1) | 1 (3) | 2 (2) | - | 1 (1) |
| Total Genome | 14 (16) | 13 (15) | 20 (28) | 15 (19) | 11 (15) | 17 (20) |
Fig 2Coding and non-coding regions aligned across TmPV1 (outer), TmPV3 (middle), and TmPV4 (inner) to illustrate G4 sequences occurring at the same location within regions on the forward (left) and reverse (right) DNA strands. Blue arrows indicate G4 sequences at the same location on TmPV3 and TmPV4. Red arrows indicate G4 sequences at the same location on all three genomes.
Fig 3Location of G4 sequences in relation to putative E2 binding sites in the NCR region of each manatee papillomavirus.
E2 binding sites are in green. G4 located on the forward DNA strand are in blue. G4 on the reverse DNA strand are in red.
Fig 4Analytical centrifugation of TmPV4 oligonucleotides (A, B, and C), oligonucleotide sequences (D), and CD spectra of TmPV4 oligonucleotides (E).
Hydrodynamic properties of the major TmPV sequences*.
| Sequence | MW (kDa) | s20,w | f/fo | Molecularity |
|---|---|---|---|---|
| TmPV4-1 | 12.1 | 1.96 | 1.36 | monomer |
| TmPV4-2 | 22.5 | 2.79 | 1.58 | monomer |
| TmPV4-3 | 7.18 | 1.8 | 1.09 | dimer |
*Determined using the non-interacting discrete species model in Sedfit.
Fig 5Proposed structural conformation of TmPV4-1 (left), TmPV4-2 (middle), and TmPV4-3 (right). Spheres represent guanine (yellow), adenine (purple), and cytosine (green) nucleotides.
Fig 6Molecular dynamics-derived model of the four stacked contiguous G-quadruplexes for sequence TmPV4-2.
Individual G-quadruplex structures appear from top to bottom in orange, magenta, yellow, and red. G-tetrads are identified in elemental colors in Chimera [.
GenBank sequence information for TmPV genomes.
| Accession no. | Description | Size |
|---|---|---|
| NC_006563.1 | TmPV1 | 7722 bp |
| KP205502.1 | TmPV3 | 7622 bp |
| KP205503.2 | TmPV4 | 7771 bp |
Fig 7A description of the sequence codes provided by Quadparser for each putative G4 sequence.
This Figure has been modified from [87].
Oligodeoxynucleotides used in this study.
| Oligonucleotide | Sequence (5’ to 3’) | ε (mM-1 cm-1) | MW |
|---|---|---|---|
| TmPV4-1 | d[ | 397.5 | 11453.4 |
| TmPV4-2 | d[ | 774.7 | 22045.3 |
| TmPV4-3 | d[ | 125.1 | 3511.3 |
| 1XAV | d[ | 228.7 | 6991.6 |
| 1XAV-complement | d[ | 191.5 | 6480.3 |
Thermodynamic parameters for thermal unfolding of TmPV4 oligonucleotides.
| Oligonucleotide | ||
|---|---|---|
| TmPV4-1 | 49.8 ± 0.2 | 54.7 ± 0.2 |
| TmPV4-2 | 52.1 ± 0.2 | 63.5 ± 0.3 |
| TmPV4-3 | 82.7 ± 0.4 | 63.1 ± 0.2 |