| Literature DB >> 27867372 |
Deeya Saha1, Soumita Podder2, Tapash C Ghosh1.
Abstract
More than a decade, overlapping genes in RNA viruses became a subject of research which has explored various effect of gene overlapping on the evolution and function of viral genomes like genome size compaction. Additionally, overlapping regions (OVRs) are also reported to encode elevated degree of protein intrinsic disorder (PID) in unspliced RNA viruses. With the aim to explore the roles of OVRs in HIV-1 pathogenesis, we have carried out an in-depth analysis on the association of gene overlapping with PID in 35 HIV1- M subtypes. Our study reveals an over representation of PID in OVR of HIV-1 genomes. These disordered residues endure several vital, structural features like short linear motifs (SLiMs) and protein phosphorylation (PP) sites which are previously shown to be involved in massive host-virus interaction. Moreover, SLiMs in OVRs are noticed to be more functionally potential as compared to that of non-overlapping region. Although, density of experimentally verified SLiMs, resided in 9 HIV-1 genes, involved in host-virus interaction do not show any bias toward clustering into OVR, tat and rev two important proteins mediates host-pathogen interaction by their experimentally verified SLiMs, which are mostly localized in OVR. Finally, our analysis suggests that the acquisition of SLiMs in OVR is mutually exclusive of the occurrence of disordered residues, while the enrichment of PPs in OVR is solely dependent on PID and not on overlapping coding frames. Thus, OVRs of HIV-1 genomes could be demarcated as potential molecular recognition sites during host-virus interaction.Entities:
Keywords: HIV-1; gene overlapping; host–pathogen interaction; protein phosphorylation; short linear motifs; structural disorder
Year: 2016 PMID: 27867372 PMCID: PMC5095123 DOI: 10.3389/fmicb.2016.01735
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Multivariate linear regression analysis between length of overlapping regions (OVRs) (independent variable) and disordered residues in OVR, total protein length.
| Covariates | β | |
|---|---|---|
| Number of disordered residues | 3.82 | 1.6 × 10-4 |
| Total protein length | 4.99 | 10-6 |