Literature DB >> 19238240

Identification and analysis of putative promoter motifs in Flavivirus genome.

Pallavi Somvanshi1, Vijai Singh, Prahlad Kishore Seth.   

Abstract

The genus Flavivirus comprises medically significant pathogenic virus; causing several infections in humans worldwide. Flavivirus genomes are 10-11 kb approximately and encode both structural and non structural region. The non structural region plays fundamental role in the stability, regulation and cell cycle of virus. The complete genomes of 26 Flavivirus were used for identification of promoter motifs through in silico approaches. The promoter sequences were encoded in merely 16 viruses and 10 viruses could not encode it. All these in silico identified promoter motifs was confirmed and verified with the known experimental data. This analysis suggests that presence of promoter may play a crucial role in the pattern of gene expression, regulation networks, cell specificity and development. It may also be useful for designing efficient expression vector and target specific delivery system in the gene therapy.

Entities:  

Keywords:  Flavivirus; cell cycle; disease; promoter; regulation

Year:  2008        PMID: 19238240      PMCID: PMC2637965          DOI: 10.6026/97320630003162

Source DB:  PubMed          Journal:  Bioinformation        ISSN: 0973-2063


Background

Flavivirus consist of globally distributed group of arbovirus transmitted mainly by tick or mosquito vectors. The most significant Flavivirus are mosquito transmitted dengue virus which causes hemorrhagic fever (HF); encephalitis caused by Japanese encephalitis (JE) in tropical and subtropical region of the world. Around 50 - 100 million case of dengue fever are annually reported in more than 80 countries where the mosquito vector Aedes aegypti is endemic and approximately 500,000 patients suffer from dengue hemorrhagic fever and dengue shock syndrome. JE virus is the leading cause of arboviral encephalitis in Asia, accounting for 30,000 to 50,000 cases annually. St. Louis encephalitis virus causes sporadic epidemic encephalitis in the Americas. West Nile virus (WNV) has caused more than 9,000 cases in North America since 1999 [1]. Flavivirus genome is a single-stranded, positive sense RNA of 10-11 kb containing a single ORF and is the only viral mRNA produced during the virus replication cycle. The replication takes place in the perinuclear region of cytoplasm in the infected cells. Three structural (capsid, premembrane and envelope) and seven nonstructural (NS1, NS2a, NS2b, NS3, NS4a, NS4b, and NS5) viral proteins are produced by proteolytic processing of the single polyprotein by viral and cellular proteases. The genome of Flavivirus open reading frame is flanked by the untranscribed region (UTR) in 5` and 3`. The UTR forms a complex RNA structure containing functional domain that are believed to play a role in virus translation, replication or assembly. These generates lots of scientific interest since, genetic modification within these region are known to attenuate Flavivirus without altering their antigenic specificity making them potential vaccine candidate for live attenuated vaccine [2]. A wide range of algorithms has been developed to assist the identification of promoters in genomic sequence of many gene prediction methods. The regulatory element utilizing the TATT-box has been reported in the genome of Epstein ‐ Barr virus (EBV). The motif was present in the promoters of lytic cycle genes and resembles a crucial host genome motif (TATA-box). Since the binding specificity of eukaryotic proteins recognizing TATA-box (TBP) was determined and no specific preference for interaction with TATT motif was found [3]. Consensus patterns of baculovirus sequences upstream from the translational initiation sites have been analyzed and a web tool. Potential consensus sequences such as TCATTGT, TCTTGTA, CTCGTAA, TCCATTT and TCATT plus TCGT in approximately 30 bp spacing context have been obtained [4]. EIA and blCP0 are the activators for gene expression in adenovirus. It does not specifically bind both DNA and proteins interacting with chromatin remodeling enzymes. The capability of E1A to bind with Rb family members plays role in stimulating productive infection suggesting that E2F family members activate productive infection [5]. Identification of transcription factor sites require for the function and regulation of genes. The evaluation of sequences specific DNA binding transcription factors belong to CTF/NF1 family has been reported [6]. Promoter is a most important regulatory region that controls and regulates gene expression at the transcription level. It contains specific DNA sequences, response elements that are recognized by proteins known as transcription factors. These factors bind to the promoter sequences, recruiting RNA polymerase, the enzyme that synthesizes the RNA from coding region of the gene. However, a limited number of data is available on the promoter motifs in the genus Flavivirus. The present study was carried out to identify and analyze the putative promoter region present in Flavivirus.

Methodology

Retrieval of genome sequence

The complete genome sequences of Flavivirus were retrieved from biological databases such as National Centre for Biotechnology Information cited at http://www.ncbi.nlm.nih.gov/genomes/VIRUSES/viruses.html and the Universal Virus Database of the International Committee on Taxonomy of Viruses (ICTVdB) genome database cited at http://www.ncbi.nlm.nih.gov/ICTVdb/.

BLAST

The relatedness of sequences deposited in databases was evaluated by BLAST (Basic Local Alignment Search Tool) implemented via the NCBI website (www.ncbi.nlm.nih.gov/blast) against the complete training dataset which is extracted from Genbank database. All these identified promoters were verified and searched for homology in the database.

Analysis of Flavivirus genome

The size of Flavivirus genome was analyzed with the aid of Generunner, DNAstar and ExPaSy tools. The G+ C content of each genome was also predicted (Table 1 under supplementary material).

Transcription start site

The transcription factor sites are over represented in the promoter region. It is natural to seek a prediction program based on putative TF site density. The PROMOTERSCAN program was used to identify the putative promoter in the genome of Flavivirus. This program comprises three database such as TF database, promoter database and non promoter set constructed from protein and RNA gene sequences. The density of all putative TF site is calculated separately for promoter and non promoter sequences scoring functions supplemented with a TATA matrix score [7].

Results and discussion

In the present study, complete genome sequences of 26 Flavivirus was analyzed. Within studied Flavivirus, the size of 24 genomes was approximately 10 kb. Merely, two genome sizes of Kamati River and Tick borne encephalitis virus was 11 kb. The smallest genome size is 10,053 bases of Tamana bat virus and the highest genome size (11,375 bases) is of Kamati river virus. The lower G+ C percentage of Tamana bat virus was 38.43 and the highest G+C content of Louping ill virus was 54.85. The genome size and G+C contents of Flavivirus were given (Table 1 under supplementary material). The establishment of persistent noncytopathic replication by replicon RNAs of a number of positive-strand RNA viruses usually leads to generation of adaptive mutations in nonstructural genes. Some of these adaptive mutations in hepatitis C virus increase the ability of RNA replication to resist the antiviral action of alpha/beta interferon (IFN-alpha/beta) and Sindbis virus may also lead to more efficient IFN production [8]. Identification of important putative promoter in the complete genome of 26 medically significant Flavivirus was done. Total 22 different types of promoter were identified in the genomes and given (Table 2, see supplementary material). All these identified promoter motifs were confirmed with experimental obtained existing data. Sixteen viruses have the putative promoter sequences while ten Flavivirus do not have the promoter motifs in their genome. The identified putative promoter of Flavivirus was given (Table 3 under supplementary material). During the time of replication and multiplication all the host machinery was utilized by Flavivirus which did not have the promoter sequences. The numbers of experimentally confirmed reports are available on the identification and characterization of promoter in the virus genome. The K1 gene of Kaposi's sarcoma-associated herpesvirus (KSHV) encodes a 46-kDa transmembrane glycoprotein that possesses transforming properties initiates signaling pathways in B cells and prevents apoptosis. K1 promoter demonstrated that purified Rta protein bound to the K1 at three locations independent of other DNA-binding factors [9]. Kaposi sarcoma-associated herpesvirus vIRF is a viral transcription factor that inhibits interferon signaling and transforms NIH 3T3 cells but does not bind interferon-stimulated response element (ISRE) DNA sequences [10]. A eukaryotic promoter-specific activator protein (activators) stimulates the transcription. An acidic activator can directly interact with the transcription factor TFIIB and increase the stable assembly into a preinitiation complex [11]. Adult T-cell leukemia (ATL) is a complex and multifaceted disease associated with human T-cell leukemia virus type 1 (HTLV-I) infection. Viral oncoprotein is considered a major contributor to cell cycle deregulation in HTLV-I transformed cells by either directly disrupting cellular factors or altering their transcription profile. Tax transactivates these cellular promoters by interacting with transcription factors such as CREB/ATF, NF-kappaB, and SRF [12]. The transcription factor TFIID consisting of TATA-binding protein (TBP) and TBP-associated factors (TAFs) plays a central role in both positive and negative regulation of transcription. The TAF N-terminal domain (TAND) of TAF1 has been shown to interact with TBP and to modulate the interaction of TBP with the TATA box, which is required for transcriptional initiation and activation of TATA-promoter operated genes [13]. The regulatory element utilizing TATT box has been reported in the genome of Epstein ‐ Barr virus (EBV). The motif is present in promoters of lytic cycle genes and resembled a crucial host genome motif (TATA-box). Since the binding specificity of eukaryotic proteins recognizing TATA-box (TBP) was determined and no specific preference for interaction with TATT motif was found [3]. Consensus patterns of baculovirus sequences upstream from the translational initiation sites have been analyzed and a web tool Local Alignment Promoter Predictor (LAPP) for the prediction of baculovirus promoter sequences has also been developed. Potential consensus sequences, i.e., TCATTGT, TCTTGTA, CTCGTAA, TCCATTT and TCATT plus TCGT in approximately 30 bp spacing context, have been found in baculovirus promoter regions, in addition to well characterized late and early promoter elements G/T/ATAAG and TATAA, which is accompanied about 30-bp downstream by a transcriptional initiation sequence CAGT or CATT [4]. The adenovirus E1A gene and bICP0 encode proteins that are potent activators of viral gene expression. They do not specifically bind DNA and both proteins interact with chromatin-remodeling enzymes. A functional similarity of E1A was tested initially to see if it could stimulate BHV-1 productive infection. E1A consistently stimulates BHV-1 productive infection, but not as efficiently as bICP0. The ability of E1A to bind Rb family members plays a role in stimulating productive infection, suggesting that E2F family members activate productive infection. E2F-4, but not E2F-1, E2F-2 or E2F-5, activates productive infection with similar efficiency as E1A [5]. Accurate prediction of transcription factor binding sites is needed to unravel the function and regulation of genes discovered in genome sequencing projects. To evaluate current computer prediction tools, we have begun a systematic study of the sequence-specific DNA-binding of a transcription factor belonging to the CTF/NFI family [6]. White Spot Syndrome Virus is a member of the virus family Nimaviridae and infecting shrimp and other crustacean species. The complete genome was analyzed to in silico identify the conserved promoter motifs. In the 5′ upstream region contained the TATA box element is similar to the Drosophila RNA polymerase II core promoter sequences and utilization of the cellular transcription machinery for generating early transcripts [10].

Conclusions

The in silico identification of promoter motifs in the genome of Flavivirus was done. These promoters play vital role in the regulation of gene expression. Delineation of the promoter is fundamental for understanding gene expression patterns, regulation networks, cell specificity and development. It is also important for designing efficient expression vector or to target specific delivery system in the gene therapy. These results might help in designing the live attenuated vaccine candidate through the site directed mutagenesis in the promoter region. In the large scale genomic era promoter prediction is crucial for gene discovery and annotation.
  12 in total

1.  Inhibition of alpha/beta interferon signaling by the NS4B protein of flaviviruses.

Authors:  Jorge L Muñoz-Jordán; Maudry Laurent-Rolle; Joseph Ashour; Luis Martínez-Sobrido; Mundrigi Ashok; W Ian Lipkin; Adolfo García-Sastre
Journal:  J Virol       Date:  2005-07       Impact factor: 5.103

2.  Analysis and prediction of baculovirus promoter sequences.

Authors:  Ke Xing; Riqiang Deng; Jinwen Wang; Jinghua Feng; Mingsong Huang; Xunzhang Wang
Journal:  Virus Res       Date:  2005-10       Impact factor: 3.303

3.  Identification of Herpes TATT-binding protein.

Authors:  Lucjan S Wyrwicz; Leszek Rychlewski
Journal:  Antiviral Res       Date:  2007-03-28       Impact factor: 5.970

4.  Characterization of Kaposi's sarcoma-associated herpesvirus (KSHV) K1 promoter activation by Rta.

Authors:  Brian S Bowser; Stephanie Morris; Moon Jung Song; Ren Sun; Blossom Damania
Journal:  Virology       Date:  2006-03-20       Impact factor: 3.616

5.  Three unrelated viral transforming proteins (vIRF, EBNA2, and E1A) induce the MYC oncogene through the interferon-responsive PRF element by using different transcription coadaptors.

Authors:  S Jayachandra; K G Low; A E Thlick; J Yu; P D Ling; Y Chang; P S Moore
Journal:  Proc Natl Acad Sci U S A       Date:  1999-09-28       Impact factor: 11.205

6.  Predicting Pol II promoter sequences using transcription factor binding sites.

Authors:  D S Prestridge
Journal:  J Mol Biol       Date:  1995-06-23       Impact factor: 5.469

7.  Interaction between an acidic activator and transcription factor TFIIB is required for transcriptional activation.

Authors:  S G Roberts; I Ha; E Maldonado; D Reinberg; M R Green
Journal:  Nature       Date:  1993-06-24       Impact factor: 49.962

8.  Analysis of adaptive mutations in Kunjin virus replicon RNA reveals a novel role for the flavivirus nonstructural protein NS2A in inhibition of beta interferon promoter-driven transcription.

Authors:  Wen Jun Liu; Hua Bo Chen; Xiang Ju Wang; Hester Huang; Alexander A Khromykh
Journal:  J Virol       Date:  2004-11       Impact factor: 5.103

9.  Stimulation of bovine herpesvirus-1 productive infection by the adenovirus E1A gene and a cell cycle regulatory gene, E2F-4.

Authors:  Vicki Geiser; Clinton Jones
Journal:  J Gen Virol       Date:  2003-04       Impact factor: 3.891

10.  Structural and functional characterization on the interaction of yeast TFIID subunit TAF1 with TATA-binding protein.

Authors:  Tapas K Mal; Yutaka Masutomi; Le Zheng; Yasuto Nakata; Hiroshi Ohta; Yoshihiro Nakatani; Tetsuro Kokubo; Mitsuhiko Ikura
Journal:  J Mol Biol       Date:  2004-06-11       Impact factor: 5.469

View more
  1 in total

1.  Identification and analysis of putative promoter motifs in bovine herpes virus.

Authors:  Mahantesh Mallikrjun Kurjogi; Rajeshwari Danappa Sanakal; Basappa Basaveneppa Kaliwal
Journal:  Bioinformation       Date:  2012-11-23
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.