| Literature DB >> 15980502 |
Oleg V Vishnevsky1, Nikolay A Kolchanov.
Abstract
Reliable recognition of the promoters in eukaryotic genomes remains an open issue. This is largely owing to the poor understanding of the features of the structural-functional organization of the eukaryotic promoters essential for their function and recognition. However, it was demonstrated that detection of ensembles of regulatory signals characteristic of specific promoter groups increases the accuracy of promoter recognition and prediction of specific expression features of the queried genes. The ARGO_Motifs package was developed for the detection of sets of region-specific degenerate oligonucleotide motifs in the regulatory regions of the eukaryotic genes. The ARGO_Viewer package was developed for the recognition of tissue-specific gene promoters based on the presence and distribution of oligonucleotide motifs obtained by the ARGO_Motifs program. Analysis and recognition of tissue-specific promoters in five gene samples demonstrated high quality of promoter recognition. The public version of the ARGO system is available at http://wwwmgs2.bionet.nsc.ru/argo/ and http://emj-pc.ics.uci.edu/argo/.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15980502 PMCID: PMC1160220 DOI: 10.1093/nar/gki459
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Layout of the algorithm for the recognition of degenerate oligonucleotide motifs in a promoter sample.
Figure 2Example of determination of the set of permissible nucleotides for each position of the [−50; −10] region of an erythroid-specific promoter.
Figure 3Example of ARGO_Motifs input and output windows. (A) Input window. The region [−50; +1] of promoters of erythroid-specific genes is analyzed. (B) A table containing the motifs detected and their characteristics. (C) A distribution pattern of the motifs found.
Figure 4Profile of the promoter recognition function for the sequence of the human β-globin gene clusters (EMBL ID: HSHBB). Values of the recognition function (ordinate) are plotted versus positions of the sequence (abscissa). Arrows indicate the positions of the transcription starts of the genes in this cluster. The triangle shows the position of the 5′-terminal region of the pseudogene corresponding to the transcription start point.