| Literature DB >> 19036138 |
Wen-Chi Chang1, Tzong-Yi Lee, Hsien-Da Huang, His-Yuan Huang, Rong-Long Pan.
Abstract
BACKGROUND: The elucidation of transcriptional regulation in plant genes is important area of research for plant scientists, following the mapping of various plant genomes, such as A. thaliana, O. sativa and Z. mays. A variety of bioinformatic servers or databases of plant promoters have been established, although most have been focused only on annotating transcription factor binding sites in a single gene and have neglected some important regulatory elements (tandem repeats and CpG/CpNpG islands) in promoter regions. Additionally, the combinatorial interaction of transcription factors (TFs) is important in regulating the gene group that is associated with the same expression pattern. Therefore, a tool for detecting the co-regulation of transcription factors in a group of gene promoters is required.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19036138 PMCID: PMC2633311 DOI: 10.1186/1471-2164-9-561
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1System flow of PlantPAN. PlantPAN has two query interfaces. "Gene group analysis" discovers the co-occurrence of TFBSs in a group of gene promoters; "Promoter analysys" contains three subfunctions: "Search" and "Novel promoter sequence" search TFBSs, CpG/CpNpG islands and tandem repeats in a single input gene ID or a novel input promoter sequence; "Cross-Species" identifies TFBSs in conserved regions between homologous or two promoters.
Data statistics of PlantPAN.
| No. of gene transcripts | 35,351 | 62,827 | 29,759 | - |
| No. of promoter sequences | 35,351 | 62,827 | 29,759 | - |
| No. of experimental promoter sequences | 13 | 6 | 16 | 167a |
| No. of transcripts containing putative CpG/CpNpG Islands (predicted by CpGProD) | 6,912 | 60,470 | 16,110 | - |
| No. of transcripts containing putative tandem repeats (predicted by TRF) | 18,080 | 45,409 | 11,535 | - |
| No. of plant transcription factors used in PlantPANb | 197 | 75 | 59 | 260c |
a Including medicago, barley, Catharanthus roseus, cider tree, french bean, snapdragon, horseradish, maize, parsley, pea, petunia, Brassica napus, soybean, potato, tobacco, tomato, and wheat.
b Collecting 591 plant transcription factors from PLACE [16], TRANSFAC public release 7.0 [12], AGRIS [17], and JASPER [18].
c Including bean, barley, carrot, cotton, medicago, parsley, pea, petunia, potato, rape, snapdragon, soybean, sweet potato, tobacco, tomato, wheat et al.
Supported regulatory features in PlantPAN.
| Promoter sequences and location sites | TAIR [ | Containing the information on the TSS and sequence location sites of |
| TIGR [ | Containing the information on the TSS and sequence location sites of | |
| ZmGDB [ | Containing the information on the 2 kb upstream location sites of | |
| Transcription factor binding sites | TRANSFAC public release 7.0 [ | Collecting experimentally verified transcription factors, their genomic binding sites and DNA-binding profiles. |
| PLACE [ | A database of nucleotide sequence motifs found in plant | |
| AGRIS [ | Collecting approximately 1,770 | |
| JASPER [ | A popular open-access database for matrix models describing DNA-binding preferences for transcription factors and other DNA patterns. | |
| MATCH [ | Scanning transcription factor binding sites using transcription factor binding profiles from TRANSFAC and PLACE. | |
| CpG/CpNpG islands | CpGProD [ | Detecting CpG/CpNpG islands. |
| Tandem repeats | TRF [ | Finding the tandem repeat. |
| Conservation of homologous gene promoter sequences | BLAST [ | Searching sequence similarity; it is also used for discovering similar gene promoters and identifying conserved regions in the PlantPAN assistant promoter database. |
| BL2SEQ [ | Utilizing the BLAST algorithm for identifying conserved regions in two sequences. | |
| Co-occurrence transcription factor binding sites in a gene group of gene promoters | Mining the co-occurrence of transcription factor binding sites in a group of gene promoters. | |
Figure 2Gene group analysis in PlantPAN. The "Gene group analysis" process has seven steps. Following GO function analysis, promoter extraction and TFBS scanning, the co-occurrence of TFBSs and combinatorial TFBSs in a group of gene promoters is tabulated and presented in two figures (with and without distance constraint).
Figure 3Results of case study I in "Gene group analysis". (A) Reference case taken from Chawade et al., 2007 [10]. The genes used in the case study are At4g17550.1, At1g20450.1, At5g52310.1, At4g37150.1, and At1g20440.1. The origin of the arrow indicates the regulating TF family and the endpoint of the arrow indicates the target gene. The time scale shown on the vertical axis is cold treatment of plant. (B) CBFHV (AP2) displayed co-occurrences in At4g17550.1, At1g20450.1, At5g52310.1, At4g37150.1, and At1g20440.1 (C) CBFHV (AP2) and DOF represented combinatorial co-occurrences in At5g52310.1, At4g17550.1, At4g37150.1, and At1g20440.1 with 100 bp distance constraint between CBFHV and DOF.
Figure 4Web interface for a search for a single gene in PlantPAN. The "Search" web tool can be used to search for general gene information and gene regulatory features; furthermore, (B) tabulated results contain general gene information and "Promoter Analysis" functions. The "Promoter analysis" functions can be used to identify regulatory elements in the promoter sequence.
Figure 5Graphical view of a case (AT1G48990) of "Cross-Species" analysis. The conserved regions and TFBSs in the conserved regions are shown in a figure significantly. Each conserved site or TFBS can be further clicked for more detailed information.