| Literature DB >> 17764576 |
Aakash Chawade1, Marcus Bräutigam, Angelica Lindlöf, Olof Olsson, Björn Olsson.
Abstract
BACKGROUND: With the advent of microarray technology, it has become feasible to identify virtually all genes in an organism that are induced by developmental or environmental changes. However, relying solely on gene expression data may be of limited value if the aim is to infer the underlying genetic networks. Development of computational methods to combine microarray data with other information sources is therefore necessary. Here we describe one such method.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17764576 PMCID: PMC2001198 DOI: 10.1186/1471-2164-8-304
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Transcription factors and their binding site motifs
| WAAAG | 9.8 | At3g50410 | OBP1 | [55] | |
| At5g60850 | OBP4 | ||||
| At1g26790 | - | ||||
| At1g69570 | DOF1.1 | ||||
| At5g39660 | DOF5.2 | ||||
| At3g47500 | DOF3.3 | ||||
| TTTGACY | 0.7 | At4g01250 | WRKY22 | [49] | |
| At2g38470 | WRKY33 | ||||
| At1g80840 | WRKY40 | ||||
| At2g46400 | WRKY46 | ||||
| At4g23810 | WRKY53 | ||||
| At4g24240 | WRKY7 | ||||
| At3g56400 | WRKY70 | ||||
| AAMAATCT | 0.3 | At2g46830 | CCA1 | [56] | |
| AGCCGCC | 10-2 | At4g17500 | ERF1 | [57] | |
| At5g47230 | ERF5 | ||||
| CACGTGG ( | At1g49720 | ABRE | |||
| 10-2 | At3g17609 | - | |||
| 10-2 | At2g31370 | BZIP59 | [58] | ||
| 10-3 | At4g01120 | GBF2 | [59] | ||
| At2g46270 | BZIP55 | ||||
| At4g34590 | ATB2 | ||||
| CANNTG | 6.8 | At3g05800 | - | [60] | |
| CATGTG | 0.5 | At1g01720 | ATAF1 | [48] | |
| At3g49530 | - | ||||
| At4g27410 | RD26 | ||||
| CCWWWWWWGG | 0.1 | At2g45660 | AGL20 | [61] | |
| CCGAC | 0.5 | At4g25480 | CBF3 | [62] | |
| At4g25470 | CBF1 | ||||
| GTGGNCCC | 0.2 | At4g18390 | - | [63] | |
| NGAANNTTCN ( | 0.8 | At3g24520 | - | [64] | |
| 0.7 | At4g18880 | HSF21 | |||
| At1g67970 | HSF5 |
Transcription factors selected for further analysis. a: Transcription factor family. b: Binding site or motif to which the TF binds, where 'W' = 'A'/'T'; 'M' = 'A'/'C'; 'Y' = 'C'/'T'; 'N' = 'A'/'T'/'G'/'C'. c: Minimum number of copies required to be found within 1 kb upstream of a gene in order for the motif to be considered as over-represented according to the confidence interval threshold. d:Locus ID of the gene. e: Common name of the TF. e: Literature reference.
Motif synergy groups from up-regulated genes
| WRKY | 0.22 | 0.18 | 9 |
| TCP2 | 1 | 0.16 | 3 |
| NAC, WRKY, DOF | 0.27 | 0.2 | 6 |
| NAC, HSF, WRKY | 0.22 | 0.17 | 10 |
| NAC, HSF, DOF | 0.23 | 0.22 | 20 |
| NAC, AP2(CBF), WRKY, DOF | 0.25 | 0.17 | 8 |
| NAC, AP2(CBF), WRKY | 0.27 | 0.2 | 6 |
| NAC, AP2(CBF), HSF, WRKY | 0.24 | 0.17 | 13 |
| NAC, AP2(CBF), HSF, DOF | 0.18 | 0.17 | 11 |
| NAC, AP2(CBF), HSF | 0.26 | 0.18 | 25 |
| MYB, NAC | 0.56 | 0.17 | 10 |
| MYB, MADS | 1 | 0.16 | 3 |
| MYB, DOF | 0.52 | 0.21 | 34 |
| MYB, BZIP, DOF | 0.33 | 0.16 | 3 |
| MYB, BZIP, BHLH, NAC, DOF | 0.33 | 0.16 | 3 |
| MYB, BZIP | 1 | 0.17 | 4 |
| MYB, BHLH, NAC, DOF | 0.47 | 0.2 | 6 |
| MYB, BHLH, NAC | 0.36 | 0.17 | 11 |
| MYB, BHLH, MADS | 1 | 0.16 | 3 |
| MYB, BHLH, DOF | 0.33 | 0.18 | 14 |
| MYB, BHLH | 0.69 | 0.17 | 10 |
| MYB | 0.59 | 0.2 | 19 |
| MADS, DOF | 0.8 | 0.2 | 5 |
| MADS | 1 | 0.17 | 8 |
| HSF, WRKY, DOF | 0.36 | 0.17 | 8 |
| HSF, WRKY | 0.22 | 0.18 | 9 |
| HSF | 0.21 | 0.2 | 28 |
| DOF | 0.23 | 0.19 | 122 |
| BZIP, WRKY, DOF | 0.33 | 0.17 | 4 |
| BZIP, NAC, HSF | 0.33 | 0.16 | 3 |
| BZIP, NAC, AP2(CBF), HSF, WRKY, DOF | 0.33 | 0.16 | 3 |
| BZIP, NAC, AP2(CBF), HSF | 0.33 | 0.16 | 3 |
| BZIP, NAC, AP2(CBF), DOF | 0.4 | 0.2 | 5 |
| BZIP, NAC, AP2(CBF) | 0.67 | 0.16 | 3 |
| BZIP, DOF | 0.22 | 0.18 | 9 |
| BZIP, BHLH, NAC, DOF | 0.67 | 0.16 | 3 |
| BZIP, BHLH | 0.44 | 0.18 | 9 |
| BZIP | 0.52 | 0.21 | 33 |
| BHLH, NAC, DOF | 0.27 | 0.22 | 21 |
| BHLH, NAC | 0.38 | 0.2 | 19 |
| BHLH, MADS, DOF | 1 | 0.16 | 3 |
| BHLH, DOF | 0.49 | 0.21 | 34 |
| BHLH | 0.45 | 0.21 | 35 |
| AP2(CBF), HSF, DOF | 0.19 | 0.18 | 16 |
| AP2(CBF), HSF | 0.23 | 0.2 | 32 |
| AP2(CBF), DOF | 0.27 | 0.17 | 11 |
| AP2(CBF) | 0.27 | 0.2 | 6 |
| AP2(AtERF), HSF, WRKY | 0.33 | 0.16 | 3 |
| AP2(AtERF), DOF | 0.5 | 0.17 | 4 |
Results obtained by applying the method using the confidence interval criterion on the genes filtered from the 24 k gene array data set. a: TF families. b: EC score obtained from the cluster of putative target genes predicted by this method. c: Threshold above which the EC score is acceptable. d: Number of putative target genes that are regulated by the TF families that are indicated in the first column.
Motif synergy groups from down-regulated genes
| AP2(CBF) | 0.34 | 0.22 | 55 |
| AP2(CBF), HSF | 0.42 | 0.21 | 34 |
| BHLH | 0.58 | 0.20 | 214 |
| BHLH, AP2(CBF) | 0.51 | 0.21 | 30 |
| BHLH, AP2(CBF), HSF | 0.23 | 0.20 | 19 |
| BHLH, HSF | 0.27 | 0.21 | 33 |
| BHLH, MADS | 0.50 | 0.20 | 5 |
| BHLH, MADS, AP2(CBF) | 1.00 | 0.16 | 3 |
| BHLH, TCP2 | 0.85 | 0.16 | 8 |
| BZIP | 0.36 | 0.18 | 9 |
| BZIP, BHLH, AP2(CBF) | 0.53 | 0.20 | 6 |
| BZIP, HSF | 0.20 | 0.20 | 6 |
| DOF | 0.38 | 0.20 | 154 |
| DOF, AP2(CBF) | 0.28 | 0.22 | 65 |
| DOF, BHLH | 0.37 | 0.19 | 119 |
| DOF, BHLH, AP2(CBF) | 0.25 | 0.20 | 27 |
| DOF, BHLH, AP2(CBF), HSF | 0.28 | 0.22 | 20 |
| DOF, BHLH, HSF | 0.31 | 0.22 | 43 |
| DOF, BHLH, MADS | 0.39 | 0.17 | 8 |
| DOF, BZIP | 0.33 | 0.16 | 3 |
| DOF, BZIP, AP2(CBF) | 0.40 | 0.20 | 6 |
| DOF, BZIP, BHLH, HSF | 0.53 | 0.20 | 6 |
| DOF, BZIP, HSF | 0.52 | 0.20 | 7 |
| DOF, HSF | 0.21 | 0.21 | 60 |
| DOF, MADS | 1.00 | 0.16 | 3 |
| DOF, MYB | 0.54 | 0.20 | 35 |
| DOF, MYB, AP2(CBF) | 0.30 | 0.20 | 5 |
| DOF, MYB, BHLH | 0.47 | 0.18 | 20 |
| DOF, MYB, MADS | 0.33 | 0.17 | 4 |
| DOF, WRKY | 0.26 | 0.21 | 32 |
| DOF, WRKY, AP2(CBF) | 0.25 | 0.20 | 18 |
| DOF, WRKY, BHLH | 0.24 | 0.20 | 25 |
| DOF, WRKY, BHLH, AP2(CBF), HSF | 0.33 | 0.18 | 15 |
| DOF, WRKY, BHLH, HSF | 0.21 | 0.20 | 27 |
| DOF, WRKY, HSF | 0.26 | 0.22 | 45 |
| MADS | 0.80 | 0.20 | 6 |
| MYB | 0.57 | 0.18 | 20 |
| MYB, AP2(CBF) | 0.57 | 0.17 | 8 |
| MYB, BHLH | 0.63 | 0.18 | 14 |
| WRKY | 0.26 | 0.20 | 28 |
| WRKY, AP2(CBF), HSF | 0.21 | 0.18 | 17 |
| WRKY, BHLH | 0.40 | 0.20 | 27 |
| WRKY, BHLH, AP2(CBF) | 0.25 | 0.18 | 17 |
| WRKY, BHLH, HSF | 0.20 | 0.20 | 25 |
| WRKY, BZIP, HSF | 0.33 | 0.17 | 4 |
| WRKY, HSF | 0.25 | 0.20 | 23 |
Motif synergy groups for down-regulated genes. For explanations see legend in table 2.
Figure 1Part of the predicted regulatory network. Regulatory network at the gene family level. The origin of the arrow indicates the regulating TF family and the endpoint of the arrow indicates the target gene. The time scale is shown on the vertical axis.
Figure 2Large scale regulatory network. Overview of the regulatory network. 2a. Up-regulated gene network. 2b. Down-regulated gene network. The entries in red ovals represent the TFs and the black dots represent target genes. The networks were generated using Graphviz version 1.13–16 [65].
Functional annotations of the TF targets
| 10-1 | 10-1 | 101 | 101 | 10-1 | 10-1 | 10-1 | 10-1 | 10-1 | 101 | |||
| 01.01.03 | Assimilation of ammonia, metabolism of the glutamate group | N/A | N/A | N/A | N/A | N/A | 10-1 | 10-1 | N/A | N/A | N/A | |
| 01.03.04.03 | Pyrimidine nucleotide anabolism | N/A | N/A | N/A | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | |
| 01.05 | C-compound and carbohydrate metabolism | 10-1 | 10-1 | 101 | 10-1 | 101 | 10-1 | 10-1 | N/A | N/A | ||
| 01.05.01.03 | C-compound, carbohydrate anabolism | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| 01.05.01.03.02 | Polysaccharide biosynthesis | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| 01.06 | Lipid, fatty acid and isoprenoid metabolism | 10-1 | 101 | N/A | 10-1 | N/A | 10-1 | 10-1 | 101 | N/A | N/A | |
| 01.06.01.01 | Phospholipid biosynthesis | N/A | N/A | N/A | 101 | N/A | N/A | N/A | N/A | |||
| 01.06.04 | Degradation of lipids, fatty acids and isoprenoids | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | N/A | |||
| 01.20.17 | Biosynthesis of secondary products derived from primary amino acids | N/A | 10-1 | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | ||
| 01.20.17.01 | Biosynthesis of nonprotein amino acids | 10-1 | N/A | N/A | N/A | N/A | N/A | 10-1 | N/A | N/A | N/A | |
| 01.20.17.09 | Biosynthesis of alkaloids | 10-1 | N/A | N/A | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | |
| 01.20.35 | Biosynthesis of secondary products derived from L-phenylalanine and L-tyrosine | N/A | N/A | 10-1 | N/A | 10-1 | 10-1 | 101 | N/A | N/A | N/A | |
| 01.20.35.01 | Biosynthesis of phenylpropanoids | N/A | N/A | 10-1 | N/A | 10-1 | 10-1 | 101 | N/A | N/A | N/A | |
| 10-1 | 101 | 101 | 10-1 | 10-1 | N/A | N/A | N/A | |||||
| 02.13 | Respiration | 10-1 | N/A | 10-1 | N/A | N/A | 10-1 | 10-1 | N/A | N/A | N/A | |
| 02.13.03 | Aerobic respiration | 10-1 | 10-1 | N/A | 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | |
| 02.19 | Metabolism of energy reserves (e.g. glycogen, trehalose) | 10-1 | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | ||
| N/A | N/A | N/A | N/A | N/A | N/A | 10-1 | N/A | N/A | N/A | |||
| 10-1 | 101 | N/A | 10-1 | N/A | 10-1 | N/A | N/A | N/A | N/A | |||
| 10.03.04.03 | Chromosome condensation | 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| 10-1 | 10-1 | 101 | 10-1 | N/A | 10-1 | |||||||
| 11.02 | RNA synthesis | 10-1 | 10-1 | 10-1 | 10-1 | N/A | 10-1 | |||||
| 11.02.01 | rRNA synthesis | N/A | N/A | 10-1 | 10-1 | N/A | N/A | N/A | N/A | N/A | ||
| 11.02.03 | mRNA synthesis | 10-1 | 10-1 | 10-1 | 10-1 | N/A | 10-1 | |||||
| 11.02.03.04 | Transcriptional control | 10-1 | 10-1 | 10-1 | 101 | 10-1 | N/A | 10-1 | ||||
| 11.04 | RNA processing | 101 | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | ||||
| 11.04.01 | rRNA processing | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | N/A | |||
| 11.04.02 | tRNA processing | N/A | N/A | N/A | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | |
| 11.04.03 | mRNA processing (splicing, 5'-, 3'-end processing) | 10-1 | N/A | 10-1 | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | ||
| 11.04.03.01 | Splicing | 10-1 | N/A | 10-1 | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | ||
| N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | ||||
| 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |||
| 16.13 | C-compound binding | N/A | N/A | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | |
| N/A | 10-1 | 10-1 | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | N/A | |||
| 18.01.01 | Modification | N/A | N/A | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | |
| 18.02.05 | Regulator of G-protein signalling | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | ||||
| 101 | 101 | 10-1 | N/A | 101 | 10-1 | 10-1 | 10-1 | N/A | N/A | |||
| 20.01 | Transported compounds (substrates) | 101 | 101 | 10-1 | N/A | 10-1 | 10-1 | N/A | N/A | N/A | ||
| 20.01.01 | Ion transport | 10-1 | N/A | 101 | 101 | N/A | 10-1 | 10-1 | N/A | N/A | N/A | |
| 20.01.03 | C-compound and carbohydrate transport | 10-1 | N/A | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | ||
| 20.09.07 | Vesicular transport (Golgi network, etc.) | 10-1 | N/A | 10-1 | N/A | N/A | 10-1 | N/A | 10-1 | N/A | N/A | |
| N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |||
| 10-1 | 101 | 10-1 | N/A | N/A | 10-1 | |||||||
| 32.01 | Stress response | 101 | 101 | 10-1 | N/A | N/A | 10-1 | |||||
| 32.05 | disease, virulence and defense | 101 | 101 | 101 | 10-1 | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | |
| 32.05.03 | defense related proteins | N/A | 10-1 | 10-1 | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | ||
| N/A | N/A | 10-1 | N/A | N/A | 10-1 | N/A | N/A | N/A | N/A | |||
| 101 | N/A | 101 | N/A | 10-1 | 10-1 | 101 | N/A | N/A | N/A | |||
| 40.20 | Cell aging | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | ||||
| N/A | N/A | 10-1 | 10-1 | N/A | N/A | N/A | N/A | N/A | ||||
| 42.10 | Nucleus | 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| 42.10.03 | Organization of chromosome structure | 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| 10-1 | 101 | 10-1 | 101 | 10-1 | 101 | 101 | N/A | 10-1 | ||||
| 70.01 | Cell wall | 10-1 | N/A | N/A | 10-1 | 10-1 | 10-1 | N/A | N/A | N/A | N/A | |
| 70.08 | Golgi | 10-1 | N/A | N/A | N/A | N/A | N/A | 10-1 | N/A | N/A | N/A | |
| 70.09 | Intracellular transport vesicles | 10-1 | N/A | 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | ||
| 70.10 | Nucleus | 101 | 10-1 | 101 | 10-1 | 101 | N/A | N/A | N/A | 10-1 | ||
| 70.10.03 | Chromosome | 10-1 | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |
| N/A | N/A | N/A | N/A | N/A | N/A | N/A | N/A | |||||
| 10-1 | 10-1 | 10-1 | 10-1 | 10-1 |
Significant functional annotations of the cluster of genes putatively regulated by the TF-encoding genes. a and b: annotation categories in the MIPS functional annotation database. c to m: p-values obtained for all the genes putatively regulated by the corresponding TF. The significance of the annotation (p) for the list of all target genes putatively regulated by a TF was calculated using the MIPS online interface. Entries in bold denote significant annotations (p < 0.05).
Figure 3Regulation pattern in up-regulated and down-regulated genes. Percentage of up-regulated and down-regulated genes that are predicted to be regulated by different TF families.