| Literature DB >> 23314754 |
Mahantesha Naika1, Khader Shameer, Oommen K Mathew, Ramanjini Gowda, Ramanathan Sowdhamini.
Abstract
Understanding the principles of abiotic and biotic stress responses, tolerance and adaptation remains important in plant physiology research to develop better varieties of crop plants. Better understanding of plant stress response mechanisms and application of knowledge derived from integrated experimental and bioinformatics approaches are gaining importance. Earlier, we showed that compiling a database of stress-responsive transcription factors and their corresponding target binding sites in the form of Hidden Markov models at promoter, untranslated and upstream regions of stress-up-regulated genes from expression analysis can help in elucidating various aspects of the stress response in Arabidopsis. In addition to the extensive content in the first version, STIFDB2 is now updated with 15 stress signals, 31 transcription factors and 5,984 stress-responsive genes from three species (Arabidopsis thaliana, Oryza sativa subsp. japonica and Oryza sativa subsp. indica). We have employed an integrated biocuration and genomic data mining approach to characterize the data set of transcription factors and consensus binding sites from literature mining and stress-responsive genes from the Gene Expression Omnibus. STIFDB2 currently has 38,798 associations of stress signals, stress-responsive genes and transcription factor binding sites predicted using the Stress-responsive Transcription Factor (STIF) algorithm, along with various functional annotation data. As a unique plant stress regulatory genomics data platform, STIFDB2 can be utilized for targeted as well as high-throughput experimental and computational studies to unravel principles of the stress regulome in dicots and gramineae. STIFDB2 is available from the URL: http://caps.ncbs.res.in/stifdb2.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23314754 PMCID: PMC3583027 DOI: 10.1093/pcp/pcs185
Source DB: PubMed Journal: Plant Cell Physiol ISSN: 0032-0781 Impact factor: 4.927
Summary of species, stress-responsive genes, stress-responsive transcription factors and stress signals in STIFDB2
| Species | Stress-responsive genes in STIFDB2 | Stress-responsive transcription factors | Stress signals |
|---|---|---|---|
| 3,150 | ABRE_ABI3_VP1 | ABA, aluminum, cold, cold–drought–salt, dehydration, drought, high light, iron, NaCl, osmotic stress, oxidative stress, UV-B and wounding | |
| AuxRE_ARF | |||
| C_ABRE_bZIP | |||
| DREB_AP2_EREBP | |||
| GCC_box_AP2_EREBP | |||
| G_ABRE_bZIP | |||
| G_box1_bZIP | |||
| G_box2_bZIP | |||
| G_box_bHLH | |||
| HBE_HB | |||
| HSE1_HSF | |||
| Myb_box1_MYB | |||
| Myb_box2_MYB | |||
| Myb_box3_MYB | |||
| Myb_box4_MYB | |||
| Myb_box5_MYB | |||
| N_box_bHLH | |||
| Nac_box_NAC | |||
| W_box_WRKY | |||
| 1,118 | ABRE_ABI3_VP1 | ABA, bacterial blight, cold, drought, heat, iron and NaCl | |
| ABRE_bZIP | |||
| DREB_AP2_EREBP | |||
| G_box1_bZIP | |||
| Nac_box_NAC | |||
| OsIRO2_bHLH | |||
| PRE2_WRKY | |||
| PRE4_WRKY | |||
| 1,716 | DREB_AP2_EREBP | Cold, drought and NaCl | |
| ABRE_bZIP | |||
| G_box1_bZIP | |||
| Nac_box_NAC |
Transcription factors are named using the name of the cis-element followed by the transcription family name or subfamily name.
Transcription factor family/subfamily, cis-element, consensus binding site data and corresponding references (PubMed identifiers) used to perform transcription factor binding site prediction using stress-responsive genes curated from the literature (PubMed identifiers of studies are provided) in Oryza sativa subsp. japonica and Oryza sativa subsp. indica
| Transcription factor family name | Stress signal | Reference (Stress signal) | Name of the | Reference ( | |
|---|---|---|---|---|---|
| ABI3/VP1 | Iron toxicity | 19737364 | ABRE | CATGC | 19737364 |
| AP2/EREBP | Cold, drought, NaCl | 12609047 | CRT/DRE | RCCGAC | 12609047 |
| 15834008 | (ACCGAC | ||||
| 18470484 | GCCGAC) | ||||
| bHLH | Iron toxicity | 16887895 | OsIRO2 | CACGTGG | 16887895 |
| bZIP | ABA, NaCl, drought, heat | 18236009 | G-box1 | CCACGTGTC | 18236009 |
| (C/T)ACGTGGC | 11828032 | ||||
| 11828032 | ABRE | ||||
| NAC | Cold, drought,NaCl, ABA | 20632034 | CATGTG | 17587305 | |
| 19135985 | |||||
| 16924117 | |||||
| WRKY | Bacterial blight | 17986178 | PRE2 | ACGCTGCCG | 17986178 |
| PRE4 | TGCGCTT | ||||
| AP2/EREBP | Drought, cold, salinity | 12609047 | CRT/DRE | (G/A)CCGAC | 12609047 |
| 18470484 | |||||
| bZIP | ABA, NaCl, drought | 10636868 | G-box1 | CCACGTGTC | 18236009 |
| 19048288 | ABRE | (C/T)ACGTGGC | 11828032 | ||
| NAC | Cold, drought, NaCl | 17587305 | CATGTG | 17587305 | |
Orthologs of stress-responsive genes compiled in STIFDB2 from Ensembl Plant BioMart using the Compara pipeline
| Species | Stress- responsive genes in STIFDB2 | Orthologs retrieved using BioMart | ||
|---|---|---|---|---|
| Sorghum ( | Maize ( | Soybean ( | ||
| 3,150 | 2,515 | 2,506 | 2,519 | |
| 1,118 | 909 | 838 | 727 | |
| 1,716 | 1,155 | 1,055 | 926 | |
Fig. 1Bioinformatics pipeline used to develop STIFDB2.
Fig. 2Distribution of transcription factor binding sites predicted in the up-regulated stress-responsive genes due to the perturbation of various stress signals in (a) Arabidopsis thaliana (b) Oryza sativa subsp. japonica and (c) Oryza sativa subsp. indica.
Fig. 3Chromosomal distribution of stress-responsive genes in STIFDB2. The x-axis shows the number of stress-up-regulated genes curated from publicly available microarray data and the y-axis shows chromosome numbers.
GO terms used for target text mining of stress-responsive genes to find transcriptional regulatory cascades
| GO identifier | GO term |
|---|---|
| GO:0003700 | Sequence-specific DNA binding transcription factor activity |
| GO:0001073 | DNA binding transcription antitermination factor activity |
| GO:0001199 | Metal ion-regulated sequence-specific DNA binding transcription factor activity |
| GO:0001130 | Sequence-specific DNA binding bacterial-type RNA polymerase transcription factor activity |
| GO:0001142 | Sequence-specific DNA binding mitochondrial RNA polymerase transcription factor activity |
| GO:0001167 | Sequence-specific DNA binding RNA polymerase I transcription factor activity |
| GO:0000981 | Sequence-specific DNA binding RNA polymerase II transcription factor activity |
| GO:0001034 | Sequence-specific DNA binding RNA polymerase III transcription factor activity |
| GO:0001011 | Sequence-specific DNA binding RNA polymerase recruiting transcription factor activity |
| GO:0001010 | Sequence-specific DNA binding transcription factor recruiting transcription factor activity |
| GO:0000975 | Regulatory region DNA binding |
| GO:0043565 | Sequence-specific DNA binding |
| GO:0000975 | Regulatory region DNA binding |
| GO:0043565 | Sequence-specific DNA binding |
| GO:0043566 | Structure-specific DNA binding |
Summary of the transcription factors identified using the targeted annotation mining approach
| Target genes in STIFDB | GO term hits | Pfam domain hits | |
|---|---|---|---|
| ABRE_ABI3_VP1 | 5 | 1 | 1 |
| AuxRE_ARF | 1,405 | 117 | 40 |
| C_ABRE_bZIP | 248 | 27 | 8 |
| DREB_AP2_EREBP | 952 | 82 | 31 |
| GCC_box_AP2_EREBP | 299 | 12 | 4 |
| G_ABRE_bZIP | 173 | 16 | 4 |
| G_box1_bZIP | 111 | 6 | 2 |
| G_box2_bZIP | 1,020 | 60 | 18 |
| G_box_bHLH | 931 | 87 | 28 |
| HBE_HB | 188 | 23 | 8 |
| HSE1_HSF | 2,472 | 197 | 74 |
| Myb_box1_MYB | 2,318 | 176 | 68 |
| Myb_box2_MYB | 1,050 | 103 | 34 |
| Myb_box3_MYB | 1,269 | 108 | 38 |
| Myb_box4_MYB | 659 | 58 | 21 |
| Myb_box5_MYB | 3,113 | 250 | 92 |
| N_box_bHLH | 683 | 47 | 15 |
| Nac_box_NAC | 1,020 | 81 | 30 |
| W_box_WRKY | 2,077 | 167 | 64 |
GO terms used to filter genes associated with transcription factor/transcription factor activities are provided in Table 7.
Pfam domains used to define the functional role pertaining to transcription factor/transcription factor activity are filtered from Pfam2GO annotation (see Supplementary Table S1).
Fig. 4Schematic diagram of (a) normal transcription factor activity and (b) transcriptional regulatory cascade network. Blue rectangles indicate untranslated regions, red rectangles indicate promoter regions, exon regions are highlighted using green, creme and violet color. Introns are defined using a white region interspersed between exons, and polyadenylation sites in the 3′ untranslated regions are colored in orange.
Stress-specific distribution of Arabidopsis thaliana genes in STIFDB2 with predicted binding sites using the STIF algorithm
| Transcription factors | ABA | Alu | Col | CDS | Dehy | Dro | Heat | Light | Iron | NaCl | OS | OxS | UV-B | Wounding |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ABRE_ABI3_VP1 | 1 | 0 | 2 | 0 | 0 | 2 | 0 | 1 | 0 | 3 | 0 | 0 | 0 | 0 |
| AuxRE_ARF | 299 | 13 | 520 | 17 | 48 | 372 | 32 | 313 | 70 | 473 | 31 | 15 | 54 | 23 |
| C_ABRE_bZIP | 52 | 1 | 105 | 3 | 5 | 86 | 6 | 48 | 12 | 80 | 5 | 4 | 10 | 6 |
| DREB_AP2_EREBP | 230 | 8 | 375 | 23 | 29 | 344 | 12 | 198 | 34 | 362 | 13 | 14 | 29 | 17 |
| GCC_box_AP2_EREBP | 70 | 2 | 112 | 5 | 12 | 98 | 2 | 63 | 11 | 88 | 1 | 6 | 9 | 5 |
| G_ABRE_bZIP | 54 | 0 | 63 | 3 | 1 | 79 | 8 | 39 | 1 | 90 | 7 | 2 | 7 | 3 |
| G_box1_bZIP | 33 | 0 | 46 | 3 | 1 | 51 | 6 | 19 | 6 | 61 | 5 | 1 | 8 | 4 |
| G_box2_bZIP | 234 | 7 | 346 | 9 | 29 | 294 | 26 | 223 | 50 | 356 | 24 | 17 | 38 | 12 |
| G_box_bHLH | 241 | 4 | 361 | 20 | 17 | 369 | 33 | 201 | 33 | 405 | 31 | 10 | 40 | 20 |
| HBE_HB | 48 | 2 | 71 | 2 | 6 | 65 | 5 | 34 | 9 | 81 | 5 | 3 | 11 | 2 |
| HSE1_HSF | 563 | 19 | 892 | 30 | 73 | 728 | 51 | 558 | 106 | 876 | 47 | 34 | 94 | 35 |
| Myb_box1_MYB | 502 | 16 | 832 | 29 | 56 | 691 | 49 | 502 | 114 | 822 | 45 | 33 | 95 | 38 |
| Myb_box2_MYB | 247 | 10 | 396 | 15 | 28 | 315 | 13 | 224 | 51 | 381 | 15 | 14 | 47 | 24 |
| Myb_box3_MYB | 307 | 4 | 470 | 15 | 25 | 395 | 24 | 280 | 53 | 472 | 24 | 21 | 55 | 20 |
| Myb_box4_MYB | 142 | 5 | 242 | 4 | 19 | 172 | 8 | 150 | 24 | 248 | 7 | 9 | 29 | 10 |
| Myb_box5_MYB | 703 | 24 | 1,133 | 41 | 93 | 922 | 63 | 674 | 144 | 1,117 | 58 | 40 | 120 | 47 |
| N_box_bHLH | 163 | 6 | 228 | 13 | 13 | 224 | 12 | 164 | 25 | 261 | 10 | 10 | 26 | 15 |
| Nac_box_NAC | 225 | 6 | 352 | 14 | 25 | 330 | 30 | 227 | 50 | 382 | 24 | 12 | 42 | 16 |
| W_box_WRKY | 464 | 20 | 741 | 24 | 64 | 612 | 47 | 463 | 102 | 716 | 45 | 27 | 84 | 30 |
Alu, aluminum; Col, cold; CDS, cold–drought–salt, Dehy, dehydration; Dro, drought; OS, osmotic stress; OxS, oxidative stress.
Stress-specific distribution of genes of Oryza sativa subsp. japonica genes in STIFDB2 with predicted binding sites using the STIF algorithm
| Transcription factors | ABA | Bacterial blight | Cold | Drought | Heat | Iron | NaCl |
|---|---|---|---|---|---|---|---|
| ABRE_ABI3_VP1 | 30 | 424 | 11 | 87 | 531 | 157 | 33 |
| ABRE_bZIP | 3 | 63 | 0 | 14 | 71 | 15 | 6 |
| DREB_AP2_EREBP | 20 | 260 | 9 | 64 | 289 | 102 | 20 |
| G_box1_bZIP | 1 | 18 | 0 | 2 | 13 | 5 | 1 |
| Nac_box_NAC | 16 | 208 | 5 | 54 | 330 | 97 | 21 |
| OsIRO2_bHLH | 6 | 69 | 2 | 17 | 82 | 21 | 3 |
| PRE2_WRKY | 0 | 1 | 2 | 1 | 7 | 2 | 0 |
| PRE4_WRKY | 2 | 26 | 1 | 2 | 30 | 11 | 0 |
Stress-specific distribution of Oryza sativa subsp. indica genes in STIFDB2 with predicted binding sites using the STIF algorithm
| Transcription factors | Cold | Drought | NaCl |
|---|---|---|---|
| ABRE_bZIP | 26 | 98 | 109 |
| DREB_AP2_EREBP | 181 | 473 | 785 |
| G_box1_bZIP | 3 | 29 | 22 |
| Nac_box_NAC | 132 | 321 | 576 |