| Literature DB >> 12877745 |
Diego Lijavetzky1, Pilar Carbonero, Jesús Vicente-Carbajosa.
Abstract
BACKGROUND: Dof proteins are a family of plant-specific transcription factors that contain a particular class of zinc-finger DNA-binding domain. Members of this family have been found to play diverse roles in gene regulation of processes restricted to the plants. The completed genome sequences of rice and Arabidopsis constitute a valuable resource for comparative genomic analyses, since they are representatives of the two major evolutionary lineages within the angiosperms. In this framework, the identification of phylogenetic relationships among Dof proteins in these species is a fundamental step to unravel functionality of new and yet uncharacterised genes belonging to this group.Entities:
Mesh:
Substances:
Year: 2003 PMID: 12877745 PMCID: PMC184357 DOI: 10.1186/1471-2148-3-17
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Rice Dof transcription factors
| Sequence Namea | Predicted Gene structured | Chomosomee | Groupf | |||
| OsDof-1 | P0505D12 | AAAA01001390.1 | ___Dof___ | (T) | 1 | c1 |
| OsDof-2 | P0453A06 | AAAA01021216.1 | _▼_Dof___ | (T) | 1 | d1 |
| OsDof-3 | P0001B06 | AAAA01003833.1 | ___Dof___ | (T) | 1 | d2 |
| OsDof-4 | P0038F12 | AAAA01000894.1 | _▼2_Dof___ | (T) | 1 | d1 |
| OsDof-5 | P0007F06 | AAAA01006178.1i | ___Dof___ | (T) | 1 | d1 |
| OsDof-6 | B1131G08 | AAAA01001653.1 | ___Dof___ | (R) | 1 | d1 |
| OsDof-7 | 49D11 | AAAA01000209.1 | _▼_Dof___ | (T) | 2 | c1 |
| OsDof-8 | B1121A12 | AAAA01004110.1 | _▼2_Dof_▼_ | (T) | 2 | a |
| OsDof-9 | P0657H12 | AAAA01000342.1 | ___Dof___ | (T) | 2 | c2 |
| OsDof-10 | OSJNBa0009N02 | AAAA01004826.1 | ___Dof_▼2_ | (T) | 2 | c2 |
| OsDof-11 | OSJNBa0010D22 | AAAA01014987.1 | ___Dof___ | (T) | 3 | a |
| OsDof-12 | OSJNBa0091P11 | AAAA01007635.1 | _▼_Dof___ | (T) | 3 | d1 |
| OsDof-13 | OSJNBa0063J18 | AAAA01000005.1 | ___Dof_▼_ | (T) | 3 | d3 |
| OsDof-14 | OSJNBa0002D01 | AAAA01000212.1 | _▼_Dof___ | (T) | 3 | b |
| OsDof-15 | OSJNBa0079B15 | AAAA01003692.1 | ___Dof___ | (T) | 3 | a |
| OsDof-16 | OJ1754_E06 | AAAA01000954.1 | ___Dof___ | (R) | 3 | b |
| OsDof-17 | OSJNBa0064G10 | AAAA01000011.1 | _▼_Dof___ | (T) | 4 | d2 |
| OsDof-18 | OSJNB0005N02 | AAAA01002346.1 | ___Dof_▼2_ | (T) | 4 | c2 |
| OsDof-19 | P0016H04 | AAAA01007236.1 | _▼_Dof___ | (T) | 5 | b |
| OsDof-20 | P0491D10 | AAAA01062012.1j | ___Dof_▼_ | (T) | 6 | a |
| OsDof-21 | P0407H12 | AAAA01008294.1 | ___Dof___ | (T) | 7 | b |
| OsDof-22 | P0483G08 | AAAA01004849.1 | ___Dof___ | (T) | 7 | b |
| OsDof-23 | OSJNBa0060O17 | AAAA01004298.1 | ___Dof___ | (R) | 7 | d1 |
| OsDof-24 | P0605H02 | AAAA01033718.1 | ___Dof___ | (R) | 8 | d3 |
| OsDof-25 | P0556A05 | AAAA01001409.1 | _▼2_Dof___ | (R) | 9 | d3 |
| OsDof-26 | OSJNBa0060A14 | AAAA01002383.1 | _▼3_Dof___ | (R) | 10 | d1 |
| OsDof-27 | OSJNBa0066I08 | AAAA01003749.1 | _▼2_Dof_▼2_ | (T) | 10 | d2 |
| OsDof-28 | OSJNBa0016C14 | AAAA01003609.1 | _▼_Dof___ | (T) | 12 | b |
| OsDof-29 | CL012178.39g | AAAA01002638.1 | ___Dof___ | (R) | c1 | |
| OsDof-30 | CL037947.70h | AAAA01068763.1 | ___Dof___ | (R) | d3 | |
a Sequence name designation is arbitrary. b O. sativa ssp. japonica BAC/PAC clone name at the rice TIGR db. c O. sativa ssp. indica contig name at the NCBI O. sativa BLAST page. d Gene structure prediction according to the rice TIGR db (T) or to the RiceGAAS system (R). Intron (▼) relative position respect to the Dof domain. Sub-index indicates number of introns. e Chromosome assignment according to O. sativa ssp. japonica at the rice TIGR db. f Group designation after the phylogenetic analysis displayed in Figure 2B. g O. sativa ssp. japonica contig name at the rice TMRI db. h O. sativa ssp. japonica contig (at TMRI) containing incomplete gene (lacking the Dof domain) i O. sativa ssp. indica contig containing incomplete gene (lacking the Dof domain) j O. sativa ssp. indica contig containing a misassembled gene.
Figure 2(A) Joined phylogenetic tree of the rice and Arabidopsis Dof gene families. The unrooted tree was inferred by the neighbor-joining method after the alignment of the Dof domain amino acid sequences of the 66 Arabidopsis and rice genes listed in Table 1 and 2 respectively. Arabidopsis (normal case) and rice genes (italics) are indicated at the end of solid and dotted branches, respectively. Bootstrapping values are indicated as percentages (when >50 %) along the branches. The resulting major clusters of orthologous genes (MCOG) are shown in different colors: Aa = orange, Bb = blue, Cc = red and Dd = green. Subscript numbers indicate the defined subgroups. The scale bar corresponds to 0.1 estimated amino acid substitution per site. Phylogenetic trees of rice (B) and Arabidopsis (C) Dof gene families. The unrooted trees were inferred by the neighbor-joining method after the alignment of the Dof domain amino acid sequences of the 30 rice and 36 Arabidopsis genes listed in Tables 1 and 2, respectively. The resulting groups are shown in different colors: A or a = orange, B or b = blue, C or c = red and D or d = green. Subscript numbers indicate the defined subgroups. The scale bar corresponds to 0.05 estimated amino acid substitutions per site. Genes belonging to the different groups are listed in Table 1 and 2.
Arabidopsis Dof transcription factors
| AGI codea | Predicted gene structureb | Chromosome | Groupc | Gene Named |
| At5g60850 | ___Dof___ | 5 | A | |
| At1g517001 | ___Dof___ | 1 | A | |
| At3g212701 | ___Dof___ | 3 | A | |
| At1g07640 | _▼_Dof___ | 1 | B1 | |
| At3g553702 | ___Dof_▼_ | 3 | B1 | |
| At2g375902 | _▼_Dof___ | 2 | B1 | |
| At2g28810 | _▼_Dof_▼_ | 2 | B1 | |
| At5g02460 | _▼_Dof___ | 5 | B1 | |
| At5g65590 | ___Dof___ | 5 | B2 | |
| At4g38000 | ___Dof___ | 4 | B2 | |
| At1g28310 | ___Dof___ | 1 | B2 | |
| At2g28510 | _▼_Dof___ | 2 | C1 | |
| At3g456103 | _▼_Dof___ | 3 | C1 | |
| At5g602003 | _▼_Dof___ | 5 | C1 | |
| At5g62940 | _▼_Dof___ | 5 | C1 | |
| At1g64620 | _▼_Dof___ | 1 | C2.1 | |
| At4g00940 | ___Dof___ | 4 | C2.1 | |
| At4g24060 | _▼_Dof___ | 4 | C2.1 | |
| At2g465904 | _▼_Dof___ | 2 | C2.1 | |
| At3g618504 | _▼_Dof_▼_ | 3 | C2.1 | |
| At3g52440 | ___Dof___ | 3 | C2.2 | |
| At1g21340 | ___Dof___ | 1 | C2.2 | |
| At4g210305 | ___Dof___ | 4 | C3 | |
| At4g210505 | ___Dof___ | 4 | C3 | |
| At4g210406 | ___Dof___ | 4 | C3 | |
| At4g210806 | ___Dof___ | 4 | C3 | |
| At2g341407 | ___Dof___ | 2 | D1 | |
| At1g291607 | ___Dof___ | 1 | D1 | |
| At3g475008 | _▼_Dof___ | 3 | D1 | |
| At5g624308 | _▼_Dof___ | 5 | D1 | |
| At5g39660 | _▼_Dof___ | 5 | D1 | |
| At1g267909 | _▼2_Dof___ | 1 | D1 | |
| At1g695709 | _▼_Dof___ | 1 | D1 | |
| At1g47650 | ___Dof___ | 1 | D2 | |
| At5g6694010 | ___Dof___ | 5 | D2 | |
| At3g5041010 | ___Dof___ | 3 | D2 |
a Names from the Arabidopsis Genome Initiative (AGI). Genes sharing the same superscript number were found in duplicated genomic regions according to the Redundancy Viewer (MIPS/MATDB). b Data from the At TIGR db. Intron (▼) relative position respect to the Dof domain. Sub-index indicates number of introns. c Group designation after the phylogenetic analysis displayed in Figure 2C. d Gene names originally utilized for publication or database registration.
Figure 1Dof domain sequence alignment of the annotated rice proteins. The four cysteine residues putatively responsible of the zinc-finger structure are indicated. Identical amino acids are highlighted in black. Gene names correspond to those listed in Table 1.
Figure 3Schematic distribution of conserved motifs among the defined gene clusters in Figure 2A. Motifs were identified by means of the MEME software using the complete amino acid sequences of the 66 rice and Arabidopsis Dof genes listed in Tables 1 and 2. Positions of the identified motifs are relative to the Dof domain. Multilevel consensus sequences for the MEME defined motifs are listed in Table 3.
Group and sub-group specific conserved motifs
| Motifa | Multilevel consensus sequenceb |
| 1 | |
| 2 | KGEGCLWVPKTLRIDDPDEAAKSSIWTTLGIK |
| 3 | DDPGIKLFGKTIPF |
| 4 | KALKKPDKILP |
| 5 | LQANPAALSRSQNFQE |
| 6 | PMDRLAFGDESFEQDYYDVGSDDLIVNPLIGGS |
| 7 | KRAKIDQPSVAQMVSVEIQPGNHQPFFNVQENNDFVGSF |
| 8 | MDNLNVFANEDNQVNDVKPPP |
| 9 | SPTLGKHSRDE |
| 10 | YHMNPVDQFKWNQSFNNAMNMNYNN |
| 11 | RVLWGFPWQM |
| 12 | ERKARPQKDQ |
| 13 | IDLALAYAKFLKHH |
| 14 | ATALKFASDSPLCESMASVLDIGEK |
| 15 | RLLFPFEDLKPLVS |
| 16 | HGGFRHDF |
| 17 | MVFSSVPAYLD |
| 18 | FYPVPAYWGC |
| 19 | KNPKLLHEGAQDLNLAFPHH |
| 20 | GHVDQIDSGREIW |
| 21 | VAAVGNHFGSLSEIHG |
| 22 | MMDSNSVLYSSLGFPTMPDYK |
| 23 | CFPGVPWPYTW |
| 24 | FGHRFHGPVRPDMILEGM |
| 25 | INVKPMEEI |
| 26 | IESLSCFNQDLHQKLQQQR |
| 27 | YWSGMI |
| 28 | WTDLAMNRAEK |
| 29 | LEQWRLPQIQQFPFFHAMDAM |
| 30 | WGCFSGWPNGAWNAPWI |
| 31 | EAGRRPAPQFAGVDLRRPKGY |
| 32 | NKGWPSSDHYLHITSEDCDN |
| 33 | HAAPIPATWQFEGLE |
| 34 | MELLRSTGCYM |
| 35 | MHPCHLEK |
| 36 | PIEFLGLPGNLQFW |
| 37 | DEEAKYDPFDSFPDDALSLHDCI |
a Numbers correspond to the motifs described in Figure 3. b Sequences obtained from the analysis of the 66 rice and Arabidopsis Dof complete proteins with the MEME system. c Dof consensus sequence (in italics). d Predicted nuclear localization signal, according to Park et al [50] (underlined).