| Literature DB >> 17397551 |
Frank Bedon1, Jacqueline Grima-Pettenati, John Mackay.
Abstract
BACKGROUND: Several members of the R2R3-MYB family of transcription factors act as regulators of lignin and phenylpropanoid metabolism during wood formation in angiosperm and gymnosperm plants. The angiosperm Arabidopsis has over one hundred R2R3-MYBs genes; however, only a few members of this family have been discovered in gymnosperms.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17397551 PMCID: PMC1851958 DOI: 10.1186/1471-2229-7-17
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Predicted lengths and C-terminal motifs of spruce MYB proteins
| Sg1 | Full length cDNA2 | DNA Binding Domain3 | C-terminal Domain3 | Motifs | Consensus sequences | Angio-Gymno4 | MEME E-value | Start motif5 | Ref. | |
| 4 | Pg MYB5 | -b- | 115 | 142 | F | LlsrGiDP(at)tHrp(li)n | 13/13-5/5 | 6.00e-14 | 1 | a) |
| G | e(re)cpdLNLel(cr)ispp | 13/13-4/5 | 3.31e-16 | 67 | a), b) | |||||
| Pg MYB10 | -a- | 115 | 95 | F | LlsrGiDP(at)tHrp(li)n | 13/13-5/5 | 6.52e-14 | 1 | a) | |
| G | e(re)cpdLNLel(cr)ispp | 13/13-4/5 | 4.32e-15 | 67 | a), b) | |||||
| Pg MYB13 | -b- | 116 | 80 | F | LlsrGiDP(at)tHrp(li)n | 13/13-5/5 | 1.18e-14 | 1 | a) | |
| G | e(re)cpdLNLel(cr)ispp | 13/13-4/5 | 5.43e-14 | 65 | a), b) | |||||
| 22 | Pg MYB6 | -a- | 115 | 235 | H | (cs)s(sv)DPpT(ls)LsLslPg | 7/7-14/14 | 2.02e-14 | 99 | d) |
| I | YlkaedaismmsaAv | 0/7-13/14 | 1.87e-13 | 141 | d) | |||||
| J | vmremvakEVrsYmn | 7/7-14/14 | 1.07e-17 | 188 | a), b), c) | |||||
| Pg MYB7 | -c- | 116 | 257 | K | egdyEVesrgLKRln | 0/7-13/14 | 1.34e-12 | 43 | d) | |
| H | (cs)s(sv)DPpT(ls)LsLslPg | 7/7-14/14 | 4.28e-13 | 113 | d) | |||||
| I | YlkaedaismmsaAv | 0/7-13/14 | 2.30e-10 | 161 | d) | |||||
| J | vmremvakEVrsYmn | 7/7-14/14 | 6.15e-16 | 206 | a), b), c) | |||||
| Pg MYB9 | -a- | 119 | 297 | K | egdyEVesrgLKRln | 0/7-13/14 | 6.03e-16 | 60 | d) | |
| P | hRQSAFksYesqktp | 0/7-11/14 | 1.19e-13 | 116 | d) | |||||
| H | (cs)s(sv)DPpT(ls)LsLslPg | 7/7-14/14 | 2.91e-13 | 144 | d) | |||||
| I | YlkaedaismmsaAv | 0/7-13/14 | 9.50e-16 | 205 | d) | |||||
| J | vmremvakEVrsYmn | 7/7-14/14 | 1.09e-16 | 256 | a), b), c) | |||||
| 8 | Pg MYB1 | -b- | 115 | 217 | A | lr(kq)mGiDP(lv)THkpl | 5/5-2/2 | 1.79e-18 | 1 | a) |
| 21 | Pg MYB3 | -c- | 130 | 177 | C | (fg)Re(rq)S(rs)(is)(rg)(kr)R | 4/5-2/2 | 4.69e-14 | 1 | d) |
| D | e(en)s(l)(vs)(pt)ffDfl(g)vG(cn) | 5/5-2/2 | 1.26e-13 | 35 | a), b) | |||||
| E | (cy)xi(sg)h(in)nh(v)q(sf)(jr)Kef | 3/5-2/2 | 4.76e-14 | 123 | d) | |||||
| 13 | Pg MYB8 | -c- | 115 | 411 | L | LrrGIDP(n)THkpl | 4/4-2/2 | 2.54e-17 | 1 | a) |
| M | VC(dv)(yk)(np)SIm(al)nPsm(yn) | 2/4-2/2 | 1.94e-18 | 199 | d) | |||||
| N | e(ye)(ae)vKWSEml | 2/4-2/2 | 6.45e-14 | 317 | d) | |||||
| O | (pk)D(fl)(hq)R(im)Aa(vs)(lf)(dg)q | 2/4-2/2 | 4.89e-15 | 399 | a) | |||||
| 9 | Pg MYB11 | -a- | 115 | 384 | Q | L(lv)kMGIDPvTHkp(k) | 6/6-1/1 | 4.08e-16 | 1 | a), b), c) |
| R | h(m)AQWEsARleAear | 6/6-1/1 | 3.10e-13 | 35 | a), b), c) | |||||
| S | (yc)eDnknYw(nd)silnlV | 4/6-1/1 | 6.79e-12 | 360 | c) | |||||
| 2 | Pg MYB12 | -a- | 115 | 254 | T | MdfW(fl)(dn)v(fl)(t) | 5/5-1/1 | 2.39e-09 | 237 | a) |
| nd | Pg MYB2 | -c- | 115 | 333 | B | (c)SylPPL(y)d(v) | 2/2-2/2 | 3.29e-13 | 249 | d) |
| Pg MYB4 | -c- | 120 | 214 | none | none | 0/3-0/2 | none | none | none |
Conserved amino acid regions were identified in angiosperm and gymnosperm C-terminal sequences by the use of MEME software (setting described in Methods). Motifs were detected among the sequences belonging to each phylogenetic clade comprised of at least one spruce MYB (Additional File 2). Sequences from Additional File 3 were used to identify more conifer members of the PgMYB6, 7, 9 clade. Within the consensus sequences, upper-case letters indicate amino acids found in all members of a subgroup, lower-case letters indicate amino acids conserved in more than 50% of the members, pairs of lower-case amino acid in brackets show the two most abundant amino acids present for 50% each and above, x indicates that no amino acid is conserved among the sequences.
1Sg, MYB subgroups identified by Kranz et al. [18].
2Source of full length cDNA sequence: -a-, full length cDNA clone identified from EST of Picea glauca database; b- partial cDNA clone identified from EST database of P. glauca, extended by RACE amplifications and finally amplified as a single clone by PCR with gene specific primers, and -c-, from non degenerates primers based on Pinus taeda MYB sequences and used on spruce cDNA followed by RACE amplifications.
3Lengths are expressed in amino acid (aa) residues.
4The number of MYB sequences, separately from angiosperm and gymnosperms, sharing the motif among all those used in each case.
5The position of the motif relative to the beginning of the C-terminal domain (5' end).
Ref: references for previously reported motifs, a) Kranz et al. [18], b) Stracke et al. [31] and c) Jiang et al. [30] and d) new motifs.
Figure 1Alignment of predicted MYB domain protein sequences from spruce and pine. Amino acid sequence alignments of the 21 conifer MYB R2R3 domains were obtained with Clustal W (see Methods) and then separated into three groups based on their homologies to the consensus R2R3-MYB DNA-binding domain (MYBR2R3-DBD, top panel), the bHLH protein-binding motif (bHLH motif, middle panel) or the Arabidopsis calmodulin-interaction motif (AtMYB2 CaMBD, bottom panel), as indicated. Black shading indicates identical amino acid residues and grey shading the similar residues that agree with the fraction sequence of 0,4 (BoxShade 3.21) and dashes indicate gaps. The numbers on the left and right indicate the amino acid position relative to the translation start codon. The boxes and dotted line above the sequences show the predicted helix and turn structures in the R2 and R3 regions of the MYB domain. Stars show positions of conserved tryptophan residues and black arrows indicate unusual amino acid residues compared to the consensus amino acid sequence of the MYB DNA-binding domains of several plant R2R3-MYB proteins described by Avila et al. [27]. The bHLH protein-binding motif ([DE]L × 2 [RK] × 3L × 6L × 3R) identified by Zimmerman et al. [28] and the calmodulin-interaction motif [29] are shown above the middle and bottom panels, respectively (major amino acids in upper-case, bold). Ia or Ib and II indicate the positions of the first and second introns, respectively (Ib is specific to PgMYB3). Accession numbers of the newly identified spruce and pine MYBs are listed in Methods. Pg, Picea glauca; Pt, Pinus taeda; Pm, Picea mariana; At, Arabidopsis thaliana.
Figure 2Phylogenetic tree of gymnosperm and angiosperm R2R3-MYB proteins. This neighbour-joining (1000 Bootstraps) tree was based on the Clustal W alignment of the complete coding sequences of 13 spruce and five pine MYB proteins identified in this study (represented by filled and empty lozenges, respectively). The bar indicates an evolutionary distance of 0.2%. Arabidopsis proteins were chosen as landmarks representing the three main groups (circles A, B and C) and subgroups (Sg next to bracket; nd, not determined) defined by Romero et al. [19] and Kranz et al. [18]. Human c-MYB [GenBank: P10242] and Mus musculus MmMYBA [GenBank: X82327] were not used as out groups but as landmarks. The accession numbers of the Arabidopsis genes are given in Methods. Other abbreviations are in Figure 1.
Pair-wise sequence amino acids identities of the DBD and full CDS of closest spruce and pine homologs
| DNA Binding Domains | Full coding sequences | |||
| Amino acids percentage | Identity | Similarity | Identity | Similarity |
| PgMYB1/PtMYB1 | 99,1 | 100 | 87,1 | 91,3 |
| PgMYB2/PtMYB2 | 100 | 100 | 88 | 91,6 |
| PgMYB3/PtMYB3 | 94,6 | 95,4 | 79,2 | 82,3 |
| PgMYB4/PtMYB4 | 96,6 | 97,5 | 84,1 | 88,6 |
| PgMYB7/PtMYB7 | 94,8 | 96,5 | 90,4 | 93,6 |
| PgMYB8/PtMYB8 | 98,3 | 100 | 93,1 | 95,5 |
| PgMYB5/PtMYB14 | 88 | 94,8 | 67,3 | 77,4 |
| PgMYB10/PtMYB14 | 87,8 | 95,6 | 64,3 | 73 |
| PgMYB13/PtMYB14 | 82,7 | 92,2 | 60 | 70 |
Percent similarity calculations were performed using pairwise optimal alignment with Bioedit software (Clustal W, matrix blosum62).
Length of spruce MYB coding sequences and introns with their predicted splice junctions
| Length (bp) | Intron I (phase 1) | Intron II (phase 2) | |||||
| Coding sequence | Intron I | Intron II | 5'Splice site | 3'Splice site | 5'Splice site | 3'Splice site | |
| 999 | 83 | 101 | CCG: | TTGC | TAG: | CACC | |
| 1347 | 501 | 88 | CAG: | TGAC | CAG: | ||
| 924 | 1427 | none | CAG: | none | none | ||
| 1005 | 194 | 267 | CTG: | GTAC | CAG: | ||
| 774 | 191 | 186 | CAG: | TTGC | |||
| 1053 | none | none | none | none | none | none | |
| 1122 | none | none | none | none | none | none | |
| 1581 | 97 | 90 | CTG: | TCGC | CAG: | ACAC | |
| 1251 | none | none | none | none | none | none | |
| 633 | 94 | 187 | |||||
| 1500 | 644 | 132 | CAG: | CAA: | TTAC | ||
| 1110 | 94 | 294 | CAG: | TTGC | CAG: | ATGT | |
| 591 | 94 | 139 | |||||
Coding sequences indicate the length in nucleotides from the translation start codon to the stop codon. Introns I and II represent the first and second introns, respectively. Intron phase refers to the position in a codon where the intron is inserted: after the first nucleotide (phase 1) or after the second nucleotide (phase 2) of a codon. Italic nucleotide pairs GT and AG represent the beginning and the end of the introns, respectively; and underlined nucleotides are conserved splice-site sequences.
Figure 3Transcript abundance for 13 spruce . Transcript abundance was determined by Q-RTPCR of six tissues from two different 33-year-old trees (number of molecules per ng of total RNA, see methods). The transcript level of an elongation factor (EF1-α) gene was used as an RNA control. N, needles; Stem tissues: P, periderm; Ph, differentiating phloem; X, differentiating xylem; Root tissues: PPh, root periderm with differentiating phloem; X, root differentiating xylem. Data are based on three technical repetitions per tree, i.e. six measurements per data point. Vertical bars represent the standard error. 4CL: 4-coumarate: CoA ligase. NS, no PCR product detected.
Figure 4Transcript abundance for . Transcript abundance was determined as in Figure 4 for, a) 13 spruce MYB genes, and b) five cell-wall-related genes in differentiating secondary xylem from stem and in the elongating terminal leader (apical stem) from 3-year-old spruce seedlings. The standard error (bars) was calculated from three biological replicates and two independent technical repetitions (i.e. six independent measurements). PAL, phenylalanine ammonia lyase; 4CL, 4-coumarate: CoA ligase; CCOaOMT, caffeoyl-CoA 3-O-methyltransferase; AGP, arabinogalactan protein; CAD, cinnamyl alcohol dehydrogenase. NS, no PCR product detected.
Figure 5Transcript accumulation for . a) Compression wood and opposite wood formed in a leaning spruce seedling after 21 days of treatment, compared to the control from vertical seedling. Exposed wood (compression wood is light brown) and wood cross-sections (10 μm thick) were stained by the safranin-orange procedure [53] (magnification, ×40). Steady-state mRNA levels were determined as in Figures 4 and 5 for cell-wall-related genes (b) and for several PgMYB genes (c) in the compression wood (left panels) and opposite side wood (right panels) of spruce seedlings leaning at a 45° angle from vertical. Continuous lines indicate genes with significant variation, and standard error bars are shown three trees (biological replicates) with two independent technical repetitions). Discontinuous lines indicate examples of gene transcripts that do not fluctuate in abundance. The zero time point represents vertical control trees only. PgMYB4 (1/15) means that mRNA level is divided by 15.