| Literature DB >> 22776508 |
Hai Du1, Si-Si Yang, Zhe Liang, Bo-Run Feng, Lei Liu, Yu-Bi Huang, Yi-Xiong Tang.
Abstract
BACKGROUND: The MYB superfamily constitutes one of the most abundant groups of transcription factors described in plants. Nevertheless, their functions appear to be highly diverse and remain rather unclear. To date, no genome-wide characterization of this gene family has been conducted in a legume species. Here we report the first genome-wide analysis of the whole MYB superfamily in a legume species, soybean (Glycine max), including the gene structures, phylogeny, chromosome locations, conserved motifs, and expression patterns, as well as a comparative genomic analysis with Arabidopsis.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22776508 PMCID: PMC3462118 DOI: 10.1186/1471-2229-12-106
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1R2 and R3 MYB repeats are highly conserved across all R2R3-MYB proteins in the soybean genome. The sequence logos of the R2 (a) and R3 (b) MYB repeats are based on full-length alignments of all soybean R2R3-MYB domains. Multiple alignment analysis of 244 soybean-typical R2R3-MYB domains was performed with ClustalW (for full representation of the alignment, see Additional file 2). The bit score indicates the information content for each position in the sequence. Asterisks indicate the conserved tryptophan residues (Trp) in the MYB domain.
Figure 2Phylogenetic relationships and subgroup designations in MYB proteins from soybean (Gm),(At) and other plants. (a) Neighbor-joining tree representing relationships among 252 MYB proteins from soybean and 132 MYB proteins from Arabidopsis, including five 3R-MYB proteins from Arabidopsis and six 3R-MYB proteins from soybean. The proteins are clustered into 47 subgroups, which are designated with a subgroup number (e.g., C1) and marked with different alternating tones of a gray background to facilitate subfamily identification with high predictive value. The numbers beside the branches represent bootstrap support values (>50%) from 1000 replications. Sixteen proteins did not fit well into clusters. Colored circles indicate the corresponding intron distribution patterns, as shown in Figure 3. (b) Structure of MYB genes in soybean and Arabidopsis. Exon(s) are indicated by green boxes, MYB domain(s) by red boxes, untranslated region(s) by blue boxes, and spaces between the colored boxes correspond to introns. The sizes of exons and introns can be estimated using the horizontal scale bar. (c) Architecture of conserved protein motifs in 47 subfamilies. The motifs on the right were detected using MEME and are graphically represented as white boxes drawn to scale for a representative plant MYB protein of each subfamily. (d) Expression patterns of MYB genes in soybean and Arabidopsis in different organs. R, root; L, leaf; F, flower; S, seed; N, legume-specific nodulation. In this expression pattern analysis, the highest values among the expression values of the four organs published in the AtGenExpress and SoySeq databases were selected.
Figure 3Schematic diagram of intron distribution patterns within the MYB domains of soybean MYB (GmMYB) proteins. Alignment of MYB domains is representative of 14 intron patterns, designated a to n. Locations of introns are indicated by white triangles, and the number within each triangle indicates the splicing phases of the MYB domain sequences: 0 refers to phase 0; 1 to phase 1; and 2 to phase 2. The number of GmMYB proteins with each pattern is presented on the right. The correlation of intron distribution patterns and phylogenetic subfamilies is provided in Figure 2 and Additional file 3.
One-to-one orthologous relationships betweenand soybean
| 1 | AtMYB103 | 1 | GmMYB232 | 1 | 52 | AtMYB11 | 3 | GmMYB190 | 3 |
| 2 | AtMYB124 | 1 | GmMYB138 | 1 | 53 | AtMYB77 | 3 | GmMYB011 | 4 |
| 3 | AtMYB61 | 1 | GmMYB214 | 10 | 53 | AtMYB84 | 3 | GmMYB161 | 4 |
| 4 | AtMYB50 | 1 | GmMYB214 | 10 | 54 | AtMYB57 | 3 | GmMYB027 | 5 |
| 5 | AtMYB20 | 1 | GmMYB116 | 11 | 55 | AtMYB94 | 3 | GmMYB207 | 5 |
| 6 | AtMYB103 | 1 | GmMYB165 | 11 | 56 | AtMYB125 | 3 | GmMYB143 | 7 |
| 7 | AtMYB112 | 1 | GmMYB014 | 17 | 57 | AtMYB26 | 3 | GmMYB102 | 7 |
| 8 | AtMYB31 | 1 | GmMYB153 | 17 | 58 | AtMYB121 | 3 | GmMYB108 | 8 |
| 9 | AtMYB124 | 1 | GmMYB091 | 18 | 59 | AtMYB67 | 3 | GmMYB065 | 8 |
| 10 | AtMYB20 | 1 | GmMYB129 | 18 | 60 | AtMYB26 | 3 | GmMYB107 | 8 |
| 11 | AtMYB61 | 1 | GmMYB094 | 19 | 61 | AtMYB85 | 4 | GmMYB097 | 1 |
| 12 | AtMYB20 | 1 | GmMYB098 | 2 | 62 | AtMYB73 | 4 | GmMYB032 | 1 |
| 13 | AtMYB124 | 1 | GmMYB051 | 2 | 63 | AtMYB42 | 4 | GmMYB097 | 1 |
| 14 | AtMYB61 | 1 | GmMYB039 | 3 | 64 | AtMYB18 | 4 | GmMYB148 | 10 |
| 15 | AtMYB31 | 1 | GmMYB172 | 4 | 65 | AtMYB55 | 4 | GmMYB213 | 10 |
| 16 | AtMYB103 | 1 | GmMYB209 | 5 | 66 | AtMYB42 | 4 | GmMYB114 | 11 |
| 17 | AtMYB31 | 1 | GmMYB173 | 6 | 67 | AtMYB85 | 4 | GmMYB114 | 11 |
| 18 | AtMYB103 | 1 | GmMYB163 | 8 | 68 | AtMYB41 | 4 | GmMYB242 | 13 |
| 19 | AtMYB124 | 1 | GmMYB067 | 8 | 69 | AtMYB85 | 4 | GmMYB126 | 17 |
| 20 | AtMYB54 | 1 | GmMYB068 | 9 | 70 | AtMYB55 | 4 | GmMYB009 | 19 |
| 21 | AtMYB88 | 2 | GmMYB138 | 1 | 71 | AtMYB55 | 4 | GmMYB184 | 2 |
| 22 | AtMYB70 | 2 | GmMYB032 | 1 | 72 | AtMYB55 | 4 | GmMYB185 | 3 |
| 23 | AtMYB70 | 2 | GmMYB003 | 11 | 73 | AtMYB85 | 4 | GmMYB101 | 5 |
| 24 | AtMYB104 | 2 | GmMYB075 | 12 | 74 | AtMYB102 | 4 | GmMYB210 | 7 |
| 25 | AtMYB101 | 2 | GmMYB075 | 12 | 75 | AtMYB74 | 4 | GmMYB210 | 7 |
| 26 | AtMYB104 | 2 | GmMYB181 | 13 | 76 | AtMYB97 | 4 | GmMYB109 | 8 |
| 27 | AtMYB25 | 2 | GmMYB081 | 14 | 77 | AtMYB18 | 4 | GmMYB145 | 9 |
| 28 | AtMYB38 | 2 | GmMYB018 | 19 | 78 | AtMYB44 | 5 | GmMYB032 | 1 |
| 29 | AtMYB14 | 2 | GmMYB132 | 19 | 79 | AtMYB86 | 5 | GmMYB214 | 10 |
| 30 | AtMYB12 | 2 | GmMYB200 | 2 | 80 | AtMYB19 | 5 | GmMYB148 | 10 |
| 31 | AtMYB2 | 2 | GmMYB099 | 3 | 81 | AtMYB59 | 5 | GmMYB074 | 11 |
| 32 | AtMYB91 | 2 | GmMYB240 | 7 | 82 | AtMYB49 | 5 | GmMYB117 | 12 |
| 33 | AtMYB77 | 3 | GmMYB032 | 1 | 83 | AtMYB37 | 5 | GmMYB217 | 12 |
| 34 | AtMYB1 | 3 | GmMYB049 | 1 | 84 | AtMYB49 | 5 | GmMYB079 | 13 |
| 35 | AtMYB27 | 3 | GmMYB146 | 10 | 85 | AtMYB59 | 5 | GmMYB037 | 13 |
| 36 | AtMYB45 | 3 | GmMYB148 | 10 | 86 | AtMYB119 | 5 | GmMYB235 | 14 |
| 37 | AtMYB48 | 3 | GmMYB074 | 11 | 87 | AtMYB78 | 5 | GmMYB028 | 15 |
| 38 | AtMYB30 | 3 | GmMYB004 | 13 | 88 | AtMYB59 | 5 | GmMYB082 | 15 |
| 38 | AtMYB35 | 3 | GmMYB119 | 13 | 89 | AtMYB40 | 5 | GmMYB124 | 16 |
| 39 | AtMYB27 | 3 | GmMYB120 | 13 | 90 | AtMYB24 | 5 | GmMYB125 | 16 |
| 40 | AtMYB109 | 3 | GmMYB081 | 14 | 91 | AtMYB36 | 5 | GmMYB220 | 17 |
| 41 | AtMYB77 | 3 | GmMYB019 | 14 | 92 | AtMYB78 | 5 | GmMYB014 | 17 |
| 41 | AtMYB84 | 3 | GmMYB219 | 14 | 92 | AtMYB111 | 5 | GmMYB204 | 17 |
| 42 | AtMYB108 | 3 | GmMYB028 | 15 | 93 | AtMYB80 | 5 | GmMYB089 | 18 |
| 43 | AtMYB67 | 3 | GmMYB085 | 15 | 94 | AtMYB24 | 5 | GmMYB026 | 19 |
| 44 | AtMYB21 | 3 | GmMYB125 | 16 | 95 | AtMYB40 | 5 | GmMYB238 | 19 |
| 45 | AtMYB57 | 3 | GmMYB125 | 16 | 96 | AtMYB86 | 5 | GmMYB023 | 20 |
| 46 | AtMYB94 | 3 | GmMYB153 | 17 | 97 | AtMYB96 | 5 | GmMYB172 | 4 |
| 47 | AtMYB30 | 3 | GmMYB007 | 18 | 98 | AtMYB78 | 5 | GmMYB105 | 7 |
| 48 | AtMYB21 | 3 | GmMYB130 | 19 | 98 | AtMYB111 | 5 | GmMYB174 | 7 |
| 48 | AtMYB35 | 3 | GmMYB131 | 19 | 99 | AtMYB80 | 5 | GmMYB144 | 8 |
| 49 | AtMYB30 | 3 | GmMYB046 | 19 | 100 | AtMYB78 | 5 | GmMYB110 | 9 |
| 50 | AtMYB11 | 3 | GmMYB176 | 19 | 100 | AtMYB111 | 5 | GmMYB197 | 9 |
| 51 | AtMYB45 | 3 | GmMYB025 | 20 |
Figure 4Chromosomal locations, region duplication, and predicted cluster for soybean MYB genes. Chromosomal positions of the MYB genes in soybean are mapped on the basis of JGI soybean Genome version 7.0. The chromosome number is indicated above each chromosome. The scale is in megabases (Mb). The number below the chromosome name indicates the length. The phylogenetic category of each gene (Figure 2) is indicated by the subgroup number. Each pair of duplicated MYB genes is connected with a red line. Connecting lines mark the specific cases in which there is a strong correlation between duplicated genomic regions and the presence of MYB genes with closely related predicted amino acid sequences. Colored boxes indicate groups of gene clusters with paralogous and syntenic genes on the chromosomes. Yellow bars on the chromosomes and blue numbers beside the bars indicate the 24 predicted duplication regions. The green and white bars on the chromosomes indicate the centromeres and pericentromeres, respectively.
Group and sub-group specific conserved motifs
| 1 | 3.60E-146 | L[RK]MGIDPVTH[ST]PRLDLLD[IL]SSIL[NRS][SA]S[LFI][YG][NG] |
| 2 | 1.80E-44 | [AHY][ANS][NY][DS][SK]N[FY][DN]D[GD]M[DE]FW[YF]D[IV][FL][TAI][RKS][ST][GN][ED][SI]IEL[LPS]E |
| 3 | 7.40E-32 | [EP][ETD][INTV][MN][PAL][APV][IL]DE[SD]FWSE[VA][LAT][SIV][DS][DY][EN] |
| 4 | 8.10E-13 | [KD][PK][NP]xL[IL]EIP[FW][DE][LPS]D[HPY]D[FI]W[SK][FL][LI]D |
| 5 | 2.20E-17 | PKL[LF]F[SA]EWL |
| 6 | 1.00E-90 | [HIK][KV][AK][PS][SAE]S[PT]STRHM[AV]QWES[AV]R[LV]EAEARLS[RN]E |
| 7 | 2.80E-58 | [SY]W[NP][NT][NS]KT[DC][SP]D[HY]FL[RQ][IL]WNSEVG[EK][SA]FR[DMN][VI][HK][GK] |
| 8 | 1.90E-27 | D[TS][AS]L[QK][LH]LL[DH][FM]P[ID][NS][ND][DM] |
| 9 | 7.60E-95 | [RK]GQWERRLQTDI[NH]MAK[QKR]AL[SC]EALS[PL][DN]K[PA] |
| 10 | 8.00E-64 | [PT][TN][TP][TY][QP][ST][GMT][CTV]YASS[AT][DE]NIA[RK]LL[KQ][GN][WF]MK[ND][TS]PK |
| 11 | 4.40E-31 | [QD][GM][PS][FL]SL[FL]EKWL[FL]D[DE][QA][GAS][CGH]Q[EMQ] |
| 12 | 1.20E-145 | [KT]N[DE][AT]LLS[ST][DEL]G[QP]SK[DNT]AANLSHMAQWESARLEAEARL[VA]RESK[LI] |
| 13 | 8.30E-22 | [IT]T[SV]N[AV][GT][RV][SV][GQS][DQ]LESPTST[LV]S[FE][SN][EN]N[AV]P[MPS] |
| 14 | 2.00E-48 | [DN][LFY]YE[DE]N[KN]NYWN[SN]I[LF]NL[VM][ND][SY]S[PS] |
| 15 | 3.70E-12 | GIG[LP][EQ][KS]DNIRDPL[EV]PNLTSPSHS[IV]TSAKLLNKMATSLPHKV[HY]GLEA[AV]KAVFSKLME |
| 16 | 3.60E-19 | [FQ]VG[GS]CLNALSPFED[GK]PHI[ST]K[IT]IDSPS[GS]PTLMFN[HQ]VTTPSCFSEDNEDNWQCVSSNYVSF[KR][IN]CAA[NS]GG |
| 17 | 5.40E-33 | [LG][GM][AH]Q[APY][LV][VM]NPE[LFI]LKLA |
| 18 | 4.10E-12 | S[VE][LS][SV]TP[SL]S[ST]PTPLNS |
| 19 | 2.90E-37 | [YFIS][VNS][NG][SCG][SNT]TE[DE]ER[DE][STQ][YS][CYG][SD] |
| 20 | 4.20E-22 | [LQ]A[ELR][ALQ][QTV][EQ][LM]AKLQ[YCL][LF][QH][YQ] |
| 21 | 1.40E-12 | [NY][AGV][EG][DG][ANP][SC][ST][YI]W[PS][ED][LH]L[LF][ED] |
| 22 | 2.30E-18 | [EF][EL][MT]WGDFM[DY]EE |
| 23 | 8.20E-18 | KT[QL]VYLPKPIRVKA[ML][YS][LS][QC][RI][TP][DR] |
| 24 | 7.30E-15 | HG[TN]L[QE][QK]LYEEY[FL]QLLN[MV][DE][EHQ][GK][QP][FD]E[LR][ND]SFA[EQ]SLL |
| 25 | 6.90E-36 | [PYL][NG][HFNQ][DQT][NE][ND][LQV][QTE][TA][KN][PI][PK][VIM]QE[TA][LF]FS[SH][TK][CP]PLF[MI][FV]DT |
| 26 | 1.70E-35 | [FL][EG][SP][VN][NK][IM]G[VAL][EG]GD[FM][SYF][VL]P[PS]LE[SN][RV][ST][ICT][ET][SR][ND] |
| 27 | 1.90E-55 | [EIV][ELS]E[RP][CH][PH][DQ]LNL[ED]L[SRT]I[SG][PL]P[SRW]Q[PQ]Q |
| 28 | 5.80E-38 | ERCPDLNLEL[RT]ISPP[RW]QQQ |
| 29 | 3.90E-21 | CF[AV]C[SR][LF]G[LM]HN[SG][KM][DE]C[SR] |
| 30 | 3.20E-29 | [LF]DMK[GK]I[IM][ANS]LLE[DE][ND]NHRVP[SY]IS |
| 31 | 2.00E-25 | M[HY][APQ][NA][TC][ST][EP][ER][QK]GY[FL]Y[SN]M[FV][NI][PV][NF][CD] |
| 32 | 5.40E-12 | [WF][EG][LV][GMR][SK]SPYE[TN]RI[LS]DWI[AS]E[LI][QS][NT][GD][YQ][GS][ED][AKN][EN]L[ES][QE]D[CH][NS]S[TN][GS][TC][CS][NE][NY][YN] |
| 33 | 8.80E-28 | NYWS[MI][ED]DIWS[MS] |
| 34 | 1.00E-46 | W[TND][DV]E[NS][MFI][WLC][FL]LQQ |
| 35 | 2.00E-37 | [NS]SGLLD[AD]LLEEA[QEK]A[LM][SA]S[GS][KEG][NK]SK[KS][DR][KR]S |
| 36 | 1.70E-36 | GHRA[LW][AG][TD]HKKEAAWRL[SR]R[VL]ELQLESEK[TA][NC]R[RQ]REK[IM]EE[FI]EAK[IM]KAL[QR]EE[EQ][KL][AN]A[LM][GE][RK]IE[AG]EYREQL[ADV][AG]LRRDAE[NA]K[ED]QKLA |
| 37 | 6.80E-20 | [LV][FHY]RPV[AP]R[VLM][SG][AS]F[SN][VAI][YC][NHK] |
| 38 | 5.20E-19 | [CDY][CDGSY][ED][PQR][FIMQS][VI]P[SHQ][KQT]CG[HG]GC[CS] |
| 39 | 7.10E-26 | [EA][LF][IA][SA]I[AS][TN][DE][LI][NGS][NS][ILT]AW[IL][KR]SG[LK][ED] |
| 40 | 1.70E-55 | [ET]T[SA]SS[SE]DDP[PA]TSLSLSLPG[VAF]D |
| 41 | 1.90E-124 | F[SG][GA]EF[ML][AT]V[MV]QEMI[RK]KEVRSYM[SAE]x[ML][QE][QR][GN]NG |
| 42 | 3.40E-33 | [PKV][RG][PN][NS][IL]L[EQ][DN]YI[KR]S[IL][TNE][ILQ][NI] |
| 43 | 2.50E-150 | [RM][LV][SI]RVELQLESEK[AT][GN]RRREK[IM]EE[FI]EAKIKAL[QR]EE[EQ][TKL]AAL[DG]RIEAEYREQL[AD][AG]LRRDAE[NS]KEQKLAEQW[AD]AKHLR[LF]T[KR][FL]LEQ[LSV][GD][CT][RI] |
| 44 | 4.10E-109 | RIGMTPTRDGFSFSMTPKGTPLRD[AE]LHINEDMNMHDSTKLELQRQADMRRSLRSGLGSLPQPKNEYQIVM[PQ]PV[LP]EDAEEPEEKIEEDMSDRIAREKAEEEARQQALLRKRSKVLQRELPRPPTASLELIRNSLMRTD[GV]DKSSFVPPTSIEQADEMIRRELL[ST]LLEHDN[AG]KYPLD[DE]KVIKEKKKGAKRAVNGSAVPVIEDF[EQ]EDEMKEADKLIKEEALYLCAAMGHEDEPLDEFIEAHRTCLNDLMYFPTR |
| 45 | 4.10E-171 | A[QE]NNKIQGTFLKKDDPK[IV][NST]ALMQQAELLSSLA[LQ]KV[DN]A[ED]N[MT]DQS[LM]ENAWKVLQ[ED]FLN[RK][TS]KE[SN]D[IL][PF][RG][YQ][KG]IPD |
| 46 | 1.00E-48 | [LF]QLV[DE][LF]KDL[IL][EA]DL[KR]S[GS][NS]E[ED][GI]Q[AP][CS]WRQ[MPV][DE]L[YH |
| 47 | 1.80E-123 | [EK]FSSP[LI]QVTPLFRSLA[AD]GIPSPQFSESER[SN]FL[MRL]KTLG[MI]ES[PS]S[ILP][NC]PS[AV]N[PS]S[QK]PP[PL]CKR[AV]LL[IPH] |
| 48 | 4.90E-79 | GIDYNAEIPFEKR[PA]P[PA]GF[FY]D[VT][TA]DEDRP[VA][ED]Q[PV][QK]FPTTIEELEGKRR[VA]DVEA[QH]LRKQD[IV]A[KR]NKIAQRQDAP[SA]AIL[HQ]ANKLNDPE[TV]VRKRSKLMLPPPQISD[QH] |
| 49 | 3.50E-92 | KGDAIMMEAENLARLR[ED]SQTPLLGGENPELHPSDF[NST]GVTP[KR]KKEIQTPNPMLTPS[AM]TPG[GA]AGLTPRIG[ML]TP[TS]RDG[FS]SFSMTPKGTP[LF]RD[EA]LHINEDM[ND]MH |
| 50 | 6.20E-59 | I[ED]DF[DEQ]E[DN]E[ML][KQ]EADK[LM]IKEE[AG][LK][YF]LC[AV][AS]MGHE[DN][EK][PT]LD[ED]F[IV]EAH[RN]TC[LV]NDLMYFPTR[NS]AY[GE]LSSVAGN[MA][ED]K[LV][AT]A[LF]Q[NE]E[FM]ENVR |