| Literature DB >> 27596925 |
Nicolas Rispail1, Diego Rubiales1.
Abstract
MLO proteins are highly conserved proteins with seven trans-membrane domains. Specific MLO genes have been linked to plant disease susceptibility. Others are involved in plant reproduction and in root thigmomorphogenesis. Functions of the remaining MLOs are still unknown. Here we performed a genome-wide survey of the MLO family in eight legume species from different clades of the Papillionoideae sub-family. A total of 118 MLO sequences were identified and characterized. Their deduced protein sequences shared the characteristics of MLO proteins. The total number of MLO genes per legume species varied from 13 to 20 depending on the species. Legume MLOs were evenly distributed over their genomes and tended to localize within syntenic blocks conserved across legume genomes. Phylogenetic analysis indicated that these sequences clustered in seven well-defined clades. Comparison of MLO protein sequences revealed 34 clade-specific motifs in the variable regions of the proteins. Comparative analyses of the MLO family between legume species also uncovered several evolutionary differences between the tropical legume species from the Phaseoloid clades and the other legume species. Altogether, this study provides interesting new features on the evolution of the MLO family. It also provides valuable clues to identify additional MLO genes from non-sequenced species.Entities:
Mesh:
Year: 2016 PMID: 27596925 PMCID: PMC5011691 DOI: 10.1038/srep32673
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
MLO family members of legume and non-legume species and their phylogenetic classificationa.
| Plant species | Common name | Total | Clade | Reference | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | ||||
| Thale cress | 15 | 3 | 3 | 5 | 0 | 3 | 1 | 0 | Chen | |
| Grapevine | 14 | 3 | 3 | 2 | 1 | 4 | 2 | 2 | Feechan | |
| Cucumber | 13 | 4 | 2 | 3 | 0 | 3 | 1 | 0 | Zhou | |
| Tomato | 15 | 3 | 3 | 3 | 0 | 4 | 1 | 1 | Chen | |
| Barley | 11 | 2 | 7 | 1 | 1 | 0 | 0 | 0 | Kusch | |
| Pepper | 2 | 2 | Kim and Huang | |||||||
| Barrel medic | 14 | 3 | 4 | 3 | 1 | 2 | 0 | 1 | This study | |
| Chickpea | 13 | 3 | 3 | 3 | 1 | 2 | 0 | 1 | This study | |
| Pea | 4 | 1 | 1 | 1 | 1 | This study | ||||
| Narrow-leaf lupin | 15 | 3 | 3 | 6 | 0 | 3 | 0 | 0 | This study | |
| Peanut | 14 | 4 | 4 | 2 | 1 | 2 | 0 | 1 | This study | |
| Soybean | 31 | 5 | 5 | 6 | 2 | 9 | 2 | 2 | Deshmukh | |
| Pigeonpea | 20 | 4 | 3 | 3 | 1 | 6 | 1 | 2 | This study | |
| Common bean | 19 | 4 | 3 | 3 | 1 | 6 | 1 | 1 | This study | |
| Mungbean | 18 | 3 | 3 | 3 | 1 | 6 | 1 | 1 | This study | |
aMLO classification is based on Neighbor-Joining phylogenetic analysis and literature.
bActualized G. max MLO sequences after removing partial sequences and those classified by Genbank as obsolete sequences.
Legume genomic databases used in this study.
| Species | Common name | Legume clade | Depository (Bioproject) | Version | Web address | Reference |
|---|---|---|---|---|---|---|
| Barrel medic | Galegoid | JCVI | v.4 | Young | ||
| Chickpea | Galegoid | NCBI (PRJNA190909) | v.1 | Varshney | ||
| GigaDB | ||||||
| Narrow-leaf lupin | Genistoid | NCBI (PRJNA179231) | Draft | Yang | ||
| Peanut | Dalbergioid | PeanutBase | v.1 | Bertioli | ||
| Pigeonpea | Phaseoloid | GigaGB | v.5 | Varshney | ||
| Common bean | Phaseoloid | Phytozome | v.1 | Schmut | ||
| Mungbean | Phaseoloid | NCBI (PRJNA243847) | v.1 | Kang |
Figure 1Domain organization of legume MLOs.
The figure represents the domain organization of all MLO protein sequences isolated from (a) Genistoid and Galegoid legumes and (b) Dalbergioid and Phaseoloid legumes. Sequences are drawn to scale with the IBS server51 following Interproscan 529, TMHMM27 and SignalP52 predictions (Supplementary Table S2).
Conserved motifs common to all sequences as detected by MEME software.
| Motif | Sequence | e-value | N° sequence |
|---|---|---|---|
| 1 | RSL[EDA]ETPTWAVAVVC[TF]V[FIL][VLI][AL][IV]S | 6.4e−1500 | 208/212 |
| 2 | [LAI][ILV]E[RHK][SI]H[KR][LI]GKWLKKK[HN]KKAL[LYF] | 1.6e−997 | 205/212 |
| 3 | E[AS]LEK[IV]K[EA]ELMLLGFISLLLT[VF] | 1.6e−1654 | 209/212 |
| 4 | [SA][KR]IC[IV][PS][ES][KS]VA[DNS][ST][MW][LHF]PC | 1.4e−821 | 193/212 |
| 5 | P[LF][VLI]S[VY]EG[LI][HE]QLH[IR]FIF[VF]LA[VI][FT]H[VI]L[YF][SC][VIA][LI]T[MVL][LA]L[GA]R[AL]K[IM]R[RS] | 3.0e−2726 | 211/212 |
| 6 | WK[ARK]WE[EAD]ET[KS][TS][LH]EY[QE]F[ASY][NH]DP[ES]RFR[FL][AT][RH][EDQ]T[ST]F[GV]RRHLS | 2.5e−1389 | 127/212 |
| 7 | [CS]FFRQF[YF][GR]SVT[KR][VA]DYL[TA]LR[HL]GF | 1.2e−1793 | 212/212 |
| 8 | KF[DN]F[HQ]KY[IM]KRS[LM]E[DE]DFKV[VI]VG[IV]S[PW]PLW[FA][FS][VA]V | 2.9e−2348 | 210/212 |
| 9 | N[IVT][HN]GW[HYN][TS]YFW[LI][SPA]FIP[LV][IV][IL][ILV]L[LA]VGTKL[QE][HV][IV]I[TA] | 1.9e−2051 | 211/212 |
| 10 | M[AG]L[ERD]I[QAT][ED][RK][HG][AE]V[VI][KQ]GI[PL][LV]V[QE]P[SG]D | 1.6e−956 | 156/212 |
| 11 | FWF[NG][RK]P[RQ]L[VL]L[FH]LIH[FL][IV]LFQNAF | 1.7e−1699 | 212/212 |
| 12 | [AT][FY]F[FL]W[TIS]W[YW][EQT][FY]GFDSC[FI] | 3.8e−946 | 207/212 |
| 13 | R[LV][AI][LM]GV[FA][VI]Q[VF]LCSY[VSI]TLPLYA[LI]VTQMG[ST][TR]MK | 7.8e−2592 | 212/212 |
| 14 | K[ATS]IF[DN]E[QR][VT][ARS]KALK[KNG]WHK[TA][AV]KKKxKHKKxGSS | 2.2e−1128 | 189/212 |
Conservation of previously identified invariable amino acid residues33 in legume MLO sequences at species and clade level.
| Barley Residue | Legume species | MLO clade | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Mt | Ca | La | Ar | Cc | Pv | Vr | Gm | Ia | Ib | II | III | IV | V | VI | VII | |
| E35 | E/N | E/H | E/Q | E/H/D | E/Q/D | E/Q/D | E/Q | E/Q | Q/H/N | E | E | E | E | E/D | E | E |
| M65 | M | M | M | M/I | M | M | M | M | M | M | M | M | M | M/I | M | M |
| G68 | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G | G |
| S71 | S | S | S | S | S | S | S | S | S | S | S | S | S | S | S | S |
| L74 | L/M | L/M | L/M | L/M | L/M | L/M | L/M | L/M | M | L | L/M | L | L/I | L | L | L |
| C86 | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C |
| C98 | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C |
| C114 | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C |
| F135 | F/S | F/S | F | F/S | F/S | F/S | F/S | F/S | F | F | F | F | F | F | F | S |
| W158 | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W |
| E163 | E | E | E/Q | E | E/D | E/D | E/D | E/D | E | E | E | E/Q | E | E | D | E |
| F207 | F | F | F | F | F/L | F | F | F | F | F | F | F | F | F | F | F/L |
| Q210 | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q | Q |
| Y220 | Y | Y | Y/F | Y | Y/F | Y | Y | Y | Y | Y | Y | Y/F | Y | Y | Y | Y |
| R224 | R | R | R | R | R | R | R | R | R | R | R | R | R | R | R | R |
| F227 | F | F | F | F | F | F | F | F | F | F | F | F | F | F | F | F |
| F240 | F | F | F | F | F | F | F | F | F | F | F | F | F | F | F | F |
| Y243 | Y/F | Y/F | Y | Y/F | Y/F | Y/F | Y/F | Y/F | Y | Y | Y | Y | Y | Y | Y | F |
| W263 | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W |
| P287 | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P |
| F329 | F | F | F | F | F | F | F | F | F | F | F | F | F | F | F | F |
| W330 | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W |
| P334 | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P |
| F346 | F/I | F/I | F | F/L | F/I | F/I | F/I | F/I | F | F | F | F | F | F | F | I/L |
| N348 | N | N | N/I | N/T | N | N | N | N | N | N | N/I | N | N | N | N | N/T |
| F350 | F | F | F/I | F | F | F | F | F | F | F | F/I | F | F | F | F | F |
| C367 | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C | C |
| T393 | T | T | T | T | T | T | T | T | T | T | T | T | T | T | T | T |
| P395 | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P | P |
| W423 | W/L | W | W | W | W | W | W | W | W | W | W | W | W | W | W | W |
Figure 2Phylogenetic relationship of legume MLOs.
The phylogenetic relationship of legume MLO protein sequences was estimated with the Maximum likelihood (ML) method with MEGA648 software with 1,000 bootstrap independent replicates. The tree was drawn to scale, with branch lengths measured as the number of substitutions per site. Number on a node indicates the percentage of bootstrap when higher than 75%.
Figure 3Motif organization of legume MLOs.
The figure shows the predicted topology of a typical MLO protein (a) and the schematic organization of the common and specific motifs for each MLO clade (b). Common and clade-specific motifs are represented by white and colored boxes, respectively. These motifs were identified by scanning the MLO sequences with the MEME suite software30 (Supplemental Table S1). Common and clade-specific amino acid motifs are listed in Tables 3 and 4 respectively. Localization of transmembrane domains is shown as dashed horizontal lines.
Clade-specific motifs in MLO sequences as detected by MEME software.
| Clade | Motif | Sequence | e-value | N° Sequence |
|---|---|---|---|---|
| I | 1 | [DN][SG][LN]S[QE][IS][TK][RKS][ES][LIK][TR][ML]RR[QL][ST]TF[VI][FK][HS]H[TA]S[HN]P[WL]S[RHK][NH][KSP] | 1.2e−405 | 42/212 |
| I | 2 | I[TM][NE]HNL[PS]L[KTS] | 1.1e−100 | 29/212 |
| I | 3 | [TK]LA[LV]E[NI]A[GE][IQR][TC][GP][FP][FM][SKP][EPR][AHT][KQ][LFV][RKN][PL]RDELFWF[KNG]KP[ERD] | 8.5e−565 | 40/212 |
| I | 4 | TRS[EQ]ID[EK][EQ][MI]E[ED]NGSE[EG]RKLL[MT]A | 4.1e−151 | 18/212 |
| I | 5 | [YA][PY][HR][LV][IF][RG]RML[ND]G[IM]NR[SN][ST] | 3.9e−082 | 13/212 |
| I | 6 | TIHTDTSTVLSIEEDDQLID[DAT]PE | 1.2e−161 | 16/212 |
| I | 7 | [AT]VT[SA]TPSPIANETSSRA[VA]TPLLRPSASISS[SV][HQV][PCS][SF]S | 1.0e−190 | 14/212 |
| II | 8 | G[AL][RK]IR[QE]WKHWEDSIAK[QE]NYETx | 2.1e−334 | 50/212 |
| II | 9 | [RP]VL[KE]P[KT]VT[HN]V[HQ]QH[ADE]FI | 8.3e−146 | 26/212 |
| II | 10 | GIQLGS[VI]F[RKQ][KR][AR][SA][AS][PA][EP][DE] | 3.7e−126 | 22/212 |
| II | 11 | [AKN][KR][KR][KR][GL]L[KRS][AG]D[SGN][NQ][SHP][SGQ][HS][GC]S | 8.1e−111 | 28/212 |
| III | 12 | [EG][EG]EH[RH]R[KR]LLSYERR[YF]L[AS][AG][DG][GTA][TG][SG] | 8.1e−314 | 33/212 |
| III | 13 | [DGS]ST[VI]HSSGPTLHR[FY]KTTGHSTR | 7.1e−258 | 26/212 |
| III | 14 | Y[DE]D[QD]D[DE]Y[HEQ]SDIE | 4.5e−079 | 18/212 |
| III | 15 | [PQ]T[AT][SNT][LI][IV][VT]RVD[HN][GD][ED]Q[QE][AQ]EE[EN]E[HE]H | 3.0e−144 | 26/212 |
| IV | 16 | [FD][DE][DE]N[MLV][EV]WRRVLA | 7.3e−076 | 11/212 |
| IV | 17 | A[AS][SG]G[GD]DYCS[QN]KGKV[PS]LISQSGV | 9.5e−091 | 9/212 |
| IV | 18 | SGE[TA]TPSQGTSP[LI]HLL[HQ]K[YF]KPS | 1.2e−099 | 12/212 |
| IV | 19 | [HQ]TDTDSVLYSPRSYQSD | 1.1e−080 | 8/212 |
| IV | 20 | TD[LF]S[DE]TEGS[ST]HQLN[EL]I[TQ][QI][TM][HS]Q[PA] | 3.4e−066 | 8/212 |
| IV | 21 | P[RN]N[GQ][EL]THNI[DE]FSF[VD][KS]P | 2.5e−044 | 8/212 |
| V | 22 | LA[TAG][GK]GYDKC | 8.5e−169 | 28/212 |
| V | 23 | FW[TS][QK][SN][PT][IV][LS][LV]WIV | 2.7e−203 | 50/212 |
| V | 24 | FHSTT[EA]D[VI]VIR[IL] | 2.4e−198 | 51/212 |
| V | 25 | STTPFSSRP[ST]TPTHGMSP[VA]HLL | 4.4e−360 | 30/212 |
| V | 26 | [APR][GRS][RHE]SDS[AFP][QP]TSPR[TAR]SNY[ED]NEQWD | 3.0e−275 | 50/212 |
| V | 27 | P[ITV][SR][SHT][QE][HIL]EI[NR][IV][SA][SL][SK][ED]FSF[EDG][KR][RG][HP][THI] | 9.7e−142 | 24/212 |
| VI | 28 | [SC]KALAK[IM]L[KR]QWH[VL]EVRERR | 1.4e−131 | 5/212 |
| VI | 29 | [QN][RELQ][KE][LQ][VL]KSFSF[RS][HR] | 1.3e−009 | 5/212 |
| VI | 30 | MSSEWSQGNKSAP[ED]FSSTL[CR]E[SN][IANT]RSSDEGEIVEELEH[MPRS][VDEM]KTKA[SCNT]SSSDPP | 2.3e−103 | 5/212 |
| VII | 31 | N[PG]KIITRG[TI]YDGEISFGS[SY][WV][KG][NS] | 9.5e−100 | 10/212 |
| VII | 32 | SSRGI[GR]EI[GV]SI[TAI]EE[DE]D | 2.7e−046 | 9/212 |
| VII | 33 | AT[RH]TSTS[EGQ][LF]D[VI]A[PH]ATN[EQ]S[TAEN][IV]E[VF] | 8.2e−038 | 5/212 |
| VII | 34 | NNSTSSKHSDSLHSK[EG][GC]DNS[AV]RG[ST][VM]DSVH[TN]PDNV[VA][LV]T[SN][NP]P[SF][PH] | 1.8e−071 | 5/212 |