| Literature DB >> 26891798 |
Panagiotis F Sarris1,2, Volkan Cevik1, Gulay Dagdas1, Jonathan D G Jones1, Ksenia V Krasileva3,4.
Abstract
BACKGROUND: Plants deploy immune receptors to detect pathogen-derived molecules and initiate defense responses. Intracellular plant immune receptors called nucleotide-binding leucine-rich repeat (NLR) proteins contain a central nucleotide-binding (NB) domain followed by a series of leucine-rich repeats (LRRs), and are key initiators of plant defense responses. However, recent studies demonstrated that NLRs with non-canonical domain architectures play an important role in plant immunity. These composite immune receptors are thought to arise from fusions between NLRs and additional domains that serve as "baits" for the pathogen-derived effector proteins, thus enabling pathogen recognition. Several names have been proposed to describe these proteins, including "integrated decoys" and "integrated sensors". We adopt and argue for "integrated domains" or NLR-IDs, which describes the product of the fusion without assigning a universal mode of action.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26891798 PMCID: PMC4759884 DOI: 10.1186/s12915-016-0228-7
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Phylogeny of the plant species and corresponding number of detected NLR and NLR-fusion proteins. The phylogeny of the plants used in the analyses was constructed using their corresponding NCBI taxon identification numbers. For the summary of NLR-IDs detected in each species, see Table 1 and Additional files 2 and 3. Annotation of all domains in NB-ARC-containing proteins and NLR-IDs and corresponding FASTA sequences are included in Additional files 4, 5, 6 and 7. NB: NB-ARC domain-containing proteins; NB-ID: NB-ARC plus any other canonical domains together with non-canonical domains; NLR: subset of NB with clearly identified LRRs; TLR: TIR-NB-ARC-LRR proteins; TN: TIR-NB-ARC proteins; T2N: TIR2-NB-ARC proteins
Most prevalent integrated domains in flowering plants
| Integrated domaina | Species | Families | Domain description |
|---|---|---|---|
| Pkinase |
|
| Protein kinase domain |
| DUF3542 |
|
| Protein of unknown function (DUF3542) |
| Pkinase_Tyr |
|
| Protein tyrosine kinase |
| WRKY |
|
| WRKY DNA-binding domain |
| RVT_3 |
|
| Reverse transcriptase-like |
| WD40 |
|
| WD domain, G-beta repeat |
| zf-BED |
|
| BED zinc finger |
| B3 |
|
| B3 DNA-binding domain |
| NAM |
|
| No apical meristem (NAM) protein |
| DUF761 |
|
| Cotton fiber-expressed protein |
| UBN2 |
|
| Gag-polypeptide of LTR copia-type |
| HMA |
|
| Heavy metal-associated domain |
| Thioredoxin |
|
| Thioredoxin |
| VQ |
|
| VQ motif |
| LIM |
|
| LIM domain |
| zf-RVT |
|
| Zinc-binding in reverse transcriptase |
| C1_2 |
|
| C1 domain |
| DUF4219 |
|
| Domain of unknown function (DUF4219) |
| EF_hand_5 |
|
| EF-hand domain pair |
| Myb_DNA-binding |
|
| Myb-like DNA-binding domain |
| Peptidase_C48 |
|
| Ulp1 protease family, C-terminal catalytic domain |
| gag_pre-integrs |
|
| GAG-pre-integrase domain |
| rve |
|
| Integrase core domain |
| Jacalin |
|
| Jacalin-like lectin domain |
| DUF3633 |
|
| Protein of unknown function (DUF3633) |
| FNIP |
|
| FNIP repeat |
| Kelch_1 |
|
| Kelch motif |
| PP2C |
|
| Protein phosphatase 2C |
| AvrRpt-cleavage |
|
| Cleavage site for pathogenic type III effector avirulence factor Avr |
| CBFB_NFYA |
|
| CCAAT-binding transcription factor (CBF-B/NF-YA) subunit B |
| DUF4283 |
|
| Domain of unknown function (DUF4283) |
| F-box |
|
| F-box domain |
| Glutaredoxin |
|
| Glutaredoxin |
| PP2 |
|
| Phloem protein 2 |
| PPR_2 |
|
| PPR repeat family |
| PRK |
|
| Phosphoribulokinase/uridine kinase family |
| U-box |
|
| U-box domain |
| UBN2_3 |
|
| Gag-polypeptide of LTR copia-type |
| Abhydrolase_6 |
|
| Alpha/beta hydrolase family |
| B_lectin |
|
| D-mannose binding lectin |
| C1_3 |
|
| C1-like domain |
| Cyclin_C |
|
| Cyclin, C-terminal domain |
| Cyclin_N |
|
| Cyclin, N-terminal domain |
| DUF247 |
|
| Plant protein of unknown function |
| FBD |
|
| FBD |
| Myb_DNA-bind_3 |
|
| Myb/SANT-like DNA-binding domain |
| PA |
|
| PA domain |
| PAH |
|
| Paired amphipathic helix repeat |
| PARP |
|
| Poly(ADP-ribose) polymerase catalytic domain |
| PPR_1 |
|
| PPR repeat |
| PTEN_C2 |
|
| C2 domain of PTEN tumour suppressor protein |
| Proteasome_A_N |
|
| Proteasome subunit A N-terminal signature |
| RVT_2 |
|
| Reverse transcriptase (RNA-dependent DNA polymerase) |
| S_locus_glycop |
|
| S-locus glycoprotein family |
| Sugar_tr |
|
| Sugar (and other) transporter |
| TPR_11 |
|
| TPR repeat |
| TPR_12 |
|
| Tetratricopeptide repeat |
| UPF0114 |
|
| Uncharacterized protein family, UPF0114 |
| XH |
|
| XH domain |
| zf-CCHC_4 |
|
| Zinc knuckle |
| zf-RING_2 |
|
| Ring finger domain |
aIntegrated domains present across at least two plant families. Additional file 3 contains the full list of integrated domains. Additional file 6 contains list of domains for each protein
Fig. 2Word cloud analysis of the putative sensor domains found in fusion to NLRs. The word cloud represents relative abundance of different domains found in fusion. To correct for biases of sequencing particular plant families, word cloud was constructed on the plant family level (list of all domains occurring in NLR-ID in each plant family)
Fig. 3Overlap between IDs and domains present in host targets of plant pathogens. a An overlap between NLR-IDs from this study and functional domains present in the published Arabidopsis “effector interactome”. b Two examples of NLRs and their guardees, RIN4 and Exo70, that are known to be targeted by effectors in rice and Arabidopsis and which are found as fusions in other plant species
Pathogenic effectors, their previously identified interacting Arabidopsis proteins and corresponding domains that were also detected in NLR-IDs
| Effector | Interacting Arabidopsis gene | Common domain with NLR-ID | Domain description |
|---|---|---|---|
| OEC55 | AT1G04690 | Aldo_ket_red | Aldo/keto reductase family |
| ATR1_group | AT4G00980 | DUF4219 | Domain of unknown function (DUF4219) |
| ATR1_group | AT4G00980 | UBN2_2 | Gag-polypeptide of LTR copia-type |
| ATR1_group | AT4G00980 | zf-CCHC_4 | Zinc knuckle |
| ATR13_group | AT5G66560 | BTB | BTB/POZ domain |
| ATR13_group | AT5G52750 | HMA | Heavy metal-associated domain |
| avrB_group | AT3G25070 | AvrRpt-cleavage | Cleavage site for pathogenic type III effector avirulence factor Avr |
| avrB_group | AT1G14920 | GRAS | GRAS domain family |
| avrB_group | AT2G47060 | Pkinase | Protein kinase domain |
| avrB_group | AT3G17410 | Pkinase_Tyr | Protein tyrosine kinase |
| avrC_group | AT4G17680 | Apolipoprotein | Apolipoprotein A1/A4/E domain |
| avrPto_group | AT5G22355 | C1_2 | C1 domain |
| avrPto_group | AT5G22355 | C1_3 | C1-like domain |
| avrPto_group | AT4G11890 | Pkinase | Protein kinase domain |
| avrPto_group | AT3G48150 | TPR_11 | TPR repeat |
| AvrRps4_Pph_1448A | AT4G11070 | WRKY | WRKY DNA-binding domain |
| AVRRPT2_group | AT4G00710 | Pkinase_Tyr | Protein tyrosine kinase |
| AVRRPT2_group | AT4G00710 | TPR_11 | TPR repeat |
| HARXL10_WACO9 | AT1G50420 | GRAS | GRAS domain family |
| HARXL106_group | AT1G09270 | Arm | Armadillo/beta-catenin-like repeat |
| HARXL106_group | AT4G02150 | Arm | Armadillo/beta-catenin-like repeat |
| HARXL106_group | AT1G32230 | PARP | Poly(ADP-ribose) polymerase catalytic domain |
| HARXL106_group | AT1G32230 | RST | RCD1-SRO-TAF4 (RST) plant domain |
| HARXL14 | AT5G66200 | Arm | Armadillo/beta-catenin-like repeat |
| HARXL149 | AT4G35580 | NAM | No apical meristem (NAM) protein |
| HARXL16 | AT1G18400 | HLH | Helix-loop-helix DNA-binding domain |
| HARXL16 | AT4G32570 | tify | tify domain |
| HARXL21 | AT1G15750 | WD40 | WD domain, G-beta repeat |
| HARXL44 | AT4G25920 | DUF295 | Protein of unknown function (DUF295) |
| HARXL44 | AT4G16380 | HMA | Heavy metal-associated domain |
| HARXL45_group | AT4G02550 | Myb_DNA-bind_3 | Myb/SANT-like DNA-binding domain |
| HARXL68 | AT1G45145 | Thioredoxin | Thioredoxin |
| HARXL68 | AT5G42980 | Thioredoxin | Thioredoxin |
| HARXL73 | AT4G39050 | Kinesin | Kinesin motor domain |
| HARXL79 | AT5G56950 | NAP | Nucleosome assembly protein (NAP) |
| HARXLL445_group | AT2G35500 | CS | CS domain |
| HARXLL470_WACO9 | AT5G49000 | F-box | F-box domain |
| HARXLL470_WACO9 | AT5G49000 | Kelch_1 | Kelch motif |
| HARXLL470_WACO9 | AT1G79430 | Myb_DNA-binding | Myb-like DNA-binding domain |
| HARXLL492 | AT3G60600 | Motile_Sperm | Major sperm protein (MSP) domain |
| HARXLL60 | AT2G23290 | Myb_DNA-bind_6 | Myb-like DNA-binding domain |
| HARXLL73_group | AT1G03960 | EF_hand_5 | EF-hand domain pair |
| HARXLL73_group | AT4G26110 | NAP | Nucleosome assembly protein (NAP) |
| HARXLL73_group | AT5G56290 | TPR_11 | TPR repeat |
| HOPAB_group | AT3G57720 | Pkinase | Protein kinase domain |
| HOPAB_group | AT3G46370 | Pkinase_Tyr | Protein tyrosine kinase |
| HOPAB_group | AT3G27960 | TPR_12 | Tetratricopeptide repeat |
| HopBB1_Pmo_M301020 | AT3G17860 | CCT_2 | Divergent CCT motif |
| HopBB1_Pmo_M301020 | AT3G17860 | tify | tify domain |
| HOPD1_group | AT5G22290 | NAM | No apical meristem (NAM) protein |
| HOPF_group | AT2G04740 | BTB | BTB/POZ domain |
| HOPH1_group | AT5G43700 | AUX_IAA | AUX/IAA family |
| HopP1_Pto_DC3000 | AT4G36540 | HLH | Helix-loop-helix DNA-binding domain |
| HOPR1_group | AT5G60120 | AP2 | AP2 domain |
| HopX_group | AT5G13810 | Glutaredoxin | Glutaredoxin |
| OEC115 | AT4G28640 | AUX_IAA | AUX/IAA family |
| OEC115 | AT5G08130 | HLH | Helix-loop-helix DNA-binding domain |
| OEC115 | AT3G21490 | HMA | Heavy metal-associated domain |
| OEC115 | AT3G10480 | NAM | No apical meristem (NAM) protein |
| OEC45 | AT1G63480 | DUF296 | Domain of unknown function (DUF296) |
| OEC45 | AT4G00120 | HLH | Helix-loop-helix DNA-binding domain |
| OEC45 | AT1G12520 | HMA | Heavy metal-associated domain |
| OEC45 | AT3G22420 | Pkinase | Protein kinase domain |
| OEC59 | AT4G08320 | TPR_11 | TPR repeat |
| OEC67 | AT1G25550 | Myb_DNA-binding | Myb-like DNA-binding domain |
| OEC78 | AT4G30080 | AUX_IAA | AUX/IAA family |
| OEC78 | AT4G30080 | B3 | B3 DNA-binding domain |
| OEC78 | AT4G02590 | HLH | Helix-loop-helix DNA-binding domain |
Fig. 4Domain architectures and structural analyses of NLR-kinase fusions. a Distribution of identified integrated kinase domains. b Complex fusions of domains on top of NLR-kinases. c Three-dimensional structural model of the kinase domain of an Arabidopsis NLR-kinase At4G12020 (aa 8–258) modelled after the best structural match, human serine/threonine protein kinase PAK 6 (PDB: 2C30). Conservation profile across all plant kinases found in NLR-IDs is overlapped on the structure, with most conserved residues depicted in red and most divergent in blue. d Zoom-in on the active site and its critical lysine and aspartate residues and a corresponding alignment logo show that the active site of kinases is completely conserved across all fusions
Fig. 5Evolutionary analyses of the WRKY domain in NLR-WRKYs family. a Multiple sequence alignment logo of the WRKY domains found in all NLR-IDs shows conserved core structural tryptophan and incomplete conservation of tyrosine and lysine that have been shown to be essential for recognizing the W-box DNA. b Maximum likelihood phylogeny of all Arabidopsis WRKY domain-containing proteins (black nodes) and the WRKYs detected as fusions in all flowering plants (strawberry nodes). Distinct Arabidopsis WRKY clades that form fusions are highlighted as the following: green, RRS1 clade; yellow, WRKY46; pink, WRKY70 and WRKY54; blue, WRKY16; and purple, WRKY19/MEKK4. Arabidopsis WRKY41 known to be the host target is marked with red asterisk
Fig. 6Summary of the information encoded in the discovered NLRs that possess “integrated domains”