| Literature DB >> 26287177 |
Manoj Kumar Sekhwal1, Pingchuan Li2, Irene Lam3, Xiue Wang4, Sylvie Cloutier5, Frank M You6,7.
Abstract
Plants have developed effective mechanisms to recognize and respond to infections caused by pathogens. Plant resistance gene analogs (RGAs), as resistance (R) gene candidates, have conserved domains and motifs that play specific roles in pathogens' resistance. Well-known RGAs are nucleotide binding site leucine rich repeats, receptor like kinases, and receptor like proteins. Others include pentatricopeptide repeats and apoplastic peroxidases. RGAs can be detected using bioinformatics tools based on their conserved structural features. Thousands of RGAs have been identified from sequenced plant genomes. High-density genome-wide RGA genetic maps are useful for designing diagnostic markers and identifying quantitative trait loci (QTL) or markers associated with plant disease resistance. This review focuses on recent advances in structures and mechanisms of RGAs, and their identification from sequenced genomes using bioinformatics tools. Applications in enhancing fine mapping and cloning of plant disease resistance genes are also discussed.Entities:
Keywords: disease resistance gene; gene mining; nucleotide binding site leucine rich repeat (NBS-LRR); pentatricopeptide repeats (PPRs); receptor like kinase (RLK); receptor like protein (RLP); resistance gene analog (RGA); small RNA (sRNA)
Mesh:
Year: 2015 PMID: 26287177 PMCID: PMC4581296 DOI: 10.3390/ijms160819248
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Cloned R-genes from plant species.
| Species a | Accession ID | Domain b | Class | Chr | Disease c | Pathogen c | Reference | ||
|---|---|---|---|---|---|---|---|---|---|
|
| AAQ01784 | CNL | NBS | 1A | Leaf rust |
|
| [ | |
|
| ABS29034 | CNL | NBS | 5D | Leaf rust |
|
| [ | |
|
| AAQ96158 | CNL | NBS | 1A | Powdery mildew |
|
| [ | |
|
| AGQ17390 | CNL | NBS | 1D | Stem rust |
|
| [ | |
|
| AGP75918 | CNL | NBS | 3A | Stem rust |
|
| [ | |
|
| AAP74647 | NL | NBS | 1D | Leaf rust |
|
| [ | |
|
| AAC05834 | NL | NBS | 2D | Cereal cyst |
|
| [ | |
|
| AAM94164 | NL | NBS | 2B | Cereal cyst |
|
| [ | |
|
| AAG42168 | CNL | NBS | 1B | Stripe rust |
|
| [ | |
|
| AEF30547 | STK | Oth-R | 6V | Powdery mildew |
|
| [ | |
|
| ACN41354 | Oth-R | 7D | Leaf rust, Powdery mildew, Stripe rust |
| [ | |||
|
| ACF33195 | Kinase-START | Oth-R | 6B | Stripe rust |
|
| [ | |
|
| CAC29241 | CNL | NBS | 1 | Powdery mildew |
|
| [ | |
|
| AAG37356 | CNL | NBS | 1 | Powdery mildew |
|
| [ | |
|
| AAO16014 | CNL | NBS | 1 | Powdery mildew |
|
| [ | |
|
| AAM81980 | LRR-PK | RLK | 7 | Stem rust |
|
| [ | |
|
| CAB06083 | TM | Oth-R | 4 | Powdery mildew |
|
| [ | |
|
| AAC49408 | CNL | NBS | 5 | Bacterial speck |
|
| [ | |
|
| AAC67238 | CNL | NBS | 6 | Root knot |
|
| [ | |
|
| AAB63274 | NL | NBS | 11 | Fusarium wilt |
|
| [ | |
| AIB02970 | CNL | NBS | 9 | Late blight |
|
| [ | ||
|
| AAG31013 | CNL | NBS | 9 | Tomato spotted wilt |
| TSWV | [ | |
|
| AAQ10735 | CNL | NBS | 9 | Tobacco mosaic | TMV | [ | ||
|
| AAR21295 | TNL | NBS | 5 | Bacterial spot |
|
| [ | |
|
| CAD29729 | CNL | NBS | 4 | Potato cyst |
|
| [ | |
|
| 2207203A | LRR-TM | RLP | 6 | Leaf mold |
|
| [ | |
|
| CAA05268 | LRR-TM | RLP | 1 | Leaf mold |
|
| [ | |
|
| AAC78591 | LRR-TM | RLP | 6 | Leaf mold |
|
| [ | |
|
| CAA05274 | LRR-TM | RLP | 1 | Leaf mold |
|
| [ | |
| AAK58681.2 | LRR-TM | RLP | 9 | Verticillium wilt |
|
| [ | ||
|
| CAA05269 | LRR-TM | RLP | 1 | Leaf mold |
|
| [ | |
|
| 2115395A | STK | Oth-R | 5 | Bacterial speck |
|
| [ | |
|
| A49332 | STK | Oth-R | 5 | Bacterial speck |
|
| [ | |
|
| NP_001233803 | STK | Oth-R | 12 | Bacterial speck |
|
| [ | |
|
| CAB50786 | CNL | NBS | 12 | PVX | PVX | [ | ||
|
| Q7XBQ9 | CNL | NBS | 8 | Late blight |
| [ | ||
|
| CAB56299 | LZ-NL | NBS | 5 | PVX | PVX | [ | ||
|
| AAL39063 | LZ-NL | NBS | 5 | Late blight |
|
| [ | |
|
| Q9ZSD1 | NL | NBS | Downy mildew |
|
| [ | ||
|
| AAF09256 | CNL | NBS | Bacterial spot |
|
| [ | ||
|
| BAA25068 | NL | NBS | 4 | Bacterial blight |
|
| [ | |
|
| BAA76282 | CNL | NBS | 2 | Blast |
|
| [ | |
|
| AAK00132 | NL | NBS | 12 | Blast |
|
| [ | |
|
| ABI64281 | CNL | NBS | 8 | Blast |
|
| [ | |
|
| BAK39926 | CNL | NBS | 11 | Blast |
|
| [ | |
|
| AAY33493 | NL | NBS | 11 | Blast |
|
| [ | |
|
| ABI94578 | NL | NBS | 1 | Blast |
|
| [ | |
|
| A2XZI2 | NL | NBS | 5 | Bacterial blight |
|
| [ | |
|
| ABD78944 | SET | Oth-R | 8 | Bacterial blight |
|
| [ | |
|
| CCD33085 | CNL | NBS | Blast |
|
| [ | ||
|
| ABB88855 | CNL | NBS | 6 | Blast |
|
| [ | |
|
| ABC73398 | CNL | NBS | 6 | Blast |
|
| [ | |
|
| BAA75812 | CNL | NBS | 11 | Blast |
|
| [ | |
|
| ACN62386 | CNL | NBS | 6 | Blast |
|
| [ | |
|
| AAC49123 | LRR-STK | RLK | 11 | Bacterial blight |
|
| [ | |
|
| ABD36512 | LRR-STK | RLK | 11 | Bacterial blight |
|
| [ | |
|
| BAE95828 | RLK | 3 |
|
| [ | |||
|
| AGE45112 | Oth-R | 11 | Bacterial blight |
|
| [ | ||
|
| AGS56390 | TM | Oth-R | 12 | Bacterial blight |
|
| [ | |
|
| AEW90324 | LRR-TM | RLP | 6 | Bacterial blight |
|
| [ | |
|
| ACR15163 | B-lectin, STK | RLK | 6 | Blast |
|
| [ | |
|
| AAD47197 | NL | NBS | 10 | Rust |
|
| [ | |
|
| Q41867 | 1 | Corn leaf blight |
| C. carbonum | [ | |||
|
| CAA61131 | CNL | NBS | 3 | Downy mildew |
| [ | ||
|
| AAA21874 | NL | NBS | 4 | Downy mildew |
|
| [ | |
|
| AAC83165 | CNL | NBS | 5 | Downy mildew |
|
| [ | |
|
| AAF42832 | CNL | NBS | 3 | Downy mildew |
|
| [ | |
|
| BAC67706 | CNL | NBS | 5 | Mosaic type |
| CMV | [ | |
|
| AAC72977 | TNL | NBS | 3 | Downy mildew |
|
| [ | |
|
| AAM18462 | TNL | NBS | 4 | Downy mildew |
|
| [ | |
|
| CAB50708 | TNL | NBS | 5 | Powdery mildew |
|
| [ | |
|
| AAF08790 | TNL | NBS | 4 | Downy mildew |
|
| [ | |
|
| AAC26126 | NL | NBS | 1 | Downy mildew |
|
| [ | |
|
| ADM88042 | WRKY-TNL | NBS | 5 | Bacterial wilt |
|
| [ | |
|
| CAE51864 | LRR-TM | RLP | 1 | Downy mildew |
|
| [ | |
|
| AAY86486 | LRR-STK | RLK | 1 | Fusarium wilt |
|
| [ | |
|
| AAG38109 | STK | Oth-R | 5 |
|
| [ | ||
|
| AED95370 | LRR-STK | RLK | 5 | Powdery mildew |
| [ | ||
|
| AT4G33430 | LRR-STK | RLK | 4 |
| [ | |||
|
| AAB95208 | TM | Oth-R | 3 | [ | ||||
|
| AAK09267 | RPW8 | Oth-R | 3 | Powdery mildew |
|
| [ | |
|
| AAA91022 | TNL | NBS | 5 | Rust |
|
| [ | |
| AAD25974 | TNL | NBS | 5 | Rust |
|
| [ | ||
|
| AAB47618 | TNL | NBS | 8 | Rust |
|
| [ | |
| AAK28806 | TNL | NBS | 14 | Rust |
|
| [ | ||
|
| AAB48305 | LRR-TM | RLP | 1 | Beet cyst |
|
| [ | |
|
| AAA50763 | TNL | NBS | Tobacco mosaic |
| TMV | [ |
a: A. thaliana, Arabidopsis thaliana; B. vulgaris, Beta vulgaris; H. vulgare, Hordeum vulgare; L. sativa, Lactuca sativa; L. usitatissimum, Linum usitatissimum; N. tabacum, Nicotiana tabacum; O. sativa, Oryza sativa; P. nigrum, Piper nigrum; S. lycopersicum, Solanum lycopersicum; S. tuberosum, Solanum tuberosum; T. aestivum, Triticum aestivum; Z. mayes, Zea mayes; b: SET, sugar efflux transporter; TM, transmembrane; STK, serine/threonine protein kinase; c: PVX, potato virus X; B. graminis, Blumeria graminis; B. lactucae, Bremia lactucae; C. fulvum, Cladosporium fulvum; C. carbonum, Cochliobolus carbonum; E. cruciferarum, Erysiphe cruciferarum; E. graminis, Erysiphe graminis; F. oxysporum, Fusarium oxysporum; G. rostochiensis, Globodera rostochiensis; H. avenae, Heterodera avenae; H. schachtii, Heterodera schachtii; M. grisea, Magnaporthe grisea; M. oryzae, Magnaporthe oryzae; M. lini, Melampsora lini; M. javanica, Meloidogyne javanica; P. parasitica, Peronospora parasitica; P. infestans, Phytophthora infestans; P. syringae, Pseudomonas syringae; P. graminis, Puccinia graminis; P. sorghi, Puccinia sorghi; P. striiformis, Puccinia striiformis; P. triticina, Puccinia triticina; R. solanacearum, Ralstonia solanacearum; V. dahliae, Verticillium dahliae; X. campestris, Xanthomonas campestris; X. oryzae, Xanthomonas oryzae pv. oryzae (Xoo); CMV, cucumber mosaic virus; TMV, tobacco mosaic virus; TSWV, tomato spotted wilt virus; d: The chromosome numbers of genes were based on unpublished data.
Genome-wide identification of RGAs in plant genomes.
| Species a | Genome Size (Mb) b | Total Annotated Genes b | Total RGAs (%) c | NBS Coding Genes d | PPR e | RLK f | RLP g | Other h | Identification Method Used i | Reference | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CNL | TNL | CN | NL | TN | N | Total | ||||||||||
|
| ||||||||||||||||
| 125 | 25,498 | 5.27 | 51 | 79 | 8 | 20 | 17 | 26 | 201 | 441 | 600 | 56 | 46 | H, P, B | [ | |
| 207 | 32,670 | 0.56 | 21 | 103 | 17 | 14 | 20 | 10 | 185 | H, B | [ | |||||
| 485 | 45,555 | 3.18 | 119 | 64 | 19 | 83 | 13 | 46 | 344 | 600 | 379 | 127 | MEME, CO, Paircoil2, MC | [ | ||
| 475 | 30,434 | 3.81 | 203 | 97 | 26 | 12 | 14 | 0 | 352 | 600 | 210 | H, B, MEME | [ | |||
| 373 | 43,484 | 0.34 | 31 | 57 | 10 | 5 | 22 | 7 | 132 | 16 | MEME/MAST | [ | ||||
| 900 | 34,727 | 0.84 | 118 | 18 | 19 | 43 | 5 | 49 | 252 | 16 | 13 | 13 | H, B | [ | ||
| 372 | 28,629 | 0.18 | 4 | 6 | 44 | 54 | TBN, MEME, CW, MC, H | [ | ||||||||
| 367 | 26,682 | 0.26 | 25 | 19 | 1 | 17 | 5 | 3 | 70 | H, CO, ME, CX, SMART, P, B | [ | |||||
| 844 | 39,031 | 1.47 | 65 | 37 | 24 | 184 | 12 | 113 | 435 | 142 | H, B | [ | ||||
| 454 | 62,388 | 1.20 | 152 | 118 | 25 | 0 | 38 | 328 | 661 | 92 | B, H | [ | ||||
| 880 | 40,976 | 1.19 | 35 | 41 | 18 | 96 | 9 | 31 | 230 | 60 | 144 | 56 | B, CO, SMART, MC, CW, IPS, ME5, | [ | ||
| 485 | 41,174 | 0.60 | 19 | 93 | 15 | 27 | 23 | 29 | 206 | 42 | B, H | [ | ||||
| 630 | 45,758 | 0.52 | 6 | 40 | 5 | 24 | 29 | 53 | 157 | 82 | B, H | [ | ||||
| 240 | 34,809 | 0.27 | 61 | 16 | 8 | 1 | 86 | 8 | B, MU, ME, MEME | [ | ||||||
| 742 | 57,386 | 1.86 | 218 | 161 | 54 | 276 | 69 | 182 | 960 | 110 | H, B, CW, MEME | [ | ||||
| 472 | 19,848 | 0.42 | 9 | 8 | 19 | 3 | 16 | 29 | 84 | BP, CO, P, MEME | [ | |||||
| 430 | 28,798 | 1.09 | 82 | 8 | 46 | 104 | 4 | 53 | 297 | 17 | B, H | [ | ||||
| 510 | 35,938 | 0.46 | 9 | 3 | 2 | 5 | 0 | 1 | 20 | 103 | 45 | B, CO, MU, ME | [ | |||
| Average | 500 | 37,433 | 1 | 69 | 56 | 19 | 55 | 18 | 56 | 263 | 436 | 264 | 71 | 72 | ||
|
| ||||||||||||||||
| 420 | 59,855 | 4.22 | 159 | 0 | 7 | 40 | 3 | 45 | 254 | 477 | 1429 | 90 | 281 | H, B, MEME, P | [ | |
| 17,000 | 94,000 | 2.37 | 98 | 0 | 555 | 318 | 971 | 1266 | H, B, MEME | [ | ||||||
| 2300 | 32,540 | 0.90 | 58 | 0 | 21 | 31 | 0 | 69 | 179 | 113 | 2 | P, H, B, CO | [ | |||
| 739 | 34,496 | 1.29 | 36 | 0 | 99 | 133 | 0 | 64 | 332 | 114 | P, H, B, CO, ME, CW | [ | ||||
| 5100 | 30,400 | 1.38 | 101 | 51 | 145 | 34 | 331 | 89 | [ | |||||||
| 272 | 25,532 | 1.23 | 133 | 0 | 28 | 87 | 0 | 34 | 282 | 34 | P, H, B, CO, CW | [ | ||||
| 4940 | 34,879 | 1.63 | 235 | 0 | 44 | 218 | 38 | 535 | 35 | H | [ | |||||
| 4360 | 43,150 | 1.94 | 296 | 0 | 63 | 288 | 81 | 728 | 112 | H | [ | |||||
| Average | 4391 | 44,357 | 2 | 140 | 0 | 39 | 187 | 1 | 85 | 452 | 477 | 771 | 90 | 242 | ||
a: A. tauschii, Aegilops tauschii; A. lyrata, Arabidopsis lyrata; B. distachyon, Brachypodium distachyon; B. oleracea, Brassica oleracea; B. rapa, Brassica rapa; C. papaya, Carica papaya; C. sativus, Cucumis sativus; F. vesca, Fragaria vesca; G. raimondii, Gossypium raimondii; L. japonicus, Lotus japonicus; M. truncatula, Medicago truncatula; M. x domestica, Malus x domestica; P. patens, Physcomitrella patens; P. trichocarpa, Populus trichocarpa; S. bicolor, Sorghum bicolor; T. cacao, Theobroma cacao; T. urartu, Triticum urartu; V. vinifera, Vitis vinifera; b: Most of the information concerning the genome sizes and the total number of annotated genes was obtained from [144]; c: The percentages calculated based on present data, not referred from references; d: CNL, CC-NBS-LRR; TNL, TIR-NBS-LRR; CN, CC-NBS; NL, NBS-LRR; TN, TIR-NBS; N, NBS; e: PPR, pentatricopeptide repeat; f: RLK, receptor like kinase; g: RLP, receptors like proteins; h: Other, includes TIRX, XN, TNLX, TNTNL, TTNL, XTNX, CNX, TX and Partial NBS–LRR; i: B, BLAST; CO, COILS; CW, ClustalW; CX, ClustalX; H, HMM; MC, MARCOIL; IPS, InterProScan; ME, MEGA; MU, MUSCLE; P, Pfam.
Figure 1Schematic representation of common structures of four major plant R proteins. Motifs are depicted as colored boxes and labeled under the domain names. Note: the domain lengths are not drawn to scale for ease of visualization. (A) Typical domain dissection for TNL and CNL proteins. Only highly conserved motifs are illustrated; (B) Domain structures for RLKs and RLPs. The kinase domain is absent in RLPs. Other common domains utilized in our genome-wide identification pipeline are labeled above the colored boxes. TIR: Toll/interleukin-1 receptor; NB: nucleotide-binding site; ARC: abbreviated from Apaf-1, R proteins and CED-4; CC: coiled-coil; SP: signal peptide; TM: transmembrane; LRR: leucine-rich repeats.
Figure 2Intracellular signaling mechanisms of RGAs in plant defense. RIN4, PBS1, Pto and EDS1 are targeted and modified by numerous effectors and, as a result, their corresponding TNL or CNL will detect the modification to initiate ETI responses [162,163,164]. TIR-TIR interactions occur between RPS4 and RRS1 to further activate defense genes [169]. Flg22, a bacterial PAMP, activates FLS2 and BAK1 RLKs to initiate the MAP kinase cascade that triggers PTI/MTI responses [170]. MAP kinase cascade signaling can be interrupted by pathogenic effectors. When MPK4 is compromised, SUMM2 will not be inactivated and will initiate PCD [165]. Effector Avr4 is recognized by Cf-4 RLP to initiate MAP kinase cascade and ROS production while simultaneously increasing calcium levels in the cytosol [171]. Upon Erysiphe cruciferarum infection, RPW8.2 can translocate from the Golgi to the extrahaustorial membrane where the fungal haustorium has penetrated to activate the downstream signaling of PCD [172,173]. Under normal conditions, NBS-LRR transcripts derived from the PHAS locus are regulated through transcript degradation by miRNAs [174]. Such miRNAs include, among others, miR1510, miR1507, miR2109, miR482/2118, miR5668, miR5376, miR172 and miR5041 [174,175,176]. Single arrows may indicate multi-step processes.
Other cloned genes relevant to plant defense.
| Species |
| Accession ID | Domain a | Chr | Disease | Avr |
| Reference |
|---|---|---|---|---|---|---|---|---|
|
| AAF18432 | CHORD | 2 | Powdery mildew |
| [ | ||
|
| AAF67518 | TLC | 3 | Black mold rot; Black shoulder |
| [ | ||
|
| BAC77666 | PPR | 10 | [ | ||||
|
| TM | [ | ||||||
|
| AAC49371 | PPR | 9 | [ | ||||
|
| NP_176454 | PPR | 1 | [ | ||||
|
| NP_176481 | PPR | 1 | [ | ||||
|
| AT1G05760 | Jacalin like | 1 | Tobacco etch | TEV | [ | ||
|
| AEE33357 | PGK | 1 | Mosaic type | WMV | [ | ||
|
| AAD20950 | Lipase-like | 3 |
|
| [ | ||
|
| AAC49611 | Ankyrin | 1 |
| [ |
a: CHORD, cysteine and histidine-rich domain; PGK, phosphoglycerate kinase; TLC, tram-lag1-cln8; TM, transmembrane; b: A. alternata, Alternaria alternata; TEV, Tobacco etch virus; WMV, watermelon mosaic virus.
Common software used for RGA domain and motif identification.
| Software | Latest Version | Input Type a | Required Database | Description | Parallel Support b | URL c | Reference |
|---|---|---|---|---|---|---|---|
| HMMER | 3.1b2 | D/P | HMM model | Protein or DNA sequence homolog search toolkits using profile hidden Markov models and featured by remote homolog identification. The latest version is as fast as BLAST thanks to the underlying mathematical models. | HT/MPI | [ | |
| MEME | 4.10 | D/P | Discover novel and ungapped motifs from nucleotide or protein sequences without well trained dataset samples. | MPI | [ | ||
| mCUDA-MEME | 3.0.15 | D/P | An ultrafast scalable motif discovery program running on graphics processing unit (GPU). The algorithm is based on MEME using a hybrid combination of CUDA, MPI and OpenMPI parallel programming models. | CUDA/MPI | [ | ||
| BLAST+ | 2.30 | D/P | BLAST databases, like nr or nt database | Classical similarity search toolkits for bioinformatics data mining. The latest version significantly improves the speed on CPU and efficiency on RAM for long queries. | HT | [ | |
| pfam_scan.pl | 1.0 | P | Pfam-A HMM model | A Perl script for PFAM database search, which invokes “hmmscan” in the HMMER toolkit package to search known HMM models. | |||
| InterproScan | 5.9 | P | PFAM, SMART, PANTHER, PROSITE, Superfamily,
| A tool that combines different protein signature recognition methods native to the InterPro member databases into one resource with lookup of corresponding InterPro and GO annotations. | HT | [ | |
| Phobius | 1.01 | P | HMM model | A HMM based tool for transmembrane (TM) topology and signal peptides (SP) prediction from proteins. A pre-training HMM model has been embedded in the tool. | [ | ||
| TMHMM | 2.0 | P | HMM model | A HMM based tool with similar functions to Phobius. | [ | ||
| nCOILS | 2.2 | P | Scoring matrix | A program to detect CC domains by comparing and scoring protein sequences with a known coiled-coils database with the MTK or MTIDK calculation matrix, which reports a probability that the sequence adopts a coiled-coil conformation. | [ |
a: D, nucleotide; P, amino acid; b: HT, hyper-thread; MPI, message passing interface; CUDA, a computing platform implemented by nVIDIA on GPUs; c: abbreviated bitly URL links were used to replace real URL, case sensitive.
Figure 3A common procedure for identification and characterization of plant RGAs.
Figure 4Phylogenetic analysis of RGAs in plants. The protein sequences of 63 RGAs or cloned R-genes from eleven plant species were selected for this analysis using MEGA 6 [258]. The protein sequences were aligned using the Muscle algorithm, and then clustered using the neighbor-joining algorithm with the p-distance model, pairwise deletion for gaps or missing data treatment, and 500 bootstrap replicates. The phylogenetic tree consists of two large clades (Clades I and II), representing the NBS-LRR class and the RLK/RLP class of proteins, respectively. Clade I may be divided into two sub-groups (Clades Ia and Ib), containing TNL and CNL proteins, respectively, while Clade Ib may be further split into several diverged CNL clusters. The bootstrap values are labelled on branches.