| Literature DB >> 23216678 |
Walter Sanseverino1, Maria Raffaella Ercolano.
Abstract
BACKGROUND: Plant resistance genes, which encode R-proteins, constitute one of the most important and widely investigated gene families. Thanks to the use of both genetic and molecular approaches, more than 100 R genes have been cloned so far. Analysis of resistance proteins and investigation of domain properties may afford insights into their role and function. Moreover, genomic experiments and availability of high-throughput sequence data are very useful for discovering new R genes and establish hypotheses about R-genes architecture. RESULT: We surveyed the PRGdb dataset to provide valuable information about hidden R-protein features. Through an in silico approach 4409 putative R-proteins belonging to 33 plant organisms were analysed for domain associations frequency. The proteins showed common domain associations as well as previously unknown classes. Interestingly, the number of proteins falling into each class was found inversely related to domain arrangement complexity. Out of 31 possible theoretical domain combinations, only 22 were found. Proteins retrieved were filtered to highlight, through the visualization of a Venn diagram, candidate classes able to exert resistance function. Detailed analyses performed on conserved profiles of those strong putative R proteins revealed interesting domain features. Finally, several atypical domain associations were identified.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23216678 PMCID: PMC3532234 DOI: 10.1186/1756-0500-5-678
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Figure 1R-domain associations found in the UniGene PRG dataset. Predicted sequences are grouped according to the number of domains identified (A-E). Box F reports the associations not found in our dataset. LRR = Leucine-rich repeat; NBS = Nucleotide-binding site; TIR = Toll-interleukin like receptor; KIN = kinase.
Figure 2A) Distribution of theoretical (red) and observed (blue) domain associations. B) R proteins arranged in ascending order with corresponding observed frequencies.
Proteins containing putative transmembrane motifs
| CNL | 20 | 6.4% | 17 | 3 | 0 |
| RLP | 107 | 100% | 63 | 41 | 3 |
| TNL | 17 | 16.5% | 12 | 3 | 2 |
| TIR | 9 | 10.9% | 6 | 2 | 1 |
| NBS-LRR | 1 | 2.2% | 1 | 0 | 0 |
| NBS | 3 | 8.1% | 8 | 0 | 0 |
| TIR-NBS | 2 | 9% | 0 | 2 | 0 |
| NBS-OthR | 1 | 8.3% | 1 | 0 | 0 |
| PTO-like | 9 | 56.2% | 1 | 8 | 0 |
| MLO-like | 11 | 100% | 0 | 4 | 7 |
| TNL-OthR | 1 | 7.1% | 0 | 1 | 0 |
For each class we report the number of sequences with a TM signature (n.), the percentage of sequences with a TM motif on the total (%) and the number of sequences with one, two or more TM domains.
Figure 3Edwards Venn diagram in which proteins are grouped according to their interpro-scan profile. The edge colors represent the five domains [LRR, Kinase, NBS, Other (OthR) and TIR] reported in legend. The group’s intersections showed all possible domain combinations. The colour filling intensity indicated the affinity of a determinate group to exert a resistance function. Stronger colours showed a high probability to exert only a resistance function; Purple red has been used for the CNL and TNL families. The families that could exert a resistance function, but have been not yet described in the literature, were brown coloured. Green and yellow colours have been used for families that need further analysis to validate their putative resistance function. Grey areas showed classes not found in our dataset. Numbers indicated proteins belonging to a specific group. Arrows were used to highlight the analyses performed for a given group in order to select further R-protein candidates.
Multiple alignment comparison of 10 R protein groups composed by typical resistance domains
| CNL | 310 | 1014 | 5050 | 16.3% | 0.3% |
| RLP | 107 | 651 | 1734 | 17.1% | 0.0% |
| TNL | 103 | 1146 | 3717 | 25.6% | 0.1% |
| TIR | 82 | 226 | 801 | 29.3% | 0.5% |
| NBS-LRR | 45 | 831 | 2750 | 15.4% | 0.4% |
| NBS | 37 | 361 | 1070 | 14.5% | 0.1% |
| CC-NBS | 30 | 395 | 804 | 16.4% | 0.1% |
| TIR-NBS | 22 | 502 | 833 | 31.8% | 1.0% |
| PTO-like | 16 | 561 | 921 | 35.7% | 6.0% |
| MLO-like | 10 | 427 | 622 | 55.5% | 14.1% |
Groups were ordered according to the number of retrieved sequences. Average sequence lengths, percentage of identity and percentage of identical sites are reported.
Figure 4A) Interproscan results of consensus sequences of each analysed class B) Pattern of conservation along consensus sequence among members of each class.
Atypical R-domain combinations found in the UniGene PRGdb dataset
| At.43365 | NBS-LRR-AtyR | Mob1 | |
| Os.24417 | NBS-LRR-AtyR | Zinc finger, ZZ-type | |
| Os.25168 | NBS-AtyR | GTPase Containing Family | |
| Os.25222 | NBS-AtyR | Phosphatase 2C | |
| Os.25268 | NBS-AtyR | WRKY transcription factor; Gag-Pol-related Retrotransposon | |
| Os.27097 | NBS-AtyR | WRKY transcription factor | |
| Os.78619 | NBS-AtyR | Zinc finger, CCHC-type | |
| Os.79969 | NBS-AtyR | DUF1979 | |
| Os.93921 | NBS-AtyR | Cecropin; Origin replication binding protein | |
| Pth.15498 | NBS-AtyR | Gag-Pol-related Retrotransposon | |
| Pth.15636 | NBS-AtyR | Zinc finger, BED-type | |
| At.3076 | TIR-LRR-NBS-KIN-Serthr-AtyR | Phenylalanine Hydroxylase (PAH); WRKY transcription factor | |
| At.38115 | TIR-AtyR | DUF541 | |
| At.51652 | TIR-AtyR | Toll-IL-1 receptor domain-containing adapter protein (TIRAP) | |
| Ghi.9199 | TIR-AtyR | Toll-IL-1 receptor domain-containing adapter protein (TIRAP) | |
| Gma.3221 | TIR-AtyR | Helix-loop-helix structural domain | |
| Vvi.2456 | TIR-AtyR | Toll-IL-1 receptor domain-containing adapter protein (TIRAP) | |
| At.46853 | TNL-AtyR | Pleckstrin homology domain (PH); Regulator of chromosome condensation | |
| Pth.16040 | TNL-AtyR | CALCINEURIN B | |
| Pth.16041 | TNL-AtyR | DNA-Directed RNA Polymerase II | |
| Pth.16058 | TNL-AtyR | Gag-Pol-related Retrotransposon | |
| Han.34 | TIR-NBS-AtyR | Steroid Binding Protein | |
| Psi.4721 | TIR-NBS-AtyR | Toll-IL-1 receptor domain-containing adapter protein (TIRAP) |
Sequences are divided according to the R domain and sorted by species.
Candidate R genes showing a transposon insertion
| CNL-WRKY | ||
| CNL | ||
| Ser/thr | ||
| CNL | ||
| RLP | ||
| LRR | ||
| RLP | ||
| NBS | ||
| TNL-TIR | ||
| TNL-TIR | ||
| TNL |
Sequences are divided according to species and resistance classes. Gene names correspond to UniGene Id.