| Literature DB >> 21408199 |
Abstract
LRR-kinases constitute the largest subfamily of receptor-like kinases in plants and regulate a wide variety of processes related to development and defense. Through a reiterative process of sequence analysis and re-annotation, we identified 309 LRR-kinase genes in the rice genome (Nipponbare). Among them, 127 genes in the Rice Annotation Project Database and 85 in Refseq of NCBI were amended (in addition, 62 LRR-kinase genes were not annotated in Refseq). The complete set of LRR-kinases was characterized. These LRR-kinases were classified into five groups according to phylogenetic analysis, and the genes in groups 1, 2, 3 and 4 usually have fewer introns than those in group 5. The introns in the LRR domain, which are highly conserved in regards to their positions and configurations, split the first Leu or other amino residues at this position of the 'xxLxLxx' motif with phase 2 and usually separate one or more LRR repeats exactly. Tandemly repeated LRR motifs have evolved from exon duplication, mutation and exon shuffling. The extensive distribution and diversity of the LRR-kinase genes have been mainly generated by tandem duplication and mutation after whole genome duplication. Positive selection has made a limited contribution to the sequence diversity after duplication, but positively selected sites located in the LRR domain are thought to involve in the protein-protein interaction.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21408199 PMCID: PMC3050792 DOI: 10.1371/journal.pone.0016079
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Numbers of rice LK genes with annotation errors.
| Annotation errors | In RAP-DB | In Refseq of NCBI |
| Including introns | 59 | 13 |
| Missed exons inside the gene | 10 | 11 |
| Missed exons inside the gene with other errors | 8 | 4 |
| gene fusion | 1 | |
| Incorrect intron/exon splice boundaries or numbers of exons | 7 | 5 |
| Truncated gene or wrong terminal exon or premature stop codon | 29 | 41 |
| Truncated gene with other errors | 4 | 3 |
| Wrong start | 2 | 5 |
| Wrong start and other errors | 7 | 4 |
| No annotation | 2 | 62 |
| Total | 129 | 147 |
Figure 1Gene structure and Introns/Exons of LK-encoding genes in rice.
Introns and exons were drawn to scale with the full encoding regions of their respective gene. Filled boxes indicate the exon, and white boxes indicate the intron. Numbers under “# in NPB” indicate the number of LKs with this structure in the Nipponbare genomic sequence. 0 = Intron phrase 0; 1 = Intron phrase 1 and 2 = Intron Phase 2. *The first exon of Os01g0810533 is too small to be shown.
Figure 2The distribution of the different sizes of exons in the LRR region.
Figure 3Phylogenetic trees of protein kinases of rice LKs.
A ML tree with the Jones-Taylor-Thornton amino acid change model was generated with the protein kinase domain aligned with DIALIGN2. The distance scale is under the figure, and branch lengths are proportional to genetic distance. The LKs were classified into 5 groups, and group and subgroup names are shown on the right. The accession numbers of the LKs in the same subgroup that cannot be shown are replaced by the name of the subgroup.
Major motifs in signal peptide and transmembrane region within rice LKs.
| Domain | Motifs | Sequence |
| Partial LRRNT | S1 | Cx |
| Partial LRRNT | S2 | D/exxA |
| Partial LRRNT | S4 | xx |
| Signal peptide | S13 | lLL |
| T1 | txxsfx | |
| T2 | xxxv/s/I S/tYxd/e L/ixx | |
| T7 | gxxrx | |
| TM domain | T9 | x v/i xx v/i xxxxxxlxlx |
| T16 | x |
*If the bits value (see fig. S2) of the amino acid at this position is smaller than 0.5, it is represented with x; 1> bits ≧0.5, with lowercase; 2> bits ≧1, with capital letter; 3> bits ≧2, with bold capital; bits >3, with underlined capital letter in bold.
Major motifs in predicted LRR domain.
| Motifs | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 1 |
| x | L | G | X | L | x | x | L | x | X | L | x | L | x | x | N | x | L | t/s | g | x | I | P | x | x | L | G | x | L | x | x | L | |
| L1 | x | l/f/i | G | X |
| x | x |
| x | X |
|
|
|
| x |
| x |
| t/s |
| x |
|
| x | ||||||||
| L2 | x | l/i | G | X |
| x | x |
| x | X |
| x |
| x | x |
| x |
| S |
| x |
|
| x | ||||||||
| L3 | l | x | x |
| x | X |
| n/d | L |
| x |
| x | L/f | x |
| x | I/v |
| x | x | x | x | x | ||||||||
| L4 | x | L/i | G | X |
| x | x |
| x | X |
| x |
| x | x |
| x | L/f | t/s |
| x |
|
| x | ||||||||
| L5 | x | L/i | x | X |
| t | x |
| x | X | L | x | L | x | x |
| x | L/f | x |
| x | I/l |
| x | ||||||||
| L6 | x | x |
| x | L | x | x |
| x | F/l | t/s |
| x | I |
| x | S | L/i | g | n | l | x | x |
| ||||||||
| L7 | x | l/i/f | x | X |
| x | x |
| x | x |
| x |
| x | x |
| x | L/f | s |
| x | I |
| x | x | |||||||
| L8 | x | x | x |
| x | x |
| x |
| x | x |
| x | L/f | S |
| x | I |
| x | x | |||||||||||
| L9 | x | L/i | G | N | L | t | x |
| x | x |
| x | L | x | x |
| x | L | x |
| x | I/l |
| |||||||||
| L10 | x | L | x | X | C | x | x |
| x | x | L | x | L | x | x |
| x | L/f | x |
| x | l/i |
| x | x | |||||||
| L11 | L | P | x |
| x | x | L | x | L | x | x |
| x | F/l | x |
| x | I |
| x | ||||||||||||
| L12 | x | l/i | x | N | l | s | x |
| x | x | L | x | L | x | x |
| x |
| x |
| x | I |
| x | ||||||||
| L13 | x | x |
| x |
| x | x |
| x | L/f | x |
| x | I |
| x | x | I/l | G | x | L | x | x |
| ||||||||
| L14 | x | L | x | X | L | x | x |
| x | x | L | x | L | x | x |
| x | L | x |
| x | l |
| x | ||||||||
| L15 | X | x | x | x |
| x | x | L | x | L | x | x |
| x | F/L | x |
| x | i/l |
| ||||||||||||
| L16 | x | L | x | X | l | x | x |
| x | x |
| x |
| x | x |
| x |
| x |
| x | l |
| x | ||||||||
| L17 | X | x | x | x | L | x | x | L | x |
| s | x |
| x |
| x |
| x | I/l |
| ||||||||||||
| L18 | x | x | L | x | L | x | x |
| x | L | x |
| x | l/i |
| x | x | i/l | g | x | ||||||||||||
| L19 | l/i | G | X | L | x | x |
| x | x | L | x | L | x | x |
| x | L | x | x | x | ||||||||||||
| L20 | x | x | g | X | L | S | s | L | x | x | L | x | l | x | x |
| x | L | s |
| x | I/l |
| |||||||||
| L21 | x | L | x | X | x | x | x |
| x | x | L | x | L | x | x |
| x | L | x |
| x | I/l |
| x | ||||||||
| L22 | x | x | x | L | x | x | L | x | L | x | x |
| x |
| x |
| x | I/l |
| |||||||||||||
| L23 | x |
| x | L | x | x |
| q |
| x |
| x | x | x | x | D/g |
| e |
| l/m | ||||||||||||
| L24 | x | l | x | x | L | x | x |
| x | L | s | x |
| x | L | x |
| x | I/l |
| ||||||||||||
| L25 | r |
| t | a | L | x |
| x | g | x | x |
| x |
| x | L/i |
| P | x | l | ||||||||||||
| L26 | x | L/i | x | n | x | x | x |
| x | x | l/i | x | L | x | x |
| x | F/L | x | G | x | i/l |
| x |
*The plant LRR motif and amino acid position in the motif come from Bella et al. (2008).
**If the bits value (see fig. S2) of the amino acid at this position is smaller than 0.5, it is represented with x; 1> bits ≧0.5, with lowercase; 2> bits ≧1, with capital letter; 3> bits ≧2, with bold capital; bits >3, with underlined capital letter in bold.
Major motifs of protein kinase domain in predicted rice LKs.
| Subdomain | Motifs | Sequence | Corresponding motif sequence in human PK |
| IX | P1 | Ks/g | Yxxxx |
| VIB | P2 | xI/vv/i |
|
| III | P3 | xs |
|
| V | P4 | LdW/lxx | xxQI/va/lx |
| IV | P5 |
| l/iV/im |
| I | P6 | xxNlI/L | L/I |
| VIII | P7 | xxxxxxxx | xtxx |
| XI | P8 | lxxvlxl/v/iA/gL | axdL/fl/ixxl/m |
| VII | P9 | dmx | v/i |
| II | P10 | x | V |
| P11 |
| ||
| P12 | x | ||
| P13 | xev/iv/l | ||
| P14 |
| ||
| P15 | xx | ||
| VIII | P16 | xxxxxx |
|
| P17 |
| ||
| P18 |
| ||
| P19 | SsrxxseqsxVrxAxxqLxDi | ||
| P20 | xssxsGSt/sxxefsxqxExxP |
*[9].
**If the bits value (see fig. S2) of the amino acid at this position is smaller than 0.5, it is represented with x; 1> bits ≧0.5, with lowercase; 2> bits ≧1, with capital letter; 3> bits ≧2, with bold capital; bits >3, with underlined capital letter in bold.
***The sequences of the human protein kinases come from Manning et al.(2002).
Likelihood ratio test of positive selection in family proteins.
| Sub-group | n | 2Δ | 2Δ | 2Δ | 2Δ | M8estimates | Positively selected sites (Posterior>0.90) | Percent of positively selected sites in xxLxLxx motif to LRR domain |
| 1-1 | 20 | 3587.8** | 0 | 185.4** | 122.8** | p1 = 0.069, ω = 2.26 | 57, | 37/38 |
| 1-3 | 19 | 5657.6** | 413.4** | 98.8** | 53.6** | p1 = 0.067, ω = 1.68 |
| 34/38 |
| 1-4 | 10 | 1481.2** | 280.4** | 14.8** | 8.2** | p1 = 0.041, ω = 1.71 | Posteriors of the sites <0.90 | 14/19 |
| 1-5 | 21 | 4145.8** | 84.8** | 59.4** | 57.8** | p1 = 0.069, ω = 1.81 |
| 24/27 |
| 1-6 | 22 | 2440.8** | 0.0 | 1.2 | 0.0 | p1 = 0.037, ω = 1.0 | None | |
| 2-10 | 9 | 1127.4** | 0.0 | 1.7 | 0.0 | p1 = 0.037, ω = 1.12 | None | |
| 3-2 | 17 | 2503.2** | 0.0 | 6.2* | 4.6* | p1 = 0.022, ω = 1.56 | Posteriors of the sites <0.90 | 6/13 |
| 4-1 | 25 | 2634.6** | 0.0 | 7.7* | 4.8* | p1 = 0.016, ω = 2.36 |
| none |
| 5-2 | 22 | 2786.8** | 0.0 | 120.0** | 46** | p1 = 0.080, ω = 1.64 |
| 3/4 |
| 5-3 | 20 | 2846.4** | 0.0 | 24.5** | 22.4** | p1 = 0.026, ω = 2.51 | 352, 467, 452, 446 | none |
| 5-10 | 11 | 1279.0** | 0.0 | 0.0 | 0.0 | p1 = 0.00001, ω = 1.0 | None |
Number of sequences in the group.
*: significant at 5% level, **: significant at 1% level.
ω is dN:dS estimated under M8; p1 is the inferred proportion of positively selected sites.
Sites potentially under positive selection identified under model M8 are listed according to conserved sequence numbering respectively. Positively selected sites with posterior probability >0.99 are underlined in bold, 0.95–0.99 in bold.
Figure 4The Model for LRR domain evolution of LKs.
Boxes indicate exons, and lines indicate introns.