| Literature DB >> 28066499 |
Ping-Li Liu1, Lu-Lu Xie2, Peng-Wei Li3, Jian-Feng Mao1, Hui Liu1, Shu-Min Gao1, Peng-Hao Shi1, Jun-Qing Gong1.
Abstract
Leucine-rich repeat receptor-like protein kinases (LRR-RLKs) are the largest group of receptor-like kinases, which are one of the largest protein superfamilies in plants, and play crucial roles in development and stress responses. Although the evolution of LRR-RLK families has been investigated in some eudicot and monocot plants, no comprehensive evolutionary studies have been performed for these genes in basal angiosperms like Amborella trichopoda. In this study, we identified 94 LRR-RLK genes in the genome of A. trichopoda. The number of LRR-RLK genes in the genome of A. trichopoda is only 17-50% of that of several eudicot and monocot species. Tandem duplication and whole-genome duplication have made limited contributions to the expansion of LRR-RLK genes in A. trichopoda. According to the phylogenetic analysis, all A. trichopoda LRR-RLK genes can be organized into 18 subfamilies, which roughly correspond to the LRR-RLK subfamilies defined in Arabidopsis thaliana. Most LRR-RLK subfamilies are characterized by highly conserved protein structures, motif compositions, and gene structures. The unique gene structure, protein structures, and protein motif compositions of each subfamily provide evidence for functional divergence among LRR-RLK subfamilies. Moreover, the expression data of LRR-RLK genes provided further evidence for the functional diversification of them. In addition, selection analyses showed that most LRR-RLK protein sites are subject to purifying selection. Our results contribute to a better understanding of the evolution of LRR-RLK gene family in angiosperm and provide a framework for further functional investigation on A. trichopoda LRR-RLKs.Entities:
Keywords: Amborella trichopoda; expression; functional divergence; gene structure; leucine-rich repeat receptor-like kinase (LRR-RLK); protein motif; protein structure
Year: 2016 PMID: 28066499 PMCID: PMC5179525 DOI: 10.3389/fpls.2016.01952
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Subgroups of LRR-RLK proteins from .
| I | 8.2 (41/5) | Mc, M3, M8, M11, M13 | 10, 11, 15 | MEE39, ISO1, RHS6, FRK1 | |
| II | 2.3 (14/6) | Mc,M3, M9, M11, M13 | SERK1-2, BAK1, BKK1, SARK, ATNIK1-3 | ||
| III | 2.6 (41/16) | 3~17, | Mc, M13 | PRK1-6, RLK, RUL1, TMKL1, PXC1, RKL1, SIRK1, IMK2-3 | |
| IV | 3 (3/1) | Mc, M3, M11 | |||
| V | 4.5 (9/2) | 6, 8 | Mc, M11, M13 | SRF1, SRF3-8, SUB/SRF9 | |
| VI-1 | 1.7 (5/3) | 7~9 | Mc, | ||
| VI-2 | 4 (4/1) | Mc, | MRH1 | ||
| VII-1 | 2 (2/1) | Mc, M3, M11, M13 | |||
| VII-2 | 1.7 (5/3) | 13~18 | M | 1, | PXC2 |
| VIII-1 | 2 (8/4) | 7, | Mc, M3, M11, M13 | ||
| VIII-2 | 4 (12/3) | 4, | Mc, M3, M11, | 16, 17, 22 | |
| IX | 1.3 (4/3) | 8, 9, 11 | Mc, M3, M11, M13 | 0, | BARK1, TMK1 |
| X* | 1.7 (15/9) | 14~28 | Mc, M11, M6, M13 | BIR1, BRL1-3, PSKR1-2, EMS1 | |
| XI* | 1.3 (32/24) | 17~27 | Mc, M3, M8, M11, M13 | PXY, BAM1-3, PEPR | |
| XII | 1 (7/7) | 10~26 | Mc, M3, M11, M13 | FLS2, EFR | |
| XIII-1 | 1 (3/3) | 3, | Mc, M3, M8, M11, M13 | FEI1-2 | |
| XIII-2 | 3 (3/1) | Mc, M3, M8, M11, M13 | ERECTA, ERL1-2, | ||
| XV | 1 (2/2) | 15, 18 | Mc, M3, M11, M13 | 0, 1 |
R is the ratio of gene number from A. thaliana to that from A. trichopoda. .
Figure 1Distribution of LRR-RLK genes on . The scaffold numbers are given at the top of each scaffold, and genes probably derived from tandem duplication are highlight with blue and in blue boxes.
Figure 2Phylogenetic tree of LRR-RLK genes. (A) Maximum likelihood tree of LRR-RLK genes in Amborella trichopoda. (B) Maximum likelihood tree of LRR-RLK genes in Amborella trichopoda and Arabidopsis thaliana. Bootstrap values of major clades are shown above branches. The full phylogeny is shown in Supplemental Figure 1.
Figure 3ML tree of 94 LRR-RLK proteins from ML tree of 94 LRR-RLK proteins from Amborella trichopoda. The full names of each LRR-RLK protein are shown in Supplemental Table 1. The subfamily names are shown on the left. (B) Protein structures of LRR-RLK proteins. (C) MEME motif distribution of the kinase domain of each protein. (D) Gene structure of each LRR-RLK protein. The green boxes represent exons, lines represent introns, and each line with double slash indicates a long intron.
Figure 4Conserved motifs in kinase domain of LRR-RLK proteins and their consensus sequences. CON indicates consensus sequence. If the bits value of amino acid at this position is smaller than 1, it is represent with x; 2 > bits ≥ 1, with lowercase; 3 > bits ≥ 2, with capital letter; bits ≥ 3, with bold capital.
Codeml site-models parameter values and likelihood scores estimated for each individual selected subfamilies.
| I | ω0 = 0.3743 | −11330.92 |
| II | ω0 = 0.0920 | −11063.62 |
| III | ω0 = 0.2292 | −40462.98 |
| VIII_1 | ω0 = 0.2949 | −11381.07 |
| XII | ω0 = 0.2182 | −23228.48 |
| I | (0 < ω0 <1, ω0 = 0.1112) | −6245.62 |
| II | (0 < ω0 <1, ω0 = 0.0807) | −10906.03 |
| III | (0 < ω0 <1, ω0 = 0.1643) | −39806.09 |
| VIII_1 | (0 < ω0 <1, ω0 = 0.0727) | −11239.85 |
| XII | (0 < ω0 <1, ω0 = 0.1336) | −22876.97 |
| I | (0 < ω0 <1, ω0 = 0.1112) | −6245.62 |
| II | (0 < ω0 <1, ω0 = 0.0807) | −10906.03 |
| III | (0 < ω0 <1, ω0 = 0.1643) | −39806.06 |
| VIII_1 | (0 < ω0 <1, ω0 = 0.0727) | −11239.85 |
| XII | (0 < ω0 <1, ω0 = 0.1336) | −22876.97 |
| I | (ω0 = 0) | −6236.24 |
| II | (ω0 = 0) | −10795.06 |
| III | (ω0 = 0.0437) | −39489.33 |
| VIII_1 | (ω0 = 0.0157) | −11218.68 |
| XII | (ω0 = 0.0140) | −22754.11 |
| I | −6238.20 | |
| II | −10794.58 | |
| III | −39479.63 | |
| VIII_1 | −11221.78 | |
| XII | −22753.97 | |
| I | −6237.41 | |
| II | −10794.53 | |
| III | −39479.63 | |
| VIII_1 | −11219.10 | |
| XII | ||
Likelihood ratio test of positive selection in LRR-RLK subfamily proteins.
| I | 306.86551 | 0 | 5.479124 |
| II | 537.108832 | 0 | 0.101476 |
| III | 1947.3058 | 0.69886 | 0.00078 |
| VIII_1 | 324.776964 | 0 | 5.357916 |
| XII | 948.721848 | 0 | 0.353648 |
indicates significant at p < 0.001 level.
Figure 5Transcript abundance of 94 . The genes were grouped according to subfamily and the color scale represents the expression values. AMYL indicates apical meristem and young leaves.