| Literature DB >> 30501109 |
Ruibin Sun1, Shaohui Wang2, Dan Ma3, Chuanliang Liu4,5.
Abstract
Leucine-rich repeat receptor-like kinases (LRR-RLKs) have been reported to play important roles in plant growth, development, and stress responses. However, no comprehensive analysis of this family has been performed in cotton (Gossypium spp.), which is an important economic crop that suffers various stresses in growth and development. Here we conducted a comprehensive analysis of LRR-RLK family in four Gossypium species (Gossypium arboreum, Gossypium barbadense, Gossypium hirsutum, and Gossypium raimondii). A total of 1641 LRR-RLK genes were identified in the four Gossypium species involved in our study. The maximum-likelihood phylogenetic tree revealed that all the LRR-RLK genes were divided into 21 subgroups. Exon-intron organization structure of LRR-RLK genes kept relatively conserved within subfamilies and between Arabidopsis and Gossypium genomes. Notably, subfamilies XI and XII were found dramatically expanded in Gossypium species. Tandem duplication acted as an important mechanism in expansion of the Gossypium LRR-RLK gene family. Functional analysis suggested that Gossypium LRR-RLK genes were enriched for plant hormone signaling and plant-pathogen interaction pathways. Promoter analysis revealed that Gossypium LRR-RLK genes were extensively regulated by transcription factors (TFs), phytohormonal, and various environmental stimuli. Expression profiling showed that Gossypium LRR-RLK genes were widely involved in stress defense and diverse developmental processes including cotton fiber development and provides insight into potential functional divergence within and among subfamilies. Our study provided valuable information for further functional study of Gossypium LRR-RLK genes.Entities:
Keywords: Gossypium; LRR-RLK family; expansion; gene expression profile; phylogenetic analysis; stress defense
Year: 2018 PMID: 30501109 PMCID: PMC6316826 DOI: 10.3390/genes9120592
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Phylogenetic tree of leucine-rich repeat receptor-like kinases (LRR-RLK) genes from four Gossypium species and Arabidospsis thaliana. The phylogenetic tree was constructed by maximum likelihood (ML) method based on kinase domain amino acid sequences of LRR-RLKs. All LRR-RLK genes were divided into 21 distinct clades, marked by bold curves with different colors. LRR-RLKs from A. thaliana, Gossypium arboreum, Gossypium barbadense, Gossypium hirsutum, and Gossypium raimondii were represented by branches colored within green, red, yellow, purple, and blue, respectively.
Statistics of A. thaliana and Gossypium LRR-RLK gene distribution among different subfamilies. For both A. thaliana and four Gossypium species, the number of LRR-RLK genes belonging to each subfamily were counted respectively. The corresponding percentage in all LRR-RLK genes from specific species was computed and indicated in brackets.
| Subfamily | Number (Percentage) of Genes in Each Subfamily | ||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| I | 41 (19.2%) | 4 (1.3%) | 6 (1.2%) | 7 (1.4%) | 3 (0.9%) |
| II | 14 (6.6%) | 17 (5.7%) | 23 (4.5%) | 29 (5.6%) | 18 (5.7%) |
| III | 41 (19.2%) | 45 (15.1%) | 68 (13.3%) | 83 (16.1%) | 46 (14.5%) |
| IV | 3 (1.4%) | 5 (1.7%) | 10 (2.0%) | 10 (1.9%) | 5 (1.6%) |
| V | 9 (4.2%) | 10 (3.4%) | 15 (2.9%) | 19 (3.7%) | 9 (2.8%) |
| VI-1 | 5 (2.3%) | 7 (2.3%) | 11 (2.2%) | 14 (2.7%) | 7 (2.2%) |
| VI-2 | 4 (1.9%) | 2 (0.7%) | 2 (0.4%) | 6 (1.2%) | 3 (0.9%) |
| VII-1 | 2 (0.9%) | 4 (1.3%) | 11 (2.2%) | 7 (1.4%) | 4 (1.3%) |
| VII-2 | 5 (2.3%) | 6 (2.0%) | 6 (1.2%) | 12 (2.3%) | 6 (1.9%) |
| VIII-1 | 8 (3.8%) | 4 (1.3%) | 7 (1.4%) | 7 (1.4%) | 4 (1.3%) |
| VIII-2 | 12 (5.6%) | 17 (5.7%) | 26 (5.1%) | 30 (5.8%) | 17 (5.4%) |
| IX | 4 (1.9%) | 12 (4.0%) | 18 (3.5%) | 19 (3.7%) | 11 (3.5%) |
| X | 15 (7.0%) | 18 (6.0%) | 32 (6.3%) | 35 (6.8%) | 18 (5.7%) |
| XI-1 | 28 (13.1%) | 75 (25.2%) | 135 (26.4%) | 132 (25.6%) | 81 (25.6%) |
| XI-2 | 2 (0.9%) | 3 (1.0%) | 3 (0.6%) | 7 (1.4%) | 3 (0.9%) |
| XI-3 | 2 (0.9%) | 1 (0.3%) | 1 (0.2%) | 2 (0.4%) | 1 (0.3%) |
| XII | 7 (3.3%) | 49 (16.4%) | 102 (20.0%) | 61 (11.8%) | 63 (19.9%) |
| XIII-1 | 3 (1.4%) | 3 (1.0%) | 6 (1.2%) | 5 (1.0%) | 2 (0.6%) |
| XIII-2 | 3 (1.4%) | 3 (1.0%) | 6 (1.2%) | 6 (1.2%) | 3 (0.9%) |
| XIV | 2 (0.9%) | 5 (1.7%) | 10 (2.0%) | 9 (1.7%) | 5 (1.6%) |
| XV | 3 (1.4%) | 8 (2.7%) | 13 (2.5%) | 15 (2.9%) | 8 (2.5%) |
Figure 2Exon-intron structures of representative LRR-RLK genes of each subfamily from four Gossypium species and A. thaliana. LRR (leucine-rich repeat), KD (kinase catalytic domain), and TM (transmembrane) domain coding regions were marked on exons by different colored rectangles. Based on the exon-intron structures of KD domain, Gossypium LRR-RLK genes were classified into Group A (A), Group B (B), and Group C (C). The number of introns in ECD (extracellular domain) and KD domain were counted and then average values were computed for each subfamily (D). The error bar on the column represented standard deviation of the subfamily.
Figure 3Comparison of LRR-RLK gene distribution among different subfamilies between A. thaliana and Gossypium. For each species, all the LRR-RLKs were divided into 21 subfamilies, represented by rectangles within different colors. The area of each rectangle within a specific color represented the proportion of the corresponding subfamily.
Number of identified tandem duplicated LRR-RLK genes among different subfamilies in four Gossypium species. Number of tandem duplicated gene sets in each subfamily were indicated in brackets.
| Subfamily | Number of Tandem Duplicated Genes (Gene Sets) | |||
|---|---|---|---|---|
|
|
|
|
| |
| II | 5(2) | 2(1) | 5(2) | 5(2) |
| III | 2(1) | 4(2) | 4(2) | 2(1) |
| IX | - | 2(1) | - | - |
| VII-1 | 4(2) | - | - | |
| VIII-2 | 10(4) | 11(4) | 14(6) | 11(4) |
| XI-1 | 31(11) | 59(19) | 42(16) | 35(9) |
| XI-2 | - | - | 2(1) | - |
| XII | 31(8) | 64(18) | 25(6) | 43(9) |
| Total | 79(26) | 146(47) | 92(33) | 96(25) |
Figure 4KEGG (Kyoto Encyclopedia of Genes and Genomes) pathway enrichment result of Gossypium LRR-RLK genes. Results of G. arboreum, G. barbadense, G. hirsutum, and G. raimondii were shown by (A–D) respectively.
Statistics of cis-acting regulatory elements detected in promoter regions of Gossypium LRR-RLK genes. (*cis-acting regulatory elements that have no functional description were not shown, see Table S3 for details.).
| Element Species | Number (Percentage) of Elements in Promoters of | ||||
|---|---|---|---|---|---|
|
|
|
|
|
| |
| 213(100.0%) | 298(100.0%) | 506(99.0%) | 514(99.8%) | 317(100.0%) | |
| 206(96.7%) | 286(96.0%) | 469(91.8%) | 492(95.5%) | 293(92.4%) | |
| 212(99.5%) | 291(97.7%) | 487(95.3%) | 506(98.3%) | 303(95.6%) | |
| 2(0.9%) | - | - | - | 3(0.9%) | |
| 69(32.4%) | 79(26.5%) | 150(29.4%) | 168(32.6%) | 99(31.2%) | |
| 89(41.8%) | 202(67.8%) | 337(65.9%) | 347(67.4%) | 181(57.1%) | |
| 183(85.9%) | 238(79.9%) | 387(75.7%) | 405(78.6%) | 237(74.8%) | |
| 4(1.9%) | 17(5.7%) | 22(4.3%) | 27(5.2%) | 17(5.4%) | |
| 89(41.8%) | 110(36.9%) | 167(32.7%) | 181(35.1%) | 111(35.0%) | |
| 166(77.9%) | 241(80.9%) | 407(79.6%) | 425(82.5%) | 255(80.4%) | |
| 213(100.0%) | 298(100.0%) | 503(98.4%) | 512(99.4%) | 314(99.1%) | |
| 150(70.4%) | 160(53.7%) | 268(52.4%) | 299(58.1%) | 182(57.4%) | |
| 105(49.3%) | 87(29.2%) | 142(27.8%) | 137(26.6%) | 104(32.8%) | |
| 94(44.1%) | 226(75.8%) | 354(69.3%) | 370(71.8%) | 248(78.2%) | |
| 14(6.6%) | 19(6.4%) | 39(7.6%) | 40(7.8%) | 22(6.9%) | |
| 99(46.5%) | 128(43.0%) | 199(38.9%) | 225(43.7%) | 121(38.2%) | |
| 142(66.7%) | 151(50.7%) | 217(42.5%) | 230(44.7%) | 153(48.3%) | |
| 73(34.3%) | 133(44.6%) | 191(37.4%) | 219(42.5%) | 114(36.0%) | |
| 4(1.9%) | - | 1(0.2%) | - | - | |
Figure 5Statistics of Gossypium LRR-RLK genes regulated by different families of transcription factors (TFs) (genes with TF binding sites were considered to be regulated by TFs).
Figure 6Expression patterns of G. hirsutum LRR-RLK genes in fiber and ovule development. Genes were clustered by the k-means method.
Figure 7Expression patterns of G. hirsutum LRR-RLK genes in response to diverse abiotic stresses (cold, hot, drought simulated by polyethylene glycol (PEG), and salt). Hierarchical clustering analysis classified genes into four distinct groups (colored by blue, green, orange, and red, respectively).
Figure 8Expression patterns of differentially expressed LRR-RLK genes in response to Verticillium dahlia infection. For both G. barbadense (A) and G. hirsutum (B), significant differentially expressed LRR-RLK genes simultaneously detected in both samples infected with V. dahliae strain V991 (highly toxic) and samples infected with V. dahliae strain D07038 (intermediately toxic) were used for heatmap drawings for each subfamily.