| Literature DB >> 34698114 |
Wei Cheng1, Zhoutao Wang1, Fu Xu1, Waqar Ahmad1, Guilong Lu1, Yachun Su1, Liping Xu1.
Abstract
The leucine-rich repeat receptor-like protein kinase (LRR-RLK) gene family is the largest family of the receptor-like protein kinases (RLKs) superfamily in higher plants, which is involved in regulating the plant growth and development, stress responses, signal transduction and so on. However, no comprehensive analyses of LRR-RLKs have been reported in sugarcane. Here, we performed a comprehensive analysis of the LRR-RLK gene family in sugarcane ancestor species Saccharum spontaneum. A total of 437 LRR-RLK genes were identified and categorized into 14 groups based on a maximum likelihood phylogenetic tree. The chromosome location showed an uneven distribution on all 32 chromosomes in sugarcane. Subsequently, the exon-intron organization structure and conserved motif arrangement were relatively conserved among the same groups or subgroups and between Arabidopsis and S. spontaneum genomes. Furthermore, the promoter sequences analyses showed that sugarcane LRR-RLK genes (SsLRR-RLKs) were strongly regulated by various environmental stimuli, phytohormonal factors and transcription factors (TFs). Eventually, the expression profiles of SsLRR-RLK genes at different stresses were analyzed based on RNA-seq data, suggesting their potential roles in the regulation of sugarcane responses to diverse abiotic and biotic stress. Overall, the findings provide insight into the potential functional roles and lay the foundation for further functional study.Entities:
Keywords: LRR-RLK gene family; Saccharum; bioinformatics; gene expression analysis; molecular evolution; stress response
Mesh:
Substances:
Year: 2021 PMID: 34698114 PMCID: PMC8929030 DOI: 10.3390/cimb43030116
Source DB: PubMed Journal: Curr Issues Mol Biol ISSN: 1467-3037 Impact factor: 2.976
LRR-RLK proteins in Saccharum spontaneum and their key features.
| Groups | Subgroups | No. of Genes | Amino Acid Length | With Signal PEPTIDE (%) | Molecular Weight (kDa) | Isoelectric Point |
|---|---|---|---|---|---|---|
| I | a | 12 | 521–1081 | 58.3 | 53.79–117.29 | 5.70–8.57 |
| b | 0 | - | - | - | - | |
| II | 13 | 473–982 | 69.2 | 52.04–105.55 | 5.45–10.63 | |
| III | 75 | 358–1775 | 84.0 | 39.00–189.90 | 5.55–10.49 | |
| IV | 10 | 529–977 | 80.0 | 55.54–108.99 | 6.83–9.07 | |
| V | 15 | 553–1403 | 80.0 | 59.12–153.82 | 5.22–8.69 | |
| VI | a | 8 | 692–771 | 100 | 75.81–82.38 | 6.40–8.87 |
| b | 8 | 572–1070 | 87.5 | 63.67–118.29 | 5.16–6.66 | |
| VII | a | 14 | 925–1930 | 92.9 | 98.09–209.91 | 5.21–7.14 |
| b | 5 | 649–714 | 100 | 69.08–76.10 | 7.25–9.56 | |
| c | 6 | 436–1095 | 33.3 | 48.33–115.15 | 5.54–8.97 | |
| VIII | 6 | 549–1000 | 50.0 | 60.55–110.28 | 5.67–8.77 | |
| IX | 0 | - | - | - | - | |
| X | a | 5 | 560–849 | 80.0 | 61.52–91.70 | 6.44–9.90 |
| b | 59 | 474–1830 | 59.3 | 52.62–197.90 | 5.06–7.25 | |
| c | 3 | 973–983 | 100 | 104.62–106.10 | 5.74–6.18 | |
| XI | a | 124 | 471–2678 | 74.2 | 51.03–292.02 | 5.17–9.50 |
| b | 1 | 3171 | 100 | 349.15 | 5.78 | |
| XII | 51 | 566–2005 | 56.9 | 61.30–213.89 | 5.26–9.56 | |
| XIII | a | 1 | 508 | 100 | 56.19 | 5.60 |
| b | 2 | 958–986 | 100 | 103.67–106.54 | 5.66–5.96 | |
| XIV | 6 | 661–992 | 83.3 | 70.94–107.52 | 8.27–9.78 | |
| XV | 13 | 713–1209 | 92.3 | 76.83–128.91 | 6.02–9.09 |
Note: “-” means no corresponding information found.
Figure 1Chromosome distribution of LRR-RLK family in S. spontaneum. 437 SsLRR-RLK alleles were located in all 32 chromosomes. The figure was drawn using MG2C v2.0.
Figure 2Phylogenetic tree of the LRR-RLK genes between Arabidopsis and S. spontaneum. The phylogenetic tree was generated using FastTree 2.0 using the maximum likelihood method. All sugarcane LRR-RLKs were classified into 14 distinct groups and different groups were shown by different color.
Figure 3Representative exon–intron (A) and conserved motifs structure (B) of each S. spontaneum LRR-RLK subfamily. (A) Exon–intron structures of SsLRR-RLKs. Untranslated regions (UTRs), exons and introns are represented by blue boxes, red boxes and black lines, respectively. (B) The motif distribution of SsLRR-RLK proteins. The 15 predicted motifs are represented by distinct color boxes using the MEME program.
Statistics analysis of cis-acting elements detected in promoter of SsLRR-RLK genes.
| Types | Functional Classification | Element Species (ID of PlantCARE) | No. of Elements |
|---|---|---|---|
| 1 | Light responsive elements | GATT-motif, MRE, 3-AF1 binding site, Sp1, CAG-motif, GA-motif, 3-AF3 binding site, chs-CMA1a/2a/2b, Gap-box, LS7, I-box, 4cl-CMA1b/2b, AAAC-motif, ACA-motif, ACE, GT1-motif, AE-box, TCCC-motif, AT1-motif, Pc-CMA2a/2c, ATC-motif, ATCT-motif, Box4, Box II, chs-Unit, LAMP-element, GATA-motif, G-Box, GTGGC-motif, L-box, sbp-SMA1c, TCT-motif | 36 |
| 2 | Hormone responsive elements | AuxRR-core, P-box, TGA-box, ERE, ABRE, ABRE2, JERE, TCA-element, ABRE3a, TATC-box, ABRE4, AT-ABRE, AuxRE, CARE, CGTCA-motif, SARE, TGACG-motif, TGA-element, GARE-motif | 19 |
| 3 | Environmental stress-related elements | MBS, ACTCATCCT-sequence, LTR, AP-1, STRE, MYB recognition site, ARE, as-1, box-S, DRE, MYC, DRE core, DRE1, W box, GC-motif, MYB-like sequence, MYB, WRE3, WUN-motif, TC-rich repeats | 20 |
| 4 | Development-related elements | GCN4_motif, AACA_motif, CCGTCC-motif, AC-I, CCGTCC-box, circadian, dOCT, E2Fb, HD-Zip 1, MSA-like, motif I, Myb-binding site, NON, NON-box, AC-II, O2-site, OCT, RY-element, re2f-1, CAT-box, telo-box | 21 |
| 5 | Promoter-related elements | A-box, AT-TATA-box, Box II-like sequence, CAAT-box, HMG-TATA-region, TATA, TATA-box | 7 |
| 6 | Binding site elements | AT-rich element, HD-Zip 3, AT-rich sequence, MBSI, BOX III, CCAAT-box | 6 |
| 7 | Other no functional description elements | AAGAA-motif, CTAG-motif, F-box, GC-repeat, GRA, H-box, plant_AP-2-like, TCA, Y-box, Unnamed_1/2/3/5/6/8/10/12/14/16 | 19 |
Figure 4Heatmaps of the expression analysis of SsLRR-RLKs in response to diverse abiotic stress. The differential expression profiles in two different sugarcane varieties is shown by heatmap under (A,B) low nitrogen (leaf) stress, (C,D) low nitrogen (root) stress and (E,F) cold stress. Heatmap was generated based on FPKM values. The color scale represents expression values of each sample, with red showing high levels and green showing low levels of transcript abundance.
Figure 5Heatmaps of the expression analysis of SsLRR-RLKs under different biotic stresses. The differential expression patterns in two different sugarcane varieties were shown by heatmap under (A,B) infection by sugarcane leaf blight disease and (C,D) infection by sugarcane smut. Heatmap was generated based on FPKM values. The color scale represents expression values of each sample, with red showing high levels and green showing low levels of transcript abundance.
Figure 6Relative expression levels of nine selected LRR-RLK genes were analyzed using quantitative real-time PCR (qRT-PCR). Among them, three obvious periods were discovered, namely the early, medium and late periods. (A) contains three highly expressed genes that respond to early period (SsLRR-RLK88/146/346) that were identified. (B) contains three highly expressed genes that respond to medium period (SsLRR-RLK19/24/391) that were identified. (C) contains three highly expressed genes that respond to late period (SsLRR-RLK203/249/359) that were identified. X-axes show different stages and y-axes show relative expression level of each LRR-RLK gene. Bars represent the standard deviation (SD) (n = 3) of three technical replicates. Different lowercase letters above the bars in the figure indicate a significant difference (p < 0.05). Lowercase letters reflect the 5% significance level. As shown in the chart, a, b, c, d, e and f are different from each other, and they are all significantly different from each other. In addition, the same lowercase letter means no significantly different.