Literature DB >> 35011598

Analyses of Lysin-motif Receptor-like Kinase (LysM-RLK) Gene Family in Allotetraploid Brassica napus L. and Its Progenitor Species: An In Silico Study.

Amin Abedi1, Zahra Hajiahmadi1, Mojtaba Kordrostami2, Qassim Esmaeel3, Cédric Jacquard3.   

Abstract

The LysM receptor-like kinases (LysM-RLKs) play a crucial role in plant symbiosis and response to environmental stresses. Brassica napus, B. rapa, and B. oleracea are utilized as valuable vegetables. Different biotic and abiotic stressors affect these crops, resulting in yield losses. Therefore, genome-wide analysis of the LysM-RLK gene family was conducted. From the genome of the examined species, 33 LysM-RLK have been found. The conserved domains of Brassica LysM-RLKs were divided into three groups: LYK, LYP, and LysMn. In the BrassicaLysM-RLK gene family, only segmental duplication has occurred. The Ka/Ks ratio for the duplicated pair of genes was less than one indicating that the genes' function had not changed over time. The BrassicaLysM-RLKs contain 70 cis-elements, indicating that they are involved in stress response. 39 miRNA molecules were responsible for the post-transcriptional regulation of 12 Brassica LysM-RLKs. A total of 22 SSR loci were discovered in 16 Brassica LysM-RLKs. According to RNA-seq data, the highest expression in response to biotic stresses was related to BnLYP6. According to the docking simulations, several residues in the active sites of BnLYP6 are in direct contact with the docked chitin and could be useful in future studies to develop pathogen-resistant B. napus. This research reveals comprehensive information that could lead to the identification of potential genes for Brassica species genetic manipulation.

Entities:  

Keywords:  bioinformatics; codon usage bias; expression pattern; in silico study; modeling; molecular docking

Mesh:

Substances:

Year:  2021        PMID: 35011598      PMCID: PMC8750388          DOI: 10.3390/cells11010037

Source DB:  PubMed          Journal:  Cells        ISSN: 2073-4409            Impact factor:   6.600


1. Introduction

Because plants are stationary, they are subjected to a variety of biotic and abiotic stresses throughout their lives. Plants have been developed their immune strategies using signal transduction from the site of infection [1]. Immune receptors are used by plants to detect and respond to invading pathogens [2]. Plants’ immune receptors are classified as either nucleotide-binding leucine-rich repeat receptors (NLR) or pattern recognition receptors (PRRs) [3]. Accordingly, NLR and PRR receptors are known as resistance gene analogs (RGAs). NLRs play a major role in plant disease resistance and are also known as resistance genes (R genes) [4]. PRRs are the main line of defense against infections. They are located on the cell membrane and belong to a receptor kinase family. They contain an intracellular kinase domain, a transmembrane domain, as well as an extracellular ligand-binding domain [5]. The extracellular domain recognizes molecular-associated molecular patterns (MAMPs). MAMPs are cell envelope components such as lipopolysaccharide (LPS), flagellin, chitin, β-glucans, peptidoglycan, and ergosterols [2,6]. Proteins containing Lysin-motif (LysM) are PRRs that recognize MAMPs [3,7]. Plant PRRs are classified into two groups: receptor-like proteins (RLPs) and receptor-like kinases (RLKs). RLKs contain an intracellular kinase domain, a transmembrane domain, and also an extracellular ligand-binding domain that is involved in signal transduction while RLPs lack intracellular regions [8]. Plant RLKs have a similar structure to animal receptor tyrosine kinases. Several extracellular domains have been discovered in plants, but none have been identified in animals [9]. In their extracellular regions, plant RLKs exhibit a diverse set of domains [10]. Plant RLKs can be divided into 14 groups, which include wall-associated kinase (WAK), receptor-like kinase in flowers (RKF), CRINKLY-like (CR-like), Catharanthus roseus like (CrRLK), the domain of unknown function 26 (DUF26), lectin (C-Lectin and L-Lectin), leucine-rich repeat (LRR), lysin motif (LysM), an extension like proline-rich extensin like (PERK), leaf rust kinase-like (LRK), thaumatin, self-incompatibility domain (S-domain), and unknown receptor kinase (URK) with various functions [9]. Few RLKs are known to have crucial roles in plant defense such as lysin motif- receptor-like kinases (LysM-RLKs), one of the most important groups implicated in plant defense response. LysM has a highly conserved βααβ secondary structure containing about 50 amino acids that bind to chitin and peptidoglycans [11]. It was first discovered in lysozyme [12] of a bacteriophage and later shown to be present in a wide range of eukaryotes and prokaryotes [13,14]. LysM family contains LysM-containing receptor-like kinase (LYK) and LysM-containing receptor-like proteins (LYP) which are widely distributed in the plant kingdom [15,16]. LYKs are important in plant-pathogen interactions because they activate the immune system by sensing pathogen entrance into the host cell [17]. The LysM motif is found in the extracellular region of LYKs [18]. LYKs are essential mediators of innate immunity against pathogens in a broad range of plant species. Investigation of the properties of the plant LysM showed that plant genomes have at least 11 distinct types of this motif, which are highly diverse in plants and at least six types of LysM motif in LysM kinase proteins and five other types in non-kinase LysM proteins have been identified [19]. The first receptor protein (Oryza sativa chitin elicitor-binding protein- OsCEBiP) was identified in rice which contains two extracellular LysM domains and a transmembrane LysM domain. Silencing of OsCEBiP prevented the plant from responding to chitin [15]. Wheat and barley have orthologs of OsCEBIP that are engaged in the plant defence system. The Mycosphaerella graminicola pathogen caused disease symptoms in wheat lines that were knocked down for TaCEBIP (Triticum aestivum chitin elicitor binding protein) using the virus-induced gene silencing (VIGS) approach [20]. In the barley lines knocked down for HvCEBIP (Hordeum vulgar chitin elicitor binding protein) via the VIGS, increased lesions owing to Magnaporthe oryzae infection were also observed [21]. Similarly, OsCERK1 (Oryza sativa chitin elicitor receptor kinase 1) is a LysM receptor-like protein kinase with three lysine motifs and a kinase domain that is required for signal transduction in rice. However, OsCEBiP is the main chitin-binding protein that uses OsCERK1 to activate the chitin-stimulated immune response [22,23,24]. OsCERK1 transmits the chitin signal to the cell, which has been sensed by OsCEBiP. CERK1 directly binds to chitin by three extracellular LysM domains without any other proteins indicating that it is the main chitin-binding protein in plants [25]. The activation of Mitogen-activated protein kinase (MAPK), the formation of reactive oxygen species (ROS), and the expression of defense genes are all part of the immunological response caused by fungal chitin [26]. According to certain studies, AtCERK (Arabidopsis thaliana chitin elicitor receptor kinase) has a dual role in biotic and abiotic stress signaling [27]. In Arabidopsis, Atlyk1 and Atlyk4 bind to chitin and transmit extracellular signals to the cell, activating downstream pathways of disease resistance [28]. AtLYM1 and AtLYM3 are similar to OsCEBiP and have a role in binding to peptidoglycans [29]. AtLYM2 has been reported to increase plant resistance against Botrytis cinerea and Alternaria brassicicola [30,31]. VvLYK1-1 and VvLYK1-2 in grapevine are involved in immunity induced by chitin and chitosan, and hence VvLYK1-1 is implicated in Erysiphe necator resistance [32]. In addition to the plant immune response, LysM-RLK genes are also involved in the plant and arbuscular mycorrhizal (AM) fungi interaction. Studies on chickpeas have shown that PsLYK9 is directly involved in the perception of long- and short-chain oligosaccharides in the hydrolyzed cell walls of the fungus and plays an important role in the immune response of chickpeas to fungal pathogens. PsLYK9, on the other hand, is involved in the symbiosis development of AM fungi, so that silencing PsLYK9 reduces levels of colonization by mycorrhizal fungi [33]. In tomatoes, SlLYK1 and SlLYK13 are involved in the chitin-induced immune response and cell death, respectively. While SlLYK10 and SlLYK13 participate in the regulation of AM symbiosis [34,35]. Brassica napus L. (2n = 4x = 38) is one of the most important allopolyploid oilseed crops derived from the hybridization of B. oleracea (2n = 2x = 18), and B. rapa (2n = 2x = 20). These species are used as important vegetables in the form of sauces, oil, and fodder, among many other items. Their crops are subjected to biotic and abiotic stressors throughout their life cycle, resulting in yield reductions. Alternaria spp., Fusarium oxysporum, Albuga candia, and Leptosphaeria maculans are the most prevalent fungal diseases that affect Brassica [36]. Brassica crops are susceptible to bacterial rot disease as well [37]. Similarly, salinity and drought among abiotic stresses have the highest impact on Brassica yield [38]. To boost crop productivity, stress-tolerant Brassica can be developed through genetic engineering. In Arabidopsis, rice, lotus, sweet orange, potato, wheat, Chinese white pear, as well as other crops, the Lysin-Motif Receptor-Like Kinase (LysM-RLK) family has been explored [8,17,39,40,41,42,43,44]. However, no genome-wide association studies in B. napus have been conducted to date. As a result, functional investigations of LysM-RLKs in Brassica have been investigated using bioinformatics techniques, given the importance of Brassica and the decrease of its products owing to biotic and abiotic stressors.

2. Materials and Methods

2.1. In Silico Identification of LysM-RLK Genes

The HMM profile of the LysM domain (Pfam ID: PF01476) was retrieved from the Pfam database [45], and the HmmerSearch tool [46] was used to determine Brassica LysM-RLK proteins in the Ensembl Plants database to determine the LysM-RLK gene family in B. napus, B. oleracea, and B. rapa. Three domains including F-box-like (PF12937), protein tyrosine kinase (PF07714), and protein kinase domain (PF00069) were identified. The default parameters include significance E-values of 0.01 for sequence and 0.03 for hit matches, as well as reporting E-values of 1 for both sequences and hit. The ProtParam tool of the ExPASY bioinformatics resource portal [47] was used to compute the molecular weight, length, and theoretical isoelectric points of Brassica LysM-RLK. CELLO and DeepLoc were used to predict protein cellular localization [48,49].

2.2. Phylogenetic Relationships of Brassica LysM-RLK Gene Family

To study the evolutionary relationships of the LysM-RLK gene family, full-length protein sequence alignments of B. napus (Bn), B. oleracea (Bo), B. rapa (Br), Arabidopsis thaliana (At), Brachypodium distachyon (Bd), Oryza sativa (Os), and Vitis vinifera (Vv) were performed using ClustalX 2.0.8 software. The Neighbor-joining (NJ) method with 1000 bootstraps was used to generate a phylogenetic tree of LysM-RLK proteins using MEGA 7 [50] and the p-distance model [51].

2.3. Investigation of Chromosome Localization, Gene Duplication, and Selection Pressure of LysM-RLK Members

The coding sequence (CDS) of the examined genes was retrieved from the Ensemble Plants database using the biomart program [52] to investigate duplication and selection pressure. Tandem duplication is defined as the duplication of genes on the same chromosome separated by no more than 10 genes [53]. Similarly, two criteria were utilized to detect segmental duplication: the aligned region’s identity had to be larger than 90% and the alignment coverage had to be greater than 90% [54]. Synonymous (Ks) and non-synonymous (Ka) substitution rates were evaluated using DnaSP ver. 5 software [55] to establish the kind of selection pressure. TBtools [56] was used to determine the location of genes on chromosomes and the duplication relationship among them.

2.4. Exon-Intron Structure and Conserved Motifs of BLysM-RLK

The Multiple Em for Motif Elicitation (MEME 5.0.5) algorithm was used to find specific LysM-RLK gene motifs [57]. Twenty motifs with a minimum and maximum length of motifs 6 and two hundred amino acids have been considered. These findings were shown using the TBtools software [56]. The GFF3 file linked to the three Brassica species was retrieved from the Ensemble Plants database and the appropriate analyses were done using TBtools software [56] to illustrate the exon-intron structure of the examined genes.

2.5. The Prediction of Cis-Regulatory Elements, Simple Sequence Repeats (SSR) Markers, and BLysM-RLK-Targeted miRNAs

PlantCare [58] was used to identify cis-acting regulatory elements of 1500 bp upstream of the initiation codon (ATG) of the LysM-RLK genes from the Ensemble Plants database [52]. SSR markers in BLysM-RLK genes were discovered using the BatchPrimer3v1.0 server [59]. CD sequences of them were evaluated in the psRNATarget database using default parameters to detect in BLysM-RLK-targeted miRNAs.

2.6. Codon Usage Bias Analysis

CodonW 1.4.2 was used to analyze the sequences for frequency of optimal codons (FOP), codon adaptation index (CAI), GC content, effective codon number (ENC), GC content at the third site position of a codon (GC3s), and relative synonymous codon usage (RSCU) for Brassica LysM-RLK [60]. The statistical analysis was carried out using Excel software.

2.7. RNA-Seq Analysis of Brassica LysM-RLK Genes

The transcript data for flower, leaf, root, silique, and stem tissues as well as dehydration stress at 1 and 8 h after treatment and ABA (25 M), cold (4 °C), and salinity (200 mM), stresses at 4 and 24 h after treatment were related to the study of Zhang et al. [61] with the project ID CRA001775 [62]. FastQC software [48] was used for the initial quality analysis on FastQ files, and then the raw sequence data was preprocessed and adapter sequences, low-quality reads, and duplicate mapping reads were filtered using Trimmomatic on Linux [49]. The preprocessed FastQ files were aligned to the Brassica napus reference genome using STAR [50]. The counts obtained from STAR normalized to transcript per million (TPM). Log2 (TPM + 1) used to generate the heatmap utilizing TBtools [63]. Clustering the data was performed using the Pearson correlation coincident and the complete linkage method. Similarly, the BrassicaEDB database was used to study the expression of BnLysM-RLK genes in response to fungal infections such as Leptosphaeria maculans and Sclerotinia sclerotiorum. Expression data related to the Leptosphaeria Maculans inoculation is available in the NCBI with the project ID number PRJNA311316. In total, they sequenced 36 samples (18 from the resistant (LepR1) genotype and 18 from the susceptible genotype (Westar)). Samples were collected 0, 3, 7, and 11 days post-inoculation in triplicate. RNA-seq data with accession number PRJNA274853 publicly available on the NCBI SRA database were mined and analyzed for expression patterns of the rapeseed LysM-RLK genes in response to S. sclerotiorum infection. The experiment consisted of 24 samples containing susceptible (J902) and resistant (J964) genotypes and was sampled at 24, 48, and 96 h after treatment with three biological replications.

2.8. Structural Modeling and Validation

Iterative template-based fragment assembly simulations were used to create the full-length atomic structures of BnLYP6 proteins to forecast protein structures on the I-TASSER server [64]. The top models from I-TASSER were refined using the ModRefinder software [65]. Ramachandran plot has been applied to confirm the predicted structures by measuring the backbone dihedral phi (ϕ) and psi (Ψ) angles with the PROCHECK module of the PDBSum server [66].

2.9. Molecular Docking

The chitin ligand structure was retrieved from the PubChem database [67] and converted to PDB format using Discovery Studio software. An improved version of the COACH server (COACH-D) was utilized to discover protein-ligand interaction sites [68]. To suggest protein-ligand binding sites, the aforementioned server employs five approaches, four of which are template-based, including TM-SITE [69], COFACTOR [70], and FINDSITE [71] while the last method (ConCavity) is based on structure [72]. The results of each approach were then combined using the COACH algorithm [69]. The ligand-enzyme interaction was studied using AutoDock v4.2.6 [73]. The Auto Grid application, which was created with AutoDock, was used to create grid maps. The grid box sizes for x, y, and z were set to 82, 90, and 120, respectively. The grid centers for x, y, and z were set at 73.866, 76.789, and 68.402, respectively, with a grid spacing of 0.375. To find the best conformers, the Lamarckian Genetic Algorithm (LGA) was used. During the docking process, a limit of 200 conformers was considered for the ligand. The default AutoDock4 parameters were used for the majority of docking processes [73]. The maximum number of tests was set at 2,500,000, the population size was set at 150, the maximum number of generations was set at 27,000, the maximum number of automatically surviving top individuals was set at 1, the gene mutation rate was set at 0.02 and the crossover rate was set at 0.8. The interaction of enzymes and substrates has been illustrated in 2D and 3D using Discovery Studio Visualizer and Chimera softwares [74].

3. Results

3.1. Identification of Brassica LysM-RLK Genes

In the current investigation, 33 LysM-RLK genes were discovered (17 in B. napus, 8 in each of B. rapa and B. oleracea). The prefix Bn, Bo, and Br, as well as the protected domain discovered in each gene, were used to label the identified LysM-RLK genes. The chromosomal location of the genes has been used to estimate the gene number. They were divided into three groups based on their specific domains including LYK (5 in B. napus, 2 in each of B. rapa and B. oleracea), LYP (10 in B. napus, 5 in each of B. rapa and B. oleracea), and LysMn (2 in B. napus, 1 in each of B. rapa and B. oleracea) (Table 1). LYKs are made up of LysM and protein kinase domains, according to supplementary file 1 (Table S1). LysM domains have been discovered in LYPs. Some LYPs are transmembrane, whereas others use a glycosylphosphatidylinositol anchor to bind to the membrane. LysMn proteins have an F box-like domain with extracellular or plasma membrane localization. The physicochemical characteristics of LysM-RLK were investigated using the ProtParam tool. The length of these 33 BnATGs protein sequences ranged from 260 amino acids (BnLysMn1-2, BrLysMn, and BoLysMn) to 665 amino acids (BnLysMn1-2, BrLysMn, and BoLysMn) (BnLYK3 and BoLYK1). The molecular weights of LysM-RLK proteins ranged from 4.08 to 72.76 kDa, with isoelectric points (pI) ranging from 4.64 to 7.78 (Table 1). Based on the pI value, the majority of proteins (28 members, 84.84%) were acidic.
Table 1

Features of Brassica LysM-RLK proteins.

Gene NameGene Stable IDChromosome/Scaffold NameGene Start (bp)Gene End (bp)StrandLengthWeight (kDa)pILocalozation
BnLYK1GSBRNA2T00068250001A41174472011746625−160265.775.32PlasmaMembrane
BnLYK2GSBRNA2T00021233001A611831291186599163369.265.95PlasmaMembrane
BnLYK3GSBRNA2T00085753001A814744431478188166572.765.74PlasmaMembrane
BnLYK4GSBRNA2T00010694001C362664946270198−166472.615.84PlasmaMembrane
BnLYK5GSBRNA2T00149947001C43770985237711907−159865.245.41PlasmaMembrane
BnLYP1GSBRNA2T00065733001A21106490111067027−142243.955.12Extracellular
BnLYP2GSBRNA2T00125469001A685248368526824141443.224.83PlasmaMembrane
BnLYP3GSBRNA2T00147404001A61794479317946593−136238.197.78Extracellular
BnLYP4GSBRNA2T00047588001A728002692802134−136438.767.31Extracellular
BnLYP5GSBRNA2T00102765001A81573712915739025−141543.354.64PlasmaMembrane
BnLYP6GSBRNA2T00125046001C33255379932555600136238.176.12Extracellular
BnLYP7GSBRNA2T00119287001C51079853410800145141643.474.83Extracellular
BnLYP8GSBRNA2T00145018001C799468919949154136438.837.77Extracellular
BnLYP9GSBRNA2T00066647001C82277936222781576141443.354.64PlasmaMembrane
BnLYP10GSBRNA2T00080311001Cnn1072021610722392−14224.085.12Extracellular
BnLysMn1GSBRNA2T00081535001Ann78385247839838−126028.926.22Extracellular
BnLysMn2GSBRNA2T00148567001C41693931816941025126028.916.22PlasmaMembrane
BoLYK1Bo3g181300C36297351362976832166572.665.76PlasmaMembrane
BoLYK2Bo4g151880C44186296841864776−160265.65.32PlasmaMembrane
BoLYP1Bo2g092750C22521140025213688147556.7Extracellular
BoLYP2Bo3g092250C33386021733861898136238.276.13Extracellular
BoLYP3Bo5g034850C51142332911424940141643.454.83Extracellular
BoLYP4Bo7g014960C757706725773498−136438.867.77Extracellular
BoLYP5Bo8g071130C82352800623530169141443.314.72PlasmaMembrane
BoLysMnBo4g078290C41724154017243011−126028.926.22PlasmaMembrane
BrLYK1Bra032146A41096826310970071−160265.845.4PlasmaMembrane
BrLYK2Bra018937A611517331154752163469.26.04PlasmaMembrane
BrLYP1Bra008320A21369058113692699142243.955.12Extracellular
BrLYP2Bra017956A687775908779193141443.254.83Extracellular
BrLYP3Bra009660A61704597317047648−136238.17.78Extracellular
BrLYP4Bra002021A724929282494398−136438.846.68Extracellular
BrLYP5Bra016402A81751243217514340−141543.334.64PlasmaMembrane
BrLysMn1Bra038977Scaffold000157101880103520−126028.916.52Extracellular

3.2. Phylogenetic Analysis of LysM-RLK Proteins

A neighbor-joining phylogenetic dendrogram was constructed to establish the link between Brassica LysM-RLK proteins and their homologous in other plants. According to Figure 1, the LysM-RLKs in Brassica were highly similar to their counterparts in Arabidopsis (At), rice (Os), and grapevine (Vv). LysM-RLK proteins were divided into four subfamilies: LYK, LYP, LysMe, and LysMn. Except for LysMn, all subfamilies have been identified in the three Brassica species investigated. Based on previous studies in Arabidopsis, AtLYP1 and AtLYP3 recognize peptidoglycan while AtCERK1, AtLYK4, and AtLYK5 recognize chitin. Therefore, due to the existence of the BnLYP2, BrLYP2, BoLYP2, BnLYP3, BrLYP3, BoLYP3, BnLYP4, BrLYP4, BoLYP4, BnLYP5, BrLYP5, BoLYP5, BnLYP6, BoLYP7, BnLYP8, BnLYP9 in the clade of AtLYP1 and AtLYP3, they can recognize peptidoglycan while BnLYK1, BrLYK1, BnLYK5, and BoLYK2 formed a monophyletic cluster with AtLYK4 confirming their ability to recognize chitin. It seems that LYP and LYK subgroups can specifically identify peptidoglycan and chitin, respectively. However, some studies reported that some members of the LYP subfamily can bind to both chitin and peptidoglycan ligands such as LYP1, LYP4, LYP5, and LYP6 [25,63,75,76].
Figure 1

Phylogenetic relationships of LysM-RLK genes from Brassica napus (Bn), Brassica rapa (Br), Brassica oleracea (Bo), Oryza sativa (Os), Arabidopsis thaliana (At), Vitis vinifera (Vv), and Brachypodium distachyon (Bd). Colored branches have been used to depict various subfamilies. The phylogenetic dendrogram was constructed using MEGA 7 software and the neighbor-joining (NJ) method with 1000 bootstraps.

3.3. Gene Duplication, Gene Location on the Chromosomes, and Selection Pressure of LysM-RLK Genes

The chromosomal distribution of 33 Brassica LysM-RLK was unequal on chromosomes (Figure 2). Chromosome (Chr) A6 in B. rapa and B. napus had the most genes, while ChrC3 and ChrC4 revealed the highest number of genes in B. oleracea. The genes of each subfamily were located on different chromosomes. In LYP subfamily of B. napus, LYP1, LYP2-3, LYP4, LYP5, LYP6, LYP7, LYP8, LYP9, and LYP10 were found on ChrA2, ChrA6, ChrA7, ChrA8, ChrC3, ChrC5, ChrC7, and ChrC8, respectively. BnLysMn1 and BnLysMn2 were identified on ChrAnn and ChrC4, respectively, in the LysMn subfamily of B. napus. Only segmental duplication was found in the Brassica LysM-RLK gene family, according to duplication analyses (Supplementary Materials: Table S2). Gene duplication is an effective phenomenon contributing to the abundance of duplicate genes in plant genomes which have contributed to the evolution of novel functions. To indicate selection pressure between duplicated genes, the Ks, Ka, and Ka/Ks parameters were investigated for 43 paired genes (Supplementary Materials: Table S3).
Figure 2

The chromosomal location of LysM-RLK genes and the duplication relationship between them. Colored boxes represent chromosomes. Curves are used to show gene duplications.

Except for BnLysMn2/BnLysMn1, BnLysMn2/BrLysMn, and BnLYP6/BrLYP3, the Ka/Ks ratio of 43 paired genes was less than 1, showing negative selection to maintain their function during Brassica evolution. The Ka/Ks ratio for three paired genes (BnLysMn2/BnLysMn1, BnLysMn2/BrLysMn, and BnLYP6/BrLYP3) was more than one, indicating positive selection, which resulted in their various functions as a result of mutations during their evolution.

3.4. Exon-Intron Structures and Conserved Motifs of Brassica LysM-RLKs

The MEME tool was used to find conserved motifs in Brassica LysM-RLK protein sequences (Supplementary Materials: Table S3). According to the data, 15 conserved motifs have been discovered, although the lowest number of motifs was detected in LysMn with 6 motifs (Figure 3A). The highest number of motifs was related to the BoLYP1 with 13 motifs, followed by BnLYP1, BrLYP1, and BnLYP10 with 12 motifs. As expected, each subgroup showed approximately similar motif compositions. Brassica LysM-RLK contains 0 to 10 introns, with BoLYP4 being the longest intron, according to the exon-intron structural study (Figure 3B). Intron-free Brassica LysM-RLK genes account for 9.09% of the genome. The majority of Brassica LysM-RLK genes exhibited zero, one, or two forms of intron splicing, but BnLYK5, BoLYK2, BoLYK1, BrLYK1, BnLysMn1, BnLysMn2, BoLysMn, and BrLysMn had intron phase splicing zero. Exons ranged from one to five in Brassica LysM-RLKs, whereas BnLYK2, BnLYK4, BnLYK3, BrLYK2, and BoLYK1 contained nine and eleven exons, respectively. The highest amount of diversity in the number of exons was observed in the LYK subfamily, which indicates a selective pressure to obtain different functions during the evolution of Brassica [77]. Each subfamily showed similar intron splicing phases. The LysMn subfamily only displayed splicing phase zero, whereas the LYP subfamily showed all three splicing phases. Based on the splicing phase, the LYK subfamily was separated into two groups: (1) BnLYK5, BoLYK2, BoLYK1, and BrLYK1 with splicing phase zero, and (2) BoLYK1, BnLYK2, BrLYK2, BnLYK3, and BnLYK4 with all three splicing phases. The untranslated region was only found in 10 of the Brassica LysM-RLKs including BnLYP2-3, BnLYp4, BnLYP6, BnLYP8-10, BnLYK1-5, and BnLysMn2.
Figure 3

The conserved motifs (A) and exon-intron structure (B) of LysM-RLK genes in Brassica species. Exons and introns were represented by green boxes and black lines, respectively. Different motifs are shown by different colors Exon-intron structure and Motifs were determined using gene structure display server (GSDS) and MEME online tool, respectively.

3.5. The Prediction of Cis-Regulatory Elements, Simple Sequence Repeats (SSR) Markers, and Brassica LysM-RLK-Targeted miRNAs

PlantCare was used to detect cis-regulatory elements in 1500 bp upstream of the Brassica LysM-RLK start codon (Supplementary Materials: Table S4). The Brassica LysM-RLK gene family has been discovered to have 70 cis-elements that can control gene expression in response to five different factors: environmental stresses, light, circadian, phytohormones, and developmental stages. The highest frequency of cis-acting elements in B. napus, B. oleracea, and B. rapa was related to ARE (94.11%), MYC (100%), and ARE (100%), respectively. The lowest frequency of cis-regulatory elements was related to GC-motif (only in BoLYP2) AT-rich sequence (only in BnLYK4), CARE (only in BnLYK2), GTGGC-motif (only in BnLYR1), MSA-like (only in BoLYR2), and F-box (only in BnLYP5). Brassica LysM-RLK contained 218 stress-responsive elements, indicating that they may have a role in regulating the Brassica response to different environmental challenges. 168, 161, and 75 cis-acting elements associated with phytohormones, light, and different tissues were also detected. Therefore, Brassica LysM-RLKs have the potential to play a role in a variety of processes. 22 SSRs were identified in 16 out of 33 Brassica LysM-RLKs (13 SSRs in B. napus, 5 SSRs in B. rapa, and 4 SSRs in B. oleracea) (Table 2). Most genes had a single SSR except BnLYP5 (2 SSRs), BnLYP2, and BnLYP9 (4 SSRs each). The highest frequency was related to tetra-nucleotide repeats (9 SSRs) followed by di-nucleotide repeats (6 SSRs), tri-nucleotide repeats (4 SSRs), and pentanucleotide repeats (3 SSRs). 39 miRNAs for 12 Brassica LysM-RLKs targets have been detected (Supplementary Materials: Table S5). miRNAs and their targets did not have a one-to-one relationship, and many miRNAs shared a common target. For instance, 10 miRNAs named bra-miR156a-5p, bra-miR156b-5p, bra-miR156c-5p, bra-miR156d-5p, bra-miR156e-5p, bra-miR156f-5p, bra-miR156g-5p, bra-miR5725, bra-miR5721, and bra-miR9565-3p co-targeted BrLYP2 transcript. One miRNA such as bna-miR390a can suppress the expression of multiple targets including BnLYK1, BnLYK3, BnLYK4, and BnLYK5 as well.
Table 2

Simple sequence repeats (SSR) were detected in Brassica LysM-RLK genes.

Seq IDCountMotiif
BnLYP81(CAAG)3
BnLYP41(CAAG)3
BnLYK11(CTC)4
BnLYP52(CCTT)4, (TGTGG)3
BnLYP94(CT)7, (CT)9, (AAG)4, (CCTT)4
BnLYP24(TC)7, (TC)6, (GA)7, (AGTC)3
BrLYP41(CAAG)3
BrLYP21(AGTC)3
BrLYK11(CTC)4
BrLYP51(TGTGG)3
BrLysMn1(TATAT)3
BoLYP41(CAAG)3
BoLYP11(CT)9
BoLYK21(CTC)4
BoLYP51(CCTT)4

3.6. Expression Analysis of BnLysM-RLK Genes at Various Tissues under Biotic and Abiotic Stresses

Because of its high content of unsaturated fatty acids and proteins, B. napus is considered one of the plants that produce the healthiest oils. Due to its outstanding properties, such as rapid growth, this plant is also used as a useful species for genetic and molecular studies of development and adaptation to diverse conditions. Therefore, in the current study, the expression of LysM-RLK genes has been investigated in B.napus. RNA-seq data sets for B. napus at different developmental stages tissues have been studied in leaf, flower, root, seed, stem, and silique to discover the related LysM-RLKs (Figure 4, Supplementary Materials: Table S6).
Figure 4

The expression pattern of LysM-RLK genes in different tissues. The color boxes indicate expression values, the lowest (green), medium (black), and the highest (red).

Different expression patterns have been observed in members of the LysM-RLK family. All members of the LYP subfamily revealed moderate to high transcript levels at all developmental stages and tissues except BnLYP1 (low expression in all tissues except seed), BnLYP2 (low expression in stem and leaf), BnLYP6 (low expression in seed and silique), BnLYP7 and BnLYP10 (low expression in the stem, leaf, and flower), BnLYP8 (low expression in flower and seed), and BnLYP9 ((low expression in leaf) and LYP4 low expression in flower. The highest expression in this subfamily was related to seed (BnLYP8, followed by BnLYP5), leaf (BnLYP3, followed by BnLYP6), silique (BnLYP9, followed by BnLYP5), flower (BnLYP2, followed by BnLYP3), and stem (BnLYP3). In the LYK subfamily, all BnLYKs demonstrated low expression except BnLYK1 and BnLYK5 (high level of transcripts in root and moderated expression in leaf), BnLYK2 (high level of transcripts in seed), and BnLYK3 (moderated expression in flower and silique). However, BnLYK5 showed no obvious expression in the flower. Based on RNA-seq data analysis of the BnLysMn subfamily, all members demonstrated moderate to high levels of transcripts in tissues. The expression patterns of BnLysM-RLK genes have been examined to predict their role in responding to abiotic stresses as well (Figure 5, Supplementary Materials: Table S7).
Figure 5

The expression pattern of LysM-RLK genes under abiotic stresses. The color boxes indicate expression values, the lowest (green), medium (black), and the highest (red).

In response to dehydration after one hour, the down-regulated expression has been observed in all BnLysM-RLKs while the expression of BnLYP3-4, BnLYP6, BnLYP8, and BnLYsMn1-2 was up-regulated. After 8 h of dehydration, the expression of all BnLysM-RLKs has been down-regulated obviously except BnLYK3-4, and BnLysMn1-2 which showed up-regulation. The expression of BnLysMn1-2 could be up-regulated by all the studied stresses except BnLysMn1 and BnLysMn2 with no obvious and down-regulated expression in response to cold and ABA after four hours, respectively. Under NaCl treatment, the expression of BnLYK2-3, BnLYP4, and BnLYP6-8 has been decreased whereas the expression of other BnLysM-RLKs has been induced more significantly at 24 h. The expression of BnLYP1 and BnLYP7 has been suppressed by all the studied stresses except BnLYP1 showed up-regulation in response to NaCl after 24 h. The expression of 9 BnLysM-RLKs genes has been up-regulated under ABA stress after four hours including BnLYk1-5, BnLYP2-3, BnLYP6, and BnLysMn1 while the transcript level of the BnLYK2-4, BnLYP1, BnLYP3-4, and BnLYP6-7 was down-regulated after 24 h of ABA treatment. After 24 h of cold stress, the expression of BnLYK2, BnLYK4, BnLYP1-2, BnLYP5, BnLYP7, and BnLYP9-10 genes has been down-regulated. The RNA-seq data sets were applied for analyzing the expression of BnLysM-RLKs in response to fungal pathogens including Leptosphaeria maculans and Sclerotinia sclerotiorum. In response to S. sclerotiorum, BnLYP3-6, BnLYP8-9, and BnLysMn2 revealed moderate to high expression in resistance, sensitivity, and control B. napus whereas the lowest expression was related to BnLYK4. As illustrated in Figure 6, BnLYP3-4 and BnLYP6 are consistently highly expressed in response to S. sclerotiorum, showing that these genes are likely involved in response to a fungal pathogen. The expression of all BnLysM-RLKs revealed down-regulated expression after L. maculans infection while BnLY5 and BnLY9 showed up-regulation (Figure 7). Similarly, BnLYP7 and BnLYK2 have been up-regulated in both susceptible and resistant cultivars except in resistant B. napus after 72 h of infection. BnLYP6 has been suppressed by L. maculans infection in both susceptible and resistant cultivars. In general, the expression of BnLysM-RLKs in response to S. sclerotiorum infection was much higher than the response to L. maculans infection (Supplementary Materials: Tables S8 and S9).
Figure 6

The expression pattern of LysM-RLK genes in response to Sclerotinia clerotiorum infection. The color boxes indicate expression values, the lowest (green), medium (black), and the highest (red). R, S, and C indicate resistant (J964), susceptible (J902), and control plants, respectively.

Figure 7

The expression pattern of LysM-RLK genes in response to Leptosphaeria maculans infection. The color boxes indicate expression values, the lowest (green), medium (black), and the highest (red). R, S, C, and T indicate resistant (DF78), susceptible (Westar), control, and treatment plants, respectively.

3.7. BnLYP6 Structural Modeling and Docking Studies

In the current investigation, the highest expression in response to biotic stress was related to BnLYP6, thus, its molecular structure and ligand-enzyme interaction were investigated. Because PGN and chitin are structurally similar, LYP4 and LYP6 may also physically bind to chitin [75]. I-TASSER and ModRefinder servers have been used to predict and refine three-dimensional structures of BnLYP6 protein. Based on the results of the Ramachandran analysis of non-refined and refined models, the residue count increased in favored regions from 63.7% to 78.1%, which indicates the efficiency of the refinement stage and increase the quality of the modeled structure (Supplementary Materials, Table S10 and Figure S1). The modeled structure for BnLYP6 revealed 8 helices, 14 strands, 6 beta hairpins, 71 beta turns, and 4 gamma turns (Figure 8A). The BnLYP6 structure contains two domains, including the LysM domain I (residues 113–159) and the LysM domain II (residues 177–220) (Figure 8B). The LysM domain is varied in size, ranging from 35 to 50 amino acids. LysM domain I and II revealed three-dimensional βααβ structure, which is inconsistent with the structure of LysM domains in other studies, implying that this structure is highly conserved [78].
Figure 8

The BnLYP6 protein features. Secondary and three-dimensional model structure (A,B). The residues involved in the BnLYP6-chitin interaction (C). Docking studies of the three-dimensional structure of chitin onto the predicted model of BnLYP6 (D). AutoDock v4.2.6 has been used to analyze ligand-protein interaction.

Docking analyses of chitin on the refined model structure were performed using AutoDock 4.2 to investigate the ligand specificity of B. napus LYP6. According to docking simulation with the ligand-enzyme binding energy of -7.9 kcal/mol, THR26, GLY27, ASN28, PHE29, LYS30, LEU202, ASN203, GLU204, ILE215, PRO216, LEU217, and ASP218 6 formed closed contacts with the docked chitin (Figure 8C,D). The chitin formed a hydrogen band with ASP218 and LYS30 of BnLYP6. Hydrogen bonds are the most significant weak interactions in biology. The ligand-enzyme complex seems to be more stable due to a large number of intermolecular hydrogen bonds [79]. In the current study, two hydrogen bonds have been observed between chitin and BnLYP6. On the other hand, the shorter the hydrogen bond, the stronger the bond and the more stable structure. Therefore, the interaction between BnLYP6 and chitin is stable due to the existence of two hydrogen bonds with a length of about 2 Å.

3.8. The Codon Usage Bias Analysis of Brassica LysM-RLK

The results of the codon usage bias analysis have been shown in Supplementary Materials (Supplementary Materials, Table S11). The GC value for Brassica LysM-RLK genes was between 0.437 and 0.548, while the GC3s value was between 0.383 and 0.617. Because of the strong correlation between GC and GC3, the mutation is the most important factor in codon creation (Table 3).
Table 3

The correlation coefficient between the parameters of codon usage of Brassica LysM-RLK gene family.

CAICBIFopENCGC3s
CBI0.7 **
Fop0.73 **0.99 **
ENC−0.12 ns−0.03 ns−0.1 ns
GC3s0.59 **0.81 **0.78 **0.18 ns
GC0.69 **0.88 **0.85 **0.25 ns0.90 **

ns and ** are not-significant and significant at 1% probability level respectively.

The CAI (codon adaptation index), which was in the range of 0.221–0.262 in Brassica LysM-RLKs, is typically used to predict gene expression levels. The closer CAI is to 1, the stronger the codon preference and the higher the gene expression. A relative synonymous codon usage (RSCU) > 1 implies that codons are used more frequently than other synonymous, an RSCU = 1 indicates that codons are not preferred, and an RSCU of 1 indicates that codons are rarely utilized by genes [80]. There are 21 codons in BnLYP7, 22 BnLYP9, 23 codons in BoLYP3-5, BnLYP4-5, and BnLYK4, 24 codons in BrLYP4-5 and BnLYP2, 25 codons in BoLYK1-2, BoLYP1, BrLYP2, BrLYK1, BnLYP5, and BnLYK5, 26 codons in BnLYK3, 27 codons in BrLYP3, BrLysMn, and BnLYK1, 28 codons in BoLYP2, BnLYK2, BnLYP1, BnLysMn1, and BnLYP3, 29 codons in BrLYP1 and BnLYP6, and 30 codons in BoLysMn, BrLYK2, BnLysMn2, and BnLYP1 with RSCU > 1 indicating that these are the most desired codons for each gene. The higher RSCU value (the more frequent codons for each gene) is shown in red, while the lower RSCU value is shown in blue (Figure 9). According to the RSCU value, Brassica LysM-RLKs were divided into four clusters: cluster I (BnLYK2-3, BoLYK1-2, BrLYK1, and BnLYK4-5), cluster II (BnLysMn1-2, BrLysMn, and BoLysMn), cluster III (BrLyP1-2, BnLYP1-2, BoLYP1, BoLYP3, BoLYP5, BrLYP5, BnLYP5, and BnLYP7), and cluster IV (BoLYP2, BnLYP3-4, BrLYP3-4, BoLYP4, BnLYP6, and BnLYP8). Each cluster had a similar preference for codons.
Figure 9

Relative synonymous codon usage analysis (RSCU) values of Brassica LysM-RLKs are represented as a heat map. The color boxes represent RSCU values, with the lowest (blue) and maximum (red) codon usage. TBtools was used to construct the heatmap.

4. Discussion

LysM-RLKs play an important role in the plant immune system against pathogens [17]. In the current study, 33 LysM-RLK genes were found among three Brassica species. In B. napus, 17 genes were detected, while only 8 genes were identified in each of B. oleracea, and B. rapa. Study of RGAs of 30 species of Brassicaceae showed that between 5 and 14 LysM-RLK genes are present in the different assembly versions of B. napus [4]. The study of RLK and RLP in Brassica juncea also showed that the number of LysM-RLK genes in this plant is low and in contrast, LRR-RLK genes have a high frequency [4,81].The observed difference in the number of LysM-RLK genes identified in this study may be due to differences in detection criteria and differences in B. napus assembly versions. RLKs has been identified in many plants such as Arabidopsis thaliana, Oryza sativa, Brachypodium distachyon, Citrus sinensis, Triticum aestivum, Gossypium hirsutum, Pyrus bretschneideri, Malus domestica, Solanum tuberosum and B. juncea that containing 14, 20, 11, 9, 117, 60, 18, 21, 10 and 11 RLKs genes, respectively [16,17,39,40,43,78,81,82]. Due to the variability of the number of genes in different plant species, it can be concluded that the expansion of the LysM-RLK gene family is species-specific resulted from gene duplication events [83]. Based on the number of detected LysM-RLKs, it may be concluded that there is no meaningful association between genome size and the number of genes in plants. For instance, Triticum aestivum and Gossypium hirsutum each have 117 and 60 LysM-RLK genes, while their genome sizes are 17 Gb and 2.5 Gb, respectively. The identified Brassica LysM-RLK were categorized into LYK, LYP, and LysMe groups. The distribution of LysM-RLK was uneven Brassica genome. Most, if not all, flowering plants had one or more genome duplication events in their evolution [84]. In the current study, only segmental duplication resulted in multiple copies of LysM-RLK genes in Brassica. The Ka/Ks ratios of the most duplicated LysM-RLKs were less than 1 except for three duplicated gene pairs (BnLysMn2/BnLysMn1, BnLysMn2/BrLysMn, and BnLYP6/BrLYP3) with Ka/Ks more than 1 and two duplicated gene pairs (BnLYP7/BoLYP3 and BnLYP8/BoLYP4) with no Ka/Ks value due to the same sequence. It should be noted that during evolution, changes in the coding region of duplicated genes resulted in various functions due to amino acid substitution or exon-intron structural divergence [85]. Because of the high purifying selection in the LysM-RLK gene family, the importance of the functional role of Brassica LysM-RLK genes has been determined. According to the phylogenetic tree, it was shown that Brassica LysM-RLKs have a close relationship with their counterparts due to their sequence conservation and similar function. The amino acid compositions of each cluster were similar, implying that the phylogenetic distribution of Brassica LysM-RLK proteins is associated with their motif contents. All members of the LYK cluster contained 10 common motifs 1, 3-4, 7-8, and 11-15. The difference between the members of this subfamily was related to motif 5 in the clade of BoLYK1, BnLYK2, BrLYK2, BnLYK3, and BnLYK4, while the clade of BnLYK1, BrLYK1, BoLYK2, and BnLYK5 contained motif 2. These results are completely consistent with the results of the phylogenetic tree. The LysMn subfamily had common motifs 1, 4, 8, 12, and 13. In the LYP subfamily, three groups were observed. The first group (Cluster I) consisted of BrLYP2, BnLYP2, BoLYP3, BoLYP5, BrLYP, BnLYP5, BnLYP7, and BnLYP9 proteins with common motifs 1-7 and 9-12. The second group (Cluster II), including BnLYP1, BoLYP1, BrLYP1, BnLYP10 with common motifs 1-7, 9-12, and 14 except for BoLYP1 with extra specific motif 13. The third group (Cluster III) consisted of BoLYP2, BrLYP3, BnLYP3, BnLYP4, BrLYP4, BoLYP4, BnLYP6, and BnLYP8 demonstrated 11 same motifs 1-7 and 10-13. The difference between clusters I and II was related to the existence of motifs 14 in the second cluster while cluster III was separated from the above two clusters due to the lack of motif 9. The structure of exons and introns, as well as the splicing phase, play crucial roles in the evolution of gene families [86]. The high and highest conservations were found in intron phases 0 and 1, respectively, while the lowest conservation was found in intron phase 2 [87,88]. The frequency of phases 0 and 1 in all subfamilies was higher than in phase 2, including LYK (63.63%), LYP (55%), and LysMn (100%) indicating high conservation of protein function during Brassica evolution. 9.09% of Brassica LysM-RLK genes were intronless. The study of promoter regions is necessary to understand the function of Brassica LysM-RLK genes. In response to environmental stresses, transcription factors play a significant role. They bind to the target genes’ promoters, regulating their expression [89]. The presence of regulatory components related to stress, developmental stage, light, and phytohormones suggests that LysM-RLKs have a role in the plant’s response to a variety of biological processes. Several cis-elements associated to plant resistance against biotic and abiotic stresses were identified based on the promoter analysis, including ARE, DRE, GC-motif, LTR, MBS, MYB, MYC, STRE, AP1, S-box, W-box, WUN-motif, and WRE3. The TGACG and CGTCA motifs are found on methyl jasmonate-responsive genes [90]. Senescence, seed germination, and response to biotic and abiotic stressors are all affected by jasmonate as well [91]. In response to ABA, the ABRE, ABRE3a, and ABRE4 motifs are activated, resulting in drought and salinity tolerance in plants. The high frequency of cis-acting elements associated with response to drought, pathogen, cold, ABA, auxin, jasmonate, gibberellin, and ethylene suggests that LysM-RLK genes are active in a variety of stress responses in Brassica species. However, the existence of specific regulatory elements is not sufficient evidence for these genes’ responses to specific hormones or stresses, requiring the use of laboratory procedures to precisely determine their function. SSRs are 1-6 nucleotide tandem repeats that have been shown to play a crucial function in gene regulation [92]. In the current study, tetra-nucleotide repeats (40.91%) were found to be more common than other SSRs. The type of dominant SSRs varies in various plant species, and the abundance of AT repeats is higher in the dicots genome than monocots [93]. SSR polymorphisms in LysM-RLK may be examined in different cultivars in the future, and they may be useful for marker-assisted selection (MAS) development in Brassica genetic improvement to choose genotypes with higher resistance to various stresses. MicroRNAs (miRNA) are non-coding small RNAs with a length of 19-24 bp. They are crucial in the regulation of post-transcriptional modifications. Plants, animals, and viruses all have miRNAs. Plant development and responses to environmental stressors are also influenced by them [94]. Brassica miRNAs targeted 6, 5, and 1 transcript in the LYK (BnLYK1, BrLYK1, BoLYK1, BnLYK3-5), LYP (BnLYP2-3 and BrLYP2-3), and LysMn (BnLysMn2) subfamilies, respectively. No LysM-RLK-targeted miRNA was found in LysMn and LYP subfamilies. miR156 is required for the vegetative phase transition of a plant from a juvenile to an adult [95]. Under normal growth conditions, auxin-induced miR390 stimulates lateral root development [96]. Therefore, BnLYK1 and BnLYK3-5 are likely to play a role in root development. miR396 with reduced activity has been demonstrated to give widespread resistance to necrotrophic and hemibiotrophic fungal infections in Arabidopsis [97], thus, BnLysMn2 may be involved in the B.napus defense against fungal infections. miR397 has been reported that target laccase family genes through transcript cleavage in Arabidopsis and rice [98]. As a result, they are required for the maintenance of cell walls and vascular integrity, implying that they play a role in plant defense against various stresses [99]. In Arabidopsis, banana, and rice, miR397 has been shown to have a major impact on plant biomass and yield [99,100,101] that targets BnLYK4 in this study. miR5717 regulates genes involved in lipid metabolism and pollen tube growth [102]. Therefore, BrLYP3 is likely to have a role in reproductive development. It was hypothesized that miR5721 may target genes that encode biotinyl-lipoyl-containing proteins [103]. In B. napus, miR2111 plays a significant role in the response to phosphorus deficiency [104]. Finally, miR6029 has been reported to regulate fatty acid production during the development of B. napus seeds [105]. The expression profile of genes provides important information about the function of the genes that have been found. According to recent studies, RLKs are thought to play a crucial role in stress responses [106,107]. The highest number of BnLysM-RLK genes with moderate to high expression was observed in seeds (76.92%) followed by roots (76.47%), and silique (52.94%) while the lowest number of moderate to highly expressed genes was related to stem (35.39%) preceded by leaf and flower (41.17% each). BnLYK5 was considered not expressed in flower tissue. The highest expression in root and flower tissues was related to BnLYP6 and BnLYP2, respectively, while in stem and leaves the highest expression was related to BnLYP3 and in seed and silique was related to BnLYP9. Most of the low-expression and high-expression genes were related to LYK and LYP subfamilies, respectively. The results demonstrated that the expression patterns of genes belonging to the same subfamily can differ significantly. For instance, the BnLYP3 and BnLYP8 of the LYP subfamily are consistently expressed at high levels while other LYP genes demonstrated a minimum expression except for BnLYP4-6 and BnLYP9 with moderate to high expressions. The results reinforced the hypothesis of divergence that the duplicated genes may be the result of one of two processes: 1) subfunctionalization, and 2) neofunctionalization. In the subfunctionalization process, some of the characteristics of new genes vary from the parental genes [108], whereas the new gene plays a different role in the neofunctionalization process due to differences in amino acid content [109]. Drought is one of the important environmental stresses that have negative effects on plant growth. RLKs’ response to drought stress is influenced by ABA [107]. ABA is a key plant hormone, regulating the expression of genes involved in drought, salt, and osmotic stress responses [110]. As an ABA-dependent pathway, Arabidopsis receptor dead kinase 1 (RDK1) plays an important role in drought stress response. The Arabidopsis rdk1 mutants were hypersensitive to drought stress due to the down-regulation of ABA-responsive genes [111]. Considering the present study, the expression of LysM-RLK genes in response to abiotic stresses varies depending on the stress type and duration. Thus, BnLYP3 and BnLYP8 genes were up-regulated by salt after 4 h of treatment, while they were down-regulated after 24 h under salinity condition. The highest transcript level under dehydration conditions after 1 and 8 h was related to BnLYP3 and BnLysMN2, respectively. Interestingly, in all treatments including salt (after 4 h), ABA (after 4 h), and cold (after 4 and 24 h) BnLYP3 showed the highest expression while the transcript levels of BnLYP9 and BnLysMn2 was higher than other LysM-RLKs in response to salinity and ABA treatments after 24 h, respectively. These findings suggested that the BnLYP3 gene may play a critical role in B. napus response to abiotic stresses, which can be utilized to improve the resistance of B. napus cultivars in future researches. We can also suggest this gene as a marker of abiotic stresses in B. napus. Pathogens and pests are believed to be capable of causing 50–60% losses in Brassica crop yield and quality, resulting in significant economic losses [112]. Sclerotinia stem rot is one of the most destructive diseases for B. napus, caused by S. sclerotiorum. The highest expression in response to S. sclerotiorum was related to BnLYP6, followed by BnLYP4, BnLYP3, BnLYP8, and BnLYP5 in both susceptible and resistant cultivars except in resistant cultivar after 96 h that the highest expression was related to BnLYP3, followed by BnLYP6, BnLYP8, BnlYP9, and BnLYP4. Based on the results of Brotman et al. (2012), CERK1 (LysM-RLK1) receptor is required for chitinase-induced salt and heavy metal tolerance in plants. Likewise, they suggested that ectopic chitinases are largely involved in inducing plant immune response against pathogens mediated by the CERK1 receptor [26]. June et al. (2015) revealed that GbRLK plays an important role in modulating a variety of plant-pathogen interactions in Gossypium barbadense. According to their findings, the majority of the up-regulated genes associated with disease resistance were chitin responsive, implying that the transgenic Arabidopsis showed improved resistance against Verticillium dahlia by modulating the chitin response signaling pathway [113]. Blackleg disease, caused by L. maculans, is a serious production limitation in B. napus. It has been observed in all canola-growing regions except China and causes yearly yield losses of 10–20% [114]. Expression in BnLysM-RLKs is suppressed after L. maculans infection, except in BnLYP5 and BnLYP9 that their expression was slightly increased after pathogen infection. Taken collectively, all members of the gene family are expressed in B. napus. The study of the LysM-RLK gene family in other plants also shows the response of these genes to fungal and bacterial pathogens. A study of transcriptome data has shown that the expression of wheat LysM-RLK genes is induced in response to Flg22 and chitin. Therefore, these genes are involved in wheat resistance to fungal and bacterial pathogens [40]. In Citrus sinensis, the expression of LYK genes have increased in response to Xanthomonas citri, the Citrus bacterial canker (CBC) causing plant bacterial pathogen, and the salicylic acid (SA), methyl jasmonate (MeJA), and abscisic acid (ABA) hormones. Accordingly, there is a link between the LYK genes, the ABA, SA, and MeJA signaling pathways, and CBC resistance [17]. Fusarium graminearum (Fg), the causative agent of Fusarium head blight (FHB), induces the expression of BdLYK2, BdLYK3, and BdLYK4 genes in Brachypodium distachyon. On the other hand, the expression of BdLYP1 and BdLYP4 genes has decreased in response to this pathogen. The function of these genes seems to be similar to the Arabidopsis AtLYP2 and AtLYP3 genes, which are involved in responding to bacterial pathogens [78]. Although these results can confirm the specificity of LYP genes to bacterial PGN, in rice, LYP4 and LYP6 genes are dual-functional and can respond simultaneously to fungal chitin and PGN [115]. The present study also showed that BnLYP6 gene expression is induced in response to fungal pathogens. On the other hand, molecular docking analysis showed that BnLYP6 has a high affinity for chitin, which indicates the role of this gene in responding to fungal pathogens in B. napus. These results together indicate the different functions of LysM-RLK genes in the response of plants to bacterial and fungal pathogens as well as abiotic stresses [116]. CUB can represent the origin of a gene and can be utilized as a theoretical model for analyzing gene evolution and function [117]. The amount of ENC varies between 20 to 61, and the higher the ENC value, the weaker the CUB. The ENC of the Brassica LysM-RLKs ranged from 48.97 to 59.64, indicating that the codons of this family are not affected by strong codon bias and there are various synonymous codons [118]. The CAI index varies from 0 to 1, which is typically applied to measure expression levels [119]. According to the CAI index (0.221-0.2621), the expression efficiency of the BnLysM-RLKs is almost low. Although the codon preference of highly expressed genes is stronger with a higher CAI and lower NC values, low-expression genes have more rare codons, resulting in a lower CAI and a higher NC. For instance, BnLYP6 showed increased expression in response to biotic stresses with almost larger CAI and relatively lower NC. The optimal codon frequency is represented by the FOP and CBI indices, which range from 0 to 1 and -1 to 1, respectively. Based on the results of the FOP and CBI, the frequency of optimum codons in this gene family was low. The majority of Brassica LysM-RLKs showed a GC content of more than 0.5, implying that Brassica LysM-RLKs have obvious preference for GC. 69.69% of Brassica LysM-RLKs demonstrated a GC3s value greater than 0.5, indicating that G/C end codons are preferred.

5. Conclusions

Bioinformatic analyses were performed in this work to discover 33 LysM-RLK genes with significant structural diversity in three Brassica species. Based on the phylogenetic analysis, Brassica LysM-RLK genes were divided into three groups including LYK, LYP, and LysMn. Only segmental duplication was found during the investigation of the mechanism of gene family expansion. The function of most duplicated Brassica LysM-RLK genes has been conserved over evolution due to negative selection. During promoter analysis, several elements in the Brassica LysM-RLK promoters were found, showing that they play a role in stress response and plant growth. 22 SSR and 39 miRNA were detected which can be employed in MAS and genetic transformation, respectively. The functional involvement of LysM-RLK genes in Brassica tissues in response to environmental stressors was revealed by their expression patterns in diverse tissues. Due to the high expression of BnLYP3 genes in response to Sclerotinia stem rot infection and BnLYP3 in response to abiotic stresses, these genes can be exploited in the production of B. napus plants resistant to biotic and abiotic stresses. The discovery of these residues might be important in future investigations to improve the efficiency of the LYP6 enzymes and generate pathogen-resistant B. napus by site-directed mutagenesis. This research has given fundamental information on the LysM-RLK genes, which will be useful in future investigations aimed at improving Brassica quality.
  108 in total

1.  The monosaccharide transporter gene family in Arabidopsis and rice: a history of duplications, adaptive evolution, and functional divergence.

Authors:  Deborah A Johnson; Michael A Thomas
Journal:  Mol Biol Evol       Date:  2007-09-06       Impact factor: 16.240

2.  Evolution and regulation of the Lotus japonicus LysM receptor gene family.

Authors:  Gitte Vestergaard Lohmann; Yoshikazu Shimoda; Mette Wibroe Nielsen; Frank Grønlund Jørgensen; Christina Grossmann; Niels Sandal; Kirsten Sørensen; Søren Thirup; Lene Heegaard Madsen; Satoshi Tabata; Shusei Sato; Jens Stougaard; Simona Radutoiu
Journal:  Mol Plant Microbe Interact       Date:  2010-04       Impact factor: 4.171

Review 3.  Effector-triggered immunity: from pathogen perception to robust defense.

Authors:  Haitao Cui; Kenichi Tsuda; Jane E Parker
Journal:  Annu Rev Plant Biol       Date:  2014-12-08       Impact factor: 26.379

4.  LYK4, a lysin motif receptor-like kinase, is important for chitin signaling and plant innate immunity in Arabidopsis.

Authors:  Jinrong Wan; Kiwamu Tanaka; Xue-Cheng Zhang; Geon Hui Son; Laurent Brechenmacher; Tran Hong Nha Nguyen; Gary Stacey
Journal:  Plant Physiol       Date:  2012-06-28       Impact factor: 8.340

5.  F-box genes: Genome-wide expansion, evolution and their contribution to pollen growth in pear (Pyrus bretschneideri).

Authors:  Guo-Ming Wang; Hao Yin; Xin Qiao; Xu Tan; Chao Gu; Bao-Hua Wang; Rui Cheng; Ying-Zhen Wang; Shao-Ling Zhang
Journal:  Plant Sci       Date:  2016-09-29       Impact factor: 4.729

6.  LYM2-dependent chitin perception limits molecular flux via plasmodesmata.

Authors:  Christine Faulkner; Elena Petutschnig; Yoselin Benitez-Alfonso; Martina Beck; Silke Robatzek; Volker Lipka; Andrew J Maule
Journal:  Proc Natl Acad Sci U S A       Date:  2013-05-14       Impact factor: 11.205

7.  CircRNA Expression Pattern and ceRNA and miRNA-mRNA Networks Involved in Anther Development in the CMS Line of Brassica campestris.

Authors:  Yuwei Liang; Yuzhi Zhang; Liai Xu; Dong Zhou; Zongmin Jin; Huiyan Zhou; Sue Lin; Jiashu Cao; Li Huang
Journal:  Int J Mol Sci       Date:  2019-09-27       Impact factor: 5.923

8.  BatchPrimer3: a high throughput web application for PCR and sequencing primer design.

Authors:  Frank M You; Naxin Huo; Yong Qiang Gu; Ming-Cheng Luo; Yaqin Ma; Dave Hane; Gerard R Lazo; Jan Dvorak; Olin D Anderson
Journal:  BMC Bioinformatics       Date:  2008-05-29       Impact factor: 3.169

9.  PubChem 2019 update: improved access to chemical data.

Authors:  Sunghwan Kim; Jie Chen; Tiejun Cheng; Asta Gindulyte; Jia He; Siqian He; Qingliang Li; Benjamin A Shoemaker; Paul A Thiessen; Bo Yu; Leonid Zaslavsky; Jian Zhang; Evan E Bolton
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  Overexpression of native Musa-miR397 enhances plant biomass without compromising abiotic stress tolerance in banana.

Authors:  Prashanti Patel; Karuna Yadav; Ashish Kumar Srivastava; Penna Suprasanna; Thumballi Ramabhatta Ganapathi
Journal:  Sci Rep       Date:  2019-11-11       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.