| Literature DB >> 33225897 |
Vinod Kumar Prajapati1, Mahendra Varma1,2, Jyothilakshmi Vadassery3.
Abstract
BACKGROUND: The common cutworm, Spodoptera litura Fabricius is a leaf and fruit feeding generalist insect of the order Lepidoptera and a destructive agriculture pest. The broad host range of the herbivore is due to its ability to downregulate plant defense across different plants. The identity of Spodoptera litura released effectors that downregulate plant defense are largely unknown. The current study aims to identify genes encoding effector proteins from salivary glands of S. litura (Fab.).Entities:
Keywords: Effector proteins; Herbivory; Plant defense; Spodoptera litura; de novo transcriptomics
Mesh:
Year: 2020 PMID: 33225897 PMCID: PMC7681983 DOI: 10.1186/s12864-020-07196-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Salivary glands present in S. litura after dissection a) Labial salivary gland b) Ventral eversible gland c) Head
Fig. 2Overview of in silico pipeline to identify potential effector proteins from generalist herbivore, Spodoptera litura. Spodoptera litura 4th instar larvae were used for salivary gland isolation. RNA was isolated from two samples; head (mandibular glands) and salivary gland (LSG and VEG) and was sequenced using Illumina platform. Raw reads were processed and later four filtering steps; presence of signal peptide, no transmembrane helices (TMH), extracellular targeting and clustering into families were sequentially applied to obtain a database of in silico secretory proteins and potential effector proteins. Finally, 808 proteins from head and 267 proteins from salivary gland were identified to be potential effector protein encoding genes. Red boxes are the software’s used, corresponding to each filtering step. Taxonomic distribution and functional annotation was performed using six different databases (Nr, UniProt, Pfam, KO, GO, and COG). Numbers shown here are predicted peptides remaining after each step in the pipeline
Summary of transcriptome data from Spodoptera litura head and salivary gland
| Statistics | Head | SG |
|---|---|---|
| Raw reads (in million) | 44.53 | 48.16 |
| Processed reads (in million) | 41.68 | 44.05 |
| Percentage of high-quality reads (%) | 93.6 | 91.47 |
| Transcripts generated | 91,927 | 71,706 |
| Average transcript length (bp) | 1106 | 957 |
| Number of predicted proteins | 31,450 | 22,486 |
| Number of complete proteins | 18,754 | 11,992 |
| Number of secretory proteins | 899 | 330 |
| Number of potential effector proteins | 808 | 267 |
Fig. 3a Length distribution of transcripts after de novo assembly of Spodoptera litura head and salivary gland (SG) transcriptome using Trinity. b The number of potential effector proteins predicted in the head and salivary gland of Spodoptera litura using de novo based assembly and reference based assembly
Protein family prediction using MCL clustering
| Groups | Family size | Frequency | |
|---|---|---|---|
| Group 1 | 92 | 1 | |
| Group 2 | 61 | 1 | |
| Group 3 | 28 | 1 | |
| Group 4 | 23 | 1 | |
| Group 5 | 16 | 3 | |
| Group 6 | 11 ≤ 15 | 5 | |
| Group 7 | 6 ≤ 10 | 17 | |
| Group 8 | 2 ≤ 5 | 130 | |
| Group 1 | 19 | 1 | |
| Group 2 | 16 | 1 | |
| Group 3 | 12 | 2 | |
| Group 4 | 6 ≤ 10 | 6 | |
| Group 5 | 2 ≤ 5 | 54 |
Fig. 4Potential effector proteins annotated with six public databases. In total, 808 proteins from the head and 267 proteins from the salivary gland were predicted to be potential effector proteins. These set of proteins from both the samples were simultaneously annotated with six different databases; NR (RefSeq non-reductant NCBI) and UniProt for taxonomic distribution and Pfam, KO (KEGG Orthology), GO (Gene Ontology) and COG (Clusters of Orthologous Groups) for functional annotation. Values at the top of each bar show the number of proteins annotated to corresponding database. Annotation of head effector proteins (a) and SG effector proteins (b)
Fig. 5Phylogenetic Relationship and similarity percentage of all the potential effector proteins in the head and salivary gland. Similarity search of the proteins was carried out using BLASTp for species and similarity distribution. BLASTp was performed against all the sequences from the RefSeq non-redundant NCBI database (Order: Lepidoptera) at an e-value cut-off of 1e-5. Fig. A and B are for species distribution of the BLAST result for each protein in the head and salivary gland, respectively. Whereas Figs. C and D are, similarity percentage distribution of BLAST hits for each protein
Fig. 6The enriched Gene Ontology categories (biological process and molecular function) for all predicted potential effector proteins. In total, 693 of 808 proteins in the head and 194 of 267 in salivary gland were assigned GO term. GO categories were assigned to proteins using PANNZER-2. GO terms with ≥1% abundance in annotated proteins are shown here. GO terms with < 1% abundance are grouped into “other biological processes” and “Other molecular function”
Fig. 7a The predicted KEGG pathways for all potential effector proteins. In KO database, a small number of proteins were annotated i.e. 142 out of 808 in the head and 80 out of 267 in the salivary gland were annotated. The proteins in the second hierarchy of the KEGG pathway were assigned to 5 categories A: Metabolism, B: Genetic Information Processing, C: Environmental Information Processing, D: Cellular Processes, E: Organismal Systems. b Predicted Pfam domains in the transcriptome of the head and Salivary gland of S. litura. In Pfam database, 558 out of 808 proteins in the head and 178 out of 267 proteins in the salivary gland were annotated. Pfam domains were mapped on potential effector proteins via Pfam standalone software using the default parameters. Numerical values at the top of each bar represent the number of times that domain has occurred in the annotated proteins. The domains which have occurred ≥5 times in the annotated proteins are shown here
The S. litura transcriptome protein families according to MCL analysis with their predicted Pfam domains
| Family size (number of proteins) | Frequency (number of families) | Major predicted Pfam domains | |
|---|---|---|---|
| 92 | 1 | Insect cuticle protein | |
| 61 | 1 | Regulatory CLIP domain of proteinases//Trypsin | |
| 28 | 1 | Haemolymph juvenile hormone binding protein (JHBP) | |
| 23 | 1 | Insect pheromone-binding family | |
| 16 | 3 | Lectin C-type domain, PBP/GOBP family, Reeler domain//Spondin_N | |
| 14 | 2 | Serpin (serine protease inhibitor), Trypsin Inhibitor like cysteine rich domain | |
| 12 | 1 | Lectin C-type domain | |
| 11 | 2 | Immunoglobulin domain | |
| 10 | 2 | Protein of unknown function (DUF563), Leucine rich repeat | |
| 9 | 1 | Metallo-peptidase family M12B Reprolysin-like | |
| 8 | 4 | Attacin, Kazal-type serine protease inhibitor domain, Lipase | |
| 7 | 3 | N-acetylmuramoyl-L-alanine amidase, Chitin binding Peritrophin-A domain | |
| 6 | 7 | Gamma interferon inducible lysosomal thiol reductase (GILT), GMC oxidoreductase, Neuroendocrine protein | |
| 5 | 13 | Cysteine-rich secretory protein family, Chitin binding domain, Cathepsin propeptide inhibitor domain | |
| 4 | 23 | EF-hand domain pair, EB module, Lectin C-type domain, Sulfotransferase family | |
| 3 | 25 | Ferritin-like domain, Carboxylesterase family, Aldo/keto reductase family, Copper/zinc superoxide dismutase | |
| 2 | 69 | Sulfatase, PBP/GOBP family, Thioredoxin | |
| 19 | 1 | Insect cuticle protein | |
| 16 | 1 | Trypsin | |
| 12 | 2 | Lectin C-type domain//Lectin C-type domain, sulfatase | |
| 10 | 1 | Haemolymph juvenile hormone binding protein (JHBP) | |
| 9 | 2 | Ferritin-like domain, Trypsin Inhibitor like cysteine rich domain | |
| 8 | 1 | Serpin (serine protease inhibitor) | |
| 6 | 2 | EF-hand domain pair, Immunoglobulin domain | |
| 5 | 6 | Cathepsin propeptide inhibitor domain (I29), Chitin binding Peritrophin-A domain | |
| 4 | 10 | WIF domain//EGF-like domain, Destabilase, Chitin binding domain, Gamma interferon inducible lysosomal thiol reductase (GILT) | |
| 3 | 14 | Thioredoxin-like domain, Carboxylesterase family, ML domain, Sema domain | |
| 2 | 24 | Lipase, Attacin, Aldo/keto reductase family, Sulfotransferase family, Peptidase S24 |
Fig. 8a Potential effector proteins from salivary glands present in two different environment conditions i.e. artificial diet fed and Arabidopsis fed. Venn diagram depicting the number of potential effector proteins in artificial diet fed and Arabidopsis fed. 110 potential effector proteins are common under both conditions. b Predicted Pfam domains in the common effector proteins. In total, there are 110 common proteins between artificial diet fed and Arabidopsis fed salivary gland. Pfam domains were assigned to these proteins through Pfam standalone software using the default parameters. Numerical values at the top of each bar represent the number of times that domain has occurred in the common proteins. The domains that occurred ≥5 times in the common proteins are shown here
Fig. 9RT-PCR amplification of most highly expressed potential effector encoding genes in (a) salivary gland (b) head. The expression is checked in different tissues, SG (salivary gland), WB (whole body of insect), WB-(H + SG) (whole body without head and salivary gland)