| Literature DB >> 26931371 |
Vishal S Somvanshi1, Shachi Gahoi2, Prakash Banakar3, Prasoon Kumar Thakur4, Mukesh Kumar5, Manisha Sajnani6, Priyatama Pandey7, Uma Rao8.
Abstract
BACKGROUND: Nematodes are the most numerous animals in the soil. Insect parasitic nematodes of the genus Heterorhabditis are capable of selectively seeking, infecting and killing their insect-hosts in the soil. The infective juvenile (IJ) stage of the Heterorhabditis nematodes is analogous to Caenorhabditis elegans dauer juvenile stage, which remains in 'arrested development' till it finds and infects a new insect-host in the soil. H. indica is the most prevalent species of Heterorhabditis in India. To understand the genes and molecular processes that govern the biology of the IJ stage, and to create a resource to facilitate functional genomics and genetic exploration, we sequenced the transcriptome of H. indica IJs.Entities:
Mesh:
Year: 2016 PMID: 26931371 PMCID: PMC4774024 DOI: 10.1186/s12864-016-2510-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Assembly statistics of H. indica transcriptome generated by Velvet and Oases
| Assembly statistics of | ||||
|---|---|---|---|---|
| k-mer length | 71 | 75 | 79 | 83 |
| No. of contig | 22,698 | 21,760 | 20,363 | 18,710 |
| Min contig length (bp) | 200 | 200 | 200 | 200 |
| Max contig length (bp) | 12,876 | 11,783 | 10,340 | 10,673 |
| N50 (bp) | 810 | 828 | 872 | 909 |
| Assembly statistics of | ||||
| Parameters | Reading | |||
| No. of transcripts | 23,827 | |||
| Total assembly (Mb) | 22 | |||
| Min transcript length (bp) | 102 | |||
| Max transcript length (bp) | 12,876 | |||
| N50 | 1,292 | |||
| Final assembly statistics of the | ||||
| Parameters | Reading | |||
| No. of unique transcripts | 13,593 | |||
| Total assembly (Mb) | 15 | |||
| min transcript length (bp) | 300 | |||
| Max transcript length (bp) | 12,876 | |||
| N50 | 1,371 | |||
Fig. 1a Distribution of the top 10 nematode species with most homologs to Heterorhabditis indica. The distribution was calculated using best blastx hits. b. Venn diagram of H. indica transcripts matching H. bacteriophora proteins in a standalone blast
Fig. 2a H. indica orthologs present in selected completely sequenced genomes of free-living nematodes C. elegans, C. remanei, C. briggsae, Pristionchus pacificus and Heterorhabditis bacteriophora. b H. indica orthologs in animal and plant parasitic nematodes
Gene ontology analysis of proteins, conceptually translated from contigs of H. indica
| GO category | GO code | GO term | No. of proteins (%) |
|---|---|---|---|
| Cellular component | GO:0005623 | Cell | 2368 (29.1) |
| GO:0043226 | Organelle | 981 (12.1) | |
| GO:0032991 | Macromolecular complex | 741 (9.1) | |
| GO:0005576 | Extracellular region | 99 (1.2) | |
| GO:0031974 | Membrane-enclosed lumen | 60 (0.7) | |
| GO:0031975 | Envelope | 47 (0.6) | |
| GO:0045202 | Synapse | 1 (0) | |
| Molecular function | GO:0005488 | Binding | 4027 (49.6) |
| GO:0003824 | Catalytic activity | 3293 (40.5) | |
| GO:0005215 | Transporter activity | 456 (5.6) | |
| GO:0005198 | Structural molecule activity | 427 (5.3) | |
| GO:0030528 | Transcription regulator activity | 207 (2.5) | |
| GO:0030234 | Enzyme regulator activity | 179 (2.2) | |
| GO:0060089 | Molecular transducer activity | 169 (2.1) | |
| GO:0009055 | Electron carrier activity | 79 (1) | |
| GO:0045182 | Translation regulator activity | 53 (0.7) | |
| GO:0016209 | Antioxidant activity | 49 (0.6) | |
| Biological process | GO:0008152 | Metabolic process | 3466 (42.7) |
| GO:0009987 | Cellular process | 3293 (40.5) | |
| GO:0051179 | Localization | 763 (9.4) | |
| GO:0065007 | Biological regulation | 762 (9.4) | |
| GO:0043473 | Pigmentation | 748 (9.2) | |
| GO:0016043 | Cellular component organization | 158 (1.9) | |
| GO:0050896 | Response to stimulus | 152 (1.9) | |
| GO:0044085 | Cellular component biogenesis | 113 (1.4) | |
| GO:0010926 | Anatomical structure formation | 73 (0.9) | |
| GO:0032501 | Multicellular organismal process | 31 (0.4) | |
| GO:0022610 | Biological adhesion | 31 (0.4) | |
| GO:0032502 | Developmental process | 18 (0.2) | |
| GO:0051704 | Multi-organism process | 6 (0.1) | |
| GO:0016265 | Death | 4 (0) | |
| GO:0002376 | Immune system process | 2 (0) | |
| GO:0000003 | Reproduction | 1 (0) | |
| GO:0022414 | Reproductive process | 1 (0) | |
| GO:0016032 | Viral reproduction | 1 (0) |
Fig. 3GO term analysis for all predicted proteins in IJ transcriptome of H. indica
The sixty pathways most represented in the transcriptome of H. indica IJs
| Pathway ID | Pathway term | No of proteins |
|---|---|---|
| 3010 | Ribosome | 117 |
| 3040 | Spliceosome | 95 |
| 4141 | Protein processing in endoplasmic reticulum | 88 |
| 3013 | RNA transport | 87 |
| 5200 | Pathways in cancer | 86 |
| 5016 | Huntington's disease | 85 |
| 190 | Oxidative phosphorylation | 74 |
| 5010 | Alzheimer's disease | 73 |
| 230 | Purine metabolism | 70 |
| 4144 | Endocytosis | 70 |
| 5012 | Parkinson's disease | 68 |
| 5169 | Epstein-Barr virus infection | 68 |
| 5166 | HTLV-I infection | 67 |
| 1200 | Carbon metabolism | 63 |
| 4932 | Non-alcoholic fatty liver disease, NAFLD | 63 |
| 4120 | Ubiquitin mediated proteolysis | 62 |
| 5203 | Viral carcinogenesis | 61 |
| 4110 | Cell cycle | 59 |
| 240 | Pyrimidine metabolism | 58 |
| 5205 | Proteoglycans in cancer | 55 |
| 3008 | Ribosome biogenesis in eukaryotes | 53 |
| 4151 | PI3K-Akt signaling pathway | 53 |
| 3015 | mRNA surveillance pathway | 52 |
| 4010 | MAPK signaling pathway | 52 |
| 4111 | Cell cycle-yeast | 48 |
| 4015 | Rap1, signaling pathway | 47 |
| 4510 | Focal adhesion | 47 |
| 4014 | Ras signaling pathway | 46 |
| 4810 | Regulation of actin cytoskeleton | 46 |
| 4910 (ko04910) | Insulin signaling pathway | 46 |
| 4142 | Lysosome | 45 |
| 4068 | FoxO signaling pathway | 42 |
| 4152 | AMPK signaling pathway | 42 |
| 4114 | Oocyte meiosis | 42 |
| 3018 | RNA degradation | 41 |
| 4024 | cAMP signaling pathway | 41 |
| 4310 | Wnt signaling pathway | 39 |
| 4390 | Hippo signaling pathway | 39 |
| 5206 | MicroRNAs in cancer | 38 |
| 1230 | Biosynthesis of amino acids | 37 |
| 4530 | Tight junction | 37 |
| 4062 | Chemokine signaling pathway | 37 |
| 4722 | Neurotrophin signaling pathway | 37 |
| 4146 | Peroxisome | 36 |
| 4113 | Meiosis-yeast | 36 |
| 5168 | Herpes simplex infection | 36 |
| 564 | Glycerophospholipid metabolism | 35 |
| 4071 | Sphingolipid signaling pathway | 35 |
| 4145 | Phagosome | 35 |
| 4914 | Progesterone-mediated oocyte maturation | 35 |
| 4921 | Oxytocin signaling pathway | 35 |
| 4919 | Thyroid hormone signaling pathway | 35 |
| 3420 | Nucleotide excision repair | 32 |
| 4022 (ko04022) | cGMP-PKG signaling pathway | 32 |
| 4550 | Signaling pathways regulating pluripotency of stem cells | 32 |
| 3050 | Proteasome | 31 |
| 4721 | Synaptic vesicle cycle | 30 |
| 510 | N-Glycan biosynthesis | 29 |
| 3022 | Basal transcription factors | 29 |
| 5100 | Bacterial invasion of epithelial cells | 29 |
Fig. 4Metabolic pathways active in H. indica infective juveniles as revealed by the transcriptomic analysis using iPATH2 interactive pathway explorer
KOG analysis of genes with FPKM Values ≥ 1000 and ≥100
| KOG function | Function ID | Gene count |
|---|---|---|
|
| ||
| Translation, ribosomal structure and biogenesis | J | 27 |
| Posttranslational modification, protein turnover, chaperones | O | 10 |
| Multiple classes | 9 | |
| Intracellular trafficking, secretion, and vesicular transport | U | 7 |
| Lipid transport and metabolism | I | 7 |
| Cytoskeleton | Z | 4 |
| Defense mechanisms | V | 2 |
| Energy production and conversion | C | 2 |
| General function prediction only | R | 2 |
| Transcription | K | 2 |
| Carbohydrate transport and metabolism | G | 1 |
| Chromatin structure and dynamics | B | 1 |
| Extracellular structures | W | 1 |
| Inorganic ion transport and metabolism | P | 1 |
| Grand total | 76 | |
|
| ||
| Posttranslational modification, protein turnover, chaperones | O | 279 |
| Multiple classes | 264 | |
| General function prediction only | R | 235 |
| Translation, ribosomal structure and biogenesis | J | 189 |
| Function unknown | S | 160 |
| Intracellular trafficking, secretion, and vesicular transport | U | 155 |
| Signal transduction mechanisms | T | 131 |
| Energy production and conversion | C | 124 |
| RNA processing and modification | A | 119 |
| Transcription | K | 108 |
| Cytoskeleton | Z | 90 |
| Lipid transport and metabolism | I | 84 |
| Carbohydrate transport and metabolism | G | 77 |
| Amino acid transport and metabolism | E | 66 |
| Inorganic ion transport and metabolism | P | 53 |
| Cell cycle control, cell division, chromosome partitioning | D | 39 |
| Replication, recombination and repair | L | 36 |
| Nucleotide transport and metabolism | F | 25 |
| Chromatin structure and dynamics | B | 24 |
| Secondary metabolites biosynthesis, transport and catabolism | Q | 21 |
| Extracellular structures | W | 18 |
| Coenzyme transport and metabolism | H | 18 |
| Cell wall/membrane/envelope biogenesis | M | 15 |
| Defense mechanisms | V | 13 |
| Nuclear structure | Y | 1 |
| Cell motility | N | 1 |
| Grand total | 2345 | |
Kinases belonging to different group/family/subfamily represented in IJ stage of H. indica
| S. No. | Kinase group/family/subfamily |
|
| S. No. | Kinase group/family/subfamily |
|
|
|---|---|---|---|---|---|---|---|
| 1 | AGC/Akt | 2 | 1 | 71 | CMGC/DYRK/DYRK1 | 1 | 1 |
| 2 | AGC/DMPK/GEK | 1 | 3 | 72 | CMGC/DYRK/DYRK2 | 3 | 1 |
| 3 | AGC/DMPK/ROCK | 1 | 2 | 73 | CMGC/DYRK/PRP4 | 1 | 1 |
| 4 | AGC/GRK/BARK | 1 | 3 | 74 | CMGC/GSK | 7 | 4 |
| 5 | AGC/GRK/GRK | 1 | 2 | 75 | CMGC/MAPK/ERK1 | 1 | 1 |
| 6 | AGC/MAST/MAST | 1 | 3 | 76 | CMGC/MAPK/ERK7 | 1 | 1 |
| 7 | AGC/NDR/LATS | 1 | 1 | 77 | CMGC/MAPK/JNK | 5 | 1 |
| 8 | AGC/NDR/NDR | 1 | 2 | 78 | CMGC/MAPK/nmo | 1 | 1 |
| 9 | AGC/PDK1 | 2 | 1 | 79 | CMGC/MAPK/MAPK-Unclassified | 3 | 2 |
| 10 | AGC/PKA | 2 | 4 | 80 | CMGC/MAPK/p38 | 3 | 2 |
| 11 | AGC/PKC/PKCa | 1 | 3 | 81 | CMGC/SRPK | 1 | 1 |
| 12 | AGC/PKC/PKCd | 1 | 1 | 82 | Other/Aur | 2 | 2 |
| 13 | AGC/PKC/PKCh | 1 | 1 | 83 | Other/BUB | 1 | 1 |
| 14 | AGC/PKC/PKCi | 1 | 1 | 84 | Other/Bud32 | 1 | 2 |
| 15 | AGC/PKG | 2 | 3 | 85 | Other/Haspin | 13 | 1 |
| 16 | AGC/PKN | 1 | 1 | 86 | Other/NAK/BIKE | 1 | 2 |
| 17 | AGC/RSK/MSK | 1 | 1 | 87 | Other/NEK/NEK6 | 1 | 2 |
| 18 | AGC/RSK/RSKp70 | 2 | 1 | 88 | Other/NEK/NEK8 | 2 | 2 |
| 19 | AGC/RSK/RSKp90 | 1 | 1 | 89 | Other/NKF2 | 1 | 1 |
| 20 | AGC/SGK | 1 | 1 | 90 | Other/NRBP | 1 | 3 |
| 21 | AGC/YANK | 1 | 1 | 91 | Other/Other-Unique | 10 | 2 |
| 22 | Atypical/ABC1/ABC1-A | 1 | 1 | 92 | Other/PEK/GCN2 | 1 | 2 |
| 23 | Atypical/ABC1/ABC1-B | 1 | 1 | 93 | Other/PLK/PLK1 | 3 | 1 |
| 24 | Atypical/BRD | 3 | 3 | 94 | Other/SCY1 | 2 | 4 |
| 25 | Atypical/PDHK/PDHK | 2 | 2 | 95 | Other/TLK | 1 | 3 |
| 26 | Atypical/PIKK/FRAP | 1 | 3 | 96 | Other/ULK/ULK | 2 | 1 |
| 27 | Atypical/PIKK/SMG1 | 1 | 2 | 97 | Other/WEE/Myt1 | 2 | 1 |
| 28 | Atypical/PIKK/TRRAP | 1 | 1 | 98 | Other/WNK | 1 | 2 |
| 29 | Atypical/RIO/RIO1 | 1 | 2 | 99 | Other/Worm3 | 2 | 1 |
| 30 | Atypical/RIO/RIO2 | 1 | 1 | 100 | RGC/RGC | 27 | 13 |
| 31 | Atypical/TAF1 | 1 | 3 | 101 | STE/STE11/ASK | 1 | 3 |
| 32 | CAMK/CAMK1 | 1 | 2 | 102 | STE/STE11/MEKK4 | 1 | 1 |
| 33 | CAMK/CAMK2 | 1 | 1 | 103 | STE/STE20/FRAY | 1 | 2 |
| 34 | CAMK/CAMKL/AMPK | 2 | 1 | 104 | STE/STE20/KHS | 1 | 1 |
| 35 | CAMK/CAMKL/LKB | 1 | 2 | 105 | STE/STE20/MSN | 1 | 3 |
| 36 | CAMK/CAMKL/MARK | 2 | 2 | 106 | STE/STE20/MST | 1 | 2 |
| 37 | CAMK/CAMKL/MELK | 1 | 2 | 107 | STE/STE20/PAKA | 2 | 3 |
| 38 | CAMK/CAMKL/NIM1 | 1 | 2 | 108 | STE/STE20/SLK | 1 | 3 |
| 39 | CAMK/CAMKL/QIK | 1 | 2 | 109 | STE/STE20/TAO | 1 | 1 |
| 40 | CAMK/CAMKL/SNRK | 1 | 1 | 110 | STE/STE20/YSK | 1 | 1 |
| 41 | CAMK/CASK | 1 | 1 | 111 | STE/STE7/MEK1 | 1 | 1 |
| 42 | CAMK/DAPK/DAPK | 1 | 2 | 112 | STE/STE7/MEK3 | 1 | 1 |
| 43 | CAMK/DCAMKL | 2 | 1 | 113 | STE/STE7/MEK4 | 3 | 1 |
| 44 | CAMK/MAPKAPK/MK2 | 2 | 2 | 114 | STE/STE7/MEK7 | 2 | 1 |
| 45 | CAMK/MAPKAPK/MNK | 1 | 1 | 115 | TK/Abl | 1 | 2 |
| 46 | CAMK/MLCK | 4 | 17 | 116 | TK/Ack | 2 | 4 |
| 47 | CAMK/PHK | 1 | 2 | 117 | TK/ALK | 1 | 1 |
| 48 | CAMK/PIM | 2 | 1 | 118 | TK/Csk | 1 | 1 |
| 49 | CAMK/PKD | 2 | 4 | 119 | TK/DDR | 2 | 2 |
| 50 | CAMK/PSK | 1 | 1 | 120 | TK/Eph | 1 | 1 |
| 51 | CAMK/TSSK | 3 | 2 | 121 | TK/Fer | 38 | 4 |
| 52 | CK1/CK1/CK1-A | 1 | 1 | 122 | TK/FGFR | 1 | 1 |
| 53 | CK1/CK1/CK1-D | 1 | 1 | 123 | TK/InsR | 1 | 3 |
| 54 | CK1/CK1/CK1-G | 1 | 1 | 124 | TK/KIN6 | 5 | 2 |
| 55 | CK1/CK1-Unique | 6 | 1 | 125 | TKL/IRAK | 1 | 1 |
| 56 | CK1/TTBK | 1 | 1 | 126 | TKL/LRRK | 1 | 2 |
| 57 | CK1/TTBKL | 31 | 4 | 127 | TKL/MLK/HH498 | 1 | 1 |
| 58 | CK1/Worm6 | 28 | 15 | 128 | TKL/MLK/ILK | 1 | 1 |
| 59 | CMGC/CDK/CDC2 | 2 | 3 | 129 | TKL/MLK/MLK | 1 | 1 |
| 60 | CMGC/CDK/CDK4 | 1 | 1 | 130 | TKL/RAF/RAF | 2 | 1 |
| 61 | CMGC/CDK/CDK5 | 1 | 2 | 131 | TKL/STKR/STKR1 | 2 | 2 |
| 62 | CMGC/CDK/CDK7 | 1 | 1 | 132 | TK/Src/Src-Unclassified | 2 | 2 |
| 63 | CMGC/CDK/CDK8 | 1 | 1 | 133 | TKL/STKR/STKR2 | 1 | 2 |
| 64 | CMGC/CDK/CDK9 | 1 | 1 | 134 | TK/Met | 2 | 2 |
| 65 | CMGC/CDK/CRK7 | 1 | 1 | 135 | TK/Ror | 1 | 1 |
| 66 | CMGC/CDK/PCTAIRE | 1 | 2 | 136 | TK/Src/Frk | 1 | 1 |
| 67 | CMGC/CDK/PFTAIRE | 1 | 2 | 137 | TK/TK-Unique | 6 | 2 |
| 68 | CMGC/CDK/PITSLRE | 2 | 1 | ||||
| 69 | CMGC/CK2 | 1 | 1 | ||||
| 70 | CMGC/CLK | 3 | 1 |
Members of flp, nlp, unc, ins, daf, far, nhr, lec, let and lys gene class present in the transcriptome of H. indica IJs. Gene counts for each gene class for C. elegans were taken from wormbase
| Gene class |
|
| |
|---|---|---|---|
| At ≥ 25 % sequence similarity and query coverage | At ≥ 30 % sequence similarity and query coverage | ||
|
| 31 | 25 | 22 |
|
| 44 | 35 | 25 |
|
| 111 | 77 | 69 |
|
| 39 | 33 | 18 |
|
| 34 | 24 | 21 |
|
| 8 | 4 | 0 |
|
| 283 | 98 | 36 |
|
| 12 | 7 | 6 |
|
| 642 | 15 | 13 |
|
| 10 | 0 | 0 |
RNAi effector genes discovered in the IJ stage of the H. indica
| S. No. |
|
|
|
|---|---|---|---|
|
| |||
| Total | 10 | 6 | 2 |
| 1 |
| + | - |
|
|
| + | + |
|
|
| + | - |
|
|
| + | - |
|
|
| + | + |
|
|
| + | - |
|
| |||
| Total | 12 | 4 | 5 |
| 7 |
| + | + |
| 8 |
| + | - |
| 9 |
| + | + |
| 10 |
| - | + |
| 11 |
| + | + |
| 12 |
| - | + |
|
| |||
| Total |
| 3 | 1 |
| 13 |
| + | - |
| 14 |
| - | + |
| 15 |
| + | - |
| 16 |
| + | - |
|
| |||
| Total | 4 | 2 | 2 |
| 17 |
| + | + |
| 18 |
| + | - |
| 19 |
| - | + |
|
| |||
| Total | 9 | 2 | 0 |
| 20 |
| + | - |
| 21 |
| + | - |
|
| |||
| Total | 15 | 7 | 2 |
| 22 |
| + | - |
| 23 |
| + | - |
| 24 |
| + | + |
| 25 |
| + | - |
| 26 |
| + | - |
| 27 |
| + | + |
| 28 |
| + | - |