| Literature DB >> 29204349 |
Vasanthakumar Ponesakki1, Sayan Paul1, Dinesh Kumar Sudalai Mani1, Veeraragavan Rajendiran1, Paulkumar Kanniah1, Sudhakar Sivasubramaniam1.
Abstract
In annelid worms, the nerve cord serves as a crucial organ to control the sensory and behavioral physiology. The inadequate genome resource of earthworms has prioritized the comprehensive analysis of their transcriptome dataset to monitor the genes express in the nerve cord and predict their role in the neurotransmission and sensory perception of the species. The present study focuses on identifying the potential transcripts and predicting their functional features by annotating the transcriptome dataset of nerve cord tissues prepared by Gong et al., 2010 from the earthworm Eisenia fetida. Totally 9762 transcripts were successfully annotated against the NCBI nr database using the BLASTX algorithm and among them 7680 transcripts were assigned to a total of 44,354 GO terms. The conserve domain analysis indicated the over representation of P-loop NTPase domain and calcium binding EF-hand domain. The COG functional annotation classified 5860 transcript sequences into 25 functional categories. Further, 4502 contig sequences were found to map with 124 KEGG pathways. The annotated contig dataset exhibited 22 crucial neuropeptides having considerable matches to the marine annelid Platynereis dumerilii, suggesting their possible role in neurotransmission and neuromodulation. In addition, 108 human stem cell marker homologs were identified including the crucial epigenetic regulators, transcriptional repressors and cell cycle regulators, which may contribute to the neuronal and segmental regeneration. The complete functional annotation of this nerve cord transcriptome can be further utilized to interpret genetic and molecular mechanisms associated with neuronal development, nervous system regeneration and nerve cord function.Entities:
Keywords: Annotation; Eisenia fetida; Nerve cord; Transcriptome
Year: 2017 PMID: 29204349 PMCID: PMC5688751 DOI: 10.1016/j.gdata.2017.10.002
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Summary Statistics of the Newbler and Seqman assembled contigs.
| Statistics | Newbler contigs | Seqman contigs |
|---|---|---|
| Total sequences | 31,114 | 63,602 |
| Total bases | 4,828,470 | 10,463,348 |
| Min sequence length | 72 | 40 |
| Max sequence length | 1395 | 2167 |
| Average sequence length | 155.19 | 164.51 |
| Median sequence length | 115.00 | 128.00 |
| N25 length | 277 | 303 |
| N50 length | 146 | 170 |
| N75 length | 109 | 119 |
| GC% | 39.89% | 39.89% |
Fig. 1(A) A three way Venn diagram denoting the unique and overlapped transcripts annotated against the public databases nr, SwissProt and EST (BLASTX algorithm; E-value threshold 1E-5). (B) The correlation between contigs length and percentage of sequences with BLAST matches. The longer assembled contigs have more number of sequences with BLAST matches against NCBI nr database (cut-off E-value < 1E− 5). (C) The data distribution details obtained from BLAST2GO annotation depicts the InterProScan annotation, BLAST hits, mapping and annotation summary of the assembled contigs.
Fig. 2(A) E-value distribution of the nr BLAST hits for each sequence of contigs with a cut-off E-value of 1E-5. (B) Sequence similarity distribution (in percentage) of top BLAST hits for each contigs sequences. (C) Top hit species distribution summary of the E. fetida contigs annotated against nr database with an E-value cut-off of 1E-5.
Fig. 3Histogram of top 30 InterPro domains distribution obtained from InterProScan annotation of the E. fetida nerve cord transcriptome dataset using the BLAST2GO software.
Fig. 4Histogram representing the gene ontology distribution of the assembled E. fetida contigs. The functionally annotated contigs were classified into three main GO categories: Biological Process (BP), Molecular Function (MF) and Cellular Component (CC).
List of transcripts associated with embryonic development and reproduction.
| SeqName | Description | Sim mean | |
|---|---|---|---|
| Newbler_Contig00646 | Ras-related Rab-2 | 5.46E-13 | 100% |
| Newbler_Contig08599 | cAMP-dependent kinase catalytic subunit beta isoform × 3 | 3.47E-21 | 97% |
| Newbler_Contig11354 | Dihydrolipoyl dehydrogenase | 6.86E-15 | 100% |
| Newbler_Contig12349 | DEAD box ATP-dependent RNA helicase | 3.51E-13 | 100% |
| Newbler_Contig17280 | Pre-mRNA-splicing factor ATP-dependent RNA helicase DHX16 | 1.47E-14 | 98.05% |
| Newbler_Contig19026 | Guanine nucleotide-binding subunit beta-2-like 1 | 2.42E-20 | 99.85% |
| Newbler_Contig19125 | Ras-related Rac1 | 3.65E-15 | 99.85% |
| Newbler_Contig20803 | Transmembrane emp24 domain-containing 2 | 1.25E-100 | 92% |
| Newbler_Contig21587 | Ras-related Rap-1b isoform × 2 | 1.86E-74 | 94.65% |
| Newbler_Contig23046 | DNA excision repair ERCC-1 isoform × 1 | 5.65E-15 | 97% |
| Newbler_Contig24204 | Structural maintenance of chromosomes 1A | 3.58E-21 | 97.30% |
| Newbler_Contig25563 | Centrin-2 isoform × 2 | 1.02E-35 | 100% |
| Newbler_Contig28743 | Calmodulin | 1.00E-51 | 100% |
| Newbler_Contig28993 | Ubiquitin-conjugating enzyme E2 2 | 6.26E-13 | 99.80% |
| Newbler_Contig30617 | BRICK1 | 1.58E-20 | 92.75% |
| Seqman_Contig_4254 | TALDO_DROME ame: Full = Probable transaldolase | 1.49E-35 | 80.40% |
| Seqman_Contig_16470 | Elongation factor 1- oocyte form | 1.10E-23 | 98.60% |
| Seqman_Contig_23100 | frataxin mitochondrial | 6.56E-14 | 87.10% |
| Seqman_Contig_31323 | 14-3-3 zeta | 1.46E-26 | 97.70% |
| Seqman_Contig_40681 | Casein kinase I isoform alpha | 1.23E-20 | 95% |
| Seqman_Contig_50528 | Angiopoietin-related protein 2 | 1.45E-19 | 97% |
| Seqman_Contig_60730 | GTP-binding 128up | 2.73E-25 | 87.70% |
| Seqman_Contig_60731 | 14–3-3 GF14 omicron | 3.95E-12 | 99.70% |
Fig. 5(A) Distribution of the enzyme classes in the annotated contig sequences of earthworm E. fetida. (B) Histogram presenting the Clusters of Orthologous Groups (COG) classification of the assembled contig sequences.
Fig. 6Pie chart denoting the distribution of top 20 KEGG pathways associated with the annotated contigs of earthworm Eisenia fetida.
Fig. S1Bar chart representing top 30 functionally enriched gene ontology terms from three different categories (Biological Process, Cellular Component and Molecular Function), associated with the nerve cord transcripts of earthworm E. fetida in comparison to their whole genome dataset.
List of Neuropeptides with their gene ontology details.
| Sl. No. | SeqName | Description | Length | Sim mean | GO names | |
|---|---|---|---|---|---|---|
| 1 | Newbler_Contig0720505 | CLCCY neuropeptide precursor [ | 235 | 4.58E-07 | 70 | P:neuropeptide signaling pathway; C:extracellular region |
| 2 | Newbler_Contig08652 | XPRLamide neuropeptide precursor [Eisenia fetida] | 183 | 1.03E-32 | 100 | P:signal transduction |
| 3 | Newbler_Contig11852 | XPRLamide neuropeptide precursor [Eisenia fetida] | 146 | 1.03E-21 | 100 | P:signal transduction |
| 4 | Newbler_Contig13284 | FMRF-amide neuropeptides-like [ | 237 | 7.94E-16 | 67.5 | P:neuropeptide signaling pathway |
| 5 | Newbler_Contig18293 | whitnin-1 neuropeptide precursor SPTR [ | 254 | 1.54E-16 | 71.67 | C:cellular_component |
| 6 | Newbler_Contig24715 | FMRFamide precursor [ | 485 | 2.48E-06 | 66 | P:neuropeptide signaling pathway |
| 7 | Newbler_Contig25668 | neuropeptide F [ | 309 | 4.55E-08 | 69.33 | P:neuropeptide signaling pathway; C:membrane; C:integral component of membrane; C:extracellular region; F:hormone activity |
| 8 | Newbler_Contig26118 | SVPGVLRF-amide 3 [ | 305 | 4.87E-17 | 61.56 | C:extracellular space; P:neuropeptide signaling pathway; P:embryo development ending in birth or egg hatching; F:neuropeptide hormone activity; C:extracellular region; F:neuropeptide receptor binding; F:molecular_function; F:neuropeptide Y receptor binding |
| 9 | Newbler_Contig26513 | whitnin-1 neuropeptide precursor SPTR [ | 577 | 7.10E-20 | 77.83 | P:signal transduction; C:cellular_component |
| 10 | Newbler_Contig29900 | WI neuropeptide partial [ | 351 | 1.41E-15 | 77 | P:neuropeptide signaling pathway |
| 11 | Newbler_Contig30157 | prohormone-3 neuropeptide precursor [ | 481 | 1.83E-43 | 60.5 | P:neuropeptide signaling pathway |
| 12 | Seqman_Contig_11518 | neuropeptide Y [ | 224 | 4.04E-14 | 73.25 | P:neuropeptide signaling pathway; C:extracellular region; F:hormone activity |
| 13 | Seqman_Contig_11560 | MIP allatostatin B neuropeptide precursor [ | 816 | 1.01E-17 | 52.5 | P:neuropeptide signaling pathway |
| 14 | Seqman_Contig_18656 | pedal peptide neuropeptide precursor 2 MLDpeptide [ | 231 | 1.10E-10 | 61 | P:neuropeptide signaling pathway |
| 15 | Seqman_Contig_1992 | WI neuropeptide partial [ | 253 | 2.76E-12 | 84 | P:neuropeptide signaling pathway |
| 16 | Seqman_Contig_20437 | Mesotocin-neurophysin partial [ | 280 | 7.46E-10 | 57.75 | P:neuropeptide signaling pathway; F:neurohypophyseal hormone activity; C:extracellular region |
| 17 | Seqman_Contig_2485 | whitnin-1 neuropeptide precursor SPTR [ | 695 | 6.60E-28 | 72.2 | P:neuropeptide signaling pathway |
| 18 | Seqman_Contig_27117 | allatostatin-C neuropeptide precursor [ | 682 | 1.59E-15 | 67 | P:neuropeptide signaling pathway |
| 19 | Seqman_Contig_3225 | FVRIamide neuropeptide precursor [ | 296 | 2.35E-07 | 57.75 | P:neuropeptide signaling pathway |
| 20 | Seqman_Contig_36821 | oxytocin-neurophysin 1 [ | 153 | 2.43E-08 | 68.27 | P:neuropeptide signaling pathway; F:neurohypophyseal hormone activity; C:extracellular region |
| 21 | Seqman_Contig_39560 | allatostatin-C neuropeptide precursor [ | 604 | 1.80E-06 | 80 | P:neuropeptide signaling pathway |
| 22 | Seqman_Contig_40507 | XPRLamide neuropeptide precursor [ | 214 | 7.63E-39 | 80 | P:neuropeptide signaling pathway |
| 23 | Seqman_Contig_40845 | LYamide FDSIG neuropeptide precursor [ | 136 | 2.23E-09 | 79.5 | P:neuropeptide signaling pathway |
| 24 | Seqman_Contig_5359 | CLCCY neuropeptide precursor [ | 314 | 2.46E-07 | 64.5 | P:neuropeptide signaling pathway; F:ion channel inhibitor activity; C:extracellular region; P:pathogenesis |
| 25 | Seqman_Contig_55026 | whitnin-1 neuropeptide precursor SPTR [ | 160 | 9.93E-12 | 70.75 | C:membrane |
| 26 | Seqman_Contig_57817 | FMRF-amide neuropeptides-like [ | 258 | 1.16E-17 | 62.63 | P:neuropeptide signaling pathway |
| 27 | Seqman_Contig_58801 | whitnin-1 neuropeptide precursor SPTR [ | 605 | 2.32E-12 | 95.5 | P:neuropeptide signaling pathway; C:integral component of membrane |
| 28 | Seqman_Contig_59269 | QERAS neuropeptide precursor [ | 235 | 5.04E-08 | 94 | P:neuropeptide signaling pathway |
| 29 | Seqman_Contig_59283 | neuropeptide F [ | 591 | 6.28E-14 | 70.4 | P:neuropeptide signaling pathway; C:extracellular region; F:hormone activity |
| 30 | Seqman_Contig_59977 | prohormone-3 neuropeptide precursor [ | 473 | 1.87E-12 | 49 | P:neuropeptide signaling pathway |
| 31 | Seqman_Contig_60850 | CLCCY neuropeptide precursor [ | 340 | 3.14E-09 | 72 | P:neuropeptide signaling pathway; C:extracellular region |
| 32 | Seqman_Contig_61369 | Neuroendocrine 7B2 [ | 344 | 1.19E-34 | 63.15 | P:neuropeptide signaling pathway; C:secretory granule |
| 33 | Seqman_Contig_61694 | prohormone-3 neuropeptide precursor [ | 619 | 1.78E-28 | 52.5 | P:neuropeptide signaling pathway |
| 34 | Seqman_Contig_62842 | XPRLamide neuropeptide precursor [ | 233 | 2.59E-26 | 95 | P:neuropeptide signaling pathway |
| 35 | Seqman_Contig_7798 | neuropeptide precursor [ | 830 | 1.08E-10 | 54 | P:neuropeptide signaling pathway |
| 36 | Seqman_Contig_7835 | MIP allatostatin B neuropeptide precursor [ | 191 | 6.26E-09 | 68.5 | P:neuropeptide signaling pathway |
| 37 | Seqman_Contig_8949 | MIP allatostatin B neuropeptide precursor [ | 1026 | 4.21E-10 | 53.5 | P:neuropeptide signaling pathway |
Fig. S2Phylogenetic tree representing the evolutionary relationship of Eisenia fetida calcium binding proteins to their orthologs in the leech Helobdella robusta and the polychaete Capitella teleta. The tree was constructed using the UPGMA method and bootstrapping of 100 replicates.