Literature DB >> 28241869

De novo transcriptome assembly, functional annotation and differential gene expression analysis of juvenile and adult E. fetida, a model oligochaete used in ecotoxicological studies.

Michelle Thunders1, Jo Cavanagh2, Yinsheng Li3.   

Abstract

BACKGROUND: Earthworms are sensitive to toxic chemicals present in the soil and so are useful indicator organisms for soil health. Eisenia fetida are commonly used in ecotoxicological studies; therefore the assembly of a baseline transcriptome is important for subsequent analyses exploring the impact of toxin exposure on genome wide gene expression.
RESULTS: This paper reports on the de novo transcriptome assembly of E. fetida using Trinity, a freely available software tool. Trinotate was used to carry out functional annotation of the Trinity generated transcriptome file and the transdecoder generated peptide sequence file along with BLASTX, BLASTP and HMMER searches and were loaded into a Sqlite3 database. To identify differentially expressed transcripts; each of the original sequence files were aligned to the de novo assembled transcriptome using Bowtie and then RSEM was used to estimate expression values based on the alignment. EdgeR was used to calculate differential expression between the two conditions, with an FDR corrected P value cut off of 0.001, this returned six significantly differentially expressed genes. Initial BLASTX hits of these putative genes included hits with annelid ferritin and lysozyme proteins, as well as fungal NADH cytochrome b5 reductase and senescence associated proteins. At a cut off of P = 0.01 there were a further 26 differentially expressed genes.
CONCLUSION: These data have been made publicly available, and to our knowledge represent the most comprehensive available transcriptome for E. fetida assembled from RNA sequencing data. This provides important groundwork for subsequent ecotoxicogenomic studies exploring the impact of the environment on global gene expression in E. fetida and other earthworm species.

Entities:  

Keywords:  E. fetida; Earthworm; Ecotoxicology; Gene expression; RNA; Sequence; Transcriptome; Trinity

Mesh:

Substances:

Year:  2017        PMID: 28241869      PMCID: PMC5327576          DOI: 10.1186/s40659-017-0114-y

Source DB:  PubMed          Journal:  Biol Res        ISSN: 0716-9760            Impact factor:   5.612


Background

Earthworms are exposed to environmental contaminants through their intestine, skin, and alimentary and dermal uptake routes and are widely used as a model organism for assessing impact of environmental chemicals and toxins [1-3]. Earthworms are sensitive to soil pollution that arises from intensive use of biocides in agriculture, industrial activity and atmospheric deposition [4] and so are useful bio-indicator organisms (Sivakumar). Eisenia fetida is a commonly used test species in ecotoxicological studies [5]. Molecular genetic data are now available on the response of worms to toxin exposure, particularly focusing on the antioxidant and stress protein response to pollutants [6-8]. In order to establish the impact of exogenous toxins on gene expression at the genome level it is first important to assemble a baseline transcriptome for the model organism. Gong et al. produced transcriptome wide oligonucleotide probes for E. fetida in their [9] report, comprising 63541 ESTs, but a complete annotated transcriptome for E. fetida has not been available thus far for this organism. Many environmental toxins have putative roles in impeding reproduction [10], therefore identifying differentially expressed genes between adult and juvenile E. fetida can be useful to identify possible genes associated with sexual maturity and reproduction that may be more susceptible to such toxins.

Methods

Worms were established in a composting wormery from an initial population of 150 E. fetida. A box of worms was bought from Bunnings (Lyall Bay. Wellington, NZ) and grown in a composting wormery supplemented with fruit and vegetable waste for 6 months. Adult worms were selected on the basis of well-developed clitella, present only at sexual maturity; juveniles were selected on the basis of the absence of well-developed clitella. Adult worms were dissected to enhance samples for the reproductive organs including gonadal structures. For each extraction four gonadal segment samples were pooled to maximize RNA content. RNA was extracted from pooled juvenile and fully clitellated adult samples respectively and then sequenced using Illumina HiSeq 2500, sequencing was carried out by New Zealand Genomics Ltd (http://www.nzgenomics.co.nz/). Prior to RNA extraction, worms were removed, washed with RNAse, DNAse free water and then used for total RNA extraction. The total RNA was isolated using the QIAGEN Mini kit and by following the manufacturers protocol. Extra vigilance was taken to minimise degradation of RNA; RNAse zap was used to clean all dissection tools and equipment used in the RNA extraction process. Trimmomatic was used to quality trim reads using the default settings and Trinity v2.1.1, and Edger 3.3 were used for de novo transcriptome assembly,including, identification of candidate coding regions, and differential gene expression analysis as outlined by Haas et al. [11]. Transdecoder v2.01, Trinotate v2.0.2, BLASTX v2.3.1, BLASTP v2.3.0+ and HMMER 3.0 searches were used for functional annotation of the transcriptome produced and to populate an Sqlite database. All bioinformatic analyses were carried out via VPN connection to a remote high speed computer (CPU 16) on the BioIT, NZGL network via a standard PC laptop.

Results

De novo transcriptome assembly

Trinity is a freely available software tool used for de novo transcriptome assembly that uses three software modules (Inchworm, Chrysalis and Butterfly) to assemble a de novo transcriptome when no model data is available [12], it does this by first assembling the RNA sequence data into transcripts or ‘genes’, clustering these contigs, constructing de Bruijn graphs and then processing the graphs to report full length transcripts for all alternatively spliced isoforms. The resulting transcriptome file was generated in fasta (.fq) format and was 32 MB in size comprising 74,381 ‘genes’, 90,862 ‘transcripts’ and with a GC percentage of 41.74. The N50 was 339 and 329 for transcripts and genes, respectively, which is the maximum length where at least 50% total assembled sequence resides in contigs of at least that length; further details on the transcriptome are found in Table 1. At a stringency of a minimum 5 TPM there were 21282 estimated ‘genes’.
Table 1

Summary of constituent data of Trinity-assembled E. fetida transcriptome

Total Trinity ‘genes’74,381
Total Trinity transcripts90,862
Percent GC41.74
Stats based on ALL transcript contigs
 Contig N10788
 Contig N20586
 Contig N30472
 Contig N40395
 Contig N50339
 Median contig length276
 Average contig length340.18
 Total assembled bases30,909,011
Stats based on ONLY LONGEST ISOFORM per ‘GENE’
 Contig N10773
 Contig N20573
 Contig N30460
 Contig N40384
 Contig N50329
 Median contig length272
 Average contig length334.57
 Total assembled bases24,885,474
Summary of constituent data of Trinity-assembled E. fetida transcriptome

Transcriptome annotation

Transdecoder 2.0.1 (http://transdecoder.sf.net) was used to identify candidate coding regions within the generated transcriptome and looked for an open reading frame of at least 100 amino acids long. This generated four output files (.pep peptide sequence for final candidate ORFs;.cds nucleotide sequence for coding regions;.gff3 position in target transcript of final ORFs and.bed format describing ORF that can be used in IGV). A BLASTP v2.3.0+ search against known proteins and HMMER 3.0 search to identify common protein domains were also carried out. Trinotate was used to carry out functional annotation of the transcriptome using the Trinity generated transcriptome file and the transdecoder generated peptide sequence file for final candidate ORFs, along with BLASTX v2.3.1, BLASTP v2.3.0+ and HMMER 3.0 searches and were loaded into an Sqlite database (see Additional file 1: Table S1). Figure 1 summarises the annotation data based on the classification by PFAM Gene Ontology (GO), the figure shows frequency of genes grouped on the basis of the GO classification that have a frequency in the annotated transcriptome of at least 15, the most frequent classification of genes were in the protein and metal binding category and cellular component membrane. Interestingly there were also over 30 hits with methyltransferases indicative that a potential DNA methylation mechanism could exist in the earthworm [13].
Fig. 1

Analysis of transcriptome annotation by frequency of gene ontology description

Analysis of transcriptome annotation by frequency of gene ontology description

Differential gene expression analysis

To identify differentially expressed transcripts; RSEM was first used to estimate expression values. Each of the six original.fasta raw sequence files (three adult and three juvenile pooled paired samples) were aligned to the de novo assembled Trinity.fasta transcript using Bowtie and RSEM used to estimate expression values based on the alignment. EdgeR was then used to calculate differential expression between the two conditions; with a P value cut off of 0.001, this returned six significantly differentially expressed genes, (most significant FDR) (Table 2). Initial BLASTX hits (Table 2) included hits with annelid ferritin and lysozyme proteins, as well as fungal NADH cytochrome b5 reductase and senescence associated proteins. At a cut off of P = 0.01 there were a further 26 differentially expressed genes.
Table 2

BLASTX v2.3.1 results of differentially expressed genes between adult and juvenile samples

Gene IDP value and FDRSequenceBLASTX
TRINITY_DN212688_c0_g3_i15.70E−090.000322CCTCAGTATGAGGATCCTCGTGCTTGATGGCCAACTTGTGCAATTCCAGCAAAGCCTCGTTAAGGCTCTTCTCCATATCCAGAGATGTCTGCATTGCTCCCAATGCGGTTCCCCAGGAGTCAACAGCCGGTTTCGAGACGTCGTCAAGAACAACGCAACCGCCACGCTTGTTCATGTACTTCATCATGTCACCAGCATGGCTTCTCTCTTCCTTTGAGTTTTCACTGAAGAACTTCGATAAGCTAGGCAGAGCCACTGAGTCATTATCAAAGTAGATCGCCATCGAATGAAAGTTGTAGGAAGCCTCAAGCTCCAGATTGATCTGCTTGTTCAGACACTTCTCAACCTCGACATGGAAGTTCTGACGAATCTGCGAGTCAGACAAAGATTCFerritin (Eisenia andrei)
TRINITY_DN228981_c22_g3_i11.15E−080.000322TTTATCGTCTTTTTACTTATTCGCTGAAGCGGAGCTGGGTTTAACGACCCACCTTTTGGCTTTAAGGTCCTATACGGGCTGATCCGGGTTGAAGACATTGTCAGGTGGGGAGTTTGGCTGGGGCGGCACATCTGTTAAACCATAACGCAGGTGTCCTAAGGGGGACTCATGGAGAACAGAAATCTCCAGTAGAGCAAAAGGGCAAAAGTCCCCTTGATTTTGATTTTCAGTGTGAATACAAACCATGAAAGSenescence associated protein (Cennococcum geophilum)
TRINITY_DN211598_c1_g4_i11.49E−080.000322AGGTAAAGTTAACCAATGTAGTGGTGGCTACTGTGGACCGTATAAAATCTCGGAACCTTACTATAAGGACTGCGGAAAGCCAGGAGCCGGCTACCAACGATGTACTAAGCAGAAAGCTTGTTCGGAATTGTGCATCAAGTCTTACATGGAGCGCTACGCCACGAAATGTCCCAGACCTATTACCTGTGAAGACTATGCTCGCGTACACGTAGGAGGACCTGGCGGATGCACGAAGCCTTCAACCGLysozyme (Eisenia fetida)
TRINITY_DN203900_c3_g1_i14.58E−080.000741GAAAATCCTAGGGAACGAATAATTTTCACGCCTGGTCGTACTCATAACCGCAGCAGGTCTCCAAGGTGAACAGCCTCTAGTTGATAGAACAATGTAGATAAGGGAAGTCGGCAAAATAGATCCGTAACTTCGGGATAAGGATTGGCTCTAAGGGTTGGGTGCGTTGGGCCTTGAGCAGAAGCCCTGGGAGCAGGTTGGCACTAGCCTCACGGCCGGCGCCTTCCAGCACCCGGTGGCGGACGCTCTTGGCAGGGTTCGCCCGTCCGGCGCACGCTTAACAACCAACTTAGAACTGGTACGGACAAGGGGAATCTGACTGHypthetical protein (Tricoderma virens)
TRINITY_DN218384_c10_g9_i18.03E−080.000948GTTTCTTTTCCTCCGCTTATTGATATGCTTAAGTTCAGCGGGTATCCCTACCTGATCCGAGGTCAAAAGTGATAAATAATCTTGATGGATGCCAACAAAAATTGGTCTGTCACGCAAATTGTGCTGCGCTTCAAACCATTATATTAGCTGCCAATATATTTAAGGCGAGTCTAGGCTAAGCAAAGACAAACACCCAACACCAAGCAAGGCTTGAGAGTACAAATGACGCTCGAACAGGCATGCCCCCCGGAATACCAGGGGGCGCAATGTGCGTTCAAAGATTCGATGATTCACTGAATTCTGCAATTCACACTACTTATCGCATTTCGCTGCGTTCTTCATCGATGCCAGAACCAAGAGATCCGTTGTTGAAAGTTGTAATTATTAAATTTATTCAGACGCTGATTGAAAATAAAAAGGTTATAGAGTTGTCCAGCCGGCAGGCAAGCCTGCCGAGGAAACATGAGTGCGCAAAAAGTCAGGGGTTAAGAAAAGAGGCGTACTTCTTCTCAAGCAAGGCCCAAGAAGTTTCCGTCTCCCTCAAACAGTAGTAAACTACTGAAATTAATGATCCTTCCGCAGGTTCACCTACGGAAANADH cytochrome b5 reductase (Aspergillus)
TRINITY_DN13199_c0_g1_i18.80E−080.000948CCCCGTTACCCGTTGATACCATGGTAGGCCACTATCCTACCATCGAAAGTTGATAGGGCAGAAATTTGAATGAACCATCGCCGGCGCAAGGCCATGCGATTCGTTAAGTTATTATGATTCACCAAGGAGCCCCGAAGGGCATTGGTTTTTTATCTAATAAATACACCCCTTCCGAAGTCGAGGTTTTTAGCATGTATTAGCTCTAGAATTACCACGGTTATCCAGGTAAAGGTACTATCAAATAAACGATAHypothetical protein (Capitella teleta)
BLASTX v2.3.1 results of differentially expressed genes between adult and juvenile samples

Discussion

Differential gene expression analysis was carried out on the de novo generated transcriptome to compare adult and juvenile samples. At a threshold of significance of P = 0.001 there were six significantly differentially expressed genes identified between adult and juvenile, potentially highlighting genes associated with sexual maturity for follow up in subsequent analyses with this and other earthworm species. Such targets may be important for investigating the impact of toxins on reproductive capability. The annotation of the transcriptome was dominated by protein and metal ion binding proteins, as expected, but there was also a suggestion of potential epigenetic mechanisms for gene control existing in E. fetida with methyltransferase function also occurring with relatively high frequency in the annotated transcript. This supports of the findings of Santoyo et al. [6] in that earthworms may be useful in experiments investigating the impact of terrestrial toxins on DNA methylation. From the perspective of genes involved specifically in reproduction, the annotation identified multiple sperm, spermatogenesis, testis, ova and other and reproductive associated proteins that could be potentially good targets for further studies exploring the impact of toxin on reproductive capability. Subsequent studies investigating a potential epigenome in E. fetida could also provide important data in terms of further understanding the relationship between environmental toxins and gene expression. Future work will involve using both transcriptome and epigenome analysis to explore the impact of the environment on gene expression and health in E. fetida and other earthworm species.

Conclusions

This report illustrates the scope of what can be done with limited resources using Trinity and Trinotate freeware to assemble and annotate a RNA sequenced derived transcriptome de novo. The data generated have been made publicly available. To our knowledge this is the most comprehensive transcriptome for E. fetida and will provide an important baseline for further ecotoxicogenomic studies focusing on the impact of environmental toxins on global gene expression in E. fetida and other earthworm species.
  13 in total

1.  Possible functions of oxytocin/vasopressin-superfamily peptides in annelids with special reference to reproduction and osmoregulation.

Authors:  Y Fujino; T Nagahama; T Oumi; K Ukena; F Morishita; Y Furukawa; O Matsushima; M Ando; H Takahama; H Satake; H Minakata; K Nomoto
Journal:  J Exp Zool       Date:  1999-09-01

Review 2.  Epigenetics, spermatogenesis and male infertility.

Authors:  Singh Rajender; Kelsey Avery; Ashok Agarwal
Journal:  Mutat Res       Date:  2011-04-16       Impact factor: 2.433

3.  Biomarker response in the earthworm Lumbricus terrestris exposed to chemical pollutants.

Authors:  A Calisi; M G Lionetto; T Schettino
Journal:  Sci Total Environ       Date:  2011-07-23       Impact factor: 7.963

4.  Global DNA methylation in earthworms: a candidate biomarker of epigenetic risks related to the presence of metals/metalloids in terrestrial environments.

Authors:  María Maldonado Santoyo; Crescencio Rodríguez Flores; Adolfo Lopez Torres; Kazimierz Wrobel; Katarzyna Wrobel
Journal:  Environ Pollut       Date:  2011-07-22       Impact factor: 8.071

5.  Effects of historic metal(loid) pollution on earthworm communities.

Authors:  Thibaut Lévêque; Yvan Capowiez; Eva Schreck; Stéphane Mombo; Christophe Mazzia; Yann Foucault; Camille Dumat
Journal:  Sci Total Environ       Date:  2015-01-21       Impact factor: 7.963

Review 6.  Effects of metals on earthworm life cycles: a review.

Authors:  S Sivakumar
Journal:  Environ Monit Assess       Date:  2015-07-28       Impact factor: 2.513

7.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

8.  Molecular toxicity of earthworms induced by cadmium contaminated soil and biomarkers screening.

Authors:  Xiaohui Mo; Yuhui Qiao; Zhenjun Sun; Xiaofei Sun; Yang Li
Journal:  J Environ Sci (China)       Date:  2012       Impact factor: 5.565

9.  Design, validation and annotation of transcriptome-wide oligonucleotide probes for the oligochaete annelid Eisenia fetida.

Authors:  Ping Gong; Mehdi Pirooznia; Xin Guan; Edward J Perkins
Journal:  PLoS One       Date:  2010-12-08       Impact factor: 3.240

10.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

View more
  6 in total

1.  The transcriptome of anterior regeneration in earthworm Eudrilus eugeniae.

Authors:  Sayan Paul; Subburathinam Balakrishnan; Arun Arumugaperumal; Saranya Lathakumari; Sandhya Soman Syamala; Vaithilingaraja Arumugaswami; Sudhakar Sivasubramaniam
Journal:  Mol Biol Rep       Date:  2020-12-11       Impact factor: 2.316

Review 2.  A simple guide to de novo transcriptome assembly and annotation.

Authors:  Venket Raghavan; Louis Kraft; Fantin Mesny; Linda Rigerte
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

3.  Annotation of nerve cord transcriptome in earthworm Eisenia fetida.

Authors:  Vasanthakumar Ponesakki; Sayan Paul; Dinesh Kumar Sudalai Mani; Veeraragavan Rajendiran; Paulkumar Kanniah; Sudhakar Sivasubramaniam
Journal:  Genom Data       Date:  2017-10-12

4.  The up-regulation of two identified wound healing specific proteins-HSP70 and lysozyme in regenerated Eisenia fetida through transcriptome analysis.

Authors:  Yuwei Yang; Yujie Sun; Na Zhang; Jianhao Li; Chenning Zhang; Xiaojie Duan; Yuting Ding; Renyun Zhao; Zhuhong Zheng; Di Geng; Yikun Sun
Journal:  J Ethnopharmacol       Date:  2019-03-19       Impact factor: 4.360

5.  Inferring the genetic responses to acute drought stress across an ecological gradient.

Authors:  Jessica K Devitt; Albert Chung; John J Schenk
Journal:  BMC Genomics       Date:  2022-01-04       Impact factor: 3.969

6.  Species-specific transcriptomic changes upon respiratory syncytial virus infection in cotton rats.

Authors:  Britton A Strickland; Seesandra V Rajagopala; Arash Kamali; Meghan H Shilts; Suman B Pakala; Marina S Boukhvalova; Shibu Yooseph; Jorge C G Blanco; Suman R Das
Journal:  Sci Rep       Date:  2022-10-04       Impact factor: 4.996

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.