Literature DB >> 27909700

Dataset for a Dugesia japonica de novo transcriptome assembly, utilized for defining the voltage-gated like ion channel superfamily.

John D Chan1, Dan Zhang1, Xiaolong Liu1, Magdalena Z Zarowiecki2, Matthew Berriman2, Jonathan S Marchant3.   

Abstract

This data article provides a transcriptomic resource for the free living planarian flatworm Dugesia japonica related to the research article entitled 'Utilizing the planarian voltage-gated ion channel transcriptome to resolve a role for a Ca2+ channel in neuromuscular function and regeneration (J.D. Chan, D. Zhang, X. Liu, M. Zarowiecki, M. Berriman, J.S. Marchant, 2016) [1]. Data provided in this submission comprise sequence information for the unfiltered de novo assembly, the filtered assembly and a curated analysis of voltage-gated like (VGL) ion channel sequences mined from this resource. Availability of this data should facilitate further adoption of this model by laboratories interested in studying the role of individual genes of interest in planarian physiology and regenerative biology.

Entities:  

Year:  2016        PMID: 27909700      PMCID: PMC5124351          DOI: 10.1016/j.dib.2016.11.022

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data Provision of a de novo transcriptome assembly for Dugesia japonica will act as a resource to facilitate investigation of the role of individual genes in this model system. Curation of the voltage-gated ion channel like superfamily in this system provides a benchmark for further annotation and study of the role of these channels in planarian regenerative physiology.

Data

The dataset of this article comprises three data files as follows: (i) Dataset 1. FASTA file (raw_trinity_assembly.fasta) of the unfiltered Trinity assembly, (ii) Dataset 2. FASTA file (filtered_trinity_cds.fasta) of the filtered Trinity assembly containing 44,857 contigs, and (iii) Table 1: voltage gated like (VGL) ion channel sequences resolved from the Dugesia japonica transcriptome. Contig IDs for ion channel sequences contained in the D. japonica de novo assembly organized by putative VGL ion channel family following manual inspection of transmembrane helix organization, structural motifs and ion selectivity residues. FPKM values reflect expression levels in whole (non-regenerating) animals. Additional analysis of these datasets are presented in the associated publication (‘Utilizing the planarian voltage-gated ion channel transcriptome to resolve a role for a Ca2+ channel in neuromuscular function and regeneration’, Chan et al. [1]).

Experimental design, materials and methods

Sequencing was performed on individuals from a clonal, asexual laboratory strain of the planarian D. japonica (GI strain). In order to sample a diversity of expressed transcripts, total RNA was extracted from intact (non-regenerating) worms (3 biological replicates of 100 individuals), as well as anterior worm fragments harvested at various intervals following tail amputation (1, 12, 24 h; 3 biological replicates of 200 heads per time point) using Trizol reagent. mRNA was purified using oligo(dT) beads (Dynal), yielding approximately 2 µg mRNA per biological sample. RNA-seq libraries were prepared according to the Illumina mRNA-Seq Sample Prep kit and Illumina TruSeq kit manufacturer protocols. Libraries were sequenced on Illumina HiSeq 2000 machines (Sanger Center, Hinxton) and the resulting 100 bp paired end reads were processed with Trimmomatic version 0.22 [2] to remove adapter sequences and low quality reads (sliding window quality filter, window size=4, minimum average quality score=25) while retaining reads ≥50 bp. In order to generate the de novo transcriptome assembly, overlapping paired-end reads were merged using FLASH [3] and fed into the Trinity pipeline [4], carried out with a minimum k-mer coverage of 2 and default k-mer size of 25. Graphs not resolving within a 6 h window were excised to allow the assembly to proceed and the minimum contig or transcript length was set to 100 nt. Relative transcript abundance was estimated using bowtie (version 2) to align trimmed reads to the de novo assembly and RSEM (version 1.2.11) to quantify read mapping, yielding FPKM (Fragments Per Kilobase of transcript per Million mapped reads) values for each contig. Assembled contigs were annotated using the TransDecoder package to predict translated open reading frames, which were searched against the NCBI Conserved Domain Database. The initial Trinity de novo assembly of D. japonica RNAseq data produced a dataset with 195,271 sequences and an N50 of 1,587 bp. This number of contigs exceeds the number of predicted gene models in published flatworm genomes [5], [6], [7], [8], [9], likely due to a high number of redundant or incorrectly/partially assembled transcripts in the D. japonica assembly. Therefore, this preliminary dataset was filtered to retain only (i) sequences with predicted open reading frames (ORFs) ≥100 amino acids that contain an assignment to a known PFAM structural domain, or (ii) sequences with predicted open reading frames (ORFs) ≥100 amino acids that were evidenced by read mapping (FPKM value ≥1). The resulting filtered assembly retained 44,857 sequences with an N50 of 2444 bp. Sequences from the unfiltered assembly are provided as Dataset 1. The filtered ORF assembly of 44,857 sequences is provided Dataset 2. Sequences belonging to the voltage-gated like ion channel superfamily were then curated by searching the translated D. japonica transcriptome for Pfam protein family hits corresponding to domains such as ion transport (PF00520, PF07885, PF08412), Cav (PF08763), Nav (PF06512), PKD (PF08016), BK (PF03493), SK (PF035630) or cyclic nucleotide gated channels (PF08412, PF00027). Sequences were inspected to confirm the presence of the appropriate number of transmembrane helixes and pore forming domains and expected architecture/topology for each family of ion channels. This analysis resulted in the prediction of 114 unique pore-containing channel sequences that could be assigned to VGL ion channel families. The appended Table 1 details the contig identifier and assignment of each of the D. japonica sequences based upon our assembly and current filtering methods. Within each class, assignments are ordered by FPKM values (fragments per kilobase of transcript per million mapped reads) to convey which transcripts predominate within each class of channels.
Subject areaBiology
More specific subject areaTranscriptomics
Type of dataSequence data (2 files), Summary Table referencing VGL information
How data was acquiredRNAseq, de novo assembly, post-hoc curation
Data formatText files (.fasta). Unfiltered assembly (Dataset 1), Filtered assembly (Dataset 2). Table (.xls) of evidenced voltage-gated like ion channels (Table 1)
Experimental factorsSequencing was performed on samples from a clonal, asexual laboratory strain of the planarian D. japonica (GI strain) and this data used to generate a de novo transcriptome assembly.
Experimental featuresThe resulting assembly was analyzed for the presence of members of the voltage-gated like (VGL) superfamily of ion channels.
Data source locationn/a
Data accessibilityAnalyzed and filtered datasets are contained within this article.
  9 in total

1.  FLASH: fast length adjustment of short reads to improve genome assemblies.

Authors:  Tanja Magoč; Steven L Salzberg
Journal:  Bioinformatics       Date:  2011-09-07       Impact factor: 6.937

2.  Utilizing the planarian voltage-gated ion channel transcriptome to resolve a role for a Ca2+ channel in neuromuscular function and regeneration.

Authors:  John D Chan; Dan Zhang; Xiaolong Liu; Magdalena Zarowiecki; Matthew Berriman; Jonathan S Marchant
Journal:  Biochim Biophys Acta Mol Cell Res       Date:  2016-10-19       Impact factor: 4.739

3.  SmedGD 2.0: The Schmidtea mediterranea genome database.

Authors:  Sofia M C Robb; Kirsten Gotting; Eric Ross; Alejandro Sánchez Alvarado
Journal:  Genesis       Date:  2015-07-17       Impact factor: 2.487

4.  Characterization of a stable form of tryptophan hydroxylase from the human parasite Schistosoma mansoni.

Authors:  F F Hamdan; P Ribeiro
Journal:  J Biol Chem       Date:  1999-07-30       Impact factor: 5.157

5.  The genome of the blood fluke Schistosoma mansoni.

Authors:  Matthew Berriman; Brian J Haas; Philip T LoVerde; R Alan Wilson; Gary P Dillon; Gustavo C Cerqueira; Susan T Mashiyama; Bissan Al-Lazikani; Luiza F Andrade; Peter D Ashton; Martin A Aslett; Daniella C Bartholomeu; Gaelle Blandin; Conor R Caffrey; Avril Coghlan; Richard Coulson; Tim A Day; Art Delcher; Ricardo DeMarco; Appolinaire Djikeng; Tina Eyre; John A Gamble; Elodie Ghedin; Yong Gu; Christiane Hertz-Fowler; Hirohisha Hirai; Yuriko Hirai; Robin Houston; Alasdair Ivens; David A Johnston; Daniela Lacerda; Camila D Macedo; Paul McVeigh; Zemin Ning; Guilherme Oliveira; John P Overington; Julian Parkhill; Mihaela Pertea; Raymond J Pierce; Anna V Protasio; Michael A Quail; Marie-Adèle Rajandream; Jane Rogers; Mohammed Sajid; Steven L Salzberg; Mario Stanke; Adrian R Tivey; Owen White; David L Williams; Jennifer Wortman; Wenjie Wu; Mostafa Zamanian; Adhemar Zerlotini; Claire M Fraser-Liggett; Barclay G Barrell; Najib M El-Sayed
Journal:  Nature       Date:  2009-07-16       Impact factor: 49.962

6.  SmedGD: the Schmidtea mediterranea genome database.

Authors:  Sofia M C Robb; Eric Ross; Alejandro Sánchez Alvarado
Journal:  Nucleic Acids Res       Date:  2007-09-18       Impact factor: 16.971

7.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

8.  Unusually Large Number of Mutations in Asexually Reproducing Clonal Planarian Dugesia japonica.

Authors:  Osamu Nishimura; Kazutaka Hosoda; Eri Kawaguchi; Shigenobu Yazawa; Tetsutaro Hayashi; Takeshi Inoue; Yoshihiko Umesono; Kiyokazu Agata
Journal:  PLoS One       Date:  2015-11-20       Impact factor: 3.240

9.  The genomes of four tapeworm species reveal adaptations to parasitism.

Authors:  Isheng J Tsai; Magdalena Zarowiecki; Nancy Holroyd; Alejandro Garciarrubio; Alejandro Sánchez-Flores; Karen L Brooks; Alan Tracey; Raúl J Bobes; Gladis Fragoso; Edda Sciutto; Martin Aslett; Helen Beasley; Hayley M Bennett; Xuepeng Cai; Federico Camicia; Richard Clark; Marcela Cucher; Nishadi De Silva; Tim A Day; Peter Deplazes; Karel Estrada; Cecilia Fernández; Peter W H Holland; Junling Hou; Songnian Hu; Thomas Huckvale; Stacy S Hung; Laura Kamenetzky; Jacqueline A Keane; Ferenc Kiss; Uriel Koziol; Olivia Lambert; Kan Liu; Xuenong Luo; Yingfeng Luo; Natalia Macchiaroli; Sarah Nichol; Jordi Paps; John Parkinson; Natasha Pouchkina-Stantcheva; Nick Riddiford; Mara Rosenzvit; Gustavo Salinas; James D Wasmuth; Mostafa Zamanian; Yadong Zheng; Jianping Cai; Xavier Soberón; Peter D Olson; Juan P Laclette; Klaus Brehm; Matthew Berriman
Journal:  Nature       Date:  2013-03-13       Impact factor: 49.962

  9 in total
  4 in total

Review 1.  Planarian regeneration as a model of anatomical homeostasis: Recent progress in biophysical and computational approaches.

Authors:  Michael Levin; Alexis M Pietak; Johanna Bischof
Journal:  Semin Cell Dev Biol       Date:  2018-05-01       Impact factor: 7.727

2.  Cysteinyl-specialized proresolving mediators link resolution of infectious inflammation and tissue regeneration via TRAF3 activation.

Authors:  Nan Chiang; Xavier de la Rosa; Stephania Libreros; Hui Pan; Jonathan M Dreyfuss; Charles N Serhan
Journal:  Proc Natl Acad Sci U S A       Date:  2021-03-09       Impact factor: 11.205

3.  Planarian regeneration in space: Persistent anatomical, behavioral, and bacteriological changes induced by space travel.

Authors:  Junji Morokuma; Fallon Durant; Katherine B Williams; Joshua M Finkelstein; Douglas J Blackiston; Twyman Clements; David W Reed; Michael Roberts; Mahendra Jain; Kris Kimel; Sunia A Trauger; Benjamin E Wolfe; Michael Levin
Journal:  Regeneration (Oxf)       Date:  2017-06-13

4.  Anti-schistosomal action of the calcium channel agonist FPL-64176.

Authors:  Paul McCusker; John D Chan
Journal:  Int J Parasitol Drugs Drug Resist       Date:  2019-09-14       Impact factor: 4.077

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.