Literature DB >> 28066712

De novo transcriptome assembly of shrimp Palaemon serratus.

Alejandra Perina1, Ana M González-Tizón1, Iago F Meilán2, Andrés Martínez-Lage1.   

Abstract

The shrimp Palaemon serratus is a coastal decapod crustacean with a high commercial value. It is harvested for human consumption. In this study, we used Illumina sequencing technology (HiSeq 2000) to sequence, assemble and annotate the transcriptome of P. serratus. RNA was isolated from muscle of adults individuals and, from a pool of larvae. A total number of 4 cDNA libraries were constructed, using the TruSeq RNA Sample Preparation Kit v2. The raw data in this study was deposited in NCBI SRA database with study accession number of SRP090769. The obtained data were subjected to de novo transcriptome assembly using Trinity software, and coding regions were predicted by TransDecoder. We used Blastp and Sma3s to annotate the identified proteins. The transcriptome data could provide some insight into the understanding of genes involved in the larval development and metamorphosis. SPECIFICATIONS: [Table: see text].

Entities:  

Keywords:  Illumina; Larvae; Muscle; Palaemon serratus; RNA-seq; Transcriptome

Year:  2016        PMID: 28066712      PMCID: PMC5200869          DOI: 10.1016/j.gdata.2016.12.009

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Introduction

The common littoral shrimp Palaemon serratus (Pennant, 1777) is a coastal decapod crustacean that inhabits the intertidal and subtidal soft-sediment of estuaries and rocky bottoms covered with seagrass and algae [1]. The world distribution covers the Atlantic Ocean, from Scotland and Denmark to Mauritania, and all the Mediterranean Sea, Marmara and the Black Sea [2]. The capture of P. serratus maintains a very important traditional activity in some fishing communities due to its high commercial value, mainly in North of Spain (up to 140€/kg on Christmas). In fact, the P. serratus fishery contributes annually more than ten million Euros to the European economy [3]. Despite its high economic value, the availability of genomic and transcriptomic data for this shrimp in public databases is limited. In addition to its ecological and commercial importance, these species have proved to be suitable indicator species in ecotoxicology [4], [5]. In this study, we performed de novo transcriptome assembly and annotation for P. serratus from adults individuals, and from a pool of larvae, by next-generation sequencing. These transcriptomic data provide useful information to reveal putative genes involved in the larval development and metamorphosis and help identify novel genes.

Experimental design, materials and methods

Animal materials

Specimens of P. serratus were collected from the Artabro Gulf (43° 22′00″N, 8°28′00′W) in the northwest of Spain. Animals were captured with a fish trap and some individuals were preserved in RNAlater® (Life Technologies). The rest of them were carried alive to the laboratory where they were kept at 18 °C in an aerated aquarium and fed with frozen brine shrimp for at least 24 h, until larvae were released. All samples were kept at − 80 °C until they were processed.

RNA isolation, library construction and sequencing

RNA isolation and library construction was carried out at AllGenetics (A Coruña, Spain) according to the following procedure. RNA was isolated from muscle of adults individuals (Pser), and from a pool of larvae (LPser), using the reagent NZYol (NZYTech). Briefly, frozen samples were homogenised using a mortar and pestle under liquid nitrogen. 1 mL of NZYol was added directly to the homogenate, and transferred to a nuclease-free 1.5 mL tube. Then, we added 0.2 volumes of chloroform-isoamil alcohol (24:1), centrifuged the mixture, and recovered the supernatant into a new tube. One volume of ice-cold isopropanol was added, and the mixture was kept at − 20 °C overnight in order to precipitate the RNA. The samples were centrifuged, and the supernatant was discarded. The pellet was washed with 96% ethanol. The ethanol was discarded, and the pellet resuspended in a final volume of 30 μL. RNA concentration and integrity were measured in an Agilent 2100 Bioanalyzer. A total number of 4 cDNA libraries were constructed, using the TruSeq RNA Sample Preparation Kit v2 (Illumina Inc. San Diego, CA), strictly following the manufacturer's instructions. From each of the RNA samples, we constructed 2 different libraries (one ‘original’ library and its replicate). All the ‘original’ libraries were run in a HiSeq 2000 PE100 lane, whereas all the ‘replicates’ were run in a different HiSeq 2000 PE100 lane. Within each lane, the libraries were pooled in equimolar amounts, according to the quantication data provided by the Qubit dsDNA HS Assay Kit, before high throughput sequencing.

De novo transcriptome assembly, identification of protein coding region, and annotation

We obtained 9.5 and 7.5 GB of raw data from Pser and Pser_rep respectively (original and replicate respectively), and 11.6 and 7.7 GB of raw data from LPser and LPser_rep respectively, by paired-end sequencing (deposited in NCBI SRA database with study accession number of SRP090769). Quality control for the raw reads was performed using FastQC [6]. After the removal Illumina adaptors and filter sequences with the Trimmomatic v0.35 [7] a total of 65,765,083 cleaned reads were obtained from adults individuals of P. serratus, and 75,307,090 cleaned reads from larvae. The specific parameters to obtain high quality reads were: 1) cut the 12 bases from the start of the read, 2) trimming sequences by the end of them and based on the value of quality, establishing a minimum quality value 25 and, 3) removing reads with a length less than 40 nucleotides. These high quality reads were de novo assembled using Trinity software v.2.2.0 [8] with default parameters settings (K mer = 25). Detailed information on the de novo trasncriptome assembly is summmarized in Table 1. The coding regions prediction of assembled transcripts was carried out by TransDecoder (implemented in the Trinity software). The results showed 35,364 and 42,244 ORFs for adults and larvae, respectively. We carried out a local Blastp on the predicted proteins against NCBI non-redundant protein sequences (nr) database (September 2016) to predict the putative functions of the identified proteins. The Blastp results can be found in Supplementary material 1. The predicted proteins, too, were functionally annotated using a modified version of the Sma3s program [9], which allows the tracing of the source of each annotation and initially tries to discover the query sequences in the annotated database. It uses the UniProt database to assign gene names, descriptions and EC (Enzyme Commission) numbers to the query sequences and adds GO terms, UniProt keywords and pathways. The predicted amino acid sequences was used as input for two executions of the Sma3s, one against Swiss-Prot database (manually curated) and another against TrEMBL database (automatically annotated and not reviewed) from unannotated sequences against Swiss-Prot database. The annotation results and their statistics can be found in Supplementary material 2. An annotation statistic comparison of adult and larvae transcriptomes against Swiss-Prot database was summarized in Fig. 1. All large-scale computational analyses were performed on a high performance computing cluster, The Supercomputing Centre of Galicia (CESGA). The transcriptome data in this work will be usefully applied to study genes involved in the larval development and metamorphosis.
Table 1

Summary of the de novo transcriptome assembly for P. serratus.

IndexAdults transcriptomeLarvae transcriptome
Total trinity ‘genes’95,601124,389
Total trinity transcripts112,716152,110
Percent GC39.3339.36
Contig N5023112596
Median contig length405401
Average contig996.971047.88
Total assembled bases112,374,970159,393,572
Fig. 1

Comparison of the annotation of adult vs larvae transcriptome against Swiss-Prot database.

Conflict of interest

The authors declare that they have no competing interests.
Organism/cell line/tissuePalaemon serratus/muscle adults individuals and pool of larvae
SexN/A
Sequencer or array typeIllumina HiSeq2000
Data formatRaw or processed
Experimental factorsDe novo transcriptome assembly of Palaemon serratus.
Experimental featuresRNA was isolated from muscle of adults individuals and, from a pool of larvae. A total number of 4 cDNA libraries were constructed, using the TruSeq RNA Sample Preparation Kit v2. The obtained data were subjected to de novo transcriptome assembly using Trinity, and coding regions were predicted by TransDecoder. We used Blastp and Sma3s_v2 to annotate the identified proteins.
ConsentN/A
Sample source locationArtabro Gulf (43° 22′00″N, 8°28′00′′’W) in the northwest of Spain.
  5 in total

1.  A multiple stressor approach to study the toxicity and sub-lethal effects of pharmaceutical compounds on the larval development of a marine invertebrate.

Authors:  E González-Ortegón; J Blasco; L Le Vay; L Giménez
Journal:  J Hazard Mater       Date:  2013-09-23       Impact factor: 10.588

2.  Swimming velocity, avoidance behavior and biomarkers in Palaemon serratus exposed to fenitrothion.

Authors:  Cristiana Oliveira; Joana R Almeida; Lúcia Guilhermino; Amadeu M V M Soares; Carlos Gravato
Journal:  Chemosphere       Date:  2012-07-21       Impact factor: 7.086

3.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

4.  Sma3s: a three-step modular annotator for large sequence datasets.

Authors:  Antonio Muñoz-Mérida; Enrique Viguera; M Gonzalo Claros; Oswaldo Trelles; Antonio J Pérez-Pulido
Journal:  DNA Res       Date:  2014-02-05       Impact factor: 4.458

5.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

  5 in total
  4 in total

1.  Molecular characterization of putative neuropeptide, amine, diffusible gas and small molecule transmitter biosynthetic enzymes in the eyestalk ganglia of the American lobster, Homarus americanus.

Authors:  Andrew E Christie; Meredith E Stanhope; Helen I Gandler; Tess J Lameyer; Micah G Pascual; Devlin N Shea; Andy Yu; Patsy S Dickinson; J Joe Hull
Journal:  Invert Neurosci       Date:  2018-10-01

2.  Single-molecule long-read sequencing facilitates shrimp transcriptome research.

Authors:  Digang Zeng; Xiuli Chen; Jinxia Peng; Chunling Yang; Min Peng; Weilin Zhu; Daxiang Xie; Pingping He; Pinyuan Wei; Yong Lin; Yongzhen Zhao; Xiaohan Chen
Journal:  Sci Rep       Date:  2018-11-16       Impact factor: 4.379

3.  De novo gonad transcriptome analysis of the common littoral shrimp Palaemon serratus: novel insights into sex-related genes.

Authors:  Inés González-Castellano; Chiara Manfrin; Alberto Pallavicini; Andrés Martínez-Lage
Journal:  BMC Genomics       Date:  2019-10-22       Impact factor: 3.969

4.  Transcriptome Analysis Reveals Putative Target Genes of APETALA3-3 During Early Floral Development in Nigella damascena L.

Authors:  Yves Deveaux; Natalia Conde E Silva; Domenica Manicacci; Martine Le Guilloux; Véronique Brunaud; Harry Belcram; Johann Joets; Ludivine Soubigou-Taconnat; Etienne Delannoy; Hélène Corti; Sandrine Balzergue; Jose Caius; Sophie Nadot; Catherine Damerval
Journal:  Front Plant Sci       Date:  2021-06-04       Impact factor: 5.753

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.