| Literature DB >> 29896516 |
André M Machado1, Mónica Felício2, Elza Fonseca2,3, Rute R da Fonseca4, L Filipe C Castro1,3.
Abstract
Mackerels represent a valuable fishery worldwide. Their ample geographic distribution and capture levels make them an insightful model to address stock management strategies in the context of global changes. Yet, and despite recent impressive genome and transcriptome sequencing efforts from teleost species, available resources from the Scombridae family are comparatively scarce. Here, we generated the first high-quality de novo assembly of the liver transcriptome of the Atlantic chub mackerel (Scomber colias). Through the use of RNA-Seq Illumina technology, 111,124,228 clean reads were obtained for the liver transcriptome. De novo assembly resulted in 93,731 transcripts with an N50 of 1462 bp. This dataset provides an important insight into the context of fisheries management.Entities:
Keywords: Atlantic chub mackerel; Liver; RNA-Seq; Scombridae; Stock management
Year: 2018 PMID: 29896516 PMCID: PMC5996228 DOI: 10.1016/j.dib.2018.03.013
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Transrate and Trinity Statistics of the original, decontaminated and final transcriptome assembly of liver transcriptome of S. colias.
| 121,656,039 | |||
| 111,124,228 | |||
| 91.34% | |||
| 72,618 | 54,876 | 35,386 | |
| 114,174 | 93,731 | 44,345 | |
| 1299 | 1462 | 1288 | |
| 544 | 593 | 593 | |
| 899 | 975 | 912 | |
| 301 | 301 | 301 | |
| 14,405 | 14,405 | 13,899 | |
| 30,346 | 28,819 | 12,962 | |
| 28 | 26 | 3 | |
| 45.33 | 45.32 | 46.64 | |
| 102,639,256 | 91,425,968 | 40,443,374 | |
| – | – | 81.37% | |
Fig. 1Length distribution of the final transcriptome contigs. The x-axis represents the length, and the y-axis represents the number of Contigs.
BUSCO statistics of completeness of S. colias Liver final transcriptome assembly against the metazoa and eukaryota gene sets.
| 82.90 | 83.20 | |
| 73.60 | 71.60 | |
| 9.30 | 11.60 | |
| 13.00 | 13.20 | |
| 4.10 | 3.60 |
The top 30 species for which there was a top blastx hit. These blastx results from the queried of final transcriptome assembly against Non-Redundant Database of NCBI.
| 8187 | 8,503 | 28.20 | |
| 215358 | 4,696 | 15.58 | |
| 144197 | 2,683 | 8.90 | |
| 80966 | 2,597 | 8.61 | |
| 56723 | 1,447 | 4.80 | |
| 8255 | 1,346 | 4.46 | |
| 43700 | 1,270 | 4.21 | |
| 8128 | 822 | 2.73 | |
| 8208 | 551 | 1.83 | |
| 32507 | 431 | 1.43 | |
| 8153 | 375 | 1.24 | |
| 13489 | 322 | 1.07 | |
| 28743 | 290 | 0.96 | |
| 106582 | 262 | 0.87 | |
| 109280 | 256 | 0.85 | |
| 8078 | 255 | 0.85 | |
| 244447 | 220 | 0.73 | |
| 37003 | 205 | 0.68 | |
| 52670 | 203 | 0.67 | |
| 8090 | 199 | 0.66 | |
| 105023 | 175 | 0.58 | |
| 48699 | 164 | 0.54 | |
| 150288 | 152 | 0.50 | |
| 99883 | 151 | 0.50 | |
| 303518 | 144 | 0.48 | |
| 8022 | 142 | 0.47 | |
| 48701 | 142 | 0.47 | |
| 31033 | 139 | 0.46 | |
| 7962 | 133 | 0.44 | |
| 163134 | 120 | 0.40 |
Fig. 2Blastx analysis of Scomber colias final transcriptome assembly. (A) Homologous gene-species distribution (B) E-value distribution, (C) Similarity distribution.
E-value distribution of blastx hits of final transcriptome assembly against NR database.
| 10492 | 34.80 | |
| 7057 | 23.41 | |
| 4366 | 14.48 | |
| 3667 | 12.16 | |
| 2899 | 9.62 | |
| 1667 | 5.53 |
Similarity distribution of blastx hits of final transcriptome assembly against NR database.
| 15 | 0.05 | |
| 660 | 2.19 | |
| 4212 | 13.97 | |
| 5949 | 19.73 | |
| 5859 | 19.43 | |
| 13453 | 44.62 |
Functional annotation categories and statistics for final transcriptome assembly and for a subset of unigenes.
| 21,981 | – | |
| – | 21,981 | |
| 27,772 | 21,981 | |
| 27,426 | 21,707 | |
| 26,780 | 21,215 | |
| 27,519 | 21,791 | |
| 26,889 | 21,307 | |
| 23,927 | 18,867 | |
| 24,086 | 19,013 | |
| 23,550 | 18,567 | |
| 21,247 | 16,911 | |
| 21,005 | 16,754 | |
| 18,885 | 14,557 |
Fig. 3Histogram of the clusters of orthologous groups (COG).
| Transcriptomics of Atlantic chub mackerel Liver | |
| Raw reads of DNA sequences | |
| A liver sample of Atlantic chub mackerel, | |
| Raw data in FASTQ, transcriptome assembly and Final transcriptome assembly in FASTA format. | |
| One specimen of | |
| The | |
| Portugal (41.501944 N 8.851667 W) | |
| The raw FASTQ files were deposited in the NCBI SRA database with accession number SRX3462868 ( | |
| The decontaminated transcriptome assembly was deposited in the NCBI TSA database with accession number GGCI00000000.1 ( | |
| The final transcriptome assembly was deposited in the figshare digital repository ( |