| Literature DB >> 35346024 |
Kishor Kumar Sarker1,2, Liang Lu1,2, Junman Huang1,2, Tao Zhou1,2, Li Wang1,2, Yun Hu1,2, Lei Jiang1,2, Habibon Naher3, Mohammad Abdul Baki3, Anirban Sarker3, Chenhong Li4,5.
Abstract
OBJECTIVES: American shad (Alosa sapidissima) is an important migratory fish under Alosinae and has long been valued for its economic, nutritional and cultural attributes. Overfishing and barriers across the passage made it vulnerable to sustain. To protect this valuable species, aquaculture action plans have been taken though there are no published genetic resources prevailing yet. Here, we reported the first de novo assembled and annotated transcriptome of A. sapidissima using blood and brain tissues. DATA DESCRIPTION: We generated 160,481 and 129,040 non-redundant transcripts from brain and blood tissues. The entire work strategy involved RNA extraction, library preparation, sequencing, de novo assembly, filtering, annotation and validation. Both coding and non-coding transcripts were annotated against Swissprot and Pfam datasets. Nearly, 83% coding transcripts were functionally assigned. Protein clustering with clupeiform and non-clupeiform taxa revealed ~ 82% coding transcripts retained the orthologue relationship which improved confidence over annotation procedure. This study will serve as a useful resource in future for the research community to elucidate molecular mechanisms for several key traits like migration which is fascinating in clupeiform shads.Entities:
Keywords: Alosa sapidissima; Annotation; Brain & Blood; De novo transcriptome
Mesh:
Year: 2022 PMID: 35346024 PMCID: PMC8960216 DOI: 10.1186/s12863-022-01043-z
Source DB: PubMed Journal: BMC Genom Data ISSN: 2730-6844
Overview of all data files/data sets
| Label | Name of data file/data set | File types (file extensions) | Data repository and identifier (DOI or accession number) |
|---|---|---|---|
| Data file 1 | Method and Code availability | Document file (.docx) | |
| Data file 2 | RNAseq-Brain | SRA file (.sra) | NCBI |
| Data file 3 | RNAseq-Blood | SRA file (.sra) | NCBI |
| Data file 4 | FigS1 Complete work flow | Image file (.jpg) | |
| Data file 5 | FigS2 Post trimming quality assessment | Image file (.jpg) | |
| Data file 6 | FigS3 Transcript length distribution | Image file (.jpg) | |
| Data file 7 | FigS4 BUSCO assessment | Image file (.jpg) | |
| Data file 8 | FigS5 Phylogenetic relationship | Image file (.jpg) | |
| Data file 9 | Table S1 Preliminary assembly statistics | Document file (.docx) | |
| Data file 10 | Table S2 Final non-redundant assembly statistics | Document file (.docx) | |
| Data file 11 | Table S3 Annotation summery | Document file (.docx) | |
| Data file 12 | Table S4 Species description | Document file (.docx) | |
| Data file 13 | Table S5 Homologue information | Document file (.docx) | |
| Data file 14 | brain.Trinotate.filtered.xls | Spreadsheet (.xls) | |
| Data file 15 | brain.Trinity.RSEM.retained.clustered.fasta | Fasta file(.fasta) | |
| Data file 16 | brain.Trinity.RSEM.retained.clustered.fasta.transdecoder.pep | Fasta file(.pep) | |
| Data file 17 | blood.Trinotate.filtered.xls | Spreadsheet (.xls) | |
| Data file 18 | blood.Trinity.RSEM.retained.clustered.fasta | Fasta file(.fasta) | |
| Data file 19 | blood.Trinity.RSEM.retained.clustered.fasta.transdecoder.pep | Fasta file(.pep) | |
| Data file 20 | Annotation from combined reads | Document file (.docx) |