Literature DB >> 30271423

Multi-Tissue Transcriptome Profiling of North American Derived Atlantic Salmon.

Amin R Mohamed1,2, Harry King3, Bradley Evans4, Antonio Reverter1, James W Kijas1.   

Abstract

The availability of a reference genome assembly for Atlantic salmon, Salmo salar, SNP genotyping platforms and low cost sequencing are enhancing the understanding of both life history and production-related traits in this important commercial species. We collected and analyzed transcriptomes from selected tissues of Atlantic salmon to inform future functional and comparative genomics studies. Messenger RNA (mRNA) was isolated from pituitary gland, brain, ovary, and liver before Illumina sequencing produced a total of 640 million 150-bp paired-end reads. Following read mapping, feature counting, and normalization, cluster analysis identified genes highly expressed in a tissue-specific manner. We identified a cluster of 508 tissue specific genes for pituitary gland, 3395 for brain, 2939 for ovary, and 539 for liver. Functional profiling identified gene clusters describing the unique functions of each tissue. Moreover, highly-expressed transcription factors (TFs) present in each tissue-specific gene cluster were identified. TFs belonging to homeobox and bhlh families were identified for pituitary gland, pou and zf-c2h2 families for brain, arid, and zf-c2h2 for ovary and rxr-like family for liver. The data and analysis presented are relevant to the emerging Functional Annotation of All Salmonid Genomes (FAASG) initiative that is seeking to develop a detailed understanding of both salmonid evolution and the genomic elements that drive gene expression and regulation.

Entities:  

Keywords:  Atlantic salmon; RNA-Seq; gene expression; next-gen sequencing; transcription factors; transcriptome

Year:  2018        PMID: 30271423      PMCID: PMC6146974          DOI: 10.3389/fgene.2018.00369

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

The Atlantic salmon (Salmo salar), a member of the family Salmonidae, is endemic to the northern Atlantic Ocean and cultivated worldwide including in Tasmania, Australia. The Australian salmon industry is founded on introduced North American wild stock originating from the River Philip in Nova Scotia (Jungalwalla, 1991). To enhance productivity using genetics, an applied breeding program has been in operation since 2004, and the Tasmanian salmon industry is now the highest valued commercial fishery-related industry in the state (Australian Bureau of Agricultural Resource Economics and Sciences, 2014). A number of production issues remain which are potentially tractable using genomic approaches, including unwanted early maturation, disease susceptibility, product quality traits, and imperfect DNA based diagnostics for sex determination. The ability to elucidate the basis of complex salmonid traits has been dramatically improved by the availability of reference genomes for both rainbow trout (Oncorhynchus mykiss) (Berthelot et al., 2014) and Atlantic salmon (Lien et al., 2016). To date, their major impact has been through enabling the development and application of high density SNP genotyping platforms and GWAS. The availability of annotated protein coding gene models supports the interpretation of GWAS to identify putatively causal variants contributing to trait variation (Ayllon et al., 2015; Barson et al., 2015). However, these reference genomes currently lack annotation of the sequences that regulate gene expression, including proximal, and distal promotors and enhancer elements. Functional annotation requires significant time and cost, and the animal science community has self-assembled into consortia to tackle the task for livestock genomes (FAANG consortium, Andersson et al., 2015) and salmonids inside the “Functional Annotation of All Salmonid Genomes” (FAASG, Macqueen et al., 2017). The experimental component involves collection of multiple data types to support functional annotation, including histone modification marks, DNA methylation status and gene expression. RNA-Seq has revolutionized transcriptomic research, previously based on testing expression of candidate genes using RT-qPCR or even thousands of genes using cDNA microarrays. RNA-Seq provides an efficient and unbiased tool for large scale gene expression analyses in non-model organisms (for example Mohamed et al., 2016, 2018) as well as in salmonids (Salem et al., 2015; Kim et al., 2016; Robinson et al., 2017). Additional background information about available genomic tools for salmonid research, in the broader context of aquaculture genomics, is captured in a recent review (Abdelrahman et al., 2017). The objective of this study was to begin contributions to FAASG by transcriptomic profiling selected tissues. We prioritized the brain-pituitary-gonad (BPG) axis, which plays a critical role in the development and regulation of the reproductive and immune systems in vertebrates including teleosts (Weltzien et al., 2004; Cowan et al., 2017). Fluctuations in this axis regulate hormone production, which exert local and systemic effects on the animal relating to sexual development, maturation, and other traits. The liver is an essential metabolic organ as it regulates most chemical levels in the blood, detoxifies various metabolites, synthesizes proteins, and produces energy. Hence, analyzing the complement of expressed genes within and across these four tissues will provide a basis for understanding key production and life history traits.

Materials and methods

Ethics statement

The animals used are from the SALTAS selective breeding program which has been described previously (Dominik et al., 2010; Eisbrenner et al., 2014; Kijas et al., 2017). The four animals used were sexually immature 3 year old female fish from the 2014 year class. The fish were reared and euthanized in compliance with the CSIRO Animal Ethics Committee, under approval number 2017–02.

Tissue collection and total RNA isolation

Samples from mid-brain, pituitary gland, ovary and liver were collected and stored in RNA-later at −80°C until RNA isolation. Total RNA was isolated from each tissue using RNeasy Kit (QIAGEN), eluted in 40 μL RNase-Free Water and stored at −80°C. RNA quantity were assessed using a Nanodrop™ ND-1000 spectrophotometer (Thermo Scientific), and integrity checked by electrophoretic profiling with Agilent Bioanalyzer 2100 (Agilent, CA).

High-throughput mRNA sequencing (illumina RNA-Seq)

Messenger RNA (mRNA) was isolated from 500 ng total RNA and a total of 16 RNA-Seq libraries (4 tissues × 4 biological replicates per tissue) were prepared using the Illumina TruSeq stranded library Preparation Kit. Libraries were sequenced on an Illumina NovaSeq 6,000 sequencing platform producing 640.3 million individual 150-bp paired-end reads.

Read mapping, differential gene expression, and clustering analyses

Illumina reads were checked for quality using FastQC software. High quality reads (Q>30) were mapped to Atlantic salmon genome ICSASG_v2 (Lien et al., 2016) using TopHat2 version 2.1.1 (Kim et al., 2009) with default parameters. Alignment BAM files were sorted by read name and converted into SAM files using SAMtools version 1.4 (Li et al., 2009). The Python package HTSeq version 0.7.2 (Anders et al., 2015) was applied to count unique reads mapped to exons, which is suitable for RNA-Seq analysis. Raw counts were analyzed using the edgeR package (Robinson et al., 2010) in the R statistical computing environment to infer differential gene expression among tissues. To identify differential expression between samples and tissues, we applied both fold-change and false discovery rate (FDR) thresholds. Specifically, we set FDR to ≤ 0.001 and log2 (fold change) ≥2 to declare significance in line with previous recommendations (Haas et al., 2013). It is important to note a log2 (fold change) >2 equates with a minimum 4-fold change. To compare multiple expression profiles, we used the value of log2 (FPKM+1) centered by subtracting the mean across profiles (Haas et al., 2013), i.e. the mean-centered log2 (FPKM+1) value in which the FPKM+1 circumvents the non-definable instances of log2(0) if FPKM = 0. Tissue-specific gene clusters were identified as those exhibiting high expression in one tissue versus all others.

Gene ontology (GO) enrichment analysis of the tissue-specific gene clusters

To infer function for identified tissue-specific clusters, GO enrichment analyses were performed using the Database For Annotation, Visualization And Integrated Discovery (DAVID) (Huang et al., 2009). The ENTREZ_GENE_ID of the gene clusters was used as identifier and Salmo salar was selected as the background dataset for the enrichment analysis. DAVID uses Fisher's exact test to ascertain statistically significant enrichment of pathways amongst differentially expressed genes (DEGs) relative to the background transcriptome. For the purpose of the enrichment analysis, GO categories with a Benjamini-corrected P ≤ 0.05 were considered significant (Huang et al., 2009). The RNA-Seq reads used in this study have been submitted to the NCBI Gene Expression Omnibus (GEO) database under Accession no. GSE114247.

Identification of highly expressed transcription factors (TFs)

In order to determine transcription factors (TFs) in the tissue-specific gene clusters in Salmon, BLASTX analysis were performed (E-value < 10−3) against the Cod annotated transcription factors database downloaded from the AnimalTFDB database (http://bioinfo.life.hust.edu.cn/AnimalTFDB/#!/tf_summary?species=Gadus_morhua) for Gadus morhua. The retrieved transcription factors were ranked based on their expression in each of the tissue-specific clusters.

Results

Expression within and between four selected tissues

Transcriptomic data was collected from 16 RNA-Seq libraries constructed in a 4 (tissues) × 4 (replicates) experimental design. Sequencing yielded a total of 640 M raw reads, which after mapping to the Atlantic salmon reference assembly ICSASG_v2 yielded an average of 28 M reads per library (Table S1). To begin assessment between libraries, multidimensional scaling (based on the top 500 genes that best differentiate the samples) was performed (Figure S1). This confirmed biological replicates within tissue clustered together, and libraries from different tissues took non-overlapping positions. We performed hierarchical clustering using Spearman correlations of pairwise normalized expression for all 16 samples to explore the relative amount of variation between replicates within a tissue compared to that between the four tissues. This revealed that brain displayed the highest variation between the four biological replicates (animals) while the lowest was observed for ovary tissue. Further, clustering grouped the brain and pituitary as the two tissues with the most similar patterns of gene expression (Figure S2). Together these assessments indicated gene expression variability was much lower between replicates of the same tissue as compared with variation between tissues, as expected for a high quality dataset.

Gene clusters with tissue specific expression

Genes exhibiting significant differential expression (DEG, ≥4-fold change with false discovery rate FDR ≤ 0.001) were identified and used to search for gene clusters exhibiting tissue specific expression. Hierarchical clustering using only significant DEGs revealed distinctive expression profiles for each tissue (Figure 1). Brain appeared to have the highest occurrence of tissue specific expression (3395 genes) followed by ovary (2939), liver (539), and the pituitary gland (508) (Figure 2). Apart from 17542 genes that were shared among the four tissues, the highest number of shared genes were 8819 between pituitary gland and brain followed by 5547 between brain and ovary, and interestingly 1839 among pituitary gland, brain and ovary (Figure 2). This can be expected given the important role of the brain-pituitary-gonad (BPG) axis in regulating sexual development in vertebrates including fish.
Figure 1

DEGs hierarchical clustering. Heat map showing genes (rows) with differential expression (fold ≥ 4, FDR ≤ 0.001) among the four replicates of the pituitary gland, brain, ovary, and liver samples. Expression values are log2-transformed and median-centered by gene.

Figure 2

Four-way Venn diagram showing the number of shared and unique expressed genes across four salmon tissues (pituitary gland, brain, ovary, and liver).

DEGs hierarchical clustering. Heat map showing genes (rows) with differential expression (fold ≥ 4, FDR ≤ 0.001) among the four replicates of the pituitary gland, brain, ovary, and liver samples. Expression values are log2-transformed and median-centered by gene. Four-way Venn diagram showing the number of shared and unique expressed genes across four salmon tissues (pituitary gland, brain, ovary, and liver). The relative expression of the identified tissue specific clusters is compared across tissues in Figure 3. We sought to explore if tissue specific gene clusters were enriched for gene ontologies (GO) relating to molecular function (GO-MF), cellular component (GO-CC), or biological process (GO-BP). Pituitary gland-specific genes contained significant enrichment for all three gene ontology classes (GO-BP, GO-CC GO-MF terms at corrected P ≤ 0.05, Table 1). Despite being comprised of only three genes (growth hormone pre-peptide, prolactin, somatotropin-2), the most highly over-represented term (149 fold) was regulation of immunoglobulin secretion that includes genes coding for growth hormones, prolactin, and somatotropin. The GO-CC extracellular region was identified via pituitary gland hormones such as somatolactins, thyroid stimulating hormone, pro-opiomelanocortin, and gonadotropin which is consistent with the expected gene expression profile of the pituitary (Table 1, Table S3). The list of highly expressed genes in the pituitary gland included genes with function related to hormone activity such as somatolactin beta, pro-opiomelanocortin A1 and B (Figure 4; Table S2). The list of highly expressed transcription factors (TFs) identified in the pituitary gland (Table 2) included lhx3, six1b, other homeodomain proteins, ascl1a, bhlha15, zf-c2h2, and zbtb. Brain-specific genes were enriched for the GO-CC extracellular region term due to expression of neuropeptides including proenkephalin, myostatins, cholecystokinins, and thyrotropin-releasing hormone (Table 1, Table S4). Further, GO-MF term calcium ion binding was enriched based on expressed neuronal calcium-sensor/binding proteins such as visinin-like proteins, calretinin-like proteins and calmodulins (Table 1; Table S4). The list of highly expressed genes in the brain (Figure 4; Table S2) included Vesicular glutamate transporter 2.1 (gene ID = gene48716:106587477) and several genes coding for neuropeptides such as Glucagon family neuropeptides-like (gene ID = gene39762:106578701). The list of highly expressed TFs identified in the brain included zic2a, zic5, pou3f2a, pou3f3b, esrrb (Table 2).
Figure 3

Tissue-specific clusters in Atlantic Salmon from Tasmania. Cluster of 508 genes was identified for pituitary gland (A), 3395 for brain (B), 2939 for ovary (C), and 539 for liver (D). The y-axis in each graph represents the mean-centered log2 (FPKM+1) value. Expression of single genes is plotted in gray, while the mean expression of the genes in each cluster is plotted in blue.

Table 1

Gene Ontology (GO) categories enriched (corrected P ≤ 0.05) among tissue-specific gene clusters.

TissueCategoryGO term IDGO term descriptionGene countFold enrichment
BrainGO_CCGO:0005576Extracellular region213.56
GO_MFGO:0005509Calcium ion binding172.86
PituitaryGO_BPGO:0051023Regulation of immunoglobulin secretion3149.66
GO:0050766Positive regulation of phagocytosis3112.25
GO:0032930Positive regulation of superoxide anion generation374.83
GO_CCGO:0005576Extracellular region117.18
GO_MFGO:0043565Sequence-specific DNA binding74.13
OvaryGO_CCGO:0005730Nucleolus95.39
GO:0016021Integral component of membrane1151.31
GO_MFGO:0016853Isomerase activity69.05
LiverGO_BPGO:0006869Lipid transport451.31
GO:0042157Lipoprotein metabolic process444.9
GO_CCGO:0005576Extracellular region116.47
GO:0005615Extracellular space67.03
GO_MFGO:0008289Lipid binding611.11

Significant enrichment for biological process (GO_BP), cellular component (GO_CC) and molecular function (GO_MF) are indicated along with their term descriptors. The identity of the genes driving each enrichment are provided in the Supplementary material.

Figure 4

Top 10 differential expressed gene expression (DEGs). Heat map and dendrogram of the 10 most highly expressed genes in each tissue-specific cluster. The red-blue spectrum represents the scaled expression values.

Table 2

Transcription factors highly expressed (top 10 ranked) in each tissue-specific gene clusters along with their expression.

Salmon gene IDENSB IDGene symbolFamilyDescriptionPituitaryBrainOvaryLiver
gene29567:106568496ENSGMOG00000003038znf532zf-c2h2Zinc Finger Protein 5327.076638−1.81893−1.75861−3.4991
gene16311:106609474ENSGMOG00000002268ascl1abhlhAchaete-scute homolog 16.812102−1.07506−2.32828−3.40876
gene15527:106608660ENSGMOG00000007895homeoboxHomeodomain protein6.511932−1.66275−1.85147−2.99771
gene24096:106563263ENSGMOG00000015270lhx3homeoboxLIM homeobox 36.489042−2.28645−0.13382−4.06877
gene38047:106576936ENSGMOG00000000185homeoboxHomeodomain protein6.357463−1.26608−0.20003−4.89135
gene37389:106576285ENSGMOG00000002268ascl1abhlhAchaete-scute homolog 16.204624−0.17073−2.43396−3.59994
gene51125:106589580ENSGMOG00000003972bhlha15bhlhBasic Helix-Loop-Helix Family Member A156.181782−1.24124−2.72763−2.21291
gene1631:106610965ENSGMOG00000006624homeoboxHomeodomain protein6.1682650.592351−2.62659−4.13403
gene32739:106571735ENSGMOG00000009091zbtb49zbtbzinc finger and BTB domain containing 496.0541750.013972−2.49383−3.57431
gene598:106598867ENSGMOG00000010826six1bhomeoboxHomeobox protein six1b5.7808140.274847−2.0676−3.98806
gene14956:106608144ENSGMOG00000009083myt1azf–c2hcMyelin Transcription Factor 1−1.561626.430779−2.08082−2.78834
gene48129:106586884ENSGMOG00000018584olig2bhlhOligodendrocyte Transcription Factor 2−1.559546.344058−1.78626−2.99826
gene14733:106607949ENSGMOG00000007407pou3f2apouPOU class 3 homeobox 2a−1.518586.216699−2.34906−2.34906
gene42796:106581638ENSGMOG00000019585zic2azf-c2h2zic family member 2a−1.311376.161513−1.80161−3.04854
gene47289:106586041ENSGMOG00000019582zic5zf-c2h2Zic Family Member 5−1.519156.075141−1.75899−2.79699
gene34430:106573296ENSGMOG00000004267rfx4rfxRegulatory Factor X4−1.385986.045839−0.24805−4.4118
gene35702:106574614ENSGMOG00000015654pou3f3bpouPOU class 3 homeobox 3b−0.369716.030068−2.43394−3.22642
gene17894:106611051ENSGMOG00000015180esrrbesr-likeSteroid hormone receptor ERR2−1.563745.9837−2.20998−2.20998
gene4377:106584047ENSGMOG00000005989pou3f2bpouPOU class 3 homeobox 2b−1.462085.928734−2.10832−2.35832
gene11670:106605033ENSGMOG00000005989pou3f2bpouPOU class 3 homeobox 2b−2.107645.926683−1.4614−2.35764
gene50315:106588731ENSGMOG00000013263thapTHAP domain containing−1.75755−1.611315.62642−2.25755
gene31170:106569971ENSGMOG00000001476kdm5baaridLysine (K)–specific demethylase 5Ba−1.22916−1.913915.570603−2.42752
gene45968:106584708ENSGMOG00000002555arid3baridAT-Rich Interaction Domain 3B0.31235−2.448045.494693−3.359
gene22766:106561922ENSGMOG00000019253e2f8e2fE2F Transcription Factor 8−1.32298−2.065595.406993−2.01843
gene27650:106566562ENSGMOG00000000127snai1bzf-c2h2snail family zinc finger 10.220371−2.854295.304473−2.67055
gene14139:106607389ENSGMOG00000000115si:dkey-208k4.2zf-c2h20.193994−1.890315.266383−3.57007
gene29784:106568797ENSGMOG00000002773mkxahomeoboxHomeodomain protein0.302587−1.296495.174461−4.18055
gene36444:106575359ENSGMOG00000020149si:dkeyp-113d7.1zf-c2h2−1.49236−1.63865.019571−1.8886
gene32174:106571028ENSGMOG00000011046fostf_bzipFos Proto-Oncogene, AP-1 Transcription Factor Subunit0.419622−1.944314.658686−3.134
gene11227:106604581ENSGMOG00000016612lbx1ahomeoboxladybird homeobox 1a−0.64393−1.299664.654205−2.71062
gene20791:106613765ENSGMOG00000011735creb3l3atf_bzipcAMP responsive element binding protein 3-like 3a−4.39031−1.12504−1.927377.442716
gene48589:106587347ENSGMOG00000002967rxr-likeThe retinoid X receptor−3.43146−2.16225−1.568667.162369
gene22207:106561422ENSGMOG00000003773nr1h4thr-likeNuclear Receptor Subfamily 1 Group H Member 4−3.21863−2.26679−1.516797.002213
gene6072:106599177ENSGMOG00000002344si:ch1073-291l11.1zbtb−2.5624−2.08568−2.31246.960481
gene22802:106561958ENSGMOG00000007809znf408zf-c2h2Zinc Finger Protein 408−2.94014−2.30503−1.499486.744649
gene31376:106570251ENSGMOG00000000984nr0b2aNuclear receptor subfamily 0, group B, member 2a−2.77805−2.31892−1.478956.57592
gene48566:106587325ENSGMOG00000002967rxr-likeThe retinoid X receptor−3.19692−0.82396−2.446926.467793
gene52087:106590549ENSGMOG00000001972hnf4grxr-likeHepatocyte Nuclear Factor 4 Gamma−2.72588−1.74915−1.829636.304663
gene33755:106572609ENSGMOG00000011421hnf4arxr-likeHepatocyte Nuclear Factor 4 Alpha−2.33726−2.04478−1.837266.219307
gene27953:106566843ENSGMOG00000015966nr1h5thr-likeNuclear receptor subfamily 1, group H, member 5−4.12551−0.64098−1.35636.12279

The red-blue spectrum represents the scaled mean expression values.

Tissue-specific clusters in Atlantic Salmon from Tasmania. Cluster of 508 genes was identified for pituitary gland (A), 3395 for brain (B), 2939 for ovary (C), and 539 for liver (D). The y-axis in each graph represents the mean-centered log2 (FPKM+1) value. Expression of single genes is plotted in gray, while the mean expression of the genes in each cluster is plotted in blue. Gene Ontology (GO) categories enriched (corrected P ≤ 0.05) among tissue-specific gene clusters. Significant enrichment for biological process (GO_BP), cellular component (GO_CC) and molecular function (GO_MF) are indicated along with their term descriptors. The identity of the genes driving each enrichment are provided in the Supplementary material. Top 10 differential expressed gene expression (DEGs). Heat map and dendrogram of the 10 most highly expressed genes in each tissue-specific cluster. The red-blue spectrum represents the scaled expression values. Transcription factors highly expressed (top 10 ranked) in each tissue-specific gene clusters along with their expression. The red-blue spectrum represents the scaled mean expression values. Ovary-specific genes were enriched for two GO-CC terms nucleolus and integral component of membrane (Table S5) and one GO-MF term isomerase activity (Table 1). The drivers of enrichment included hormone receptors such as follicle stimulating hormone receptor, growth hormone receptor, trans-membrane proteins members of solute carrier family and 3-oxo-5-alpha-steroid 4-dehydrogenase. The list of highly expressed genes in the ovary is shown in Figure 4 and Table S2. The list of highly expressed TFs identified in the ovary (Table 2) included e2f8, fos, kdm5ba, arid3b, two Homeodomain proteins (lbx1a and mkxa) and two zinc finger proteins (snai1b and si:dkey-208k4.2). The final tissue investigated was the liver, which had enriched GO terms in all three categories that included lipid transport, lipoprotein metabolic process, extracellular region, extracellular space, and lipid binding (Table 1; Table S6). The lipid transport term was the most highly over-represented (51-fold) based on genes coding for lipid transport-proteins (apolipoproteins). Other notable genes expressed primarily in the liver included coagulation factor, insulin-like growth factor-binding protein, peptide hormones, and genes with immune-related functions. The list of highly expressed genes in the liver (Figure 4; Table S2) included serum albumins (gene ID = gene1886:100136575, gene38403:100136922), eggshell protein (gene ID = gene22108:100136930), and saxitoxin and tetrodotoxin-binding protein 1-like (gene ID = gene18707:106611802). The list of highly expressed TFs identified in the liver (Table 2) included creb3l3a, nuclear receptors (nr1h4, nr0b2a, nr1h5), and hepatocyte nuclear factors (hnf4g, hnf4a).

Discussion

Identifying the genes expressed in cells of a given tissue represents important baseline information essential for understanding tissue function and physiology more broadly. This study cataloged the collection of expressed genes in four salmon tissues, and documented significant changes that characterized tissues from each other. We identified the most abundant transcripts and the collection of genes and transcription factors (TFs) exhibiting tissue specific expression, and our findings were in broad agreement with expectation about the specific role of each organ. It is worthwhile noting that tissue specificity was considered only within the context of the tissues tested, and that examination of a more diverse collection of tissues may alter these findings. Nonetheless, the profile of each transcriptome captured GO enrichment terms consistent with expectation, and provided confidence that the dataset is of both high quality and useful for subsequent analysis. For example the enrichment of the GO terms regulation of immunoglobulin secretion and extracellular region derived from the presence of genes coding for pituitary hormones. Similarly in the ovary the enrichment of the GO term integral component of membrane derived from the presence of many hormone receptors such as follicle stimulating hormone receptor and growth hormone receptor which aligns with functions of the ovary. The identified signals with high expression reflect the physiological function of each tissue. For example, the high expression of the pituitary hormone somatolactin beta and pro-opiomelanocortin (pomc) was observed in the pituitary gland. Similarly the pomc gene is expressed in both the anterior and intermediate lobes of the pituitary gland. The same is true for other tissues as expression of genes encoding neuropeptides in the brain and genes encoding liver proteins such as albumin and eggshell proteins in the liver. The dataset adds to the increasing volume of RNA-Seq information being made available by the research community. The majority of existing Atlantic salmon RNA-Seq datasets have been generated to address specific biological questions, including host response to disease challenge (Dettleff et al., 2017; Valenzuela-Muñoz et al., 2017; Rozas-Serri et al., 2018), developmental progression (Wang et al., 2014) or nutritional requirements (Jin et al., 2018). The consequence is a paucity of transcriptomic profiling of animals not subject to an experimental regime designed to elicit specific responses or that characterize a biological extreme. The survey sequencing performed here can therefore be considered baseline data, making it applicable to general studies seeking to understand normal state expression levels across tissues. Importantly, when co-analyzed with the other datatypes identifying gene regulatory elements inside FAASG, we anticipate utility for at least three experimental applications. Firstly, for interpretation of GWAS to prioritize candidate genes on the basis of their tissue specific expression in relation to the biology of the trait under investigation. Secondly, it will assist assigning putative function to salmonid specific genes without annotation using the “guilt by association” principal. Specifically, the function of uncharacterized genes can be deduced from their tissue specific co-expression with genes of known function. Finally, we anticipate the data to contribute to characterization of transcriptomic complexity that includes expression of transcript isoforms and temporal variation in expression that can only be addressed by extensive tissue sampling across multiple life history time points.

Author contributions

AM, HK, JK, and BE conceived the study and contributed to the experimental design. HK and BE performed animal management and AM and JK contributed to sample acquisition. AM generated and analyzed the RNA-Seq data. TR contributed to the analysis. AM and JK drafted the manuscript which all authors contributed to.

Conflict of interest statement

BE was employed by company Tassal. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  27 in total

1.  Transcriptomic profiles of post-smolt Atlantic salmon challenged with Piscirickettsia salmonis reveal a strategy to evade the adaptive immune response and modify cell-autonomous immunity.

Authors:  Marco Rozas-Serri; Andrea Peña; Lucerina Maldonado
Journal:  Dev Comp Immunol       Date:  2017-12-27       Impact factor: 3.636

2.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

3.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

4.  The Atlantic salmon genome provides insights into rediploidization.

Authors:  Sigbjørn Lien; Ben F Koop; Simen R Sandve; Jason R Miller; Matthew P Kent; Torfinn Nome; Torgeir R Hvidsten; Jong S Leong; David R Minkley; Aleksey Zimin; Fabian Grammes; Harald Grove; Arne Gjuvsland; Brian Walenz; Russell A Hermansen; Kris von Schalburg; Eric B Rondeau; Alex Di Genova; Jeevan K A Samy; Jon Olav Vik; Magnus D Vigeland; Lis Caler; Unni Grimholt; Sissel Jentoft; Dag Inge Våge; Pieter de Jong; Thomas Moen; Matthew Baranski; Yniv Palti; Douglas R Smith; James A Yorke; Alexander J Nederbragt; Ave Tooming-Klunderud; Kjetill S Jakobsen; Xuanting Jiang; Dingding Fan; Yan Hu; David A Liberles; Rodrigo Vidal; Patricia Iturra; Steven J M Jones; Inge Jonassen; Alejandro Maass; Stig W Omholt; William S Davidson
Journal:  Nature       Date:  2016-04-18       Impact factor: 49.962

5.  The rainbow trout genome provides novel insights into evolution after whole-genome duplication in vertebrates.

Authors:  Camille Berthelot; Frédéric Brunet; Domitille Chalopin; Amélie Juanchich; Maria Bernard; Benjamin Noël; Pascal Bento; Corinne Da Silva; Karine Labadie; Adriana Alberti; Jean-Marc Aury; Alexandra Louis; Patrice Dehais; Philippe Bardou; Jérôme Montfort; Christophe Klopp; Cédric Cabau; Christine Gaspin; Gary H Thorgaard; Mekki Boussaha; Edwige Quillet; René Guyomard; Delphine Galiana; Julien Bobe; Jean-Nicolas Volff; Carine Genêt; Patrick Wincker; Olivier Jaillon; Hugues Roest Crollius; Yann Guiguen
Journal:  Nat Commun       Date:  2014-04-22       Impact factor: 14.919

6.  Transcriptome assembly, gene annotation and tissue gene expression atlas of the rainbow trout.

Authors:  Mohamed Salem; Bam Paneru; Rafet Al-Tobasei; Fatima Abdouni; Gary H Thorgaard; Caird E Rexroad; Jianbo Yao
Journal:  PLoS One       Date:  2015-03-20       Impact factor: 3.240

7.  HTSeq--a Python framework to work with high-throughput sequencing data.

Authors:  Simon Anders; Paul Theodor Pyl; Wolfgang Huber
Journal:  Bioinformatics       Date:  2014-09-25       Impact factor: 6.937

8.  Functional Annotation of All Salmonid Genomes (FAASG): an international initiative supporting future salmonid research, conservation and aquaculture.

Authors:  Daniel J Macqueen; Craig R Primmer; Ross D Houston; Barbara F Nowak; Louis Bernatchez; Steinar Bergseth; William S Davidson; Cristian Gallardo-Escárate; Tom Goldammer; Yann Guiguen; Patricia Iturra; James W Kijas; Ben F Koop; Sigbjørn Lien; Alejandro Maass; Samuel A M Martin; Philip McGinnity; Martin Montecino; Kerry A Naish; Krista M Nichols; Kristinn Ólafsson; Stig W Omholt; Yniv Palti; Graham S Plastow; Caird E Rexroad; Matthew L Rise; Rachael J Ritchie; Simen R Sandve; Patricia M Schulte; Alfredo Tello; Rodrigo Vidal; Jon Olav Vik; Anna Wargelius; José Manuel Yáñez
Journal:  BMC Genomics       Date:  2017-06-27       Impact factor: 3.969

9.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

10.  TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions.

Authors:  Daehwan Kim; Geo Pertea; Cole Trapnell; Harold Pimentel; Ryan Kelley; Steven L Salzberg
Journal:  Genome Biol       Date:  2013-04-25       Impact factor: 13.583

View more
  3 in total

1.  Leveraging transcriptome and epigenome landscapes to infer regulatory networks during the onset of sexual maturation.

Authors:  Amin R Mohamed; Marina Naval-Sanchez; Moira Menzies; Bradley Evans; Harry King; Antonio Reverter; James W Kijas
Journal:  BMC Genomics       Date:  2022-06-01       Impact factor: 4.547

2.  Transcriptome Profiling across Five Tissues of Giant Panda.

Authors:  Feng Li; Chengdong Wang; Zhongxian Xu; Mingzhou Li; Linhua Deng; Ming Wei; Hemin Zhang; Kai Wu; Ruihong Ning; Diyan Li; Mingyao Yang; Mingwang Zhang; Qingyong Ni; Bo Zeng; Desheng Li; Ying Li
Journal:  Biomed Res Int       Date:  2020-08-10       Impact factor: 3.411

3.  Comprehensive Characterization of Multitissue Expression Landscape, Co-Expression Networks and Positive Selection in Pikeperch.

Authors:  Julien Alban Nguinkal; Marieke Verleih; Lidia de Los Ríos-Pérez; Ronald Marco Brunner; Arne Sahm; Saptarshi Bej; Alexander Rebl; Tom Goldammer
Journal:  Cells       Date:  2021-09-02       Impact factor: 6.600

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.