Literature DB >> 22353237

An ovary transcriptome for all maturational stages of the striped bass (Morone saxatilis), a highly advanced perciform fish.

Benjamin J Reading1, Robert W Chapman, Jennifer E Schaff, Elizabeth H Scholl, Charles H Opperman, Craig V Sullivan.   

Abstract

BACKGROUND: The striped bass and its relatives (genus Morone) are important fisheries and aquaculture species native to estuaries and rivers of the Atlantic coast and Gulf of Mexico in North America. To open avenues of gene expression research on reproduction and breeding of striped bass, we generated a collection of expressed sequence tags (ESTs) from a complementary DNA (cDNA) library representative of their ovarian transcriptome.
RESULTS: Sequences of a total of 230,151 ESTs (51,259,448 bp) were acquired by Roche 454 pyrosequencing of cDNA pooled from ovarian tissues obtained at all stages of oocyte growth, at ovulation (eggs), and during preovulatory atresia. Quality filtering of ESTs allowed assembly of 11,208 high-quality contigs ≥ 100 bp, including 2,984 contigs 500 bp or longer (average length 895 bp). Blastx comparisons revealed 5,482 gene orthologues (E-value < 10-3), of which 4,120 (36.7% of total contigs) were annotated with Gene Ontology terms (E-value < 10-6). There were 5,726 remaining unknown unique sequences (51.1% of total contigs). All of the high-quality EST sequences are available in the National Center for Biotechnology Information (NCBI) Short Read Archive (GenBank: SRX007394). Informative contigs were considered to be abundant if they were assembled from groups of ESTs comprising ≥ 0.15% of the total short read sequences (≥ 345 reads/contig). Approximately 52.5% of these abundant contigs were predicted to have predominant ovary expression through digital differential display in silico comparisons to zebrafish (Danio rerio) UniGene orthologues. Over 1,300 Gene Ontology terms from Biological Process classes of Reproduction, Reproductive process, and Developmental process were assigned to this collection of annotated contigs.
CONCLUSIONS: This first large reference sequence database available for the ecologically and economically important temperate basses (genus Morone) provides a foundation for gene expression studies in these species. The predicted predominance of ovary gene expression and assignment of directly relevant Gene Ontology classes suggests a powerful utility of this dataset for analysis of ovarian gene expression related to fundamental questions of oogenesis. Additionally, a high definition Agilent 60-mer oligo ovary 'UniClone' microarray with 8 × 15,000 probe format has been designed based on this striped bass transcriptome (eArray Group: Striper Group, Design ID: 029004).

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22353237      PMCID: PMC3305648          DOI: 10.1186/1756-0500-5-111

Source DB:  PubMed          Journal:  BMC Res Notes        ISSN: 1756-0500


Background

The striped bass and its relatives in the genus Morone (the temperate basses) are ecologically and economically important aquaculture and fisheries species native to estuaries and rivers of the Atlantic coast and Gulf of Mexico in North America [1,2]. Although the striped bass and its hybrids have been reared as commercial aquaculture products in the United States since the late 1980s, little genetic information is available for these species in public databases at the National Center for Biotechnology Information (NCBI) or elsewhere, consisting only of microsatellite DNA markers [3,4], the mitochondrial genome (GenBank: HM447585), and a medium density genetic linkage map [5]. A major factor contributing to restricted growth of hybrid striped bass farming nationwide is reproductive dysfunction of female striped bass, resulting in non-viable eggs, embryos, and larvae [6]. These reproductive failures hamper selective breeding efforts required for species domestication and improvement. The exact cause(s) of poor egg quality and embryonic mortality in farmed fishes, however, still remain to be discovered, making appropriate and timely corrective measures difficult to achieve [review: [7,8]]. Functional genomics has emerged as a major research field and gene expression (transcriptomics) and proteomics studies are promising approaches to gain new insights into reproductive molecular biology [7,9-12]. Marked advancement in striped bass reproductive technology based on such "Omic" analyses is, however, currently restricted due to the lack of an available, comprehensive sequence database for this species or for other members of the genus Morone that are important in aquaculture (e.g. hybrid striped bass) or as research models (e.g. white perch, M. americana). Transcriptome resources are currently available for other commercially important fishes, including rainbow trout (Oncorhynchus mykiss) [13-16], coho salmon (Oncorhynchus kisutch) [17], tilapia (Oreochromis mossambicus) [18], Atlantic halibut (Hippoglossus hippoglossus) [19], Senegalese sole (Solea senegalensis) [20], Atlantic salmon (Salmo salar) [21], and cod (Gadus morhua) [22]. The emergence of pyrosequencing and later generation DNA sequencing technologies has made acquisition of significant genomic resources accessible and affordable for non-model organisms [23-25]. Vast numbers of expressed sequence tags (ESTs) can readily be generated using these methods, providing direct evidence of gene transcription, and collections of such EST sequences are presently the most important resources used for transcriptome exploration [26]. Depending on the number of ESTs sequenced, resulting databases can represent a high proportion of the total number of gene transcripts expressed by a given tissue (i.e. transcriptome), making downstream procedures for transcriptome profiling, such as oligo microarray or real-time quantitative reverse transcription PCR, tractable without the need for an entire genome sequence. When sequencing depth is limited, organ specific EST collections permit more efficient gene expression analyses using 'UniClone' microarrays, which are comprised of probe sequences isolated from a single organ type [27-30]. UniClone arrays represent a larger proportion of a target organ transcriptome and have reduced redundancy when compared to arrays comprised of ESTs derived from several different tissue types. Additionally, to realize the full benefits of proteomic analyses based on mass spectrometry, species-specific ESTs are required, since algorithms used for spectral analyses (e.g. SEQUEST, Proteome Discoverer Software, Thermo Scientific, West Palm Beach, FL) require a homologous reference sequence database. For non-model organisms, sequence information from even closely related species can be insufficient for the accurate identification of peptides, since these algorithms tend to be conservative and heterospecific amino acid substitutions can result in peptide misidentification or an inability to detect orthologues [31]. Therefore, the goal of the present study was to provide an ovary transcriptome database representative of all stages of oogenesis and atresia in striped bass, one that could provide the requisite foundation for functional genomics and proteomics investigations of reproduction and egg quality in this species and that would support similar studies in the other temperate basses.

Results

A total of 230,151 EST short read sequences with a combined length of 51,259,448 bp (average length 224 bp) were generated from cDNA pooled from ovarian tissues and eggs encompassing the various stages of ovary growth, maturation and atresia. A total of 11,208 high-quality contigs with a length of at least 100 bp were assembled and these included 2,984 contigs that were 500 bp or longer (average length 895 bp; total length 5,068,343 bp) (Additional File 1). Blastx comparisons revealed 5,482 orthologues, of which 4,120 (36.7%) were annotated with Gene Ontology (GO) terms. The number of unknown, unique sequences was 5,726 (51.1%). The breakdown of GO annotation classes within the three categories of GO terms for all annotated sequences is shown in Figure 1: Biological Process (2nd level) and Molecular Function and Cellular Component (3rd level). A complete list, in FASTA format, of the contig assemblies identified by their annotations are included as Additional File 2 and a list of the assemblies and their GO terms are included as Additional File 3.
Figure 1

Gene ontology graph of A. Cellular Component (3rd level GO terms), B. Molecular Function (3rd level GO terms), and C. Biological Process (2nd level GO terms) of annotated genes in the striped bass ovary transcriptome. The number of GOs in each class is shown and sections that contained 50-150 entities are represented in black, 151-500 by dark gray, 500 and up by light gray, and the predominant class is indicated in white.

Gene ontology graph of A. Cellular Component (3rd level GO terms), B. Molecular Function (3rd level GO terms), and C. Biological Process (2nd level GO terms) of annotated genes in the striped bass ovary transcriptome. The number of GOs in each class is shown and sections that contained 50-150 entities are represented in black, 151-500 by dark gray, 500 and up by light gray, and the predominant class is indicated in white. There were 66 contigs that were each assembled from groups of ESTs that comprised ≥ 0.15% of the total 230,151 reads (i.e. ≥ 345 reads per contig) and these contigs were considered to have abundant ovary expression. These contigs were identified by NCBI UniGene cluster and compared to zebrafish, Danio rerio, orthologues evaluated by Digital Differential Display (DDD) (Table 1). Twenty-two striped bass genes from this list (33.3% of the total listed) either had no blastx returns (i.e. were novel), or were identified as being unnamed gene products, or had gene names but no zebrafish UniGene orthologues. These were excluded from further evaluation. Of the remaining informative 44 genes, 23 (52.5%) are predicted to have predominant ovary expression based on DDD of zebrafish orthologues, 11 (25.0%) would be expected to have no difference in expression between ovary and other tissues of the body based on the DDD results, and 10 (22.7%) would likely have predominant expression in other tissues of the body based on the DDD comparison. Overall, the estimated 66 most abundantly expressed striped bass ovary genes were assembled from ~1/6 of the total number of short read sequences (Table 1).
Table 1

Transcripts abundantly expressed in the striped bass ovary.

Contig NumberBLAST 2GO AnnotationGeneGeneID zebrafish taxid: 7955 orthologueAssembled contig length (bp)Number of observe sequence reads% Total sequence reads (230,151)Fraction of ESTs that mapped to the zebrafish UniGene by DDDZebrafish UniGene

OvaryBody
110186cyclin b2ccnb2368316128411460.49793400.0025>0.0001Dr.80580

210415zona pellucida glycoproteinzp2.3114439132910760.46751920.0429>0.0012Dr.143785

310181novel protein with zona pellucida-like domainsi: ch211-14a17.736866964610010.43493180.0015>0.0001Dr.75717

49349zona pellucida czpcx33401120369230.40104110.0013>0.0001Dr.80433

5146nad h quinone 1nqo13225069169080.3945236n.d.=n.d.Dr.4189

68878tubulin beta 2czgc: 12319464142115108690.3775782n.d.=n.d.Dr.52550

79768egg envelope component zpaxsi: dkeyp-50f7.233403628908640.37540570.0017>0.0003Dr.105787

810472fatty acid binding protein liverfabp1b.15540954198480.3684538n.d.=n.d.Dr.24261

99294--NA------8128390.3645433------

1010137choriogenin 1zp3b6469213898170.35498430.0029>0.0003Dr.75734

1111102hypothetical protein LOC100049339polr2a5533477747670.3332595**Dr.79109

1211074--NA------1817620.3310870------

1310663zgc: 175135 proteinzgc: 1655511000039696367060.30675510.0039>0.0003Dr.106137

149917heat shock protein 8hspa857337622666990.30371360.0011<0.0029Dr.75087

1511091novel protein with zona pellucida-like domainLOC10033170710033170712196750.2932857------

163--NA------15856540.2841613------

1711147fatty acid-binding heartfabp11a4479445816380.2772093n.d.=n.d.Dr.78045

1810883mgc86501 proteinwu: ft38e017989965686230.270699190.0024>0.0002Dr.106837

199329histoneh3f3c3362319456190.26895390.0001<0.0003Dr.75577

2010302voltage gated chloride channel domain-containing protein----9966160.2676504------

2111112egg envelope component zpczp3c56317915276100.26504340.0002>0Dr.113688

2230histone h2aLOC573838 (h2af1o)1003322294476070.26373990.0024>0.0002Dr.75698

2310079--NA------8115850.2541810------

2410058beta-actinbactin25793518745780.25113950.0026<0.0077Dr.75125

2510823apolipoprotein dzgc: 1233395679728165600.2433185**Dr.15815

2610825--NA------1545550.2411460------

2710773hypothetical protein LOC100049339--307057565550.2411460------

286635h1 histone member oocyte-specifich1m3274038235230.2272421n.d.=n.d.Dr.75735

2911098adp atp translocaseslc25a519232112435150.22376610.0015<0.0078Dr.30295

30127nucleoside diphosphate kinase bnme2b.1300838345110.2220281n.d.=n.d.Dr.11052

Contig NumberBLAST 2GO AnnotationGeneGeneID zebrafish taxid: 7955 OrthologueAssembled contig length (bp)Number of observe sequence reads% Total sequence reads (230,151)Fraction of ESTs that mapped to the zebrafish UniGene by DDDZebrafish UniGene

OvaryBody

311030960 s acidic ribosomal protein p0rplp0581019324970.21594520.0008<0.0033Dr.55617

3211081loc494706 protein (oogenesis-related gene)org1000011106014950.21507620.0016>0.0001Dr.80745

3310120elongation factor 1 alphaefla3051617444920.21377270.0032<0.0108Dr.31797

3410015heat shock protein 90hsp90ab13057319004850.21073120.0006<0.0020Dr.35688

3511073unnamed protein product----4144810.2089932------

3610797complement component (3b 4b) receptor 1LOC56554156554116964700.2042138**Dr.91858

3792cyclin b1ccnb1580257384700.20421380.0035>0.0002Dr.121261

3810403--NA------3274690.2037793------

39126karyopherin alpha 2 (rag cohort importin alpha 1)zgc: 5587740634310854690.20377930.0010>0.0002Dr.20877

4010948--NA------2484650.2020413------

4110900zpb proteinLOC10033427510033427515614610.2003033**Dr.141250

4236claudin 4cldnd815837314560.19813080.0004>0.0001Dr.75663

43216stathmin 1 oncoprotein 18 variant 8stmn1b5505489644500.19552380<0.0004Dr.105609

4410949--NA----5501341514200.1824889------

459337Securin [Anoplopoma fimbria]LOC5666905666904354140.17988190.0002>0Dr.118007

469321dna replication inhibitorgmnn36832011214120.1790129n.d.=n.d.Dr.119358

4710986cell division cycle 20 homolog (cerevisiae)cdc2040635315974100.17814390.0005>0.0001Dr.105018

4811071--NA------2154020.1746679------

4910743--NA------2733980.1729299------

501174cyclin kLOC10033130410033130433313970.17249540.0009>0Dr.148591

5110438ribonucleotide reductase m2 polypeptiderrm23073316213960.17206100.0018>0.0003Dr.75098

5211198ribosomal protein s20rps204064854773930.17075750.0014>0.0008Dr.18943

5311014karyopherin alpha 2 (rag cohort importin alpha 1)kpna24366075343800.16510900.0009>0.0002Dr.75709

5410351--NA------2993750.1629365------

5510265unnamed protein product----10753750.1629365------

56771cytochrome c oxidase copper chaperonecox174479144103750.16293650.0007>0.0001Dr.82168

5710107tubulin, alpha 1cMGC1714075731226973740.1625020n.d.=n.d.Dr.120425

58161--NA------25323710.1611985------

59231epididymal secretory protein e1 precursornpc22826737283600.1564190------

6011090--NA------3083560.1546811------

Contig NumberBLAST 2GO AnnotationGeneGeneID zebrafish taxid: 7955 orthologueAssembled contig length (bp)Number of observed sequence reads% Total sequence reads (230,151)Fraction of ESTs that mapped to the zebrafish UniGene by DDDZebrafish UniGene

OvaryBody

6110741ppia protein (pepitidylprolyl isomerase A)ppia3366128253560.15468110.0005<0.0011Dr.104642

629354superoxide dismutasesod1305537953560.1546811n.d.=n.d.Dr.75822

6310048ubiquitin bubb5501341693550.1542466n.d.=n.d.Dr. 104259

6410083cyclin a2ccna219229521083510.1525086n.d.=n.d.Dr.121874

6510746eukaryotic translation elongation factor 1 gammaeef1g19582215333500.15207410.0006<0.0011Dr.75657

6610761egg envelope component zpaxsi: dkeyp-50f7.233403627313470.15077060.0017>0.0003Dr.105787

TOTALS691733653215.8730570

Genes are ranked (1-66) by number of observed 454 short read sequences used in each contig assembly. Digital Differential Display (DDD) results of orthologous sequences in zebrafish are also shown

Annotation "--NA--"indicates no blastx return; Dashes (--) indicate unknown or data not available; asterisks (*) indicate the UniGene was not present in the EST libraries used for DDD. Sequences with expression differences evaluated by DDD (FET, P ≤ 0.05) are indicated by ">" (enhanced ovary expression) or "<" (enhanced body expression); "n.d." indicates no significant difference in expression between ovary and body (=)

Transcripts abundantly expressed in the striped bass ovary. Genes are ranked (1-66) by number of observed 454 short read sequences used in each contig assembly. Digital Differential Display (DDD) results of orthologous sequences in zebrafish are also shown Annotation "--NA--"indicates no blastx return; Dashes (--) indicate unknown or data not available; asterisks (*) indicate the UniGene was not present in the EST libraries used for DDD. Sequences with expression differences evaluated by DDD (FET, P ≤ 0.05) are indicated by ">" (enhanced ovary expression) or "<" (enhanced body expression); "n.d." indicates no significant difference in expression between ovary and body (=) All of the high-quality ESTs have been deposited in the NCBI Short Read Archive (GenBank: SRX007394) and annotated contigs are posted under "Resources" on the National Animal Genome Research Program Aquaculture Genome Projects website (http://www.animalgenome.org/aquaculture/database/) [32]. These contigs also have been submitted to Agilent Technologies eArray (Santa Clara, CA) for ovary UniClone microarray design (http://www.chem.agilent.com/). We designed a high definition 60-mer SurePrint oligo array with 8 × 15,000 probe format comprised of 11,145 UniGene probes from the transcriptome, plus an additional 3,854 probes printed in duplicate or selected from Morone cDNAs available from NCBI or from our own unpublished results (B.J. Reading and C.V. Sullivan, unpublished data) and datasets (eArray Group: Striper Group, Design ID: 029004).

Discussion

This collection of ESTs represents the first contribution of a large reference sequence database for species of the genus Morone and provides a basis for future gene expression studies in these temperate basses. Availability of characterized ovarian transcriptomes from fishes other than zebrafish is limited. Partial transcriptomes have been reported for tilapia (474 EST assemblies) [18] and for cod (1,361 EST assemblies) [22]. Several thousand ovarian ESTs have been reported for salmonid fishes [[13,15,33] and references therein], but to our knowledge these have not been assembled into a comprehensive ovarian transcriptome. Numbers of total ESTs currently available in the NCBI EST database for some other commercially important finfishes are as follows: rainbow trout (287,967), coho salmon (4,942), tilapia (Genus Oreochromis, 121,346), Atlantic halibut (20,836), Senegalese sole (10,631), Atlantic salmon (498,212), and cod (229,094). Therefore, the 230,151 ESTs reported herein represent a comparatively valuable transcriptome resource for striped bass. If the 11,208 contigs are considered to be UniGenes, this represents a substantial proportion of the estimated total protein-coding gene transcripts expressed by the striped bass ovary (i.e. transcriptome) as the average number of mRNA transcripts expressed by a single tissue type is estimated to be between 10,000-15,000 [34], but can be as low as 8,200 [35]. Since over 1,300 GOs from Biological Process classes of Reproduction (121), Reproductive process (55), and Developmental process (1,188) were assigned to the annotated contigs (Figure 1), this sequence collection should prove to be a powerful tool for analysis of ovarian gene expression related to fundamental questions of oogenesis. Approximately 52.5% of the informative contigs considered to have abundant ovary expression (i.e. those with ≥ 345 reads per contig) were also predicted to have predominant expression in striped bass ovary through DDD comparisons to zebrafish orthologues (Table 1). These include cyclin B2 (ccnb2, contig10186), several egg envelope and zona pellucida proteins, histone H2A (h2af1o, contig00030), oogenesis-related gene (org, contig11081), cyclin B1 (ccnb1 contig00092), karyopherin alpha 2 (kpna2, contigs 00126 and 11014), claudin 4 (cldnd, contig00036), securin (LOC566690, contig 09337), cell division cycle 20 homolog (cdc20, contig10986), cyclin K (LOC100331304, contig11174), ribonucleotide reductase M2 polypeptide (rrm2, contig10438), ribosomal protein S20 (rps20, contig11198), cytochrome C oxidase copper chaperone (cox17, contig00771), and epididymal secretory protein E1 (npc2, contig00231). Many of these are well-characterized ovary transcripts and several recent and informative papers have been published detailing the functions of these genes and their protein products in fish oocytes and embryos [see: [7,8,13-20,27,28,36-38]]; others are briefly detailed below. The remaining 47.5% of abundant striped bass ovary genes that were compared to zebrafish orthologues in the DDD were predicted to have indifferent or predominant expression levels in other tissues of the body relative to the ovary. These may represent constitutively expressed genes or those expressed at high levels in the ovary albeit comparatively lower than in other tissues of the body, respectively. Examples of potential genes with constitutive expression include NADH quinone 1 (nqo1, contig00146), tubulin (zgc:123194, contig08878 and MGC171407, contig10107), fatty acid binding proteins (fabp1b, contig10472 and fabp11a, contig11147), H1 histone member oocyte-specific (h1m, contig06635), nucleoside diphosphate kinase B (nme2b, contig00127), geminin DNA replication inhibitor (gmnn, contig09321), superoxide dismutase (sod1, contig09354), ubiquitin B (ubb, contig10048), and cyclin A2 (ccna2, contig10083). Of these, fatty acid-binding protein heart (fabp11a) has been shown to be up-regulated in ovary of rainbow trout females that mature precociously [13] and an orthologue of h1m (H1foo) is generally considered to be an oocyte specific histone in mouse (Mus musculus) [39,40], contrary to the DDD prediction. The UniGene EST Profile of zebrafish h1m (Dr. 75735) indicates that it is predominantly expressed in skin, however the second most abundant site of expression is the reproductive system. The following genes expressed in striped bass ovary are also expressed in zebrafish ovary, however the DDD indicates that they are predominantly expressed in other tissues of the body (Table 1): histone (h3f3c, contig09329), beta-actin (bactin2, contig10058), ADP/ATP translocase (slc25a5, contig11098), 60S acidic ribosomal protein P0 (rplp0, contig10309), elongation factor 1 alpha (ef1a, contig10120), peptidylprolyl isomerase A (ppia, contig10741), eukaryotic translation elongation factor 1 gamma (eef1g, contig10746), stathmin 1 oncoprotein 18 variant 8 (stmn1b, contig00216), and heat-shock proteins 8 (hspa8, contig09917) and 90 (hsp90ab1, contig10015). Ovarian representation of gene transcripts that show predominant expression in other tissues of the body is not surprising given the heterogeneous complexity of the ovary, which is comprised of vasculature, blood and other connective tissues, the somatic follicle, and germ cells. Furthermore, most of these genes, for example ef1a and bactin2, are considered to have constitutively high expression in most tissues, and this is supported by the corresponding zebrafish UniGene EST Profiles (Dr. 31797 and Dr.75125, respectively). There were, however, three exceptional genes whose expression, although considered to be lower in comparison to other tissues of the body by DDD, have been shown to be highly expressed in ovary. Stathmin (stmn) is expressed in oocytes and pre-implantation embryos of mice [41] and in cod ovary [22], and Stmn proteins have been detected in zebrafish ovary [36]. Contig00216 encodes a full-length, 147 amino acid Stmn and has been putatively identified as stmn1b, however it is highly similar to two zebrafish stmn isoforms (95% and 94% amino acid identity with stmn1b and stmn1a, respectively). Although stmn1b has body predominant expression in zebrafish by DDD (Table 1), zebrafish stmn1a (UniGene Dr.52664) shows ovary predominant expression and, therefore, contig00216 may actually be orthologous to stmn1a. Given the high similarity of this sequence to both zebrafish stmn1 isoforms, it is not possible to definitively assign identity without comparison to the other striped bass stmn isoform, which is unavailable. Recently, hsp8 and hsp90 (corresponding to striped bass hspa8 and hsp90ab1, respectively) have been characterized as some of the most abundant genes expressed in mouse and fish eggs at both the transcript and protein levels [36,37,42]. This inconsistent result may relate to the inherent weaknesses of DDD, since only highly expressed genes are adequately represented in the EST libraries used to conduct the in silico comparisons and the Fisher's exact test (FET) is conservative [43]. Although this method does not offer quantitation, ranking of the striped bass contigs by number of short reads used in assembly paired with comparisons to zebrafish orthologues evaluated by DDD proved to be a useful tool for estimating relative ovarian abundance of the striped bass gene transcripts. Reservation must be taken when considering such interspecific DDD comparisons for the purpose of excluding genes that are predicted to have less predominant expression in one tissue compared to another, since they may be highly expressed in both. This is a promising approach for characterization of novel gene transcripts from EST libraries and has recently been used to identify ovary specific genes in zebrafish [44] and rainbow trout [15], however such results should be further validated using an experimental evaluation of gene expression. The growing oocyte is considered to be largely transcriptionally inactive, acting as a storehouse of specific maternal RNAs, proteins, and other molecules required for competency for fertilization, initiation of zygotic development, and transition to embryonic gene expression [review: [37,38]]. These maternal factors may be stored in oocytes for extended periods of time until use (e.g. months to years). Therefore, a system of regulatory proteins and RNAs must mediate the oocyte cell cycle during growth, ovarian maturation (OM), and zygotic development from fertilization until activation of the embryonic genome at the mid-blastula transition [45]. A number of known cell-cycle regulators and proteins critical for these processes have been identified as predominantly expressed in striped bass ovary (Table 1). Examples include cyclins B1 and B2 (ccnb1, ccnb2) [46-49], cyclin K (ccnk) [50], securin [51], cdc20 [27], kpna2 [22,52], gmnn [53], h2af1o [54] and org [44]. Transcripts encoding several different cell division and cell cycle regulatory proteins were similarly reported in the ovaries of cod [22] and rainbow trout [13]. Solute carrier protein (SLC) family members are selected to illustrate representation of sequences in the striped bass ovary transcriptome encoding proteins from a large gene series. The SLCs are a diverse group of eukaryotic membrane proteins that control cellular influx and efflux of solutes, including ions, fatty acids, amino acids, sugars, drugs, and vitamins [55,56]. The Human Genome Gene Nomenclature Committee [57] classifies approximately 400 different human SLCs into 47 families. At least one representative protein from 19 (~40.4%) of these families was identified in the striped bass ovary transcriptome (Table 2). Characterization of SLC gene expression in growing oocytes and during OM would be of direct importance to understanding mechanisms of oogenesis and egg quality in light of what is known of oocyte and egg physiology. Due to osmoregulatory requirements imposed by both fresh and marine waters, embryos of egg-laying fishes develop within the confines of an established chorion that becomes osmotically closed after fertilization. Therefore, ovulated eggs must contain all of the water required during embryogenesis as a medium and substrate for biochemical reactions and as a diluent for waste products (e.g. ammonia). Furthermore, water contributes to appropriate egg buoyancy, especially in marine fishes that spawn pelagic eggs. Prior to ovulation, a hyperosmotic solute concentration develops within the oocytes of these species, followed by passive influx of water through aquaporin membrane channels [review: [58,59]]. Inorganic ions have primarily been implicated in this phenomenon, however the exact mechanisms of their entry have not been verified. Bobe et al. [14] demonstrated up regulation of slc26 (Pendrin) and aqp4 (aquaporin 4) expression in ovary of rainbow trout during OM. Gene transcripts encoding a slc26a6-like protein, along with several other ion transporters (Table 2) and aquaporin 1 (contig08717) were identified in striped bass ovary. This indicates the potential for discovery of previously unknown mechanisms of teleost oocyte hydration by gene expression analyses of these particular SLCs and water transport genes in the striped bass and related species (genus Morone), which can tolerate a wide range of environmental salinities.
Table 2

Solute carrier family members identified in the striped bass ovary transcriptome

ContigGeneGene ID Danio orthologueContig Length (bp)Solute carrier family function
04292aslc3a2796322629Heavy subunit of the heteromeric amino acid transporters (Na+-independent, transport of large neutral amino acids: phenylalanine, tyrosine, leucine, arginine and tryptophan)

10145slc3a2-like1000038051740Heavy subunit of the heteromeric amino acid transporters (Na+-independent, transport of large neutral amino acids: phenylalanine, tyrosine, leucine, arginine and tryptophan)

09132bslc4a7568872563Electroneutral Na+ and HCO3--dependent cotransporter

11036cslc7a2100007793815Cationic amino acid transporter/glycoprotein-associated amino-acid transporter (transport of the cationic amino acids including arginine, lysine and ornithine)

00672dslc7a8100007704987Na+-independent, transporter of small and large neutral amino acids such as alanine, serine, threonine, cysteine, phenylalanine, tyrosine, leucine, arginine and tryptophan; when associated with Slc3a2, acts as an amino acid exchanger

05979slc7a10567420240Na+-independent, high affinity transport of small neutral D- and L-amino acids

04450slc9a3r1327272385Na+/H+ exchanger

02807slc10a3406519692Na+/bile acid cotransporter

06556slc10a4556491249Na+/bile acid cotransporter

03289slc12a5-like572215251Electroneutral cation/Cl- cotransporter (K+/Cl- transporter)

04100slc19a2-like100329244778Thiamine transporter

00585slc20a1a4064582129Na+-dependent PO43- transporter

05003slc20a1b321541246Na+-dependent PO43- transporter

00176slc25a33223621448Mitochondrial carrier (PO43- transporter)

01147slc25a51923211302Mitochondrial carrier (ADT/ATP translocator)

01400eslc25a12337675693Mitochondrial carrier (aspartate/glutamate transporter)

01037slc25a26560478349Mitochondrial carrier (S-adenosylmethionine transporter)

09234slc25a29569608579Mitochondrial carrier (carnitine/acylcarnitine transporter)

06849slc25a43796731254Mitochondrial carrier

07197fslc25a46436831251Mitochondrial carrier

08784slc26a6-like557779215Multifunctional anion exchanger (Pendrin-like; Cl-, oxalate, SO42-, and HCO3- transporter)

04105slc27a1541410265Fatty acid transporter (FATP-1; long-chain fatty acid translocator)

01322gslc29a1563580260Facilitative nucleoside transporter (cellular uptake of nucleosides)

05237slc30a2563540293Zinc transporter

06016slc30a2-like560642608Zinc transporter

05293hslc30a5436594506Zinc transporter

03716slc30a7327439392Zinc transporter (zinc efflux transporter)

09883slc31a2--2142Copper transporter (low affinity copper uptake)

02632slc35a2368487186Nucleoside-sugar transporter (UDP-galactose transporter)

07709slc35e1-like100332364249Nucleoside-sugar transporter

04693slc38a8-like795255414Na+-coupled neutral amino acid transporter

05870slc38a9562137243Na+-coupled neutral amino acid transporter

02706slc38a11550337347Na+-coupled neutral amino acid transporter

02072slc39a3321324414Metal ion transporter (zinc influx transporter)

08253islc39a13368686239Metal ion transporter (zinc influx transporter)

05275jslc44a1100333377256Choline transporter

02670slc44a2-like321056269Choline transporter

07718slc44a4-like393385255Choline transporter

05152slc48a1a436697853Heme transporter hrg1-B

For contigs with superscripted letters, see also the following corresponding contigs: a02586 (slc3a2); b07936, b03956 (slc4a7); c00241, c09351, c04452 (slc7a2); d04260 (slc7a8); e07750 (slc25a12); f06248 (slc25a46); g04062 (slc29a1); h09154 (slc30a5); i06486 (slc39a13); j02053 (slc44a1)

Solute carrier family members identified in the striped bass ovary transcriptome For contigs with superscripted letters, see also the following corresponding contigs: a02586 (slc3a2); b07936, b03956 (slc4a7); c00241, c09351, c04452 (slc7a2); d04260 (slc7a8); e07750 (slc25a12); f06248 (slc25a46); g04062 (slc29a1); h09154 (slc30a5); i06486 (slc39a13); j02053 (slc44a1)

Conclusions

In summary, as we continue to advance our understanding of reproduction in temperate basses of the genus Morone, this reference sequence database of ovarian transcripts will provide the requisite foundation for gene expression studies and will open avenues of research related to reproduction and egg quality. Several important candidate genes have already been identified for future study. Furthermore, these sequences have been used to design an ovary UniClone oligo microarray for assessing changes in gene expression during oogenesis and in female striped bass spawning good and poor quality eggs. Our recent deployment of this microarray in a study of striped bass egg quality has allowed us to detect differences in ovarian gene expression explaining and predicting most of the eventual variance in early embryo mortality among good and poor quality spawners.

Methods

Sample collection and preparation

Striped bass were reared in outdoor tanks at the N.C. State University Pamlico Aquaculture Field Laboratory [60]. As the striped bass is a group synchronous, single clutch, iteroparous spawner, ovarian tissues were collected by dissection or through ovarian biopsy [61] from females whose most advanced clutch of oocytes/eggs represented one of several stages (≥ 3 females/stage) of oocyte growth (early primary growth oocytes, diameter 49-81 μm; late primary growth oocytes showing evidence of lipid droplet accumulation, diameter 162-184 μm; vitellogenic growth oocytes, diameter 558-764 μm [see:[62][63]]), oocyte maturation (post-vitellogenic and maturing oocytes, diameter > 900 μm), and atresia [64], and ovulated eggs. All samples were preserved in RNAlater® (Applied Biosystems/Ambion; Austin, TX). Tissues were pooled in equal weight by oocyte/egg stage and total RNA was extracted in TRIzol® Reagent (Invitrogen; Carlsbad, CA). RNA quality was assessed by agarose gel electrophoresis and NanoDrop™ spectrophotometry (Fisher Scientific; Pittsburgh, PA). Dynabeads® (Invitrogen) were used to purify mRNA as described by the manufacturer.

cDNA library construction and sequencing

Ovary mRNA was submitted for cDNA synthesis at the N.C. State University Genomic Sciences Laboratory (Raleigh, NC). First and second strand cDNA was synthesized from 2.5 μg of Dnase treated mRNA using the SuperScript™ Double-Stranded cDNA Synthesis Kit (Invitrogen) and oligo (dT)17 according to the manufacturer. Approximately 2 μg of cDNA was prepared for FLX sequencing using standard Roche protocols [65]. Briefly, cDNA was nebulized to generate fragments averaging ~500 bp in length, fragment ends were repaired, and adapters containing PCR and sequencing primer annealing sites were ligated. Fragments were immobilized on beads, clonally amplified and then sequenced on a 1/2 plate using standard FLX platform (Roche; Indianapolis, IN).

Sequence assembly and annotation

Short reads were assembled into contigs using Roche's Newbler software (gsAssembler) with default settings except that the minimum overlap was set to 30 bp. Parameters were set to generate files for large contigs (> 500 bp) and for all contigs > 100 bp. High quality contig assemblies were subjected to BLAST (blastx) [66] of the NCBI database and annotated according to the Gene Ontology Consortium [67] using Blast2GO 2048 M version 12.2.0 [10,68,69]. Parameters for blastx were: Expect value 1.0E-3 and HSP Length Cutoff 33. Parameters for the GO annotations were: E-value-hit-filter 1.0E-6, Annotation Cutoff 55, GO Weight 5, and HSP-Hit Coverage Cutoff 0. Combined GO graphs for the annotated sequences (4,120 total) were created using percentages of 2nd level GO terms for Biological Process and 3rd level GO terms for Molecular Function and Cellular Component. Represented GO classes were restricted to those with 50 or more entities (sequence cutoff = 50.0); Sequence Filter = 50, Score alpha = 0.6, Node Score Filter = 10. Parameters for the Combined Graphs, Level Pie Configuration were: Ontology Level = Level 2 or 3 as described above.

Estimation of abundant gene transcripts

Contigs that were assembled from a number of ESTs comprising ≥ 0.15% of the total 230,151 short reads (i.e. those having ≥ 345 reads per contig) were considered to be abundant [see: [38]]. These contigs were ranked by relative abundance and compared to zebrafish orthologues shown to be ovary predominant by NCBI UniGene DDD [70], see: [15,44]. Zebrafish EST libraries were used to determine relative representation by DDD of orthologous UniGene clusters in ovary (104, 986 ESTs; Lib.IDs 20503, 15519, 20772, 20502, 19214, 15930, 9874, 9767) and body tissues excluding gonads (714, 604 ESTs; Lib.IDs 1520, 1521, 15438, 1028, 17704, 17768, 19753, 1522, 19745, 19746, 20694, 20725, 15518, 21372, 19747, 19748, 4913, 9766, 21371, 19741, 19749, 20771, 19739, 19740, 10504, 19737, 13027, 1029, 17276, 15077, 19752, 15517, 2387, 17282, 17284, 19738, 9968, 9993, 14182, 14249, 19217, 24670, 20072, 20071, 19253, 19219, 19218, 19215, 17283, 17275, 14410, 14409, 13866, 12106, 9706, 4264, 1727). Libraries with sequences derived from embryos, larvae, or whole bodies including gonads were excluded. The Fisher's exact test (FET) was used to determine difference between the number of times sequences from the ovary or body libraries were assigned to a specific UniGene cluster (P ≤ 0.05). Numerical DDD scores of genes with significantly different expression profiles were reported as the fraction of sequences within the EST libraries that mapped to the UniGene cluster.

Availability of supporting data

The data sets supporting the results of this article are available in the National Center for Biotechnology Information repository, Short Read Archive: SRX007394 and the National Animal Genome Research Program Aquaculture Genome Projects repository, http://www.animalgenome.org/aquaculture/database/.

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

BJR conducted the sample preparation, DDD statistical analyses, and drafted the manuscript. JES performed the FLX pyrosequencing and contig sequence assemblies. RWC performed the GO annotations. RWC, EHS, and CHO participated in design of the study and critical review of the manuscript. CVS conceived the study, participated in its design and coordination, and helped draft the manuscript. All authors read and approved the final manuscript.

Additional file 1

Striped bass ovary contig assemblies in FASTA format. Click here for file

Additional file 2

Striped bass ovary contig assemblies identified by their annotations in FASTA format. Click here for file

Additional file 3

List of striped bass ovary contig assemblies and their GO terms. Click here for file
  56 in total

1.  Utilization of digital differential display to identify novel targets of Oct3/4.

Authors:  Yoshimi Tokuzawa; Masayoshi Maruyama; Shinya Yamanaka
Journal:  Methods Mol Biol       Date:  2006

2.  A microsatellite linkage map of striped bass (Morone saxatilis) reveals conserved synteny with the three-spined stickleback (Gasterosteus aculeatus).

Authors:  Sixin Liu; Caird E Rexroad; Charlene R Couch; Jan F Cordes; Kimberly S Reece; Craig V Sullivan
Journal:  Mar Biotechnol (NY)       Date:  2011-10-04       Impact factor: 3.619

Review 3.  17 alpha,20 beta-dihydroxy-4-pregnen-3-one, a maturation-inducing hormone in fish oocytes: mechanisms of synthesis and action.

Authors:  Y Nagahama
Journal:  Steroids       Date:  1997-01       Impact factor: 2.668

4.  Maternal gene expression in Atlantic halibut (Hippoglossus hippoglossus L.) and its relation to egg quality.

Authors:  Maren Mommens; Jorge Mo Fernandes; Teshome T Bizuayehu; Sylvie L Bolla; Ian A Johnston; Igor Babiak
Journal:  BMC Res Notes       Date:  2010-05-24

5.  Novel expression of importin alpha homologue in marine teleost, Pagrus major.

Authors:  Koichiro Gen; Sonoko Yamaguchi; Koichi Okuzawa; Hirohiko Kagawa; Md Samsul Alam
Journal:  Comp Biochem Physiol B Biochem Mol Biol       Date:  2008-08-27       Impact factor: 2.231

6.  Molecular characterization and expression profiles of cyclin B1, B2 and Cdc2 kinase during oogenesis and spermatogenesis in rainbow trout (Oncorhynchus mykiss).

Authors:  Gao-Feng Qiu; Raghuveer K Ramachandra; Caird E Rexroad; Jianbo Yao
Journal:  Anim Reprod Sci       Date:  2007-03-07       Impact factor: 2.145

7.  Securin' M-phase entry.

Authors:  Mark J Solomon; Janet L Burton
Journal:  Nat Cell Biol       Date:  2008-04       Impact factor: 28.824

Review 8.  RNA-Seq: a revolutionary tool for transcriptomics.

Authors:  Zhong Wang; Mark Gerstein; Michael Snyder
Journal:  Nat Rev Genet       Date:  2009-01       Impact factor: 53.242

9.  Transcriptomic analyses reveal novel genes with sexually dimorphic expression in the zebrafish gonad and brain.

Authors:  Rajini Sreenivasan; Minnie Cai; Richard Bartfai; Xingang Wang; Alan Christoffels; Laszlo Orban
Journal:  PLoS One       Date:  2008-03-12       Impact factor: 3.240

10.  Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals.

Authors:  Anja Knoll-Gellida; Michèle André; Tamar Gattegno; Jean Forgue; Arie Admon; Patrick J Babin
Journal:  BMC Genomics       Date:  2006-03-09       Impact factor: 3.969

View more
  20 in total

1.  Identification of reproduction-related genes and pathways in the Culter alburnus H-P-G axis and characterization of their expression differences in malformed and normal gynogenetic ovaries.

Authors:  Yong Y Jia; Mei L Chi; Wen P Jiang; Shi L Liu; Shun Cheng; Jian B Zheng; Zhi M Gu
Journal:  Fish Physiol Biochem       Date:  2020-11-06       Impact factor: 2.794

2.  Transcriptome of Atlantic cod (Gadus morhua L.) early embryos from farmed and wild broodstocks.

Authors:  Carlos Frederico Ceccon Lanes; Teshome Tilahun Bizuayehu; Jorge Manuel de Oliveira Fernandes; Viswanath Kiron; Igor Babiak
Journal:  Mar Biotechnol (NY)       Date:  2013-07-27       Impact factor: 3.619

3.  Transcriptomic signatures for ovulation in vertebrates.

Authors:  Dong Teng Liu; Michael S Brewer; Shixi Chen; Wanshu Hong; Yong Zhu
Journal:  Gen Comp Endocrinol       Date:  2017-01-19       Impact factor: 2.822

4.  Stage Specific Transcriptomic Analysis and Database for Zebrafish Oogenesis.

Authors:  Yoel Bogoch; Allison Jamieson-Lucy; Charles E Vejnar; Karine Levy; Antonio J Giraldez; Mary C Mullins; Yaniv M Elkouby
Journal:  Front Cell Dev Biol       Date:  2022-06-06

5.  Lrp13 is a novel vertebrate lipoprotein receptor that binds vitellogenins in teleost fishes.

Authors:  Benjamin J Reading; Naoshi Hiramatsu; Justin Schilling; Katelyn T Molloy; Norm Glassbrook; Hiroko Mizuta; Wenshu Luo; David A Baltzegar; Valerie N Williams; Takashi Todo; Akihiko Hara; Craig V Sullivan
Journal:  J Lipid Res       Date:  2014-09-12       Impact factor: 5.922

6.  RNA-Seq-based transcriptome analysis of reproduction- and growth-related genes in Lateolabrax japonicus ovaries at four different ages.

Authors:  Chao Zhao; Pengfei Wang; Lihua Qiu
Journal:  Mol Biol Rep       Date:  2018-11-01       Impact factor: 2.316

7.  Transcriptome Profile Analysis of Ovarian Tissues from Diploid and Tetraploid Loaches Misgurnus anguillicaudatus.

Authors:  Weiwei Luo; Chuanshu Liu; Xiaojuan Cao; Songqian Huang; Weimin Wang; Yeke Wang
Journal:  Int J Mol Sci       Date:  2015-07-14       Impact factor: 5.923

8.  Adaptation to Low Salinity Promotes Genomic Divergence in Atlantic Cod (Gadus morhua L.).

Authors:  Paul R Berg; Sissel Jentoft; Bastiaan Star; Kristoffer H Ring; Halvor Knutsen; Sigbjørn Lien; Kjetill S Jakobsen; Carl André
Journal:  Genome Biol Evol       Date:  2015-05-20       Impact factor: 3.416

9.  Characterization of the heart transcriptome of the white shark (Carcharodon carcharias).

Authors:  Vincent P Richards; Haruo Suzuki; Michael J Stanhope; Mahmood S Shivji
Journal:  BMC Genomics       Date:  2013-10-11       Impact factor: 3.969

10.  Gene expression networks underlying ovarian development in wild largemouth bass (Micropterus salmoides).

Authors:  Christopher J Martyniuk; Melinda S Prucha; Nicholas J Doperalski; Philipp Antczak; Kevin J Kroll; Francesco Falciani; David S Barber; Nancy D Denslow
Journal:  PLoS One       Date:  2013-03-20       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.