| Literature DB >> 26599404 |
Peter A Bain1, Alexie Papanicolaou2, Anupama Kumar1.
Abstract
Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26599404 PMCID: PMC4658143 DOI: 10.1371/journal.pone.0142636
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics of the transcriptome assembly.
|
| 262,050 |
|
| 149,742 |
|
| 44.51 |
|
| 3,367 |
|
| 655 |
|
| 1539 |
|
| 85 |
|
| 137 |
|
| 403,186,007 |
Fig 1Analysis of the completeness of the rainbowfish transcriptome.
Sequence similarity searches were conducted using the full set of predicted proteins derived from the Japanese medaka genome (Oryzias latipes; Ensembl release 76) as BLASTP queries against translated open reading frames from the Murray-Darling rainbowfish transcriptome. From 24,674 medaka query sequences, 23,601 high-quality (BLASTP e-value ≤ 1e-10) matches were obtained (95.7%) with a mean query coverage of 92.1%. Alignment length distributions are shown (bin size 5%), expressed as a proportion of medaka protein sequence lengths (% query coverage). INSET: A similar analysis conducted using the highly conserved subset of core eukaryotic genes (CEGs) from medaka.
Fig 2Phylogenetic analysis of candidate nuclear receptors from rainbowfish and nuclear receptors sequences from the Japanese medaka (Oryzias latipes) genome.
Node labels indicate percent bootstrap support from 100 replications. Branch lengths are not to scale. Candidate nuclear receptors from the rainbowfish transcriptome are indicated by their coding sequence identifiers (m.*). Abbreviations: AR, androgen receptor; BAR, bile acid receptor; COUP-TF, chicken ovalbumin upstream promoter transcription factor; DAX-1, dosage-sensitive sex reversal, adrenal hypoplasia critical region, on chromosome X, gene 1; PNR, photoreceptor cell-specific nuclear receptor; ER, estrogen receptor; ERR, estrogen receptor-related; FTZ-F1, fushi tarazu factor 1; GR, glucocorticoid receptor; HNF, hepatocyte nuclear factor; LRH, liver receptor homologue; LXR, liver X receptor; MR, mineralocorticoid receptor; NR, nuclear receptor; PPAR, peroxisome proliferator-activated receptor; PR, progestin receptor; RAR, retinoic acid receptor; ROR, RAR-related orphan receptor; RXR, retinoid X receptor; TR, thyroid hormone receptor; VDR, vitamin D receptor. NRs not assigned a common name are denoted by their subfamily, group and member number, i.e. ‘NR5A2’ represents nuclear receptor subfamily 5, group A, member 2.
Fig 3Schematic diagram of likely steroidogenic pathways in Murray-Darling rainbowfish.
ORF identifiers corresponding to deduced proteins displaying high similarity to biosynthetic enzymes are shown alongside enzyme descriptions at each step of the biosynthetic pathway. Adapted from Häggström and Richfield [85] with reference to known teleost steroidogenic pathways [86,87,88].
Rainbowfish transcripts with high similarity to estrogen-responsive genes from related teleosts.
| Description | Rainbowfish transcript|ORF | Deduced protein length (aa) | Closest BLAST match (GenBank non-redundant protein sequences, | Length of closest match (aa) | GenBank Accession |
|---|---|---|---|---|---|
| Vitellogenin A | comp98304_c0_seq1|m.58126, comp98304_c0_seq2|m.58130, comp98304_c0_seq3|m.58134 | 1,717, 1,699, 1,690 | vitellogenin [ | 1,695 | BAD93697 |
| Vitellogenin B | comp102542_c3_seq1|m.120426, comp102542_c3_seq2|m.120431, comp102542_c3_seq3|m.120436, comp102542_c3_seq4|m.120441 | 1,733, 1,717, 1,713, 1,709 | vitellogenin Ab [ | 1,692 | AFA26670 |
| Vitellogenin C | comp42500_c0_seq1|m.2380, | 1,244 | vitellogenin C [ | 1,275 | AFA26671 |
| Estrogen receptor α | comp105150_c0_seq9|m.186792 | 611 | estrogen receptor alpha [ | 602 | ABY19510 |
| Estrogen receptor β1 | transcript from alternative assembly (ovary) | 559 | estrogen receptor beta 1 [ | 558 | ABY19511 |
| Estrogen receptor β2 | comp103476_c2_seq6|m.142553 (manually edited to remove likely assembly error in poly-U tract and verified by sequencing) | 665 | estrogen receptor beta 2 [ | 666 | NP_001266406 |
| Choriogenin H | comp78177_c1_seq1|m.9381 | 466 | choriogenin H minor [ | 452 | BAJ07537 |
| Choriogenin L | comp78177_c1_seq1|m.9382 | 440 | L-SF precursor [ | 420 | NP_001098273 |
Fig 4Confirmation of selected transcript variants of a putative rainbowfish ERα cluster by endpoint reverse-transcriptase PCR (RT-PCR).
(A) Schematic diagram of three transcripts in the cluster–comp105150_c0_seq9, comp105150_c0_seq11 and comp105150_c0_seq12, showing binding sites for the PCR primers listed in Table 4. (B) Agarose gel electrophoresis of RT-PCR products generated using the indicated primer pairs. Primers targeting 18S rRNA were used to confirm the quality of cDNA in all samples. Abbreviations: kb, kilobases; M, marker; NT, no-template control; MB, male brain; MG, male gonad; ML, male liver; FB, female brain; FG, female gonad; FL, female liver; 18S, 18S rRNA.
Primers used for RT-PCR analysis of rainbowfish ERα 5’-end variants.
|
|
|
|
|---|---|---|
| 1 | Seq9/11/12 | CACCATGATTATTGATTCGGC |
| 2 | Seq12 | ATGTTTAAGAGGCAGAGCCTG |
| 3 | Seq9/11 | GCTGTGATGTTGCTCAGGCAG |
| 4 | Seq9/11 | ATGTTGCTCAGGCAGAGCCT |
| 5 | Seq9/11/12 | GCAGAAAGAAAAGGCACCAG |
| 6 | Seq9/11/12 | TCATAGTGCGTGGGCGCA |
| 7 | Seq9/11/12 | CCATCTCATAGTGCGTGGGC |
Transcripts involved in aryl hydrocarbon receptor (AhR)-mediated adaptive stress responses.
Transcripts were classified into two functional groups–the nuclear AhR receptor complex (GO: 0034753) or AhR target genes comprising selected transcripts strongly upregulated by 2,3,7,8-Tetrachlorodibenzo-p-dioxin selected from Li et al. [124] and Watson et al. [125].
| Putative description | Transcript/CDS identifier | Deduced protein length (aa) | Closest match (GenBank non-redundant protein sequences, | Length of closest match (aa) | Accession |
|---|---|---|---|---|---|
| AhR receptor complex | |||||
| Aryl hydrocarbon receptor 1A (AhR1A) | comp328301_c0_seq1|m.267373 | 268 | aryl hydrocarbon receptor 1 [ | 944 | AAR19369 |
| Aryl hydrocarbon receptor 2A (AhR2A) | comp94961_c0_seq1|m.36116 | 914 | aryl hydrocarbon receptor 2 [ | 990 | BAE02825 |
| Aryl hydrocarbon receptor nuclear translocator (ARNT) | comp106403_c1_seq1|m.227926 | 638 | PREDICTED: aryl hydrocarbon receptor nuclear translocator-like protein 1-like isoform X2 [ | 638 | XP_005449385 |
| Aryl hydrocarbon receptor repressor (AhRR) | comp89767_c1_seq1|m.20660 | 639 | PREDICTED: uncharacterized protein LOC100696058 [ | 694 | XP_003460145 |
| AhR target genes | |||||
| Cytochrome P450 1A (CYP1A) | comp94347_c0_seq1|m.33721 | 421 | Cytochrome P450 1A [ | 521 | AFN02446 |
| Cytochrome P450, family 1, subfamily B, polypeptide 1 (CYP1B1) | comp83107_c0_seq4|m.12292 | 583 | PREDICTED: cytochrome P450 1B1-like [ | 537 | XP_008278388 |
| Cytochrome P450, family 1, subfamily D, polypeptide 1 | comp81594_c0_seq2|m.11394 | 537 | PREDICTED: cytochrome P450 1A1-like [ | 535 | XP_007564479 |
| Stromal cell-derived factor 2 (SDF2) | comp88906_c0_seq1|m.19082 | 245 | PREDICTED: stromal cell-derived factor 2-like [ | 226 | XP_006784901 |
| Heat-shock protein A5 (HSPA5) / Glucose-regulated protein, 78 kDa (GRP78) | comp99062_c0_seq1|m.65001 | 564 | Glucose-regulated protein 78 [ | 654 | ABG56392 |
| Aldehyde dehydrogenase 3 family, member A1 (ALDH3A1) | comp101664_c3_seq2|m.102761 | 532 | PREDICTED: aldehyde dehydrogenase, dimeric NADP-preferring-like [Stegastes partitus] | 509 | XP_008297392 |
| Nuclear factor erythroid 2-related factor 2 (Nrf2) | comp93427_c0_seq1|m.30054 | 617 | PREDICTED: nuclear factor erythroid 2-related factor 2 [ | 613 | XP_008304424 |
| NADPH:quinone oxidoreductase 1 (NQO1) | comp95672_c0_seq1|m.40009 | 272 | NAD(P)H dehydrogenase quinone 1 [ | 277 | ABD43173 |
| Fibroblast growth factor 13 (FGF13) | comp105488_c2_seq1|m.196118 | 247 | PREDICTED: fibroblast growth factor 13-like isoform X1 [ | 246 | XP_003445268 |