| Literature DB >> 27504260 |
Shengming Sun1, Xianping Ge1, Jian Zhu1, Wuxiao Zhang2, Fujun Xuan3.
Abstract
De novo transcriptome sequencing is a robust method for microRNA (miRNA) target gene prediction, especially for organisms without reference genomes. Following exposure of Megalobrama amblycephala to ammonia (0.1 or 20 mg L(-1) ), two cDNA libraries were constructed from the fish gills and sequenced using Illumina HiSeq 2000. Over 90 million reads were generated and de novo assembled into 46, 615 unigenes, which were then extensively annotated by comparing to different protein databases, followed by biochemical pathway prediction. The expression of 2666 unigenes significantly differed; 1961 were up-regulated, while 975 were down-regulated. Among these, 250 unigenes were identified as the targets for 10 conserved and 4 putative novel miRNA families by miRNA target computational prediction. We examined expression of ssa-miRNA-21 and its target genes by real-time quantitative PCR and found agreement with the sequencing data. This study demonstrates the feasibility of identifying miRNA targets by transcriptome analysis. The transcriptome assembly data represent a substantial increase in the genomic resources available for Megalobrama amblycephala and will be useful for gene expression profile analysis and miRNA functional annotation.Entities:
Keywords: Ammonia; Megalobrama amblycephala; Transcriptome; mRNA; miRNA
Year: 2016 PMID: 27504260 PMCID: PMC4969262 DOI: 10.1016/j.rinim.2016.03.001
Source DB: PubMed Journal: Results Immunol ISSN: 2211-2839
Bluntnose black bream transcriptome sequencing results.
| Sample | Raw data | Valid data | Valid ratio (%) | Q20 percentage (%) | ||
|---|---|---|---|---|---|---|
| Read | Base | Read | Base | |||
| CG | 44,578,630 | 4,457,863,000 | 44,103,206 | 4,410,320,600 | 98.93 | 95.24 |
| TG | 46,295,912 | 4,629,591,200 | 45,331,332 | 4,533,133,200 | 98.88 | 95.42 |
CG: Gills of Blunt nose black bream without ammonia exposure; TG: gills of Blunt nose black bream under ammonia exposure 48 h.
cDNA library sequencing results.
| Parameter | Number |
|---|---|
| Number of unigene | 46,615 |
| Total bases of unigene (bp) | 43,774,977 |
| Number of unigene ≥500 bp | 24,715 |
| Mean length of unigenes (bp) | 939 |
| N50 | 1593 |
| Maximal length of unigenes (bp) | 16,157 |
Fig. 1Size distribution of the assembled unigenes in the sequenced cDNA library.
BLAST analysis of non-redundant unigenes against public databases.
| Database | Number of annotated unigenes | Percentage of annoted unigenes (%) |
|---|---|---|
| Nr | 25,180 | 50.02 |
| Pfam | 18,803 | 40.34 |
| Swiss-prot | 19,997 | 40.90 |
| KO | 14,385 | 30.86 |
| KOG | 18,789 | 40.31 |
| GO | 17,773 | 38.13 |
Nr: NCBI non-redundant protein sequences, Pfam: Protein family, Swiss-Prot: A manually annotated and reviewed protein sequence database, KO: KEGG Ortholog database, KOG: Clusters of Orthologous Groups of proteins and GO: Gene Ontology.
Fig. 2KOG annotations. of the identified M. amblycephala unigenes.
Fig. 3Level 2 GO term distributions for the biological process, cellular component and molecular function categories.
Top 10 list of the gene number of pathway.
| Pathway hierarchy 2 | Unigene number |
|---|---|
| Signal transduction | 1502 |
| Cell communication | 1185 |
| Immune system | 1140 |
| Signaling molecules and interaction | 802 |
| Endocrine system | 707 |
| Cancers | 704 |
| Carbohydrate metabolism | 542 |
| Amino acid metabolism | 485 |
| Nervous system | 450 |
| Lipid metabolism | 407 |
Fig. 4qRT-PCR validation of transcriptome sequencing results for the ammonia exposure group vs. the control group. One down-regulated miRNA (ssa-miR-21b) and three up-regulated genes (JNK1, CCR9 and FRIS) were identified.
Fig. 5MicroRNA-gene network analysis. Depiction of gene targets of the differentially expressed miRNA: miRNAs identified as being differentially expressed following M. amblycephala ammonia exposure were assembled in a network based on GO annotations. The ovals represent the miRNAs (red, down-regulated; green, up-regulated), while the rectangles represent the target genes.