Literature DB >> 26981358

In silico genome wide mining of conserved and novel miRNAs in the brain and pineal gland of Danio rerio using small RNA sequencing data.

Suyash Agarwal1, Naresh Sahebrao Nagpure1, Prachi Srivastava2, Basdeo Kushwaha1, Ravindra Kumar1, Manmohan Pandey1, Shreya Srivastava1.   

Abstract

MicroRNAs (miRNAs) are small, non-coding RNA molecules that bind to the mRNA of the target genes and regulate the expression of the gene at the post-transcriptional level. Zebrafish is an economically important freshwater fish species globally considered as a good predictive model for studying human diseases and development. The present study focused on uncovering known as well as novel miRNAs, target prediction of the novel miRNAs and the differential expression of the known miRNA using the small RNA sequencing data of the brain and pineal gland (dark and light treatments) obtained from NCBI SRA. A total of 165, 151 and 145 known zebrafish miRNAs were found in the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively. Chromosomes 4 and 5 of zebrafish reference assembly GRCz10 were found to contain maximum number of miR genes. The miR-181a and miR-182 were found to be highly expressed in terms of number of reads in the brain and pineal gland, respectively. Other ncRNAs, such as tRNA, rRNA and snoRNA, were curated against Rfam. Using GRCz10 as reference, the subsequent bioinformatic analyses identified 25, 19 and 9 novel miRNAs from the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively. Targets of the novel miRNAs were identified, based on sequence complementarity between miRNAs and mRNA, by searching for antisense hits in the 3'-UTR of reference RNA sequences of the zebrafish. The discovery of novel miRNAs and their targets in the zebrafish genome can be a valuable scientific resource for further functional studies not only in zebrafish but also in other economically important fishes.

Entities:  

Keywords:  MiRNA; NcRNAs; Novel miRs; Phylogenetic analysis; Zebrafish

Year:  2015        PMID: 26981358      PMCID: PMC4778606          DOI: 10.1016/j.gdata.2015.11.013

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Introduction

The discovery of the first miRNAs in Caenorhabditis elegans in 1993 paved the way for unearthing of thousands of the mature miRNAs in a wide variety of organisms, including animals, plants and even viruses [1], [2], [3]. In recent years, intensive studies have been carried out on identification of novel miRNAs and their targets, with the addition of newly discovered miRNAs to miRBase and subsequently resulting in its new version release. The current miRBase release 21 has miR information on 9 fish species. MiRNAs are a distinct class of endogenous RNA molecules, which do not code for any protein and are about 22 nucleotides in length [4]. MicroRNAs (miRNAs) are small, non-coding RNA molecules that function as master regulators of the genome. They bind to the mRNA of the target genes; thus, regulating the gene expression at the post-transcriptional level. Many miR-related discoveries have come from zebrafish investigations [5], [6], [7], [8], [9], [10]. After the release of Zv9 (zebrafish genome draft), the zebrafish genome project joined the Genome Reference Consortium (GRC) for further improvement and ongoing maintenance. The GRC has now released a new reference assembly, GRCz10. Correlating the miR functions from a model organism to that of human health largely depends on recognizing true orthologs of human miRs. Thus, for the benefit of the entire miR community, a better understanding of the miRNome is essential [11]. Each miRNA appears to regulate the expression of tens to hundreds of genes to efficiently coordinate multiple cellular pathways. Precursor-miRNAs are usually 60–80 nucleotides in length with a hairpin secondary structure while mature miRNAs are mostly 18–26 nucleotides [12]. Many miRNAs are conserved across vertebrates [13]. Mutation in miRNA genes or the improper miRNA and target gene interaction may become a cause of various genetic diseases. With the advent of high-throughput sequencing technologies, non-conserved or weakly expressed miRNAs, along with species-specific miRNAs can be identified from a wide range of organisms [14], [15], [16], [17], [18]. Recent studies have focused on bioinformatic analysis of the NGS data obtained from small RNA sequencing, where algorithms predict miRNA precursor molecules based on the presence of hairpins and other associated parameters and how they are processed into mature miRNAs. This sort of analysis can lead to the discovery of both novel and evolutionary conserved miRNAs. Kloosterman et al. [19] reported 66 new miRNAs and 11 star sequences corresponding to 116 potential miRNA hairpins in the zebrafish genome by deep sequencing of two small RNA cDNA libraries. Bizuayehu et al. [20] worked on the Atlantic Halibut and the results indicate a wide conservation of miRNA precursors and involvement of miRNA in multiple regulatory pathways. Despite enormous research on zebrafish, the annotation of miR-producing genes remains limited. Zebrafish is a fish species of freshwater ecosystem and considered popular organism for studying the gene functions of the vertebrate, especially human development and genetic diseases. It is a favored model organism due to their specific features such as virtually transparent embryos, small size, ability to keep them together in large numbers, ease of breeding and easy to maintain/manipulate/observe in the lab experiments. The critical role of miRNAs in gene expression is highly evident from the recent studies in zebrafish. The miRNAs play key roles in zebrafish organ formation and their expression at different time points. In the present work, we used Illumina HiSeq2000 small RNA sequencing data from the brain and pineal gland (dark and light treatments) of zebrafish from NCBI SRA database. The total number of reads in the data obtained from NCBI SRA was ~ 6 million, ~ 10.4 million and ~ 14.8 million for the pineal light, pineal dark and brain, respectively. An integrative bioinformatic strategy was applied to detect and analyze the whole miRNA transcriptome of zebrafish. The present study led to the discovery of novel miRNAs in the brain and pineal gland of zebrafish, which will contribute for a better understanding of the role miRNAs play in regulating diverse biological processes.

Materials and methods

Raw data retrieval and pre-processing

The small RNA sequencing data from three mature miRNA libraries (pineal light, pineal dark and brain) of zebrafish were downloaded from NCBI SRA (SRX363296, SRX363297 and SRX363298) and were subsequently used for analysis in the present study. The downloaded data was in SRA format, which was subsequently converted to fastq format using sratoolkit (version 2.3.4-2) [30], fastq-dump option. The obtained data was generated on HiSeq2000 using standard Illumina sequencing workflow with the multiplexing option. A custom Perl script was written to remove low quality bases, adaptor sequences, count the number of occurrences of each read, and eliminate reads outside the targeted size range (≥ 16 and ≤ 30).

Identification of conserved miRNAs and other ncRNAs in zebrafish and other fishes

The filtered reads were further aligned onto the latest released version of zebrafish genome GRCz10, using bowtie [31] with two mismatches and zero gaps. Only the aligned reads were used for the downstream analysis. MiRBase [32], [33] release 21 was used for annotation of known miRNA. A custom based Perl script was written in order to extract only the fish miRNA from the miRBase, along with the preparation of unique fish miRNA database, and its annotation files. Aligned reads were annotated against the unique fish miRNA database, RefSeq database, as well as noncoding RNA sequences of zebrafish from Rfam (version 11) [34]. A custom Perl script was written in order to extract the best read hit, which depicts the fish miRNA, and to segregate the miRNA hits into known zebrafish miRNA and other fish miRNA.

Differential expression of known miRNAs

The individual read counts of the 3 data sets were fed into a custom based Perl script to prepare the final read count table, which was taken as input for DeSeq [35]. The results were further segregated into up-regulated, down-regulated and neutral miRs based on the log2 fold change value. For the log2 fold change greater than 1, less than − 1 and between 1 and − 1, the miRs were designated as up, down and neutral miRs. A heatmap of few highly regulated miRNAs was drawn using R script.

Identification of novel miRNA candidates

The reads with no hits in the unique fish miRNA database and Rfam were further used for the prediction of putative new miRs using Mireap (version 0.2). All novel pre-miRNAs were identified based on the presence of a classic hairpin structure [36]. These filtered small RNA reads were aligned with the zebrafish genome using bowtie with strict parameters (number of mismatch; — v = 0). Mireap was used for the detection of novel miRs based on alignment, secondary structure, free energy and location on the precursor arm. The parameters used for Mireap prediction included: i) minimal miRNA length = 18; ii) maximal miRNA length = 26; iii) minimal miRNA reference length = 20; iv) maximal miRNA reference length = 24; v) uniqueness of miRNA = 20; vi) maximal energy allowed for a miRNA precursor = − 18 kcal/mol; vii) minimal and maximal space between the miRNA and miRNA* = 5 and 35 respectively; viii) minimal mature pair = 14; ix) maximal mature bulge = 4; x) maximal duplex asymmetry = 5; and xi) flank sequence length = 100 [37]. It is evident from the previous studies that more than 90% of miRNA precursors have MFEIs greater than 0.85 (tRNAs ~ 0.64, rRNAs ~ 0.59, and mRNAs ~ 0.65) [38]. Therefore, minimal folding free energy index (MFEI) is a new criterion for assaying miRNAs and distinguishing miRNAs from other non-coding and coding RNAs. MFEI is equal to MFE / (precursor length) × 100 / (G + C)%. In the present study, reads which showed MFEI value to be greater than 0.85, were considered as novel miRNAs candidates.

Target prediction of novel miRs

Target prediction of the novel miRNA was done against the 3′-UTR sequences of zebrafish (www.ensembl.org/biomart) using miRanda (version 3.3a) [39]. The miRanda results were parsed based on score > 150 and energy <− 20. The gene ontology (GO) terms of the predicted targets were taken from UniProt database and were further classified into most abundant GO terms for biological process, cellular component and molecular function.

Results and discussion

NCBI SRA was the main source of NGS data, from where the small RNA sequencing data of three mature miRNA libraries (pineal light, pineal dark, and brain) of zebrafish (SRX363296, SRX363297 and SRX363298) was downloaded and was subsequently used in the present study. The total numbers of reads were found to be ~ 6 million, ~ 10.4 million and ~ 14.8 million for the pineal light, pineal dark and brain, respectively. A custom Perl script was written to remove low quality sequences, adapter sequences and to count the number of occurrences of each read, with the elimination of reads outside the targeted size range (≥ 16 and ≤ 30). A read length distribution graph (Fig. 1) was prepared to get insight into the number of reads at each read length. The maximum numbers of distinct sequences were found to be of 22 bp and most of the data was falling between 21 and 23 bp in all the 3 data sets. The numbers of distinct sequences were calculated both before and after length filtering. Finally, the total numbers of distinct sequences after length filtering (Table 1), which were considered for downstream analyses were 200,520, 204,696 and 351,638 for the brain, pineal light and pineal dark, respectively.
Fig. 1

Read length distribution. The figure represents the read length distribution of the 3 data sets after length filtering.

Table 1

Statistics of the 3 data sets before and after length filtering.

ParameterBrainPineal gland (light treatment)Pineal gland (dark treatment)
Total number of sequences14,781,5695,933,63810,385,264
Total number of distinct sequences259,897265,007463,789
Total number of sequences after length filtering14,347,8695,724,36110,008,434
Total number of distinct sequences after length filtering200,520204,696351,638
The filtered reads were further aligned onto the latest released version of zebrafish genome GRCz10, using bowtie with two mismatches and zero gaps. A total of 88.58%, 60.00% and 61.14% reads aligned to the zebrafish genome GRCz10 for the brain, pineal gland (dark) and pineal gland (light), respectively. Only the aligned reads were used for the downstream analysis. miRBase release 21 was used to determine the known and conserved miRNAs in fishes. MiRNA data of 9 fishes was extracted from miRBase, comprising of 1637 miRNA sequences. Unique fish miRNA database was prepared by eliminating the redundant sequences, which comprised of 1029 sequences. The aligned reads from all the 3 data sets were blast against the unique fish miRNA database. A total of 165, 151 and 145 known zebrafish miRNAs were found in the brain, pineal gland (dark treatment) and pineal gland (light treatment) along with their expression values and were further annotated for their presence in other fishes (Table S1). A total of 221, 196 and 195 distinct reads showed hits with other fish miRNAs in the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively (Table S1). A comparative study of all the detected fish miRNAs showed 321 miRNAs to be common along all the 3 data sets, with 40, 6 and 6 miR genes to be uniquely expressed in the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively (Fig. 2). The maximum read count was observed for miR-181a and miR-182 in the brain and pineal gland, respectively. The miR-181a is mainly involved in proliferation, and is an active miRNA regulating regeneration process. Rudnicki et al. [21] and Frucht et al. [22] also reported that miR-181a can encourage proliferation in both quiescent and proliferating chick basilar papilla. It is also reported that over-expression of miR-182 results in production of ectopic hair cells [21]. The miR-182 forms a part of miR-183/96/182 cluster, whose expression is considerably enriched in the pineal gland and up-regulated by light [23], [24]. Because of the role of these miRNA in promoting regeneration and mediating the targets and transcriptional factors involved in regeneration, these miRNAs may prove to be promising in therapeutics.
Fig. 2

Venn diagram of conserved miRNAs in the brain, pineal gland (dark treatment) and pineal gland (light treatment).

Chromosome wise coverage of the reads aligning to zebrafish genome GRCz10 showed that the maximum numbers of miR genes were coded by chromosomes 4 and 5 (Fig. 3). Thatcher et al. [25] also confirmed the presence of miR-430 family in two large clusters of 10 and 57 genes on chromosome 4. The reads which did not show any hits in miRBase were analyzed by BLAST against the Rfam database (ftp.sanger.-ac.uk/pub/databases/Rfam) and Refseq proteins to annotate rRNA, tRNA, snRNA, mRNA and other ncRNA sequences. The maximum numbers of reads were annotated against eukaryotic small subunit ribosomal RNA (SSU_rRNA_eukarya) and tRNA, followed by other ncRNAs (Fig. 4).
Fig. 3

Distribution of unique miRNA reads across the zebrafish genome.

Fig. 4

Unique reads representing other ncRNAs in the data.

The differential expression of the miRNA was computed as 2 different sets: 1) brain v/s pineal dark, and 2) brain v/s pineal light. For the brain v/s pineal dark, a total of 93, 99 and 145 miRs were found to be up, down and neutrally regulated, with 49 and 10 miRNAs only expressed in the brain and pineal dark, respectively (Table S2). For the brain v/s pineal light, a total of 105, 101 and 124 miRs were found to be up, down and neutrally regulated, with 56 and 10 miRNAs only expressed in the brain and pineal light, respectively (Table S3). Few up-regulated, down-regulated and neutral miRs from the brain v/s pineal dark are depicted as a heatmap (Fig. 5). The results from both the expression analyses showed that the miR-183/96/182 cluster is highly up-regulated, along with miR-726. The expression of miR-183/96/182 cluster is considerably enriched in the pineal gland and up-regulated by light [23], [24]. These findings are consistent with the light-regulation of this cluster in the mouse retina [26]. This cluster also plays a major role in the regulation of circadian rhythm via its targeting of adcy6, a clock-controlled gene that modulates melatonin synthesis [27]. Dre-miR-726 is found to be expressed in the retina of larval and adult zebrafish. From the previous studies it is evident that many miRNAs are transcribed along with their regulating genes, the proximity of miR-726 to SWS2 and LWS opsins suggests that dre-miR-726 could play a vital role in opsin regulation [28].
Fig. 5

Heatmap of few up-regulated, down-regulated and neutral miRs from the brain and pineal gland.

Mireap was used for the prediction of novel miRs, using reads that did not align to the unique fish miRNA database and Rfam. The reads were aligned onto the zebrafish genome using bowtie (with − v 0). A total of 84.40%, 83.68% and 87.72% of reads aligned for the brain, pineal dark and pineal light, respectively. For a small RNA to be considered as a potential miRNA candidate, it should meet the following strict criteria: 1) the miRNA precursor sequence should fold into an appropriate stemloop hairpin secondary structure, 2) the mature miRNA sits in one arm of the hairpin structure, 3) a maximum of 6 mismatches between the predicted mature miRNA sequence and its opposite miRNA* sequence in the secondary structure, 4) there should be no loop or break in the miRNA or miRNA* sequences, and 5) minimal folding free energy index and negative minimal folding free energy of the predicted secondary structure should be higher. The stem-loop hairpin structures with free energy lower than − 18 kcal/mol as per RNAfold were retained. Mireap predicted a total of 50, 34, and 16 novel miRNAs from the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively. All of these novel miRNAs were named temporarily in the form of Dre-mir-novnumber, e.g. Dre-mir-nov1. To increase the authenticity of the predicted novel miRNA, Zhang et al. [29] combined several parameters to form a new criterion called minimal folding free energy index (MFEI). The MFEI value from all the 3 data sets ranged from − 0.42 to − 1.56. A total of 25, 19 and 9 novel miRNA precursors, with a MFEI greater than 0.85, were identified from the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively (Table 2). This indicates that the RNA sequences with MFEI greater than 0.85 are most likely to be miRNAs. Most of the novel miRs were also found to originate from chr 4 and chr 5 of zebrafish. The novel miRs ranged from 21 nt to 24 nt in length, with the precursors ranging from 75 nt to 99 nt in length. The free energy of the precursors ranged from − 28.5 to − 71.5 kcal/mol. The secondary structure of a novel hairpin with the highest read count, i.e. Dre-mir-nov74, is depicted in Fig. 6. Biological experiments can be undertaken to validate the authenticity of the reported novel miRs.
Table 2

Zebrafish novel miRNA and their characteristics.

Tissue typeChromosomeNovel miRNASequenceMiRNA length (nt)Read countPrecursor positionStrandPrecursor length% GCMFEI (− kcal/mol)
BrainChr 1Dre-mir-nov49AGAGGCUGUCCGAGUGCUGAU2115610303253–10303335+8349.4− 1.56
Chr 20Dre-mir-nov8GACCUGUAACCAUUGACUUCCU22102962031–962117+8736.78− 1.4
Chr 3Dre-mir-nov46AAAAGCGUACCAAACUGAACCGU2312246496327–46496412+8648.84− 1.3
Chr 17Dre-mir-nov19AGACUAGUAGCCAUUGAGAUCUU2326420769194–207692717833.33− 1.27
Chr 18Dre-mir-nov16UGUUUUUUUAGGUUUUGAUUUUU23114245964887–45964967+8133.33− 1.26
Chr 4Dre-mir-nov38GAAUAACUCAAACCGGAGGACU2210631535442–31535523+8247.56− 1.26
Chr 4Dre-mir-nov42AAUAACUCAAACCAGAGGACU2112654513045–54513126+8243.9− 1.19
Chr 18Dre-mir-nov15UUUCCAGAAAGGUCUGUAUGUGU2313026647915–26647993+7946.84− 1.16
GRCZ10_NA262Dre-mir-nov2AGACUCUCCAGUACACUGGCCCU23862744–8258258.54− 1.16
Chr 10Dre-mir-nov25GAAAACCUGUAACCAUUGACUUCU2422129746032–29746122+9135.16− 1.14
Chr 20Dre-mir-nov10AAACUUGUAAUCACUGACUUCCU2312535014623–35014703+8133.33− 1.12
Chr 2Dre-mir-nov48AAUGACUCAAACCCAAGGACUCG2311828746947–8747032+8640.7− 1.1
Chr 6Dre-mir-nov29AAACUCUGCAGGACACCAGCUGU2326818113026–18113108+8346.99− 1.05
Chr 17Dre-mir-nov18AAACUCUGCAGGACACCAGCUG2274847039884–47039964+8146.91− 1.02
Chr 4Dre-mir-nov39UAAACUCUGCAGGACACCAGCUG23102042571226–42571306+8146.91− 1.02
Chr 6Dre-mir-nov30ACUGUACAGACUACUGCCUUGC2236141457248–41457346+9945.45− 0.98
Chr 18Dre-mir-nov17ACUCUUCACUCGUCUGUGUUCA2211650849750–508498358655.81− 0.96
Chr 23Dre-mir-nov3UCAAAAGGCGUACCAAACUGUAC2315518099730–180998077843.59− 0.95
Chr 5Dre-mir-nov37UCCAUCAGUCACGUGACCUACCA23710635345784–353458759251.09− 0.94
Chr 5Dre-mir-nov33GGCCCGUCCGGUGCGCUCGGAUCC24767824650–8247409184.62− 0.93
Chr 5Dre-mir-nov31UAAACUCUGCAGGACACCAGCUGU2415810220904–10221001+9846.94− 0.93
Chr 5Dre-mir-nov32GAUGACUCAAACCCAAGGACUCA2311515221789–15221886+9839.8− 0.89
Chr 8Dre-mir-nov28AAUGACUCAAACCCAAGGACUCGU241082766509–2766596+8844.32− 0.88
Chr 4Dre-mir-nov44AGUGAGGUCCUCGGAUCGGCCC22183576320181–76320279+9968.69− 0.87
Chr 22Dre-mir-nov4AACGACUCAAGAACCAGAAGACU2331518571986–18572067+8246.34− 0.86
Pineal gland (dark treatment)Chr 1Dre-mir-nov83AUCAGCACUCGGACAGCCUCUU2217310303251–10303335+8550.59− 1.53
Chr 17Dre-mir-nov59AGACUAGUAGCCAUUGAGAUCUU2310120769194–207692717833.33− 1.27
Chr 18Dre-mir-nov58UGUUUUUUUAGGUUUUGAUUUUU2338145964887–45964967+8133.33− 1.26
Chr 9Dre-mir-nov66UACGGGCUGAAUUUAGACAAAUU231775548615–5548705+9127.47− 1.25
Chr 24Dre-mir-nov52UAACGUUUCGAGCCCACUGACUG231181825424–18254987546.67− 1.19
GRCZ10_NA262Dre-mir-nov51AGACUCUCCAGUACACUGGCCCUC24997743–8268458.33− 1.14
Chr 12Dre-mir-nov62UUUCCGGAAAGGUCUGUAUGUGC23165444383412–44383489+7847.44− 1.14
Chr 4Dre-mir-nov77GAAUAACUCAAACCGGAGGACU2210349472235–49472316+8246.34− 1.11
Chr 6Dre-mir-nov72UACGGAUAGAAUCAGCGGAGCGA2322156909252–56909333+8260.98− 1.11
Chr 8Dre-mir-nov70AACCUGUAACCAUUGACUUCCU2214010454696–10454780+8538.82− 1.04
Chr 1Dre-mir-nov84UAAAGCGUACCAAACUGAACCGU2315041154258–41154343+8646.51− 1.00
Chr 11Dre-mir-nov63GCUGAAGUCAUUAUUAUUAGGGC231165486604–5486685+8235.37− 0.98
Chr 6Dre-mir-nov71ACUGUACAGACUACUGCCUUGC2222041457248–41457346+9945.45− 0.98
Chr 4Dre-mir-nov76GCAGAGAGAAAUGUCUAUGGCUU2317125403781–5403855+7548.00− 0.97
Chr 5Dre-mir-nov74UCCAUCAGUCACGUGACCUACCA2315,88035345784–353458759251.09− 0.94
Chr 20Dre-mir-nov54CUCAUCCCUCUGCUCUAUCCCCU2311537928243–37928325+8350.60− 0.93
Chr 8Dre-mir-nov69AAUGACUCAAACCCAAGGACUCG232242766510–2766595+8644.19− 0.90
Chr 12Dre-mir-nov61AACGACUCAAGAGCCAGAAGACU233233216161–3216240+8045.00− 0.88
Chr 20Dre-mir-nov55CAGGGGGUCGGGAAGCACUGCCU23487648676465–486765538958.43− 0.85
Pineal gland (light treatment)Chr 1Dre-mir-nov100AUCAGCACUCGGACAGCCUCUU2210010303251–10303335+8550.59− 1.53
Chr 9Dre-mir-nov91UACGGGCUGAAUUUAGACAAAUU231015548615–5548705+9127.47− 1.25
GRCZ10_NA262Dre-mir-nov85AGACUCUCCAGUACACUGGCCCUC24491743–8268458.33− 1.14
Chr 12Dre-mir-nov89UUUCCGGAAAGGUCUGUAUGUGC23107544383412–44383489+7847.44− 1.14
Chr 6Dre-mir-nov94UACGGAUAGAAUCAGCGGAGCGA2313056909252–56909333+8260.98− 1.11
Chr 4Dre-mir-nov97GCAGAGAGAAAUGUCUAUGGCUU239035403781–5403855+7548.00− 0.97
Chr 5Dre-mir-nov95UCCAUCAGUCACGUGACCUACCA23825635345784–353458759251.09− 0.94
Chr 4Dre-mir-nov99AGUGAGGUCCUCGGAUCGGCCCC2314476320181–76320279+9968.69− 0.87
Chr 20Dre-mir-nov88CAGGGGGUCGGGAAGCACUGCCU23210448676465–486765538958.43− 0.85
Fig. 6

Hairpin structures of Dre-mir-nov74 predicted using RNAfold software.

MicroRNAs bind to the mRNA of the target genes, thus regulating the gene expression at the post-transcriptional level. To gain insight into the function of the newly identified miRNAs, the putative target genes of these miRNAs were predicted using miRanda. The 3′-UTR regions of zebrafish mRNAs were downloaded from ensemble biomart and were checked for their complementarity against the novel miRs. Each miRNA was found to target more than one mRNA. The 25, 19 and 9 novel miRs in the 3 data sets were found to regulate 3737, 2527 and 1719 transcripts (Table S4), with the number of targets ranging from 12 to 873 for each miRNA. In order to infer the functional annotation of the novel miRs, GO analysis was done for the predicted Zebrafish targets, which indicated their involvement in the regulation of diverse physiological processes (Fig. 7). The novel miRNAs were found to play a major role in transcription regulation, signal transduction, organism development, RNA polymerase II promoter regulation, protein transport and homophilic cell adhesion. The pathway analysis also revealed the involvement of these novel miRs in amine and polyamine biosynthesis, carbohydrate degradation, iron-sulfur cluster biosynthesis, urea cycle, AMP and XMP biosynthesis via de novo pathway, CTP biosynthesis via salvage pathway and tRNA modification. Our study provided further insight into the novel miRNA-mediated regulation of target genes.
Fig. 7

Gene ontology of the target genes of novel miRNAs. The graph represents the maximum represented GO in terms of biological process, molecular function and cellular component.

Conclusion

In this study, a total of 25, 19 and 9 novel miRNAs were identified from the brain, pineal gland (dark treatment) and pineal gland (light treatment), respectively, using deep sequencing data and in silico bioinformatic analysis. Most of the conserved and novel miRs were found to originate from chr 4 and chr 5 of zebrafish. To gain insight into the function of the newly identified miRNAs, the putative target genes of these miRNAs were predicted using miRanda. Gene ontology analysis was done for the predicted zebrafish targets to infer the functional annotation of the novel miRs, which indicated their involvement in the regulation of diverse physiological processes. The novel miRNAs were found to play a major role in transcription regulation, signal transduction, organism development, RNA polymerase II promoter regulation, protein transport and homophilic cell adhesion. The pathway analysis also revealed the involvement of these novel miRs in amine and polyamine biosyntheses, carbohydrate degradation, ironsulfur cluster biosynthesis, urea cycle, AMP and XMP biosyntheses via de novo pathway, CTP biosynthesis via salvage pathway and tRNA modification. A total of 165, 151 and 145 known zebrafish miRNAs were found in the brain, pineal gland (dark treatment) and pineal gland (light treatment) along with their expression values and were further annotated for their presence in other fishes. MiR-181a and miR-182 were the highly expressed miRNAs in the brain and pineal gland of zebrafish. Because of the role of these miRNAs in promoting regeneration and mediating the targets and transcriptional factors involved in regeneration, these miRNAs may prove to be promising in therapeutics. The expression analysis of the known miR genes showed that miR-183/96/182 cluster is highly up-regulated, along with miR-726. The expression of miR-183/96/182 cluster is considerably enriched in the pineal gland and up-regulated by light. The authenticity of the reported novel miRs may further be validated by different biological experiments.

Competing interests

The authors have declared that no competing interest exists.
  38 in total

1.  Rfam: an RNA family database.

Authors:  Sam Griffiths-Jones; Alex Bateman; Mhairi Marshall; Ajay Khanna; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

2.  Identification of homologous microRNAs in 56 animal genomes.

Authors:  Sung-Chou Li; Wen-Ching Chan; Ling-Yueh Hu; Chun-Hung Lai; Chun-Nan Hsu; Wen-chang Lin
Journal:  Genomics       Date:  2010-03-27       Impact factor: 5.736

3.  Observation of miRNA gene expression in zebrafish embryos by in situ hybridization to microRNA primary transcripts.

Authors:  Xinjun He; Yi-Lin Yan; April DeLaurier; John H Postlethwait
Journal:  Zebrafish       Date:  2011-02-02       Impact factor: 1.985

4.  MicroRNA (miRNA) transcriptome of mouse retina and identification of a sensory organ-specific miRNA cluster.

Authors:  Shunbin Xu; P Dane Witmer; Stephen Lumayag; Beatrix Kovacs; David Valle
Journal:  J Biol Chem       Date:  2007-06-27       Impact factor: 5.157

5.  MicroRNA discovery and profiling in human embryonic stem cells by deep sequencing of small RNA libraries.

Authors:  Merav Bar; Stacia K Wyman; Brian R Fritz; Junlin Qi; Kavita S Garg; Rachael K Parkin; Evan M Kroh; Ausra Bendoraite; Patrick S Mitchell; Angelique M Nelson; Walter L Ruzzo; Carol Ware; Jerald P Radich; Robert Gentleman; Hannele Ruohola-Baker; Muneesh Tewari
Journal:  Stem Cells       Date:  2008-06-26       Impact factor: 6.277

6.  Gene expression analysis of forskolin treated basilar papillae identifies microRNA181a as a mediator of proliferation.

Authors:  Corey S Frucht; Mohamed Uduman; Jamie L Duke; Steven H Kleinstein; Joseph Santos-Sacchi; Dhasakumar S Navaratnam
Journal:  PLoS One       Date:  2010-07-09       Impact factor: 3.240

7.  Characterization of Epstein-Barr virus miRNAome in nasopharyngeal carcinoma by deep sequencing.

Authors:  Shu-Jen Chen; Gian-Hung Chen; Yi-Hsuan Chen; Cheng-Yuan Liu; Kai-Ping Chang; Yu-Sun Chang; Hua-Chien Chen
Journal:  PLoS One       Date:  2010-09-20       Impact factor: 3.240

8.  Differential expression analysis for sequence count data.

Authors:  Simon Anders; Wolfgang Huber
Journal:  Genome Biol       Date:  2010-10-27       Impact factor: 13.583

9.  The light-induced transcriptome of the zebrafish pineal gland reveals complex regulation of the circadian clockwork by light.

Authors:  Zohar Ben-Moshe; Shahar Alon; Philipp Mracek; Lior Faigenbloom; Adi Tovin; Gad D Vatine; Eli Eisenberg; Nicholas S Foulkes; Yoav Gothilf
Journal:  Nucleic Acids Res       Date:  2014-01-13       Impact factor: 16.971

10.  miRBase: annotating high confidence microRNAs using deep sequencing data.

Authors:  Ana Kozomara; Sam Griffiths-Jones
Journal:  Nucleic Acids Res       Date:  2013-11-25       Impact factor: 16.971

View more
  3 in total

1.  Expression characteristics of pineal miRNAs at ovine different reproductive stages and the identification of miRNAs targeting the AANAT gene.

Authors:  Ran Di; Qiu-Yue Liu; Shu-Hui Song; Dong-Mei Tian; Jian-Ning He; Ying Ge; Xiang-Yu Wang; Wen-Ping Hu; Joram-Mwashigadi Mwacharo; Zhang-Yuan Pan; Jian-Dong Wang; Qing Ma; Gui-Ling Cao; Hui-Hui Jin; Xiao-Jun Liang; Ming-Xing Chu
Journal:  BMC Genomics       Date:  2021-03-25       Impact factor: 3.969

2.  High-throughput sequencing analysis of differentially expressed miRNAs and target genes in ischemia/reperfusion injury and apelin-13 neuroprotection.

Authors:  Chun-Mei Wang; Xue-Lu Yang; Ming-Hui Liu; Bao-Hua Cheng; Jing Chen; Bo Bai
Journal:  Neural Regen Res       Date:  2018-02       Impact factor: 5.135

3.  Identification of MicroRNAs and Their Target Genes Associated with Ovarian Development in Black Tiger Shrimp (Penaeus monodon) Using High-Throughput Sequencing.

Authors:  Chao Zhao; Sigang Fan; Lihua Qiu
Journal:  Sci Rep       Date:  2018-08-02       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.