| Literature DB >> 24894665 |
Zhifu Sun1, Jared Evans, Aditya Bhagwate, Sumit Middha, Matthew Bockol, Huihuang Yan, Jean-Pierre Kocher.
Abstract
BACKGROUND: miRNAs play a key role in normal physiology and various diseases. miRNA profiling through next generation sequencing (miRNA-seq) has become the main platform for biological research and biomarker discovery. However, analyzing miRNA sequencing data is challenging as it needs significant amount of computational resources and bioinformatics expertise. Several web based analytical tools have been developed but they are limited to processing one or a pair of samples at time and are not suitable for a large scale study. Lack of flexibility and reliability of these web applications are also common issues.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24894665 PMCID: PMC4070549 DOI: 10.1186/1471-2164-15-423
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Workflow diagram of CAP-miRSeq.
Data summary for MCF7 miRNA-seq data
| SRA_ID | Treatment | miRNA source | Read number |
|---|---|---|---|
| SRR326279 | control | cytoplasm | 15,493,265 |
| SRR326280 | control | total | 14,670,735 |
| SRR326281 | Dicer knock-down | cytoplasm | 9,237,490 |
| SRR326282 | Dicer knock-down | total | 8,689,337 |
Figure 2Example outputs from CPAP-miRSeq (MCF7 dataset). A. Summary statistics of each miRNA-seq library processed. B. Raw mature miRNA expression table, weighted count for miRNAs coded by multiple pre-miRNAs. C. Normalized mature miRNA expression table by number of reads per million (RPM). D. Summarized mature miRNA expression table for those with multiple pre-cursors. These miRNAs are indistinguishable by sequence but have the same biological effect. This table is used for final differential expression analysis. E. Merged novel miRNA expression across samples. F. Integrative Genome Viewer (IGV) to visualize sequence level data for a single nucleotide variant. G. Mapped read distribution in different parts of the hairpin structure of miRNA precursor which provides a strong evidence of an authentic miRNA.
Figure 3Distribution of all detected RNAs by category for MCF7 dataset. A. RNA extracted from cytoplasm without the Dicer treatment. B. RNA extracted from total RNA (both cytoplasm and nucleus) without the Dicer treatment. C. RNA extracted from cytoplasm with the Dicer treatment. D. RNA extracted from total RNA (both cytoplasm and nucleus) with the Dicer treatment. miRNA expression is significantly repressed in the Dicer treated cell lines (C and D).
Figure 4Changed miRNAs in renal cancer carcinoma validated by CAP-miRSeq. A. Significantly up and down expressed miRNAs in renal cell carcinoma compared with paired normal kidney as previous reported and described in the text. B. Significantly down expressed miRNAs in chromosome Xq27.3 locus as reported.
Comparison of different tools in miRNA detection
| CAP-miRSeq | omiRAS | miRTools2 | Novoalign | |
|---|---|---|---|---|
| Version | v1.1 | 12/2013 | v2 | 2.07.13 |
| Aligner | Bowtie | Bowtie | SOAP | Novoalign+ |
| Ref Genome | hg19 | hg19 | hg19 | hg19 |
| Time | 5 hrs | 12 hrs | Variable | 6 hrs |
| Memory | 4 GB | - | - | 10 G |
| Mature miRNA (> = 2) | 860 | 739 | 172 | 881 |
| Novel miRNA | 194 | NA* | 35 | No prediction |
| Correlation with other tools | - | 0.99 | 0.81 | 0.95 |
*NA–omiRAS requires at least two samples to run with differential expression analysis; however, it does not report novel miRNAs for each individual sample. +Novoalign is just an alignment toot and it does not perform miRNA prediction and quantification. omiRAS and miRTool2 are web based tools and no memory usage information is available.
Figure 5Simulation result of sequence depth vs. miRNA capture. The original library has ~25.4 million single end reads. miRNAs are defined as expressed at an arbitrary cutoff of > =2 reads and clustered into five expression level groups based on the number of assigned reads. The original library is randomly sampled at 14 different depths from 0.5M to 24M. For each of the five groups from each sub-library, the number of miRNAs with > =2 reads are calculated and the ratio over the corresponding total from the original library is displayed. About 78% of expressed miRNAs can be detected at sequence depth of 12–18 million of reads.