| Literature DB >> 25379175 |
Regina Z Cer1, J Enrique Herrera-Galeano1, Joseph J Anderson2, Kimberly A Bishop-Lilly1, Vishwesh P Mokashi3.
Abstract
BACKGROUND: Understanding the biological roles of microRNAs (miRNAs) is a an active area of research that has produced a surge of publications in PubMed, particularly in cancer research. Along with this increasing interest, many open-source bioinformatics tools to identify existing and/or discover novel miRNAs in next-generation sequencing (NGS) reads become available. While miRNA identification and discovery tools are significantly improved, the development of miRNA differential expression analysis tools, especially in temporal studies, remains substantially challenging. Further, the installation of currently available software is non-trivial and steps of testing with example datasets, trying with one's own dataset, and interpreting the results require notable expertise and time. Subsequently, there is a strong need for a tool that allows scientists to normalize raw data, perform statistical analyses, and provide intuitive results without having to invest significant efforts.Entities:
Keywords: DE; Differential expression; Linear model; Normal quantile transformation; Quantile normalization; Time series; miRNA Temporal Analyzer; microRNA; mirnaTA
Year: 2014 PMID: 25379175 PMCID: PMC4212236 DOI: 10.1186/2047-217X-3-20
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1An overview of the mirnaTA workflow using a three time point example. mirnaTA converts the tab-delimited, matrix raw data into quantiles using Normal Quantile Transformation (NQT). Then, using a linear regression model, mirnaTA identifies miRNA species which either increase or decrease linearly. Any miRNA species with P < 0.05 are considered to be statistically significant (shown in the black box). miRNAs that did not fit into a linear model with statistical significance are further analyzed using either cumulative distribution function (CDF) or analysis of variance (ANOVA) depending on the number of time points.
Figure 2mirnaTA Outputs. (A) A Portable Network Graphic (PNG) image showing the raw data (before normalization) vs. NQT data (after normalization). (B) A heat map of differentially expressed miRNAs with statistical significance (P < 0.05) which were identified to be linearly increasing or decreasing using linear regression model. (C) One of the many intermediate data files (shown here is a text file of P-values, intercept and slope data of significant miRNAs). (D) A heat map of differentially expressed miRNAs with statistical significance (P < 0.05) which were identified to be increasing or decreasing using either cumulative distribution function (CDF) of the normal distribution or analysis of variance (ANOVA). Note that all these images and text files are available for viewing on web browsers by opening ‘mirnata.html’ file in ‘output_files’ directory.