Literature DB >> 19181683

CORNA: testing gene lists for regulation by microRNAs.

X Wu1, M Watson.   

Abstract

MOTIVATION: With the increasing use of post-genomics techniques to examine a wide variety of biological systems in laboratories throughout the world, scientists are often presented with lists of genes that they must make sense of. A consistently challenging problem is that of defining co-regulated genes within those gene lists. In recent years, microRNAs have emerged as a mechanism for regulating several cellular processes. In this article, we report on how gene lists and microRNA targets data may be integrated to test for significant associations between gene lists and microRNAs.
RESULTS: We discuss CORNA, a package written in R and released under the GNU GPL, which allows users to test gene lists for significant microRNA-target associations using one of three separate statistical tests, to link microRNA targets to functional annotation and to visualize quantitative data associated with those data. AVAILABILITY: CORNA is available as an R package from http://corna.sf.net.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19181683      PMCID: PMC2654799          DOI: 10.1093/bioinformatics/btp059

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 INTRODUCTION

Experiments involving post-genomics technologies such as microarrays, proteomics and systems biology often present scientists with gene lists that they must attempt to make sense of. Several software packages exist that allow scientists to assign functional annotation to gene lists, and to assign statistical significance to those associations. These include tools for associating genes with biological ontologies (e.g. Falcon and Gentleman, 2007) and with biological pathways (e.g. Salomonis et al., 2007). A particular challenge is that of assessing which genes in a given gene list are co-regulated. miRBase (Griffiths-Jones et al., 2006), a database of all known microRNAs, has been created and there have been several published software tools that attempt to predict the targets of microRNAs (Brennecke et al., 2005). An excel-based tool (Creighton et al., 2008) has been produced for linking microarray data to microRNA targets information. Here we describe CORNA, a package for R that allows scientists to analyse gene lists in the context of microRNA–target predictions. Methods exist to test for significant microRNA–target relationships in gene lists, and to test for significant associations between microRNAs and pathways and GO terms. The software is flexible and can read data from public databases or from a scientists own data files. CORNA is released as open-source under the GNU GPL.

2 FLOW OF INFORMATION

Central to the flow of information through CORNA is the gene list from which the user may test for significant microRNA–target associations. The user may also start with a microRNA, find genes that are associated with that microRNA and then test that gene list for significant associations with KEGG pathways or GO terms. The user may also plot quantitative data associated with the targets of a particular microRNA.

2.1 Inputs

CORNA exclusively uses R vectors and data frames. CORNA includes functions for reading microRNA–target data directly from miRBase and microRNA.org (Betel et al., 2008). There are also helper functions to read gene and GO term data using biomaRt (Durinck et al., 2005); microarray data directly from GEO (Barrett et al., 2008); and pathway data directly from KEGG (Kanehisa et al., 2004).

2.2 Methods

CORNA employs three complementary statistical methods for enrichments analysis of relationships within lists of genes. These are the HyperGeometric test, Fisher's exact test and the χ2-test.

2.3 Outputs

If the user tests a gene list for significant microRNA associations, then the output is an R data frame with one row per microRNA, the observed and expected frequencies from sample and population, and the range of user-selected P-values. Where the user begins with a particular microRNA, the targets information is used to create a gene list and that gene list is tested for enrichment of pathways and GO terms. There is also a range of plotting functions for plotting quantitative data associated with microRNA targets.

3 EXAMPLE ANALYSIS

3.1 Using CORNA to test for enrichment of microRNA–target relationships in a gene list

The list in this example, tsam, consists of 1000 ensembl transcript ids; 940 of these were chosen at random, then 30 predicted targets for two microRNAs were added. The example assumes that the file ‘arch.v5.txt.mus_musculus.zip’ has been downloaded from miRBase targets. The only two microRNAs with a significant adjusted P-values are those used to bias the transcript list. The user may work with genes simply by converting the transcript list to microRNA–gene relationships using the BioMart2df.fun and corna.map.fun functions.

3.2 Using CORNA to test for KEGG pathways associated with a microRNA list

The microRNA used in this example is “mmu-mir-155”, and we use the predicted targets from miRBase to test for enrichment of KEGG pathways. We first convert the microRNA–transcript relationship to a microRNA–gene relationship using the BioMart2df.fun and corna.map.fun functions. We then find those genes predicted to be targets of mmu-mir-155. The next stage is to use the KEGG2df.fun function to obtain links between genes and pathways from KEGG for Mus musculus. Finally, we set the sample to be only those genes targeted by mmu-mir-155 that have a pathway link, and perform hypergeometric and Fisher's exact tests for the KEGG pathways involved. The top five pathways can be seen in Table 1.
Table 1.

Top five significant pathways from CORNA for mmu-mir-155

IDDescriptionExpectedObservationP-value
00190Oxidative phosphorylation5120.002
00400Phenylalanine etc. biosynthesis030.003
00500Starch and sucrose metabolism260.012
05020Parkinson's disease5100.016
04010MAPK signaling pathway9150.044
Top five significant pathways from CORNA for mmu-mir-155

4 SUMMARY

With increasing use of large-scale post-genomics techniques, scientists are often presented with lists of genes. MicroRNAs have emerged as an important regulator of gene function. In this article, we have shown that CORNA can be used to test for significant associations between genes, microRNAs, pathways and GO terms. CORNA can also be used to plot quantitative data associated with microRNA targets. CORNA is flexible and can read data from public databases or from a user's own files. CORNA has been tested on both Microsoft Windows and Red Hat Linux. CORNA is released under the GNU GPL and is available from http://corna.sf.net. Funding: BBSRC (core strategic grant of the Institute for Animal Health). Conflict of Interest: none declared.
  9 in total

1.  The KEGG resource for deciphering the genome.

Authors:  Minoru Kanehisa; Susumu Goto; Shuichi Kawashima; Yasushi Okuno; Masahiro Hattori
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis.

Authors:  Steffen Durinck; Yves Moreau; Arek Kasprzyk; Sean Davis; Bart De Moor; Alvis Brazma; Wolfgang Huber
Journal:  Bioinformatics       Date:  2005-08-15       Impact factor: 6.937

3.  Using GOstats to test gene lists for GO term association.

Authors:  S Falcon; R Gentleman
Journal:  Bioinformatics       Date:  2006-11-10       Impact factor: 6.937

4.  A bioinformatics tool for linking gene expression profiling results with public databases of microRNA target predictions.

Authors:  Chad J Creighton; Ankur K Nagaraja; Samir M Hanash; Martin M Matzuk; Preethi H Gunaratne
Journal:  RNA       Date:  2008-09-23       Impact factor: 4.942

5.  NCBI GEO: archive for high-throughput functional genomic data.

Authors:  Tanya Barrett; Dennis B Troup; Stephen E Wilhite; Pierre Ledoux; Dmitry Rudnev; Carlos Evangelista; Irene F Kim; Alexandra Soboleva; Maxim Tomashevsky; Kimberly A Marshall; Katherine H Phillippy; Patti M Sherman; Rolf N Muertter; Ron Edgar
Journal:  Nucleic Acids Res       Date:  2008-10-21       Impact factor: 16.971

6.  miRBase: microRNA sequences, targets and gene nomenclature.

Authors:  Sam Griffiths-Jones; Russell J Grocock; Stijn van Dongen; Alex Bateman; Anton J Enright
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

7.  Principles of microRNA-target recognition.

Authors:  Julius Brennecke; Alexander Stark; Robert B Russell; Stephen M Cohen
Journal:  PLoS Biol       Date:  2005-03       Impact factor: 8.029

8.  The microRNA.org resource: targets and expression.

Authors:  Doron Betel; Manda Wilson; Aaron Gabow; Debora S Marks; Chris Sander
Journal:  Nucleic Acids Res       Date:  2007-12-23       Impact factor: 16.971

9.  GenMAPP 2: new features and resources for pathway analysis.

Authors:  Nathan Salomonis; Kristina Hanspers; Alexander C Zambon; Karen Vranizan; Steven C Lawlor; Kam D Dahlquist; Scott W Doniger; Josh Stuart; Bruce R Conklin; Alexander R Pico
Journal:  BMC Bioinformatics       Date:  2007-06-24       Impact factor: 3.169

  9 in total
  38 in total

1.  Computational methods for the identification of microRNA targets.

Authors:  Yang Dai; Xiaofeng Zhou
Journal:  Open Access Bioinformatics       Date:  2010-05-01

Review 2.  MicroRNA expression profiling in archival tissue specimens: methods and data processing.

Authors:  Thomas Streichert; Benjamin Otto; Ulrich Lehmann
Journal:  Mol Biotechnol       Date:  2012-02       Impact factor: 2.695

Review 3.  A network-biology perspective of microRNA function and dysfunction in cancer.

Authors:  Cameron P Bracken; Hamish S Scott; Gregory J Goodall
Journal:  Nat Rev Genet       Date:  2016-10-31       Impact factor: 53.242

4.  A microRNA signature associated with chondrogenic lineage commitment.

Authors:  Behnaz Bakhshandeh; Masoud Soleimani; Seyed Hassan Paylakhi; Nasser Ghaemi
Journal:  J Genet       Date:  2012-08       Impact factor: 1.166

5.  miRPathDB 2.0: a novel release of the miRNA Pathway Dictionary Database.

Authors:  Tim Kehl; Fabian Kern; Christina Backes; Tobias Fehlmann; Daniel Stöckel; Eckart Meese; Hans-Peter Lenhof; Andreas Keller
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

6.  Genes located in a chromosomal inversion are correlated with territorial song in white-throated sparrows.

Authors:  W M Zinzow-Kramer; B M Horton; C D McKee; J M Michaud; G K Tharp; J W Thomas; E M Tuttle; S Yi; D L Maney
Journal:  Genes Brain Behav       Date:  2015-10-14       Impact factor: 3.449

7.  Digital gene expression analysis of the zebra finch genome.

Authors:  Robert Ekblom; Christopher N Balakrishnan; Terry Burke; Jon Slate
Journal:  BMC Genomics       Date:  2010-04-01       Impact factor: 3.969

8.  Admixture mapping in a hybrid zone reveals loci associated with avian feather coloration.

Authors:  Alan Brelsford; David P L Toews; Darren E Irwin
Journal:  Proc Biol Sci       Date:  2017-11-15       Impact factor: 5.349

9.  miEAA: microRNA enrichment analysis and annotation.

Authors:  Christina Backes; Qurratulain T Khaleeq; Eckart Meese; Andreas Keller
Journal:  Nucleic Acids Res       Date:  2016-04-29       Impact factor: 16.971

10.  GeneSet2miRNA: finding the signature of cooperative miRNA activities in the gene lists.

Authors:  Alexey V Antonov; Sabine Dietmann; Philip Wong; Dominik Lutter; Hans W Mewes
Journal:  Nucleic Acids Res       Date:  2009-05-06       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.