• We are sorry, but NCBI web applications do not support your browser and may not function properly. More information
Logo of bioinfoLink to Publisher's site
Bioinformatics. Mar 15, 2009; 25(6): 832–833.
Published online Jan 29, 2009. doi:  10.1093/bioinformatics/btp059
PMCID: PMC2654799

CORNA: testing gene lists for regulation by microRNAs

Abstract

Motivation: With the increasing use of post-genomics techniques to examine a wide variety of biological systems in laboratories throughout the world, scientists are often presented with lists of genes that they must make sense of. A consistently challenging problem is that of defining co-regulated genes within those gene lists. In recent years, microRNAs have emerged as a mechanism for regulating several cellular processes. In this article, we report on how gene lists and microRNA targets data may be integrated to test for significant associations between gene lists and microRNAs.

Results: We discuss CORNA, a package written in R and released under the GNU GPL, which allows users to test gene lists for significant microRNA–target associations using one of three separate statistical tests, to link microRNA targets to functional annotation and to visualize quantitative data associated with those data.

Availability: CORNA is available as an R package from http://corna.sf.net

Contact: ku.ca.crsbb@uw.nukix

1 INTRODUCTION

Experiments involving post-genomics technologies such as microarrays, proteomics and systems biology often present scientists with gene lists that they must attempt to make sense of. Several software packages exist that allow scientists to assign functional annotation to gene lists, and to assign statistical significance to those associations. These include tools for associating genes with biological ontologies (e.g. Falcon and Gentleman, 2007) and with biological pathways (e.g. Salomonis et al., 2007).

A particular challenge is that of assessing which genes in a given gene list are co-regulated. miRBase (Griffiths-Jones et al., 2006), a database of all known microRNAs, has been created and there have been several published software tools that attempt to predict the targets of microRNAs (Brennecke et al., 2005). An excel-based tool (Creighton et al., 2008) has been produced for linking microarray data to microRNA targets information.

Here we describe CORNA, a package for R that allows scientists to analyse gene lists in the context of microRNA–target predictions. Methods exist to test for significant microRNA–target relationships in gene lists, and to test for significant associations between microRNAs and pathways and GO terms. The software is flexible and can read data from public databases or from a scientists own data files. CORNA is released as open-source under the GNU GPL.

2 FLOW OF INFORMATION

Central to the flow of information through CORNA is the gene list from which the user may test for significant microRNA–target associations. The user may also start with a microRNA, find genes that are associated with that microRNA and then test that gene list for significant associations with KEGG pathways or GO terms. The user may also plot quantitative data associated with the targets of a particular microRNA.

2.1 Inputs

CORNA exclusively uses R vectors and data frames. CORNA includes functions for reading microRNA–target data directly from miRBase and microRNA.org (Betel et al., 2008). There are also helper functions to read gene and GO term data using biomaRt (Durinck et al., 2005); microarray data directly from GEO (Barrett et al., 2008); and pathway data directly from KEGG (Kanehisa et al., 2004).

2.2 Methods

CORNA employs three complementary statistical methods for enrichments analysis of relationships within lists of genes. These are the HyperGeometric test, Fisher's exact test and the χ2-test.

2.3 Outputs

If the user tests a gene list for significant microRNA associations, then the output is an R data frame with one row per microRNA, the observed and expected frequencies from sample and population, and the range of user-selected P-values.

Where the user begins with a particular microRNA, the targets information is used to create a gene list and that gene list is tested for enrichment of pathways and GO terms.

There is also a range of plotting functions for plotting quantitative data associated with microRNA targets.

3 EXAMPLE ANALYSIS

3.1 Using CORNA to test for enrichment of microRNA–target relationships in a gene list

The list in this example, tsam, consists of 1000 ensembl transcript ids; 940 of these were chosen at random, then 30 predicted targets for two microRNAs were added. The example assumes that the file ‘arch.v5.txt.mus_musculus.zip’ has been downloaded from miRBase targets.

An external file that holds a picture, illustration, etc.
Object name is btp059i1.jpg

The only two microRNAs with a significant adjusted P-values are those used to bias the transcript list. The user may work with genes simply by converting the transcript list to microRNA–gene relationships using the BioMart2df.fun and corna.map.fun functions.

3.2 Using CORNA to test for KEGG pathways associated with a microRNA list

The microRNA used in this example is “mmu-mir-155”, and we use the predicted targets from miRBase to test for enrichment of KEGG pathways.

An external file that holds a picture, illustration, etc.
Object name is btp059i2.jpg

We first convert the microRNA–transcript relationship to a microRNA–gene relationship using the BioMart2df.fun and corna.map.fun functions. We then find those genes predicted to be targets of mmu-mir-155. The next stage is to use the KEGG2df.fun function to obtain links between genes and pathways from KEGG for Mus musculus. Finally, we set the sample to be only those genes targeted by mmu-mir-155 that have a pathway link, and perform hypergeometric and Fisher's exact tests for the KEGG pathways involved. The top five pathways can be seen in Table 1.

Table 1.
Top five significant pathways from CORNA for mmu-mir-155

4 SUMMARY

With increasing use of large-scale post-genomics techniques, scientists are often presented with lists of genes. MicroRNAs have emerged as an important regulator of gene function. In this article, we have shown that CORNA can be used to test for significant associations between genes, microRNAs, pathways and GO terms. CORNA can also be used to plot quantitative data associated with microRNA targets. CORNA is flexible and can read data from public databases or from a user's own files. CORNA has been tested on both Microsoft Windows and Red Hat Linux. CORNA is released under the GNU GPL and is available from http://corna.sf.net.

Funding: BBSRC (core strategic grant of the Institute for Animal Health).

Conflict of Interest: none declared.

REFERENCES

  • Barrett T, et al. NCBI GEO: archive for high-throughput functional genomic data. Nucleic Acids Res. 2008;37:D5–D15. [PMC free article] [PubMed]
  • Betel D, et al. The microRNA.org resource: targets and expression. Nucleic Acids Res. 2008;36:D149–D153. [PMC free article] [PubMed]
  • Brennecke J, et al. Principles of microRNA-target recognition. PLoS Biol. 2005;3:e85. [PMC free article] [PubMed]
  • Creighton CJ, et al. A bioinformatics tool for linking gene expression profiling results with public databases of microRNA target predictions. RNA. 2008;14:2290–2296. [PMC free article] [PubMed]
  • Durinck S, et al. BioMart and Bioconductor: a powerful link between biological databases and microarray data analysis. Bioinformatics. 2005;21:3439–3440. [PubMed]
  • Falcon S, Gentleman R. Using GOstats to test gene lists for GO term association. Bioinformatics. 2007;23:257–258. [PubMed]
  • Griffiths-Jones S, et al. miRBase: microRNA sequences, targets and gene nomenclature. Nucleic Acids Res. 2006;34:D140–D144. [PMC free article] [PubMed]
  • Kanehisa M, et al. The KEGG resource for deciphering the genome. Nucleic Acids Res. 2004;32:D277–D280. [PMC free article] [PubMed]
  • Salomonis N, et al. GenMAPP 2: new features and resources for pathway analysis. BMC Bioinformatics. 2007;8:217. [PMC free article] [PubMed]

Articles from Bioinformatics are provided here courtesy of Oxford University Press
PubReader format: click here to try

Formats:

Related citations in PubMed

See reviews...See all...

Cited by other articles in PMC

See all...

Links

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...