Association analysis for large-scale gene set data

Methods Mol Biol. 2007:408:19-33. doi: 10.1007/978-1-59745-547-3_2.

Abstract

High-throughput experiments in biology often produce sets of genes of potential interests. Some of those gene sets might be of considerable size. Therefore, computer-assisted analysis is necessary for the biological interpretation of the gene sets, and for creating working hypotheses, which can be tested experimentally. One obvious way to analyze gene set data is to associate the genes with a particular biological feature, for example, a given pathway. Statistical analysis could be used to evaluate if a gene set is truly associated with a feature. Over the past few years many tools that perform such analysis have been created. In this chapter, using WebGestalt as an example, it will be explained in detail how to associate gene sets with functional annotations, pathways, publication records, and protein domains.

MeSH terms

  • Computational Biology
  • Data Interpretation, Statistical
  • Databases, Genetic*
  • Gene Expression Profiling / statistics & numerical data
  • Genetic Techniques / statistics & numerical data*
  • Genomics / statistics & numerical data
  • Oligonucleotide Array Sequence Analysis / statistics & numerical data
  • Software*