Study of gene function based on spatial co-expression in a high-resolution mouse brain atlas

BMC Syst Biol. 2007 Apr 16:1:19. doi: 10.1186/1752-0509-1-19.

Abstract

Background: The Allen Brain Atlas (ABA) project systematically profiles three-dimensional high-resolution gene expression in postnatal mouse brains for thousands of genes. By unveiling gene behaviors at both the cellular and molecular levels, ABA is becoming a unique and comprehensive neuroscience data source for decoding enigmatic biological processes in the brain. Given the unprecedented volume and complexity of the in situ hybridization image data, data mining in this area is extremely challenging. Currently, the ABA database mainly serves as an online reference for visual inspection of individual genes; the underlying rich information of this large data set is yet to be explored by novel computational tools. In this proof-of-concept study, we studied the hypothesis that genes sharing similar three-dimensional expression profiles in the mouse brain are likely to share similar biological functions.

Results: In order to address the pattern comparison challenge when analyzing the ABA database, we developed a robust image filtering method, dubbed histogram-row-column (HRC) algorithm. We demonstrated how the HRC algorithm offers the sensitivity of identifying a manageable number of gene pairs based on automatic pattern searching from an original large brain image collection. This tool enables us to quickly identify genes of similar in situ hybridization patterns in a semi-automatic fashion and consequently allows us to discover several gene expression patterns with expression neighborhoods containing genes of similar functional categories.

Conclusion: Given a query brain image, HRC is a fully automated algorithm that is able to quickly mine vast number of brain images and identify a manageable subset of genes that potentially shares similar spatial co-distribution patterns for further visual inspection. A three-dimensional in situ hybridization pattern, if statistically significant, could serve as a fingerprint of certain gene function. Databases such as ABA provide valuable data source for characterizing brain-related gene functions when armed with powerful image querying tools like HRC.

MeSH terms

  • Adenylyl Cyclases / genetics
  • Adenylyl Cyclases / physiology
  • Animals
  • Brain / anatomy & histology*
  • Brain / metabolism*
  • Calcium Signaling / genetics
  • Corpus Striatum / anatomy & histology
  • Corpus Striatum / metabolism
  • Databases, Genetic
  • Dopamine / metabolism
  • Dopamine Plasma Membrane Transport Proteins / genetics
  • Dopamine Plasma Membrane Transport Proteins / physiology
  • Gene Expression Profiling
  • Gene Expression*
  • Genes / physiology*
  • Isoenzymes / genetics
  • Isoenzymes / physiology
  • Mice
  • Neurons / metabolism
  • Parkinson Disease / genetics
  • Phosphoproteins / genetics
  • Phosphoproteins / physiology
  • Substantia Nigra / anatomy & histology
  • Substantia Nigra / metabolism

Substances

  • Dopamine Plasma Membrane Transport Proteins
  • Isoenzymes
  • Phosphoproteins
  • Slc6a3 protein, mouse
  • cyclic AMP-regulated phosphoprotein ARPP-21
  • Adenylyl Cyclases
  • adenylyl cyclase type V
  • Dopamine