ARACNe-based inference, using curated microarray data, of Arabidopsis thaliana root transcriptional regulatory networks

BMC Plant Biol. 2014 Apr 16:14:97. doi: 10.1186/1471-2229-14-97.

Abstract

Background: Uncovering the complex transcriptional regulatory networks (TRNs) that underlie plant and animal development remains a challenge. However, a vast amount of data from public microarray experiments is available, which can be subject to inference algorithms in order to recover reliable TRN architectures.

Results: In this study we present a simple bioinformatics methodology that uses public, carefully curated microarray data and the mutual information algorithm ARACNe in order to obtain a database of transcriptional interactions. We used data from Arabidopsis thaliana root samples to show that the transcriptional regulatory networks derived from this database successfully recover previously identified root transcriptional modules and to propose new transcription factors for the SHORT ROOT/SCARECROW and PLETHORA pathways. We further show that these networks are a powerful tool to integrate and analyze high-throughput expression data, as exemplified by our analysis of a SHORT ROOT induction time-course microarray dataset, and are a reliable source for the prediction of novel root gene functions. In particular, we used our database to predict novel genes involved in root secondary cell-wall synthesis and identified the MADS-box TF XAL1/AGL12 as an unexpected participant in this process.

Conclusions: This study demonstrates that network inference using carefully curated microarray data yields reliable TRN architectures. In contrast to previous efforts to obtain root TRNs, that have focused on particular functional modules or tissues, our root transcriptional interactions provide an overview of the transcriptional pathways present in Arabidopsis thaliana roots and will likely yield a plethora of novel hypotheses to be tested experimentally.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms*
  • Arabidopsis / genetics*
  • Arabidopsis / radiation effects
  • Arabidopsis Proteins / genetics
  • Arabidopsis Proteins / metabolism
  • Cell Wall / metabolism
  • Cell Wall / radiation effects
  • Data Mining*
  • Databases, Genetic*
  • Epistasis, Genetic / radiation effects
  • Gene Expression Profiling
  • Gene Expression Regulation, Plant / radiation effects
  • Gene Regulatory Networks / genetics*
  • Genomics
  • High-Throughput Nucleotide Sequencing
  • Mutation / genetics
  • Oligonucleotide Array Sequence Analysis*
  • Plant Roots / genetics*
  • Plant Roots / radiation effects
  • Time Factors
  • Transcription Factors / genetics
  • Transcription Factors / metabolism
  • Ultraviolet Rays

Substances

  • Arabidopsis Proteins
  • Transcription Factors