Microarray analysis of autoimmune diseases by machine learning procedures

IEEE Trans Inf Technol Biomed. 2009 May;13(3):341-50. doi: 10.1109/TITB.2008.2011984.

Abstract

Microarray-based global gene expression profiling, with the use of sophisticated statistical algorithms is providing new insights into the pathogenesis of autoimmune diseases. We have applied a novel statistical technique for gene selection based on machine learning approaches to analyze microarray expression data gathered from patients with systemic lupus erythematosus (SLE) and primary antiphospholipid syndrome (PAPS), two autoimmune diseases of unknown genetic origin that share many common features. The methodology included a combination of three data discretization policies, a consensus gene selection method, and a multivariate correlation measurement. A set of 150 genes was found to discriminate SLE and PAPS patients from healthy individuals. Statistical validations demonstrate the relevance of this gene set from an univariate and multivariate perspective. Moreover, functional characterization of these genes identified an interferon-regulated gene signature, consistent with previous reports. It also revealed the existence of other regulatory pathways, including those regulated by PTEN, TNF, and BCL-2, which are altered in SLE and PAPS. Remarkably, a significant number of these genes carry E2F binding motifs in their promoters, projecting a role for E2F in the regulation of autoimmunity.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Analysis of Variance
  • Antiphospholipid Syndrome / genetics*
  • Artificial Intelligence*
  • Bayes Theorem
  • Cluster Analysis
  • Female
  • Gene Expression Profiling / methods*
  • Gene Expression Regulation
  • Humans
  • Logistic Models
  • Lupus Erythematosus, Systemic / genetics*
  • Models, Genetic
  • Oligonucleotide Array Sequence Analysis / methods*
  • Reproducibility of Results
  • Reverse Transcriptase Polymerase Chain Reaction