Discovering genes-diseases associations from specialized literature using the grid

IEEE Trans Inf Technol Biomed. 2009 Jul;13(4):554-60. doi: 10.1109/TITB.2008.2007755. Epub 2008 Oct 31.

Abstract

This paper proposes a novel method for text mining on the Grid, aimed at pointing out hidden relationships for hypothesis generation and suitable for semi-interactive querying. The method is based on unsupervised clustering and the outputs are visualized with contextual information. Grid implementation is crucial for feasibility. We demonstrate it with a mining run for discovering genes-diseases associations from bibliographic sources and annotated databases. The proposed methodology is in view of a Grid architecture specialized in bioinformatics mining tasks. Some performance considerations are provided.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Cluster Analysis
  • Computational Biology / methods*
  • Cystic Fibrosis / genetics
  • Databases, Bibliographic
  • Disease / genetics*
  • Genetic Predisposition to Disease*