An integrated web interface for large-scale characterization of sequence data

Funct Integr Genomics. 2000 May;1(1):70-5. doi: 10.1007/s101420000004.

Abstract

Large-scale genome projects require the analysis of large amounts of raw data. This analysis often involves the application of a chain of biology-based programs. Many of these programs are difficult to operate because they are non-integrated, command-line driven, and platform-dependent. The problem is compounded when the number of data files involved is large, making navigation and status-tracking difficult. To demonstrate how this problem can be addressed, we have created a platform-independent Web front end that integrates a set of programs used in a genomic project analyzing gene function by transposon mutagenesis in Saccharomyces cerevisiae. In particular, these programs help define a large number of transposon insertion events within the yeast genome, identifying both the precise site of transposon insertion as well as potential open reading frames disrupted by this insertion event. Our Web interface facilitates this analysis by performing the following tasks. Firstly, it allows each of the analysis programs to be launched against multiple directories of data files. Secondly, it allows the user to view, download, and upload files generated by the programs. Thirdly, it indicates which sets of data directories have been processed by each program. Although designed specifically to aid in this project, our interface exemplifies a general approach by which independent software programs may be integrated into an efficient protocol for large-scale genomic data processing.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Clinical Laboratory Information Systems
  • Computational Biology / methods*
  • DNA Transposable Elements / genetics
  • DNA, Fungal / genetics
  • Databases, Genetic
  • Databases, Protein
  • Gene Expression Profiling
  • Genome*
  • Genome, Fungal
  • Internet
  • Mutagenesis, Insertional
  • Open Reading Frames
  • Saccharomyces cerevisiae / genetics
  • Sequence Analysis, DNA / methods*
  • Sequence Homology, Nucleic Acid
  • Software
  • User-Computer Interface

Substances

  • DNA Transposable Elements
  • DNA, Fungal