Mulan: multiple-sequence local alignment and visualization for studying function and evolution

Genome Res. 2005 Jan;15(1):184-94. doi: 10.1101/gr.3007205. Epub 2004 Dec 8.

Abstract

Multiple-sequence alignment analysis is a powerful approach for understanding phylogenetic relationships, annotating genes, and detecting functional regulatory elements. With a growing number of partly or fully sequenced vertebrate genomes, effective tools for performing multiple comparisons are required to accurately and efficiently assist biological discoveries. Here we introduce Mulan (http://mulan.dcode.org/), a novel method and a network server for comparing multiple draft and finished-quality sequences to identify functional elements conserved over evolutionary time. Mulan brings together several novel algorithms: the TBA multi-aligner program for rapid identification of local sequence conservation, and the multiTF program for detecting evolutionarily conserved transcription factor binding sites in multiple alignments. In addition, Mulan supports two-way communication with the GALA database; alignments of multiple species dynamically generated in GALA can be viewed in Mulan, and conserved transcription factor binding sites identified with Mulan/multiTF can be integrated and overlaid with extensive genome annotation data using GALA. Local multiple alignments computed by Mulan ensure reliable representation of short- and large-scale genomic rearrangements in distant organisms. Mulan allows for interactive modification of critical conservation parameters to differentially predict conserved regions in comparisons of both closely and distantly related species. We illustrate the uses and applications of the Mulan tool through multispecies comparisons of the GATA3 gene locus and the identification of elements that are conserved in a different way in avians than in other genomes, allowing speculation on the evolution of birds. Source code for the aligners and the aligner-evaluation software can be freely downloaded from http://www.bx.psu.edu/miller_lab/.

Publication types

  • Comparative Study
  • Evaluation Study
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Anura / genetics
  • Binding Sites / genetics
  • Chickens / genetics
  • Computational Biology / methods
  • Computer Graphics*
  • Conserved Sequence / genetics
  • DNA-Binding Proteins / genetics
  • Evolution, Molecular*
  • Fishes / genetics
  • GATA3 Transcription Factor
  • Genome
  • Genome, Human
  • Humans
  • Mice
  • Phylogeny
  • Rats
  • Sequence Alignment / methods*
  • Sequence Homology, Nucleic Acid
  • Software
  • Trans-Activators / genetics

Substances

  • DNA-Binding Proteins
  • GATA3 Transcription Factor
  • GATA3 protein, human
  • Gata3 protein, mouse
  • Trans-Activators