Display Settings:


Send to:

Choose Destination
PLoS Curr. 2011 Feb 7;3:RRN1213. doi: 10.1371/currents.RRN1213.

Hal: an automated pipeline for phylogenetic analyses of genomic data.

Author information

  • 1National Center for Biotechnology Information, Bethesda, Maryland; Peace Corps; Oregon State University and Bonzi Software Development.


The rapid increase in genomic and genome-scale data is resulting in unprecedented levels of discrete sequence data available for phylogenetic analyses. Major analytical impasses exist, however, prior to analyzing these data with existing phylogenetic software. Obstacles include the management of large data sets without standardized naming conventions, identification and filtering of orthologous clusters of proteins or genes, and the assembly of alignments of orthologous sequence data into individual and concatenated super alignments. Here we report the production of an automated pipeline, Hal that produces multiple alignments and trees from genomic data. These alignments can be produced by a choice of four alignment programs and analyzed by a variety of phylogenetic programs. In short, the Hal pipeline connects the programs BLASTP, MCL, user specified alignment programs, GBlocks, ProtTest and user specified phylogenetic programs to produce species trees. The script is available at sourceforge (http://sourceforge.net/projects/bio-hal/). The results from an example analysis of Kingdom Fungi are briefly discussed.

Free PMC Article

Images from this publication.See all images (2)Free text

Fig. 1.
Fig. 2
PubMed Commons home

PubMed Commons

How to join PubMed Commons

    Supplemental Content

    Icon for Public Library of Science Icon for PubMed Central
    Loading ...
    Write to the Help Desk