Format

Send to

Choose Destination
Bioinformatics. 2013 May 15;29(10):1341-2. doi: 10.1093/bioinformatics/btt128. Epub 2013 Mar 16.

HAL: a hierarchical format for storing and analyzing multiple genome alignments.

Author information

1
Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz CA 95064, USA. hickey@soe.ucsc.edu

Abstract

MOTIVATION:

Large multiple genome alignments and inferred ancestral genomes are ideal resources for comparative studies of molecular evolution, and advances in sequencing and computing technology are making them increasingly obtainable. These structures can provide a rich understanding of the genetic relationships between all subsets of species they contain. Current formats for storing genomic alignments, such as XMFA and MAF, are all indexed or ordered using a single reference genome, however, which limits the information that can be queried with respect to other species and clades. This loss of information grows with the number of species under comparison, as well as their phylogenetic distance.

RESULTS:

We present HAL, a compressed, graph-based hierarchical alignment format for storing multiple genome alignments and ancestral reconstructions. HAL graphs are indexed on all genomes they contain. Furthermore, they are organized phylogenetically, which allows for modular and parallel access to arbitrary subclades without fragmentation because of rearrangements that have occurred in other lineages. HAL graphs can be created or read with a comprehensive C++ API. A set of tools is also provided to perform basic operations, such as importing and exporting data, identifying mutations and coordinate mapping (liftover).

AVAILABILITY:

All documentation and source code for the HAL API and tools are freely available at http://github.com/glennhickey/hal.

CONTACT:

hickey@soe.ucsc.edu or haussler@soe.ucsc.edu

SUPPLEMENTARY INFORMATION:

Supplementary data are available at Bioinformatics online.

PMID:
23505295
PMCID:
PMC3654707
DOI:
10.1093/bioinformatics/btt128
[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for Silverchair Information Systems Icon for PubMed Central
Loading ...
Support Center