Format

Send to

Choose Destination
Version 2. F1000Res. 2018 Mar 14 [revised 2018 Jun 8];7:319. doi: 10.12688/f1000research.14148.2. eCollection 2018.

segment_liftover : a Python tool to convert segments between genome assemblies.

Gao B1,2, Huang Q1,2, Baudis M1,2.

Author information

1
Institute of molecular Life Sciences, University of Zürich, Zürich, CH-8057, Switzerland.
2
Swiss Institute of Bioinformatics, University of Zürich, Zürich, CH-8057, Switzerland.

Abstract

The process of assembling a species' reference genome may be performed in a number of iterations, with subsequent genome assemblies differing in the coordinates of mapped elements. The conversion of genome coordinates between different assemblies is required for many integrative and comparative studies. While currently a number of bioinformatics tools are available to accomplish this task, most of them are tailored towards the conversion of single genome coordinates. When converting the boundary positions of segments spanning larger genome regions, segments may be mapped into smaller sub-segments if the original segment's continuity is disrupted in the target assembly. Such a conversion may lead to a relevant degree of data loss in some circumstances such as copy number variation (CNV) analysis, where the quantitative representation of a genomic region takes precedence over base-specific accuracy. segment_liftover aims at continuity-preserving remapping of genome segments between assemblies and provides features such as approximate locus conversion, automated batch processing and comprehensive logging to facilitate processing of datasets containing large numbers of structural genome variation data.

KEYWORDS:

Genome assembly; copy number segment.; liftover; remap

Supplemental Content

Full text links

Icon for F1000 Research Ltd Icon for PubMed Central
Loading ...
Support Center