Sorting Signed Permutations by Intergenic Reversals

IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2870-2876. doi: 10.1109/TCBB.2020.2993002. Epub 2021 Dec 8.

Abstract

Genome rearrangements are mutations affecting large portions of a genome, and a reversal is one of the most studied genome rearrangements in the literature through the Sorting by Reversals (SbR) problem. SbR is solvable in polynomial time on signed permutations (i.e., the gene orientation is known), and it is NP-hard on unsigned permutations. This problem (and many others considering genome rearrangements) models genome as a list of its genes in the order they appear, ignoring all other information present in the genome. Recent works claimed that the incorporation of the size of intergenic regions, i.e., sequences of nucleotides between genes, may result in better estimators for the real distance between genomes. Here we introduce the Sorting Signed Permutations by Intergenic Reversals problem, that sorts a signed permutation using reversals both on gene order and intergenic sizes. We show that this problem is NP-hard by a reduction from the 3-partition problem. Then, we propose a 2-approximation algorithm for it. Finally, we also incorporate intergenic indels (i.e., insertions or deletions of intergenic regions) to overcome a limitation of sorting by conservative events (such as reversals) and propose two approximation algorithms.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • DNA, Intergenic / genetics*
  • Gene Rearrangement / genetics*
  • Genomics / legislation & jurisprudence*
  • INDEL Mutation / genetics
  • Models, Genetic
  • Mutation / genetics

Substances

  • DNA, Intergenic