Rearrangement distance with reversals, indels, and moves in intergenic regions on signed and unsigned permutations

J Bioinform Comput Biol. 2023 Apr;21(2):2350009. doi: 10.1142/S0219720023500099. Epub 2023 Apr 27.

Abstract

Genome rearrangement events are widely used to estimate a minimum-size sequence of mutations capable of transforming a genome into another. The length of this sequence is called distance, and determining it is the main goal in genome rearrangement distance problems. Problems in the genome rearrangement field differ regarding the set of rearrangement events allowed and the genome representation. In this work, we consider the scenario where the genomes share the same set of genes, gene orientation is known or unknown, and intergenic regions (structures between a pair of genes and at the extremities of the genome) are taken into account. We use two models, the first model allows only conservative events (reversals and moves), and the second model includes non-conservative events (insertions and deletions) in the intergenic regions. We show that both models result in NP-hard problems no matter if gene orientation is known or unknown. When the information regarding the orientation of genes is available, we present for both models an approximation algorithm with a factor of 2. For the scenario where this information is unavailable, we propose a 4-approximation algorithm for both models.

Keywords: Genome rearrangements; approximation algorithms; complexity analysis.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • DNA, Intergenic / genetics
  • Gene Rearrangement*
  • Genome
  • Models, Genetic*
  • Mutation

Substances

  • DNA, Intergenic