Characterization and evolutionary dynamics of complex regions in eukaryotic genomes

Sci China Life Sci. 2019 Apr;62(4):467-488. doi: 10.1007/s11427-018-9458-0. Epub 2019 Feb 22.

Abstract

Complex regions in eukaryotic genomes are typically characterized by duplications of chromosomal stretches that often include one or more genes repeated in a tandem array or in relatively close proximity. Nevertheless, the repetitive nature of these regions, together with the often high sequence identity among repeats, have made complex regions particularly recalcitrant to proper molecular characterization, often being misassembled or completely absent in genome assemblies. This limitation has prevented accurate functional and evolutionary analyses of these regions. This is becoming increasingly relevant as evidence continues to support a central role for complex genomic regions in explaining human disease, developmental innovations, and ecological adaptations across phyla. With the advent of long-read sequencing technologies and suitable assemblers, the development of algorithms that can accommodate sample heterozygosity, and the adoption of a pangenomic-like view of these regions, accurate reconstructions of complex regions are now within reach. These reconstructions will finally allow for accurate functional and evolutionary studies of complex genomic regions, underlying the generation of genotype-phenotype maps of unprecedented resolution.

Keywords: complex genomic regions; evolutionary novelty; genome assembly; sequencing technologies; structural variation; tandem gene duplicates.

Publication types

  • Review

MeSH terms

  • Eukaryota / genetics*
  • Evolution, Molecular*
  • Gene Duplication
  • Genome / genetics*
  • Genomic Structural Variation
  • Genomics / standards
  • Multigene Family
  • Sequence Analysis, DNA / standards