mapDamage: testing for damage patterns in ancient DNA sequences

Bioinformatics. 2011 Aug 1;27(15):2153-5. doi: 10.1093/bioinformatics/btr347. Epub 2011 Jun 9.

Abstract

Summary: Ancient DNA extracts consist of a mixture of contaminant DNA molecules, most often originating from environmental microbes, and endogenous fragments exhibiting substantial levels of DNA damage. The latter introduce specific nucleotide misincorporations and DNA fragmentation signatures in sequencing reads that could be advantageously used to argue for sequence validity. mapDamage is a Perl script that computes nucleotide misincorporation and fragmentation patterns using next-generation sequencing reads mapped against a reference genome. The Perl script outputs are further automatically processed in embedded R script in order to detect typical patterns of genuine ancient DNA sequences.

Availability and implementation: The Perl script mapDamage is freely available with documentation and example files at http://geogenetics.ku.dk/all_literature/mapdamage/. The script requires prior installation of the SAMtools suite and R environment and has been validated on both GNU/Linux and MacOSX operating systems.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • Computational Biology / methods
  • DNA Contamination*
  • DNA Damage / genetics*
  • DNA Restriction Enzymes
  • Genome, Human
  • Humans
  • Paleontology
  • Reference Standards
  • Sequence Analysis, DNA / methods*
  • Software*

Substances

  • DNA Restriction Enzymes