Format

Send to

Choose Destination
Mol Biol Evol. 2016 Jul;33(7):1875-86. doi: 10.1093/molbev/msw056. Epub 2016 Mar 23.

BaitFisher: A Software Package for Multispecies Target DNA Enrichment Probe Design.

Author information

1
Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, Bonn, Germany.
2
Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, Bonn, Germany Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, Germany.
3
Services in Molecular Biology GmbH, Rüdersdorf, Germany.
4
University of Bonn, Institute of Evolutionary Biology and Ecology, Bonn, Germany.
5
Department Arthropoda, Zoological Research Museum Alexander Koenig, Bonn, Germany.
6
Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, Bonn, Germany Australian National Insect Collection, CSIRO National Research Collections Australia, Acton, Canberra, ACT, Australia.
7
Zoological Research Museum Alexander Koenig, Bonn, Germany.
8
Molecular Evolution and Systematics of Animals, Institute for Biology, University of Leipzig, Leipzig, Germany German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Leipzig, Germany Museo Nacional de Ciencias Naturales, Spanish National Research Council (CSIC), Madrid, Spain.
9
Museum für Naturkunde, Leibniz Institute for Evolution and Biodiversity Science, Berlin, Germany o.niehuis@zfmk.de michael.ohl@mfn-berlin.de.
10
Center for Molecular Biodiversity Research, Zoological Research Museum Alexander Koenig, Bonn, Germany o.niehuis@zfmk.de michael.ohl@mfn-berlin.de.

Abstract

Target DNA enrichment combined with high-throughput sequencing technologies is a powerful approach to probing a large number of loci in genomes of interest. However, software algorithms that explicitly consider nucleotide sequence information of target loci in multiple reference species for optimizing design of target enrichment baits to be applicable across a wide range of species have not been developed. Here we present an algorithm that infers target DNA enrichment baits from multiple nucleotide sequence alignments. By applying clustering methods and the combinatorial 1-center sequence optimization to bait design, we are able to minimize the total number of baits required to efficiently probe target loci in multiple species. Consequently, more loci can be probed across species with a given number of baits. Using transcript sequences of 24 apoid wasps (Hymenoptera: Crabronidae, Sphecidae) from the 1KITE project and the gene models of Nasonia vitripennis, we inferred 57,650, 120-bp-long baits for capturing 378 coding sequence sections of 282 genes in apoid wasps. Illumina reduced-representation library sequencing confirmed successful enrichment of the target DNA when applying these baits to DNA of various apoid wasps. The designed baits furthermore enriched a major fraction of the target DNA in distantly related Hymenoptera, such as Formicidae and Chalcidoidea, highlighting the baits' broad taxonomic applicability. The availability of baits with broad taxonomic applicability is of major interest in numerous disciplines, ranging from phylogenetics to biodiversity monitoring. We implemented our new approach in a software package, called BaitFisher, which is open source and freely available at https://github.com/cmayer/BaitFisher-package.git.

KEYWORDS:

Hymenoptera.; comparative genomics; hybrid enrichment; phylogenetics; phylogenomics

PMID:
27009209
DOI:
10.1093/molbev/msw056
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Silverchair Information Systems
Loading ...
Support Center