Format

Send to

Choose Destination
G3 (Bethesda). 2018 Oct 3;8(10):3131-3141. doi: 10.1534/g3.118.200160.

Highly Contiguous Genome Assemblies of 15 Drosophila Species Generated Using Nanopore Sequencing.

Author information

1
Stowers Institute for Medical Research, Kansas City, MO danny.miller@seattlechildrens.org.
2
MD-PhD Physician Scientist Training Program, University of Kansas Medical Center, Kansas City, KS.
3
Stowers Institute for Medical Research, Kansas City, MO.
4
Department of Pathology, University of Kansas Medical Center, Kansas City, KS 66160.
5
Department of Molecular and Integrative Physiology, University of Kansas Medical Center, Kansas City, KS.

Abstract

The Drosophila genus is a unique group containing a wide range of species that occupy diverse ecosystems. In addition to the most widely studied species, Drosophila melanogaster, many other members in this genus also possess a well-developed set of genetic tools. Indeed, high-quality genomes exist for several species within the genus, facilitating studies of the function and evolution of cis-regulatory regions and proteins by allowing comparisons across at least 50 million years of evolution. Yet, the available genomes still fail to capture much of the substantial genetic diversity within the Drosophila genus. We have therefore tested protocols to rapidly and inexpensively sequence and assemble the genome from any Drosophila species using single-molecule sequencing technology from Oxford Nanopore. Here, we use this technology to present highly contiguous genome assemblies of 15 Drosophila species: 10 of the 12 originally sequenced Drosophila species (ananassae, erecta, mojavensis, persimilis, pseudoobscura, sechellia, simulans, virilis, willistoni, and yakuba), four additional species that had previously reported assemblies (biarmipes, bipectinata, eugracilis, and mauritiana), and one novel assembly (triauraria). Genomes were generated from an average of 29x depth-of-coverage data that after assembly resulted in an average contig N50 of 4.4 Mb. Subsequent alignment of contigs from the published reference genomes demonstrates that our assemblies could be used to close over 60% of the gaps present in the currently published reference genomes. Importantly, the materials and reagents cost for each genome was approximately $1,000 (USD). This study demonstrates the power and cost-effectiveness of long-read sequencing for genome assembly in Drosophila and provides a framework for the affordable sequencing and assembly of additional Drosophila genomes.

KEYWORDS:

Drosophila; Genome assembly; Nanopore sequencing; third-generation sequencing

Supplemental Content

Full text links

Icon for HighWire Icon for PubMed Central
Loading ...
Support Center