De novo assembly of a chromosome-scale reference genome for the northern flicker Colaptes auratus

G3 (Bethesda). 2021 Jan 18;11(1):jkaa026. doi: 10.1093/g3journal/jkaa026.

Abstract

The northern flicker, Colaptes auratus, is a widely distributed North American woodpecker and a long-standing focal species for the study of ecology, behavior, phenotypic differentiation, and hybridization. We present here a highly contiguous de novo genome assembly of C. auratus, the first such assembly for the species and the first published chromosome-level assembly for woodpeckers (Picidae). The assembly was generated using a combination of short-read Chromium 10× and long-read PacBio sequencing, and further scaffolded with chromatin conformation capture (Hi-C) reads. The resulting genome assembly is 1.378 Gb in size, with a scaffold N50 of 11 and a scaffold L50 of 43.948 Mb. This assembly contains 87.4-91.7% of genes present across four sets of universal single-copy orthologs found in tetrapods and birds. We annotated the assembly both for genes and repetitive content, identifying 18,745 genes and a prevalence of ∼28.0% repetitive elements. Lastly, we used fourfold degenerate sites from neutrally evolving genes to estimate a mutation rate for C. auratus, which we estimated to be 4.007 × 10-9 substitutions/site/year, about 1.5× times faster than an earlier mutation rate estimate of the family. The highly contiguous assembly and annotations we report will serve as a resource for future studies on the genomics of C. auratus and comparative evolution of woodpeckers.

Keywords: Colaptes auratus; Hi-C; PacBio; genome assembly; woodpeckers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Birds
  • Chromosomes*
  • Genome*
  • Genomics
  • Repetitive Sequences, Nucleic Acid

Associated data

  • figshare/10.25387/g3.12821822