Send to

Choose Destination
Genetics. 2015 Jul;200(3):975-89. doi: 10.1534/genetics.115.175950. Epub 2015 May 19.

Remarkably Divergent Regions Punctuate the Genome Assembly of the Caenorhabditis elegans Hawaiian Strain CB4856.

Author information

Department of Genome Sciences, University of Washington, Seattle, Washington 98195.
Laboratory of Nematology, Wageningen University, 6708 PB Wageningen, The Netherlands.
Laboratory of Bioinformatics, Wageningen University, NL-6708 PB Wageningen, The Netherlands.
Centre for Genome Research, Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom.
Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, CH-1015 Lausanne, Switzerland.
Department of Biology, Technion-Israel Institute of Technology, Haifa, Israel 32000.
Institute of Molecular Life Sciences, University of Zurich, CH-8057 Zurich, Switzerland.
Department of Zoology and Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada V6T 1Z3.
Howard Hughes Medical Institute, Department of Human Genetics and Department of Biological Chemistry, David Geffen School of Medicine, University of California, Los Angeles, California 90095.
Department of Molecular Biosciences, Northwestern University, Evanston, Illinois 60208.
Department of Genome Sciences, University of Washington, Seattle, Washington 98195


The Hawaiian strain (CB4856) of Caenorhabditis elegans is one of the most divergent from the canonical laboratory strain N2 and has been widely used in developmental, population, and evolutionary studies. To enhance the utility of the strain, we have generated a draft sequence of the CB4856 genome, exploiting a variety of resources and strategies. When compared against the N2 reference, the CB4856 genome has 327,050 single nucleotide variants (SNVs) and 79,529 insertion-deletion events that result in a total of 3.3 Mb of N2 sequence missing from CB4856 and 1.4 Mb of sequence present in CB4856 but not present in N2. As previously reported, the density of SNVs varies along the chromosomes, with the arms of chromosomes showing greater average variation than the centers. In addition, we find 61 regions totaling 2.8 Mb, distributed across all six chromosomes, which have a greatly elevated SNV density, ranging from 2 to 16% SNVs. A survey of other wild isolates show that the two alternative haplotypes for each region are widely distributed, suggesting they have been maintained by balancing selection over long evolutionary times. These divergent regions contain an abundance of genes from large rapidly evolving families encoding F-box, MATH, BATH, seven-transmembrane G-coupled receptors, and nuclear hormone receptors, suggesting that they provide selective advantages in natural environments. The draft sequence makes available a comprehensive catalog of sequence differences between the CB4856 and N2 strains that will facilitate the molecular dissection of their phenotypic differences. Our work also emphasizes the importance of going beyond simple alignment of reads to a reference genome when assessing differences between genomes.


C. elegans; evolution; genome assembly; variation

[Indexed for MEDLINE]
Free PMC Article

Supplemental Content

Full text links

Icon for HighWire Icon for PubMed Central
Loading ...
Support Center