Format

Send to

Choose Destination
Chromosoma. 2014 Mar;123(1-2):165-8. doi: 10.1007/s00412-013-0443-8. Epub 2013 Nov 10.

Assembly errors cause false tandem duplicate regions in the chicken (Gallus gallus) genome sequence.

Author information

1
Department of Human Evolutionary Biology, Harvard University, 11 Divinity Avenue, Cambridge, MA, 02138, USA, quzhang@post.harvard.edu.

Abstract

The complexity of eukaryote genomes makes assembly errors inevitable in the process of constructing reference genomes. Next-generation sequencing (NGS) could provide an efficient way to validate previously assembled genomes. Here, we exploited NGS data to interrogate the chicken reference genome and identified 35 pairs of nearly identical regions with >99.5 % sequence similarity and a median size of 109 kb. Several lines of evidence, including read depth, the composition of junction sequences, and sequence similarity, suggest that these regions present genome assembly errors and should be excluded from forthcoming genomic studies.

PMID:
24213641
DOI:
10.1007/s00412-013-0443-8
[Indexed for MEDLINE]

Supplemental Content

Full text links

Icon for Springer
Loading ...
Support Center