AAGH00000000.1 Drosophila simulans
The Drosophila simulans whole genome shotgun (WGS) project has the project accession AAGH00000000. This version of the project (01) has the accession number AAGH01000000, and consists of sequences AAGH01000001-AAGH01031198.
This line is derived from a white501 stock obtained from the Drosophila Species Stock Center in Bowling Green, OH in 1997. It was subsequently inbred by sib-pair mating for nine generations by Daniel Barbash at UC-Davis. The provenance of this line is poorly documented. However, it is likely derived from a Drosophila simulans stock collected in North America in the 1940's. ----- Drosophila simulans Reference Assembly ---- The Drosophila simulans reference assembly is the records CM000361-CM000366 (the chromosomes), CH981541-CH984324 (the random linked, unplaced scaffolds) and CH984325-CH991539 (the unlinked scaffolds). This is the CAF1 assembly of the Drosophila simulans genome. It represents a mosaic of several different D. simulans lines. The assembly process began with a 4x WGS assembly of the D. simulans white501 (w501) line, AAGH00000000. The w501 contigs were initially anchored, ordered and oriented by alignment with the D. melanogaster genome. The assembly was then examined for places where the w501 assembly suggested inversions with respect to the D. melanogaster assembly. One major inversion was found, confirming the already documented inversion found by Lemeunier and Ashburner (1976). Six other D. simulans lines (C167.4, MD106TS, MD199S, New Caledonia 48S, SIM4, and SIM6) were assembled with approximately 1x coverage (WGS projects AASR00000000-AASW00000000, respectively). The 4x WGS assembly of the D. simulans w501 genome was used as a scaffold, and the contigs and unplaced reads from the 1x assemblies of the other individual D. simulans lines were used to cover gaps in the w501 assembly where possible. Thus the resulting assembly is a mosaic containing the w501 contigs as the primary scaffolding, with contigs and unplaced reads from the other lines filling gaps in the w501 assembly. Total size is 142,405,747 bp including gaps and 127,241,461 bp excluding gaps. For more information about the D. simulans assembly and statistics, see the WUSTL Genome Sequencing Center Drosophila simulans web page from the home page, http://genome.wustl.edu/home.cgi. The gene annotation is based on FlyBase Release 1.3, which contains some corrections of the original annotation published by the Drosophila 12 Genomes Consortium. Annotation was added to the scaffolds in July 2008.