U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



idEriAene1.1

Organism name:
Eristalinus aeneus (flies)
BioSample:
SAMEA112221990
BioProject:
PRJEB63370
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2023/07/14
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_955652365.1 (latest)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
CATPCP01
Assembly method:
various
Genome coverage:
56x
Sequencing technology:
PacBio,Arima2
Linked assembly:
GCA_955652355.1 (alternate pseudohaplotype of diploid)

IDs: 17612731 [UID] 45146588 [GenBank]

See Genome Information for Eristalinus aeneus

There are 2 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly idEriAene1.1 is based on 56x PacBio data and Arima2 Hi-C data generated by the Darwin Tree of Life Project
(https://www.darwintreeoflife.org/). The assembly process included the following sequence of steps: initial PacBio assembly generation with Hifiasm, retained haplotig separation ... with purge_dups, and Hi-C based scaffolding with YaHS. The mitochondrial genome was assembled using MitoHiFi. Finally, the primary assembly was analysed and manually improved using rapid curation. Chromosome-scale scaffolds confirmed by the Hi-C data have been named in order of size.  more

Global statistics

Total sequence length495,389,070
Total ungapped length495,339,070
Gaps between scaffolds0
Number of scaffolds198
Scaffold N5085,846,282
Scaffold L503
Number of contigs448
Contig N505,283,291
Contig L5029
Total number of chromosomes and plasmids7
Number of component sequences (WGS or clone)198

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCA_955652364.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1OY019130.1n/an/a0
Chromosome 2OY019131.1n/an/a0
Chromosome 3OY019132.1n/an/a0
Chromosome 4OY019133.1n/an/a0
Chromosome 5OY019134.1n/an/a1
Chromosome 6OY019135.1n/an/a0
unplacedn/an/an/a190

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule495,373,098197495,323,09885,846,2822500
Chromosome 1Assembled molecule131,691,1791131,675,179131,691,179800
Chromosome 2Assembled molecule100,305,6151100,294,215100,305,615570
Chromosome 3Assembled molecule85,846,282185,839,28285,846,282350
Chromosome 4Assembled molecule79,051,686179,045,28679,051,686320
Chromosome 5AllAssembled moleculeUnlocalized scaffolds78,562,70878,373,126189,58221178,554,90878,365,326189,58278,373,12678,373,126189,58239390000
Chromosome 6Assembled molecule6,563,25016,562,0506,563,25060
unplacedAssembled molecule13,352,37819013,352,178871,04610
MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
Mitochondrion MT15,972115,97215,97200