U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Download Assembly



daLinVulg1.1

Organism name:
Linaria vulgaris (common toadflax)
BioSample:
SAMEA7522288
BioProject:
PRJEB59030
Submitter:
WELLCOME SANGER INSTITUTE
Date:
2023/01/28
Assembly type:
haploid (principal pseudohaplotype of diploid)
Assembly level:
Chromosome
Genome representation:
full
RefSeq category:
representative genome
GenBank assembly accession:
GCA_948329865.1 (latest)
RefSeq assembly accession:
n/a
RefSeq assembly and GenBank assembly identical:
n/a
WGS Project:
CAOJCB01
Assembly method:
various
Genome coverage:
28x
Sequencing technology:
PacBio,Arima2
Linked assembly:
GCA_948329855.1 (alternate pseudohaplotype of diploid)

IDs: 15661271 [UID] 40236238 [GenBank]

See Genome Information for Linaria vulgaris

There are 2 assemblies for this organism

See more

History (Show revision history)

Comment

The assembly daLinVulg1.1 is based on 28x PacBio data and Arima2 Hi-C data generated by the Darwin Tree of Life Project
(https://www.darwintreeoflife.org/). The assembly process included the following sequence of steps: initial PacBio assembly generation with Hifiasm, retained haplotig separation ... with purge_dups, and Hi-C based scaffolding with YaHS. The mitochondrial and chloroplast genomes were assembled using MBG from PacBio HiFi reads mapping to related genomes. A representative circular sequence was selected for each from the graph based on read coverage. Finally, the primary assembly was analysed and manually improved using gEVAL. Chromosome-scale scaffolds confirmed by the Hi-C data have been named in order of size.  more

Global statistics

Total sequence length760,456,579
Total ungapped length760,438,379
Gaps between scaffolds0
Number of scaffolds41
Scaffold N50127,462,361
Scaffold L503
Number of contigs132
Contig N5012,652,593
Contig L5020
Total number of chromosomes and plasmids9
Number of component sequences (WGS or clone)41

Supplemental Content

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
non-nuclear
Assembly Unit: Primary Assembly (GCA_948329864.1)
Molecule nameGenBank sequenceRefSeq sequenceUnlocalized
sequences count
Chromosome 1OX415249.1n/an/a7
Chromosome 2OX415250.1n/an/a0
Chromosome 3OX415251.1n/an/a4
Chromosome 4OX415252.1n/an/a0
Chromosome 5OX415253.1n/an/a0
Chromosome 6OX415254.1n/an/a0
unplacedn/an/an/a21

Assembly statistics

MoleculeSequence RoleTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
AllAssembled molecule759,825,09838759,806,898127,462,361910
Chromosome 1AllAssembled moleculeUnlocalized scaffolds154,061,409153,338,251723,158817154,058,409153,335,251723,158153,338,251153,338,251209,51815150000
Chromosome 2Assembled molecule133,811,3861133,808,586133,811,386140
Chromosome 3AllAssembled moleculeUnlocalized scaffolds128,002,239127,462,361539,878514127,999,639127,459,761539,878127,462,361127,462,361114,00013130000
Chromosome 4Assembled molecule120,922,4961120,919,096120,922,496170
Chromosome 5Assembled molecule110,685,2381110,682,638110,685,238130
Chromosome 6Assembled molecule110,092,4591110,089,259110,092,459160
unplacedAssembled molecule2,249,871212,249,271265,13530
MoleculeTotal
Length
Scaffold
Count
Ungapped
Length
Scaffold
N50
Spanned
Gaps
Unspanned
Gaps
All631,4813631,481330,80200
Mitochondrion MT1330,8021330,802330,80200
Mitochondrion MT2143,9711143,971143,97100
Chloroplast Pltd156,7081156,708156,70800