Skip navigation and go to main content

Genome Assemblies

The GRC has built tools to facilitate the curation of genome assemblies based on the sequence overlaps of long, high quality sequences (clones and PCR products, not short sequence reads). The GRC currently supports production of assemblies for human, mouse or zebrafish. If your assembly data fits this model and you are interested in using these tools, please contact us. Subscribe to the grc-announce email list to receive email notification for all GRC assembly updates.

Human

Human

The human genome assembly was produced as part of the Human Genome Project (HGP). The previous assembly (NCBI36) was the last one produced by the HGP and was described in 2004 (PMID: 15496913); this was the starting point for the GRC. The assembly is based largely on assembling overlapping clone sequences.

Human assembly information
Current major assembly GRCh38
Regions with alternate loci 178
Assembly N50 67,794,873 bp
Remaining gaps 875
Patch release version p14
Patches released FIX: 164, NOVEL: 90
Mouse

Mouse

The GRC has produced an updated assembly (GRCm38). This is an update of the last MGSC assembly (MGSCv37) which was described in 2009 (PMID: 19468303). The primary assembly is based on assembling overlapping BAC clones derived from the C57BL/6J strain and several loci have sequence available from other strains.

Mouse assembly information
Current major assembly GRCm39
Regions with alternate loci 0
Assembly N50 106,145,001 bp
Remaining gaps 347
Patch release version None
Patches released FIX: 0, NOVEL: 0
Zebrafish

Zebrafish

The zebrafish genome assembly was produced at the Wellcome Sanger Institute. The last assembly produced from the original project was Zv9 and was described in 2013 (PMID: 23594743). This assembly is the starting point for the GRC. The assembly is based on assembling overlapping BAC clones and integrating these sequences with the whole genome shotgun assembly.

Zebrafish assembly information
Current major assembly GRCz11
Regions with alternate loci 607
Assembly N50 7,379,053 bp
Remaining gaps 18,736
Rat

Rat

The mRatBN7 assembly was generated by the Darwin Tree of Life Project at the Wellcome Sanger Institute and is the starting point for the GRC. The previous rat assemblies were generated by the Rat Genome Sequence Consortium (PMID:15057822). mRatBN7 was derived from a male BN/NHsdMcwi rat (a direct descendent from the previously sequenced rat) and was generated using multiple technologies including PacBio long reads, 10X linked reads, Bionano maps and Arima Hi-C.

Rat assembly information
Current major assembly mRatBN7.2
Regions with alternate loci 0
Assembly N50 135,012,528 bp
Remaining gaps 581
Chicken

Chicken

The chicken genome assembly was produced by the International Chicken Genome Consortium. Gallus_gallus-5.0 is the latest assembly produced from this project. This assembly is the starting point for the GRC. It is comprised primarily of WGS contigs, into which overlapping genomic clones from the same DNA source have been integrated.

Chicken assembly information
Current major assembly GRCg6a
Regions with alternate loci 0
Assembly N50 20,785,086 bp
Remaining gaps 946