
Genome Assemblies
The GRC has built tools to facilitate the curation of genome assemblies based on the sequence overlaps of long, high quality sequences (Clones and PCR products, not short sequence reads). The GRC currently supports production of assemblies for human, mouse or zebrafish. If your assembly data fits this model and you are interested in using these tools please contact us using the 'Contact Us' page.
Human
The human genome assembly was produced as part of the Human Genome Project (HGP). The previous assembly (NCBI36) was the last one produced by the HGP and was described in 2004(PMID: 15496913); this was the starting point for the GRC. The assembly is based largely on assembling overlapping clone sequences.
| Current Major Assembly | GRCh37 |
|---|---|
| Regions with Alternate Loci | 3 |
| Assembly N50 | 46,395,641 bp |
| Remaining Gaps | 357 |
| Patch Release version | p8 |
| Patches Released | Fix: 69; Novel: 71 |
More Human assembly statistics...
Mouse
The GRC has produced an updated assembly (GRCm38). This is an update of the last MGSC assembly (MGSCv37) which was described in 2004(PMID: 19468303). The primary assembly is based on assembling overlapping BAC clones derived from the C57BL/6J strain and several loci have sequence available from other strains.
| Current Assembly | GRCm38 |
|---|---|
| Regions with Alternate Loci | 70 |
| Assembly N50 | 54,517,951 bp |
| Remaining Gaps | 437 |
More Mouse assembly statistics...
Zebrafish
The zebrafish genome assembly was produced at the Sanger Institute. The last assembly produced from the original project was Zv9 and will be described in 2010. This assembly is the starting point for the GRC. The assembly is based on assembling overlapping BAC clones and integrating these sequences with the whole genome shotgun assembly.
| Current Assembly | Zv9 |
|---|---|
| Regions with Alternate Loci | 0 |
| Assembly N50 | 1,551,602 |
| Remaining Gaps | 26,921 |
More Zebrafish assembly statistics...


