U.S. flag

An official website of the United States government


Send to:

Choose Destination

Download Assembly


Organism name:
Human gammaherpesvirus 4 (Epstein-Barr virus)
Infraspecific name:
Strain: B95-8
Assembly type:
Assembly level:
Complete Genome
Genome representation:
Relation to type material:
ICTV species exemplar
GenBank assembly accession:
GCA_002402265.1 (latest)
RefSeq assembly accession:
GCF_002402265.1 (latest)
RefSeq assembly and GenBank assembly identical:

IDs: 5139508 [UID] 5139508 [GenBank] 5139518 [RefSeq]

See Genome Information for Lymphocryptovirus humangamma4

There are 586 assemblies for this organism

See more

History (Show revision history)


CONSTRUCTION: This sequence was assembled from data for the B95-8 (V01555) and Raji (M35547) strains with corrections. The number of IR1 (W) repeats in V01555 has been reduced from 11.6 to a more typical 7.6, and the missing B95-8 ... sequence has been restored to give a sequence more representative of wild type EBV.
NUMBERING: Like the modified B95-8 sequence in V01555, this sequence starts 1 bp to the left of the EcoRI site separating EcoRI Dhet from EcoRI I (i.e. the first A of AGAATTC).
TRANSCRIPTION: Long-range splicing of the EBNA and BART genes generates multiple, complex mRNAs. A single mRNA structure is shown for each gene, but alternatives may initiate at other promoters and may involve various exon combinations. TATA and polyA signals are shown as non-experimental where predicted or as experimental where supporting data are available. Predictions of TATA or polyA signals are not made for some genes.
NOMENCLATURE: The original gene nomenclature has been retained. Genes presumably inherited from the common ancestor of alpha-, beta- and gammaherpesviruses (core genes), from the common ancestor of beta- and gammaherpesviruses (betagamma genes), or from the common ancestor of gammaherpesviruses (gamma genes) are indicated. A standard protein nomenclature has been applied so that orthologs have the same name in all herpesviruses.
CONFIDENCE: Evidence for encoded protein functions is more convincing for some ORFs than others, and rests on a combination of factors, including evolutionary conservation, expression and functional data. The most questionable ORFs are annotated as such.  more

Global statistics

Total sequence length171,823
Total ungapped length171,823
Total number of viral segments1

Supplemental Content

PubMed articles for this assembly

See more...

Recent activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...

Global assembly definition

Download the full sequence report
Click on the table row to see sequence details in the table to the right
Assembly Unit Name
Primary Assembly
Assembly Unit: Primary Assembly (GCF_000255725.1)
Molecule nameGenBank sequenceRefSeq sequence
Segment AJ507799.2=NC_007605.1

Assembly statistics

Segment 171,823