Format

Send to:

Choose Destination

CENPU centromere protein U [ Homo sapiens (human) ]

Gene ID: 79682, updated on 1-Jun-2020

Summary

Official Symbol
CENPUprovided by HGNC
Official Full Name
centromere protein Uprovided by HGNC
Primary source
HGNC:HGNC:21348
See related
Ensembl:ENSG00000151725 MIM:611511
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
KLIP1; PBIP1; CENP50; MLF1IP; CENPU50
Summary
The centromere is a specialized chromatin domain, present throughout the cell cycle, that acts as a platform on which the transient assembly of the kinetochore occurs during mitosis. All active centromeres are characterized by the presence of long arrays of nucleosomes in which CENPA (MIM 117139) replaces histone H3 (see MIM 601128). MLF1IP, or CENPU, is an additional factor required for centromere assembly (Foltz et al., 2006 [PubMed 16622419]).[supplied by OMIM, Mar 2008]
Expression
Biased expression in testis (RPKM 44.2), bone marrow (RPKM 27.7) and 10 other tissues See more
Orthologs

Genomic context

See CENPU in Genome Data Viewer
Location:
4q35.1
Exon count:
14
Annotation release Status Assembly Chr Location
109.20200522 current GRCh38.p13 (GCF_000001405.39) 4 NC_000004.12 (184694085..184734096, complement)
105 previous assembly GRCh37.p13 (GCF_000001405.25) 4 NC_000004.11 (185615219..185655286, complement)

Chromosome 4 - NC_000004.12Genomic Context describing neighboring genes Neighboring gene long intergenic non-protein coding RNA 2365 Neighboring gene caspase 3 Neighboring gene primase and DNA directed polymerase Neighboring gene acyl-CoA synthetase long chain family member 1 Neighboring gene proteoglycan 3, pro eosinophil major basic protein 2 pseudogene Neighboring gene uncharacterized LOC105377587 Neighboring gene MIR3945 host gene

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

GeneRIFs: Gene References Into Functions

What's a GeneRIF?

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Homology

Clone Names

  • FLJ23468

Gene Ontology Provided by GOA

Function Evidence Code Pubs
protein binding IPI
Inferred from Physical Interaction
more info
PubMed 
Process Evidence Code Pubs
CENP-A containing nucleosome assembly TAS
Traceable Author Statement
more info
 
chordate embryonic development IEA
Inferred from Electronic Annotation
more info
 
viral process IEA
Inferred from Electronic Annotation
more info
 
Component Evidence Code Pubs
centriolar satellite IDA
Inferred from Direct Assay
more info
 
condensed chromosome kinetochore IEA
Inferred from Electronic Annotation
more info
 
cytosol TAS
Traceable Author Statement
more info
 
nucleoplasm IDA
Inferred from Direct Assay
more info
 
nucleoplasm TAS
Traceable Author Statement
more info
 
nucleus IBA
Inferred from Biological aspect of Ancestor
more info
PubMed 

General protein information

Preferred Names
centromere protein U
Names
KSHV latent nuclear antigen interacting protein 1
MLF1 interacting protein
centromere protein of 50 kDa
interphase centromere complex protein 24
polo-box-interacting protein 1

NCBI Reference Sequences (RefSeq)

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_024629.4NP_078905.2  centromere protein U

    See identical proteins and their annotated locations for NP_078905.2

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) represents the protein-coding transcript.
    Source sequence(s)
    AF469667, AF516710
    Consensus CDS
    CCDS3838.1
    UniProtKB/Swiss-Prot
    Q71F23
    Related
    ENSP00000281453.5, ENST00000281453.10
    Conserved Domains (2) summary
    pfam13097
    Location:150320
    CENP-U; CENP-A nucleosome associated complex (NAC) subunit
    cl25732
    Location:245417
    SMC_N; RecF/RecN/SMC N terminal domain

RNA

  1. NR_104593.2 RNA Sequence

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) lacks an alternate internal exon compared to variant 1. This variant is represented as non-coding because the use of the 5'-most expected translational start codon, as used in variant 1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
    Source sequence(s)
    AC079257, BC131556, BC141854

RefSeqs of Annotated Genomes: Homo sapiens Updated Annotation Release 109.

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p13 Primary Assembly

Genomic

  1. NC_000004.12 Reference GRCh38.p13 Primary Assembly

    Range
    184694085..184734096 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_005263218.4XP_005263275.2  centromere protein U isoform X1

    Conserved Domains (1) summary
    pfam13097
    Location:180350
    CENP-U; CENP-A nucleosome associated complex (NAC) subunit
Support Center