U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Cenpk centromere protein K [ Mus musculus (house mouse) ]

Gene ID: 60411, updated on 26-Sep-2022

Summary

Official Symbol
Cenpkprovided by MGI
Official Full Name
centromere protein Kprovided by MGI
Primary source
MGI:MGI:1926210
See related
Ensembl:ENSMUSG00000021714 AllianceGenome:MGI:1926210
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
Solt; Solzt; Cenp-K; B130045K24Rik; C530004N04Rik
Summary
Acts upstream of or within positive regulation of transcription by RNA polymerase II. Located in nucleus. Is expressed in several structures, including central nervous system and neural retina. Orthologous to human CENPK (centromere protein K). [provided by Alliance of Genome Resources, Apr 2022]
Expression
Biased expression in liver E14 (RPKM 9.5), CNS E11.5 (RPKM 6.6) and 9 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Cenpk in Genome Data Viewer
Location:
13; 13 D1
Exon count:
12
Annotation release Status Assembly Chr Location
109 current GRCm39 (GCF_000001635.27) 13 NC_000079.7 (104365119..104386130)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 13 NC_000079.6 (104228611..104249622)

Chromosome 13 - NC_000079.7Genomic Context describing neighboring genes Neighboring gene trafficking protein particle complex 13 Neighboring gene tripartite motif-containing 23 Neighboring gene shieldin complex subunit 3 Neighboring gene peptidylprolyl isomerase domain and WD repeat containing 1 Neighboring gene a disintegrin-like and metallopeptidase (reprolysin type) with thrombospondin type 1 motif, 6 Neighboring gene predicted gene 8680 Neighboring gene predicted gene, 53810 Neighboring gene CWC27 spliceosome-associated protein

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Variation

Alleles

Alleles of this type are documented at Mouse Genome Informatics  (MGI)
  • Endonuclease-mediated (2) 

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Homology

Gene Ontology Provided by MGI

Function Evidence Code Pubs
enables protein binding IPI
Inferred from Physical Interaction
more info
PubMed 
Process Evidence Code Pubs
involved_in kinetochore assembly IEA
Inferred from Electronic Annotation
more info
 
involved_in mitotic sister chromatid segregation IBA
Inferred from Biological aspect of Ancestor
more info
PubMed 
acts_upstream_of_or_within positive regulation of transcription by RNA polymerase II IDA
Inferred from Direct Assay
more info
PubMed 
Component Evidence Code Pubs
located_in chromosome IEA
Inferred from Electronic Annotation
more info
 
located_in chromosome, centromeric region IEA
Inferred from Electronic Annotation
more info
 
located_in kinetochore IEA
Inferred from Electronic Annotation
more info
 
located_in nucleus IDA
Inferred from Direct Assay
more info
PubMed 

General protein information

Preferred Names
centromere protein K
Names
SoxLZ/Sox6 leucine zipper binding protein in testis

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001377093.1NP_001364022.1  centromere protein K isoform 3

    Status: VALIDATED

    Source sequence(s)
    AC154216
    UniProtKB/Swiss-Prot
    Q8C469
    Conserved Domains (1) summary
    pfam11802
    Location:12271
    CENP-K; Centromere-associated protein K
  2. NM_001377094.1NP_001364023.1  centromere protein K isoform 3

    Status: VALIDATED

    Source sequence(s)
    AC154216
    UniProtKB/Swiss-Prot
    Q8C469
    Conserved Domains (1) summary
    pfam11802
    Location:12271
    CENP-K; Centromere-associated protein K
  3. NM_001377095.1NP_001364024.1  centromere protein K isoform 3

    Status: VALIDATED

    Source sequence(s)
    AC154216
    UniProtKB/Swiss-Prot
    Q8C469
    Conserved Domains (1) summary
    pfam11802
    Location:12271
    CENP-K; Centromere-associated protein K
  4. NM_021790.2NP_068562.1  centromere protein K isoform 1

    See identical proteins and their annotated locations for NP_068562.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) uses alternate 5' exon structure, differs in the 5' UTR, and includes an alternate 3' terminal exon compared to variant 2. This transcript also initiates translation at an alternate start codon, resulting in isoform 1, which is longer and has distinct N- and C-termini compared to isoform 2.
    Source sequence(s)
    AB043687, AC154216, AV367397
    Consensus CDS
    CCDS26749.1
    UniProtKB/Swiss-Prot
    Q9ESN5
    UniProtKB/TrEMBL
    A0A0R4J037, A0A8C6GI50
    Related
    ENSMUSP00000022227.7, ENSMUST00000022227.8
    Conserved Domains (1) summary
    pfam11802
    Location:47306
    CENP-K; Centromere-associated protein K
  5. NM_181061.6NP_851406.1  centromere protein K isoform 2

    See identical proteins and their annotated locations for NP_851406.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) represents the longest transcript and encodes the shorter isoform (2).
    Source sequence(s)
    AC154216
    Consensus CDS
    CCDS26750.1
    UniProtKB/Swiss-Prot
    Q9ESN5
    Related
    ENSMUSP00000070910.4, ENSMUST00000070761.10
    Conserved Domains (1) summary
    pfam11802
    Location:12220
    CENP-K; Centromere-associated protein K

RNA

  1. NR_075088.2 RNA Sequence

    Status: VALIDATED

    Description
    Transcript Variant: This variant (3) uses an alternate splice site in an internal exon and includes an alternate 3' terminal exon, compared to variant 2. This variant is represented as non-coding due to the presence of an upstream ORF that is predicted to interfere with translation of the longest ORF; translation of the upstream ORF renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
    Source sequence(s)
    AC154216
    Related
    ENSMUST00000224500.2

RefSeqs of Annotated Genomes: Mus musculus Annotation Release 109 details...Open this link in a new tab

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000079.7 Reference GRCm39 C57BL/6J

    Range
    104365119..104386130
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)