U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Cggbp1 CGG triplet repeat binding protein 1 [ Mus musculus (house mouse) ]

Gene ID: 106143, updated on 4-Dec-2022

Summary

Official Symbol
Cggbp1provided by MGI
Official Full Name
CGG triplet repeat binding protein 1provided by MGI
Primary source
MGI:MGI:2146370
See related
Ensembl:ENSMUSG00000054604 AllianceGenome:MGI:2146370
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Mus musculus
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Summary
Predicted to enable DNA-binding transcription factor binding activity; double-stranded DNA binding activity; and identical protein binding activity. Predicted to be involved in negative regulation of transcription by RNA polymerase II and regulation of gene expression, epigenetic. Predicted to be located in nucleoplasm. Predicted to be active in nucleus. Is expressed in embryo; inner ear; and otocyst. Orthologous to human CGGBP1 (CGG triplet repeat binding protein 1). [provided by Alliance of Genome Resources, Apr 2022]
Expression
Ubiquitous expression in CNS E11.5 (RPKM 17.5), bladder adult (RPKM 15.6) and 28 other tissues See more
Orthologs
NEW
Try the new Gene table
Try the new Transcript table

Genomic context

See Cggbp1 in Genome Data Viewer
Location:
16; 16 C1.3
Exon count:
3
Annotation release Status Assembly Chr Location
109 current GRCm39 (GCF_000001635.27) 16 NC_000082.7 (64672364..64679854)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 16 NC_000082.6 (64852001..64859491)

Chromosome 16 - NC_000082.7Genomic Context describing neighboring genes Neighboring gene RIKEN cDNA 4930453N24 gene Neighboring gene zinc finger protein 654 Neighboring gene predicted gene, 33971 Neighboring gene glyceraldehyde-3-phosphate dehydrogenase pseudogene

Genomic regions, transcripts, and products

Expression

  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Markers

Homology

Gene Ontology Provided by MGI

Function Evidence Code Pubs
enables DNA binding IEA
Inferred from Electronic Annotation
more info
 
enables DNA-binding transcription factor binding ISO
Inferred from Sequence Orthology
more info
 
enables double-stranded DNA binding ISO
Inferred from Sequence Orthology
more info
 
enables identical protein binding ISO
Inferred from Sequence Orthology
more info
 
Process Evidence Code Pubs
involved_in epigenetic regulation of gene expression ISO
Inferred from Sequence Orthology
more info
 
involved_in negative regulation of transcription by RNA polymerase II ISO
Inferred from Sequence Orthology
more info
 
involved_in regulation of gene expression IBA
Inferred from Biological aspect of Ancestor
more info
PubMed 
involved_in regulation of transcription by RNA polymerase II IEA
Inferred from Electronic Annotation
more info
 
Component Evidence Code Pubs
located_in nucleoplasm ISO
Inferred from Sequence Orthology
more info
 
is_active_in nucleus IBA
Inferred from Biological aspect of Ancestor
more info
PubMed 
located_in nucleus ISO
Inferred from Sequence Orthology
more info
 

General protein information

Preferred Names
CGG triplet repeat-binding protein 1
Names
20 kDa CGG-binding protein
CGG-binding protein 1
p20-CGGBP DNA-binding protein

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001357416.1NP_001344345.1  CGG triplet repeat-binding protein 1

    Status: VALIDATED

    Source sequence(s)
    CT030641
    Consensus CDS
    CCDS28266.1
    UniProtKB/Swiss-Prot
    Q8K2K1
  2. NM_178647.3NP_848762.1  CGG triplet repeat-binding protein 1

    See identical proteins and their annotated locations for NP_848762.1

    Status: VALIDATED

    Source sequence(s)
    CT030641
    Consensus CDS
    CCDS28266.1
    UniProtKB/Swiss-Prot
    Q8BHG9, Q8K2K1
    Related
    ENSMUSP00000065845.8, ENSMUST00000067744.8

RefSeqs of Annotated Genomes: Mus musculus Annotation Release 109 details...Open this link in a new tab

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J

Genomic

  1. NC_000082.7 Reference GRCm39 C57BL/6J

    Range
    64672364..64679854
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)