U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination
    • Showing Current items.

    Tcerg1 transcription elongation regulator 1 (CA150) [ Mus musculus (house mouse) ]

    Gene ID: 56070, updated on 18-Sep-2024

    Summary

    Official Symbol
    Tcerg1provided by MGI
    Official Full Name
    transcription elongation regulator 1 (CA150)provided by MGI
    Primary source
    MGI:MGI:1926421
    See related
    Ensembl:ENSMUSG00000024498 AllianceGenome:MGI:1926421
    Gene type
    protein coding
    RefSeq status
    VALIDATED
    Organism
    Mus musculus
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
    Also known as
    p144; FBP23; FBP28; Taf2s; ca150; CA150b; 2410022J09Rik; 2900090C16Rik
    Summary
    Enables proline-rich region binding activity. Predicted to be involved in RNA processing and regulation of transcription by RNA polymerase II. Predicted to act upstream of or within transcription, DNA-templated. Predicted to be located in nuclear matrix and nucleoplasm. Predicted to be active in nucleus. Is expressed in several structures, including branchial arch; limb bud; liver; metanephros; and pancreas epithelium. Orthologous to human TCERG1 (transcription elongation regulator 1). [provided by Alliance of Genome Resources, Apr 2022]
    Expression
    Ubiquitous expression in CNS E11.5 (RPKM 20.3), CNS E14 (RPKM 15.7) and 28 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    See Tcerg1 in Genome Data Viewer
    Location:
    18 B3; 18 22.66 cM
    Exon count:
    24
    Annotation release Status Assembly Chr Location
    RS_2024_02 current GRCm39 (GCF_000001635.27) 18 NC_000084.7 (42644369..42708850)
    108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 18 NC_000084.6 (42511304..42575785)

    Chromosome 18 - NC_000084.7Genomic Context describing neighboring genes Neighboring gene predicted gene, 31501 Neighboring gene STARR-seq mESC enhancer starr_44441 Neighboring gene ribosomal protein S2 pseudogene Neighboring gene CapStarr-seq enhancer MGSCv37_chr18:42670844-42671027 Neighboring gene STARR-seq mESC enhancer starr_44443 Neighboring gene STARR-positive B cell enhancer mm9_chr18:42717560-42717861 Neighboring gene STARR-positive B cell enhancer ABC_E400 Neighboring gene G protein-coupled receptor 151 Neighboring gene RIKEN cDNA C030004G16 gene

    Genomic regions, transcripts, and products

    Expression

    • Project title: Mouse ENCODE transcriptome data Mouse ENCODE transcriptome data
    • Description: RNA profiling data sets generated by the Mouse ENCODE project.
    • BioProject: PRJNA66167
    • Publication: PMID 25409824
    • Analysis date: n/a

    Bibliography

    Variation

    Alleles

    Alleles of this type are documented at Mouse Genome Informatics  (MGI)
    • Endonuclease-mediated (1) 

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by MGI

    Function Evidence Code Pubs
    enables DNA-binding transcription factor activity ISS
    Inferred from Sequence or Structural Similarity
    more info
    PubMed 
    enables RNA polymerase II-specific DNA-binding transcription factor binding ISO
    Inferred from Sequence Orthology
    more info
     
    enables RNA polymerase binding IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    enables RNA polymerase binding IEA
    Inferred from Electronic Annotation
    more info
     
    enables identical protein binding ISO
    Inferred from Sequence Orthology
    more info
     
    enables proline-rich region binding IDA
    Inferred from Direct Assay
    more info
    PubMed 
    enables transcription coregulator activity IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    enables transcription corepressor activity ISO
    Inferred from Sequence Orthology
    more info
     
    enables ubiquitin-like protein conjugating enzyme binding ISO
    Inferred from Sequence Orthology
    more info
     
    Process Evidence Code Pubs
    acts_upstream_of_or_within DNA-templated transcription ISS
    Inferred from Sequence or Structural Similarity
    more info
    PubMed 
    involved_in RNA splicing ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in mRNA processing ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in negative regulation of transcription by RNA polymerase II ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in positive regulation of transcription by RNA polymerase II ISO
    Inferred from Sequence Orthology
    more info
     
    involved_in regulation of DNA-templated transcription IEA
    Inferred from Electronic Annotation
    more info
     
    Component Evidence Code Pubs
    located_in nuclear matrix ISS
    Inferred from Sequence or Structural Similarity
    more info
    PubMed 
    located_in nucleoplasm IEA
    Inferred from Electronic Annotation
    more info
     
    located_in nucleoplasm ISO
    Inferred from Sequence Orthology
    more info
     
    is_active_in nucleus IBA
    Inferred from Biological aspect of Ancestor
    more info
     

    General protein information

    Preferred Names
    transcription elongation regulator 1
    Names
    TATA box binding protein (TBP)-associated factor, RNA polymerase II, S, 150kD
    TATA box-binding protein-associated factor 2S
    coactivator of 150 kD
    formin-binding protein 28
    transcription factor CA150
    transcription factor CA150b

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001039474.1NP_001034563.1  transcription elongation regulator 1 isoform 1

      See identical proteins and their annotated locations for NP_001034563.1

      Status: VALIDATED

      Description
      Transcript Variant: This variant (1) encodes the longest isoform (1).
      Source sequence(s)
      AK082750, AK168434, CJ052824
      Consensus CDS
      CCDS29214.1
      UniProtKB/Swiss-Prot
      Q61051, Q8C490, Q8CGF7, Q8CHT8, Q9R0R5
      UniProtKB/TrEMBL
      Q3TH57
      Related
      ENSMUSP00000158288.2, ENSMUST00000237602.2
      Conserved Domains (4) summary
      smart00441
      Location:9571012
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      cd00201
      Location:435464
      WW; Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other ...
      pfam01846
      Location:796845
      FF; FF domain
    2. NM_001289526.1NP_001276455.1  transcription elongation regulator 1 isoform 2

      Status: VALIDATED

      Description
      Transcript Variant: This variant (2) lacks an alternate in-frame exon in the 5' coding region and contains an alternate 3' exon, compared to variant 1. This results in a shorter isoform (2), compared to isoform 1.
      Source sequence(s)
      AC121603, AK076993, AK082750, AK150312, CJ052824
      UniProtKB/Swiss-Prot
      Q8CGF7
      Conserved Domains (4) summary
      smart00441
      Location:9531008
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      cd00201
      Location:414443
      WW; Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other ...
      pfam01846
      Location:792841
      FF; FF domain
    3. NM_001360881.1NP_001347810.1  transcription elongation regulator 1 isoform 3

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) lacks an alternate in-frame exon in the 5' coding region, compared to variant 1. This results in a shorter isoform (3), compared to isoform 1.
      Source sequence(s)
      AC121603
      Consensus CDS
      CCDS89230.1
      Related
      ENSMUSP00000025375.9, ENSMUST00000025375.15
      Conserved Domains (4) summary
      smart00441
      Location:9941058
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      cl00157
      Location:402999
      WW; Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other ...
      cl25496
      Location:258346
      Herpes_BLLF1; Herpes virus major outer envelope glycoprotein (BLLF1)

    RNA

    1. NR_110344.1 RNA Sequence

      Status: VALIDATED

      Description
      Transcript Variant: This variant (3) contains an alternate 3' exon, compared to variant 1. This variant is represented as non-coding because the use of the 5'-most expected translational start codon, as used in variant 1, renders the transcript a candidate for nonsense-mediated mRNA decay (NMD).
      Source sequence(s)
      AC121603, AK082750, AK168434, CJ052824

    RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCm39 C57BL/6J

    Genomic

    1. NC_000084.7 Reference GRCm39 C57BL/6J

      Range
      42644369..42708850
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    mRNA and Protein(s)

    1. XM_006526111.4XP_006526174.1  transcription elongation regulator 1 isoform X2

      Conserved Domains (4) summary
      smart00441
      Location:10321096
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      COG5104
      Location:4231037
      PRP40; Splicing factor [RNA processing and modification]
      pfam03154
      Location:258418
      Atrophin-1; Atrophin-1 family
    2. XM_006526110.4XP_006526173.1  transcription elongation regulator 1 isoform X1

      Conserved Domains (4) summary
      smart00441
      Location:10381102
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      COG5104
      Location:4291043
      PRP40; Splicing factor [RNA processing and modification]
      pfam05109
      Location:258352
      Herpes_BLLF1; Herpes virus major outer envelope glycoprotein (BLLF1)
    3. XM_006526112.5XP_006526175.1  transcription elongation regulator 1 isoform X3

      Conserved Domains (4) summary
      smart00441
      Location:10211085
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      COG5104
      Location:4291026
      PRP40; Splicing factor [RNA processing and modification]
      pfam05109
      Location:258352
      Herpes_BLLF1; Herpes virus major outer envelope glycoprotein (BLLF1)
    4. XM_006526113.4XP_006526176.1  transcription elongation regulator 1 isoform X4

      Conserved Domains (4) summary
      smart00441
      Location:10171081
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      COG5104
      Location:4081022
      PRP40; Splicing factor [RNA processing and modification]
      pfam03154
      Location:258403
      Atrophin-1; Atrophin-1 family
    5. XM_030250545.2XP_030106405.1  transcription elongation regulator 1 isoform X5

      Conserved Domains (4) summary
      smart00441
      Location:10001064
      FF; Contains two conserved F residues
      smart00456
      Location:132164
      WW; Domain with 2 conserved Trp (W) residues
      COG5104
      Location:4081005
      PRP40; Splicing factor [RNA processing and modification]
      pfam03154
      Location:258403
      Atrophin-1; Atrophin-1 family

    RNA

    1. XR_004939874.1 RNA Sequence

    2. XR_001782385.3 RNA Sequence

    3. XR_001782384.3 RNA Sequence

    4. XR_004939873.1 RNA Sequence

    5. XR_386004.5 RNA Sequence

    6. XR_001782386.3 RNA Sequence

    7. XR_001782383.3 RNA Sequence

    Suppressed Reference Sequence(s)

    The following Reference Sequences have been suppressed. Explain

    1. NM_019512.1: Suppressed sequence

      Description
      NM_019512.1: This RefSeq record was removed by NCBI staff. Contact info@ncbi.nlm.nih.gov for further information.