U.S. flag

An official website of the United States government


Send to:

Choose Destination

Sohlh2 spermatogenesis and oogenesis specific basic helix-loop-helix 2 [ Mus musculus (house mouse) ]

Gene ID: 74434, updated on 12-May-2024


Official Symbol
Sohlh2provided by MGI
Official Full Name
spermatogenesis and oogenesis specific basic helix-loop-helix 2provided by MGI
Primary source
See related
Ensembl:ENSMUSG00000027794 AllianceGenome:MGI:1921684
Gene type
protein coding
RefSeq status
Mus musculus
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae; Murinae; Mus; Mus
Also known as
Sosf2; 4933406N12Rik
Enables DNA-binding transcription activator activity, RNA polymerase II-specific; RNA polymerase II transcription regulatory region sequence-specific DNA binding activity; and protein dimerization activity. Involved in oocyte differentiation; positive regulation of transcription by RNA polymerase II; and spermatogenesis. Acts upstream of or within primary ovarian follicle growth and regulation of gene expression. Predicted to be located in cytoplasm. Predicted to be active in nucleus. Is expressed in early conceptus; eye; genitourinary system; heart; and liver. Orthologous to several human genes including CCDC169-SOHLH2 (CCDC169-SOHLH2 readthrough). [provided by Alliance of Genome Resources, Apr 2022]
Biased expression in testis adult (RPKM 4.3), placenta adult (RPKM 0.7) and 1 other tissue See more
Try the new Gene table
Try the new Transcript table

Genomic context

See Sohlh2 in Genome Data Viewer
3 C; 3 26.53 cM
Exon count:
Annotation release Status Assembly Chr Location
RS_2024_02 current GRCm39 (GCF_000001635.27) 3 NC_000069.7 (55089465..55117378)
108.20200622 previous assembly GRCm38.p6 (GCF_000001635.26) 3 NC_000069.6 (55182044..55209957)

Chromosome 3 - NC_000069.7Genomic Context describing neighboring genes Neighboring gene spartin Neighboring gene coiled-coil domain containing 169 Neighboring gene STARR-seq mESC enhancer starr_07696 Neighboring gene doublecortin-like kinase 1 Neighboring gene predicted gene, 25132 Neighboring gene predicted gene 9831

Genomic regions, transcripts, and products


  • Project title: Mouse ENCODE transcriptome data
  • Description: RNA profiling data sets generated by the Mouse ENCODE project.
  • BioProject: PRJNA66167
  • Publication: PMID 25409824
  • Analysis date: n/a


GeneRIFs: Gene References Into Functions

What's a GeneRIF?



Alleles of this type are documented at Mouse Genome Informatics  (MGI)
  • Endonuclease-mediated (3) 
  • Targeted (3)  1 citation

General gene information


Gene Ontology Provided by MGI

Component Evidence Code Pubs
located_in cytoplasm IEA
Inferred from Electronic Annotation
more info
is_active_in nucleus IBA
Inferred from Biological aspect of Ancestor
more info

General protein information

Preferred Names
spermatogenesis- and oogenesis-specific basic helix-loop-helix-containing protein 2

NCBI Reference Sequences (RefSeq)

NEW Try the new Transcript table

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_028937.3NP_083213.2  spermatogenesis- and oogenesis-specific basic helix-loop-helix-containing protein 2

    See identical proteins and their annotated locations for NP_083213.2

    Status: VALIDATED

    Source sequence(s)
    AI413378, DQ086115
    Consensus CDS
    Q4JI40, Q9D489
    ENSMUSP00000029369.5, ENSMUST00000029369.5
    Conserved Domains (1) summary
    HLH; Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, ...

RefSeqs of Annotated Genomes: GCF_000001635.27-RS_2024_02

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCm39 C57BL/6J


  1. NC_000069.7 Reference GRCm39 C57BL/6J

    GenBank, FASTA, Sequence Viewer (Graphics)