Send to:

Choose Destination

Links from PubMed

    • Showing Current items.

    SEMG1 semenogelin I [ Homo sapiens (human) ]

    Gene ID: 6406, updated on 6-Aug-2017
    Official Symbol
    SEMG1provided by HGNC
    Official Full Name
    semenogelin Iprovided by HGNC
    Primary source
    See related
    Ensembl:ENSG00000124233 MIM:182140; Vega:OTTHUMG00000032565
    Gene type
    protein coding
    RefSeq status
    Homo sapiens
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Also known as
    SGI; SEMG; CT103; dJ172H20.2
    The protein encoded by this gene is the predominant protein in semen. The encoded secreted protein is involved in the formation of a gel matrix that encases ejaculated spermatozoa. This preproprotein is proteolytically processed by the prostate-specific antigen (PSA) protease to generate multiple peptide products that exhibit distinct functions. One of these peptides, SgI-29, is an antimicrobial peptide with antibacterial activity. This proteolysis process also breaks down the gel matrix and allows the spermatozoa to move more freely. This gene and another similar semenogelin gene are present in a gene cluster on chromosome 20. [provided by RefSeq, Feb 2016]
    Exon count:
    Annotation release Status Assembly Chr Location
    108 current GRCh38.p7 (GCF_000001405.33) 20 NC_000020.11 (45206964..45209773)
    105 previous assembly GRCh37.p13 (GCF_000001405.25) 20 NC_000020.10 (43835605..43838414)

    Chromosome 20 - NC_000020.11Genomic Context describing neighboring genes Neighboring gene uncharacterized LOC101929866 Neighboring gene peptidase inhibitor 3 Neighboring gene uncharacterized LOC105372630 Neighboring gene semenogelin II Neighboring gene uncharacterized LOC107985424

    • Project title: HPA RNA-seq normal tissues
    • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
    • BioProject: PRJEB4337
    • Publication: PMID 24309898
    • Analysis date: Wed Jun 15 11:32:44 2016

    GeneRIFs: Gene References Into FunctionsWhat's a GeneRIF?

    Protein interactions

    Protein Gene Interaction Pubs
    Pr55(Gag) gag Cellular biotinylated semenogelin I protein (SEMG1) is incorporated into HIV-1 Gag virus-like particles PubMed

    Go to the HIV-1, Human Interaction Database

    • Amyloid fiber formation, organism-specific biosystem (from REACTOME)
      Amyloid fiber formation, organism-specific biosystemAmyloid is a term used to describe deposits of fibrillar proteins, typically extracellular. The abnormal accumulation of amyloid, amyloidosis, is a term associated with tissue damage caused by amyloi...
    • Antimicrobial peptides, organism-specific biosystem (from REACTOME)
      Antimicrobial peptides, organism-specific biosystemAntimicrobial peptides (AMPs) are small molecular weight proteins with broad spectrum of antimicrobial activity against bacteria, viruses, and fungi (Zasloff M 2002; Radek K & Gallo R 2007). The majo...
    • Immune System, organism-specific biosystem (from REACTOME)
      Immune System, organism-specific biosystemHumans are exposed to millions of potential pathogens daily, through contact, ingestion, and inhalation. Our ability to avoid infection depends on the adaptive immune system and during the first crit...
    • Innate Immune System, organism-specific biosystem (from REACTOME)
      Innate Immune System, organism-specific biosystemInnate immunity encompases the nonspecific part of immunity tha are part of an individual's natural biologic makeup
    • Metabolism of proteins, organism-specific biosystem (from REACTOME)
      Metabolism of proteins, organism-specific biosystemProtein metabolism comprises the pathways of translation, post-translational modification and protein folding.
    Products Interactant Other Gene Complex Source Pubs Description


    Potential readthrough

    Included gene: SEMG2

    Clone Names

    • FLJ78262, MGC14719

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    metal ion binding IMP
    Inferred from Mutant Phenotype
    more info
    protein binding IPI
    Inferred from Physical Interaction
    more info
    Component Evidence Code Pubs
    extracellular exosome IDA
    Inferred from Direct Assay
    more info
    extracellular region TAS
    Traceable Author Statement
    more info
    extracellular space IDA
    Inferred from Direct Assay
    more info
    extracellular space TAS
    Traceable Author Statement
    more info
    nucleus IDA
    Inferred from Direct Assay
    more info
    protein complex IDA
    Inferred from Direct Assay
    more info
    Preferred Names
    cancer/testis antigen 103
    semen coagulating protein

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_003007.4NP_002998.1  semenogelin-1 preproprotein

      See identical proteins and their annotated locations for NP_002998.1

      Status: REVIEWED

      Source sequence(s)
      AA687874, BC055416, BP325874, J04440
      Consensus CDS
      ENSP00000361867.3, OTTHUMP00000031098, ENST00000372781.3, OTTHUMT00000079416
      Conserved Domains (1) summary
      Semenogelin; Semenogelin

    RefSeqs of Annotated Genomes: Homo sapiens Annotation Release 108 details...Open this link in a new tab

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p7 Primary Assembly


    1. NC_000020.11 Reference GRCh38.p7 Primary Assembly

      GenBank, FASTA, Sequence Viewer (Graphics)

    Alternate CHM1_1.1


    1. NC_018931.2 Alternate CHM1_1.1

      GenBank, FASTA, Sequence Viewer (Graphics)

    Suppressed Reference Sequence(s)

    The following Reference Sequences have been suppressed. Explain

    1. NM_198139.1: Suppressed sequence

      NM_198139.1: This RefSeq was permanently suppressed because the transcript lacked a 180 nt repeat unit in the coding sequence compared to the reference genome sequence.
    Support Center