U.S. flag

An official website of the United States government

Format

Send to:

Choose Destination

Links from GEO Profiles

    • Showing Current items.

    MUC1 mucin 1, cell surface associated [ Homo sapiens (human) ]

    Gene ID: 4582, updated on 7-Apr-2024

    Summary

    Official Symbol
    MUC1provided by HGNC
    Official Full Name
    mucin 1, cell surface associatedprovided by HGNC
    Primary source
    HGNC:HGNC:7508
    See related
    Ensembl:ENSG00000185499 MIM:158340; AllianceGenome:HGNC:7508
    Gene type
    protein coding
    RefSeq status
    REVIEWED
    Organism
    Homo sapiens
    Lineage
    Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
    Also known as
    EMA; MCD; PEM; PUM; KL-6; MAM6; MCKD; PEMT; CD227; H23AG; MCKD1; MUC-1; ADMCKD; ADTKD2; Ca15-3; ADMCKD1; CA 15-3; MUC-1/X; MUC1/ZD; MUC-1/SEC
    Summary
    This gene encodes a membrane-bound protein that is a member of the mucin family. Mucins are O-glycosylated proteins that play an essential role in forming protective mucous barriers on epithelial surfaces. These proteins also play a role in intracellular signaling. This protein is expressed on the apical surface of epithelial cells that line the mucosal surfaces of many different tissues including lung, breast stomach and pancreas. This protein is proteolytically cleaved into alpha and beta subunits that form a heterodimeric complex. The N-terminal alpha subunit functions in cell-adhesion and the C-terminal beta subunit is involved in cell signaling. Overexpression, aberrant intracellular localization, and changes in glycosylation of this protein have been associated with carcinomas. This gene is known to contain a highly polymorphic variable number tandem repeats (VNTR) domain. Alternate splicing results in multiple transcript variants.[provided by RefSeq, Feb 2011]
    Annotation information
    Annotation category: suggests misassembly
    Expression
    Biased expression in stomach (RPKM 319.2), lung (RPKM 118.1) and 6 other tissues See more
    Orthologs
    NEW
    Try the new Gene table
    Try the new Transcript table

    Genomic context

    Location:
    1q22
    Exon count:
    11
    Annotation release Status Assembly Chr Location
    RS_2023_10 current GRCh38.p14 (GCF_000001405.40) 1 NC_000001.11 (155185824..155192915, complement)
    RS_2023_10 current T2T-CHM13v2.0 (GCF_009914755.1) 1 NC_060925.1 (154325440..154331478, complement)
    105.20220307 previous assembly GRCh37.p13 (GCF_000001405.25) 1 NC_000001.10 (155158300..155162706, complement)

    Chromosome 1 - NC_000001.11Genomic Context describing neighboring genes Neighboring gene keratinocyte associated protein 2 Neighboring gene ATAC-STARR-seq lymphoblastoid active region 1809 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:155145543-155146051 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:155146052-155146560 Neighboring gene Sharpr-MPRA regulatory region 7923 Neighboring gene tripartite motif containing 46 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:155157269-155157776 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 1397 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:155165957-155166792 Neighboring gene THBS3 antisense RNA 1 Neighboring gene H3K27ac-H3K4me1 hESC enhancer GRCh37_chr1:155166793-155167628 Neighboring gene microRNA 92b Neighboring gene thrombospondin 3 Neighboring gene H3K4me1 hESC enhancer GRCh37_chr1:155176689-155177472 Neighboring gene ATAC-STARR-seq lymphoblastoid silent region 1398 Neighboring gene metaxin 1

    Genomic regions, transcripts, and products

    Expression

    • Project title: HPA RNA-seq normal tissues
    • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
    • BioProject: PRJEB4337
    • Publication: PMID 24309898
    • Analysis date: Wed Apr 4 07:08:55 2018

    Bibliography

    GeneRIFs: Gene References Into Functions

    What's a GeneRIF?

    Phenotypes

    EBI GWAS Catalog

    Description
    A shared susceptibility locus in PLCE1 at 10q23 for gastric adenocarcinoma and esophageal squamous cell carcinoma.
    EBI GWAS Catalog
    Genome-wide association studies of serum magnesium, potassium, and sodium concentrations identify six Loci influencing serum magnesium levels.
    EBI GWAS Catalog
    Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci.
    EBI GWAS Catalog

    HIV-1 interactions

    Replication interactions

    Interaction Pubs
    HIV-1 is inhibited by TNC (tenascin C), MUC1, and LTF (lactoferrin) as shown through neutralization assays in TZM-bl cells PubMed

    Protein interactions

    Protein Gene Interaction Pubs
    Nef nef Both HIV-1 Nef and Vpu downregulate the cell surface expression of mucin 1 (CD227) PubMed
    Vpu vpu Both HIV-1 Nef and Vpu downregulate the cell surface expression of mucin 1 (CD227) PubMed

    Go to the HIV-1, Human Interaction Database

    Pathways from PubChem

    Interactions

    Products Interactant Other Gene Complex Source Pubs Description

    General gene information

    Markers

    Gene Ontology Provided by GOA

    Function Evidence Code Pubs
    enables RNA polymerase II cis-regulatory region sequence-specific DNA binding IDA
    Inferred from Direct Assay
    more info
    PubMed 
    enables p53 binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    enables protein binding IPI
    Inferred from Physical Interaction
    more info
    PubMed 
    enables transcription coregulator activity IDA
    Inferred from Direct Assay
    more info
    PubMed 
    Component Evidence Code Pubs
    located_in Golgi lumen TAS
    Traceable Author Statement
    more info
     
    is_active_in apical plasma membrane IBA
    Inferred from Biological aspect of Ancestor
    more info
     
    part_of chromatin IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in extracellular exosome HDA PubMed 
    located_in extracellular exosome IDA
    Inferred from Direct Assay
    more info
    PubMed 
    located_in extracellular space HDA PubMed 
    located_in nucleus IEA
    Inferred from Electronic Annotation
    more info
     
    located_in plasma membrane IDA
    Inferred from Direct Assay
    more info
     
    located_in plasma membrane TAS
    Traceable Author Statement
    more info
     
    located_in vesicle HDA PubMed 

    General protein information

    Preferred Names
    mucin-1
    Names
    H23 antigen
    breast carcinoma-associated antigen DF3
    cancer antigen 15-3
    carcinoma-associated mucin
    episialin
    krebs von den Lungen-6
    mucin 1, transmembrane
    peanut-reactive urinary mucin
    polymorphic epithelial mucin
    tumor associated epithelial mucin
    tumor-associated epithelial membrane antigen

    NCBI Reference Sequences (RefSeq)

    NEW Try the new Transcript table

    RefSeqs maintained independently of Annotated Genomes

    These reference sequences exist independently of genome builds. Explain

    These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

    mRNA and Protein(s)

    1. NM_001018016.3NP_001018016.1  mucin-1 isoform 2 precursor

      See identical proteins and their annotated locations for NP_001018016.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (2) uses alternate in-frame splice sites for two exons, compared to variant 1. The resulting isoform (2) is shorter than isoform 1.
      Source sequence(s)
      AA425062, BC120975, J05581
      Consensus CDS
      CCDS30882.1
      UniProtKB/TrEMBL
      A5YRU8
      Related
      ENSP00000357377.3, ENST00000368392.7
      Conserved Domains (1) summary
      smart00200
      Location:45161
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    2. NM_001018017.3NP_001018017.1  mucin-1 isoform 3 precursor

      See identical proteins and their annotated locations for NP_001018017.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (3) uses an alternate in-frame splice site, compared to variant 1. The resulting isoform (3) is shorter than isoform 1.
      Source sequence(s)
      AA425062, AC234582, BC120974, J05581
      Consensus CDS
      CCDS30883.1
      UniProtKB/TrEMBL
      A5YRU8
      Related
      ENSP00000357375.3, ENST00000368390.7
      Conserved Domains (1) summary
      smart00200
      Location:36152
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    3. NM_001044390.3NP_001037855.1  mucin-1 isoform 5 precursor

      See identical proteins and their annotated locations for NP_001037855.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (5) has multiple differences which result in a frameshift in the central coding region, compared to variant 1. These differences result in a shorter isoform (5), compared to isoform 1.
      Source sequence(s)
      AA425062, AC234582, AY327582, J05581
      Consensus CDS
      CCDS41409.1
      UniProtKB/Swiss-Prot
      P15941
      Related
      ENSP00000339690.5, ENST00000343256.9
    4. NM_001044391.3NP_001037856.1  mucin-1 isoform 6 precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (6) has multiple differences but maintains the reading frame, compared to variant 1. These differences result in a shorter isoform (6), compared to isoform 1.
      Source sequence(s)
      AA425062, AC234582, AY327583, J05581
      UniProtKB/TrEMBL
      Q7Z551
      Conserved Domains (2) summary
      cl00944
      Location:110145
      KdpC; K+-transporting ATPase, c chain
      cl02507
      Location:3676
      SEA; SEA domain
    5. NM_001044392.3NP_001037857.1  mucin-1 isoform 7 precursor

      See identical proteins and their annotated locations for NP_001037857.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (7) has multiple differences but maintains the reading frame, compared to variant 1. These differences result in a shorter isoform (7), compared to isoform 1.
      Source sequence(s)
      AA425062, AC234582, AY327584, J05581
      Consensus CDS
      CCDS41408.1
      UniProtKB/TrEMBL
      A0A384NPK6, A6ZIE7
      Related
      ENSP00000357381.4, ENST00000368396.8
      Conserved Domains (1) summary
      cl02507
      Location:4585
      SEA; SEA domain
    6. NM_001044393.3NP_001037858.1  mucin-1 isoform 8 precursor

      See identical proteins and their annotated locations for NP_001037858.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (8) lacks an exon in the coding region, which results in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (8) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AA425062, AC234582, AY327598, J05581
      UniProtKB/TrEMBL
      Q7Z538
      Related
      ENSP00000484730.1, ENST00000471283.5
    7. NM_001204285.2NP_001191214.1  mucin-1 isoform 9 precursor

      See identical proteins and their annotated locations for NP_001191214.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (9) has multiple differences but maintains the reading frame, compared to variant 1. The encoded isoform (9) is longer than isoform 1.
      Source sequence(s)
      AA425062, AC234582, J05581
      Consensus CDS
      CCDS55640.2
      UniProtKB/TrEMBL
      A0A0C4DGW3, B4DWK6
      Related
      ENSP00000481231.1, ENST00000620103.4
      Conserved Domains (1) summary
      smart00200
      Location:254372
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    8. NM_001204286.1NP_001191215.1  mucin-1 isoform 10 precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (10) has multiple differences but maintains the reading frame, compared to variant 1. The encoded isoform (10) is longer than isoform 1.
      Source sequence(s)
      AA425062, AC234582, AH001451, J05581
      UniProtKB/TrEMBL
      B4DWK6
      Conserved Domains (1) summary
      smart00200
      Location:263381
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    9. NM_001204287.2NP_001191216.1  mucin-1 isoform 11 precursor

      See identical proteins and their annotated locations for NP_001191216.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (11) uses an alternate splice site, compared to variant 1. The encoded isoform (11) is longer than isoform 1.
      Source sequence(s)
      AA425062, EF583647, J05581
      UniProtKB/TrEMBL
      A5YRU5, A6ZID5
      Related
      ENSP00000388172.3, ENST00000457295.6
      Conserved Domains (1) summary
      smart00200
      Location:64179
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    10. NM_001204288.2NP_001191217.1  mucin-1 isoform 12 precursor

      See identical proteins and their annotated locations for NP_001191217.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (12) has multiple differences, one of which results in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (12) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AA425062, EF583649, J05581
      UniProtKB/TrEMBL
      A5YRU7, B6ECB2
      Conserved Domains (1) summary
      smart00200
      Location:45150
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    11. NM_001204289.2NP_001191218.1  mucin-1 isoform 13 precursor

      See identical proteins and their annotated locations for NP_001191218.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (13) has multiple differences but maintains the reading frame, compared to variant 1. The encoded isoform (13) is shorter than isoform 1.
      Source sequence(s)
      AA425062, AC234582, EF583652, J05581
      UniProtKB/TrEMBL
      A5YRV0, A6ZID6
      Related
      ENSP00000389098.1, ENST00000438413.5
      Conserved Domains (1) summary
      cl02507
      Location:63135
      SEA; SEA domain
    12. NM_001204290.2NP_001191219.1  mucin-1 isoform 14 precursor

      See identical proteins and their annotated locations for NP_001191219.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (14) has multiple differences but maintains the reading frame, compared to variant 1. The encoded isoform (14) is shorter than isoform 1.
      Source sequence(s)
      AA425062, EF583654, J05581
      UniProtKB/TrEMBL
      A5YRV2, B1AVQ7
      Conserved Domains (1) summary
      cl02507
      Location:42114
      SEA; SEA domain
    13. NM_001204291.1NP_001191220.1  mucin-1 isoform 15 precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (15) has multiple differences but maintains the reading frame, compared to variant 1. The encoded isoform (15) is shorter than isoform 1.
      Source sequence(s)
      AA425062, AC234582, EF670708, J05581
      Consensus CDS
      CCDS72934.1
      UniProtKB/TrEMBL
      A0A0A0MRB3, A6ZID6
      Related
      ENSP00000343482.6, ENST00000338684.9
      Conserved Domains (1) summary
      smart00200
      Location:62138
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    14. NM_001204292.1NP_001191221.1  mucin-1 isoform 16 precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (16) has multiple differences compared to variant 1, one of which results in a translational frameshift in the central coding region. The encoded isoform (16) is shorter than isoform 1.
      Source sequence(s)
      AA425062, AC234582, EF670709, J05581
      Consensus CDS
      CCDS72935.1
      UniProtKB/TrEMBL
      A0A087X0L2, A6ZID7
      Related
      ENSP00000483482.1, ENST00000610359.4
      Conserved Domains (1) summary
      cl02507
      Location:80136
      SEA; SEA domain
    15. NM_001204293.2NP_001191222.1  mucin-1 isoform 17 precursor

      See identical proteins and their annotated locations for NP_001191222.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (17) lacks an exon in the 3' coding region, which results in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (17) is shorter and has a distinct C-terminus, compared to isoform 1.
      Source sequence(s)
      AA425062, AC234582, EF670711, J05581
      Consensus CDS
      CCDS55642.1
      UniProtKB/TrEMBL
      B6ECB2
      Related
      ENSP00000357378.3, ENST00000368393.7
      Conserved Domains (1) summary
      smart00200
      Location:55158
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin
    16. NM_001204294.2NP_001191223.1  mucin-1 isoform 18 precursor

      See identical proteins and their annotated locations for NP_001191223.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (18) has multiple differences compared to variant 1, one of which results in a translational frameshift in the central coding region. The encoded isoform (18) is shorter than isoform 1.
      Source sequence(s)
      AA425062, AC234582, EF670712, J05581
      Consensus CDS
      CCDS55641.1
      UniProtKB/TrEMBL
      A6ZID7
      Related
      ENSP00000357383.3, ENST00000368398.7
      Conserved Domains (1) summary
      cl02507
      Location:71127
      SEA; SEA domain
    17. NM_001204295.1NP_001191224.1  mucin-1 isoform 19 precursor

      See identical proteins and their annotated locations for NP_001191224.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (19) has multiple differences but maintains the reading frame, compared to variant 1. These differences result in a shorter isoform (19), compared to isoform 1.
      Source sequence(s)
      AA425062, AC234582, EF670716, J05581
      Consensus CDS
      CCDS72936.1
      UniProtKB/TrEMBL
      A6ZIE4, Q7Z549
      Related
      ENSP00000483473.1, ENST00000615517.4
      Conserved Domains (1) summary
      cl02507
      Location:4585
      SEA; SEA domain
    18. NM_001204296.2NP_001191225.1  mucin-1 isoform 20 precursor

      See identical proteins and their annotated locations for NP_001191225.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (20) has multiple differences compared to variant 1, one of which results in a translational frameshift in the central coding region. The encoded isoform (20) is shorter than isoform 1.
      Source sequence(s)
      AA425062, FJ226053, J05581
      UniProtKB/TrEMBL
      B6ECB3
    19. NM_001204297.2NP_001191226.1  mucin-1 isoform 21 precursor

      See identical proteins and their annotated locations for NP_001191226.1

      Status: REVIEWED

      Description
      Transcript Variant: This variant (21) has multiple differences but maintains the reading frame, compared to variant 1. These differences result in a shorter isoform (21), compared to isoform 1.
      Source sequence(s)
      AA425062, EF670718, J05581
      UniProtKB/TrEMBL
      A6ZIE6, A6ZIE7
      Related
      ENSP00000357374.2, ENST00000368389.6
      Conserved Domains (2) summary
      cl00944
      Location:137172
      KdpC; K+-transporting ATPase, c chain
      cl02507
      Location:64103
      SEA; SEA domain
    20. NM_001371720.2NP_001358649.2  mucin-1 isoform 22 precursor

      Status: REVIEWED

      Description
      Transcript Variant: This variant (22) has multiple differences but maintains the reading frame, compared to variant 1. The encoded isoform (22) is longer than isoform 1.
      Source sequence(s)
      CP068277
    21. NM_002456.6NP_002447.4  mucin-1 isoform 1 precursor

      See identical proteins and their annotated locations for NP_002447.4

      Status: REVIEWED

      Description
      Transcript Variant: This variant (1) represents isoform 1.
      Source sequence(s)
      AA425062, AC234582, J05581, U60261
      Consensus CDS
      CCDS1098.1
      UniProtKB/TrEMBL
      A6ZID5
      Related
      ENSP00000338983.5, ENST00000337604.6
      Conserved Domains (1) summary
      smart00200
      Location:55170
      SEA; Domain found in sea urchin sperm protein, enterokinase, agrin

    RefSeqs of Annotated Genomes: GCF_000001405.40-RS_2023_10

    The following sections contain reference sequences that belong to a specific genome build. Explain

    Reference GRCh38.p14 Primary Assembly

    Genomic

    1. NC_000001.11 Reference GRCh38.p14 Primary Assembly

      Range
      155185824..155192915 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    Alternate T2T-CHM13v2.0

    Genomic

    1. NC_060925.1 Alternate T2T-CHM13v2.0

      Range
      154325440..154331478 complement
      Download
      GenBank, FASTA, Sequence Viewer (Graphics)

    Suppressed Reference Sequence(s)

    The following Reference Sequences have been suppressed. Explain

    1. NG_029383.1: Suppressed sequence

      Description
      NG_029383.1: This RefSeq was removed because currently there is insufficient data to represent an accurate genomic sequence.
    2. NM_001018021.1: Suppressed sequence

      Description
      NM_001018021.1: This RefSeq was permanently suppressed because it is a nonsense-mediated mRNA decay (NMD) candidate.
    3. NM_182741.1: Suppressed sequence

      Description
      NM_182741.1: This RefSeq was permanently suppressed because currently there is insufficient support from non-cancerous tissues for the transcript and protein.