Format

Send to:

Choose Destination

ZBTB8OS zinc finger and BTB domain containing 8 opposite strand [ Homo sapiens (human) ]

Gene ID: 339487, updated on 8-Nov-2020

Summary

Official Symbol
ZBTB8OSprovided by HGNC
Official Full Name
zinc finger and BTB domain containing 8 opposite strandprovided by HGNC
Primary source
HGNC:HGNC:24094
See related
Ensembl:ENSG00000176261 MIM:615891
Gene type
protein coding
RefSeq status
VALIDATED
Organism
Homo sapiens
Lineage
Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia; Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae; Homo
Also known as
ARCH; ARCH2
Expression
Ubiquitous expression in colon (RPKM 4.0), lymph node (RPKM 3.5) and 25 other tissues See more
Orthologs

Genomic context

See ZBTB8OS in Genome Data Viewer
Location:
1p35.1
Exon count:
11
Annotation release Status Assembly Chr Location
109.20200815 current GRCh38.p13 (GCF_000001405.39) 1 NC_000001.11 (32620818..32651008, complement)
105 previous assembly GRCh37.p13 (GCF_000001405.25) 1 NC_000001.10 (33086421..33116565, complement)

Chromosome 1 - NC_000001.11Genomic Context describing neighboring genes Neighboring gene zinc finger and BTB domain containing 8A Neighboring gene uncharacterized LOC102723870 Neighboring gene RB binding protein 4, chromatin remodeling factor Neighboring gene syncoilin, intermediate filament protein Neighboring gene Sharpr-MPRA regulatory region 8469

Genomic regions, transcripts, and products

Expression

  • Project title: HPA RNA-seq normal tissues
  • Description: RNA-seq was performed of tissue samples from 95 human individuals representing 27 different tissues in order to determine tissue-specificity of all protein-coding genes
  • BioProject: PRJEB4337
  • Publication: PMID 24309898
  • Analysis date: Wed Apr 4 07:08:55 2018

Bibliography

Pathways from PubChem

Interactions

Products Interactant Other Gene Complex Source Pubs Description

General gene information

Homology

Clone Names

  • MGC62007

Gene Ontology Provided by GOA

Function Evidence Code Pubs
metal ion binding IEA
Inferred from Electronic Annotation
more info
 
protein binding IPI
Inferred from Physical Interaction
more info
PubMed 
Component Evidence Code Pubs
nucleoplasm TAS
Traceable Author Statement
more info
 
tRNA-splicing ligase complex IBA
Inferred from Biological aspect of Ancestor
more info
PubMed 
tRNA-splicing ligase complex IDA
Inferred from Direct Assay
more info
PubMed 

General protein information

Preferred Names
protein archease
Names
archease (ARCH)
archease-like protein
zinc finger and BTB domain-containing opposite strand protein 8

NCBI Reference Sequences (RefSeq)

RefSeqs maintained independently of Annotated Genomes

These reference sequences exist independently of genome builds. Explain

These reference sequences are curated independently of the genome annotation cycle, so their versions may not match the RefSeq versions in the current genome build. Identify version mismatches by comparing the version of the RefSeq in this section to the one reported in Genomic regions, transcripts, and products above.

mRNA and Protein(s)

  1. NM_001308135.2NP_001295064.1  protein archease isoform 2

    Status: VALIDATED

    Description
    Transcript Variant: This variant (2) lacks an alternate in-frame exon in the 5' coding region compared to variant 1. It encodes isoform 2 which has the same N- and C- termini, but lacks a short internal segment compared to isoform 1.
    Source sequence(s)
    AC114489, AL033529
    Related
    ENSP00000483675.1, ENST00000373506.7
    Conserved Domains (1) summary
    pfam01951
    Location:47172
    Archease; Archease protein family (MTH1598/TM1083)
  2. NM_001308136.2NP_001295065.1  protein archease isoform 3

    See identical proteins and their annotated locations for NP_001295065.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (3) lacks an exon in the 3' coding region, which results in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (3) is shorter and has a distinct C-terminus, compared to isoform 1.
    Source sequence(s)
    AC114489, AL033529
    UniProtKB/Swiss-Prot
    Q8IWT0
    Related
    ENSP00000413485.1, ENST00000436661.5
    Conserved Domains (1) summary
    pfam01951
    Location:43121
    Archease; Archease protein family (MTH1598/TM1083)
  3. NM_001308137.2NP_001295066.1  protein archease isoform 4

    Status: VALIDATED

    Description
    Transcript Variant: This variant (4) uses an alternate splice donor site in the 5' coding region, and lacks exons in the 5' and 3' coding regions, with the latter resulting in a frameshift and an early stop codon, compared to variant 1. The encoded isoform (4) contains two distinct amino acids near the N-terminus, lacks an internal segment, is shorter, and has a distinct C-terminus, compared to isoform 1.
    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:47114
    Archease; Archease protein family (MTH1598/TM1083)
  4. NM_001308138.2NP_001295067.1  protein archease isoform 5

    See identical proteins and their annotated locations for NP_001295067.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (5) has multiple differences in the coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
    Source sequence(s)
    AC114489, AL033529
    Consensus CDS
    CCDS76134.1
    UniProtKB/TrEMBL
    D3DPQ2
    Related
    ENSP00000481039.1, ENST00000465588.2
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  5. NM_001308139.2NP_001295068.1  protein archease isoform 5

    See identical proteins and their annotated locations for NP_001295068.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (6) uses an alternate splice acceptor site in the 5' coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
    Source sequence(s)
    AC114489, AL033529
    Consensus CDS
    CCDS76134.1
    UniProtKB/Swiss-Prot
    Q8IWT0
    UniProtKB/TrEMBL
    D3DPQ2
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  6. NM_001308140.2NP_001295069.1  protein archease isoform 5

    See identical proteins and their annotated locations for NP_001295069.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (7) uses an alternate splice donor site in the 5' coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
    Source sequence(s)
    AC114489, AL033529
    Consensus CDS
    CCDS76134.1
    UniProtKB/TrEMBL
    D3DPQ2
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  7. NM_001308141.2NP_001295070.1  protein archease isoform 5

    See identical proteins and their annotated locations for NP_001295070.1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (8) uses an alternate splice donor site in the 5' coding region compared to variant 1. This variant represents translation initiation at a downstream start codon compared to variant 1; the 5'-most initiation codon, as used in variant 1, is associated with a truncated ORF that would render the transcript a candidate for nonsense-mediated decay (NMD). Leaky scanning may allow translation initiation at the downstream start codon to encode an isoform (5) that has a shorter N-terminus, compared to isoform 1. Variants 5, 6, 7, and 8 all encode the same isoform (5).
    Source sequence(s)
    AC114489, AL033529
    Consensus CDS
    CCDS76134.1
    UniProtKB/TrEMBL
    D3DPQ2
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  8. NM_001330475.2NP_001317404.1  protein archease isoform 6

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Consensus CDS
    CCDS81292.1
    Related
    ENSP00000362600.3, ENST00000373501.6
    Conserved Domains (1) summary
    pfam01951
    Location:6131
    Archease; Archease protein family (MTH1598/TM1083)
  9. NM_001366255.1NP_001353184.1  protein archease isoform 6

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:6131
    Archease; Archease protein family (MTH1598/TM1083)
  10. NM_001366256.1NP_001353185.1  protein archease isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  11. NM_001366257.1NP_001353186.1  protein archease isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529, HY228209
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  12. NM_001366258.1NP_001353187.1  protein archease isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  13. NM_001366259.1NP_001353188.1  protein archease isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  14. NM_001366260.1NP_001353189.1  protein archease isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  15. NM_001366263.1NP_001353192.1  protein archease isoform 5

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529, CN344730
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  16. NM_001366264.1NP_001353193.1  protein archease isoform 7

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529, HY108987
    Conserved Domains (1) summary
    pfam01951
    Location:43191
    Archease; Archease protein family (MTH1598/TM1083)
  17. NM_001366265.1NP_001353194.1  protein archease isoform 8

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529, CB992352
    Conserved Domains (1) summary
    pfam01951
    Location:43152
    Archease; Archease protein family (MTH1598/TM1083)
  18. NM_001366266.1NP_001353195.1  protein archease isoform 9

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529, HY067040
    Conserved Domains (1) summary
    pfam01951
    Location:43149
    Archease; Archease protein family (MTH1598/TM1083)
  19. NM_001366267.1NP_001353196.1  protein archease isoform 10

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:1127
    Archease; Archease protein family (MTH1598/TM1083)
  20. NM_001366268.1NP_001353197.1  protein archease isoform 11

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:1122
    Archease; Archease protein family (MTH1598/TM1083)
  21. NM_001366269.1NP_001353198.1  protein archease isoform 12

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    pfam01951
    Location:673
    Archease; Archease protein family (MTH1598/TM1083)
  22. NM_001366270.1NP_001353199.1  protein archease isoform 13

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Related
    ENSP00000484207.1, ENST00000492007.5
    Conserved Domains (1) summary
    cl00606
    Location:5493
    Archease; Archease protein family (MTH1598/TM1083)
  23. NM_001366271.1NP_001353200.1  protein archease isoform 14

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
    Conserved Domains (1) summary
    cl00606
    Location:6792
    Archease; Archease protein family (MTH1598/TM1083)
  24. NM_178547.5NP_848642.2  protein archease isoform 1

    Status: VALIDATED

    Description
    Transcript Variant: This variant (1) encodes the longest isoform (1).
    Source sequence(s)
    AL033529, AY151084
    Consensus CDS
    CCDS365.1

RNA

  1. NR_158772.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  2. NR_158773.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  3. NR_158774.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  4. NR_158775.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  5. NR_158776.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  6. NR_158777.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  7. NR_158778.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  8. NR_158779.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  9. NR_158780.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529
  10. NR_158781.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AA927666, AC114489, AL033529
  11. NR_158782.1 RNA Sequence

    Status: VALIDATED

    Source sequence(s)
    AC114489, AL033529, CK819207

RefSeqs of Annotated Genomes: Homo sapiens Updated Annotation Release 109.20200815

The following sections contain reference sequences that belong to a specific genome build. Explain

Reference GRCh38.p13 Primary Assembly

Genomic

  1. NC_000001.11 Reference GRCh38.p13 Primary Assembly

    Range
    32620818..32651008 complement
    Download
    GenBank, FASTA, Sequence Viewer (Graphics)

mRNA and Protein(s)

  1. XM_017001136.2XP_016856625.1  protein archease isoform X4

  2. XM_011541328.2XP_011539630.1  protein archease isoform X2

    See identical proteins and their annotated locations for XP_011539630.1

    Conserved Domains (1) summary
    pfam01951
    Location:6131
    Archease; Archease protein family (MTH1598/TM1083)
  3. XM_017001137.2XP_016856626.1  protein archease isoform X5

    UniProtKB/TrEMBL
    D3DPQ2
    Conserved Domains (1) summary
    pfam01951
    Location:1110
    Archease; Archease protein family (MTH1598/TM1083)
  4. XM_011541327.3XP_011539629.1  protein archease isoform X1

    Conserved Domains (1) summary
    pfam01951
    Location:43122
    Archease; Archease protein family (MTH1598/TM1083)

Suppressed Reference Sequence(s)

The following Reference Sequences have been suppressed. Explain

  1. NM_001366278.1: Suppressed sequence

    Description
    NM_001366278.1: This RefSeq was removed because it is redundant with an existing RefSeq.
Support Center