NCBI CCDS banner
PubMed Entrez Gene BLAST OMIM
  

CCDS
Home
FTP
Process
Releases & Statistics

Collaborators
EBI
HGNC
MGI
NCBI

Contact Us
email CCDS

Genome Displays

Ensembl
NCBI
UCSC
VEGA

Related Resources
Gene
HomoloGene
MANE
RefSeq


Report for CCDS43988.1 (current version)

CCDS Status Species Chrom. Gene CCDS Release NCBI Annotation Release Ensembl Annotation Release Links
43988.1 Public Homo sapiens X THOC2 24 110 108 CCDS HistoryNCBI Gene:57187Re-query CCDS DB by CCDS ID:43988.1Re-query CCDS DB by GeneID:57187See the combined annotation on chromosome X in Sequence Viewer

Public since: CCDS release 5, NCBI annotation release 36.3, Ensembl annotation release 47

Review status: Reviewed (by RefSeq and Havana)

Sequence IDs included in CCDS 43988.1

Original Current Source Nucleotide ID Protein ID MANE Status in CCDS Seq. Status Links
Original member Current member EBI ENST00000245838.13 ENSP00000245838.8 MANE Select Accepted alive Link to Ensembl Transcript Viewer:ENST00000245838.13Link to Ensembl Protein Viewer:ENSP00000245838.8Re-query CCDS DB by Nucleotide ID:ENST00000245838Re-query CCDS DB by Protein ID:ENSP00000245838
Original member Current member EBI ENST00000355725.8 ENSP00000347959.4 Accepted alive Link to Ensembl Transcript Viewer:ENST00000355725.8Link to Ensembl Protein Viewer:ENSP00000347959.4Re-query CCDS DB by Nucleotide ID:ENST00000355725Re-query CCDS DB by Protein ID:ENSP00000347959
Original member Current member NCBI NM_001081550.2 NP_001075019.1 MANE Select Accepted alive Link to Nucleotide Sequence:NM_001081550.2Link to Protein Sequence:NP_001075019.1Re-query CCDS DB by Nucleotide ID:NM_001081550Re-query CCDS DB by Protein ID:NP_001075019Link to BLAST:NP_001075019.1

RefSeq Length Related UniProtKB/SwissProt Length Identity Gaps Mismatches
NP_001075019.1 1593 Q8NI27-1 1593 100% 0 0

Chromosomal Locations for CCDS 43988.1

Assembly GRCh38.p14 (GCF_000001405.40)

On '-' strand of Chromosome X (NC_000023.11)
Genome Browser links: Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome XSee the combined annotation on chromosome X in Sequence Viewer

Chromosome Start Stop Links
X 123610936 123610963 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123611440 123611516 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123613399 123613556 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123613639 123613708 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123614052 123614189 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123619401 123619436 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123620907 123620965 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123621157 123621587 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123622758 123622860 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123623105 123623283 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123623787 123623971 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123624060 123624191 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123624541 123624669 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123625912 123626069 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123626521 123626662 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123627693 123627968 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123631688 123631852 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123632861 123633040 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123633953 123634070 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123636079 123636175 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123638043 123638123 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123638934 123639027 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123640538 123640622 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123644575 123644676 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123644779 123644909 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123645334 123645375 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123665642 123665837 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123667106 123667278 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123668159 123668314 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123671669 123671761 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123686548 123686714 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123696021 123696154 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123696721 123696842 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123697681 123697751 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123703454 123703505 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123706858 123706949 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123712850 123712908 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X
X 123732952 123733022 Link to NCBI NucleotideLink to UCSC Genome Browser on chromosome XLink to Ensembl Genome Browser on chromosome X

CCDS Sequence Data
Blue highlighting indicates alternating exons.
Red highlighting indicates amino acids encoded across a splice junction.
 
Mouse over the nucleotide or protein sequence below and click on the highlighted codon or residue to select the pair.

Nucleotide Sequence (4782 nt):
ATGGCGGCCGCGGCTGTGGTGGTTCCCGCAGAGTGGATAAAGAACTGGGAGAAATCAGGGAGAGGCGAAT
T
T
TTGCATTTATGTCGGATCCTCAGTGAAAATAAAAGCCATGATAGTTCAACATACAGAGATTTCCAGCA
A
GCTCTCTATGAGTTGTCATATCATGTAATTAAAGGAAATCTAAAGCATGAACAGGCATCTAATGTTCTT
AGT
GACATTAGTGAATTTCGTGAGGATATGCCCTCCATTCTTGCTGATGTATTCTGCATATTAGACATTG
AG
ACAAATTGTTTAGAAGAAAAAAGCAAGAGAGACTATTTTACACAGTTGGTATTAGCATGTTTGTATTT
A
GTTTCAGACACAGTTCTAAAGGAACGCCTGGATCCAGAAACACTGGAATCATTAGGGCTTATCAAACAA
TCA
CAGCAATTCAATCAAAAGTCAGTTAAAATCAAGACAAAACTCTTTTATAAGCAGCAAAAATTCAATT
TG
TTAAGAGAAGAGAATGAAGGTTATGCCAAGCTGATTGCTGAATTGGGGCAAGATTTATCTGGAAGTAT
T
ACTAGTGATTTAATCTTAGAAAATATCAAATCTTTAATAGGATGCTTTAATCTGGATCCCAATAGAGTT
TTG
GATGTCATTTTAGAAGTGTTTGAATGCAGGCCAGAACACGATGACTTCTTTATATCTTTGTTAGAAT
CT
TACATGAGTATGTGTGAACCGCAAACACTGTGTCATATTCTTGGGTTCAAATTCAAGTTTTACCAGGA
A
CCAAATGGCGAGACACCATCATCTTTATACAGAGTTGCAGCAGTACTTCTACAATTTAATCTTATTGAT
TTA
GATGATCTTTATGTACATCTTCTTCCGGCTGATAATTGCATTATGGATGAACACAAACGAGAAATTG
CG
GAAGCTAAGCAAATTGTTAGAAAGCTTACGATGGTTGTGTTGTCTTCTGAAAAAATGGATGAGCGAGA
G
AAAGAAAAGGAAAAAGAAGAGGAGAAAGTAGAGAAACCACCTGATAACCAAAAACTTGGCTTGTTGGAA
GCC
TTATTAAAGATTGGTGATTGGCAACATGCACAGAACATTATGGATCAGATGCCTCCATACTATGCAG
CT
TCACACAAGCTAATAGCCCTTGCTATTTGCAAGCTCATTCATATAACTATTGAGCCTCTCTACCGAAG
A
GTTGGAGTTCCTAAAGGTGCTAAAGGCTCACCTGTTAATGCTTTGCAAAACAAGAGAGCACCAAAACAA
GCT
GAGAGCTTTGAAGATTTGAGGAGAGACGTGTTCAATATGTTCTGTTACCTTGGTCCTCACCTTTCTC
AC
GATCCCATTTTATTTGCAAAAGTGGTGCGCATAGGCAAGTCATTTATGAAGGAGTTTCAGTCTGATGG
A
AGCAAACAAGAAGATAAAGAAAAAACGGAAGTTATCCTTAGCTGTTTGCTTAGCATTACTGACCAGGTA
CTA
CTTCCATCTCTTTCTTTGATGGACTGCAATGCTTGTATGTCTGAGGAACTATGGGGAATGTTTAAAA
CA
TTTCCATATCAGCATAGATATCGTCTGTATGGCCAGTGGAAGAATGAAACTTATAACAGTCACCCACT
T
TTAGTAAAAGTTAAAGCTCAAACAATAGACAGAGCCAAATATATCATGAAGCGCCTAACCAAGGAAAAT
GTG
AAGCCTTCTGGAAGACAAATTGGGAAGTTGAGCCACAGCAATCCAACCATTTTGTTTGATTATATCT
TG
TCACAAATACAGAAGTATGATAACTTAATAACACCTGTAGTAGATTCATTGAAATACCTCACTTCACT
G
AATTATGATGTCTTGGCCTATTGTATCATTGAAGCTTTAGCTAATCCAGAAAAGGAAAGAATGAAACAT
GAT
GACACAACCATCTCAAGCTGGCTTCAGAGTCTGGCTAGTTTCTGTGGTGCAGTTTTTCGTAAATATC
CA
ATTGATCTTGCTGGTCTTCTTCAGTATGTTGCCAATCAGCTAAAGGCGGGCAAAAGTTTTGACCTGCT
T
ATATTGAAAGAAGTGGTACAAAAAATGGCAGGAATAGAAATTACAGAGGAAATGACAATGGAGCAACTA
GAG
GCTATGACTGGTGGAGAGCAGCTAAAAGCTGAGGGTGGTTATTTTGGTCAGATCAGAAACACTAAAA
AA
TCCTCTCAGAGATTAAAGGATGCTCTATTGGACCATGATCTTGCCCTTCCTCTCTGTCTGCTTATGGC
T
CAGCAGAGAAATGGGGTAATCTTTCAGGAAGGTGGAGAGAAACATTTGAAACTTGTGGGAAAGCTCTAT
GAC
CAGTGTCATGATACCCTGGTGCAGTTTGGTGGGTTTTTAGCATCTAATCTGAGCACAGAAGATTATA
TA
AAGCGAGTGCCTTCAATTGATGTACTCTGTAATGAATTTCATACACCCCATGATGCAGCATTTTTCCT
G
TCTAGGCCAATGTATGCCCATCATATTTCGTCAAAGTATGATGAACTTAAAAAATCAGAAAAGGGAAGT
AAA
CAGCAACATAAAGTTCATAAGTACATTACATCATGTGAGATGGTGATGGCGCCTGTCCATGAAGCAG
TG
GTCTCCTTACATGTTTCCAAAGTCTGGGATGACATCAGCCCTCAATTCTATGCTACATTCTGGTCATT
G
ACAATGTATGACCTTGCAGTTCCACACACCAGCTATGAACGAGAAGTCAATAAACTTAAAGTCCAGATG
AAA
GCAATTGATGACAATCAGGAAATGCCCCCAAATAAAAAGAAAAAAGAGAAGGAGCGCTGTACTGCCC
TT
CAGGACAAGCTTCTTGAAGAAGAAAAGAAACAGATGGAACATGTACAGAGAGTTCTACAGAGATTGAA
A
CTGGAAAAGGACAACTGGCTTTTAGCAAAATCTACCAAAAATGAGACCATCACAAAATTTCTACAGCTG
TGT
ATATTTCCTCGATGTATTTTTTCAGCAATTGATGCTGTTTACTGTGCTCGTTTTGTTGAATTGGTAC
AT
CAACAGAAAACTCCAAATTTTTCCACACTTCTTTGCTATGATCGAGTTTTCTCTGACATAATTTACAC
A
GTTGCAAGCTGTACTGAAAATGAAGCCAGTCGATACGGAAGGTTTCTTTGCTGCATGTTAGAGACTGTG
ACC
AGGTGGCATAGTGATAGAGCCACATATGAAAAGGAATGTGGAAACTATCCAGGATTCCTTACCATAT
TA
CGGGCAACTGGATTTGATGGTGGAAATAAGGCTGATCAATTAGACTATGAAAATTTTCGACATGTTGT
A
CATAAATGGCATTACAAACTAACCAAGGCATCGGTACATTGCCTTGAAACAGGCGAATATACTCACATC
AGG
AATATCTTGATTGTGCTAACAAAAATACTTCCTTGGTACCCAAAAGTTTTGAATCTGGGTCAAGCTT
TG
GAAAGAAGAGTACACAAAATCTGCCAAGAAGAAAAAGAGAAGAGGCCAGATCTATATGCATTGGCTAT
G
GGCTACTCTGGGCAGTTGAAAAGTAGAAAGTCATACATGATACCTGAAAATGAGTTTCATCACAAAGAC
CCC
CCTCCGAGGAATGCAGTTGCCAGTGTGCAAAATGGGCCTGGTGGTGGGCCTTCTTCATCATCAATAG
GA
AGTGCATCTAAATCGGATGAAAGCAGTACTGAGGAGACTGATAAATCAAGGGAGAGATCTCAGTGTGG
T
GTGAAAGCTGTTAATAAAGCTTCTAGTACCACACCTAAAGGGAATTCAAGCAATGGAAATAGTGGCTCT
AAC
AGCAACAAAGCTGTTAAAGAAAATGACAAAGAAAAAGGGAAAGAGAAAGAAAAAGAGAAAAAAGAAA
AG
ACTCCAGCTACTACTCCAGAGGCCAGGGTACTTGGTAAAGATGGTAAAGAAAAACCAAAGGAAGAGCG
G
CCAAATAAAGATGAAAAAGCAAGAGAGACCAAGGAAAGAACGCCGAAGTCTGACAAAGAGAAAGAAAAA
TTC
AAGAAGGAAGAAAAAGCTAAAGATGAGAAATTTAAGACCACTGTCCCCAACGCAGAATCAAAATCAA
CT
CAAGAAAGGGAAAGAGAGAAGGAGCCATCCAGAGAAAGAGATATAGCAAAGGAAATGAAATCAAAGGA
A
AATGTTAAAGGAGGAGAAAAAACACCAGTTTCTGGGTCCTTGAAATCACCTGTTCCCAGATCAGATATT
CCA
GAGCCTGAAAGGGAACAAAAACGCCGCAAAATTGATACTCACCCTTCTCCATCACATTCCTCCACAG
TA
AAGGACAGTCTCATCGAACTCAAGGAATCTTCAGCAAAGCTCTACATTAATCATACTCCTCCACCACT
G
TCCAAGAGTAAGGAGAGAGAAATGGACAAGAAAGATTTGGACAAGTCAAGGGAAAGATCCAGAGAAAGA
GAG
AAAAAAGATGAAAAGGACAGGAAAGAGCGGAAAAGGGATCACTCAAACAACGACCGTGAAGTGCCAC
CG
GACTTAACCAAGAGACGTAAAGAGGAGAATGGAACAATGGGGGTTTCAAAACATAAAAGTGAAAGTCC
T
TGTGAATCTCCTTATCCAAATGAGAAAGACAAGGAAAAAAATAAGTCAAAATCTTCAGGCAAAGAAAAA
GGC
AGTGATTCATTTAAATCTGAGAAGATGGATAAAATCTCCTCCGGTGGCAAAAAGGAGTCCAGGCATG
AT
AAAGAAAAGATAGAAAAGAAAGAGAAACGGGACAGTTCAGGAGGAAAGGAAGAGAAGAAACATCATAA
G
TCCTCGGACAAGCACAGATAA


Translation (1593 aa):
MAAAAVVVPAEWIKNWEKSGRGEFLHLCRILSENKSHDSSTYRDFQQALYELSYHVIKGNLKHEQASNVL
S
DIS
EFREDMPSILADVFCILDIETNCLEEKSKRDYFTQLVLACLYLVSDTVLKERLDPETLESLGLIKQ
S
QQFNQKSVKIKTKL
FYKQQKFNLLREENEGYAKLIAELGQDLSGSITSDLILENIKSLIGCFNLDPNRV
L
DVILEVFECRPEHDDFFISLLESYMSMCEPQTLCHILGFKFKFYQ
EPNGETPSSLYRVAAVLLQFNLID
L
DDLYVH
LLPADNCIMDEHKREIAEAKQIVRKLTMVVLSSEKMDEREKEKEKEEEKVEKPPDNQKLGLLE
A
LLKIGDWQHAQNIMDQMPPYYAASHKLIALAICKLIHITIEPLYR
RVGVPKGAKGSPVNALQNKRAPKQ
A
ESFEDLRRDVFNMFCYLGPHLSHDPILFAKVVRIGKSFMKE
FQSDGSKQEDKEKTEVILSCLLSITDQV
L
LPSLSLMDCNACMSEELWGMFKTFPYQH
RYRLYGQWKNETYNSHPLLVKVKAQTIDRAKYIMKRLTKEN
V
KPSGRQIGKLSHSNPTILFDY
ILSQIQKYDNLITPVVDSLKYLTSLNYDVLAYCIIEALANPEKERMKH
D
DTTISSWLQ
SLASFCGAVFRKYPIDLAGLLQYVANQLKAGKSFDLLILKEVVQKMAGIEITEEMTMEQL
E
AMTGGEQLKAE
GGYFGQIRNTKKSSQRLKDALLDHDLALPLCLLMAQQRNGVIFQEGGEKHLKLVGKLY
D
Q
CHDTLVQFGGFLASNLSTEDYIKRVPSIDVLCNEFHTPHDAAFFLSRPMYAHHISSKYDELKKSEKGS
K
QQHKVHKYITSCEMVMAPVHEAVVSLHVSKVWDDISPQFYATFWSLTMYDLAVPHTSYEREVNKLKVQM
K
AIDDNQEM
PPNKKKKEKERCTALQDKLLEEEKKQMEHVQRVLQRLKLEKDNWLLAKSTKNETITKFLQL
C
IFPRCIFSAIDAVYCARFVELVHQQKTPNFSTLLCYDR
VFSDIIYTVASCTENEASRYGRFLCCMLETV
T
RWHSDRATYEK
ECGNYPGFLTILRATGFDGGNKADQLDYENFRHVVHKWHYKLTKASVHCLETGEYTHI
R
NILIVLTKILPWYPKVLNLGQALERRVHKICQEEKEKRPDLYALAM
GYSGQLKSRKSYMIPENEFHHKD
P
PPRNAVASVQNGPGGGPSSSSIGSASKSDESSTEET
DKSRERSQCGVKAVNKASSTTPKGNSSNGNSGS
N
SNKAVKENDKEKGKEKEKEKKEKTPATTPEARVLGKDGKEKPKEERPNKDEKARETKERTPKSDKEKEK
F
KKEEKAKDEKFKTTVPNAESKSTQEREREKEPSRERDIAKEMKSKENVKGGEKTPVSGSLKSPVPRSDI
P
EPER
EQKRRKIDTHPSPSHSSTVKDSLIELKESSAKLYINHTPPPLSKSKEREMDKKDLDKSRERSRER
E
KKDEKDRKERKR
DHSNNDREVPPDLTKRRKEENGTMGVSKHKSESPCESPYPNEKDKEKNKSKSSGKEK
G
SDSFKSEKMDKISSGGKK
ESRHDKEKIEKKEKRDSSGGKEEKKHHKSSDKHR



Links Key
 Links to:   History report
  BLAST report
  Entrez Gene
  Nucleotide report
  Protein report
 Re-query CCDS DB by:   CCDS ID
  Gene ID
  Nucleotide ID
  Protein ID
 Genome Browser Links:   Ensembl Genome Browser
  NCBI Sequence Viewer
  UCSC Genome Browser
  VEGA Genome Browser