NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|119618940|gb|EAW98534|]
View 

E1A binding protein p400, isoform CRA_a [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
EP400_N super family cl25813
E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact ...
1-469 0e+00

E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact function of this domain is not known. This family is largely low-complexity residues.


The actual alignment was detected with superfamily member pfam15790:

Pssm-ID: 434938 [Multi-domain]  Cd Length: 489  Bit Score: 580.41  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940     1 MHHGTGPQNVQHQLQRSRACPGSEG---EEQPAHPNPPPSPAAPFAPSASPSAPQSPSYQIqqLMNRSPATGQNVNITLQ 77
Cdd:pfam15790    1 MHHGSGSQNVQRQLQRSKSVSGSEEqqqEQQPATVNHPQSPVTTFAPAASPSAPQSPNYQI--IMSRSPVTGQNVNITLQ 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940    78 SVGPVVGGNQQITLAPLPLPSPTSPGFQFSAQPRRFEHGSPSYIQVTSPLSQQVQTQSPTQPSPGPGQALQnvRAGAPGP 157
Cdd:pfam15790   79 NVGQMVAGNQQITLTPLPLQSPASPGFQHSAPQWRFEHGSPSYIQVTSPLPQQVQPQSPTQHSPVPLQGVQ--RPGAPGT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   158 GLGLCSSSPTGgFVDASVLVRQISL-SPSSGGHFVFQDGSGLTQIAQG-AQVQLQHPGTPITVRERRPSQPHTQSGGTIH 235
Cdd:pfam15790  157 GLGVCGQSPTR-FVDASMLVRQISLgSPSGGGHFVYQDGTGLAQIAPGaGQVQLASPGTPGSVRERRLSQPHSQTGGTIH 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   236 HLGPQSPAAAGgAGLQPLASPSHITTANLPPQISSIIQGQlvqqqqvlqgppLPRPLGFERTPGVLLPGAGGAAGF-GMT 314
Cdd:pfam15790  236 HLGPQSPAAAG-AGLQTLGSPGHITTSNLPPQISSIIQGQ------------LARPLGFEKTAQVVVAGAGGPAASfGIP 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   315 SPPPPTSPSRTAVPPGLSSLPLTSVGNTG-MKKVPKKLEEIPPASPEMAQMRKQCLDYHYQEMQALKEVFKEYLIELFFL 393
Cdd:pfam15790  303 SSIPPTSPSRTSPPPGLSSNPLTSTGMSGsVKKVPKKLEEIAPATPEIAQLRKQCLDHHTKKMESLKEVFKEYLIELFFL 382
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   394 QHFQGNMMDFLAFKKKHYAPLQAYLRQNDLDIeeeeEEEEEEEEKSEVINDEQ--------------------------- 446
Cdd:pfam15790  383 QHLQGNMMDFLAFKKKHCVPLYTYLRQNDLDL----EEEEEEEEQSEVINDEVkvvtgkdgqtgtpvaiatqlppnvsaa 458
                          490       500       510
                   ....*....|....*....|....*....|.
gi 119618940   447 --------QALAGSLVAGAGSTVETDLFKRQ 469
Cdd:pfam15790  459 fstqqqpfQAHQGTASAGITNTVEMDAFKRQ 489
DEAD-like_helicase_N super family cl28899
N-terminal helicase domain of the DEAD-box helicase superfamily; The DEAD-like helicase ...
1008-1224 1.93e-134

N-terminal helicase domain of the DEAD-box helicase superfamily; The DEAD-like helicase superfamily is a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. The N-terminal domain contains the ATP-binding region.


The actual alignment was detected with superfamily member cd18003:

Pssm-ID: 475120 [Multi-domain]  Cd Length: 223  Bit Score: 419.83  E-value: 1.93e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18003     1 LREYQHIGLDWLATLYEKNLNGILADEMGLGKTIQTIALLAHLACEKGNWGPHLIVVPTSVMLNWEMEFKRWCPGFKILT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSP 1167
Cdd:cd18003    81 YYGSAKERKLKRQGWMKPNSFHVCITSYQLVVQDHQVFKRKKWKYLILDEAHNIKNFKSQRWQTLLNFNTQRRLLLTGTP 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1168 LHNTFLELWTMVHFLVPGI------SRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18003   161 LQNSLMELWSLMHFLMPHIfqshqeFKEWFSNPLTAMSEGSQEENEELVRRLHKVLRPFLLRR 223
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
768-1291 1.84e-60

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


:

Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 222.79  E-value: 1.84e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  768 EAPRPKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVVEIKLRVELEEKRKKALNLQKVSRRGKELRPKGFDALQESSLD 847
Cdd:COG0553     1 LLLLLLLLALGALGLLLTELLLLLRLGALLLELVLARELLLLLLAADALLLLALLLLLELLLLLAALLLLALLLLALSAL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  848 SGMSGRKRKASISLTDDEVDDEEETIEEEEANEGVVDHQTELSNLAKEAELPLLDLMKLYEGAFLPSSQWPRPKPDGEDT 927
Cdd:COG0553    81 ALLLLRLLLALLLLALLLLLAGLLALALLLLALLGLLLSLALLLLLLLLLLLLLLALLLVLLAALLLLLLLLLLLALLLG 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  928 SGEEDADDCPGDRESRKDLVLIDSLFIMDQFKAAERMNIGK-PNAKDIADVTAVAEAILPKGSARVTTSVKFNAPSLLYG 1006
Cdd:COG0553   161 RLLLLALLLLALEALLLLGLLLALALLALLELALLAAEAELlLLLELLLELELLAEAAVDAFRLRRLREALESLPAGLKA 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1007 ALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVV-RSCnILKWELELKRWCPGLKI 1085
Cdd:COG0553   241 TLRPYQLEGAAWLLFLRRLGLGGLLADDMGLGKTIQALALLLELK-ERGLARPVLIVApTSL-VGNWQRELAKFAPGLRV 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGShrelkAKRQEWAEP-NSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLI 1164
Cdd:COG0553   319 LVLDGT-----RERAKGANPfEDADLVITSYGLLRRDIELLAAVDWDLVILDEAQHIKNPATKRAKAVRALKARHRLALT 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1165 DSPLHNTFLELWTMVHFLVPGisrpYLSSPlrapsEESQDYYHKVVI--------RLHRVTQPFILRRTKRDVEKQLTKK 1236
Cdd:COG0553   394 GTPVENRLEELWSLLDFLNPG----LLGSL-----KAFRERFARPIEkgdeealeRLRRLLRPFLLRRTKEDVLKDLPEK 464
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1237 YEHVLKCRLSNRQKALYEDVI--LQPGTQEALKSGHFVNVLSILVRLQRICNHPGLV 1291
Cdd:COG0553   465 TEETLYVELTPEQRALYEAVLeyLRRELEGAEGIRRRGLILAALTRLRQICSHPALL 521
SF2_C_SNF cd18793
C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) ...
1814-1939 1.77e-38

C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) family includes chromatin-remodeling factors, such as CHD proteins and SMARCA proteins, recombination proteins Rad54, and many others. They are DEAD-like helicases belonging to superfamily (SF)2, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Similar to SF1 helicases, SF2 helicases do not form toroidal structures like SF3-6 helicases. Their helicase core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


:

Pssm-ID: 350180 [Multi-domain]  Cd Length: 135  Bit Score: 141.46  E-value: 1.77e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1814 SGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFCAILSTHSR 1893
Cdd:cd18793    10 SGKLEALLELLEELREPGEKVLIFSQFTDTLDILEEALRERGIKYLRLDGSTSSKERQKLVDRFNEDPDIRVFLLSTKAG 89
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 119618940 1894 TTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLV 1939
Cdd:cd18793    90 GVGLNLTAANRVILYDPWWNPAVEEQAIDRAHRIGQKKPVVVYRLI 135
PAT1 super family cl37801
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
518-780 1.83e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


The actual alignment was detected with superfamily member pfam09770:

Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 56.97  E-value: 1.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   518 PPTPQAAQLAGQRQSQQQYDPSTGPPVQNAASLHT---------PLPQL-----------PGRLPPAGVPTAALSSALQF 577
Cdd:pfam09770  108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekykepePIPDLqvdaslwgvapKKAAAPAPAPQPAAQPASLP 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   578 AQQPQV-----VEAQTQLQIPVKTQQPNVPIPAPPSSqlpiPPSQPAQLALHVPTPGKVQVQASQLSSLPQMVASTRLPV 652
Cdd:pfam09770  188 APSRKMmsleeVEAAMRAQAKKPAQQPAPAPAQPPAA----PPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPV 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   653 DPAPPCPRPLPTSSTSSLAPVS-GSGPGPSPARSSPV------NRPSSAtnkalspvtsRTPGVVASAPTKPQSPAQNAT 725
Cdd:pfam09770  264 TILQRPQSPQPDPAQPSIQPQAqQFHQQPPPVPVQPTqilqnpNRLSAA----------RVGYPQNPQPGVQPAPAHQAH 333
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940   726 SSQdSSQDTLTEQITLENQVHQRIAELRKAGLwsqrrlpkLQEAPRPK-SHWDYLL 780
Cdd:pfam09770  334 RQQ-GSFGRQAPIITHPQQLAQLSEEEKAAYL--------DEEAKRAKrNHKIFLL 380
PRK12323 super family cl46901
DNA polymerase III subunit gamma/tau;
2962-3081 2.86e-05

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK14971:

Pssm-ID: 481241 [Multi-domain]  Cd Length: 614  Bit Score: 49.77  E-value: 2.86e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2962 TQIQVAKLPQVVQQQTPVASIQQVASA--SQQAS-PQTVALTQATAAGQQVQMIPAVTATAQVVQQKLIQQQVVT--TAS 3036
Cdd:PRK14971  355 TLIQLAQLTQKGDDASGGRGPKQHIKPvfTQPAAaPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdpPAA 434
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 119618940 3037 APLQTPGAPNPAQVPASSDSPSQQPKLQMRVPAVRLKTPTKPPCQ 3081
Cdd:PRK14971  435 VPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAE 479
SP1-4_N super family cl41773
N-terminal domain of transcription factor Specificity Proteins (SP) 1-4; Specificity Proteins ...
2798-3023 3.85e-04

N-terminal domain of transcription factor Specificity Proteins (SP) 1-4; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. SPs belong to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. SP1-4 have similar N-terminal transactivation domains characterized by glutamine-rich regions, which, in most cases, have adjacent serine/threonine-rich regions. This model represents the N-terminal domain of SP1-4.


The actual alignment was detected with superfamily member cd22553:

Pssm-ID: 425404 [Multi-domain]  Cd Length: 384  Bit Score: 45.79  E-value: 3.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2798 GTTVANLQVARLTRVPTSQLQAQGQMQTQAPQPAQVALAKPPVVSVPAAVVSSPGVTTLPMNVAGiSVAIGQPQKAA-GQ 2876
Cdd:cd22553    92 LLQTNNQQAIQLAPGGTQAILANQQTLIRPNTVQGQANASNVLQNIAQIASGGNAVQLPLNNMTQ-TIPVQVPVSTAnGQ 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2877 TV---VAQPVHMQQLLKLKQQAVQQQKAIQPQAAQ-GPAAVQQKITA------QQITTPGAQQK-------------VAY 2933
Cdd:cd22553   171 TVyqtIQVPIQAIQSGNAGGGNQALQAQVIPQLAQaAQLQPQQLAQVssqgyiQQIPANASQQQpqmvqqgpnqsgqIIG 250
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2934 AAQPALKTQFLTTPISQAQKLAGAQQVQTQIQVAKLP----QVVQQQTPVASIQQVASASQQASPQTVAL---TQATAAG 3006
Cdd:cd22553   251 QVASASSIQAAAIPLTVYTGALAGQNGSNQQQVGQIVtspiQGMTQGLTAPASSSIPTVVQQQAIQGNPLppgTQIIAAG 330
                         250
                  ....*....|....*..
gi 119618940 3007 QQVQMIPAVTATAQVVQ 3023
Cdd:cd22553   331 QQLQQDPNDPTKWQVVA 347
 
Name Accession Description Interval E-value
EP400_N pfam15790
E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact ...
1-469 0e+00

E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact function of this domain is not known. This family is largely low-complexity residues.


Pssm-ID: 434938 [Multi-domain]  Cd Length: 489  Bit Score: 580.41  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940     1 MHHGTGPQNVQHQLQRSRACPGSEG---EEQPAHPNPPPSPAAPFAPSASPSAPQSPSYQIqqLMNRSPATGQNVNITLQ 77
Cdd:pfam15790    1 MHHGSGSQNVQRQLQRSKSVSGSEEqqqEQQPATVNHPQSPVTTFAPAASPSAPQSPNYQI--IMSRSPVTGQNVNITLQ 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940    78 SVGPVVGGNQQITLAPLPLPSPTSPGFQFSAQPRRFEHGSPSYIQVTSPLSQQVQTQSPTQPSPGPGQALQnvRAGAPGP 157
Cdd:pfam15790   79 NVGQMVAGNQQITLTPLPLQSPASPGFQHSAPQWRFEHGSPSYIQVTSPLPQQVQPQSPTQHSPVPLQGVQ--RPGAPGT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   158 GLGLCSSSPTGgFVDASVLVRQISL-SPSSGGHFVFQDGSGLTQIAQG-AQVQLQHPGTPITVRERRPSQPHTQSGGTIH 235
Cdd:pfam15790  157 GLGVCGQSPTR-FVDASMLVRQISLgSPSGGGHFVYQDGTGLAQIAPGaGQVQLASPGTPGSVRERRLSQPHSQTGGTIH 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   236 HLGPQSPAAAGgAGLQPLASPSHITTANLPPQISSIIQGQlvqqqqvlqgppLPRPLGFERTPGVLLPGAGGAAGF-GMT 314
Cdd:pfam15790  236 HLGPQSPAAAG-AGLQTLGSPGHITTSNLPPQISSIIQGQ------------LARPLGFEKTAQVVVAGAGGPAASfGIP 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   315 SPPPPTSPSRTAVPPGLSSLPLTSVGNTG-MKKVPKKLEEIPPASPEMAQMRKQCLDYHYQEMQALKEVFKEYLIELFFL 393
Cdd:pfam15790  303 SSIPPTSPSRTSPPPGLSSNPLTSTGMSGsVKKVPKKLEEIAPATPEIAQLRKQCLDHHTKKMESLKEVFKEYLIELFFL 382
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   394 QHFQGNMMDFLAFKKKHYAPLQAYLRQNDLDIeeeeEEEEEEEEKSEVINDEQ--------------------------- 446
Cdd:pfam15790  383 QHLQGNMMDFLAFKKKHCVPLYTYLRQNDLDL----EEEEEEEEQSEVINDEVkvvtgkdgqtgtpvaiatqlppnvsaa 458
                          490       500       510
                   ....*....|....*....|....*....|.
gi 119618940   447 --------QALAGSLVAGAGSTVETDLFKRQ 469
Cdd:pfam15790  459 fstqqqpfQAHQGTASAGITNTVEMDAFKRQ 489
DEXQc_SRCAP cd18003
DEXH/Q-box helicase domain of SRCAP; Snf2-related CBP activator (SRCAP, also known as SWR1 or ...
1008-1224 1.93e-134

DEXH/Q-box helicase domain of SRCAP; Snf2-related CBP activator (SRCAP, also known as SWR1 or DOMO1) is the core catalytic component of the multiprotein chromatin-remodeling SRCAP complex, that is necessary for the incorporation of the histone variant H2A.Z into nucleosomes. SRCAP is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350761 [Multi-domain]  Cd Length: 223  Bit Score: 419.83  E-value: 1.93e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18003     1 LREYQHIGLDWLATLYEKNLNGILADEMGLGKTIQTIALLAHLACEKGNWGPHLIVVPTSVMLNWEMEFKRWCPGFKILT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSP 1167
Cdd:cd18003    81 YYGSAKERKLKRQGWMKPNSFHVCITSYQLVVQDHQVFKRKKWKYLILDEAHNIKNFKSQRWQTLLNFNTQRRLLLTGTP 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1168 LHNTFLELWTMVHFLVPGI------SRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18003   161 LQNSLMELWSLMHFLMPHIfqshqeFKEWFSNPLTAMSEGSQEENEELVRRLHKVLRPFLLRR 223
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
768-1291 1.84e-60

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 222.79  E-value: 1.84e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  768 EAPRPKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVVEIKLRVELEEKRKKALNLQKVSRRGKELRPKGFDALQESSLD 847
Cdd:COG0553     1 LLLLLLLLALGALGLLLTELLLLLRLGALLLELVLARELLLLLLAADALLLLALLLLLELLLLLAALLLLALLLLALSAL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  848 SGMSGRKRKASISLTDDEVDDEEETIEEEEANEGVVDHQTELSNLAKEAELPLLDLMKLYEGAFLPSSQWPRPKPDGEDT 927
Cdd:COG0553    81 ALLLLRLLLALLLLALLLLLAGLLALALLLLALLGLLLSLALLLLLLLLLLLLLLALLLVLLAALLLLLLLLLLLALLLG 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  928 SGEEDADDCPGDRESRKDLVLIDSLFIMDQFKAAERMNIGK-PNAKDIADVTAVAEAILPKGSARVTTSVKFNAPSLLYG 1006
Cdd:COG0553   161 RLLLLALLLLALEALLLLGLLLALALLALLELALLAAEAELlLLLELLLELELLAEAAVDAFRLRRLREALESLPAGLKA 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1007 ALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVV-RSCnILKWELELKRWCPGLKI 1085
Cdd:COG0553   241 TLRPYQLEGAAWLLFLRRLGLGGLLADDMGLGKTIQALALLLELK-ERGLARPVLIVApTSL-VGNWQRELAKFAPGLRV 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGShrelkAKRQEWAEP-NSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLI 1164
Cdd:COG0553   319 LVLDGT-----RERAKGANPfEDADLVITSYGLLRRDIELLAAVDWDLVILDEAQHIKNPATKRAKAVRALKARHRLALT 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1165 DSPLHNTFLELWTMVHFLVPGisrpYLSSPlrapsEESQDYYHKVVI--------RLHRVTQPFILRRTKRDVEKQLTKK 1236
Cdd:COG0553   394 GTPVENRLEELWSLLDFLNPG----LLGSL-----KAFRERFARPIEkgdeealeRLRRLLRPFLLRRTKEDVLKDLPEK 464
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1237 YEHVLKCRLSNRQKALYEDVI--LQPGTQEALKSGHFVNVLSILVRLQRICNHPGLV 1291
Cdd:COG0553   465 TEETLYVELTPEQRALYEAVLeyLRRELEGAEGIRRRGLILAALTRLRQICSHPALL 521
PLN03142 PLN03142
Probable chromatin-remodeling complex ATPase chain; Provisional
982-1303 1.47e-59

Probable chromatin-remodeling complex ATPase chain; Provisional


Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 225.84  E-value: 1.47e-59
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  982 EAILPKGSARVTTSvkfnaPSLLYGALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHL 1061
Cdd:PLN03142  149 DGLGGSGGTRLLVQ-----PSCIKGKMRDYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLGYLHEYRGITGPHM 223
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1062 VVVRSCNILKWELELKRWCPGLKILSYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRV 1141
Cdd:PLN03142  224 VVAPKSTLGNWMNEIRRFCPVLRAVKFHGNPEERAHQREELLVAGKFDVCVTSFEMAIKEKTALKRFSWRYIIIDEAHRI 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1142 KGMTERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPGI--SRPYLSSPLRAPSEESQdyyHKVVIRLHRVTQP 1219
Cdd:PLN03142  304 KNENSLLSKTMRLFSTNYRLLITGTPLQNNLHELWALLNFLLPEIfsSAETFDEWFQISGENDQ---QEVVQQLHKVLRP 380
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1220 FILRRTKRDVEKQLTKKYEHVLKCRLSNRQKALYEdVILQPGTQEALKSGHFVNVLSILVRLQRICNHPGLVEPRHPGSS 1299
Cdd:PLN03142  381 FLLRRLKSDVEKGLPPKKETILKVGMSQMQKQYYK-ALLQKDLDVVNAGGERKRLLNIAMQLRKCCNHPYLFQGAEPGPP 459

                  ....
gi 119618940 1300 YVAG 1303
Cdd:PLN03142  460 YTTG 463
SNF2-rel_dom pfam00176
SNF2-related domain; This domain is found in proteins involved in a variety of processes ...
1011-1291 1.97e-59

SNF2-related domain; This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1). SNF2 functions as the ATPase component of the SNF2/SWI multisubunit complex, which utilizes energy derived from ATP hydrolysis to disrupt histone-DNA interactions, resulting in the increased accessibility of DNA to transcription factors.


Pssm-ID: 425504 [Multi-domain]  Cd Length: 289  Bit Score: 207.54  E-value: 1.97e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1011 YQKIGLDWLAKLYRK-NLNGILADEAGLGKTVQIIAFFAHLACNEGNWG-PHLVVVRSCNILKWELELKRWC--PGLKIL 1086
Cdd:pfam00176    1 YQIEGVNWMLSLENNlGRGGILADEMGLGKTLQTISLLLYLKHVDKNWGgPTLIVVPLSLLHNWMNEFERWVspPALRVV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1087 SYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDS 1166
Cdd:pfam00176   81 VLHGNKRPQERWKNDPNFLADFDVVITTYETLRKHKELLKKVHWHRIVLDEGHRLKNSKSKLSKALKSLKTRNRWILTGT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1167 PLHNTFLELWTMVHFLVPGI--SRPYLSSPLRAPSEESQDYyhKVVIRLHRVTQPFILRRTKRDVEKQLTKKYEHVLKCR 1244
Cdd:pfam00176  161 PLQNNLEELWALLNFLRPGPfgSLSTFRNWFDRPIERGGGK--KGVSRLHKLLKPFLLRRTKKDVEKSLPPKVEYILFCR 238
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 119618940  1245 LSNRQKALYEDVILQPGTQEALKS--GHFVN--VLSILVRLQRICNHPGLV 1291
Cdd:pfam00176  239 LSKLQRKLYQTFLLKKDLNAIKTGegGREIKasLLNILMRLRKICNHPGLI 289
SF2_C_SNF cd18793
C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) ...
1814-1939 1.77e-38

C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) family includes chromatin-remodeling factors, such as CHD proteins and SMARCA proteins, recombination proteins Rad54, and many others. They are DEAD-like helicases belonging to superfamily (SF)2, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Similar to SF1 helicases, SF2 helicases do not form toroidal structures like SF3-6 helicases. Their helicase core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350180 [Multi-domain]  Cd Length: 135  Bit Score: 141.46  E-value: 1.77e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1814 SGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFCAILSTHSR 1893
Cdd:cd18793    10 SGKLEALLELLEELREPGEKVLIFSQFTDTLDILEEALRERGIKYLRLDGSTSSKERQKLVDRFNEDPDIRVFLLSTKAG 89
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 119618940 1894 TTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLV 1939
Cdd:cd18793    90 GVGLNLTAANRVILYDPWWNPAVEEQAIDRAHRIGQKKPVVVYRLI 135
PLN03142 PLN03142
Probable chromatin-remodeling complex ATPase chain; Provisional
1813-1954 8.52e-31

Probable chromatin-remodeling complex ATPase chain; Provisional


Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 133.77  E-value: 8.52e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1813 DSGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRD-RRIFCAILSTH 1891
Cdd:PLN03142  469 NSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPgSEKFVFLLSTR 548
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1892 SRTTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLVSGNSIEEKLLKNGTK 1954
Cdd:PLN03142  549 AGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYK 611
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
1793-1950 6.83e-25

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 113.40  E-value: 6.83e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1793 LRQ-TTAPRLLqFPELRLVQFDSGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQ 1871
Cdd:COG0553   511 LRQiCSHPALL-LEEGAELSGRSAKLEALLELLEELLAEGEKVLVFSQFTDTLDLLEERLEERGIEYAYLHGGTSAEERD 589
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 119618940 1872 ELMRSFNRDRRIFCAILSTHSRTTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLVSGNSIEEKLLK 1950
Cdd:COG0553   590 ELVDRFQEGPEAPVFLISLKAGGEGLNLTAADHVIHYDLWWNPAVEEQAIDRAHRIGQTRDVQVYKLVAEGTIEEKILE 668
HSA smart00573
domain in helicases and associated with SANT domains;
764-828 7.09e-18

domain in helicases and associated with SANT domains;


Pssm-ID: 214727 [Multi-domain]  Cd Length: 73  Bit Score: 80.13  E-value: 7.09e-18
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940    764 PKLQEAPRPKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVVEIKLR----VELEEKRKKALN----LQKVSR 828
Cdd:smart00573    1 QKLEEERRRKQHWDHLLEEMIWHAKDFKEEHKWKIAAAKKMAKAVMDyhqnKEKEEERREEKNekrrLRKLAA 73
DEXDc smart00487
DEAD-like helicases superfamily;
1001-1189 9.25e-17

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 81.38  E-value: 9.25e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   1001 PSLLYGALRDYQKIGLDWLAKLYRknlNGILADEAGLGKTVQIIAfFAHLACNEGNWGPHLVVV--RScniLK--WELEL 1076
Cdd:smart00487    2 EKFGFEPLRPYQKEAIEALLSGLR---DVILAAPTGSGKTLAALL-PALEALKRGKGGRVLVLVptRE---LAeqWAEEL 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   1077 KRWCP--GLKILSYIGSHRELKAKRQEWAEPnsFHVCITSYTQFFRGLT--AFTRVRWKCLVIDEMQRVKGMTER-HWEA 1151
Cdd:smart00487   75 KKLGPslGLKVVGLYGGDSKREQLRKLESGK--TDILVTTPGRLLDLLEndKLSLSNVDLVILDEAHRLLDGGFGdQLEK 152
                           170       180       190       200
                    ....*....|....*....|....*....|....*....|...
gi 119618940   1152 VFTL--QSQQRLLL---IDSPLHNTFLELWTMVHFLVPGISRP 1189
Cdd:smart00487  153 LLKLlpKNVQLLLLsatPPEEIENLLELFLNDPVFIDVGFTPL 195
HSA pfam07529
HSA domain; This domain is predicted to bind DNA and is often found associated with helicases. ...
766-819 1.46e-14

HSA domain; This domain is predicted to bind DNA and is often found associated with helicases. This region does not form a compact domain in the known structures.


Pssm-ID: 462194 [Multi-domain]  Cd Length: 67  Bit Score: 70.68  E-value: 1.46e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940   766 LQEAPR-PKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVV------------EIKLRVELEEKRKK 819
Cdd:pfam07529    1 RDEPERrEKTHHDYLLEEILWHSKDFKQERRWKRARAKKLAravaqyhkniekEEQKRIEREEKQRL 67
Helicase_C pfam00271
Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, ...
1815-1930 5.86e-14

Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.


Pssm-ID: 459740 [Multi-domain]  Cd Length: 109  Bit Score: 70.32  E-value: 5.86e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1815 GKLEALAILLQKlkSEGRRVLILSQMILMLDIlEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFcaILSTHSRT 1894
Cdd:pfam00271    1 EKLEALLELLKK--ERGGKVLIFSQTKKTLEA-ELLLEKEGIKVARLHGDLSQEEREEILEDFRKGKIDV--LVATDVAE 75
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 119618940  1895 TGINLVEADTVVFYDNDLNPVMDakAQewcdRIGRC 1930
Cdd:pfam00271   76 RGLDLPDVDLVINYDLPWNPASY--IQ----RIGRA 105
HELICc smart00490
helicase superfamily c-terminal domain;
1845-1930 4.78e-10

helicase superfamily c-terminal domain;


Pssm-ID: 197757 [Multi-domain]  Cd Length: 82  Bit Score: 58.38  E-value: 4.78e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   1845 DILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFcaILSTHSRTTGINLVEADTVVFYDNDLNPVMDakAQewc 1924
Cdd:smart00490    1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKV--LVATDVAERGLDLPGVDLVIIYDLPWSPASY--IQ--- 73

                    ....*.
gi 119618940   1925 dRIGRC 1930
Cdd:smart00490   74 -RIGRA 78
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
518-780 1.83e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 56.97  E-value: 1.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   518 PPTPQAAQLAGQRQSQQQYDPSTGPPVQNAASLHT---------PLPQL-----------PGRLPPAGVPTAALSSALQF 577
Cdd:pfam09770  108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekykepePIPDLqvdaslwgvapKKAAAPAPAPQPAAQPASLP 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   578 AQQPQV-----VEAQTQLQIPVKTQQPNVPIPAPPSSqlpiPPSQPAQLALHVPTPGKVQVQASQLSSLPQMVASTRLPV 652
Cdd:pfam09770  188 APSRKMmsleeVEAAMRAQAKKPAQQPAPAPAQPPAA----PPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPV 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   653 DPAPPCPRPLPTSSTSSLAPVS-GSGPGPSPARSSPV------NRPSSAtnkalspvtsRTPGVVASAPTKPQSPAQNAT 725
Cdd:pfam09770  264 TILQRPQSPQPDPAQPSIQPQAqQFHQQPPPVPVQPTqilqnpNRLSAA----------RVGYPQNPQPGVQPAPAHQAH 333
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940   726 SSQdSSQDTLTEQITLENQVHQRIAELRKAGLwsqrrlpkLQEAPRPK-SHWDYLL 780
Cdd:pfam09770  334 RQQ-GSFGRQAPIITHPQQLAQLSEEEKAAYL--------DEEAKRAKrNHKIFLL 380
PHA03247 PHA03247
large tegument protein UL36; Provisional
511-734 1.52e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 1.52e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  511 PTAQGGMPPTPQAAQLAGQRQSQQQYDPSTGP--PVQNAASLHTPLPQLPgrlPPAGVPTAALSSALQFAQQPQVVEAQT 588
Cdd:PHA03247 2735 LPAAPAPPAVPAGPATPGGPARPARPPTTAGPpaPAPPAAPAAGPPRRLT---RPAVASLSESRESLPSPWDPADPPAAV 2811
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  589 QLQIPV--KTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKV------QVQASQLSSLPQMVASTRLPVDPAPPCPR 660
Cdd:PHA03247 2812 LAPAAAlpPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAV 2891
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940  661 PLPTSSTSSlapvsgsgPGPSPARSSPVNRPSSATNKALSPVTSRtPGVVASAPTKPQSPAQNATSSQDSSQDT 734
Cdd:PHA03247 2892 SRSTESFAL--------PPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
2962-3081 2.86e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.77  E-value: 2.86e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2962 TQIQVAKLPQVVQQQTPVASIQQVASA--SQQAS-PQTVALTQATAAGQQVQMIPAVTATAQVVQQKLIQQQVVT--TAS 3036
Cdd:PRK14971  355 TLIQLAQLTQKGDDASGGRGPKQHIKPvfTQPAAaPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdpPAA 434
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 119618940 3037 APLQTPGAPNPAQVPASSDSPSQQPKLQMRVPAVRLKTPTKPPCQ 3081
Cdd:PRK14971  435 VPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAE 479
SP1-4_arthropods_N cd22553
N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; ...
2932-3064 2.44e-04

N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in the chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. One SP is clade SP1-4, which is expressed ubiquitously throughout development. SP1-4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. This model represents the N-terminal domain of SP1-4 from arthropods.


Pssm-ID: 411778 [Multi-domain]  Cd Length: 384  Bit Score: 46.17  E-value: 2.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2932 AYAAQPALKTQFLTtPISQAQKLAGAQ--QVQTQIQVAKLPQVVQQQTPVAsIQQVASASQQASPQTvaltqATAAGQQV 3009
Cdd:cd22553   188 AGGGNQALQAQVIP-QLAQAAQLQPQQlaQVSSQGYIQQIPANASQQQPQM-VQQGPNQSGQIIGQV-----ASASSIQA 260
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 3010 QMIPAVT-ATAQVVQQKLIQQQVVTTASAPLQTPGAPNPAQVPASSDSPSQQPKLQ 3064
Cdd:cd22553   261 AAIPLTVyTGALAGQNGSNQQQVGQIVTSPIQGMTQGLTAPASSSIPTVVQQQAIQ 316
SP1-4_arthropods_N cd22553
N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; ...
2798-3023 3.85e-04

N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in the chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. One SP is clade SP1-4, which is expressed ubiquitously throughout development. SP1-4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. This model represents the N-terminal domain of SP1-4 from arthropods.


Pssm-ID: 411778 [Multi-domain]  Cd Length: 384  Bit Score: 45.79  E-value: 3.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2798 GTTVANLQVARLTRVPTSQLQAQGQMQTQAPQPAQVALAKPPVVSVPAAVVSSPGVTTLPMNVAGiSVAIGQPQKAA-GQ 2876
Cdd:cd22553    92 LLQTNNQQAIQLAPGGTQAILANQQTLIRPNTVQGQANASNVLQNIAQIASGGNAVQLPLNNMTQ-TIPVQVPVSTAnGQ 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2877 TV---VAQPVHMQQLLKLKQQAVQQQKAIQPQAAQ-GPAAVQQKITA------QQITTPGAQQK-------------VAY 2933
Cdd:cd22553   171 TVyqtIQVPIQAIQSGNAGGGNQALQAQVIPQLAQaAQLQPQQLAQVssqgyiQQIPANASQQQpqmvqqgpnqsgqIIG 250
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2934 AAQPALKTQFLTTPISQAQKLAGAQQVQTQIQVAKLP----QVVQQQTPVASIQQVASASQQASPQTVAL---TQATAAG 3006
Cdd:cd22553   251 QVASASSIQAAAIPLTVYTGALAGQNGSNQQQVGQIVtspiQGMTQGLTAPASSSIPTVVQQQAIQGNPLppgTQIIAAG 330
                         250
                  ....*....|....*..
gi 119618940 3007 QQVQMIPAVTATAQVVQ 3023
Cdd:cd22553   331 QQLQQDPNDPTKWQVVA 347
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
468-583 2.38e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 43.64  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   468 RQQAMPSTGMAEQSKRPRLEVGHQG---VVFQHPGAD-AGVPL----QQLMPTAQG------GMPPTPQAAQLAGQRQSQ 533
Cdd:TIGR01628  372 QDQFMQLQPRMRQLPMGSPMGGAMGqppYYGQGPQQQfNGQPLgwprMSMMPTPMGpggplrPNGLAPMNAVRAPSRNAQ 451
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 119618940   534 QQYDPSTGPPVQ-----NAASLHTPLPQlPGRLPPAGVPTAALSSALQFA---QQPQV 583
Cdd:TIGR01628  452 NAAQKPPMQPVMyppnyQSLPLSQDLPQ-PQSTASQGGQNKKLAQVLASAtpqMQKQV 508
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
2816-3068 5.54e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 42.30  E-value: 5.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  2816 QLQAQGQMQTQAPQPAQVALAKPPVVSVPAAVVSSPGVTTLPMNVAGISVAIGQPQKAAGQTVvaqPVHMQQllklKQQA 2895
Cdd:pfam09606  222 QAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPM---GPPGQQ----PGAM 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  2896 VQQQKAIQPQAAQGPAAVQQKITAQQITTPGAQQKVAYAAQPAlktqflttpiSQAQKLAGAQQVQTQIQVAKLPQVVQQ 2975
Cdd:pfam09606  295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQG----------GQVVALGGLNHLETWNPGNFGGLGANP 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  2976 QtpvasiqqvasasQQASPQTVALTQATAAGQQVQMipavtATAQVVQQKliQQQVVTTASAPLQTPGA-------PNPA 3048
Cdd:pfam09606  365 M-------------QRGQPGMMSSPSPVPGQQVRQV-----TPNQFMRQS--PQPSVPSPQGPGSQPPQshpggmiPSPA 424
                          250       260
                   ....*....|....*....|
gi 119618940  3049 QVPASSDSPSQQPKLQMRVP 3068
Cdd:pfam09606  425 LIPSPSPQMSQQPAQQRTIG 444
 
Name Accession Description Interval E-value
EP400_N pfam15790
E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact ...
1-469 0e+00

E1A-binding protein p400, N-terminal; EP400_N is a family of eukaryote proteins. the exact function of this domain is not known. This family is largely low-complexity residues.


Pssm-ID: 434938 [Multi-domain]  Cd Length: 489  Bit Score: 580.41  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940     1 MHHGTGPQNVQHQLQRSRACPGSEG---EEQPAHPNPPPSPAAPFAPSASPSAPQSPSYQIqqLMNRSPATGQNVNITLQ 77
Cdd:pfam15790    1 MHHGSGSQNVQRQLQRSKSVSGSEEqqqEQQPATVNHPQSPVTTFAPAASPSAPQSPNYQI--IMSRSPVTGQNVNITLQ 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940    78 SVGPVVGGNQQITLAPLPLPSPTSPGFQFSAQPRRFEHGSPSYIQVTSPLSQQVQTQSPTQPSPGPGQALQnvRAGAPGP 157
Cdd:pfam15790   79 NVGQMVAGNQQITLTPLPLQSPASPGFQHSAPQWRFEHGSPSYIQVTSPLPQQVQPQSPTQHSPVPLQGVQ--RPGAPGT 156
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   158 GLGLCSSSPTGgFVDASVLVRQISL-SPSSGGHFVFQDGSGLTQIAQG-AQVQLQHPGTPITVRERRPSQPHTQSGGTIH 235
Cdd:pfam15790  157 GLGVCGQSPTR-FVDASMLVRQISLgSPSGGGHFVYQDGTGLAQIAPGaGQVQLASPGTPGSVRERRLSQPHSQTGGTIH 235
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   236 HLGPQSPAAAGgAGLQPLASPSHITTANLPPQISSIIQGQlvqqqqvlqgppLPRPLGFERTPGVLLPGAGGAAGF-GMT 314
Cdd:pfam15790  236 HLGPQSPAAAG-AGLQTLGSPGHITTSNLPPQISSIIQGQ------------LARPLGFEKTAQVVVAGAGGPAASfGIP 302
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   315 SPPPPTSPSRTAVPPGLSSLPLTSVGNTG-MKKVPKKLEEIPPASPEMAQMRKQCLDYHYQEMQALKEVFKEYLIELFFL 393
Cdd:pfam15790  303 SSIPPTSPSRTSPPPGLSSNPLTSTGMSGsVKKVPKKLEEIAPATPEIAQLRKQCLDHHTKKMESLKEVFKEYLIELFFL 382
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   394 QHFQGNMMDFLAFKKKHYAPLQAYLRQNDLDIeeeeEEEEEEEEKSEVINDEQ--------------------------- 446
Cdd:pfam15790  383 QHLQGNMMDFLAFKKKHCVPLYTYLRQNDLDL----EEEEEEEEQSEVINDEVkvvtgkdgqtgtpvaiatqlppnvsaa 458
                          490       500       510
                   ....*....|....*....|....*....|.
gi 119618940   447 --------QALAGSLVAGAGSTVETDLFKRQ 469
Cdd:pfam15790  459 fstqqqpfQAHQGTASAGITNTVEMDAFKRQ 489
DEXQc_SRCAP cd18003
DEXH/Q-box helicase domain of SRCAP; Snf2-related CBP activator (SRCAP, also known as SWR1 or ...
1008-1224 1.93e-134

DEXH/Q-box helicase domain of SRCAP; Snf2-related CBP activator (SRCAP, also known as SWR1 or DOMO1) is the core catalytic component of the multiprotein chromatin-remodeling SRCAP complex, that is necessary for the incorporation of the histone variant H2A.Z into nucleosomes. SRCAP is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350761 [Multi-domain]  Cd Length: 223  Bit Score: 419.83  E-value: 1.93e-134
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18003     1 LREYQHIGLDWLATLYEKNLNGILADEMGLGKTIQTIALLAHLACEKGNWGPHLIVVPTSVMLNWEMEFKRWCPGFKILT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSP 1167
Cdd:cd18003    81 YYGSAKERKLKRQGWMKPNSFHVCITSYQLVVQDHQVFKRKKWKYLILDEAHNIKNFKSQRWQTLLNFNTQRRLLLTGTP 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1168 LHNTFLELWTMVHFLVPGI------SRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18003   161 LQNSLMELWSLMHFLMPHIfqshqeFKEWFSNPLTAMSEGSQEENEELVRRLHKVLRPFLLRR 223
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
768-1291 1.84e-60

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 222.79  E-value: 1.84e-60
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  768 EAPRPKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVVEIKLRVELEEKRKKALNLQKVSRRGKELRPKGFDALQESSLD 847
Cdd:COG0553     1 LLLLLLLLALGALGLLLTELLLLLRLGALLLELVLARELLLLLLAADALLLLALLLLLELLLLLAALLLLALLLLALSAL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  848 SGMSGRKRKASISLTDDEVDDEEETIEEEEANEGVVDHQTELSNLAKEAELPLLDLMKLYEGAFLPSSQWPRPKPDGEDT 927
Cdd:COG0553    81 ALLLLRLLLALLLLALLLLLAGLLALALLLLALLGLLLSLALLLLLLLLLLLLLLALLLVLLAALLLLLLLLLLLALLLG 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  928 SGEEDADDCPGDRESRKDLVLIDSLFIMDQFKAAERMNIGK-PNAKDIADVTAVAEAILPKGSARVTTSVKFNAPSLLYG 1006
Cdd:COG0553   161 RLLLLALLLLALEALLLLGLLLALALLALLELALLAAEAELlLLLELLLELELLAEAAVDAFRLRRLREALESLPAGLKA 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1007 ALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVV-RSCnILKWELELKRWCPGLKI 1085
Cdd:COG0553   241 TLRPYQLEGAAWLLFLRRLGLGGLLADDMGLGKTIQALALLLELK-ERGLARPVLIVApTSL-VGNWQRELAKFAPGLRV 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGShrelkAKRQEWAEP-NSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLI 1164
Cdd:COG0553   319 LVLDGT-----RERAKGANPfEDADLVITSYGLLRRDIELLAAVDWDLVILDEAQHIKNPATKRAKAVRALKARHRLALT 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1165 DSPLHNTFLELWTMVHFLVPGisrpYLSSPlrapsEESQDYYHKVVI--------RLHRVTQPFILRRTKRDVEKQLTKK 1236
Cdd:COG0553   394 GTPVENRLEELWSLLDFLNPG----LLGSL-----KAFRERFARPIEkgdeealeRLRRLLRPFLLRRTKEDVLKDLPEK 464
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1237 YEHVLKCRLSNRQKALYEDVI--LQPGTQEALKSGHFVNVLSILVRLQRICNHPGLV 1291
Cdd:COG0553   465 TEETLYVELTPEQRALYEAVLeyLRRELEGAEGIRRRGLILAALTRLRQICSHPALL 521
PLN03142 PLN03142
Probable chromatin-remodeling complex ATPase chain; Provisional
982-1303 1.47e-59

Probable chromatin-remodeling complex ATPase chain; Provisional


Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 225.84  E-value: 1.47e-59
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  982 EAILPKGSARVTTSvkfnaPSLLYGALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHL 1061
Cdd:PLN03142  149 DGLGGSGGTRLLVQ-----PSCIKGKMRDYQLAGLNWLIRLYENGINGILADEMGLGKTLQTISLLGYLHEYRGITGPHM 223
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1062 VVVRSCNILKWELELKRWCPGLKILSYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRV 1141
Cdd:PLN03142  224 VVAPKSTLGNWMNEIRRFCPVLRAVKFHGNPEERAHQREELLVAGKFDVCVTSFEMAIKEKTALKRFSWRYIIIDEAHRI 303
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1142 KGMTERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPGI--SRPYLSSPLRAPSEESQdyyHKVVIRLHRVTQP 1219
Cdd:PLN03142  304 KNENSLLSKTMRLFSTNYRLLITGTPLQNNLHELWALLNFLLPEIfsSAETFDEWFQISGENDQ---QEVVQQLHKVLRP 380
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1220 FILRRTKRDVEKQLTKKYEHVLKCRLSNRQKALYEdVILQPGTQEALKSGHFVNVLSILVRLQRICNHPGLVEPRHPGSS 1299
Cdd:PLN03142  381 FLLRRLKSDVEKGLPPKKETILKVGMSQMQKQYYK-ALLQKDLDVVNAGGERKRLLNIAMQLRKCCNHPYLFQGAEPGPP 459

                  ....
gi 119618940 1300 YVAG 1303
Cdd:PLN03142  460 YTTG 463
SNF2-rel_dom pfam00176
SNF2-related domain; This domain is found in proteins involved in a variety of processes ...
1011-1291 1.97e-59

SNF2-related domain; This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1). SNF2 functions as the ATPase component of the SNF2/SWI multisubunit complex, which utilizes energy derived from ATP hydrolysis to disrupt histone-DNA interactions, resulting in the increased accessibility of DNA to transcription factors.


Pssm-ID: 425504 [Multi-domain]  Cd Length: 289  Bit Score: 207.54  E-value: 1.97e-59
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1011 YQKIGLDWLAKLYRK-NLNGILADEAGLGKTVQIIAFFAHLACNEGNWG-PHLVVVRSCNILKWELELKRWC--PGLKIL 1086
Cdd:pfam00176    1 YQIEGVNWMLSLENNlGRGGILADEMGLGKTLQTISLLLYLKHVDKNWGgPTLIVVPLSLLHNWMNEFERWVspPALRVV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1087 SYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDS 1166
Cdd:pfam00176   81 VLHGNKRPQERWKNDPNFLADFDVVITTYETLRKHKELLKKVHWHRIVLDEGHRLKNSKSKLSKALKSLKTRNRWILTGT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1167 PLHNTFLELWTMVHFLVPGI--SRPYLSSPLRAPSEESQDYyhKVVIRLHRVTQPFILRRTKRDVEKQLTKKYEHVLKCR 1244
Cdd:pfam00176  161 PLQNNLEELWALLNFLRPGPfgSLSTFRNWFDRPIERGGGK--KGVSRLHKLLKPFLLRRTKKDVEKSLPPKVEYILFCR 238
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|.
gi 119618940  1245 LSNRQKALYEDVILQPGTQEALKS--GHFVN--VLSILVRLQRICNHPGLV 1291
Cdd:pfam00176  239 LSKLQRKLYQTFLLKKDLNAIKTGegGREIKasLLNILMRLRKICNHPGLI 289
DEXHc_SMARCA1_SMARCA5 cd17997
DEAH-box helicase domain of SMARCA1 and SMARCA5; SWI/SNF related, matrix associated, actin ...
1006-1226 6.03e-56

DEAH-box helicase domain of SMARCA1 and SMARCA5; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 1 and 5 (SMARCA1 and SMARCA5) are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350755 [Multi-domain]  Cd Length: 222  Bit Score: 194.85  E-value: 6.03e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1006 GALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKI 1085
Cdd:cd17997     2 GTMRDYQIRGLNWLISLFENGINGILADEMGLGKTLQTISLLGYLKHYKNINGPHLIIVPKSTLDNWMREFKRWCPSLRV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLID 1165
Cdd:cd17997    82 VVLIGDKEERADIIRDVLLPGKFDVCITSYEMVIKEKTVLKKFNWRYIIIDEAHRIKNEKSKLSQIVRLFNSRNRLLLTG 161
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940 1166 SPLHNTFLELWTMVHFLVPGIsrpYLSSPLRAP---SEESQDYYHKVVIRLHRVTQPFILRRTK 1226
Cdd:cd17997   162 TPLQNNLHELWALLNFLLPDV---FTSSEDFDEwfnVNNCDDDNQEVVQRLHKVLRPFLLRRIK 222
DEXHc_Snf cd17919
DEXH/Q-box helicase domain of DEAD-like helicase Snf family proteins; Sucrose Non-Fermenting ...
1008-1184 6.01e-53

DEXH/Q-box helicase domain of DEAD-like helicase Snf family proteins; Sucrose Non-Fermenting (SNF) proteins DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350677 [Multi-domain]  Cd Length: 182  Bit Score: 184.69  E-value: 6.01e-53
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd17919     1 LRPYQLEGLNFLLELYENGPGGILADEMGLGKTLQAIAFLAYLLKEGKERGPVLVVCPLSVLENWEREFEKWTPDLRVVV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGShRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSP 1167
Cdd:cd17919    81 YHGS-QRERAQIRAKEKLDKFDVVLTTYETLRRDKASLRKFRWDLVVVDEAHRLKNPKSQLSKALKALRAKRRLLLTGTP 159
                         170
                  ....*....|....*..
gi 119618940 1168 LHNTFLELWTMVHFLVP 1184
Cdd:cd17919   160 LQNNLEELWALLDFLDP 176
DEXQc_INO80 cd18002
DEAQ-box helicase domain of INO80; INO80 is the catalytic ATPase subunit of the INO80 ...
1008-1224 4.65e-52

DEAQ-box helicase domain of INO80; INO80 is the catalytic ATPase subunit of the INO80 chromatin remodeling complex. INO80 removes histone H3-containing nucleosomes from associated chromatin, promotes CENP-ACnp1 chromatin assembly at the centromere in a redundant manner with another chromatin-remodeling factor Chd1Hrp1. INO80 mutants have severe defects in oxygen consumption and promiscuous cell division that is no longer coupled with metabolic status. INO80 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350760 [Multi-domain]  Cd Length: 229  Bit Score: 183.86  E-value: 4.65e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18002     1 LKEYQLKGLNWLANLYEQGINGILADEMGLGKTVQSIAVLAHLAEEHNIWGPFLVIAPASTLHNWQQEISRFVPQFKVLP 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEWAEPN------SFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRL 1161
Cdd:cd18002    81 YWGNPKDRKVLRKFWDRKNlytrdaPFHVVITSYQLVVQDEKYFQRVKWQYMVLDEAQAIKSSSSSRWKTLLSFHCRNRL 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 119618940 1162 LLIDSPLHNTFLELWTMVHFLVPGI------SRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18002   161 LLTGTPIQNSMAELWALLHFIMPTLfdshdeFNEWFSKDIESHAENKTGLNEHQLKRLHMILKPFMLRR 229
DEXQc_arch_SWI2_SNF2 cd18012
DEAQ-box helicase domain of archaeal and bacterial SNF2-related proteins; Proteins belonging ...
1006-1226 3.25e-49

DEAQ-box helicase domain of archaeal and bacterial SNF2-related proteins; Proteins belonging to SNF2 family of DNA dependent ATPases are important members of the chromatin remodeling complexes that are implicated in epigenetic control of gene expression. The Snf2 family comprises a large group of ATP-hydrolyzing proteins that are ubiquitous in eukaryotes, but also present in eubacteria and archaea. Archaeal SWI2 and SNF2 are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350770 [Multi-domain]  Cd Length: 218  Bit Score: 175.45  E-value: 3.25e-49
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1006 GALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVVRSCNILKWELELKRWCPGLKI 1085
Cdd:cd18012     3 ATLRPYQKEGFNWLSFLRHYGLGGILADDMGLGKTLQTLALLLSRK-EEGRKGPSLVVAPTSLIYNWEEEAAKFAPELKV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGShrelKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLID 1165
Cdd:cd18012    82 LVIHGT----KRKREKLRALEDYDLVITSYGLLRRDIELLKEVKFHYLVLDEAQNIKNPQTKTAKAVKALKADHRLALTG 157
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1166 SPLHNTFLELWTMVHFLVPGisrpYLSSP------LRAPSEESQDyyHKVVIRLHRVTQPFILRRTK 1226
Cdd:cd18012   158 TPIENHLGELWSIFDFLNPG----LLGSYkrfkkrFAKPIEKDGD--EEALEELKKLISPFILRRLK 218
DEXHc_SMARCA2_SMARCA4 cd17996
DEXH-box helicase domain of SMARCA2 and SMARCA4; SWI/SNF related, matrix associated, actin ...
1006-1226 3.16e-48

DEXH-box helicase domain of SMARCA2 and SMARCA4; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, members 2 and 4 (SMARCA2 and SMARCA4) are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350754 [Multi-domain]  Cd Length: 233  Bit Score: 172.94  E-value: 3.16e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1006 GALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKI 1085
Cdd:cd17996     2 GTLKEYQLKGLQWMVSLYNNNLNGILADEMGLGKTIQTISLITYLMEKKKNNGPYLVIVPLSTLSNWVSEFEKWAPSVSK 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGShRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKgmtERHWEAVFTL----QSQQRL 1161
Cdd:cd17996    82 IVYKGT-PDVRKKLQSQIRAGKFNVLLTTYEYIIKDKPLLSKIKWKYMIIDEGHRMK---NAQSKLTQTLntyyHARYRL 157
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 1162 LLIDSPLHNTFLELWTMVHFLVPGI--SRP----YLSSPLRAPSEESQDYYHK-----VVIRLHRVTQPFILRRTK 1226
Cdd:cd17996   158 LLTGTPLQNNLPELWALLNFLLPKIfkSCKtfeqWFNTPFANTGEQVKIELNEeetllIIRRLHKVLRPFLLRRLK 233
DEXHc_HELLS_SMARCA6 cd18009
DEXH-box helicase domain of HELLS; HELLS (helicase, lymphoid specific, also known as Lsh or ...
1006-1226 1.71e-47

DEXH-box helicase domain of HELLS; HELLS (helicase, lymphoid specific, also known as Lsh or SMARCA6) is a major epigenetic regulator crucial for normal heterochromatin structure and function. HELLS is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350767 [Multi-domain]  Cd Length: 236  Bit Score: 171.03  E-value: 1.71e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1006 GALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNeGNWGPHLVVVRSCNILKWELELKRWCPGLKI 1085
Cdd:cd18009     2 GVMRPYQLEGMEWLRMLWENGINGILADEMGLGKTIQTIALLAHLRER-GVWGPFLVIAPLSTLPNWVNEFARFTPSVPV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1086 LSYIGSHRE-LKAKRQEWAEPNS---FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRL 1161
Cdd:cd18009    81 LLYHGTKEErERLRKKIMKREGTlqdFPVVVTSYEIAMRDRKALQHYAWKYLIVDEGHRLKNLNCRLIQELKTFNSDNRL 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 1162 LLIDSPLHNTFLELWTMVHFLVPGISRPY-----------LSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRRTK 1226
Cdd:cd18009   161 LLTGTPLQNNLSELWSLLNFLLPDVFDDLssfeswfdfssLSDNAADISNLSEEREQNIVHMLHAILKPFLLRRLK 236
DEXHc_SMARCA5 cd18064
DEAH-box helicase domain of SMARCA5; SWI/SNF related, matrix associated, actin dependent ...
999-1236 2.21e-47

DEAH-box helicase domain of SMARCA5; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 5 (SMARCA5, also called SNF2H) is the catalytic subunit of the four known chromatin-remodeling complexes: CHRAC, RSF, ACF/WCRF, and WICH. SMARCA5 plays a major role organising arrays of nucleosomes adjacent to the binding sites for the architectural transcription factor CTCF sites and acts to promote CTCF binding SMARCA5 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350822 [Multi-domain]  Cd Length: 244  Bit Score: 171.00  E-value: 2.21e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  999 NAPSLL-YGALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELK 1077
Cdd:cd18064     6 DSPSYVkWGKLRDYQVRGLNWLISLYENGINGILADEMGLGKTLQTISLLGYMKHYRNIPGPHMVLVPKSTLHNWMAEFK 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1078 RWCPGLKILSYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQS 1157
Cdd:cd18064    86 RWVPTLRAVCLIGDKDQRAAFVRDVLLPGEWDVCVTSYEMLIKEKSVFKKFNWRYLVIDEAHRIKNEKSKLSEIVREFKT 165
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1158 QQRLLLIDSPLHNTFLELWTMVHFLVPGISRpylssplrapSEESQDYY---------HKVVIRLHRVTQPFILRRTKRD 1228
Cdd:cd18064   166 TNRLLLTGTPLQNNLHELWALLNFLLPDVFN----------SAEDFDSWfdtnnclgdQKLVERLHMVLRPFLLRRIKAD 235

                  ....*...
gi 119618940 1229 VEKQLTKK 1236
Cdd:cd18064   236 VEKSLPPK 243
DEXHc_SMARCAD1 cd17998
DEXH-box helicase domain of SMARCAD1; SWI/SNF-related matrix-associated actin-dependent ...
1008-1184 8.72e-45

DEXH-box helicase domain of SMARCAD1; SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A containing DEAD/H box 1 (SMARCAD1, also known as ATP-dependent helicase 1 or Hel1) possesses intrinsic ATP-dependent nucleosome-remodeling activity and is required for both DNA repair and heterochromatin organization. SMARCAD1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350756 [Multi-domain]  Cd Length: 187  Bit Score: 161.40  E-value: 8.72e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd17998     1 LKDYQLIGLNWLNLLYQKKLSGILADEMGLGKTIQVIAFLAYLK-EIGIPGPHLVVVPSSTLDNWLREFKRWCPSLKVEP 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEwAEPN--SFHVCITSYT---------QFFRgltaftRVRWKCLVIDEMQRVKGMTERHWEAVFTLQ 1156
Cdd:cd17998    80 YYGSQEERKHLRYD-ILKGleDFDVIVTTYNlatsnpddrSFFK------RLKLNYVVYDEGHMLKNMTSERYRHLMTIN 152
                         170       180
                  ....*....|....*....|....*...
gi 119618940 1157 SQQRLLLIDSPLHNTFLELWTMVHFLVP 1184
Cdd:cd17998   153 ANFRLLLTGTPLQNNLLELMSLLNFIMP 180
DEXHc_CHD1_2 cd17993
DEXH-box helicase domain of the chromodomain helicase DNA binding proteins 1 and 2, and ...
1008-1224 1.08e-44

DEXH-box helicase domain of the chromodomain helicase DNA binding proteins 1 and 2, and similar proteins; Chromodomain-helicase-DNA-binding protein 1 (CHD1) is an ATP-dependent chromatin-remodeling factor which functions as the substrate recognition component of the transcription regulatory histone acetylation (HAT) complex SAGA. It regulates polymerase II transcription and is also required for efficient transcription by RNA polymerase I, and more specifically the polymerase I transcription termination step. It is not only involved in transcription-related chromatin-remodeling, but is also required to maintain a specific chromatin configuration across the genome. CHD1 is also associated with histone deacetylase (HDAC) activity. Chromodomain-helicase-DNA-binding protein 2 (CHD2) is a DNA-binding helicase that specifically binds to the promoter of target genes, leading to chromatin remodeling, possibly by promoting deposition of histone H3.3. It is involved in myogenesis via interaction with MYOD1; it binds to myogenic gene regulatory sequences and mediates incorporation of histone H3.3 prior to the onset of myogenic gene expression, promoting their expression. Both are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350751 [Multi-domain]  Cd Length: 218  Bit Score: 162.53  E-value: 1.08e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd17993     2 LRDYQLTGLNWLAHSWCKGNNGILADEMGLGKTVQTISFLSYLFHSQQQYGPFLVVVPLSTMPAWQREFAKWAPDMNVIV 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQ-EWAEPNS----FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLL 1162
Cdd:cd17993    82 YLGDIKSRDTIREyEFYFSQTkklkFNVLLTTYEIILKDKAFLGSIKWQYLAVDEAHRLKNDESLLYEALKEFKTNNRLL 161
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1163 LIDSPLHNTFLELWTMVHFLVPGisRPYLSSPLR-APSEESQDYYHkvviRLHRVTQPFILRR 1224
Cdd:cd17993   162 ITGTPLQNSLKELWALLHFLMPG--KFDIWEEFEeEHDEEQEKGIA----DLHKELEPFILRR 218
DEXHc_SMARCA1 cd18065
DEAH-box helicase domain of SMARCA1; SWI/SNF related, matrix associated, actin dependent ...
1000-1226 1.35e-43

DEAH-box helicase domain of SMARCA1; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 1 (SMARCA1, also called SNF2L) is a component of NURF (nucleosome-remodeling factor) and CERF (CECR2-containing-remodeling factor) complexes which promote the perturbation of chromatin structure in an ATP-dependent manner. SMARCA1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350823 [Multi-domain]  Cd Length: 233  Bit Score: 159.80  E-value: 1.35e-43
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1000 APSLLYGA-LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKR 1078
Cdd:cd18065     7 SPSYVKGGtLRDYQVRGLNWMISLYENGVNGILADEMGLGKTLQTIALLGYLKHYRNIPGPHMVLVPKSTLHNWMNEFKR 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1079 WCPGLKILSYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQ 1158
Cdd:cd18065    87 WVPSLRAVCLIGDKDARAAFIRDVMMPGEWDVCVTSYEMVIKEKSVFKKFNWRYLVIDEAHRIKNEKSKLSEIVREFKTT 166
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1159 QRLLLIDSPLHNTFLELWTMVHFLVPGISRpylssplrapSEESQDYY---------HKVVIRLHRVTQPFILRRTK 1226
Cdd:cd18065   167 NRLLLTGTPLQNNLHELWALLNFLLPDVFN----------SADDFDSWfdtknclgdQKLVERLHAVLKPFLLRRIK 233
DEXHc_CHD6_7_8_9 cd17995
DEXH-box helicase domain of the chromodomain helicase DNA binding protein 6, 7, 8 and 9; ...
1008-1224 1.72e-39

DEXH-box helicase domain of the chromodomain helicase DNA binding protein 6, 7, 8 and 9; Chromodomain-helicase-DNA-binding protein 6-9 (CHD6, CHD7, CHD8, and CHD9) are members of the DEAD-like helicases superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350753 [Multi-domain]  Cd Length: 223  Bit Score: 147.78  E-value: 1.72e-39
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPgLKILS 1087
Cdd:cd17995     1 LRDYQLEGVNWLLFNWYNRRNCILADEMGLGKTIQSIAFLEHLYQVEGIRGPFLVIAPLSTIPNWQREFETWTD-MNVVV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSH--RELKAKRQ-----EWAEPNS----FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQ 1156
Cdd:cd17995    80 YHGSGesRQIIQQYEmyfkdAQGRKKKgvykFDVLITTYEMVIADAEELRKIPWRVVVVDEAHRLKNRNSKLLQGLKKLT 159
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940 1157 SQQRLLLIDSPLHNTFLELWTMVHFLVPGisrpylssplRAPSEES--QDY----YHKVVIRLHRVTQPFILRR 1224
Cdd:cd17995   160 LEHKLLLTGTPLQNNTEELWSLLNFLEPE----------KFPSSEEflEEFgdlkTAEQVEKLQALLKPYMLRR 223
DEXHc_CHD2 cd18054
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 2; ...
1001-1224 1.55e-38

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 2; Chromodomain-helicase-DNA-binding protein 2 (CHD2) is a DNA-binding helicase that specifically binds to the promoter of target genes, leading to chromatin remodeling, possibly by promoting deposition of histone H3.3. It is involved in myogenesis via interaction with MYOD1; it binds to myogenic gene regulatory sequences and mediates incorporation of histone H3.3 prior to the onset of myogenic gene expression, promoting their expression. CHD2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350812 [Multi-domain]  Cd Length: 237  Bit Score: 145.53  E-value: 1.55e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1001 PSLLYGA---LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELK 1077
Cdd:cd18054    11 PSYIGGEnleLRDYQLEGLNWLAHSWCKNNSVILADEMGLGKTIQTISFLSYLFHQHQLYGPFLLVVPLSTLTSWQREFE 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1078 RWCPGLKILSYIGSHRELKAKRQ-EWAEPNS----FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAV 1152
Cdd:cd18054    91 IWAPEINVVVYIGDLMSRNTIREyEWIHSQTkrlkFNALITTYEILLKDKTVLGSINWAFLGVDEAHRLKNDDSLLYKTL 170
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 1153 FTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPGISRPYlssplrapsEESQDYYHKV----VIRLHRVTQPFILRR 1224
Cdd:cd18054   171 IDFKSNHRLLITGTPLQNSLKELWSLLHFIMPEKFEFW---------EDFEEDHGKGrengYQSLHKVLEPFLLRR 237
SF2_C_SNF cd18793
C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) ...
1814-1939 1.77e-38

C-terminal helicase domain of the SNF family helicases; The Sucrose Non-Fermenting (SNF) family includes chromatin-remodeling factors, such as CHD proteins and SMARCA proteins, recombination proteins Rad54, and many others. They are DEAD-like helicases belonging to superfamily (SF)2, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. Similar to SF1 helicases, SF2 helicases do not form toroidal structures like SF3-6 helicases. Their helicase core consists of two similar protein domains that resemble the fold of the recombination protein RecA. This model describes the C-terminal domain, also called HelicC.


Pssm-ID: 350180 [Multi-domain]  Cd Length: 135  Bit Score: 141.46  E-value: 1.77e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1814 SGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFCAILSTHSR 1893
Cdd:cd18793    10 SGKLEALLELLEELREPGEKVLIFSQFTDTLDILEEALRERGIKYLRLDGSTSSKERQKLVDRFNEDPDIRVFLLSTKAG 89
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 119618940 1894 TTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLV 1939
Cdd:cd18793    90 GVGLNLTAANRVILYDPWWNPAVEEQAIDRAHRIGQKKPVVVYRLI 135
DEXHc_SMARCA4 cd18062
DEXH-box helicase domain of SMARCA4; SWI/SNF related, matrix associated, actin dependent ...
992-1226 2.45e-35

DEXH-box helicase domain of SMARCA4; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 4 (SMARCA4, also known as transcription activator BRG1) is a component of the CREST-BRG1 complex that regulates promoter activation by orchestrating a calcium-dependent release of a repressor complex and a recruitment of an activator complex. Mutation of SMARCA4 (BRG1), the ATPase of BAF (mSWI/SNF) and PBAF complexes, contributes to a range of malignancies and neurologic disorders. SMARCA4 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350820 [Multi-domain]  Cd Length: 251  Bit Score: 136.71  E-value: 2.45e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  992 VTTSVKFNAPSLLYGALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILK 1071
Cdd:cd18062     8 VTEKVEKQSSLLVNGVLKQYQIKGLEWLVSLYNNNLNGILADEMGLGKTIQTIALITYLMEHKRINGPFLIIVPLSTLSN 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1072 WELELKRWCPGLKILSYIGShrelKAKRQEWA---EPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKgmtERH 1148
Cdd:cd18062    88 WVYEFDKWAPSVVKVSYKGS----PAARRAFVpqlRSGKFNVLLTTYEYIIKDKQILAKIRWKYMIVDEGHRMK---NHH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1149 WEAVFTLQSQ----QRLLLIDSPLHNTFLELWTMVHFLVPGISRP------YLSSPLRAPSEE---SQDYYHKVVIRLHR 1215
Cdd:cd18062   161 CKLTQVLNTHyvapRRLLLTGTPLQNKLPELWALLNFLLPTIFKScstfeqWFNAPFAMTGEKvdlNEEETILIIRRLHK 240
                         250
                  ....*....|.
gi 119618940 1216 VTQPFILRRTK 1226
Cdd:cd18062   241 VLRPFLLRRLK 251
DEXHc_SMARCA2 cd18063
DEXH-box helicase domain of SMARCA2; SWI/SNF related, matrix associated, actin dependent ...
992-1226 4.21e-35

DEXH-box helicase domain of SMARCA2; SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a, member 2 (SMARCA2, also known as brahma homolog) is a component of the BAF complex. SMARCA2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350821 [Multi-domain]  Cd Length: 251  Bit Score: 135.96  E-value: 4.21e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  992 VTTSVKFNAPSLLYGALRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILK 1071
Cdd:cd18063     8 ITERVEKQSSLLINGTLKHYQLQGLEWMVSLYNNNLNGILADEMGLGKTIQTIALITYLMEHKRLNGPYLIIVPLSTLSN 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1072 WELELKRWCPGLKILSYIGShRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKgmtERHWEA 1151
Cdd:cd18063    88 WTYEFDKWAPSVVKISYKGT-PAMRRSLVPQLRSGKFNVLLTTYEYIIKDKHILAKIRWKYMIVDEGHRMK---NHHCKL 163
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1152 VFTLQSQ----QRLLLIDSPLHNTFLELWTMVHFLVPGISRP------YLSSPLRAPSEE---SQDYYHKVVIRLHRVTQ 1218
Cdd:cd18063   164 TQVLNTHyvapRRILLTGTPLQNKLPELWALLNFLLPTIFKScstfeqWFNAPFAMTGERvdlNEEETILIIRRLHKVLR 243

                  ....*...
gi 119618940 1219 PFILRRTK 1226
Cdd:cd18063   244 PFLLRRLK 251
DEXHc_CHD1L cd18006
DEAH/Q-box helicase domain of CHD1L; Chromodomain helicase DNA binding protein 1 like (CHD1L, ...
1008-1224 4.28e-35

DEAH/Q-box helicase domain of CHD1L; Chromodomain helicase DNA binding protein 1 like (CHD1L, also known as ALC1) is involved in DNA repair by regulating chromatin relaxation following DNA damage. CHD1L is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350764 [Multi-domain]  Cd Length: 216  Bit Score: 134.87  E-value: 4.28e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18006     1 LRPYQLEGVNWLLQCRAEQHGCILGDEMGLGKTCQTISLLWYLAGRLKLLGPFLVLCPLSVLDNWKEELNRFAPDLSVIT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSP 1167
Cdd:cd18006    81 YMGDKEKRLDLQQDIKSTNRFHVLLTTYEICLKDASFLKSFPWASLVVDEAHRLKNQNSLLHKTLSEFSVDFRLLLTGTP 160
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 1168 LHNTFLELWTMVHFLvpgisrpylsSPLRAPSEESQDYYHK---------VVIRLHRVTQPFILRR 1224
Cdd:cd18006   161 IQNSLQELYALLSFI----------EPNVFPKDKLDDFIKAysetddeseTVEELHLLLQPFLLRR 216
DEXHc_Mot1 cd17999
DEXH-box helicase domain of Mot1; Modifier of transcription 1 (Mot1, also known as TAF172 in ...
1008-1224 6.17e-32

DEXH-box helicase domain of Mot1; Modifier of transcription 1 (Mot1, also known as TAF172 in eukaryotes) regulates transcription in association with TATA binding protein (TBP). Mot1, Ino80C, and NC2 function coordinately to regulate pervasive transcription in yeast and mammals. Mot1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350757 [Multi-domain]  Cd Length: 232  Bit Score: 126.31  E-value: 6.17e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQ---IIAFFAHLACNEGNWG--PHLVVVRSCNILKWELELKRWCP- 1081
Cdd:cd17999     1 LRPYQQEGINWLAFLNKYNLHGILCDDMGLGKTLQtlcILASDHHKRANSFNSEnlPSLVVCPPTLVGHWVAEIKKYFPn 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1082 -GLKILSYIGSHRElkaKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQR 1160
Cdd:cd17999    81 aFLKPLAYVGPPQE---RRRLREQGEKHNVIVASYDVLRNDIEVLTKIEWNYCVLDEGHIIKNSKTKLSKAVKQLKANHR 157
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 119618940 1161 LLLIDSPLHNTFLELWTMVHFLVPG-----------ISRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd17999   158 LILSGTPIQNNVLELWSLFDFLMPGylgtekqfqrrFLKPILASRDSKASAKEQEAGALALEALHKQVLPFLLRR 232
DEXHc_CHD1 cd18053
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 1; ...
1008-1224 3.04e-31

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 1; Chromodomain-helicase-DNA-binding protein 1 (CHD1) is an ATP-dependent chromatin-remodeling factor which functions as substrate recognition component of the transcription regulatory histone acetylation (HAT) complex SAGA. It regulates polymerase II transcription and is also required for efficient transcription by RNA polymerase I, and more specifically the polymerase I transcription termination step. It is not only involved in transcription-related chromatin-remodeling, but also required to maintain a specific chromatin configuration across the genome. CHD1 is also associated with histone deacetylase (HDAC) activity. It is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350811 [Multi-domain]  Cd Length: 237  Bit Score: 124.39  E-value: 3.04e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18053    21 LRDYQLNGLNWLAHSWCKGNSCILADEMGLGKTIQTISFLNYLFHEHQLYGPFLLVVPLSTLTSWQREIQTWAPQMNAVV 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKR-QEWAEPNS----FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLL 1162
Cdd:cd18053   101 YLGDINSRNMIRtHEWMHPQTkrlkFNILLTTYEILLKDKSFLGGLNWAFIGVDEAHRLKNDDSLLYKTLIDFKSNHRLL 180
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 119618940 1163 LIDSPLHNTFLELWTMVHFLVPGISRPYlsSPLRAPSEESQDYYHKvviRLHRVTQPFILRR 1224
Cdd:cd18053   181 ITGTPLQNSLKELWSLLHFIMPEKFSSW--EDFEEEHGKGREYGYA---SLHKELEPFLLRR 237
PLN03142 PLN03142
Probable chromatin-remodeling complex ATPase chain; Provisional
1813-1954 8.52e-31

Probable chromatin-remodeling complex ATPase chain; Provisional


Pssm-ID: 215601 [Multi-domain]  Cd Length: 1033  Bit Score: 133.77  E-value: 8.52e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1813 DSGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRD-RRIFCAILSTH 1891
Cdd:PLN03142  469 NSGKMVLLDKLLPKLKERDSRVLIFSQMTRLLDILEDYLMYRGYQYCRIDGNTGGEDRDASIDAFNKPgSEKFVFLLSTR 548
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1892 SRTTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLVSGNSIEEKLLKNGTK 1954
Cdd:PLN03142  549 AGGLGINLATADIVILYDSDWNPQVDLQAQDRAHRIGQKKEVQVFRFCTEYTIEEKVIERAYK 611
DEXHc_ERCC6 cd18000
DEXH-box helicase domain of ERCC6; ERCC excision repair 6, chromatin remodeling factor (ERCC6, ...
1008-1191 1.56e-29

DEXH-box helicase domain of ERCC6; ERCC excision repair 6, chromatin remodeling factor (ERCC6, also known Cockayne syndrome group B (CSB), Rad26 in Saccharomyces cerevisiae, and Rhp26 in Schizosaccharomyces pombe) is a DNA-binding protein that is important in transcription-coupled excision repair. ERCC6 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350758 [Multi-domain]  Cd Length: 193  Bit Score: 117.81  E-value: 1.56e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKIL- 1086
Cdd:cd18000     1 LFKYQQTGVQWLWELHCQRVGGILGDEMGLGKTIQIIAFLAALHHSKLGLGPSLIVCPATVLKQWVKEFHRWWPPFRVVv 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1087 -----------SYIGSHRELKAKRQEWAEPNsfHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTL 1155
Cdd:cd18000    81 lhssgsgtgseEKLGSIERKSQLIRKVVGDG--GILITTYEGFRKHKDLLLNHNWQYVILDEGHKIRNPDAEITLACKQL 158
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 119618940 1156 QSQQRLLLIDSPLHNTFLELWTMVHFLVPgisrPYL 1191
Cdd:cd18000   159 RTPHRLILSGTPIQNNLKELWSLFDFVFP----PYL 190
DEXHc_CHD3_4_5 cd17994
DEAH-box helicase domain of the chromodomain helicase DNA binding proteins 3, 4 and 5; ...
1008-1224 3.14e-28

DEAH-box helicase domain of the chromodomain helicase DNA binding proteins 3, 4 and 5; Chromodomain-helicase-DNA-binding protein 3 (CHD3) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. It is required for anchoring centrosomal pericentrin in both interphase and mitosis, for spindle organization and centrosome integrity. Chromodomain-helicase-DNA-binding protein 4 (CHD4) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. Chromodomain-helicase-DNA-binding protein 5 (CHD5) is a chromatin-remodeling protein that binds DNA through histones and regulates gene transcription. It is thought to specifically recognize and bind trimethylated 'Lys-27' (H3K27me3) and non-methylated 'Lys-4' of histone H3 and plays a role in the development of the nervous system by activating the expression of genes promoting neuron terminal differentiation. In parallel, it may also positively regulate the trimethylation of histone H3 at 'Lys-27' thereby specifically repressing genes that promote the differentiation into non-neuronal cell lineages. As a tumor suppressor, it regulates the expression of genes involved in cell proliferation and differentiation. In spermatogenesis, it probably regulates histone hyperacetylation and the replacement of histones by transition proteins in chromatin, a crucial step in the condensation of spermatid chromatin and the production of functional spermatozoa. CHD3, CHD4, and CHD5 are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350752 [Multi-domain]  Cd Length: 196  Bit Score: 114.46  E-value: 3.14e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNW-GPHLVVVRSCNILKWELELKRWCPGLKIL 1086
Cdd:cd17994     1 LHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLY-KEGHSkGPFLVSAPLSTIINWEREFEMWAPDFYVV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1087 SYIGShrelkakrqewaepnsfHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDS 1166
Cdd:cd17994    80 TYVGD-----------------HVLLTSYELISIDQAILGSIDWAVLVVDEAHRLKNNQSKFFRILNSYKIGYKLLLTGT 142
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 119618940 1167 PLHNTFLELWTMVHFLVPGisrpyLSSPLRAPSEESQDYYHKVVI-RLHRVTQPFILRR 1224
Cdd:cd17994   143 PLQNNLEELFHLLNFLTPE-----RFNNLQGFLEEFADISKEDQIkKLHDLLGPHMLRR 196
DEXHc_ERCC6L cd18001
DEXH-box helicase domain of ERCC6L; ERCC excision repair 6 like, spindle assembly checkpoint ...
1008-1224 6.05e-27

DEXH-box helicase domain of ERCC6L; ERCC excision repair 6 like, spindle assembly checkpoint helicase (ERCC6L, also known as RAD26L) is an essential component of the mitotic spindle assembly checkpoint, by acting as a tension sensor that associates with catenated DNA which is stretched under tension until it is resolved during anaphase. ERCC6L is proposed to stimulate cancer cell proliferation by promoting cell cycle through a way of RAB31-MAPK-CDK2. ERCC6L is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350759 [Multi-domain]  Cd Length: 232  Bit Score: 111.69  E-value: 6.05e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18001     1 LYPHQREGVAWLWSLHDGGKGGILADDMGLGKTVQICAFLSGMF-DSGLIKSVLVVMPTSLIPHWVKEFAKWTPGLRVKV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQEWAEpNSFHVCITSYTQFFRG---LTAFTR--VRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLL 1162
Cdd:cd18001    80 FHGTSKKERERNLERIQ-RGGGVLLTTYGMVLSNteqLSADDHdeFKWDYVILDEGHKIKNSKTKSAKSLREIPAKNRII 158
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940 1163 LIDSPLHNTFLELWTMVHFLVPG--------ISRPYLSSPLRAPSEES----QDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18001   159 LTGTPIQNNLKELWALFDFACNGsllgtrktFKMEFENPITRGRDKDAtqgeKALGSEVAENLRQIIKPYFLRR 232
DEXHc_ERCC6L2 cd18005
DEXH-box helicase domain of ERCC6L2; ERCC excision repair 6 like 2 (ERCC6L2, also known as ...
1008-1224 1.22e-26

DEXH-box helicase domain of ERCC6L2; ERCC excision repair 6 like 2 (ERCC6L2, also known as RAD26L) may play a role in DNA repair and mitochondrial function. In humans, mutations in the ERCC6L2 gene are associated with bone marrow failure syndrome 2. ERCC6L2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350763 [Multi-domain]  Cd Length: 245  Bit Score: 111.32  E-value: 1.22e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHL---------ACN-----------EGNWGPHLVVVRSC 1067
Cdd:cd18005     1 LRDYQREGVEFMYDLYKNGRGGILGDDMGLGKTVQVIAFLAAVlgktgtrrdRENnrprfkkkppaSSAKKPVLIVAPLS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1068 NILKWELELKRWcpG-LKILSYIGSHRE------LKAKRQEwaepnsfhVCITSYTQFFRGLTAFTRVRWKCLVIDEMQR 1140
Cdd:cd18005    81 VLYNWKDELDTW--GhFEVGVYHGSRKDdelegrLKAGRLE--------VVVTTYDTLRRCIDSLNSINWSAVIADEAHR 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1141 VKGMTERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPG-----------ISRPYLSSPLRAPSEESQDYYHKV 1209
Cdd:cd18005   151 IKNPKSKLTQAMKELKCKVRIGLTGTLLQNNMKELWCLLDWAVPGalgsrsqfkkhFSEPIKRGQRHTATARELRLGRKR 230
                         250
                  ....*....|....*
gi 119618940 1210 VIRLHRVTQPFILRR 1224
Cdd:cd18005   231 KQELAVKLSKFFLRR 245
DEXHc_CHD7 cd18059
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 7; ...
1008-1224 3.42e-26

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 7; Chromodomain-helicase-DNA-binding protein 7 (CHD7) is a probable transcription regulator. It may be involved in the 45S precursor rRNA production. CHD7 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350817 [Multi-domain]  Cd Length: 222  Bit Score: 109.35  E-value: 3.42e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACnEGNWGPHLVVVRSCNILKWELELKRWCPgLKILS 1087
Cdd:cd18059     1 LREYQLEGVNWLLFNWYNTRNCILADEMGLGKTIQSITFLYEIYL-KGIHGPFLVIAPLSTIPNWEREFRTWTE-LNVVV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSH---RELKAKRQEWAEPN--------SFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQ 1156
Cdd:cd18059    79 YHGSQasrRTIQLYEMYFKDPQgrvikgsyKFHAIITTFEMILTDCPELRNIPWRCVVIDEAHRLKNRNCKLLEGLKMMD 158
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940 1157 SQQRLLLIDSPLHNTFLELWTMVHFLVPGisrpylssplRAPSEES--QDY----YHKVVIRLHRVTQPFILRR 1224
Cdd:cd18059   159 LEHKVLLTGTPLQNTVEELFSLLHFLEPS----------RFPSETTfmQEFgdlkTEEQVQKLQAILKPMMLRR 222
DEXHc_RAD54 cd18004
DEXH-box helicase domain of RAD54; RAD54 proteins play a role in recombination. They are ...
1008-1224 5.24e-26

DEXH-box helicase domain of RAD54; RAD54 proteins play a role in recombination. They are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350762 [Multi-domain]  Cd Length: 240  Bit Score: 109.30  E-value: 5.24e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAK--LYRKNLNG---ILADEAGLGKTVQIIAFFAHLACNEGNWGPHL---VVVRSCNILK-WELELKR 1078
Cdd:cd18004     1 LRPHQREGVQFLYDclTGRRGYGGggaILADEMGLGKTLQAIALVWTLLKQGPYGKPTAkkaLIVCPSSLVGnWKAEFDK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1079 WCPG--LKILSYIGSHRELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTR-VRWKCLVIDEMQRVKGMTERHWEAVFTL 1155
Cdd:cd18004    81 WLGLrrIKVVTADGNAKDVKASLDFFSSASTYPVLIISYETLRRHAEKLSKkISIDLLICDEGHRLKNSESKTTKALNSL 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1156 QSQQRLLLIDSPLHNTFLELWTMVHFLVPGI-----------SRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18004   161 PCRRRLLLTGTPIQNDLDEFFALVDFVNPGIlgslasfrkvfEEPILRSRDPDASEEDKELGAERSQELSELTSRFILRR 240
DEXHc_CHD3 cd18055
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 3; ...
1008-1224 9.47e-26

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 3; Chromodomain-helicase-DNA-binding protein 3 (CHD3) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. It is required for anchoring centrosomal pericentrin in both interphase and mitosis, for spindle organization and centrosome integrity. CHD3 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350813 [Multi-domain]  Cd Length: 232  Bit Score: 108.17  E-value: 9.47e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18055     1 LHMYQLEGLNWLRFSWAQGTDTILADEMGLGKTIQTIVFLYSLYKEGHTKGPFLVSAPLSTIINWEREFQMWAPDFYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGS----------------------HRELKAKRQewaEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMT 1145
Cdd:cd18055    81 YTGDkdsraiirenefsfddnavkggKKAFKMKRE---AQVKFHVLLTSYELVTIDQAALGSIRWACLVVDEAHRLKNNQ 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1146 ERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPGisrpyLSSPLRAPSEESQDYYHKVVI-RLHRVTQPFILRR 1224
Cdd:cd18055   158 SKFFRVLNGYKIDHKLLLTGTPLQNNLEELFHLLNFLTPE-----RFNNLEGFLEEFADISKEDQIkKLHDLLGPHMLRR 232
DEXHc_CHD4 cd18056
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 4; ...
1008-1224 1.10e-25

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 4; Chromodomain-helicase-DNA-binding protein 4 (CHD4) is a component of the histone deacetylase NuRD complex which participates in the remodeling of chromatin by deacetylating histones. CHD4 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350814 [Multi-domain]  Cd Length: 232  Bit Score: 108.23  E-value: 1.10e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18056     1 LHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTVQTAVFLYSLYKEGHSKGPFLVSAPLSTIINWEREFEMWAPDMYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSH------RE-------------LKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERH 1148
Cdd:cd18056    81 YVGDKdsraiiREnefsfednairggKKASRMKKEASVKFHVLLTSYELITIDMAILGSIDWACLIVDEAHRLKNNQSKF 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1149 WEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPgiSRPYlssPLRAPSEESQDYYHKVVI-RLHRVTQPFILRR 1224
Cdd:cd18056   161 FRVLNGYSLQHKLLLTGTPLQNNLEELFHLLNFLTP--ERFH---NLEGFLEEFADIAKEDQIkKLHDMLGPHMLRR 232
DEXHc_CHD6 cd18058
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 6; ...
1008-1224 1.98e-25

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 6; Chromodomain-helicase-DNA-binding protein 6 (CHD6) is a DNA-dependent ATPase that plays a role in chromatin remodeling. It regulates transcription by disrupting nucleosomes in a largely non-sliding manner which strongly increases the accessibility of chromatin. It activates transcription of specific genes in response to oxidative stress through interaction with NFE2L2.2 and acts as a transcriptional repressor of different viruses including influenza virus or papillomavirus. During influenza virus infection, the viral polymerase complex localizes CHD6 to inactive chromatin where it gets degraded in a proteasome independent-manner. CHD6 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350816 [Multi-domain]  Cd Length: 222  Bit Score: 107.05  E-value: 1.98e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACnEGNWGPHLVVVRSCNILKWELELKRWCPgLKILS 1087
Cdd:cd18058     1 LREYQLEGMNWLLFNWYNRKNCILADEMGLGKTIQSITFLSEIFL-MGIRGPFLIIAPLSTITNWEREFRTWTE-MNAIV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSH--RELKAK-----RQEWAEPNS----FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQ 1156
Cdd:cd18058    79 YHGSQisRQMIQQyemyyRDEQGNPLSgifkFQVVITTFEMILADCPELKKINWSCVIIDEAHRLKNRNCKLLEGLKLMA 158
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 119618940 1157 SQQRLLLIDSPLHNTFLELWTMVHFLVPgISRPYLSSPLrapsEESQDY-YHKVVIRLHRVTQPFILRR 1224
Cdd:cd18058   159 LEHKVLLTGTPLQNSVEELFSLLNFLEP-SQFPSETTFL----EEFGDLkTEEQVKKLQSILKPMMLRR 222
DEXHc_CHD5 cd18057
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 5; ...
1008-1224 4.06e-25

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 5; Chromodomain-helicase-DNA-binding protein 5 (CHD5) is a chromatin-remodeling protein that binds DNA through histones and regulates gene transcription. It is thought to specifically recognize and bind trimethylated 'Lys-27' (H3K27me3) and non-methylated 'Lys-4' of histone H3 and plays a role in the development of the nervous system by activating the expression of genes promoting neuron terminal differentiation. In parallel, it may also positively regulate the trimethylation of histone H3 at 'Lys-27' thereby specifically repressing genes that promote the differentiation into non-neuronal cell lineages. As a tumor suppressor, it regulates the expression of genes involved in cell proliferation and differentiation. In spermatogenesis, it probably regulates histone hyperacetylation and the replacement of histones by transition proteins in chromatin, a crucial step in the condensation of spermatid chromatin and the production of functional spermatozoa. CHD5 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350815 [Multi-domain]  Cd Length: 232  Bit Score: 106.69  E-value: 4.06e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18057     1 LHPYQLEGLNWLRFSWAQGTDTILADEMGLGKTVQTIVFLYSLYKEGHSKGPYLVSAPLSTIINWEREFEMWAPDFYVVT 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGSHRELKAKRQ-EWA-EPNS-----------------FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERH 1148
Cdd:cd18057    81 YTGDKESRSVIREnEFSfEDNAirsgkkvfrmkkeaqikFHVLLTSYELITIDQAILGSIEWACLVVDEAHRLKNNQSKF 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940 1149 WEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVpgisrPYLSSPLRAPSEESQDYYHKVVI-RLHRVTQPFILRR 1224
Cdd:cd18057   161 FRVLNSYKIDYKLLLTGTPLQNNLEELFHLLNFLT-----PERFNNLEGFLEEFADISKEDQIkKLHDLLGPHMLRR 232
DEXHc_CHD9 cd18061
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 9; ...
1008-1224 4.46e-25

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 9; Chromodomain-helicase-DNA-binding protein 9 (CHD9) acts as a transcriptional coactivator for PPARA and possibly other nuclear receptors. It is proposed to be a ATP-dependent chromatin remodeling protein. CHD9 has DNA-dependent ATPase activity and binds to A/T-rich DNA. It also associates with A/T-rich regulatory regions in promoters of genes that participate in the differentiation of progenitors during osteogenesis. CHD9 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350819 [Multi-domain]  Cd Length: 222  Bit Score: 106.24  E-value: 4.46e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLACNeGNWGPHLVVVRSCNILKWELELKRWCpGLKILS 1087
Cdd:cd18061     1 LREYQLEGLNWLLFNWYNRRNCILADEMGLGKTIQSITFLYEILLT-GIRGPFLIIAPLSTIANWEREFRTWT-DLNVVV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGS--HRELKAKRQEWAEPNS---------FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQ 1156
Cdd:cd18061    79 YHGSliSRQMIQQYEMYFRDSQgriirgayrFQAIITTFEMILGGCPELNAIDWRCVIIDEAHRLKNKNCKLLEGLKLMN 158
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940 1157 SQQRLLLIDSPLHNTFLELWTMVHFLvpgisrpylsSPLRAPSEES--QDY----YHKVVIRLHRVTQPFILRR 1224
Cdd:cd18061   159 LEHKVLLTGTPLQNTVEELFSLLHFL----------EPLRFPSESTfmQEFgdlkTEEQVQKLQAILKPMMLRR 222
HepA COG0553
Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, ...
1793-1950 6.83e-25

Superfamily II DNA or RNA helicase, SNF2 family [Transcription, Replication, recombination, and repair];


Pssm-ID: 440319 [Multi-domain]  Cd Length: 682  Bit Score: 113.40  E-value: 6.83e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1793 LRQ-TTAPRLLqFPELRLVQFDSGKLEALAILLQKLKSEGRRVLILSQMILMLDILEMFLNFHYLTYVRIDENASSEQRQ 1871
Cdd:COG0553   511 LRQiCSHPALL-LEEGAELSGRSAKLEALLELLEELLAEGEKVLVFSQFTDTLDLLEERLEERGIEYAYLHGGTSAEERD 589
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 119618940 1872 ELMRSFNRDRRIFCAILSTHSRTTGINLVEADTVVFYDNDLNPVMDAKAQEWCDRIGRCKDIHIYRLVSGNSIEEKLLK 1950
Cdd:COG0553   590 ELVDRFQEGPEAPVFLISLKAGGEGLNLTAADHVIHYDLWWNPAVEEQAIDRAHRIGQTRDVQVYKLVAEGTIEEKILE 668
DEXDc_SHPRH-like cd18008
DEXH-box helicase domain of SHPRH-like proteins; The SHPRH-like subgroup belongs to the ...
1008-1224 1.20e-24

DEXH-box helicase domain of SHPRH-like proteins; The SHPRH-like subgroup belongs to the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350766 [Multi-domain]  Cd Length: 241  Bit Score: 105.45  E-value: 1.20e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLakLYRknlNGILADEAGLGKTVQIIA---------------FFAHLACNEGNWGPH--LVVVRScNIL 1070
Cdd:cd18008     1 LLPYQKQGLAWM--LPR---GGILADEMGLGKTIQALAlilatrpqdpkipeeLEENSSDPKKLYLSKttLIVVPL-SLL 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1071 K-WELELKR--WCPGLKILSYIGSHRELKAKrqewaEPNSFHVCITSYT----------QFFRGLTAFT------RVRWK 1131
Cdd:cd18008    75 SqWKDEIEKhtKPGSLKVYVYHGSKRIKSIE-----ELSDYDIVITTYGtlasefpknkKGGGRDSKEKeasplhRIRWY 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1132 CLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFL------VPGISRPYLSSPLRAPSEESQDy 1205
Cdd:cd18008   150 RVILDEAHNIKNRSTKTSRAVCALKAERRWCLTGTPIQNSLDDLYSLLRFLrvepfgDYPWFNSDISKPFSKNDRKALE- 228
                         250
                  ....*....|....*....
gi 119618940 1206 yhkvviRLHRVTQPFILRR 1224
Cdd:cd18008   229 ------RLQALLKPILLRR 241
DEXHc_CHD8 cd18060
DEAH-box helicase domain of the chromodomain helicase DNA binding protein 8; ...
1008-1224 3.46e-24

DEAH-box helicase domain of the chromodomain helicase DNA binding protein 8; Chromodomain-helicase-DNA-binding protein 8 (CHD8) is a DNA helicase that acts as a chromatin remodeling factor and regulates transcription. It also acts as a transcription repressor by remodeling chromatin structure and recruiting histone H1 to target genes. It suppresses p53/TP53-mediated apoptosis by recruiting histone H1 and preventing p53/TP53 transactivation activity and of STAT3 activity by suppressing the LIF-induced STAT3 transcriptional activity. It also acts as a negative regulator of Wnt signaling pathway and CTNNB1-targeted gene expression. CHD8 is also involved in both enhancer blocking and epigenetic remodeling at chromatin boundary via its interaction with CTCF. It also acts as a transcription activator via its interaction with ZNF143 by participating in efficient U6 RNA polymerase III transcription. CHD8 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350818 [Multi-domain]  Cd Length: 222  Bit Score: 103.59  E-value: 3.46e-24
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRKNLNGILADEAGLGKTVQIIAFFAHLAcNEGNWGPHLVVVRSCNILKWELELKRWCPgLKILS 1087
Cdd:cd18060     1 LREYQLEGVNWLLFNWYNRQNCILADEMGLGKTIQSIAFLQEVY-NVGIHGPFLVIAPLSTITNWEREFNTWTE-MNTIV 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIGS--HRELKAKRQEWAEPNS---------FHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQ 1156
Cdd:cd18060    79 YHGSlaSRQMIQQYEMYCKDSRgrlipgaykFDALITTFEMILSDCPELREIEWRCVIIDEAHRLKNRNCKLLDSLKHMD 158
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 119618940 1157 SQQRLLLIDSPLHNTFLELWTMVHFLVPGisrpYLSSPLRAPSEESQDYYHKVVIRLHRVTQPFILRR 1224
Cdd:cd18060   159 LEHKVLLTGTPLQNTVEELFSLLHFLEPS----QFPSESEFLKDFGDLKTEEQVQKLQAILKPMMLRR 222
HSA smart00573
domain in helicases and associated with SANT domains;
764-828 7.09e-18

domain in helicases and associated with SANT domains;


Pssm-ID: 214727 [Multi-domain]  Cd Length: 73  Bit Score: 80.13  E-value: 7.09e-18
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940    764 PKLQEAPRPKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVVEIKLR----VELEEKRKKALN----LQKVSR 828
Cdd:smart00573    1 QKLEEERRRKQHWDHLLEEMIWHAKDFKEEHKWKIAAAKKMAKAVMDyhqnKEKEEERREEKNekrrLRKLAA 73
DEXHc_RAD54B cd18066
DEXH-box helicase domain of RAD54B; DNA repair and recombination protein RAD54B, also known as ...
1008-1224 3.13e-17

DEXH-box helicase domain of RAD54B; DNA repair and recombination protein RAD54B, also known as RDH54, binds to double-stranded DNA, displays ATPase activity in the presence of DNA, and may have a role in meiotic and mitotic recombination. RAD54B is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350824 [Multi-domain]  Cd Length: 235  Bit Score: 83.74  E-value: 3.13e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWL-----AKLYRKNLNGILADEAGLGKTVQIIAFFAHLACnEGNWGPH------LVVVRSCNILKWELEL 1076
Cdd:cd18066     1 LRPHQREGIEFLyecvmGMRVNERFGAILADEMGLGKTLQCISLIWTLLR-QGPYGGKpvikraLIVTPGSLVKNWKKEF 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1077 KRWcpglkilsyIGSHR------ELKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWE 1150
Cdd:cd18066    80 QKW---------LGSERikvftvDQDHKVEEFIASPLYSVLIISYEMLLRSLDQISKLNFDLVICDEGHRLKNTSIKTTT 150
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1151 AVFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPGI-----------SRPYLSSPLRAPSEESQDYYHKVVIRLHRVTQP 1219
Cdd:cd18066   151 ALTSLSCERRIILTGTPIQNDLQEFFALIDFVNPGIlgslstyrkvyEEPIVRSREPTATPEEKKLGEARAAELTRLTGL 230

                  ....*
gi 119618940 1220 FILRR 1224
Cdd:cd18066   231 FILRR 235
DEXDc smart00487
DEAD-like helicases superfamily;
1001-1189 9.25e-17

DEAD-like helicases superfamily;


Pssm-ID: 214692 [Multi-domain]  Cd Length: 201  Bit Score: 81.38  E-value: 9.25e-17
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   1001 PSLLYGALRDYQKIGLDWLAKLYRknlNGILADEAGLGKTVQIIAfFAHLACNEGNWGPHLVVV--RScniLK--WELEL 1076
Cdd:smart00487    2 EKFGFEPLRPYQKEAIEALLSGLR---DVILAAPTGSGKTLAALL-PALEALKRGKGGRVLVLVptRE---LAeqWAEEL 74
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   1077 KRWCP--GLKILSYIGSHRELKAKRQEWAEPnsFHVCITSYTQFFRGLT--AFTRVRWKCLVIDEMQRVKGMTER-HWEA 1151
Cdd:smart00487   75 KKLGPslGLKVVGLYGGDSKREQLRKLESGK--TDILVTTPGRLLDLLEndKLSLSNVDLVILDEAHRLLDGGFGdQLEK 152
                           170       180       190       200
                    ....*....|....*....|....*....|....*....|...
gi 119618940   1152 VFTL--QSQQRLLL---IDSPLHNTFLELWTMVHFLVPGISRP 1189
Cdd:smart00487  153 LLKLlpKNVQLLLLsatPPEEIENLLELFLNDPVFIDVGFTPL 195
DEXHc_ATRX-like cd18007
DEXH-box helicase domain of ATRX-like proteins; This family includes ATRX-like members such as ...
1030-1196 4.28e-15

DEXH-box helicase domain of ATRX-like proteins; This family includes ATRX-like members such as transcriptional regulator ATRX (also called alpha thalassemia/mental retardation syndrome X-linked and X-linked nuclear protein or XNP) which is involved in transcriptional regulation and chromatin remodeling, and ARIP4 (also called androgen receptor-interacting protein 4, RAD54 like 2 or RAD54L2) which modulates androgen receptor (AR)-dependent transactivation in a promoter-dependent manner. They are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350765 [Multi-domain]  Cd Length: 239  Bit Score: 77.33  E-value: 4.28e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1030 ILADEAGLGKTVQIIAFF-AHLACNEGNWGPHLVVVRSCnILKWELELKRWCPGLKILSYIGSH--RELKAKR-----QE 1101
Cdd:cd18007    30 ILAHTMGLGKTLQVITFLhTYLAAAPRRSRPLVLCPAST-LYNWEDEFKKWLPPDLRPLLVLVSlsASKRADArlrkiNK 108
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1102 WAEPNSfhVCITSYTQFfRGLTA---------FTRVRWKC------LVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDS 1166
Cdd:cd18007   109 WHKEGG--VLLIGYELF-RNLASnattdprlkQEFIAALLdpgpdlLVLDEGHRLKNEKSQLSKALSKVKTKRRILLTGT 185
                         170       180       190
                  ....*....|....*....|....*....|
gi 119618940 1167 PLHNTFLELWTMVHFLVPGisrpYLSSPLR 1196
Cdd:cd18007   186 PLQNNLKEYWTMVDFARPK----YLGTLKE 211
HSA pfam07529
HSA domain; This domain is predicted to bind DNA and is often found associated with helicases. ...
766-819 1.46e-14

HSA domain; This domain is predicted to bind DNA and is often found associated with helicases. This region does not form a compact domain in the known structures.


Pssm-ID: 462194 [Multi-domain]  Cd Length: 67  Bit Score: 70.68  E-value: 1.46e-14
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940   766 LQEAPR-PKSHWDYLLEEMQWMATDFAQERRWKVAAAKKVV------------EIKLRVELEEKRKK 819
Cdd:pfam07529    1 RDEPERrEKTHHDYLLEEILWHSKDFKQERRWKRARAKKLAravaqyhkniekEEQKRIEREEKQRL 67
Helicase_C pfam00271
Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, ...
1815-1930 5.86e-14

Helicase conserved C-terminal domain; The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.


Pssm-ID: 459740 [Multi-domain]  Cd Length: 109  Bit Score: 70.32  E-value: 5.86e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1815 GKLEALAILLQKlkSEGRRVLILSQMILMLDIlEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFcaILSTHSRT 1894
Cdd:pfam00271    1 EKLEALLELLKK--ERGGKVLIFSQTKKTLEA-ELLLEKEGIKVARLHGDLSQEEREEILEDFRKGKIDV--LVATDVAE 75
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 119618940  1895 TGINLVEADTVVFYDNDLNPVMDakAQewcdRIGRC 1930
Cdd:pfam00271   76 RGLDLPDVDLVINYDLPWNPASY--IQ----RIGRA 105
DEXHc_HARP_SMARCAL1 cd18010
DEXH-box helicase domain of SMARCAL1; SMARCAL1 (SWI/SNF related, matrix associated, actin ...
1008-1186 2.82e-13

DEXH-box helicase domain of SMARCAL1; SMARCAL1 (SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily a like 1, also known as HARP) is recruited to stalled replication forks to promote repair and helps restart replication. It plays a role in DNA repair, telomere maintenance and replication fork stability in response to DNA replication stress. Mutations cause Schimke Immunoosseous Dysplasia. SMARCAL1 is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350768 [Multi-domain]  Cd Length: 213  Bit Score: 71.47  E-value: 2.82e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAklyRKNLNGILADEAGLGKTVQIIAFFAHLacnEGNWgPHLVVVRSCNILKWELELKRWCPGLKILS 1087
Cdd:cd18010     1 LLPFQREGVCFAL---RRGGRVLIADEMGLGKTVQAIAIAAYY---REEW-PLLIVCPSSLRLTWADEIERWLPSLPPDD 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1088 YIgshreLKAKRQEWAEPNSFHVCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVK-GMTERHWEAVFTLQSQQR-LLLID 1165
Cdd:cd18010    74 IQ-----VIVKSKDGLRDGDAKVVIVSYDLLRRLEKQLLARKFKVVICDESHYLKnSKAKRTKAALPLLKRAKRvILLSG 148
                         170       180
                  ....*....|....*....|.
gi 119618940 1166 SPLHNTFLELWTMVHFLVPGI 1186
Cdd:cd18010   149 TPALSRPIELFTQLDALDPKL 169
DEXHc_RAD54A cd18067
DEXH-box helicase domain of RAD54A; DNA repair and recombination protein RAD54A, also known as ...
1008-1186 1.64e-10

DEXH-box helicase domain of RAD54A; DNA repair and recombination protein RAD54A, also known as RAD54L or RAD54, plays a role in homologous recombination related repair of DNA double-strand breaks. RAD54A is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350825 [Multi-domain]  Cd Length: 243  Bit Score: 64.03  E-value: 1.64e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAK----LYRKNLNG-ILADEAGLGKTVQIIAFFAHLACNEGNWGPHL----VVVRSCNILKWELELKR 1078
Cdd:cd18067     1 LRPHQREGVKFLYRcvtgRRIRGSHGcIMADEMGLGKTLQCITLMWTLLRQSPQCKPEIdkaiVVSPSSLVKNWANELGK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1079 WCpGLKILSYI---GSHRELKAKRQEWAEPNSFH----VCITSYTQFFRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEA 1151
Cdd:cd18067    81 WL-GGRLQPLAidgGSKKEIDRKLVQWASQQGRRvstpVLIISYETFRLHVEVLQKGEVGLVICDEGHRLKNSDNQTYQA 159
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 119618940 1152 VFTLQSQQRLLLIDSPLHNTFLELWTMVHFLVPGI 1186
Cdd:cd18067   160 LDSLNTQRRVLLSGTPIQNDLSEYFSLVNFVNPGI 194
HELICc smart00490
helicase superfamily c-terminal domain;
1845-1930 4.78e-10

helicase superfamily c-terminal domain;


Pssm-ID: 197757 [Multi-domain]  Cd Length: 82  Bit Score: 58.38  E-value: 4.78e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   1845 DILEMFLNFHYLTYVRIDENASSEQRQELMRSFNRDRRIFcaILSTHSRTTGINLVEADTVVFYDNDLNPVMDakAQewc 1924
Cdd:smart00490    1 EELAELLKELGIKVARLHGGLSQEEREEILDKFNNGKIKV--LVATDVAERGLDLPGVDLVIIYDLPWSPASY--IQ--- 73

                    ....*.
gi 119618940   1925 dRIGRC 1930
Cdd:smart00490   74 -RIGRA 78
DEXHc_ARIP4 cd18069
DEXH-box helicase domain of ARIP4; Androgen receptor-interacting protein 4 (ARIP4, also called ...
1030-1203 3.51e-09

DEXH-box helicase domain of ARIP4; Androgen receptor-interacting protein 4 (ARIP4, also called RAD54 like 2 or RAD54L2 ) modulates androgen receptor (AR)-dependent transactivation in a promoter-dependent manner. ARIP4 is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350827 [Multi-domain]  Cd Length: 227  Bit Score: 59.83  E-value: 3.51e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1030 ILADEAGLGKTVQIIAF----FAHLACNEGnwgphLVVVRSCNILKWELELKRWCP-----------GLKILSYIGSHRE 1094
Cdd:cd18069    32 ILAHSMGLGKTLQVISFldvlLRHTGAKTV-----LAIVPVNTLQNWLSEFNKWLPppealpnvrprPFKVFILNDEHKT 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1095 LKAKRQ---EWAEPNSfhVCITSYTQFfrgltaftRVR--WKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSPLH 1169
Cdd:cd18069   107 TAARAKvieDWVKDGG--VLLMGYEMF--------RLRpgPDVVICDEGHRIKNCHASTSQALKNIRSRRRIVLTGYPLQ 176
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 119618940 1170 NTFLELWTMVHFLVPGI--SRPYLSSPLRAPSEESQ 1203
Cdd:cd18069   177 NNLIEYWCMVDFVRPDFlgTRQEFSNMFERPILNGQ 212
DEXHc_ATRX cd18068
DEXH-box helicase domain of ATRX; Transcriptional regulator ATRX (also called alpha ...
1029-1186 5.21e-09

DEXH-box helicase domain of ATRX; Transcriptional regulator ATRX (also called alpha thalassemia/mental retardation syndrome X-linked and X-linked nuclear protein or XNP) is involved in transcriptional regulation and chromatin remodeling. Mutations in humans cause mental retardation, X-linked, syndromic, with hypotonic facies 1 (MRXSHF1) and alpha-thalassemia myelodysplasia syndrome (ATMDS). ATRX is part of the a DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350826 [Multi-domain]  Cd Length: 246  Bit Score: 59.52  E-value: 5.21e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1029 GILADEAGLGKTVQIIAFFAHLACNE--GNWGPHLVVVRSCNILKWELELKRWCPGLKI--------LSYIGSHRELKAK 1098
Cdd:cd18068    31 CILAHCMGLGKTLQVVTFLHTVLLCEklENFSRVLVVCPLNTVLNWLNEFEKWQEGLKDeekievneLATYKRPQERSYK 110
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1099 RQEWAEPNSfhVCITSYtQFFRGLTAFTRVRWKC-----------------LVIDEMQRVKGMTERHWEAVFTLQSQQRL 1161
Cdd:cd18068   111 LQRWQEEGG--VMIIGY-DMYRILAQERNVKSREklkeifnkalvdpgpdfVVCDEGHILKNEASAVSKAMNSIRTKRRI 187
                         170       180
                  ....*....|....*....|....*
gi 119618940 1162 LLIDSPLHNTFLELWTMVHFLVPGI 1186
Cdd:cd18068   188 VLTGTPLQNNLIEYHCMVNFVKPNL 212
DEXHc_TTF2 cd18072
DEAH-box helicase domain of TTF2; Transcription termination factor 2 (TTF2 also called ...
1008-1182 1.55e-08

DEAH-box helicase domain of TTF2; Transcription termination factor 2 (TTF2 also called Forkhead-box E1/FOXE1 ) is a transcription termination factor that couples ATP hydrolysis with the removal of RNA polymerase II from the DNA template. Single nucleotide polymorphism (SNP) within the 5'-UTR of TTF2 is associated with thyroid cancer risk.TTF2 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350830 [Multi-domain]  Cd Length: 241  Bit Score: 58.26  E-value: 1.55e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLakLYRKNLN---GILADEAGLGKTVQIIAFFahLACNEG-------------NWGPHL--VVVRSCNI 1069
Cdd:cd18072     1 LLLHQKQALAWL--LWRERQKprgGILADDMGLGKTLTMIALI--LAQKNTqnrkeeekekaltEWESKKdsTLVPSAGT 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1070 L---------KWELELKRWCPG--LKILSYIGSHRELKAKRQEwaepnSFHVCITSYTQFFRGL---------TAFTRVR 1129
Cdd:cd18072    77 LvvcpaslvhQWKNEVESRVASnkLRVCLYHGPNRERIGEVLR-----DYDIVITTYSLVAKEIptykeesrsSPLFRIA 151
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 119618940 1130 WKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHFL 1182
Cdd:cd18072   152 WARIILDEAHNIKNPKVQASIAVCKLRAHARWALTGTPIQNNLLDMYSLLKFL 204
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
518-780 1.83e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 56.97  E-value: 1.83e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   518 PPTPQAAQLAGQRQSQQQYDPSTGPPVQNAASLHT---------PLPQL-----------PGRLPPAGVPTAALSSALQF 577
Cdd:pfam09770  108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTgyekykepePIPDLqvdaslwgvapKKAAAPAPAPQPAAQPASLP 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   578 AQQPQV-----VEAQTQLQIPVKTQQPNVPIPAPPSSqlpiPPSQPAQLALHVPTPGKVQVQASQLSSLPQMVASTRLPV 652
Cdd:pfam09770  188 APSRKMmsleeVEAAMRAQAKKPAQQPAPAPAQPPAA----PPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPV 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   653 DPAPPCPRPLPTSSTSSLAPVS-GSGPGPSPARSSPV------NRPSSAtnkalspvtsRTPGVVASAPTKPQSPAQNAT 725
Cdd:pfam09770  264 TILQRPQSPQPDPAQPSIQPQAqQFHQQPPPVPVQPTqilqnpNRLSAA----------RVGYPQNPQPGVQPAPAHQAH 333
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940   726 SSQdSSQDTLTEQITLENQVHQRIAELRKAGLwsqrrlpkLQEAPRPK-SHWDYLL 780
Cdd:pfam09770  334 RQQ-GSFGRQAPIITHPQQLAQLSEEEKAAYL--------DEEAKRAKrNHKIFLL 380
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
496-733 2.68e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 56.70  E-value: 2.68e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   496 QHPGADAGVPLQQLMptAQGGMPPTPQaAQLAGQRQSQQQYDPSTGPPVQNAASLHTPLPQLPGRL----PPAGVPTAAL 571
Cdd:pfam03154  290 QHPVPPQPFPLTPQS--SQSQVPPGPS-PAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMphikPPPTTPIPQL 366
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   572 SSAlQFAQQPQVVEAQTQLQIPVKTQQP-------NVPIPAPPSS------------QLPIPPSQPaqlalhvPTPGKVQ 632
Cdd:pfam03154  367 PNP-QSHKHPPHLSGPSPFQMNSNLPPPpalkplsSLSTHHPPSAhppplqlmpqsqQLPPPPAQP-------PVLTQSQ 438
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   633 VQASQLSSLPQMVASTRLPVDPAPPCPRPLPTSSTSSLAPvsgSGPGPSPARSSPVNRPSSATNKALSpvtsrtpGVVAS 712
Cdd:pfam03154  439 SLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPP---SGPPTSTSSAMPGIQPPSSASVSSS-------GPVPA 508
                          250       260
                   ....*....|....*....|.
gi 119618940   713 APTKPQSPAQNATSSQDSSQD 733
Cdd:pfam03154  509 AVSCPLPPVQIKEEALDEAEE 529
DEXDc_RapA cd18011
DEXH-box helicase domain of RapA; In bacteria, RapA is an RNA polymerase (RNAP)-associated ...
1030-1185 8.48e-07

DEXH-box helicase domain of RapA; In bacteria, RapA is an RNA polymerase (RNAP)-associated SWI2/SNF2 (switch/sucrose non-fermentable) protein that mediates RNAP recycling during transcription. The ATPase activity of RapA is stimulated by its interaction with RNAP and inhibited by its N-terminal domain. The conformational changes of RapA and its interaction with RNAP are essential for RNAP recycling. RapA is part of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350769 [Multi-domain]  Cd Length: 207  Bit Score: 52.29  E-value: 8.48e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1030 ILADEAGLGKTVQIIAfFAHLACNEGNWGPHLVVVRSCNILKWELELKRWCpGLKILSYIGSHRElKAKRQEWAEPNSFH 1109
Cdd:cd18011    21 LLADEVGLGKTIEAGL-IIKELLLRGDAKRVLILCPASLVEQWQDELQDKF-GLPFLILDRETAA-QLRRLIGNPFEEFP 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1110 VCITSYTQFFRG---LTAFTRVRWKCLVIDEMQRVKGM-----TERhWEAVFTLQSQQR--LLLIDSPLHNTFLELWTMV 1179
Cdd:cd18011    98 IVIVSLDLLKRSeerRGLLLSEEWDLVVVDEAHKLRNSgggkeTKR-YKLGRLLAKRARhvLLLTATPHNGKEEDFRALL 176

                  ....*.
gi 119618940 1180 HFLVPG 1185
Cdd:cd18011   177 SLLDPG 182
DEXHc_HLTF1_SMARC3 cd18071
DEXH-box helicase domain of HLTF1; Helicase like transcription factor (HLTF1, also known as ...
1029-1224 9.40e-07

DEXH-box helicase domain of HLTF1; Helicase like transcription factor (HLTF1, also known as HIP116 or SMARCA3) has both helicase and E3 ubiquitin ligase activities and ATP-dependent nucleosome-remodeling activity. HLTF1 is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350829 [Multi-domain]  Cd Length: 239  Bit Score: 52.86  E-value: 9.40e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1029 GILADEAGLGKTVQIIAFFAHlacnegnwGPHLVVVRSCNILKWELELKRWC-PG-LKILSYIGSHRELKAKrqewaEPN 1106
Cdd:cd18071    51 GILADDMGLGKTLTTISLILA--------NFTLIVCPLSVLSNWETQFEEHVkPGqLKVYTYHGGERNRDPK-----LLS 117
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1107 SFHVCITSYTQF-----FRGLTAFTRVRWKCLVIDEMQRVKGMTERHWEAVFTLQSQQRLLLIDSPLHNTFLELWTMVHF 1181
Cdd:cd18071   118 KYDIVLTTYNTLasdfgAKGDSPLHTINWLRVVLDEGHQIRNPNAQQTKAVLNLSSERRWVLTGTPIQNSPKDLGSLLSF 197
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 119618940 1182 L--VPGISRPYLSSPLRAPSEESQDyyhKVVIRLHRVTQPFILRR 1224
Cdd:cd18071   198 LhlKPFSNPEYWRRLIQRPLTMGDP---TGLKRLQVLMKQITLRR 239
DEXQc_SHPRH cd18070
DEXQ-box helicase domain of SHPRH; E3 ubiquitin-protein ligase SHPRH is a ubiquitously ...
1008-1145 5.03e-06

DEXQ-box helicase domain of SHPRH; E3 ubiquitin-protein ligase SHPRH is a ubiquitously expressed protein that contains motifs characteristic of several DNA repair proteins, transcription factors, and helicases. SHPRH is a functional homolog of S. cerevisiae RAD5 and is involved in DNA repair. SHPRH is a member of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350828 [Multi-domain]  Cd Length: 257  Bit Score: 50.81  E-value: 5.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLaklyrKNLNGILADEAGLGKTVQIIAF-FAH-----------LACNEGNWGPHLVV---VRSCN---- 1068
Cdd:cd18070     1 LLPYQRRAVNWM-----LVPGGILADEMGLGKTVEVLALiLLHprpdndldaadDDSDEMVCCPDCLVaetPVSSKatli 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1069 -----ILK-WELELKRWCP-GLKILSYIGSHRELKAKRQEWAEPNSFHVCITSYT-------------------QFFRGL 1122
Cdd:cd18070    76 vcpsaILAqWLDEINRHVPsSLKVLTYQGVKKDGALASPAPEILAEYDIVVTTYDvlrtelhyaeanrsnrrrrRQKRYE 155
                         170       180
                  ....*....|....*....|....*.
gi 119618940 1123 ---TAFTRVRWKCLVIDEMQRVKGMT 1145
Cdd:cd18070   156 appSPLVLVEWWRVCLDEAQMVESST 181
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
477-643 6.07e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 52.35  E-value: 6.07e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   477 MAEQSKRPRlevghQGVVFQHPGADAGVPLQQLMPTAQGgmPPTPQAAQLAGQRQSQQQYDPSTGPPVQnaaslhtPLPQ 556
Cdd:pfam09770  203 MRAQAKKPA-----QQPAPAPAQPPAAPPAQQAQQQQQF--PPQIQQQQQPQQQPQQPQQHPGQGHPVT-------ILQR 268
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   557 LPGRLPPAGVPTAALSSALQFAQQPQVVEAQTQ-LQIPVKTQQPNVPIPAPPSSQLPIPPSQPAQLALHV-PTPGKVQVQ 634
Cdd:pfam09770  269 PQSPQPDPAQPSIQPQAQQFHQQPPPVPVQPTQiLQNPNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSfGRQAPIITH 348

                   ....*....
gi 119618940   635 ASQLSSLPQ 643
Cdd:pfam09770  349 PQQLAQLSE 357
PHA03247 PHA03247
large tegument protein UL36; Provisional
511-734 1.52e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 1.52e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  511 PTAQGGMPPTPQAAQLAGQRQSQQQYDPSTGP--PVQNAASLHTPLPQLPgrlPPAGVPTAALSSALQFAQQPQVVEAQT 588
Cdd:PHA03247 2735 LPAAPAPPAVPAGPATPGGPARPARPPTTAGPpaPAPPAAPAAGPPRRLT---RPAVASLSESRESLPSPWDPADPPAAV 2811
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  589 QLQIPV--KTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKV------QVQASQLSSLPQMVASTRLPVDPAPPCPR 660
Cdd:PHA03247 2812 LAPAAAlpPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVapggdvRRRPPSRSPAAKPAAPARPPVRRLARPAV 2891
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 119618940  661 PLPTSSTSSlapvsgsgPGPSPARSSPVNRPSSATNKALSPVTSRtPGVVASAPTKPQSPAQNATSSQDSSQDT 734
Cdd:PHA03247 2892 SRSTESFAL--------PPDQPERPPQPQAPPPPQPQPQPPPPPQ-PQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
2962-3081 2.86e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.77  E-value: 2.86e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2962 TQIQVAKLPQVVQQQTPVASIQQVASA--SQQAS-PQTVALTQATAAGQQVQMIPAVTATAQVVQQKLIQQQVVT--TAS 3036
Cdd:PRK14971  355 TLIQLAQLTQKGDDASGGRGPKQHIKPvfTQPAAaPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVdpPAA 434
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 119618940 3037 APLQTPGAPNPAQVPASSDSPSQQPKLQMRVPAVRLKTPTKPPCQ 3081
Cdd:PRK14971  435 VPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAE 479
PHA03247 PHA03247
large tegument protein UL36; Provisional
469-722 7.27e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 7.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  469 QQAMPSTGMAEQSKRPRLEVGHQGVVFQHPGADAGVPLQQLMPTAQGGMPPTPQAAQLAGQRQSQQQYDPSTGPPVQNAA 548
Cdd:PHA03247 2572 RPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPER 2651
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  549 SLHTPLP---QLPGRLPPAGVPTAAlSSALQFAQQPQV---VEAQTQLQIPVKTQQPNVPIPAPPSSQLPIPPSQPAQLA 622
Cdd:PHA03247 2652 PRDDPAPgrvSRPRRARRLGRAAQA-SSPPQRPRRRAArptVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQ 2730
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  623 LHVPTPGKvqvqasqlSSLPQMVASTRLPVDPAPPCPRPLPTSSTSSLAPVSGSGPGPSPARSSPVNRPSSATNKALSPV 702
Cdd:PHA03247 2731 ASPALPAA--------PAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPW 2802
                         250       260
                  ....*....|....*....|
gi 119618940  703 TSRTPGVVASAPTKPQSPAQ 722
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAA 2822
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
495-772 1.35e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.60  E-value: 1.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   495 FQHPGADAGVPLQQLMPT---AQGGMPPTPQAAQL-----AGQRQSQQQYDPSTGP----PVQNAASLHTPLPQLPGRLP 562
Cdd:pfam05109  439 FAAPNTTTGLPSSTHVPTnltAPASTGPTVSTADVtsptpAGTTSGASPVTPSPSPrdngTESKAPDMTSPTSAVTTPTP 518
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   563 PAGVPTAALSSALQFAQQPQVveAQTQLQIPVKTQQPNVPIPAPPSSQlPIPPSQpaqlalhVPTPGKVQVQASQLSSLP 642
Cdd:pfam05109  519 NATSPTPAVTTPTPNATSPTL--GKTSPTSAVTTPTPNATSPTPAVTT-PTPNAT-------IPTLGKTSPTSAVTTPTP 588
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   643 QMVASTrlpvdpappcprPLPTSSTSSLAPVSGSGPGPSPARSSPVNRPSSATNKALSPVTSRTpgvVASAPTKPQSPAQ 722
Cdd:pfam05109  589 NATSPT------------VGETSPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSS---TSSMSLRPSSISE 653
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 119618940   723 NATSsqdSSQDTLTEQITLENQVH----QRIAELRKAGLwSQRRLPKLQEAPRP 772
Cdd:pfam05109  654 TLSP---STSDNSTSHMPLLTSAHptggENITQVTPAST-STHHVSTSSPAPRP 703
SP1-4_arthropods_N cd22553
N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; ...
2932-3064 2.44e-04

N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in the chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. One SP is clade SP1-4, which is expressed ubiquitously throughout development. SP1-4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. This model represents the N-terminal domain of SP1-4 from arthropods.


Pssm-ID: 411778 [Multi-domain]  Cd Length: 384  Bit Score: 46.17  E-value: 2.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2932 AYAAQPALKTQFLTtPISQAQKLAGAQ--QVQTQIQVAKLPQVVQQQTPVAsIQQVASASQQASPQTvaltqATAAGQQV 3009
Cdd:cd22553   188 AGGGNQALQAQVIP-QLAQAAQLQPQQlaQVSSQGYIQQIPANASQQQPQM-VQQGPNQSGQIIGQV-----ASASSIQA 260
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 3010 QMIPAVT-ATAQVVQQKLIQQQVVTTASAPLQTPGAPNPAQVPASSDSPSQQPKLQ 3064
Cdd:cd22553   261 AAIPLTVyTGALAGQNGSNQQQVGQIVTSPIQGMTQGLTAPASSSIPTVVQQQAIQ 316
PHA03269 PHA03269
envelope glycoprotein C; Provisional
588-721 2.55e-04

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 46.65  E-value: 2.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  588 TQLQIPVKTQQPNVPIP---APPSSQLPIPPSQPAQLALHVPTPGKVQVQAS----QLSSLPQMVASTRLpvDPAPPCPR 660
Cdd:PHA03269   13 ACINLIIANLNTNIPIPelhTSAATQKPDPAPAPHQAASRAPDPAVAPTSAAsrkpDLAQAPTPAASEKF--DPAPAPHQ 90
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 119618940  661 PLPTSSTSSLAPVSGSGPGPSPArSSPVNRPSSATNKALSPVT--SRTPGVVASAPTKPQSPA 721
Cdd:PHA03269   91 AASRAPDPAVAPQLAAAPKPDAA-EAFTSAAQAHEAPADAGTSaaSKKPDPAAHTQHSPPPFA 152
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
586-770 2.81e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.68  E-value: 2.81e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   586 AQTQLQipvKTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKVQVQ---ASQLSSLPQMVASTRLPVDPAPPCPRPL 662
Cdd:pfam03154  162 AQQQIL---QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPpqgSPATSQPPNQTQSTAAPHTLIQQTPTLH 238
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   663 PTSSTSSLAPVSGSGPGPSPARSSPVNRPSSATNKALSPVtsrtPGVVASAPTKPQSPAQ----NATSSQDSSQDTLTEQ 738
Cdd:pfam03154  239 PQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM----PHSLQTGPSHMQHPVPpqpfPLTPQSSQSQVPPGPS 314
                          170       180       190
                   ....*....|....*....|....*....|..
gi 119618940   739 ITLENQVHQRIAELRKAGLWSQRRLPKLQEAP 770
Cdd:pfam03154  315 PAAPGQSQQRIHTPPSQSQLQSQQPPREQPLP 346
ResIII pfam04851
Type III restriction enzyme, res subunit;
1008-1137 3.53e-04

Type III restriction enzyme, res subunit;


Pssm-ID: 398492 [Multi-domain]  Cd Length: 162  Bit Score: 43.81  E-value: 3.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  1008 LRDYQKIGLD-WLAKLYRKNLNGILADEAGLGKTvqIIAFFAHLACNEGNWGPH-LVVVRSCNILK-WELELKRWCPGLK 1084
Cdd:pfam04851    4 LRPYQIEAIEnLLESIKNGQKRGLIVMATGSGKT--LTAAKLIARLFKKGPIKKvLFLVPRKDLLEqALEEFKKFLPNYV 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 119618940  1085 ILSYIGSHRELKAKRQEWaepnsfHVCITSYTQFFRGLTAFTRV----RWKCLVIDE 1137
Cdd:pfam04851   82 EIGEIISGDKKDESVDDN------KIVVTTIQSLYKALELASLEllpdFFDVIIIDE 132
SP1-4_arthropods_N cd22553
N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; ...
2798-3023 3.85e-04

N-terminal domain of transcription factor Specificity Protein (SP) 1-4 from arthropods; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. There are many SPs in vertebrates (9 SPs in humans and mice, 7 SPs in the chicken, and 11 SPs in teleost fish), but arthropods only have 3 SPs. One SP is clade SP1-4, which is expressed ubiquitously throughout development. SP1-4 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. This model represents the N-terminal domain of SP1-4 from arthropods.


Pssm-ID: 411778 [Multi-domain]  Cd Length: 384  Bit Score: 45.79  E-value: 3.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2798 GTTVANLQVARLTRVPTSQLQAQGQMQTQAPQPAQVALAKPPVVSVPAAVVSSPGVTTLPMNVAGiSVAIGQPQKAA-GQ 2876
Cdd:cd22553    92 LLQTNNQQAIQLAPGGTQAILANQQTLIRPNTVQGQANASNVLQNIAQIASGGNAVQLPLNNMTQ-TIPVQVPVSTAnGQ 170
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2877 TV---VAQPVHMQQLLKLKQQAVQQQKAIQPQAAQ-GPAAVQQKITA------QQITTPGAQQK-------------VAY 2933
Cdd:cd22553   171 TVyqtIQVPIQAIQSGNAGGGNQALQAQVIPQLAQaAQLQPQQLAQVssqgyiQQIPANASQQQpqmvqqgpnqsgqIIG 250
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2934 AAQPALKTQFLTTPISQAQKLAGAQQVQTQIQVAKLP----QVVQQQTPVASIQQVASASQQASPQTVAL---TQATAAG 3006
Cdd:cd22553   251 QVASASSIQAAAIPLTVYTGALAGQNGSNQQQVGQIVtspiQGMTQGLTAPASSSIPTVVQQQAIQGNPLppgTQIIAAG 330
                         250
                  ....*....|....*..
gi 119618940 3007 QQVQMIPAVTATAQVVQ 3023
Cdd:cd22553   331 QQLQQDPNDPTKWQVVA 347
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
469-730 4.22e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 46.15  E-value: 4.22e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   469 QQAMPSTGMAEQSKRPRLEVGHQGVVFQHPGADAGVPLQQLMPTAQGGMP--------PTPQAAQLAGQRQSQQQYDPST 540
Cdd:pfam09606  171 PNQMGPNGGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQMGqqaqanggMNPQQMGGAPNQVAMQQQQPQQ 250
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   541 GPPVQNAASLHTPLPQLPGRLP---PAGVPTAALSSAlqfAQQPQVVEAQTQLQIP-VKTQQPNVpiPAPPSSQLPIPPS 616
Cdd:pfam09606  251 QGQQSQLGMGINQMQQMPQGVGggaGQGGPGQPMGPP---GQQPGAMPNVMSIGDQnNYQQQQTR--QQQQQQGGNHPAA 325
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   617 QPAQLALHVptpgkvqVQASQLSSLP-QMVASTRLPVDPAPPCPRPL--------------PTSSTSSLAPVSGSGPGPS 681
Cdd:pfam09606  326 HQQQMNQSV-------GQGGQVVALGgLNHLETWNPGNFGGLGANPMqrgqpgmmsspspvPGQQVRQVTPNQFMRQSPQ 398
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|
gi 119618940   682 PARSSPVNrPSSATNKALSPVTSRTPGVVAS-APTKPQSPAQNATSSQDS 730
Cdd:pfam09606  399 PSVPSPQG-PGSQPPQSHPGGMIPSPALIPSpSPQMSQQPAQQRTIGQDS 447
DEXQc_bact_SNF2 cd18013
DEXQ-box helicase domain of bacterial SNF2 family proteins; Proteins belonging to the SNF2 ...
1008-1182 5.02e-04

DEXQ-box helicase domain of bacterial SNF2 family proteins; Proteins belonging to the SNF2 family of DNA dependent ATPases are important members of the chromatin remodeling complexes that are implicated in epigenetic control of gene expression. The Snf2 family comprise a large group of ATP-hydrolyzing proteins that are ubiquitous in eukaryotes, but also present in eubacteria and archaea. The bacterial SNF2 present in this family are members of the DEAD-like helicase superfamily, a diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.


Pssm-ID: 350771 [Multi-domain]  Cd Length: 218  Bit Score: 44.26  E-value: 5.02e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1008 LRDYQKIGLDWLAKLYRknlNGILADeAGLGKTVQIIAFFAHLACnEGNWGPHLV-----VVRScnilKWELELKRW--C 1080
Cdd:cd18013     1 PHPYQKVAINFIIEHPY---CGLFLD-MGLGKTVTTLTALSDLQL-DDFTRRVLViaplrVARS----TWPDEVEKWnhL 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 1081 PGLKILSYIGSHREL-KAKRQEWaepnsfHVCITSYtQFFRGLTAFTRVRW--KCLVIDEMQRVKGMTERHWEAVFTLQS 1157
Cdd:cd18013    72 RNLTVSVAVGTERQRsKAANTPA------DLYVINR-ENLKWLVNKSGDPWpfDMVVIDELSSFKSPRSKRFKALRKVRP 144
                         170       180
                  ....*....|....*....|....*..
gi 119618940 1158 Q-QRLL-LIDSPLHNTFLELWTMVHFL 1182
Cdd:cd18013   145 ViKRLIgLTGTPSPNGLMDLWAQIALL 171
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
480-728 7.35e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.53  E-value: 7.35e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   480 QSKRPRLEVGHQGVVFQHPGADAGVPLQQLMPTAQG------GMPPT---PQAAQLAGQRQSQQQYDPSTGPpvQNAASL 550
Cdd:pfam03154  168 QTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSApsvppqGSPATsqpPNQTQSTAAPHTLIQQTPTLHP--QRLPSP 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   551 HTPLPQLPGRLPPAGVPTAALSSALQFAQQPQvveaqtqLQIPVKTQQPNVPIPAPP----------SSQLPIPPSQ--- 617
Cdd:pfam03154  246 HPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPP-------MPHSLQTGPSHMQHPVPPqpfpltpqssQSQVPPGPSPaap 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   618 ----------PAQLALHVPTPGKVQVQASQLSSLPQMVASTRLPVDPAPPCPRPLPTSSTSSLAPVSGSG-----PGPSP 682
Cdd:pfam03154  319 gqsqqrihtpPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLSGPSPFQMNSnlpppPALKP 398
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*...
gi 119618940   683 ARSSPVNRPSSATNKALS--PVTSRTPGVVASAPTKPQSPAQNATSSQ 728
Cdd:pfam03154  399 LSSLSTHHPPSAHPPPLQlmPQSQQLPPPPAQPPVLTQSQSLPPPAAS 446
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
472-653 8.20e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.98  E-value: 8.20e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  472 MPSTGMAEQSKRPRLEvghqgvvfqhpgadagvPLQQLMPTAQGGMPPTPQAAqlagqrqSQQQYDPSTGPPvqnaaslh 551
Cdd:PRK07764  364 LPSASDDERGLLARLE-----------------RLERRLGVAGGAGAPAAAAP-------SAAAAAPAAAPA-------- 411
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  552 tplpqlPGRLPPAGVPTAALSSALQFAQQPQVVEAQTQLQIPVKTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKV 631
Cdd:PRK07764  412 ------PAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPP 485
                         170       180
                  ....*....|....*....|..
gi 119618940  632 QVQASQLSSLPQMVASTRLPVD 653
Cdd:PRK07764  486 AAPAPAAAPAAPAAPAAPAGAD 507
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
499-736 1.21e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 1.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  499 GADAGVPLQQLMPTAQggMPPTPQAAQLAGQRQSQQQYDPSTGPPVQNAASlhtPLPQLPGRLPPAGVPTAALSSALQFA 578
Cdd:PRK12323  369 GGGAGPATAAAAPVAQ--PAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAAR---AVAAAPARRSPAPEALAAARQASARG 443
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  579 QQPQVVEAQTQLQIPVKTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKVqvqaSQLSSLPQMVAStrlpvdpappc 658
Cdd:PRK12323  444 PGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDP----PPWEELPPEFAS----------- 508
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  659 prplptsstsslAPVSGSGPGPSPARSSPVNRPS----SATNKALSPVTSRTPGVVASAPTKPQSPAQNATSSQDSSQDT 734
Cdd:PRK12323  509 ------------PAPAQPDAAPAGWVAESIPDPAtadpDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDM 576

                  ..
gi 119618940  735 LT 736
Cdd:PRK12323  577 FD 578
PHA03247 PHA03247
large tegument protein UL36; Provisional
538-774 1.44e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 1.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  538 PSTGPPVQNAASLHTPLPQ--LPGRLPPAGVPTAALSSALQFAQQ--PQVVEAQTQLQIPVKTQQPNVPIPaPPSSQL-- 611
Cdd:PHA03247 2686 RAARPTVGSLTSLADPPPPppTPEPAPHALVSATPLPPGPAAARQasPALPAAPAPPAVPAGPATPGGPAR-PARPPTta 2764
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  612 ----PIPPSQPAQLALHVPTPGKVQVQASQLSSLPQMVASTRLPVDPAPPCPRPLPTSSTSSLAPVSGSGPGPSPARSSP 687
Cdd:PHA03247 2765 gppaPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  688 VNRPSSATNKALSP-------VTSRTPGVVASAPTKP------QSPAQNATSSQDSSQDTLTEQITLENQVHQRIAELRK 754
Cdd:PHA03247 2845 PPPPSLPLGGSVAPggdvrrrPPSRSPAAKPAAPARPpvrrlaRPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPP 2924
                         250       260
                  ....*....|....*....|
gi 119618940  755 AglwSQRRLPKLQEAPRPKS 774
Cdd:PHA03247 2925 P---PPQPQPPPPPPPRPQP 2941
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
468-583 2.38e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 43.64  E-value: 2.38e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940   468 RQQAMPSTGMAEQSKRPRLEVGHQG---VVFQHPGAD-AGVPL----QQLMPTAQG------GMPPTPQAAQLAGQRQSQ 533
Cdd:TIGR01628  372 QDQFMQLQPRMRQLPMGSPMGGAMGqppYYGQGPQQQfNGQPLgwprMSMMPTPMGpggplrPNGLAPMNAVRAPSRNAQ 451
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 119618940   534 QQYDPSTGPPVQ-----NAASLHTPLPQlPGRLPPAGVPTAALSSALQFA---QQPQV 583
Cdd:TIGR01628  452 NAAQKPPMQPVMyppnyQSLPLSQDLPQ-PQSTASQGGQNKKLAQVLASAtpqMQKQV 508
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
498-731 2.40e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 43.62  E-value: 2.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  498 PGADAGVPLQQLMPTAQGGMPPTPQAAQLAGQRQSQ--QQYDPSTGPPVQNAASLHTPLPQLPGRLPPAG--VPTAALSS 573
Cdd:PHA03307   73 PGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPpgPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGspGPPPAASP 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  574 ALQFAQQPQVVEAQT---QLQIPVKTQQPNVPIPAPPSSQLPI------------PPSQPAQLALHVPTPGKVQVQASQL 638
Cdd:PHA03307  153 PAAGASPAAVASDAAssrQAALPLSSPEETARAPSSPPAEPPPstppaaasprppRRSSPISASASSPAPAPGRSAADDA 232
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  639 SSLPQMVASTRLPV---DPAPPCPRPLPTSSTSSLAPVSGSGPGPSPARSSPVnRPSSATNKALSPVTSRTPGVVASAPT 715
Cdd:PHA03307  233 GASSSDSSSSESSGcgwGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPA-SSSSSPRERSPSPSPSSPGSGPAPSS 311
                         250
                  ....*....|....*.
gi 119618940  716 KPQSPAQNATSSQDSS 731
Cdd:PHA03307  312 PRASSSSSSSRESSSS 327
PRK10263 PRK10263
DNA translocase FtsK; Provisional
2924-3048 2.64e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.54  E-value: 2.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2924 TPGAQQKVAYAAQPALKTQFlttpiSQAQKLAGAQQVQTQIQ--VAKLPQVVQQQTPVASIQQVASASQQASPQTVALTQ 3001
Cdd:PRK10263  746 TPIVEPVQQPQQPVAPQQQY-----QQPQQPVAPQPQYQQPQqpVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 820
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 119618940 3002 ATAAGQQVQMIPAVTATAQVVQQKLIQQQVVTTA-SAPLQTPGAPNPA 3048
Cdd:PRK10263  821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGdSRPLHKPTTPLPS 868
rne PRK10811
ribonuclease E; Reviewed
2947-3079 3.61e-03

ribonuclease E; Reviewed


Pssm-ID: 236766 [Multi-domain]  Cd Length: 1068  Bit Score: 43.10  E-value: 3.61e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940 2947 PISQAQKLAGAQQVQTQIQVAkLPQVVQQQTPVASIQQVASASQQASPQTVALTQATAAGQQVQMIPAVTATAQVVQQKL 3026
Cdd:PRK10811  846 PVVRPQDVQVEEQREAEEVQV-QPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPV 924
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940 3027 IQQ-QVVT--TASAPLQTPGAPNPAQVPASSDSPSQQPKLQMRVPAVRLKTPTKPP 3079
Cdd:PRK10811  925 TEQpQVITesDVAVAQEVAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPA 980
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
498-648 3.64e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.78  E-value: 3.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  498 PGADAGVPlqqlmPTAQGGMPPTPQAAQLAGQRQSQQQYDPSTGPPV----QNAASLHTPLPQLPGRLPPAGVPTAALSS 573
Cdd:PRK14951  366 PAAAAEAA-----APAEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPaaaaSAPAAPPAAAPPAPVAAPAAAAPAAAPAA 440
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 119618940  574 ALQFAQQPQVVEAQTQ---LQIPVKTQqpnvPIPAPPSSQlPIPPSQPAQLALHVPTPGKVQVQAsqlssLPQMVAST 648
Cdd:PRK14951  441 APAAVALAPAPPAQAApetVAIPVRVA----PEPAVASAA-PAPAAAPAAARLTPTEEGDVWHAT-----VQQLAAAE 508
PRK10263 PRK10263
DNA translocase FtsK; Provisional
470-680 4.06e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.15  E-value: 4.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  470 QAMPSTGMAEQSKRPRLEVGHQGVVFQHPGADAGVPLQQLMPtaqggmPPTPQAAQLAGQRQSQQQYDPSTGPPVQNAAs 549
Cdd:PRK10263  361 QPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQ------PQQPYYAPAAEQPAQQPYYAPAPEQPAQQPY- 433
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  550 lHTPLPQLPGRLPPAGVPTAALSSALQFAQQPQVVEAQTQLQIPVKTQQPNVPipaPPSSQLPIP------PSQPA---- 619
Cdd:PRK10263  434 -YAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQPVE---QQPVVEPEPvveetkPARPPlyyf 509
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  620 ------------QLA---------LHVPTPGKVQVQASQLSSLPQM-VASTRLPVDPAPPCPRPLPTSSTSSLAPV---- 673
Cdd:PRK10263  510 eeveekrarereQLAawyqpipepVKEPEPIKSSLKAPSVAAVPPVeAAAAVSPLASGVKKATLATGAAATVAAPVfsla 589

                  ....*..
gi 119618940  674 SGSGPGP 680
Cdd:PRK10263  590 NSGGPRP 596
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
2816-3068 5.54e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 42.30  E-value: 5.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  2816 QLQAQGQMQTQAPQPAQVALAKPPVVSVPAAVVSSPGVTTLPMNVAGISVAIGQPQKAAGQTVvaqPVHMQQllklKQQA 2895
Cdd:pfam09606  222 QAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPM---GPPGQQ----PGAM 294
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  2896 VQQQKAIQPQAAQGPAAVQQKITAQQITTPGAQQKVAYAAQPAlktqflttpiSQAQKLAGAQQVQTQIQVAKLPQVVQQ 2975
Cdd:pfam09606  295 PNVMSIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQG----------GQVVALGGLNHLETWNPGNFGGLGANP 364
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  2976 QtpvasiqqvasasQQASPQTVALTQATAAGQQVQMipavtATAQVVQQKliQQQVVTTASAPLQTPGA-------PNPA 3048
Cdd:pfam09606  365 M-------------QRGQPGMMSSPSPVPGQQVRQV-----TPNQFMRQS--PQPSVPSPQGPGSQPPQshpggmiPSPA 424
                          250       260
                   ....*....|....*....|
gi 119618940  3049 QVPASSDSPSQQPKLQMRVP 3068
Cdd:pfam09606  425 LIPSPSPQMSQQPAQQRTIG 444
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
500-725 6.45e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 6.45e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  500 ADAGVPLQQLMPTAQGGMPPTPQAAQLAGQRQSQQQYDPSTGPPvqNAASLHTPLPQLPGRLPPAGVPTAALSSALQFAQ 579
Cdd:PRK07764  589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGA--AAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGG 666
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  580 QPQVVEAQTqlqiPVKTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKVQVQASQlsslPQMVASTRLPVDPAPPCP 659
Cdd:PRK07764  667 DGWPAKAGG----AAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQ----PPQAAQGASAPSPAADDP 738
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 119618940  660 RPLPtsSTSSLAPVSGSGPGPSPARSSPVNRPSSATNKALSPVTSRTPGVVASAPTKPQSPAQNAT 725
Cdd:PRK07764  739 VPLP--PEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAE 802
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
483-721 7.60e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 7.60e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  483 RPRLEVGHQGVVFQHPGADAGVPLQQLMPTAQGGMPPTPQAAqlagqrqsqqqydPSTGPPVQNAASlhtPLPQLPGRLP 562
Cdd:PRK12323  364 RPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAA-------------PAAAPAAAAAAR---AVAAAPARRS 427
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  563 PAGVPTAALSSALQFAQQPQVVEAQTQLQIPVKTQQPNVPIPAPPSSQLPIPPSQPAQLALHVPTPGKVqvqaSQLSSLP 642
Cdd:PRK12323  428 PAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDP----PPWEELP 503
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 119618940  643 QMVASTRLPVDPAPPCPRPLPTSSTSSLAPVSGSGPGPSPARSSPvnrPSSATNKALSPVTSRTPGVVASAPTKPQSPA 721
Cdd:PRK12323  504 PEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAA---PAPRAAAATEPVVAPRPPRASASGLPDMFDG 579
PHA03378 PHA03378
EBNA-3B; Provisional
506-772 9.08e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 41.98  E-value: 9.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  506 LQQLMPTAQGGMPPTPQAAQLAGQrQSQQQYDPSTGPPVQNAASLHtPLPQLPGRLPPAGVPTAALSSALQFAQQPQVVE 585
Cdd:PHA03378  585 LASSAPSYAQTPWPVPHPSQTPEP-PTTQSHIPETSAPRQWPMPLR-PIPMRPLRMQPITFNVLVFPTPHQPPQVEITPY 662
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  586 AQTQLQIPVKTQQPNvpiPAPPSSQLPI---------PPSQPAQLALHVPTPGKVQVQASQLSSLPQMVASTRLPVDPAP 656
Cdd:PHA03378  663 KPTWTQIGHIPYQPS---PTGANTMLPIqwapgtmqpPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAA 739
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 119618940  657 PCPRPLPTSSTSSLAPVSGSGPGPSPARSSPVNRPSSATNKALSPVTSRTPgvvASAPTKPQSPAQNATSSQDSSQDTLT 736
Cdd:PHA03378  740 APGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQQRP---RGAPTPQPPPQAGPTSMQLMPRAAPG 816
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 119618940  737 EQITLENQVHQRIAELRKAG---LWSQRRLPKLQEA-PRP 772
Cdd:PHA03378  817 QQGPTKQILRQLLTGGVKRGrpsLKKPAALERQAAAgPTP 856
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH