|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 super family |
cl35903 |
DNA translocase FtsK; Provisional |
288-477 |
2.21e-07 |
|
DNA translocase FtsK; Provisional The actual alignment was detected with superfamily member PRK10263:
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.09 E-value: 2.21e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 288 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 367
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 368 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 447
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 1720400458 448 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 477
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PHA03247 super family |
cl33720 |
large tegument protein UL36; Provisional |
46-483 |
1.58e-06 |
|
large tegument protein UL36; Provisional The actual alignment was detected with superfamily member PHA03247:
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 1.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 46 PPQASLSIPVSRGLPQQSSPQQLLSLQGLHSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLTMPTATLGNLRAF 125
Cdd:PHA03247 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL 2698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 126 NVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPNRKTVPledrED 205
Cdd:PHA03247 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP----AA 2774
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 206 PTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAkrcrrvrikgidhhnwlfaylwifASSEESTEK 285
Cdd:PHA03247 2775 PAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPA------------------------AVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 286 GPTGQPQARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVE 359
Cdd:PHA03247 2819 PPAASPAGPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESF 2898
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 360 PQVPSQPPWQLQPretdppnqaqaqtqpqplwqaqsqkQAQTQAHPQVPTQAQSQEQTSEKTQDQPQtwpqgsvPPPEQA 439
Cdd:PHA03247 2899 ALPPDQPERPPQP-------------------------QAPPPPQPQPQPPPPPQPQPPPPPPPRPQ-------PPLAPT 2946
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1720400458 440 SGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 483
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
646-670 |
1.87e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization. :
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 42.16 E-value: 1.87e-05
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
762-795 |
1.66e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins. :
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.85 E-value: 1.66e-03
10 20 30
....*....|....*....|....*....|....
gi 1720400458 762 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 795
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_SF super family |
cl15257 |
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ... |
615-682 |
2.90e-03 |
|
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions. The actual alignment was detected with superfamily member cd10442:
Pssm-ID: 472790 Cd Length: 92 Bit Score: 37.73 E-value: 2.90e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 615 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 682
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
288-477 |
2.21e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.09 E-value: 2.21e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 288 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 367
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 368 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 447
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 1720400458 448 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 477
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
46-483 |
1.58e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 1.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 46 PPQASLSIPVSRGLPQQSSPQQLLSLQGLHSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLTMPTATLGNLRAF 125
Cdd:PHA03247 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL 2698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 126 NVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPNRKTVPledrED 205
Cdd:PHA03247 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP----AA 2774
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 206 PTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAkrcrrvrikgidhhnwlfaylwifASSEESTEK 285
Cdd:PHA03247 2775 PAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPA------------------------AVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 286 GPTGQPQARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVE 359
Cdd:PHA03247 2819 PPAASPAGPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESF 2898
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 360 PQVPSQPPWQLQPretdppnqaqaqtqpqplwqaqsqkQAQTQAHPQVPTQAQSQEQTSEKTQDQPQtwpqgsvPPPEQA 439
Cdd:PHA03247 2899 ALPPDQPERPPQP-------------------------QAPPPPQPQPQPPPPPQPQPPPPPPPRPQ-------PPLAPT 2946
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1720400458 440 SGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 483
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
646-670 |
1.87e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 42.16 E-value: 1.87e-05
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
287-416 |
5.39e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.87 E-value: 5.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 287 PTGQPQARVQPQTQmtAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 366
Cdd:pfam09770 222 PAAPPAQQAQQQQQ--FPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP 299
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720400458 367 PWQLQ-------PRETDPPNQAQAQTQPQPLWQAQSQ----KQAQTQAHPQVPTQAQSQEQ 416
Cdd:pfam09770 300 TQILQnpnrlsaARVGYPQNPQPGVQPAPAHQAHRQQgsfgRQAPIITHPQQLAQLSEEEK 360
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
762-795 |
1.66e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.85 E-value: 1.66e-03
10 20 30
....*....|....*....|....*....|....
gi 1720400458 762 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 795
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
615-682 |
2.90e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.90e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 615 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 682
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
288-477 |
2.21e-07 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 55.09 E-value: 2.21e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 288 TGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrqaqtqtSPEHLAPQQDQVEPQVPSQPP 367
Cdd:PRK10263 327 TTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAP----------APEGYPQQSQYAQPAVQYNEP 396
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 368 WQlQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPPEQASGPACATE 447
Cdd:PRK10263 397 LQ-QPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQE 475
|
170 180 190
....*....|....*....|....*....|
gi 1720400458 448 PQlsSHAAEAGSDPDKALPEPVSAQSSEDR 477
Cdd:PRK10263 476 PL--YQQPQPVEQQPVVEPEPVVEETKPAR 503
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
46-483 |
1.58e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 52.25 E-value: 1.58e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 46 PPQASLSIPVSRGLPQQSSPQQLLSLQGLHSTSLLNGPMLQRALLLQQLQGLDQFAMPPATYDGASLTMPTATLGNLRAF 125
Cdd:PHA03247 2619 PDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSL 2698
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 126 NVTAPSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPNRKTVPledrED 205
Cdd:PHA03247 2699 ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPP----AA 2774
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 206 PTEGSEEATElqmdtcedqdslVGPDSMLSEPQVPEPEPFETLEPPAkrcrrvrikgidhhnwlfaylwifASSEESTEK 285
Cdd:PHA03247 2775 PAAGPPRRLT------------RPAVASLSESRESLPSPWDPADPPA------------------------AVLAPAAAL 2818
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 286 GPTGQPQARVQPQT--QMTAPKQTQTPDRLPEPPEVQMLP----RIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVE 359
Cdd:PHA03247 2819 PPAASPAGPLPPPTsaQPTAPPPPPGPPPPSLPLGGSVAPggdvRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESF 2898
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 360 PQVPSQPPWQLQPretdppnqaqaqtqpqplwqaqsqkQAQTQAHPQVPTQAQSQEQTSEKTQDQPQtwpqgsvPPPEQA 439
Cdd:PHA03247 2899 ALPPDQPERPPQP-------------------------QAPPPPQPQPQPPPPPQPQPPPPPPPRPQ-------PPLAPT 2946
|
410 420 430 440
....*....|....*....|....*....|....*....|....
gi 1720400458 440 SGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEDRSREASA 483
Cdd:PHA03247 2947 TDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPA 2990
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
130-551 |
1.91e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 51.86 E-value: 1.91e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 130 PSLAAPSLTPPQMVTPNLQQFFPQATRQSLLGPPPVGVPINPSQLNHSG-------RNTQKQARTPSSTTPNRKTVPLED 202
Cdd:PHA03247 2609 RGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPApgrvsrpRRARRLGRAAQASSPPQRPRRRAA 2688
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 203 RedPTegseeatelqmdtcedqdslVGPDSMLSEPQVPEPEPfETLEPPAKRCRRVRIKGIDHHNWLFAYLWIFASSEES 282
Cdd:PHA03247 2689 R--PT--------------------VGSLTSLADPPPPPPTP-EPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVP 2745
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 283 TEKGPTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQ-TQTSPEHLAPQQDQVEPQ 361
Cdd:PHA03247 2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADpPAAVLAPAAALPPAASPA 2825
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 362 VPSQPPWQLQPRETDPPNQAQAQTQPQPLWQA------------QSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWP 429
Cdd:PHA03247 2826 GPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrppsrSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQP 2905
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 430 QgsvPPPEQASGPACATEPQLSSHAAEAGSDPDKALPEPVSAQSSEdrsrEASAGGLDLGECEKRAGEMlgmwgAGSSLK 509
Cdd:PHA03247 2906 E---RPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTD----PAGAGEPSGAVPQPWLGAL-----VPGRVA 2973
|
410 420 430 440
....*....|....*....|....*....|....*....|..
gi 1720400458 510 VTILQSSNSRAFNTTPLTSGPRPGDstSATPAIASTPSKQSL 551
Cdd:PHA03247 2974 VPRFRVPQPAPSREAPASSTPPLTG--HSLSRVSSWASSLAL 3013
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
277-488 |
4.77e-06 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 50.37 E-value: 4.77e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 277 ASSEESTEKGPTGQPQARVQPQ--TQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQ 354
Cdd:PRK07764 595 AGGEGPPAPASSGPPEEAARPAapAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 355 QDQVEPQVPSQPPWQLQPRETDPPNQAQAQTQpqplWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQ----DQPQTWPQ 430
Cdd:PRK07764 675 GAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPA----ATPPAGQADDPAAQPPQAAQGASAPSPAADDPvplpPEPDDPPD 750
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 1720400458 431 GSVPPPEQASGPACATEPQLSSHAAEA-GSDPDKALPEPVSAQSSEDRsREASAGGLDL 488
Cdd:PRK07764 751 PAGAPAQPPPPPAPAPAAAPAAAPPPSpPSEEEEMAEDDAPSMDDEDR-RDAEEVAMEL 808
|
|
| zf-C2H2_jaz |
pfam12171 |
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, ... |
646-670 |
1.87e-05 |
|
Zinc-finger double-stranded RNA-binding; This domain family is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localize in the nucleus, particularly the nucleolus. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localization.
Pssm-ID: 432381 [Multi-domain] Cd Length: 27 Bit Score: 42.16 E-value: 1.87e-05
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
139-461 |
3.25e-05 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 48.01 E-value: 3.25e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 139 PPQMVTPNLQqffPQATRQSLLGPPPVGVPINPSQlnhsgrnTQKQARTPSSTTPNRKTVPLEDREDPtEGSEEATELQM 218
Cdd:PHA03247 2551 PPPPLPPAAP---PAAPDRSVPPPRPAPRPSEPAV-------TSRARRPDAPPQSARPRAPVDDRGDP-RGPAPPSPLPP 2619
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 219 DTCEDQDSLVGPDSMLSEPQVPEPEPFETLEPP--------AKRCRRVRIKGidhhnwlfaylwiFASSEESTEKGPTgq 290
Cdd:PHA03247 2620 DTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPrddpapgrVSRPRRARRLG-------------RAAQASSPPQRPR-- 2684
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 291 PQARVQPQTQMTAPKQTQTPDRLPEPPevqmlPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQ-----VEPQVPSQ 365
Cdd:PHA03247 2685 RRAARPTVGSLTSLADPPPPPPTPEPA-----PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAgpatpGGPARPAR 2759
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 366 PPWQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTW-PQGSVPPPEQA--SGP 442
Cdd:PHA03247 2760 PPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAAsPAGPLPPPTSAqpTAP 2839
|
330
....*....|....*....
gi 1720400458 443 ACATEPQLSSHAAEAGSDP 461
Cdd:PHA03247 2840 PPPPGPPPPSLPLGGSVAP 2858
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
281-439 |
3.43e-05 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 46.98 E-value: 3.43e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 281 ESTEKGPTGQPQARVQPQTQMTAPKQT---QTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQ 357
Cdd:PRK10927 101 EPSAGGEVKTPEQLTPEQRQLLEQMQAdmrQQPTQLVEVPWNEQTPEQRQQTLQRQRQAQQLAEQQRLAQQSRTTEQSWQ 180
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 358 VEPQVPSQPPWQLQPRetdPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPT--QAQSQEQTSEKTQDQPQTWPQGSVPP 435
Cdd:PRK10927 181 QQTRTSQAAPVQAQPR---QSKPASTQQPYQDLLQTPAHTTAQSKPQQAAPVtrAADAPKPTAEKKDERRWMVQCGSFRG 257
|
....
gi 1720400458 436 PEQA 439
Cdd:PRK10927 258 AEQA 261
|
|
| PRK14949 |
PRK14949 |
DNA polymerase III subunits gamma and tau; Provisional |
278-425 |
5.19e-05 |
|
DNA polymerase III subunits gamma and tau; Provisional
Pssm-ID: 237863 [Multi-domain] Cd Length: 944 Bit Score: 47.03 E-value: 5.19e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 278 SSEESTEKGPTGQPQARVQPQTQmTAPKQTQTPDRLPEPPEVQMLPR--IQPQALQIQTQPKLLRQAQTQTSPEHLAPQQ 355
Cdd:PRK14949 639 SSADRKPKTPPSRAPPASLSKPA-SSPDASQTSASFDLDPDFELATHqsVPEAALASGSAPAPPPVPDPYDRPPWEEAPE 717
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 356 DQVEPQVPSQPPwqlqpRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQP 425
Cdd:PRK14949 718 VASANDGPNNAA-----EGNLSESVEDASNSELQAVEQQATHQPQVQAEAQSPASTTALTQTSSEVQDTE 782
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
316-436 |
1.51e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 45.85 E-value: 1.51e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 316 PPEVQMLPRIQPQAlqiQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQPPWQLQPRETDPPNQAQAqtqpqplwqAQS 395
Cdd:PRK10263 740 PHEPLFTPIVEPVQ---QPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQY---------QQP 807
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1720400458 396 QKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVPPP 436
Cdd:PRK10263 808 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHP 848
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
287-419 |
2.81e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 44.47 E-value: 2.81e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 287 PTGQPQARVQPQTQMTAPKQTQTPdrlPEPPEVQMLPRIQPQALQIQTQpklLRQAQTQTSPEHLAPQQDQVEPQVPSQP 366
Cdd:PRK07994 383 ATAAPTAAVAPPQAPAVPPPPASA---PQQAPAVPLPETTSQLLAARQQ---LQRAQGATKAKKSEPAAASRARPVNSAL 456
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 1720400458 367 PW--QLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSE 419
Cdd:PRK07994 457 ERlaSVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPE 511
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
290-375 |
2.89e-04 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 44.69 E-value: 2.89e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 290 QPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVPSQPpwq 369
Cdd:PRK10263 767 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPVAP--- 839
|
....*.
gi 1720400458 370 lQPRET 375
Cdd:PRK10263 840 -QPQDT 844
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
287-416 |
5.39e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 43.87 E-value: 5.39e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 287 PTGQPQARVQPQTQmtAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPKLLRQAQTQTSPEHLAPQQDQVEPQVPSQP 366
Cdd:pfam09770 222 PAAPPAQQAQQQQQ--FPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQAQQFHQQPPPVPVQP 299
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720400458 367 PWQLQ-------PRETDPPNQAQAQTQPQPLWQAQSQ----KQAQTQAHPQVPTQAQSQEQ 416
Cdd:pfam09770 300 TQILQnpnrlsaARVGYPQNPQPGVQPAPAHQAHRQQgsfgRQAPIITHPQQLAQLSEEEK 360
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
287-378 |
1.41e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 42.38 E-value: 1.41e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 287 PTGQPQARVQPQTQMTAPKQTQTPDRLPEPPEVQMLPRIQPQALQIQTQPkllrQAQTQTSPEHLAPQQDQVEPQVP--S 364
Cdd:PRK10263 751 PVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP----QPQYQQPQQPVAPQPQYQQPQQPvaP 826
|
90
....*....|....
gi 1720400458 365 QPPWQlQPRETDPP 378
Cdd:PRK10263 827 QPQYQ-QPQQPVAP 839
|
|
| ZnF_U1 |
smart00451 |
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ... |
762-795 |
1.66e-03 |
|
U1-like zinc finger; Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Pssm-ID: 197732 [Multi-domain] Cd Length: 35 Bit Score: 36.85 E-value: 1.66e-03
10 20 30
....*....|....*....|....*....|....
gi 1720400458 762 GYVCQICHKFYDSNSELRlSHCKSLAHFENLQKY 795
Cdd:smart00451 3 GFYCKLCNVTFTDEISVE-AHLKGKKHKKNVKKR 35
|
|
| GIY-YIG_PLEs |
cd10442 |
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This ... |
615-682 |
2.90e-03 |
|
Catalytic GIY-YIG endonuclease domain of penelope-like elements and similar proteins; This model corresponds to the EN domain of PLEs that contains catalytic module of the GIY-YIG endonucleases of group I bacterial/organellar introns, as well as bacterial UvrC DNA repair proteins. It can cleave DNA with low nucleotide sequence specificity. However, the PLEs EN domain is distinct from other GIY-YIG endonucleases by the presence of a well-conserved CCHH motif (CX(2-7)CX(33-39)HX(3-5)H, X can be any residue). The role of the CCHH motif has not yet been identified. Penelope-like elements (PLEs) represent a novel class of eukaryotic retroelements, which do not belong to either long terminal repeat (LTR) retrotransposons or non-LTR retrotransposons (often called LINEs), but instead form a sister clade to telomerase reverse transcriptases (TERTs), highly specialized non-mobile reverse transcriptases (RTs) which are responsible for the addition of telomeric repeats to the ends of eukaryotic chromosomes. The single open reading frame (ORF) encoded by PLE consists of two principal domains, RT domain and endonuclease (EN) domain, jointed by a linker region of variable length. Both of these two domains are functionally active.
Pssm-ID: 198389 Cd Length: 92 Bit Score: 37.73 E-value: 2.90e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 615 WCNTCQVYYVGDLIQ--HRRTQEHKVAKQSlrpfcticNRYFKTPrkFVEHVKSQGHKDKAQELKTLEKE 682
Cdd:cd10442 6 PCPKCGLVYIGETKRplRERMKEHRRAIRL--------SGTKKSA--VAKHFNEEGHSIDSDRVRILDKE 65
|
|
| PHA03379 |
PHA03379 |
EBNA-3A; Provisional |
130-442 |
3.14e-03 |
|
EBNA-3A; Provisional
Pssm-ID: 223066 [Multi-domain] Cd Length: 935 Bit Score: 41.20 E-value: 3.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 130 PSLAAPSLTPPQMVTPNLQQFFP--------QATRQSLLGPPPVGVPINPSQLNHSGRNTQKQARTPSSTTPNRKTVPLE 201
Cdd:PHA03379 508 PWEASLSQVPGVAFAPVMPQPMPvepvpvptVALERPVCPAPPLIAMQGPGETSGIVRVRERWRPAPWTPNPPRSPSQMS 587
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 202 DREDPTEGSEEATELQ--MDTCEDQDSLVGPDSMLSEPQVPEPEPFETlEPPAKRCRRVRIKGIdhhnwlfaylwifass 279
Cdd:PHA03379 588 VRDRLARLRAEAQPYQasVEVQPPQLTQVSPQQPMEYPLEPEQQMFPG-SPFSQVADVMRAGGV---------------- 650
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 280 eestekgPTGQPQARVQPQTQMTAPKQTQTPDR---LPEPPevqmLPRIQPQALQIqtqpKLLRQAQTQTSPEHLAPQQD 356
Cdd:PHA03379 651 -------PAMQPQYFDLPLQQPISQGAPLAPLRasmGPVPP----VPATQPQYFDI----PLTEPINQGASAAHFLPQQP 715
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 357 QVEPQVPSQPPWQ-LQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSqeqtsektqdQPQTWP-QGSVP 434
Cdd:PHA03379 716 MEGPLVPERWMFQgATLSQSVRPGVAQSQYFDLPLTQPINHGAPAAHFLHQPPMEGPW----------VPEQWMfQGAPP 785
|
....*...
gi 1720400458 435 PPEQASGP 442
Cdd:PHA03379 786 SQGTDVVQ 793
|
|
| PRK10927 |
PRK10927 |
cell division protein FtsN; |
292-447 |
7.04e-03 |
|
cell division protein FtsN;
Pssm-ID: 236797 [Multi-domain] Cd Length: 319 Bit Score: 39.66 E-value: 7.04e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 292 QARVQPQTQMTAPKQTQTPDRLpEPPEVQMLPRIQPQALQIQTQpkLLRQAQTQTSPEHLAP--QQDQVEPQVPSQPPWQ 369
Cdd:PRK10927 93 QPGVRAPTEPSAGGEVKTPEQL-TPEQRQLLEQMQADMRQQPTQ--LVEVPWNEQTPEQRQQtlQRQRQAQQLAEQQRLA 169
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 370 LQPRETDPPNQAQAQTQPqplwqaQSQKQAQTQAHPQVPTQAQSQE--QTSEKTQDQPQtwPQGSVPPPEQASGPACATE 447
Cdd:PRK10927 170 QQSRTTEQSWQQQTRTSQ------AAPVQAQPRQSKPASTQQPYQDllQTPAHTTAQSK--PQQAAPVTRAADAPKPTAE 241
|
|
| PTZ00449 |
PTZ00449 |
104 kDa microneme/rhoptry antigen; Provisional |
284-469 |
7.99e-03 |
|
104 kDa microneme/rhoptry antigen; Provisional
Pssm-ID: 185628 [Multi-domain] Cd Length: 943 Bit Score: 40.06 E-value: 7.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 284 EKGPTGQPQARVQPQtqmtAPKQTQTPDRlPEPPEVQMLPRI--------QPQALQIQTQPKLLRQAQTQTSPEHLAPQQ 355
Cdd:PTZ00449 566 EHKPSKIPTLSKKPE----FPKDPKHPKD-PEEPKKPKRPRSaqrptrpkSPKLPELLDIPKSPKRPESPKSPKRPPPPQ 640
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720400458 356 DQVEPQVPSQPPWQLQPRETDPPNQAQAQTQPQPLWQAQSQKQAQTQAHPQVPTQAQSQEQTSEKTQDQPQTWPQGSVP- 434
Cdd:PTZ00449 641 RPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRp 720
|
170 180 190
....*....|....*....|....*....|....*.
gi 1720400458 435 -PPEQASGPACATEPQlsshaaeagSDPDKALPEPV 469
Cdd:PTZ00449 721 lPPKLPRDEEFPFEPI---------GDPDAEQPDDI 747
|
|
|