View
Concise Results
Standard Results
Full Results
janus kinase and microtubule-interacting protein 2 isoform X4 [Canis lupus familiaris]
Protein Classification
List of domain hits
Name
Accession
Description
Interval
E-value
JAKMIP_CC3
pfam16034
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP ...
339-532
9.15e-70
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP family (Janus kinase and microtubule-interacting proteins) and is predicted to be a coiled coil. It interacts with the Janus family kinases Tyk2 and Jak1.
:Pssm-ID: 464986 [Multi-domain]
Cd Length: 199
Bit Score: 227.07
E-value: 9.15e-70
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 339 D R E KLIR R RK H RR SS K PI K RP -- V LDP F I GYDE D - S M DSETSS MA -- SF RTD R TPATPD D DLDE SL AAEE S ELRFRQLT K 413
Cdd:pfam16034 1 E R D KLIR A RK Q RR KE K KK K LR dm V VET F F GYDE E a S V DSETSS LS yn TD RTD S TPATPD E DLDE GV AAEE A ELRFRQLT R 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 414 EYQALQRAYALLQEQ T GG II DAEREAK AQ EQLQAE VL R YKAK IEDLE AT LAQKGQDS H WVE D KQ LF I K RNQEL L EKI EKQ 493
Cdd:pfam16034 81 EYQALQRAYALLQEQ S GG TL DAEREAK TR EQLQAE LR R AQSR IEDLE SA LAQKGQDS K WVE E KQ AL I R RNQEL V EKI RIM 160
170 180 190
....*....|....*....|....*....|....*....
gi 1952702176 494 E A E NH RL QQ E L QDARDQNELLEFR N LELEERERRSP PF N 532
Cdd:pfam16034 161 E Q E EG RL KN E I QDARDQNELLEFR I LELEERERRSP AI N 199
Smc super family
cl34174
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
13-530
3.29e-08
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
The actual alignment was detected with superfamily member COG1196 :Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 57.25
E-value: 3.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 13 EA LIVA L QAAN E D L RTK L TDI Q I E LHQEKSKVSK LE REKTQEAK R I RELE Q R khtvlvtelkaklh E E KMK E LQ A VR E NL 92
Cdd:COG1196 266 EA ELEE L RLEL E E L ELE L EEA Q A E EYELLAELAR LE QDIARLEE R R RELE E R -------------- L E ELE E EL A EL E EE 331
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 93 IKQH E Q E MSRTVKLRDGEIQR L KS A LCA L RDGSSDKVRTALTIEAR EE ARKLFDA E R L KL L QEI A D L KTAKKQVD EA LSN 172
Cdd:COG1196 332 LEEL E E E LEELEEELEEAEEE L EE A EAE L AEAEEALLEAEAELAEA EE ELEELAE E L L EA L RAA A E L AAQLEELE EA EEA 411
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 173 MIQADKIKAGD L RSEHQSHQ E A iskikw E S E RDIRRLD L RRNQKRI AEL NATIRK L EDRNTL L GD E RNE L LKRVR E tekq 252
Cdd:COG1196 412 LLERLERLEEE L EELEEALA E L ------ E E E EEEEEEA L EEAAEEE AEL EEEEEA L LELLAE L LE E AAL L EAALA E ---- 481
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 253 ckp LLE RNKCL A K R NDE L MVSLQRM E EK L KA V tkensem REKITSHPPLKKLKSLND L DQANEEQ E TEFLKLQVIEQ QNI 332
Cdd:COG1196 482 --- LLE ELAEA A A R LLL L LEAEADY E GF L EG V ------- KAALLLAGLRGLAGAVAV L IGVEAAY E AALEAALAAAL QNI 551
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 333 ID E LTRDREKL I RRR K HRRS ------------- SKPIKRPVLDPF IG YDE D SMD S ETSSMASFRTDRTPATPDDD L DE sl 399
Cdd:COG1196 552 VV E DDEVAAAA I EYL K AAKA gratflpldkira RAALAAALARGA IG AAV D LVA S DLREADARYYVLGDTLLGRT L VA -- 629
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 400 A AE E SE LR FRQLTKEYQALQRAYALLQEQT G GIIDAE R EAKAQEQ L Q AE VLRYKAKIEDL E AT L AQKGQDSHWV E DKQLF 479
Cdd:COG1196 630 A RL E AA LR RAVTLAGRLREVTLEGEGGSAG G SLTGGS R RELLAAL L E AE AELEELAERLA E EE L ELEEALLAEE E EEREL 709
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1952702176 480 IKRNQ E L LE KIEKQ EA ENHR L QQ E LQDARDQNELL E FRNL E LEER E RRS PP 530
Cdd:COG1196 710 AEAEE E R LE EELEE EA LEEQ L EA E REELLEELLEE E ELLE E EALE E LPE PP 760
SMC_prok_B super family
cl37069
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
402-733
6.42e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
The actual alignment was detected with superfamily member TIGR02168 :Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 43.51
E-value: 6.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 402 EE S E LRFRQ L T K EYQA L QRAYAL L Q E QTGGIIDAER E AKA Q - EQ L QAEVL R YK A KI E D LE ATL AQ KGQDSHWV E D - KQLF 479
Cdd:TIGR02168 687 EE L E EKIAE L E K ALAE L RKELEE L E E ELEQLRKELE E LSR Q i SA L RKDLA R LE A EV E Q LE ERI AQ LSKELTEL E A e IEEL 766
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 480 IK R NQ E LL E KIEKQ EAE NHR L QQELQDARDQNEL L EFRNL EL EERERR sppfn L QIH p FSDGVSA L QIYCMKEGVKDVNI 559
Cdd:TIGR02168 767 EE R LE E AE E ELAEA EAE IEE L EAQIEQLKEELKA L REALD EL RAELTL ----- L NEE - AANLRER L ESLERRIAATERRL 840
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 560 P DL IK Q L ----- DI LGDNGNL r N E EQVA I IQASTV L SLAEKWIQQI E G A E A A L HQKMM EL ESDMEQFCKIKGY L EE EL DY 634
Cdd:TIGR02168 841 E DL EE Q I eelse DI ESLAAEI - E E LEEL I EELESE L EALLNERASL E E A L A L L RSELE EL SEELRELESKRSE L RR EL EE 919
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 635 RKQA L D Q AYM R IQE LE ATLY N ALQQ etvikfgel LSE KQQEE L RT A VE klrrqm L RKSR E Y D CQILQE R MEL L QQAHQRI 714
Cdd:TIGR02168 920 LREK L A Q LEL R LEG LE VRID N LQER --------- LSE EYSLT L EE A EA ------ L ENKI E D D EEEARR R LKR L ENKIKEL 984
330 340 350
....*....|....*....|....*....|
gi 1952702176 715 ----------- RD L EDKT D IQKR Q IK DL E E 733
Cdd:TIGR02168 985 gpvnlaaieey EE L KERY D FLTA Q KE DL T E 1014
Name
Accession
Description
Interval
E-value
JAKMIP_CC3
pfam16034
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP ...
339-532
9.15e-70
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP family (Janus kinase and microtubule-interacting proteins) and is predicted to be a coiled coil. It interacts with the Janus family kinases Tyk2 and Jak1.
Pssm-ID: 464986 [Multi-domain]
Cd Length: 199
Bit Score: 227.07
E-value: 9.15e-70
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 339 D R E KLIR R RK H RR SS K PI K RP -- V LDP F I GYDE D - S M DSETSS MA -- SF RTD R TPATPD D DLDE SL AAEE S ELRFRQLT K 413
Cdd:pfam16034 1 E R D KLIR A RK Q RR KE K KK K LR dm V VET F F GYDE E a S V DSETSS LS yn TD RTD S TPATPD E DLDE GV AAEE A ELRFRQLT R 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 414 EYQALQRAYALLQEQ T GG II DAEREAK AQ EQLQAE VL R YKAK IEDLE AT LAQKGQDS H WVE D KQ LF I K RNQEL L EKI EKQ 493
Cdd:pfam16034 81 EYQALQRAYALLQEQ S GG TL DAEREAK TR EQLQAE LR R AQSR IEDLE SA LAQKGQDS K WVE E KQ AL I R RNQEL V EKI RIM 160
170 180 190
....*....|....*....|....*....|....*....
gi 1952702176 494 E A E NH RL QQ E L QDARDQNELLEFR N LELEERERRSP PF N 532
Cdd:pfam16034 161 E Q E EG RL KN E I QDARDQNELLEFR I LELEERERRSP AI N 199
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
13-530
3.29e-08
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 57.25
E-value: 3.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 13 EA LIVA L QAAN E D L RTK L TDI Q I E LHQEKSKVSK LE REKTQEAK R I RELE Q R khtvlvtelkaklh E E KMK E LQ A VR E NL 92
Cdd:COG1196 266 EA ELEE L RLEL E E L ELE L EEA Q A E EYELLAELAR LE QDIARLEE R R RELE E R -------------- L E ELE E EL A EL E EE 331
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 93 IKQH E Q E MSRTVKLRDGEIQR L KS A LCA L RDGSSDKVRTALTIEAR EE ARKLFDA E R L KL L QEI A D L KTAKKQVD EA LSN 172
Cdd:COG1196 332 LEEL E E E LEELEEELEEAEEE L EE A EAE L AEAEEALLEAEAELAEA EE ELEELAE E L L EA L RAA A E L AAQLEELE EA EEA 411
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 173 MIQADKIKAGD L RSEHQSHQ E A iskikw E S E RDIRRLD L RRNQKRI AEL NATIRK L EDRNTL L GD E RNE L LKRVR E tekq 252
Cdd:COG1196 412 LLERLERLEEE L EELEEALA E L ------ E E E EEEEEEA L EEAAEEE AEL EEEEEA L LELLAE L LE E AAL L EAALA E ---- 481
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 253 ckp LLE RNKCL A K R NDE L MVSLQRM E EK L KA V tkensem REKITSHPPLKKLKSLND L DQANEEQ E TEFLKLQVIEQ QNI 332
Cdd:COG1196 482 --- LLE ELAEA A A R LLL L LEAEADY E GF L EG V ------- KAALLLAGLRGLAGAVAV L IGVEAAY E AALEAALAAAL QNI 551
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 333 ID E LTRDREKL I RRR K HRRS ------------- SKPIKRPVLDPF IG YDE D SMD S ETSSMASFRTDRTPATPDDD L DE sl 399
Cdd:COG1196 552 VV E DDEVAAAA I EYL K AAKA gratflpldkira RAALAAALARGA IG AAV D LVA S DLREADARYYVLGDTLLGRT L VA -- 629
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 400 A AE E SE LR FRQLTKEYQALQRAYALLQEQT G GIIDAE R EAKAQEQ L Q AE VLRYKAKIEDL E AT L AQKGQDSHWV E DKQLF 479
Cdd:COG1196 630 A RL E AA LR RAVTLAGRLREVTLEGEGGSAG G SLTGGS R RELLAAL L E AE AELEELAERLA E EE L ELEEALLAEE E EEREL 709
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1952702176 480 IKRNQ E L LE KIEKQ EA ENHR L QQ E LQDARDQNELL E FRNL E LEER E RRS PP 530
Cdd:COG1196 710 AEAEE E R LE EELEE EA LEEQ L EA E REELLEELLEE E ELLE E EALE E LPE PP 760
SMC_prok_A
TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
3-733
3.23e-06
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain]
Cd Length: 1164
Bit Score: 50.84
E-value: 3.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 3 KKG R N K G E KPE AL IVALQAANEDLRT K - LTDIQIELHQEKSKVSK LE R E KTQEAKR I R ELE Q R KHTVLVT -- EL KA K LHE 79
Cdd:TIGR02169 204 RRE R E K A E RYQ AL LKEKREYEGYELL K e KEALERQKEAIERQLAS LE E E LEKLTEE I S ELE K R LEEIEQL le EL NK K IKD 283
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 80 EKMK E LQA V R E N l I KQH E Q E MSRTVKLRDGEIQR L KS A LCA L RDGSSDKVRTALT IE AR E EARKLFDAE R L KL LQ E I A D L 159
Cdd:TIGR02169 284 LGEE E QLR V K E K - I GEL E A E IASLERSIAEKERE L ED A EER L AKLEAEIDKLLAE IE EL E REIEEERKR R D KL TE E Y A E L 362
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 160 K takk QVD E A L SNMIQADKIKAGDL R S E HQSHQ E AIS K I K W E SERDI R R LD LRRNQKR ----- I A E LNA T I RKL E DRNTL 234
Cdd:TIGR02169 363 K ---- EEL E D L RAELEEVDKEFAET R D E LKDYR E KLE K L K R E INELK R E LD RLQEELQ rlsee L A D LNA A I AGI E AKINE 438
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 235 L GD E RNELLKRVRET E KQCKP L LERNKCLAKRNDE L MVSLQ R M E EK L KAVTK E NS E M -------- REKITSHPPLKK LK S 306
Cdd:TIGR02169 439 L EE E KEDKALEIKKQ E WKLEQ L AADLSKYEQELYD L KEEYD R V E KE L SKLQR E LA E A eaqarase ERVRGGRAVEEV LK A 518
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 307 -------- LND L DQAN E EQE T EF ------- L KLQ V I E QQ ------- NIIDELTRD R EKLIRRR K H R RSSKPIKRPVL D PF 364
Cdd:TIGR02169 519 siqgvhgt VAQ L GSVG E RYA T AI evaagnr L NNV V V E DD avakeai ELLKRRKAG R ATFLPLN K M R DERRDLSILSE D GV 598
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 365 IG YDE D SMDSETSSMAS F RT drtp ATP D DDLD E SL - AA EESELRF R QL T K E YQALQRAY A L lqeq TGG IIDAEREAKAQE 443
Cdd:TIGR02169 599 IG FAV D LVEFDPKYEPA F KY ---- VFG D TLVV E DI e AA RRLMGKY R MV T L E GELFEKSG A M ---- TGG SRAPRGGILFSR 670
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 444 QLQ AE VL R YKAKI E D L EAT L A qkgqdshwvedkqlfikrnq E L LEKIEKQ E AENHR L Q QEL Q DA RDQNELL E FRNLE LE E 523
Cdd:TIGR02169 671 SEP AE LQ R LRERL E G L KRE L S -------------------- S L QSELRRI E NRLDE L S QEL S DA SRKIGEI E KEIEQ LE Q 730
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 524 R E RR spp FNLQIHPFSDGV S A L Q -- I YCM K EGV K DVN -- I PD L IKQ L ---- DI L G D NGNLRNEEQVAI IQA S ----- TVL 590
Cdd:TIGR02169 731 E E EK --- LKERLEELEEDL S S L E qe I ENV K SEL K ELE ar I EE L EED L hkle EA L N D LEARLSHSRIPE IQA E lskle EEV 807
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 591 S LA E KWIQQ IE GAEAA L HQKMME LE SDMEQFCKIKGY L E E ELDYRKQALDQAYMRIQ ELE AT L YNA lqqetvikfgells 670
Cdd:TIGR02169 808 S RI E ARLRE IE QKLNR L TLEKEY LE KEIQELQEQRID L K E QIKSIEKEIENLNGKKE ELE EE L EEL -------------- 873
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1952702176 671 E KQQEE L RTAVEK L RRQ ml R KSR E YDCQI L QERM E L L ---- QQAHQ R IRD L ED K TDIQKRQIKDL E E 733
Cdd:TIGR02169 874 E AALRD L ESRLGD L KKE -- R DEL E AQLRE L ERKI E E L eaqi EKKRK R LSE L KA K LEALEEELSEI E D 938
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
402-733
6.42e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 43.51
E-value: 6.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 402 EE S E LRFRQ L T K EYQA L QRAYAL L Q E QTGGIIDAER E AKA Q - EQ L QAEVL R YK A KI E D LE ATL AQ KGQDSHWV E D - KQLF 479
Cdd:TIGR02168 687 EE L E EKIAE L E K ALAE L RKELEE L E E ELEQLRKELE E LSR Q i SA L RKDLA R LE A EV E Q LE ERI AQ LSKELTEL E A e IEEL 766
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 480 IK R NQ E LL E KIEKQ EAE NHR L QQELQDARDQNEL L EFRNL EL EERERR sppfn L QIH p FSDGVSA L QIYCMKEGVKDVNI 559
Cdd:TIGR02168 767 EE R LE E AE E ELAEA EAE IEE L EAQIEQLKEELKA L REALD EL RAELTL ----- L NEE - AANLRER L ESLERRIAATERRL 840
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 560 P DL IK Q L ----- DI LGDNGNL r N E EQVA I IQASTV L SLAEKWIQQI E G A E A A L HQKMM EL ESDMEQFCKIKGY L EE EL DY 634
Cdd:TIGR02168 841 E DL EE Q I eelse DI ESLAAEI - E E LEEL I EELESE L EALLNERASL E E A L A L L RSELE EL SEELRELESKRSE L RR EL EE 919
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 635 RKQA L D Q AYM R IQE LE ATLY N ALQQ etvikfgel LSE KQQEE L RT A VE klrrqm L RKSR E Y D CQILQE R MEL L QQAHQRI 714
Cdd:TIGR02168 920 LREK L A Q LEL R LEG LE VRID N LQER --------- LSE EYSLT L EE A EA ------ L ENKI E D D EEEARR R LKR L ENKIKEL 984
330 340 350
....*....|....*....|....*....|
gi 1952702176 715 ----------- RD L EDKT D IQKR Q IK DL E E 733
Cdd:TIGR02168 985 gpvnlaaieey EE L KERY D FLTA Q KE DL T E 1014
PTZ00121
PTZ00121
MAEBL; Provisional
39-527
2.63e-03
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 41.28
E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 39 Q E KS K VSKLERE K TQE AK RIR E LEQRKHTVLVT E LK A KLH E EKMK E LQ A VRENLI K QHEQEMSRTV K LRDG E IQRLKS A l 118
Cdd:PTZ00121 1318 D E AK K KAEEAKK K ADA AK KKA E EAKKAAEAAKA E AE A AAD E AEAA E EK A EAAEKK K EEAKKKADAA K KKAE E KKKADE A - 1396
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 119 cal RDGSSDKVRT A LTIEAREE A R K LF D a E RL K LLQ E IADLKT AKK QVD EA L snmi Q AD KI K A gdl RS E HQSHQ E AIS K i 198
Cdd:PTZ00121 1397 --- KKKAEEDKKK A DELKKAAA A K K KA D - E AK K KAE E KKKADE AKK KAE EA K ---- K AD EA K K --- KA E EAKKA E EAK K - 1464
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 199 K W E SERDIRRLDLRRNQKRI A E lnatirkledrntllgdern E LL K RVR E TE K QCKPL le RNKCL AK RNDE lmv SLQRM E 278
Cdd:PTZ00121 1465 K A E EAKKADEAKKKAEEAKK A D -------------------- E AK K KAE E AK K KADEA -- KKAAE AK KKAD --- EAKKA E 1519
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 279 E KL KA VTKENS E MRE K ITSHPPLKKL K SLND L DQ A N E EQET E fl KLQVI E QQNIID E ltrd REKLIR R RKHRRSSKPIK R 358
Cdd:PTZ00121 1520 E AK KA DEAKKA E EAK K ADEAKKAEEK K KADE L KK A E E LKKA E -- EKKKA E EAKKAE E ---- DKNMAL R KAEEAKKAEEA R 1593
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 359 PVLDPFIGYD E DS M DS E tssmasfrtdrtpatpdddld E SLA AEE SELRFRQ L T K EYQALQRAYA L LQEQTGGIID AE RE 438
Cdd:PTZ00121 1594 IEEVMKLYEE E KK M KA E --------------------- E AKK AEE AKIKAEE L K K AEEEKKKVEQ L KKKEAEEKKK AE EL 1652
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 439 A KA Q E Q -- LQ A EVLRY KA KIEDLE A TL A Q K GQDSHWVEDKQ L FIKRNQ - ELL E KIE K Q EAE NHRLQQ EL QD A RDQ N ELLE 515
Cdd:PTZ00121 1653 K KA E E E nk IK A AEEAK KA EEDKKK A EE A K K AEEDEKKAAEA L KKEAEE a KKA E ELK K K EAE EKKKAE EL KK A EEE N KIKA 1732
490
....*....|..
gi 1952702176 516 FRNLELE E RERR 527
Cdd:PTZ00121 1733 EEAKKEA E EDKK 1744
DUF5401
pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
34-297
4.50e-03
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain]
Cd Length: 722
Bit Score: 40.49
E-value: 4.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 34 Q I E LHQEKSKVSKL ER --- E KTQ E AK RIR EL E QRKHTVLVTELKAKLHEEK M K EL QAV ----- RE N LIKQH E Q E MS R T VK 105
Cdd:pfam17380 326 Q A E MDRQAAIYAEQ ER mam E RER E LE RIR QE E RKRELERIRQEEIAMEISR M R EL ERL qmerq QK N ERVRQ E L E AA R K VK 405
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 106 LRDG E I QR LKSA lcal RDGSSDKV R TALT i EAR EEARKLFDA ER LKLLQEIADLKTAKK Q VD E A L SNMIQAD K I K AGD L R 185
Cdd:pfam17380 406 ILEE E R QR KIQQ ---- QKVEMEQI R AEQE - EAR QREVRRLEE ER AREMERVRLEEQERQ Q QV E R L RQQEEER K R K KLE L E 480
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 186 S E HQSHQE A ISK ---- IKW E S E RDIRRLDLRRNQKRIA E lnati RKL E D R NTLLGD E RN ell K R VR E T E KQCKPLL E RN k 261
Cdd:pfam17380 481 K E KRDRKR A EEQ rrki LEK E L E ERKQAMIEEERKRKLL E ----- KEM E E R QKAIYE E ER --- R R EA E E E RRKQQEM E ER - 551
250 260 270
....*....|....*....|....*....|....*.
gi 1952702176 262 cla K R ND E L M VSLQRMEEK L K A VTK E NSE MR EKIT S 297
Cdd:pfam17380 552 --- R R IQ E Q M RKATEERSR L E A MER E REM MR QIVE S 584
hsdR
PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
388-507
5.54e-03
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain]
Cd Length: 1123
Bit Score: 40.32
E-value: 5.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 388 P ATPDDD L DES L AA E ESE L R f R QL TKEYQALQRAY AL LQE Q TGGIIDA E RE A KAQ E QL Q A E VL rykaki ED LE ATLAQKG 467
Cdd:PRK11448 136 P PEDPEN L LHA L QQ E VLT L K - Q QL ELQAREKAQSQ AL AEA Q QQELVAL E GL A AEL E EK Q Q E LE ------ AQ LE QLQEKAA 208
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1952702176 468 QD S H wv E D KQ LFIKRNQELLEKI E KQ E A E NHR L - Q Q E L QD A 507
Cdd:PRK11448 209 ET S Q -- E R KQ KRKEITDQAAKRL E LS E E E TRI L i D Q Q L RK A 247
Name
Accession
Description
Interval
E-value
JAKMIP_CC3
pfam16034
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP ...
339-532
9.15e-70
JAKMIP CC3 domain; This domain is found at the C-terminus of proteins belonging to the JAKMIP family (Janus kinase and microtubule-interacting proteins) and is predicted to be a coiled coil. It interacts with the Janus family kinases Tyk2 and Jak1.
Pssm-ID: 464986 [Multi-domain]
Cd Length: 199
Bit Score: 227.07
E-value: 9.15e-70
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 339 D R E KLIR R RK H RR SS K PI K RP -- V LDP F I GYDE D - S M DSETSS MA -- SF RTD R TPATPD D DLDE SL AAEE S ELRFRQLT K 413
Cdd:pfam16034 1 E R D KLIR A RK Q RR KE K KK K LR dm V VET F F GYDE E a S V DSETSS LS yn TD RTD S TPATPD E DLDE GV AAEE A ELRFRQLT R 80
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 414 EYQALQRAYALLQEQ T GG II DAEREAK AQ EQLQAE VL R YKAK IEDLE AT LAQKGQDS H WVE D KQ LF I K RNQEL L EKI EKQ 493
Cdd:pfam16034 81 EYQALQRAYALLQEQ S GG TL DAEREAK TR EQLQAE LR R AQSR IEDLE SA LAQKGQDS K WVE E KQ AL I R RNQEL V EKI RIM 160
170 180 190
....*....|....*....|....*....|....*....
gi 1952702176 494 E A E NH RL QQ E L QDARDQNELLEFR N LELEERERRSP PF N 532
Cdd:pfam16034 161 E Q E EG RL KN E I QDARDQNELLEFR I LELEERERRSP AI N 199
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
13-530
3.29e-08
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 57.25
E-value: 3.29e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 13 EA LIVA L QAAN E D L RTK L TDI Q I E LHQEKSKVSK LE REKTQEAK R I RELE Q R khtvlvtelkaklh E E KMK E LQ A VR E NL 92
Cdd:COG1196 266 EA ELEE L RLEL E E L ELE L EEA Q A E EYELLAELAR LE QDIARLEE R R RELE E R -------------- L E ELE E EL A EL E EE 331
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 93 IKQH E Q E MSRTVKLRDGEIQR L KS A LCA L RDGSSDKVRTALTIEAR EE ARKLFDA E R L KL L QEI A D L KTAKKQVD EA LSN 172
Cdd:COG1196 332 LEEL E E E LEELEEELEEAEEE L EE A EAE L AEAEEALLEAEAELAEA EE ELEELAE E L L EA L RAA A E L AAQLEELE EA EEA 411
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 173 MIQADKIKAGD L RSEHQSHQ E A iskikw E S E RDIRRLD L RRNQKRI AEL NATIRK L EDRNTL L GD E RNE L LKRVR E tekq 252
Cdd:COG1196 412 LLERLERLEEE L EELEEALA E L ------ E E E EEEEEEA L EEAAEEE AEL EEEEEA L LELLAE L LE E AAL L EAALA E ---- 481
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 253 ckp LLE RNKCL A K R NDE L MVSLQRM E EK L KA V tkensem REKITSHPPLKKLKSLND L DQANEEQ E TEFLKLQVIEQ QNI 332
Cdd:COG1196 482 --- LLE ELAEA A A R LLL L LEAEADY E GF L EG V ------- KAALLLAGLRGLAGAVAV L IGVEAAY E AALEAALAAAL QNI 551
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 333 ID E LTRDREKL I RRR K HRRS ------------- SKPIKRPVLDPF IG YDE D SMD S ETSSMASFRTDRTPATPDDD L DE sl 399
Cdd:COG1196 552 VV E DDEVAAAA I EYL K AAKA gratflpldkira RAALAAALARGA IG AAV D LVA S DLREADARYYVLGDTLLGRT L VA -- 629
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 400 A AE E SE LR FRQLTKEYQALQRAYALLQEQT G GIIDAE R EAKAQEQ L Q AE VLRYKAKIEDL E AT L AQKGQDSHWV E DKQLF 479
Cdd:COG1196 630 A RL E AA LR RAVTLAGRLREVTLEGEGGSAG G SLTGGS R RELLAAL L E AE AELEELAERLA E EE L ELEEALLAEE E EEREL 709
490 500 510 520 530
....*....|....*....|....*....|....*....|....*....|.
gi 1952702176 480 IKRNQ E L LE KIEKQ EA ENHR L QQ E LQDARDQNELL E FRNL E LEER E RRS PP 530
Cdd:COG1196 710 AEAEE E R LE EELEE EA LEEQ L EA E REELLEELLEE E ELLE E EALE E LPE PP 760
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
19-527
2.48e-06
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 51.09
E-value: 2.48e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 19 L Q A AN E D L r TK L T DI QI EL hqe KSKVSK LER EKTQ -------- E AKRIR E L E QRKHTVLVT E LKAKLH E EKMK EL Q A VR E 90
Cdd:COG1196 181 L E A TE E N L - ER L E DI LG EL --- ERQLEP LER QAEK aeryrelk E ELKEL E A E LLLLKLREL E AELEEL E AELE EL E A EL E 256
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 91 N L IKQHEQEMSRTVK LR DG --- EIQR L KS A LCALRDGSSDKV R TALT I EAR EE A R KLFDAERLK L LQ E I A D L KTAKKQVD 167
Cdd:COG1196 257 E L EAELAELEAELEE LR LE lee LELE L EE A QAEEYELLAELA R LEQD I ARL EE R R RELEERLEE L EE E L A E L EEELEELE 336
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 168 E A L SNMIQADKIKAGD L RS --------------- E HQSHQEAISKIKWES E RDIRRLDLRRNQKRIA EL NATIRK L ED R N 232
Cdd:COG1196 337 E E L EELEEELEEAEEE L EE aeaelaeaeeallea E AELAEAEEELEELAE E LLEALRAAAELAAQLE EL EEAEEA L LE R L 416
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 233 TL L GD E RN EL LKRVR E T E KQCKPLL E RNKCL A KRND EL MVSLQRME E K L KAVTK E NSEMREKITS - HPP L KKLKSLND L D 311
Cdd:COG1196 417 ER L EE E LE EL EEALA E L E EEEEEEE E ALEEA A EEEA EL EEEEEALL E L L AELLE E AALLEAALAE l LEE L AEAAARLL L L 496
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 312 QAN E EQETE F lk L QVIEQQNIIDE L TRDREKLIRRRKHRRSSKPIKRPV L DPFIGYDEDSM D SETSSMASFRTDR ----- 386
Cdd:COG1196 497 LEA E ADYEG F -- L EGVKAALLLAG L RGLAGAVAVLIGVEAAYEAALEAA L AAALQNIVVED D EVAAAAIEYLKAA kagra 574
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 387 T PATP D DDLDESLA A EESELRFRQLTKEYQ A LQRAY A LLQEQTG G IIDAE R EAK A QEQLQ A EVLRYKAKIEDL E A TL AQK 466
Cdd:COG1196 575 T FLPL D KIRARAAL A AALARGAIGAAVDLV A SDLRE A DARYYVL G DTLLG R TLV A ARLEA A LRRAVTLAGRLR E V TL EGE 654
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1952702176 467 G QDSHWVEDKQLFIKRNQE LLE KIEKQ E AENH RL QQ E LQDARDQNELL E FRNL EL E E R E RR 527
Cdd:COG1196 655 G GSAGGSLTGGSRRELLAA LLE AEAEL E ELAE RL AE E ELELEEALLAE E EEER EL A E A E EE 715
SMC_prok_A
TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
3-733
3.23e-06
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain]
Cd Length: 1164
Bit Score: 50.84
E-value: 3.23e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 3 KKG R N K G E KPE AL IVALQAANEDLRT K - LTDIQIELHQEKSKVSK LE R E KTQEAKR I R ELE Q R KHTVLVT -- EL KA K LHE 79
Cdd:TIGR02169 204 RRE R E K A E RYQ AL LKEKREYEGYELL K e KEALERQKEAIERQLAS LE E E LEKLTEE I S ELE K R LEEIEQL le EL NK K IKD 283
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 80 EKMK E LQA V R E N l I KQH E Q E MSRTVKLRDGEIQR L KS A LCA L RDGSSDKVRTALT IE AR E EARKLFDAE R L KL LQ E I A D L 159
Cdd:TIGR02169 284 LGEE E QLR V K E K - I GEL E A E IASLERSIAEKERE L ED A EER L AKLEAEIDKLLAE IE EL E REIEEERKR R D KL TE E Y A E L 362
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 160 K takk QVD E A L SNMIQADKIKAGDL R S E HQSHQ E AIS K I K W E SERDI R R LD LRRNQKR ----- I A E LNA T I RKL E DRNTL 234
Cdd:TIGR02169 363 K ---- EEL E D L RAELEEVDKEFAET R D E LKDYR E KLE K L K R E INELK R E LD RLQEELQ rlsee L A D LNA A I AGI E AKINE 438
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 235 L GD E RNELLKRVRET E KQCKP L LERNKCLAKRNDE L MVSLQ R M E EK L KAVTK E NS E M -------- REKITSHPPLKK LK S 306
Cdd:TIGR02169 439 L EE E KEDKALEIKKQ E WKLEQ L AADLSKYEQELYD L KEEYD R V E KE L SKLQR E LA E A eaqarase ERVRGGRAVEEV LK A 518
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 307 -------- LND L DQAN E EQE T EF ------- L KLQ V I E QQ ------- NIIDELTRD R EKLIRRR K H R RSSKPIKRPVL D PF 364
Cdd:TIGR02169 519 siqgvhgt VAQ L GSVG E RYA T AI evaagnr L NNV V V E DD avakeai ELLKRRKAG R ATFLPLN K M R DERRDLSILSE D GV 598
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 365 IG YDE D SMDSETSSMAS F RT drtp ATP D DDLD E SL - AA EESELRF R QL T K E YQALQRAY A L lqeq TGG IIDAEREAKAQE 443
Cdd:TIGR02169 599 IG FAV D LVEFDPKYEPA F KY ---- VFG D TLVV E DI e AA RRLMGKY R MV T L E GELFEKSG A M ---- TGG SRAPRGGILFSR 670
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 444 QLQ AE VL R YKAKI E D L EAT L A qkgqdshwvedkqlfikrnq E L LEKIEKQ E AENHR L Q QEL Q DA RDQNELL E FRNLE LE E 523
Cdd:TIGR02169 671 SEP AE LQ R LRERL E G L KRE L S -------------------- S L QSELRRI E NRLDE L S QEL S DA SRKIGEI E KEIEQ LE Q 730
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 524 R E RR spp FNLQIHPFSDGV S A L Q -- I YCM K EGV K DVN -- I PD L IKQ L ---- DI L G D NGNLRNEEQVAI IQA S ----- TVL 590
Cdd:TIGR02169 731 E E EK --- LKERLEELEEDL S S L E qe I ENV K SEL K ELE ar I EE L EED L hkle EA L N D LEARLSHSRIPE IQA E lskle EEV 807
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 591 S LA E KWIQQ IE GAEAA L HQKMME LE SDMEQFCKIKGY L E E ELDYRKQALDQAYMRIQ ELE AT L YNA lqqetvikfgells 670
Cdd:TIGR02169 808 S RI E ARLRE IE QKLNR L TLEKEY LE KEIQELQEQRID L K E QIKSIEKEIENLNGKKE ELE EE L EEL -------------- 873
730 740 750 760 770 780
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1952702176 671 E KQQEE L RTAVEK L RRQ ml R KSR E YDCQI L QERM E L L ---- QQAHQ R IRD L ED K TDIQKRQIKDL E E 733
Cdd:TIGR02169 874 E AALRD L ESRLGD L KKE -- R DEL E AQLRE L ERKI E E L eaqi EKKRK R LSE L KA K LEALEEELSEI E D 938
Smc
COG1196
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning]; ...
183-734
6.39e-06
Chromosome segregation ATPase Smc [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 440809 [Multi-domain]
Cd Length: 983
Bit Score: 49.94
E-value: 6.39e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 183 D L RS E HQSHQEAISKI kw E S E RDIRRLD L RRNQKRIA EL NATIRK LE DRNTLLGD E RN ELL KRVRET E KQCKP L L ER NKC 262
Cdd:COG1196 236 E L EA E LEELEAELEEL -- E A E LEELEAE L AELEAELE EL RLELEE LE LELEEAQA E EY ELL AELARL E QDIAR L E ER RRE 313
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 263 L AK R ND EL MVS L QRM EE K L KAVTK E NS E MR E K itshpp L KKLKSLNDLDQ A NEEQET E F L KLQVI E QQNIID EL TRDR E K 342
Cdd:COG1196 314 L EE R LE EL EEE L AEL EE E L EELEE E LE E LE E E ------ L EEAEEELEEAE A ELAEAE E A L LEAEA E LAEAEE EL EELA E E 387
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 343 L IRRRKHRRSSKPI krpvldpf IGYD E DSMDSETSSMASFRTDRTPATPDDDLD E SLAA EE S E LRFRQLTK E YQALQRAY 422
Cdd:COG1196 388 L LEALRAAAELAAQ -------- LEEL E EAEEALLERLERLEEELEELEEALAEL E EEEE EE E E ALEEAAEE E AELEEEEE 459
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 423 ALL QEQTGGIID A EREAK A QEQ L QA E VLRYK A KIED L EATL A QKGQDSHW V EDKQ L FIKRN ------ QE L LEKIEKQ EA E 496
Cdd:COG1196 460 ALL ELLAELLEE A ALLEA A LAE L LE E LAEAA A RLLL L LEAE A DYEGFLEG V KAAL L LAGLR glagav AV L IGVEAAY EA A 539
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 497 --------- NHRLQQELQD A RDQN E L L EFRN ------ L E L EERER R SPPFN ---- LQ I HPFS D G V SALQIYC mkegvkdv 557
Cdd:COG1196 540 leaalaaal QNIVVEDDEV A AAAI E Y L KAAK agratf L P L DKIRA R AALAA alar GA I GAAV D L V ASDLREA -------- 611
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 558 NIPDLIKQLDI LG DNGNLRNE E QVAIIQASTVLS L A E K wiqq IEGA E AALHQKMMELE S DM E Q fckikgyl EEE L DYRKQ 637
Cdd:COG1196 612 DARYYVLGDTL LG RTLVAARL E AALRRAVTLAGR L R E V ---- TLEG E GGSAGGSLTGG S RR E L -------- LAA L LEAEA 679
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 638 A L DQAYM R IQ E L E AT L YN AL QQ E TVIKFGELLS E KQQE E LRTAV E K L RR Q MLRKSR E YDCQI L Q E RME L LQQ A HQRIRDL 717
Cdd:COG1196 680 E L EELAE R LA E E E LE L EE AL LA E EEEERELAEA E EERL E EELEE E A L EE Q LEAERE E LLEEL L E E EEL L EEE A LEELPEP 759
570
....*....|....*..
gi 1952702176 718 E D KTDIQKR q IKD LE EK 734
Cdd:COG1196 760 P D LEELERE - LER LE RE 775
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
19-734
9.86e-06
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 49.28
E-value: 9.86e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 19 L QAAN E D L r TK L T DI QI EL hqe KSKVSK LER -- EK TQEA K RIR - EL EQRKHTV LV TE L KA kl HE E KMK ELQ AVRENLIKQ 95
Cdd:TIGR02168 181 L ERTR E N L - DR L E DI LN EL --- ERQLKS LER qa EK AERY K ELK a EL RELELAL LV LR L EE -- LR E ELE ELQ EELKEAEEE 254
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 96 H E q E MSRTVKLRDGEIQR L KSALCA L RDGSSDKVRT -- AL TI E - A R E E AR K LFDA ERL KL L Q ------- EIADLKTA K KQ 165
Cdd:TIGR02168 255 L E - E LTAELQELEEKLEE L RLEVSE L EEEIEELQKE ly AL AN E i S R L E QQ K QILR ERL AN L E rqleele AQLEELES K LD 333
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 166 VDEALSNMIQA dki K AGD L RS E HQ S HQEAISKIKW E S E RDIR RL DLRRN Q ------------ KR IA E LN AT I RK LE D R NT 233
Cdd:TIGR02168 334 ELAEELAELEE --- K LEE L KE E LE S LEAELEELEA E L E ELES RL EELEE Q letlrskvaqle LQ IA S LN NE I ER LE A R LE 410
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 234 L L G D E R NE L LKRVR E ----- T E KQC K P L LERNKC L AKRND EL MVS L Q R M EE K L KAVTK E NS E MREKITS ----- HPPLKK 303
Cdd:TIGR02168 411 R L E D R R ER L QQEIE E llkkl E E AEL K E L QAELEE L EEELE EL QEE L E R L EE A L EELRE E LE E AEQALDA aerel AQLQAR 490
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 304 L K SL ND L d Q A N E E QET E FL K LQVIE Q QNIIDE L TRDR E KLIRRRKHRRSSKPIKRPV L DPFIGYDEDSMDSETSSMASFR 383
Cdd:TIGR02168 491 L D SL ER L - Q E N L E GFS E GV K ALLKN Q SGLSGI L GVLS E LISVDEGYEAAIEAALGGR L QAVVVENLNAAKKAIAFLKQNE 569
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 384 TD R --------- TPATPDDDLD E S L AAE E SE L RFRQLTKEYQA lq RAYAL L QEQT GG IIDAEREAK A Q E QLQA ev LR YKA 454
Cdd:TIGR02168 570 LG R vtflpldsi KGTEIQGNDR E I L KNI E GF L GVAKDLVKFDP -- KLRKA L SYLL GG VLVVDDLDN A L E LAKK -- LR PGY 645
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 455 K I ED L EAT L AQK G ---- QD S HWVEDKQ L FIK R N - Q EL L EKIE KQ E AENHR L QQE L QDA R DQN E L LE FRNLE L EERERRS p 529
Cdd:TIGR02168 646 R I VT L DGD L VRP G gvit GG S AKTNSSI L ERR R E i E EL E EKIE EL E EKIAE L EKA L AEL R KEL E E LE EELEQ L RKELEEL - 724
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 530 pfnlqihpf S DGV SAL QIYCMKEGVKDVNIPDL I K QL DI - L GDNGNLRN E EQVAIIQ A STV L SL AE KW I QQI E GAEAA L H 608
Cdd:TIGR02168 725 --------- S RQI SAL RKDLARLEAEVEQLEER I A QL SK e L TELEAEIE E LEERLEE A EEE L AE AE AE I EEL E AQIEQ L K 795
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 609 QKMME L ESDMEQFCKIKGY L E EE LDYRKQA L DQAYM RI QEL E AT L YNALQ Q ETVIKFGELLSEKQQ EEL RTAV E K L RRQM 688
Cdd:TIGR02168 796 EELKA L REALDELRAELTL L N EE AANLRER L ESLER RI AAT E RR L EDLEE Q IEELSEDIESLAAEI EEL EELI E E L ESEL 875
730 740 750 760
....*....|....*....|....*....|....*....|....*...
gi 1952702176 689 LRKSR E YDCQI -- L QERMEL L QQAHQRI R D LE D K TDIQK R QIKD L E EK 734
Cdd:TIGR02168 876 EALLN E RASLE ea L ALLRSE L EELSEEL R E LE S K RSELR R ELEE L R EK 923
235kDa-fam
TIGR01612
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in ...
23-316
4.28e-05
reticulocyte binding/rhoptry protein; This model represents a group of paralogous families in plasmodium species alternately annotated as reticulocyte binding protein, 235-kDa family protein and rhoptry protein. Rhoptry protein is localized on the cell surface and is extremely large (although apparently lacking in repeat structure) and is important for the process of invasion of the RBCs by the parasite. These proteins are found in P. falciparum, P. vivax and P. yoelii.
Pssm-ID: 130673 [Multi-domain]
Cd Length: 2757
Bit Score: 47.35
E-value: 4.28e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 23 N EDLRT KL T D IQIELHQ E KS K VSKL E REKTQEAKRIR E LEQRKHTVLVT E L K AKL H E E KM K E L QAVR E NL i K QH E Q E M S R 102
Cdd:TIGR01612 691 N TEDKA KL D D LKSKIDK E YD K IQNM E TATVELHLSNI E NKKNELLDIIV E I K KHI H G E IN K D L NKIL E DF - K NK E K E L S N 769
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 103 TV ---- K LR D g E IQRL KS ALCALRDGSS D KVRTALTIE ar E E A RKLF D AERL ---------- KLLQE I ADL K TA K KQVDE 168
Cdd:TIGR01612 770 KI ndya K EK D - E LNKY KS KISEIKNHYN D QINIDNIKD -- E D A KQNY D KSKE yiktisiked EIFKI I NEM K FM K DDFLN 846
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 169 ALSNM I QADKIKAGDLR SEH QSHQ E AIS KIK W E - S ERDIRRLDLRR N QKR -- I A E L N AT I RK - LEDR NTL lg DERN E LL K 244
Cdd:TIGR01612 847 KVDKF I NFENNCKEKID SEH EQFA E LTN KIK A E i S DDKLNDYEKKF N DSK sl I N E I N KS I EE e YQNI NTL -- KKVD E YI K 924
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1952702176 245 RVRE T EKQCKPLLERNKC L AKRNDELMVSLQRMEEKL K AVTKE - NSEMRE KI TSHPPLK K LK SLND LDQA N E E 316
Cdd:TIGR01612 925 ICEN T KESIEKFHNKQNI L KEILNKNIDTIKESNLIE K SYKDK f DNTLID KI NELDKAF K DA SLND YEAK N N E 997
COG5022
COG5022
Myosin heavy chain [General function prediction only];
21-320
9.17e-05
Myosin heavy chain [General function prediction only];
Pssm-ID: 227355 [Multi-domain]
Cd Length: 1463
Bit Score: 46.22
E-value: 9.17e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 21 AA N ED L R TKLT D IQIELH Q EKSKVSK L E R EKT Q EA KRI RELEQRK H TVLV ----- T ELK AK L H --------- EEKM KE L - 85
Cdd:COG5022 734 AA L ED M R DAKL D NIATRI Q RAIRGRY L R R RYL Q AL KRI KKIQVIQ H GFRL rrlvd Y ELK WR L F iklqpllsl LGSR KE Y r 813
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 86 QAVRENLIK Q HEQEMSRTVKLRDGEIQR LK SALCALRD G S S D K VRTALTIEAR E EARKLFDAERLKLLQEIAD LK TAK K Q 165
Cdd:COG5022 814 SYLACIIKL Q KTIKREKKLRETEEVEFS LK AEVLIQKF G R S L K AKKRFSLLKK E TIYLQSAQRVELAERQLQE LK IDV K S 893
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 166 VDEA ------------- L SNMIQA D K I KAGDLRS E HQSHQE ais K IKWESERDIRRLDLRRNQKRIAE L NATIR KL EDRN 232
Cdd:COG5022 894 ISSL klvnleleseiie L KKSLSS D L I ENLEFKT E LIARLK --- K LLNNIDLEEGPSIEYVKLPELNK L HEVES KL KETS 970
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 233 TLLG D ERNELLKR VRE TE K QCKP L LERN K C LA KRNDELMV --- S LQRME E KLKA V TKEN S EMREKITSHPP L KK LK S L ND 309
Cdd:COG5022 971 EEYE D LLKKSTIL VRE GN K ANSE L KNFK K E LA ELSKQYGA lqe S TKQLK E LPVE V AELQ S ASKIISSESTE L SI LK P L QK 1050
330
....*....|.
gi 1952702176 310 L DQANEEQETE 320
Cdd:COG5022 1051 L KGLLLLENNQ 1061
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
10-321
1.02e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 45.82
E-value: 1.02e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 10 E K PEAL I VALQAAN E D L RT K LTDIQIELHQEKSKVSK L ER E KTQEAKRIRE L EQRKH tvlvt EL KAKLHE ek MKELQ A VR 89
Cdd:TIGR02168 666 A K TNSS I LERRREI E E L EE K IEELEEKIAELEKALAE L RK E LEELEEELEQ L RKELE ----- EL SRQISA -- LRKDL A RL 738
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 90 E NLIK Q H E QEMSR tvkl RDG E IQR L KSALCA L RD gssdkvrtal TI E AR EE ARKLFD AE RLK L LQE I AD LK TAK K QVD EA 169
Cdd:TIGR02168 739 E AEVE Q L E ERIAQ ---- LSK E LTE L EAEIEE L EE ---------- RL E EA EE ELAEAE AE IEE L EAQ I EQ LK EEL K ALR EA 804
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 170 L SNM --- IQADKIK A GD LR SEHQ S HQEA I SKIKWES E RD irrldlrrn QKR I A EL NAT I RK L EDRNTL L GDERN EL LKRV 246
Cdd:TIGR02168 805 L DEL rae LTLLNEE A AN LR ERLE S LERR I AATERRL E DL --------- EEQ I E EL SED I ES L AAEIEE L EELIE EL ESEL 875
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1952702176 247 RETEKQCKP L L E RNKC L AKRND EL MVS L QRM E E K LKAVTK E NS E M REK ITSHPPLK --- KLKSL N DLDQAN EE QETEF 321
Cdd:TIGR02168 876 EALLNERAS L E E ALAL L RSELE EL SEE L REL E S K RSELRR E LE E L REK LAQLELRL egl EVRID N LQERLS EE YSLTL 953
EnvC
COG4942
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, ...
17-229
1.50e-04
Septal ring factor EnvC, activator of murein hydrolases AmiA and AmiB [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 443969 [Multi-domain]
Cd Length: 377
Bit Score: 44.75
E-value: 1.50e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 17 V A LQA A NEDLRTK L TDI Q I E LHQEKSKVSK L ER E KTQEA K RIRE LE Q R ---- KHTVLVT E LKAKLH E EKMK EL QAVREN L 92
Cdd:COG4942 16 A A QAD A AAEAEAE L EQL Q Q E IAELEKELAA L KK E EKALL K QLAA LE R R iaal ARRIRAL E QELAAL E AELA EL EKEIAE L 95
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 93 IKQH E QEMSRTVK L ----- R D G EIQR L KSA L CA lr DGSS D K VR TALTIEAREE AR KLFDA E RLKL L Q E I A D L KTAKKQVD 167
Cdd:COG4942 96 RAEL E AQKEELAE L lraly R L G RQPP L ALL L SP -- EDFL D A VR RLQYLKYLAP AR REQAE E LRAD L A E L A A L RAELEAER 173
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1952702176 168 EA L SNMIQADKIKAGD L RSEHQSH Q EAISKI kw E S E RDIRRLD L RRN Q KRIA EL N A T I RK LE 229
Cdd:COG4942 174 AE L EALLAELEEERAA L EALKAER Q KLLARL -- E K E LAELAAE L AEL Q QEAE EL E A L I AR LE 233
SMC_prok_A
TIGR02169
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of ...
13-283
4.80e-04
chromosome segregation protein SMC, primarily archaeal type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274009 [Multi-domain]
Cd Length: 1164
Bit Score: 43.90
E-value: 4.80e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 13 E ALIVA L QAANE D LRT K LTD I QI E LH Q EKSKVS KL EREKTQEAKRIRE LEQ RK htvlv TEL K AK L h E E KMKELQAVR E N L 92
Cdd:TIGR02169 701 E NRLDE L SQELS D ASR K IGE I EK E IE Q LEQEEE KL KERLEELEEDLSS LEQ EI ----- ENV K SE L - K E LEARIEELE E D L 774
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 93 I K QH E QEMSRTVK L RDGE I QRLKSA L CA L RDGS S DKVRTALT IE AREEAR ---- KLFDA E RLK L LQEIA DLK TAK K QVDE 168
Cdd:TIGR02169 775 H K LE E ALNDLEAR L SHSR I PEIQAE L SK L EEEV S RIEARLRE IE QKLNRL tlek EYLEK E IQE L QEQRI DLK EQI K SIEK 854
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 169 A lsnm I QADKI K AGD L RS E HQSHQE A I ----- SKIKWES ERD IRRLD LR RNQKR I A EL N A T I R K LED R NTL L GDERNE L L 243
Cdd:TIGR02169 855 E ---- I ENLNG K KEE L EE E LEELEA A L rdles RLGDLKK ERD ELEAQ LR ELERK I E EL E A Q I E K KRK R LSE L KAKLEA L E 930
250 260 270 280
....*....|....*....|....*....|....*....|
gi 1952702176 244 KRVR E T E KQCKPLL E RNKCLAKRN D e LMVS LQR M EE KLK A 283
Cdd:TIGR02169 931 EELS E I E DPKGEDE E IPEEELSLE D - VQAE LQR V EE EIR A 969
SMC_prok_B
TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
402-733
6.42e-04
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain]
Cd Length: 1179
Bit Score: 43.51
E-value: 6.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 402 EE S E LRFRQ L T K EYQA L QRAYAL L Q E QTGGIIDAER E AKA Q - EQ L QAEVL R YK A KI E D LE ATL AQ KGQDSHWV E D - KQLF 479
Cdd:TIGR02168 687 EE L E EKIAE L E K ALAE L RKELEE L E E ELEQLRKELE E LSR Q i SA L RKDLA R LE A EV E Q LE ERI AQ LSKELTEL E A e IEEL 766
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 480 IK R NQ E LL E KIEKQ EAE NHR L QQELQDARDQNEL L EFRNL EL EERERR sppfn L QIH p FSDGVSA L QIYCMKEGVKDVNI 559
Cdd:TIGR02168 767 EE R LE E AE E ELAEA EAE IEE L EAQIEQLKEELKA L REALD EL RAELTL ----- L NEE - AANLRER L ESLERRIAATERRL 840
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 560 P DL IK Q L ----- DI LGDNGNL r N E EQVA I IQASTV L SLAEKWIQQI E G A E A A L HQKMM EL ESDMEQFCKIKGY L EE EL DY 634
Cdd:TIGR02168 841 E DL EE Q I eelse DI ESLAAEI - E E LEEL I EELESE L EALLNERASL E E A L A L L RSELE EL SEELRELESKRSE L RR EL EE 919
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 635 RKQA L D Q AYM R IQE LE ATLY N ALQQ etvikfgel LSE KQQEE L RT A VE klrrqm L RKSR E Y D CQILQE R MEL L QQAHQRI 714
Cdd:TIGR02168 920 LREK L A Q LEL R LEG LE VRID N LQER --------- LSE EYSLT L EE A EA ------ L ENKI E D D EEEARR R LKR L ENKIKEL 984
330 340 350
....*....|....*....|....*....|
gi 1952702176 715 ----------- RD L EDKT D IQKR Q IK DL E E 733
Cdd:TIGR02168 985 gpvnlaaieey EE L KERY D FLTA Q KE DL T E 1014
PTZ00121
PTZ00121
MAEBL; Provisional
39-527
2.63e-03
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 41.28
E-value: 2.63e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 39 Q E KS K VSKLERE K TQE AK RIR E LEQRKHTVLVT E LK A KLH E EKMK E LQ A VRENLI K QHEQEMSRTV K LRDG E IQRLKS A l 118
Cdd:PTZ00121 1318 D E AK K KAEEAKK K ADA AK KKA E EAKKAAEAAKA E AE A AAD E AEAA E EK A EAAEKK K EEAKKKADAA K KKAE E KKKADE A - 1396
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 119 cal RDGSSDKVRT A LTIEAREE A R K LF D a E RL K LLQ E IADLKT AKK QVD EA L snmi Q AD KI K A gdl RS E HQSHQ E AIS K i 198
Cdd:PTZ00121 1397 --- KKKAEEDKKK A DELKKAAA A K K KA D - E AK K KAE E KKKADE AKK KAE EA K ---- K AD EA K K --- KA E EAKKA E EAK K - 1464
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 199 K W E SERDIRRLDLRRNQKRI A E lnatirkledrntllgdern E LL K RVR E TE K QCKPL le RNKCL AK RNDE lmv SLQRM E 278
Cdd:PTZ00121 1465 K A E EAKKADEAKKKAEEAKK A D -------------------- E AK K KAE E AK K KADEA -- KKAAE AK KKAD --- EAKKA E 1519
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 279 E KL KA VTKENS E MRE K ITSHPPLKKL K SLND L DQ A N E EQET E fl KLQVI E QQNIID E ltrd REKLIR R RKHRRSSKPIK R 358
Cdd:PTZ00121 1520 E AK KA DEAKKA E EAK K ADEAKKAEEK K KADE L KK A E E LKKA E -- EKKKA E EAKKAE E ---- DKNMAL R KAEEAKKAEEA R 1593
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 359 PVLDPFIGYD E DS M DS E tssmasfrtdrtpatpdddld E SLA AEE SELRFRQ L T K EYQALQRAYA L LQEQTGGIID AE RE 438
Cdd:PTZ00121 1594 IEEVMKLYEE E KK M KA E --------------------- E AKK AEE AKIKAEE L K K AEEEKKKVEQ L KKKEAEEKKK AE EL 1652
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 439 A KA Q E Q -- LQ A EVLRY KA KIEDLE A TL A Q K GQDSHWVEDKQ L FIKRNQ - ELL E KIE K Q EAE NHRLQQ EL QD A RDQ N ELLE 515
Cdd:PTZ00121 1653 K KA E E E nk IK A AEEAK KA EEDKKK A EE A K K AEEDEKKAAEA L KKEAEE a KKA E ELK K K EAE EKKKAE EL KK A EEE N KIKA 1732
490
....*....|..
gi 1952702176 516 FRNLELE E RERR 527
Cdd:PTZ00121 1733 EEAKKEA E EDKK 1744
DUF5401
pfam17380
Family of unknown function (DUF5401); This is a family of unknown function found in ...
34-297
4.50e-03
Family of unknown function (DUF5401); This is a family of unknown function found in Chromadorea.
Pssm-ID: 375164 [Multi-domain]
Cd Length: 722
Bit Score: 40.49
E-value: 4.50e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 34 Q I E LHQEKSKVSKL ER --- E KTQ E AK RIR EL E QRKHTVLVTELKAKLHEEK M K EL QAV ----- RE N LIKQH E Q E MS R T VK 105
Cdd:pfam17380 326 Q A E MDRQAAIYAEQ ER mam E RER E LE RIR QE E RKRELERIRQEEIAMEISR M R EL ERL qmerq QK N ERVRQ E L E AA R K VK 405
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 106 LRDG E I QR LKSA lcal RDGSSDKV R TALT i EAR EEARKLFDA ER LKLLQEIADLKTAKK Q VD E A L SNMIQAD K I K AGD L R 185
Cdd:pfam17380 406 ILEE E R QR KIQQ ---- QKVEMEQI R AEQE - EAR QREVRRLEE ER AREMERVRLEEQERQ Q QV E R L RQQEEER K R K KLE L E 480
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 186 S E HQSHQE A ISK ---- IKW E S E RDIRRLDLRRNQKRIA E lnati RKL E D R NTLLGD E RN ell K R VR E T E KQCKPLL E RN k 261
Cdd:pfam17380 481 K E KRDRKR A EEQ rrki LEK E L E ERKQAMIEEERKRKLL E ----- KEM E E R QKAIYE E ER --- R R EA E E E RRKQQEM E ER - 551
250 260 270
....*....|....*....|....*....|....*.
gi 1952702176 262 cla K R ND E L M VSLQRMEEK L K A VTK E NSE MR EKIT S 297
Cdd:pfam17380 552 --- R R IQ E Q M RKATEERSR L E A MER E REM MR QIVE S 584
hsdR
PRK11448
type I restriction enzyme EcoKI subunit R; Provisional
388-507
5.54e-03
type I restriction enzyme EcoKI subunit R; Provisional
Pssm-ID: 236912 [Multi-domain]
Cd Length: 1123
Bit Score: 40.32
E-value: 5.54e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 388 P ATPDDD L DES L AA E ESE L R f R QL TKEYQALQRAY AL LQE Q TGGIIDA E RE A KAQ E QL Q A E VL rykaki ED LE ATLAQKG 467
Cdd:PRK11448 136 P PEDPEN L LHA L QQ E VLT L K - Q QL ELQAREKAQSQ AL AEA Q QQELVAL E GL A AEL E EK Q Q E LE ------ AQ LE QLQEKAA 208
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 1952702176 468 QD S H wv E D KQ LFIKRNQELLEKI E KQ E A E NHR L - Q Q E L QD A 507
Cdd:PRK11448 209 ET S Q -- E R KQ KRKEITDQAAKRL E LS E E E TRI L i D Q Q L RK A 247
PTZ00121
PTZ00121
MAEBL; Provisional
39-309
8.99e-03
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain]
Cd Length: 2084
Bit Score: 39.74
E-value: 8.99e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 39 Q E KS K VSKLER - E KTQE A KRIREL E QR K HTVLVTELKAKLHE E KM K ELQAVRENLI K QH E QEMSRT V KLRDG E IQRL K SA 117
Cdd:PTZ00121 1531 E E AK K ADEAKK a E EKKK A DELKKA E EL K KAEEKKKAEEAKKA E ED K NMALRKAEEA K KA E EARIEE V MKLYE E EKKM K AE 1610
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 118 LCALRDGSSD K --- VRT A LTIEARE E AR K LFD AE RL K LLQ E IADLKTAK K QVDEALSNMIQA DK I KA GDLRSEHQSHQE A 194
Cdd:PTZ00121 1611 EAKKAEEAKI K aee LKK A EEEKKKV E QL K KKE AE EK K KAE E LKKAEEEN K IKAAEEAKKAEE DK K KA EEAKKAEEDEKK A 1690
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1952702176 195 ISKI K W E S E RDIRRLD L RRNQKRIAELNATIR K L E DR N TLLGD E RNELLKRVRETEKQC K PLL E RN K CL A KRND E LMVSL 274
Cdd:PTZ00121 1691 AEAL K K E A E EAKKAEE L KKKEAEEKKKAEELK K A E EE N KIKAE E AKKEAEEDKKKAEEA K KDE E EK K KI A HLKK E EEKKA 1770
250 260 270
....*....|....*....|....*....|....*
gi 1952702176 275 QRMEEKLK AV TK E NSEMREKITSHPPL KK L K SLN D 309
Cdd:PTZ00121 1771 EEIRKEKE AV IE E ELDEEDEKRRMEVD KK I K DIF D 1805
Blast search parameters
Data Source:
Precalculated data, version = cdd.v.3.21
Preset Options: Database: CDSEARCH/cdd Low complexity filter: no Composition Based Adjustment: yes E-value threshold: 0.01