|
Name |
Accession |
Description |
Interval |
E-value |
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
3-491 |
2.55e-117 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 363.25 E-value: 2.55e-117
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 3 ELLVQKKKQLEAESHAAQL-------QILMEFLKVARRNKREQLEQIQKELSVLEEDIKRVEEMSGLYSpvSEDSTVPQF 75
Cdd:PLN00181 266 EFINEPRENLEEREAAMELrdrieeqELLLEFLFLIQQRKQEAADKLQDTISLLSSDIDQVVKRQLVLQ--QKGSDVRSF 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 76 EAPSPSHSSIIDSTEYSQPPGFSGTSQTKKQpwyNSTLAsRRKRLTAHFEDLEQCYFSTRMSRIS--------------- 140
Cdd:PLN00181 344 LASRKRIRQGAETLAAEEENDDNSSKLDDTL---ESTLL-ESSRLMRNLKKLESVYFATRYRQIKaaaaaekplaryysa 419
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 141 ------------------------DDSRTASQLDEFQECLSKFTRYNSVRPLATLSyASDLYNGSSIVSSIEFDRDCDYF 196
Cdd:PLN00181 420 lsengrssekssmsnpakppdfyiNDSRQGGWIDPFLEGLCKYLSFSKLRVKADLK-QGDLLNSSNLVCAIGFDRDGEFF 498
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 197 AIAGVTKKIKVYEYGTVIQDAVDIHYPENEMTCNSKISCISWSSYHKNLLASSDYEGTVILWDGFTGQRSKVYQEHEKRC 276
Cdd:PLN00181 499 ATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRV 578
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 277 WSVDFNLMDPKLLASGSDDAKVKLWSTNLDNSVASIEAKANVCCVKFSPSSRYHLAFGCADHCVHYYDLRNTKQPIMVFK 356
Cdd:PLN00181 579 WSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMI 658
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 357 GHRKAVSYAKFVSGEEIVSASTDSQLKLWNVGKPYC------LRSFKGHINEKNFVGLASNGDYIACGSENNSLYLYYKG 430
Cdd:PLN00181 659 GHSKTVSYVRFVDSSTLVSSSTDNTLKLWDLSMSISginetpLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKA 738
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720355450 431 LSKTLLTFKFDTVKSVldKDRKEDDTNEFVSAVCWRalsdGESNVLIAANSQGTIKVLELV 491
Cdd:PLN00181 739 FPMPVLSYKFKTIDPV--SGLEVDDASQFISSVCWR----GQSSTLVAANSTGNIKILEMV 793
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
181-487 |
3.86e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 146.33 E-value: 3.86e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 181 SSIVSSIEFDRDCDYFAIAGVTKKIKVYEYGTVIQDAVD-IHYpenemtcnSKISCISWSSYHKNLLASSdYEGTVILWD 259
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLkGHT--------GPVRDVAASADGTYLASGS-SDKTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 260 GFTGQRSKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKFSPSSRYhLAFGCADH 338
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFS-PDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwVNSVAFSPDGTF-VASSSQDG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 339 CVHYYDLRNTKqPIMVFKGHRKAVSYAKFV-SGEEIVSASTDSQLKLWNVGKPYCLRSFKGHINEKNFVGLASNGDYIAC 417
Cdd:cd00200 158 TIKLWDLRTGK-CVATLTGHTGEVNSVAFSpDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLAS 236
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 418 GSENNSLYLYykglsktlltfkfDTVKSVLDKDRKEddTNEFVSAVCWralsDGESNVLIAANSQGTIKV 487
Cdd:cd00200 237 GSEDGTIRVW-------------DLRTGECVQTLSG--HTNSVTSLAW----SPDGKRLASGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
168-487 |
6.58e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 134.65 E-value: 6.58e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 168 LATLSYASDLYNGSSIVSSIEFDRDCDYFAIAGVTKKIKVYEY--GTVIQdAVDIHypenemtcNSKISCISWSSyHKNL 245
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLatGKLLR-TLTGH--------SGAVTSVAFSP-DGKL 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 246 LASSDYEGTVILWDGFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKFS 324
Cdd:COG2319 177 LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGsVRSVAFS 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 325 PSSRYhLAFGCADHCVHYYDLrNTKQPIMVFKGHRKAVSYAKFV-SGEEIVSASTDSQLKLWNVGKPYCLRSFKGHINEK 403
Cdd:COG2319 256 PDGRL-LASGSADGTVRLWDL-ATGELLRTLTGHSGGVNSVAFSpDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAV 333
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 404 NFVGLASNGDYIACGSENNSLYLYYKGLSKTLLTFKfdtvksvldkdrkedDTNEFVSAVCWRAlsDGesNVLIAANSQG 483
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---------------GHTGAVTSVAFSP--DG--RTLASGSADG 394
|
....
gi 1720355450 484 TIKV 487
Cdd:COG2319 395 TVRL 398
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
262-302 |
4.47e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 40.76 E-value: 4.47e-05
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1720355450 262 TGQRSKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWS 302
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFS-PDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
263-302 |
2.93e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 2.93e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720355450 263 GQRSKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWS 302
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFS-PDGKLLASGSDDGTVKVWD 39
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
5-59 |
3.05e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 40.43 E-value: 3.05e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 1720355450 5 LVQKKKQLEAESHAAQLQIlmEFLKVARRNKREQLEQIQKELSVLEEDIKRVEEM 59
Cdd:TIGR02168 251 AEEELEELTAELQELEEKL--EELRLEVSELEEEIEELQKELYALANEISRLEQQ 303
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PLN00181 |
PLN00181 |
protein SPA1-RELATED; Provisional |
3-491 |
2.55e-117 |
|
protein SPA1-RELATED; Provisional
Pssm-ID: 177776 [Multi-domain] Cd Length: 793 Bit Score: 363.25 E-value: 2.55e-117
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 3 ELLVQKKKQLEAESHAAQL-------QILMEFLKVARRNKREQLEQIQKELSVLEEDIKRVEEMSGLYSpvSEDSTVPQF 75
Cdd:PLN00181 266 EFINEPRENLEEREAAMELrdrieeqELLLEFLFLIQQRKQEAADKLQDTISLLSSDIDQVVKRQLVLQ--QKGSDVRSF 343
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 76 EAPSPSHSSIIDSTEYSQPPGFSGTSQTKKQpwyNSTLAsRRKRLTAHFEDLEQCYFSTRMSRIS--------------- 140
Cdd:PLN00181 344 LASRKRIRQGAETLAAEEENDDNSSKLDDTL---ESTLL-ESSRLMRNLKKLESVYFATRYRQIKaaaaaekplaryysa 419
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 141 ------------------------DDSRTASQLDEFQECLSKFTRYNSVRPLATLSyASDLYNGSSIVSSIEFDRDCDYF 196
Cdd:PLN00181 420 lsengrssekssmsnpakppdfyiNDSRQGGWIDPFLEGLCKYLSFSKLRVKADLK-QGDLLNSSNLVCAIGFDRDGEFF 498
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 197 AIAGVTKKIKVYEYGTVIQDAVDIHYPENEMTCNSKISCISWSSYHKNLLASSDYEGTVILWDGFTGQRSKVYQEHEKRC 276
Cdd:PLN00181 499 ATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRV 578
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 277 WSVDFNLMDPKLLASGSDDAKVKLWSTNLDNSVASIEAKANVCCVKFSPSSRYHLAFGCADHCVHYYDLRNTKQPIMVFK 356
Cdd:PLN00181 579 WSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMI 658
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 357 GHRKAVSYAKFVSGEEIVSASTDSQLKLWNVGKPYC------LRSFKGHINEKNFVGLASNGDYIACGSENNSLYLYYKG 430
Cdd:PLN00181 659 GHSKTVSYVRFVDSSTLVSSSTDNTLKLWDLSMSISginetpLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKA 738
|
490 500 510 520 530 540
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1720355450 431 LSKTLLTFKFDTVKSVldKDRKEDDTNEFVSAVCWRalsdGESNVLIAANSQGTIKVLELV 491
Cdd:PLN00181 739 FPMPVLSYKFKTIDPV--SGLEVDDASQFISSVCWR----GQSSTLVAANSTGNIKILEMV 793
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
181-487 |
3.86e-40 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 146.33 E-value: 3.86e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 181 SSIVSSIEFDRDCDYFAIAGVTKKIKVYEYGTVIQDAVD-IHYpenemtcnSKISCISWSSYHKNLLASSdYEGTVILWD 259
Cdd:cd00200 9 TGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLkGHT--------GPVRDVAASADGTYLASGS-SDKTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 260 GFTGQRSKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKFSPSSRYhLAFGCADH 338
Cdd:cd00200 80 LETGECVRTLTGHTSYVSSVAFS-PDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDwVNSVAFSPDGTF-VASSSQDG 157
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 339 CVHYYDLRNTKqPIMVFKGHRKAVSYAKFV-SGEEIVSASTDSQLKLWNVGKPYCLRSFKGHINEKNFVGLASNGDYIAC 417
Cdd:cd00200 158 TIKLWDLRTGK-CVATLTGHTGEVNSVAFSpDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLAS 236
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 418 GSENNSLYLYykglsktlltfkfDTVKSVLDKDRKEddTNEFVSAVCWralsDGESNVLIAANSQGTIKV 487
Cdd:cd00200 237 GSEDGTIRVW-------------DLRTGECVQTLSG--HTNSVTSLAW----SPDGKRLASGSADGTIRI 287
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
168-487 |
6.58e-35 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 134.65 E-value: 6.58e-35
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 168 LATLSYASDLYNGSSIVSSIEFDRDCDYFAIAGVTKKIKVYEY--GTVIQdAVDIHypenemtcNSKISCISWSSyHKNL 245
Cdd:COG2319 107 LATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLatGKLLR-TLTGH--------SGAVTSVAFSP-DGKL 176
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 246 LASSDYEGTVILWDGFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKFS 324
Cdd:COG2319 177 LASGSDDGTVRLWDLATGKLLRTLTGHTGAVRSVAFSP-DGKLLASGSADGTVRLWDLATGKLLRTLTGHSGsVRSVAFS 255
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 325 PSSRYhLAFGCADHCVHYYDLrNTKQPIMVFKGHRKAVSYAKFV-SGEEIVSASTDSQLKLWNVGKPYCLRSFKGHINEK 403
Cdd:COG2319 256 PDGRL-LASGSADGTVRLWDL-ATGELLRTLTGHSGGVNSVAFSpDGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGAV 333
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 404 NFVGLASNGDYIACGSENNSLYLYYKGLSKTLLTFKfdtvksvldkdrkedDTNEFVSAVCWRAlsDGesNVLIAANSQG 483
Cdd:COG2319 334 RSVAFSPDGKTLASGSDDGTVRLWDLATGELLRTLT---------------GHTGAVTSVAFSP--DG--RTLASGSADG 394
|
....
gi 1720355450 484 TIKV 487
Cdd:COG2319 395 TVRL 398
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
168-427 |
1.72e-32 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 125.53 E-value: 1.72e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 168 LATLSYASDLYNGSSIVSSIEFDRDCDYFAIAGVTKKIKVYEYGTViqdavdihYPENEMTC-NSKISCISWSSyHKNLL 246
Cdd:cd00200 38 LETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETG--------ECVRTLTGhTSYVSSVAFSP-DGRIL 108
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 247 ASSDYEGTVILWDGFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEA-KANVCCVKFSP 325
Cdd:cd00200 109 SSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSP-DGTFVASSSQDGTIKLWDLRTGKCVATLTGhTGEVNSVAFSP 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 326 SSRyHLAFGCADHCVHYYDLRnTKQPIMVFKGHRKAVSYAKF-VSGEEIVSASTDSQLKLWNVGKPYCLRSFKGHINEKN 404
Cdd:cd00200 188 DGE-KLLSSSSDGTIKLWDLS-TGKCLGTLRGHENGVNSVAFsPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVT 265
|
250 260
....*....|....*....|...
gi 1720355450 405 FVGLASNGDYIACGSENNSLYLY 427
Cdd:cd00200 266 SLAWSPDGKRLASGSADGTIRIW 288
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
166-427 |
3.53e-32 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 127.33 E-value: 3.53e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 166 RPLATLSyasdlyNGSSIVSSIEFDRDCDYFAIAGVTKKIKVYEY--GTVIQdAVDIHypenemtcNSKISCISWSSyHK 243
Cdd:COG2319 153 KLLRTLT------GHSGAVTSVAFSPDGKLLASGSDDGTVRLWDLatGKLLR-TLTGH--------TGAVRSVAFSP-DG 216
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 244 NLLASSDYEGTVILWDGFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVK 322
Cdd:COG2319 217 KLLASGSADGTVRLWDLATGKLLRTLTGHSGSVRSVAFSP-DGRLLASGSADGTVRLWDLATGELLRTLTGHSGgVNSVA 295
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 323 FSPSSRYhLAFGCADHCVHYYDLrNTKQPIMVFKGHRKAVSYAKFVS-GEEIVSASTDSQLKLWNVGKPYCLRSFKGHIN 401
Cdd:COG2319 296 FSPDGKL-LASGSDDGTVRLWDL-ATGKLLRTLTGHTGAVRSVAFSPdGKTLASGSDDGTVRLWDLATGELLRTLTGHTG 373
|
250 260
....*....|....*....|....*.
gi 1720355450 402 EKNFVGLASNGDYIACGSENNSLYLY 427
Cdd:COG2319 374 AVTSVAFSPDGRTLASGSADGTVRLW 399
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
245-490 |
1.52e-28 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 116.93 E-value: 1.52e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 245 LLASSDYEGTVILWDGFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKF 323
Cdd:COG2319 92 LLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSP-DGKTLASGSADGTVRLWDLATGKLLRTLTGHSGaVTSVAF 170
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 324 SPSSRYhLAFGCADHCVHYYDLRnTKQPIMVFKGHRKAVSYAKFvS--GEEIVSASTDSQLKLWNVGKPYCLRSFKGHIN 401
Cdd:COG2319 171 SPDGKL-LASGSDDGTVRLWDLA-TGKLLRTLTGHTGAVRSVAF-SpdGKLLASGSADGTVRLWDLATGKLLRTLTGHSG 247
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 402 EKNFVGLASNGDYIACGSENNSLYLY--YKGLSKTLLTFKFDTVKSVldkdrkeddtnefvsavcwrALS-DGesNVLIA 478
Cdd:COG2319 248 SVRSVAFSPDGRLLASGSADGTVRLWdlATGELLRTLTGHSGGVNSV--------------------AFSpDG--KLLAS 305
|
250
....*....|..
gi 1720355450 479 ANSQGTIKVLEL 490
Cdd:COG2319 306 GSDDGTVRLWDL 317
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
181-387 |
5.23e-26 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 109.62 E-value: 5.23e-26
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 181 SSIVSSIEFDRDCDYFAIAGVTKKIKVYEYGT-VIQDAVDIHypenemtcNSKISCISWSSYHKnLLASSDYEGTVILWD 259
Cdd:COG2319 204 TGAVRSVAFSPDGKLLASGSADGTVRLWDLATgKLLRTLTGH--------SGSVRSVAFSPDGR-LLASGSADGTVRLWD 274
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 260 GFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKFSPSSRYhLAFGCADH 338
Cdd:COG2319 275 LATGELLRTLTGHSGGVNSVAFSP-DGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGaVRSVAFSPDGKT-LASGSDDG 352
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|
gi 1720355450 339 CVHYYDLrNTKQPIMVFKGHRKAVSYAKFVS-GEEIVSASTDSQLKLWNV 387
Cdd:COG2319 353 TVRLWDL-ATGELLRTLTGHTGAVTSVAFSPdGRTLASGSADGTVRLWDL 401
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
266-490 |
1.41e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 103.18 E-value: 1.41e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 266 SKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWSTNLDNSVASIE-AKANVCCVKFSPSSRYhLAFGCADHCVHYYD 344
Cdd:cd00200 2 RRTLKGHTGGVTCVAFS-PDGKLLATGSGDGTIKVWDLETGELLRTLKgHTGPVRDVAASADGTY-LASGSSDKTIRLWD 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 345 LrNTKQPIMVFKGHRKAVSYAKFVSGEEIV-SASTDSQLKLWNVGKPYCLRSFKGHINEKNFVGLASNGDYIACGSENNS 423
Cdd:cd00200 80 L-ETGECVRTLTGHTSYVSSVAFSPDGRILsSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGT 158
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720355450 424 LYLYYKGLSKTLLTFKFDTvksvldkdrkeddtnEFVSAVCWralsDGESNVLIAANSQGTIKVLEL 490
Cdd:cd00200 159 IKLWDLRTGKCVATLTGHT---------------GEVNSVAF----SPDGEKLLSSSSDGTIKLWDL 206
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
184-386 |
6.76e-24 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 101.26 E-value: 6.76e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 184 VSSIEFDRDCDYFAIAGVTKKIKVYeygtviqdavDIHYPENEMTCNSK---ISCISWSSYHKnLLASSDYEGTVILWDG 260
Cdd:cd00200 96 VSSVAFSPDGRILSSSSRDKTIKVW----------DVETGKCLTTLRGHtdwVNSVAFSPDGT-FVASSSQDGTIKLWDL 164
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 261 FTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEAKAN-VCCVKFSPSSRYhLAFGCADHC 339
Cdd:cd00200 165 RTGKCVATLTGHTGEVNSVAFSP-DGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENgVNSVAFSPDGYL-LASGSEDGT 242
|
170 180 190 200
....*....|....*....|....*....|....*....|....*...
gi 1720355450 340 VHYYDLRnTKQPIMVFKGHRKAVSYAKFV-SGEEIVSASTDSQLKLWN 386
Cdd:cd00200 243 IRVWDLR-TGECVQTLSGHTNSVTSLAWSpDGKRLASGSADGTIRIWD 289
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
245-487 |
4.19e-23 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 101.14 E-value: 4.19e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 245 LLASSDYEGTVILWDGFTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTNLDNSVASIEA-KANVCCVKF 323
Cdd:COG2319 50 RLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSP-DGRLLASASADGTVRLWDLATGLLLRTLTGhTGAVRSVAF 128
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 324 SPSSRYhLAFGCADHCVHYYDLRnTKQPIMVFKGHRKAVSYAKFVS-GEEIVSASTDSQLKLWNVGKPYCLRSFKGHINE 402
Cdd:COG2319 129 SPDGKT-LASGSADGTVRLWDLA-TGKLLRTLTGHSGAVTSVAFSPdGKLLASGSDDGTVRLWDLATGKLLRTLTGHTGA 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 403 KNFVGLASNGDYIACGSENNSLYLYYKGLSKTLLTFKFDTvksvldkdrkeddtnEFVSAVCWraLSDGEsnVLIAANSQ 482
Cdd:COG2319 207 VRSVAFSPDGKLLASGSADGTVRLWDLATGKLLRTLTGHS---------------GSVRSVAF--SPDGR--LLASGSAD 267
|
....*
gi 1720355450 483 GTIKV 487
Cdd:COG2319 268 GTVRL 272
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
181-304 |
1.88e-12 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 68.78 E-value: 1.88e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 181 SSIVSSIEFDRDCDYFAIAGVTKKIKVYEYGTviQDAVDIHYPENEMtcnskISCISWSSYHKnLLASSDYEGTVILWDG 260
Cdd:COG2319 288 SGGVNSVAFSPDGKLLASGSDDGTVRLWDLAT--GKLLRTLTGHTGA-----VRSVAFSPDGK-TLASGSDDGTVRLWDL 359
|
90 100 110 120
....*....|....*....|....*....|....*....|....
gi 1720355450 261 FTGQRSKVYQEHEKRCWSVDFNLmDPKLLASGSDDAKVKLWSTN 304
Cdd:COG2319 360 ATGELLRTLTGHTGAVTSVAFSP-DGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
285-487 |
8.29e-07 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 51.07 E-value: 8.29e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 285 DPKLLASGSDDAKVKLWSTNLDNSVASIEAKANVCCVKFSPSSRYHLAFGCADHCVHYYDLRNTKQPIMVFKGHRKAVSY 364
Cdd:COG2319 5 DGAALAAASADLALALLAAALGALLLLLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSV 84
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720355450 365 AKFVSGEEIVSASTDSQLKLWNVGKPYCLRSFKGHINEKNFVGLASNGDYIACGSENNSLYLYYKGLSKTLLTFKFDTvk 444
Cdd:COG2319 85 AFSPDGRLLASASADGTVRLWDLATGLLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHS-- 162
|
170 180 190 200
....*....|....*....|....*....|....*....|...
gi 1720355450 445 svldkdrkeddtnEFVSAVCWRAlsDGEsnVLIAANSQGTIKV 487
Cdd:COG2319 163 -------------GAVTSVAFSP--DGK--LLASGSDDGTVRL 188
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
262-302 |
4.47e-05 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 40.76 E-value: 4.47e-05
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 1720355450 262 TGQRSKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWS 302
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFS-PDGKYLASGSDDGTIKLWD 40
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
263-302 |
2.93e-04 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 38.10 E-value: 2.93e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720355450 263 GQRSKVYQEHEKRCWSVDFNlMDPKLLASGSDDAKVKLWS 302
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFS-PDGKLLASGSDDGTVKVWD 39
|
|
| WD40 |
smart00320 |
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats ... |
348-386 |
3.74e-04 |
|
WD40 repeats; Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Pssm-ID: 197651 [Multi-domain] Cd Length: 40 Bit Score: 38.06 E-value: 3.74e-04
10 20 30 40
....*....|....*....|....*....|....*....|
gi 1720355450 348 TKQPIMVFKGHRKAVSYAKFV-SGEEIVSASTDSQLKLWN 386
Cdd:smart00320 1 SGELLKTLKGHTGPVTSVAFSpDGKYLASGSDDGTIKLWD 40
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
5-59 |
3.05e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 40.43 E-value: 3.05e-03
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*
gi 1720355450 5 LVQKKKQLEAESHAAQLQIlmEFLKVARRNKREQLEQIQKELSVLEEDIKRVEEM 59
Cdd:TIGR02168 251 AEEELEELTAELQELEEKL--EELRLEVSELEEEIEELQKELYALANEISRLEQQ 303
|
|
| WD40 |
pfam00400 |
WD domain, G-beta repeat; |
349-386 |
9.88e-03 |
|
WD domain, G-beta repeat;
Pssm-ID: 459801 [Multi-domain] Cd Length: 39 Bit Score: 33.86 E-value: 9.88e-03
10 20 30
....*....|....*....|....*....|....*....
gi 1720355450 349 KQPIMVFKGHRKAVSYAKF-VSGEEIVSASTDSQLKLWN 386
Cdd:pfam00400 1 GKLLKTLEGHTGSVTSLAFsPDGKLLASGSDDGTVKVWD 39
|
|
|