NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|24041035|ref|NP_077719.2|]
View 

neurogenic locus notch homolog protein 2 isoform 1 preproprotein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1938-2062 4.21e-31

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


:

Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 121.72  E-value: 4.21e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1938 DARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLA 2017
Cdd:cd00204    1 NARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHLA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 24041035 2018 AREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLL 2062
Cdd:cd00204   81 ARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLL 125
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1872-1997 3.34e-30

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


:

Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 119.03  E-value: 3.34e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1872 QTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVtDLDARMNDGTTPLILA 1951
Cdd:cd00204    2 ARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGA-DVNARDKDGNTPLHLA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 24041035 1952 ARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLL 1997
Cdd:cd00204   81 ARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
182-218 4.02e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 55.34  E-value: 4.02e-09
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  182 DVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQYC 218
Cdd:cd00054    1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1026-1061 5.75e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 51.87  E-value: 5.75e-08
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035 1026 INECSS-HPCLNEGTCVDGLGTYRCSCPLGYTGKNCQ 1061
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
873-909 5.81e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 51.87  E-value: 5.81e-08
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  873 DIDECISK-PCMNHGLCHNTQGSYMCECPPGFSGMDCE 909
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
757-793 2.01e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 2.01e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  757 DKNECLS-NPCQNGGTCDNLVNGYRCTCKKGFKGYNCQ 793
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1151-1185 2.11e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 2.11e-07
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 24041035 1151 DECAS-NPCQHGATCSDFIGGYRCECVPGYQGVNCE 1185
Cdd:cd00054    3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
456-492 2.73e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.94  E-value: 2.73e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  456 DINECHS-DPCQNDATCLDKIGGFTCLCMPGFKGVHCE 492
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
495-530 2.91e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.94  E-value: 2.91e-07
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  495 INECQS-NPCVNNGQCVDKVNRFQCLCPPGFTGPVCQ 530
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1225-1262 6.04e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.17  E-value: 6.04e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035 1225 NIDDCARGPHCLNGGQCMDRIGGYSCRCLPGFAGERCE 1262
Cdd:cd00054    1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
949-985 9.52e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 9.52e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  949 DMNECLSE-PCKNGGTCSDYVNSYTCKCQAGFDGVHCE 985
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
911-947 9.98e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 9.98e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  911 DIDDCL-ANPCQNGGSCMDGVNTFSCLCLPGFTGDKCQ 947
Cdd:cd00054    1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
415-454 1.51e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.02  E-value: 1.51e-06
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 24041035  415 DVDECAManSNPCEHAGKCVNTDGAFHCECLKGYAGPRCE 454
Cdd:cd00054    1 DIDECAS--GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
682-717 1.56e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 1.56e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  682 DIDECAS-NPCRKGATCINGVNGFRCICPEGPHHPSC 717
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
608-643 2.11e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.11e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  608 IDECYS-SPCLNDGRCIDLVNGYQCNCQPGTSGVNCE 643
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
532-568 2.21e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.21e-06
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  532 DIDDCSST-PCLNGAKCIDHPNGYECQCATGFTGVLCE 568
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1188-1223 2.32e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.32e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035 1188 VDECQ-NQPCQNGGTCIDLVNHFKCSCPPGTRGLLCE 1223
Cdd:cd00054    2 IDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
987-1022 4.44e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.48  E-value: 4.44e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  987 NINECTESS-CFNGGTCVDGINSFSCLCPVGFTGSFC 1022
Cdd:cd00054    1 DIDECASGNpCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
795-831 8.36e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.71  E-value: 8.36e-06
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  795 NIDECAS-NPCLNQGTCFDDISGYTCHCVLPYTGKNCQ 831
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
260-296 5.87e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 5.87e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  260 NIDDC-PNHRCQNGGVCVDGVNTYNCRCPPQWTGQFCT 296
Cdd:cd00054    1 DIDECaSGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
298-335 5.92e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 5.92e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  298 DVDECLlQPNACQNGGTCANRNGGYGCVCVNGWSGDDC 335
Cdd:cd00054    1 DIDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
645-679 1.41e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 39.16  E-value: 1.41e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  645 NFDDCAS-NPC-IHGICMDGINRYSCVCSPGFTGQRC 679
Cdd:cd00054    1 DIDECASgNPCqNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
570-604 2.23e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.39  E-value: 2.23e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  570 NIDNCD-PDPC-HHGQCQDGIDSYTCICNPGYMGAIC 604
Cdd:cd00054    1 DIDECAsGNPCqNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1312-1343 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.00  E-value: 2.86e-03
                         10        20        30
                 ....*....|....*....|....*....|..
gi 24041035 1312 PCLNGGTCavaSNMPDGFICRCPPGFSGARCQ 1343
Cdd:cd00054   10 PCQNGGTC---VNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1117-1147 5.19e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 5.19e-03
                         10        20        30
                 ....*....|....*....|....*....|.
gi 24041035 1117 EHLCQHSGVCINAGNTHYCQCPLGYTGSYCE 1147
Cdd:cd00054    8 GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1264-1302 6.30e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 6.30e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 24041035 1264 DINECLS-NPCSSEGSldCIQLTNDYLCVCRSAFTGRHCE 1302
Cdd:cd00054    1 DIDECASgNPCQNGGT--CVNTVGSYRCSCPPGYTGRNCE 38
NOD pfam06816
NOTCH protein; NOTCH signalling plays a fundamental role during a great number of ...
1539-1595 9.44e-26

NOTCH protein; NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. Role of NOD domain remains to be elucidated.


:

Pssm-ID: 191614  Cd Length: 57  Bit Score: 103.85  E-value: 9.44e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 24041035   1539 PENLAEGTLVIVVLMPPEQLLQDARSFLRALGTLLHTNLRIKRDSQGELMVYPYYGE 1595
Cdd:pfam06816    1 PPKLAEGTLVIVVLIPPEELRNNSVQFLRELSHLLRTNVRFKKDANGQPMIFPWYGE 57
DUF3454 pfam11936
Domain of unknown function (DUF3454); This presumed domain is functionally uncharacterized. ...
2380-2445 1.22e-24

Domain of unknown function (DUF3454); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 60 amino acids in length. This domain is found associated with pfam00066, pfam00008, pfam06816, pfam07684, pfam00023.


:

Pssm-ID: 256741  Cd Length: 64  Bit Score: 101.07  E-value: 1.22e-24
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 24041035   2380 VGKYPTPPSQHSYaSSNAAERTPSHSGHLQgEHPYLTPSPESPDQWSSSSPHSASDWSDVTTSPTP 2445
Cdd:pfam11936    1 VEQYPTPPSQHSS-SSSSGDNTPQHQLQVP-DHPYLTPSPESPDQWSSSSPHSNSDWSEGISSPPT 64
NODP pfam07684
NOTCH protein; NOTCH signalling plays a fundamental role during a great number of ...
1617-1673 5.21e-14

NOTCH protein; NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. The role of the NOD and NODP domains remains to be elucidated.


:

Pssm-ID: 254356  Cd Length: 62  Bit Score: 70.01  E-value: 5.21e-14
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 24041035   1617 EVAGSKVFLEIDNRQCVQDSDHCFKNTDAAAALLASHAIQGTL--SYPLVSVVSESLTP 1673
Cdd:pfam07684    1 EVTGSVVYLEIDNRKCSQQSGECFWSAQSAAAFLAALAAKGGLdtPYPISSVRSEPDEP 59
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1499-1534 3.87e-09

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


:

Pssm-ID: 249555  Cd Length: 38  Bit Score: 55.38  E-value: 3.87e-09
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 24041035   1499 NSKTCKYDKYCADHFKDNHCDQGCNSEECGWDGLDC 1534
Cdd:pfam00066    2 PWKNCPKAQYCEKKFGDGVCDPECNNAECLFDGGDC 37
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1420-1456 1.06e-08

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


:

Pssm-ID: 249555  Cd Length: 38  Bit Score: 54.22  E-value: 1.06e-08
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 24041035   1420 TPPATC-LSQYCADKARDGVCDEACNSHACQWDGGDCS 1456
Cdd:pfam00066    1 SPWKNCpKAQYCEKKFGDGVCDPECNNAECLFDGGDCS 38
Chorion_3 super family cl05116
Chorion family 3; This family consists of several Drosophila chorion proteins S36 and S38. The ...
2292-2421 1.73e-03

Chorion family 3; This family consists of several Drosophila chorion proteins S36 and S38. The chorion genes of Drosophila are amplified in response to developmental signals in the follicle cells of the ovary.


The actual alignment was detected with superfamily member pfam05387:

Pssm-ID: 253174  Cd Length: 277  Bit Score: 41.63  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2292 KHITTPREPLPPIVtfqlipkgsIAQPAGAPQPQSTCPPAVAGPLPTMYQIPemarlPSVAF--------PTAMmpqqdg 2363
Cdd:pfam05387  144 NHQVIATQPLPPII---------VKQPGAPPKVLVNGPPLVVKPAPVIYKIK-----PSVIYqqevinkvPTPL------ 203
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   2364 QVAQTILPAYHPfPASVGKYPTPPSQHSYASsnaaertPSHSGHlQGEHPYLTPSPES 2421
Cdd:pfam05387  204 SLNPVYVKVYKP-GKKIEAPLVPEVQQVYSQ-------PSYGGS-EYSQPREQASPSS 252
Notch super family cl02419
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1459-1496 2.53e-03

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


The actual alignment was detected with superfamily member smart00004:

Pssm-ID: 261277  Cd Length: 38  Bit Score: 38.46  E-value: 2.53e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1459 MENPWANCSSPlPCWDYINN-QCDELCNTVECLFDNFEC 1496
Cdd:smart00004    1 PQDPWSRCEDA-QCWDKFGDgVCDEECNNAECLWDGGDC 38
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1948-2040 8.26e-19

Ankyrin repeats (3 copies);


:

Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 85.00  E-value: 8.26e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1948 LILAARLAVEGMVAELINCQADVNAVDDHgkSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAK 2027
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLEKGADVNLGDTD--TALHLAARNGNLEIVKLLLENGADVNAKDKDGNTALHLAARNGNLEIVK 78
                           90
                   ....*....|...
gi 24041035   2028 ILLDHFANRDITD 2040
Cdd:pfam12796   79 LLLEHGADINLKD 91
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1832-1940 7.19e-11

Ankyrin repeats (3 copies);


:

Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 61.50  E-value: 7.19e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1832 LMLASLRGgssdlsdededaedsSANIITDLVYQGASLQAQTDRTgemALHLAARYSRADAAKRLLDAGADANAQDNMGR 1911
Cdd:pfam12796    1 LHLAAKNG---------------NLELVKLLLEKGADVNLGDTDT---ALHLAARNGNLEIVKLLLENGADVNAKDKDGN 62
                           90       100
                   ....*....|....*....|....*....
gi 24041035   1912 CPLHAAVAADAQGVFQILIRNRVtDLDAR 1940
Cdd:pfam12796   63 TALHLAARNGNLEIVKLLLEHGA-DINLK 90
DUF1421 pfam07223
Protein of unknown function (DUF1421); This family represents a conserved region approximately ...
2130-2422 2.36e-10

Protein of unknown function (DUF1421); This family represents a conserved region approximately 350 residues long within a number of plant proteins of unknown function.


:

Pssm-ID: 254110 [Multi-domain]  Cd Length: 357  Bit Score: 63.42  E-value: 2.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2130 RRKKSLSE-KVQLSESSVTLSPVDSLE--SPHTYVSDTTSSPMITSPGILQASPNPMLATAAPPAPvhAQHALSFSNLHE 2206
Cdd:pfam07223   12 RDKQEIAEtQKELSKLQLSHEEAQSSEahSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAP--APQSPQPDQQQQ 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2207 MQ-PLAHGASTVLPsVSQLLSHHHIVSPgsgsagslsrlHPVPVPAdwmnrmevnetqynemfgmvlAPAEGTHPG---- 2281
Cdd:pfam07223   90 SQaPPSHQYPSQLP-PQQVQSVPQQPTP-----------QQEPYYP---------------------PPSQPQPPPaqqp 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2282 IAPQSRPPegkhittPREPLPPivTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMArlPSVAFPTAMMPQq 2361
Cdd:pfam07223  137 QAQQPQPP-------PQVPQQQ--QYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYP--PNEPLPSSMAMQ- 204
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   2362 dgqvaqtilPAYHPFPASVGKY-PTPPSQHSYasSNAAERTPSH--SGHL----QGEHPYLTPSPESP 2422
Cdd:pfam07223  205 ---------PPYSGAPPSQQFYgPPQPSPYMY--GGPGGRPNSGfpSGQQpppsQGQEGYGYSGPPPS 261
 
Name Accession Description Interval E-value
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1938-2062 4.21e-31

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 121.72  E-value: 4.21e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1938 DARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLA 2017
Cdd:cd00204    1 NARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHLA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 24041035 2018 AREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLL 2062
Cdd:cd00204   81 ARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLL 125
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1872-1997 3.34e-30

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 119.03  E-value: 3.34e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1872 QTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVtDLDARMNDGTTPLILA 1951
Cdd:cd00204    2 ARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGA-DVNARDKDGNTPLHLA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 24041035 1952 ARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLL 1997
Cdd:cd00204   81 ARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
182-218 4.02e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 55.34  E-value: 4.02e-09
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  182 DVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQYC 218
Cdd:cd00054    1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1026-1061 5.75e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 51.87  E-value: 5.75e-08
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035 1026 INECSS-HPCLNEGTCVDGLGTYRCSCPLGYTGKNCQ 1061
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
873-909 5.81e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 51.87  E-value: 5.81e-08
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  873 DIDECISK-PCMNHGLCHNTQGSYMCECPPGFSGMDCE 909
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
757-793 2.01e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 2.01e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  757 DKNECLS-NPCQNGGTCDNLVNGYRCTCKKGFKGYNCQ 793
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1151-1185 2.11e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 2.11e-07
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 24041035 1151 DECAS-NPCQHGATCSDFIGGYRCECVPGYQGVNCE 1185
Cdd:cd00054    3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
456-492 2.73e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.94  E-value: 2.73e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  456 DINECHS-DPCQNDATCLDKIGGFTCLCMPGFKGVHCE 492
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
495-530 2.91e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.94  E-value: 2.91e-07
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  495 INECQS-NPCVNNGQCVDKVNRFQCLCPPGFTGPVCQ 530
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1225-1262 6.04e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.17  E-value: 6.04e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035 1225 NIDDCARGPHCLNGGQCMDRIGGYSCRCLPGFAGERCE 1262
Cdd:cd00054    1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
949-985 9.52e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 9.52e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  949 DMNECLSE-PCKNGGTCSDYVNSYTCKCQAGFDGVHCE 985
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
911-947 9.98e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 9.98e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  911 DIDDCL-ANPCQNGGSCMDGVNTFSCLCLPGFTGDKCQ 947
Cdd:cd00054    1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
415-454 1.51e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.02  E-value: 1.51e-06
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 24041035  415 DVDECAManSNPCEHAGKCVNTDGAFHCECLKGYAGPRCE 454
Cdd:cd00054    1 DIDECAS--GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
682-717 1.56e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 1.56e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  682 DIDECAS-NPCRKGATCINGVNGFRCICPEGPHHPSC 717
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
608-643 2.11e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.11e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  608 IDECYS-SPCLNDGRCIDLVNGYQCNCQPGTSGVNCE 643
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
532-568 2.21e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.21e-06
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  532 DIDDCSST-PCLNGAKCIDHPNGYECQCATGFTGVLCE 568
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1188-1223 2.32e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.32e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035 1188 VDECQ-NQPCQNGGTCIDLVNHFKCSCPPGTRGLLCE 1223
Cdd:cd00054    2 IDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
987-1022 4.44e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.48  E-value: 4.44e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  987 NINECTESS-CFNGGTCVDGINSFSCLCPVGFTGSFC 1022
Cdd:cd00054    1 DIDECASGNpCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
795-831 8.36e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.71  E-value: 8.36e-06
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  795 NIDECAS-NPCLNQGTCFDDISGYTCHCVLPYTGKNCQ 831
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
260-296 5.87e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 5.87e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  260 NIDDC-PNHRCQNGGVCVDGVNTYNCRCPPQWTGQFCT 296
Cdd:cd00054    1 DIDECaSGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
298-335 5.92e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 5.92e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  298 DVDECLlQPNACQNGGTCANRNGGYGCVCVNGWSGDDC 335
Cdd:cd00054    1 DIDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
645-679 1.41e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 39.16  E-value: 1.41e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  645 NFDDCAS-NPC-IHGICMDGINRYSCVCSPGFTGQRC 679
Cdd:cd00054    1 DIDECASgNPCqNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
570-604 2.23e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.39  E-value: 2.23e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  570 NIDNCD-PDPC-HHGQCQDGIDSYTCICNPGYMGAIC 604
Cdd:cd00054    1 DIDECAsGNPCqNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1312-1343 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.00  E-value: 2.86e-03
                         10        20        30
                 ....*....|....*....|....*....|..
gi 24041035 1312 PCLNGGTCavaSNMPDGFICRCPPGFSGARCQ 1343
Cdd:cd00054   10 PCQNGGTC---VNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1117-1147 5.19e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 5.19e-03
                         10        20        30
                 ....*....|....*....|....*....|.
gi 24041035 1117 EHLCQHSGVCINAGNTHYCQCPLGYTGSYCE 1147
Cdd:cd00054    8 GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1264-1302 6.30e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 6.30e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 24041035 1264 DINECLS-NPCSSEGSldCIQLTNDYLCVCRSAFTGRHCE 1302
Cdd:cd00054    1 DIDECASgNPCQNGGT--CVNTVGSYRCSCPPGYTGRNCE 38
NOD pfam06816
NOTCH protein; NOTCH signalling plays a fundamental role during a great number of ...
1539-1595 9.44e-26

NOTCH protein; NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. Role of NOD domain remains to be elucidated.


Pssm-ID: 191614  Cd Length: 57  Bit Score: 103.85  E-value: 9.44e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 24041035   1539 PENLAEGTLVIVVLMPPEQLLQDARSFLRALGTLLHTNLRIKRDSQGELMVYPYYGE 1595
Cdd:pfam06816    1 PPKLAEGTLVIVVLIPPEELRNNSVQFLRELSHLLRTNVRFKKDANGQPMIFPWYGE 57
DUF3454 pfam11936
Domain of unknown function (DUF3454); This presumed domain is functionally uncharacterized. ...
2380-2445 1.22e-24

Domain of unknown function (DUF3454); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 60 amino acids in length. This domain is found associated with pfam00066, pfam00008, pfam06816, pfam07684, pfam00023.


Pssm-ID: 256741  Cd Length: 64  Bit Score: 101.07  E-value: 1.22e-24
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 24041035   2380 VGKYPTPPSQHSYaSSNAAERTPSHSGHLQgEHPYLTPSPESPDQWSSSSPHSASDWSDVTTSPTP 2445
Cdd:pfam11936    1 VEQYPTPPSQHSS-SSSSGDNTPQHQLQVP-DHPYLTPSPESPDQWSSSSPHSNSDWSEGISSPPT 64
NODP pfam07684
NOTCH protein; NOTCH signalling plays a fundamental role during a great number of ...
1617-1673 5.21e-14

NOTCH protein; NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. The role of the NOD and NODP domains remains to be elucidated.


Pssm-ID: 254356  Cd Length: 62  Bit Score: 70.01  E-value: 5.21e-14
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 24041035   1617 EVAGSKVFLEIDNRQCVQDSDHCFKNTDAAAALLASHAIQGTL--SYPLVSVVSESLTP 1673
Cdd:pfam07684    1 EVTGSVVYLEIDNRKCSQQSGECFWSAQSAAAFLAALAAKGGLdtPYPISSVRSEPDEP 59
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1499-1534 3.87e-09

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


Pssm-ID: 249555  Cd Length: 38  Bit Score: 55.38  E-value: 3.87e-09
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 24041035   1499 NSKTCKYDKYCADHFKDNHCDQGCNSEECGWDGLDC 1534
Cdd:pfam00066    2 PWKNCPKAQYCEKKFGDGVCDPECNNAECLFDGGDC 37
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1420-1456 1.06e-08

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


Pssm-ID: 249555  Cd Length: 38  Bit Score: 54.22  E-value: 1.06e-08
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 24041035   1420 TPPATC-LSQYCADKARDGVCDEACNSHACQWDGGDCS 1456
Cdd:pfam00066    1 SPWKNCpKAQYCEKKFGDGVCDPECNNAECLFDGGDCS 38
NL smart00004
Domain found in Notch and Lin-12; The Notch protein is essential for the proper ...
1418-1455 2.46e-08

Domain found in Notch and Lin-12; The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.


Pssm-ID: 197463  Cd Length: 38  Bit Score: 53.10  E-value: 2.46e-08
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 24041035    1418 PSTPPATCLSQYCADKARDGVCDEACNSHACQWDGGDC 1455
Cdd:smart00004    1 PQDPWSRCEDAQCWDKFGDGVCDEECNNAECLWDGGDC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
182-219 3.37e-08

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 52.63  E-value: 3.37e-08
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     182 DVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFT-GQYCD 219
Cdd:smart00179    1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
873-909 1.63e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 50.71  E-value: 1.63e-07
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     873 DIDECISK-PCMNHGLCHNTQGSYMCECPPGFS-GMDCE 909
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
NL smart00004
Domain found in Notch and Lin-12; The Notch protein is essential for the proper ...
1496-1534 9.93e-07

Domain found in Notch and Lin-12; The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.


Pssm-ID: 197463  Cd Length: 38  Bit Score: 48.48  E-value: 9.93e-07
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1496 CQGNSKTCKyDKYCADHFKDNHCDQGCNSEECGWDGLDC 1534
Cdd:smart00004    1 PQDPWSRCE-DAQCWDKFGDGVCDEECNNAECLWDGGDC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
1025-1061 2.46e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 47.24  E-value: 2.46e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1025 EINECSS-HPCLNEGTCVDGLGTYRCSCPLGYT-GKNCQ 1061
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
1151-1185 2.61e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 47.24  E-value: 2.61e-06
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 24041035    1151 DECAS-NPCQHGATCSDFIGGYRCECVPGYQ-GVNCE 1185
Cdd:smart00179    3 DECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
682-711 3.73e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 46.86  E-value: 3.73e-06
                            10        20        30
                    ....*....|....*....|....*....|.
gi 24041035     682 DIDECAS-NPCRKGATCINGVNGFRCICPEG 711
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPG 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
456-492 4.30e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 46.47  E-value: 4.30e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     456 DINECHS-DPCQNDATCLDKIGGFTCLCMPGFK-GVHCE 492
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
757-793 5.29e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 46.09  E-value: 5.29e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     757 DKNECLS-NPCQNGGTCDNLVNGYRCTCKKGFK-GYNCQ 793
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
494-530 9.49e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 45.70  E-value: 9.49e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     494 EINECQS-NPCVNNGQCVDKVNRFQCLCPPGFT-GPVCQ 530
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
949-985 1.00e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 45.32  E-value: 1.00e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     949 DMNECLSE-PCKNGGTCSDYVNSYTCKCQAGF-DGVHCE 985
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYtDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
415-454 1.22e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 45.32  E-value: 1.22e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 24041035     415 DVDECAManSNPCEHAGKCVNTDGAFHCECLKGY-AGPRCE 454
Cdd:smart00179    1 DIDECAS--GNPCQNGGTCVNTVGSYRCECPPGYtDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
1225-1262 1.38e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 44.93  E-value: 1.38e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1225 NIDDCARGPHCLNGGQCMDRIGGYSCRCLPGF-AGERCE 1262
Cdd:smart00179    1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYtDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
1187-1223 2.02e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 44.55  E-value: 2.02e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1187 EVDECQ-NQPCQNGGTCIDLVNHFKCSCPPG-TRGLLCE 1223
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGyTDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
608-643 3.08e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 44.16  E-value: 3.08e-05
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 24041035     608 IDECYS-SPCLNDGRCIDLVNGYQCNCQPG-TSGVNCE 643
Cdd:smart00179    2 IDECASgNPCQNGGTCVNTVGSYRCECPPGyTDGRNCE 39
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1877-1908 4.16e-05

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 249517  Cd Length: 33  Bit Score: 43.71  E-value: 4.16e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 24041035   1877 GEMALHLAARYSRADAAKRLLDAGADANAQDN 1908
Cdd:pfam00023    2 GNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
911-947 6.34e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 43.00  E-value: 6.34e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     911 DIDDCL-ANPCQNGGSCMDGVNTFSCLCLPGFT-GDKCQ 947
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
532-568 6.53e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 43.00  E-value: 6.53e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     532 DIDDCSST-PCLNGAKCIDHPNGYECQCATGFT-GVLCE 568
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1976-2008 7.39e-05

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 249517  Cd Length: 33  Bit Score: 42.94  E-value: 7.39e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 24041035   1976 HGKSALHWAAAVNNVEATLLLLKNGANRDMQDN 2008
Cdd:pfam00023    1 DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
987-1022 7.51e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 43.00  E-value: 7.51e-05
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 24041035     987 NINECTESS-CFNGGTCVDGINSFSCLCPVGFT-GSFC 1022
Cdd:smart00179    1 DIDECASGNpCQNGGTCVNTVGSYRCECPPGYTdGRNC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
795-831 2.95e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 41.08  E-value: 2.95e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     795 NIDECAS-NPCLNQGTCFDDISGYTCHCVLPYT-GKNCQ 831
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
186-217 5.55e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 249503  Cd Length: 32  Bit Score: 40.10  E-value: 5.55e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 24041035    186 CDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQY 217
Cdd:pfam00008    1 CSPNNPCSNGGTCVDTPGGYTCECPPGYTGKR 32
EGF_CA smart00179
Calcium-binding EGF-like domain;
298-335 9.93e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 39.54  E-value: 9.93e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     298 DVDECLlQPNACQNGGTCANRNGGYGCVCVNGWS-GDDC 335
Cdd:smart00179    1 DIDECA-SGNPCQNGGTCVNTVGSYRCECPPGYTdGRNC 38
Chorion_3 pfam05387
Chorion family 3; This family consists of several Drosophila chorion proteins S36 and S38. The ...
2292-2421 1.73e-03

Chorion family 3; This family consists of several Drosophila chorion proteins S36 and S38. The chorion genes of Drosophila are amplified in response to developmental signals in the follicle cells of the ovary.


Pssm-ID: 253174  Cd Length: 277  Bit Score: 41.63  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2292 KHITTPREPLPPIVtfqlipkgsIAQPAGAPQPQSTCPPAVAGPLPTMYQIPemarlPSVAF--------PTAMmpqqdg 2363
Cdd:pfam05387  144 NHQVIATQPLPPII---------VKQPGAPPKVLVNGPPLVVKPAPVIYKIK-----PSVIYqqevinkvPTPL------ 203
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   2364 QVAQTILPAYHPfPASVGKYPTPPSQHSYASsnaaertPSHSGHlQGEHPYLTPSPES 2421
Cdd:pfam05387  204 SLNPVYVKVYKP-GKKIEAPLVPEVQQVYSQ-------PSYGGS-EYSQPREQASPSS 252
PHA02887 PHA02887
EGF-like protein; Provisional
629-681 2.01e-03

EGF-like protein; Provisional


Pssm-ID: 165214  Cd Length: 126  Bit Score: 39.14  E-value: 2.01e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   629 YQCNCQPGTSGVNCEINFDDCAS---NPCIHGICMDGIN--RYSCVCSPGFTGQRCNI 681
Cdd:PHA02887   66 YKENANAQNFKRKNSMFFEKCKNdfnDFCINGECMNIIDldEKFCICNKGYTGIRCDE 123
EGF_CA smart00179
Calcium-binding EGF-like domain;
260-296 2.25e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 38.38  E-value: 2.25e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     260 NIDDC-PNHRCQNGGVCVDGVNTYNCRCPPQWT-GQFCT 296
Cdd:smart00179    1 DIDECaSGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1032-1059 2.43e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 249503  Cd Length: 32  Bit Score: 38.18  E-value: 2.43e-03
                           10        20
                   ....*....|....*....|....*...
gi 24041035   1032 HPCLNEGTCVDGLGTYRCSCPLGYTGKN 1059
Cdd:pfam00008    5 NPCSNGGTCVDTPGGYTCECPPGYTGKR 32
NL smart00004
Domain found in Notch and Lin-12; The Notch protein is essential for the proper ...
1459-1496 2.53e-03

Domain found in Notch and Lin-12; The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.


Pssm-ID: 197463  Cd Length: 38  Bit Score: 38.46  E-value: 2.53e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1459 MENPWANCSSPlPCWDYINN-QCDELCNTVECLFDNFEC 1496
Cdd:smart00004    1 PQDPWSRCEDA-QCWDKFGDgVCDEECNNAECLWDGGDC 38
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1461-1497 3.68e-03

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


Pssm-ID: 249555  Cd Length: 38  Bit Score: 38.04  E-value: 3.68e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 24041035   1461 NPWANCSSPLPCWDYINN-QCDELCNTVECLFDNFECQ 1497
Cdd:pfam00066    1 SPWKNCPKAQYCEKKFGDgVCDPECNNAECLFDGGDCS 38
EGF_CA pfam07645
Calcium-binding EGF domain;
298-330 8.50e-03

Calcium-binding EGF domain;


Pssm-ID: 254326  Cd Length: 42  Bit Score: 36.94  E-value: 8.50e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 24041035    298 DVDECLLQPNACQNGGTCANRNGGYGCVCVNGW 330
Cdd:pfam07645    1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGY 33
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1948-2040 8.26e-19

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 85.00  E-value: 8.26e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1948 LILAARLAVEGMVAELINCQADVNAVDDHgkSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAK 2027
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLEKGADVNLGDTD--TALHLAARNGNLEIVKLLLENGADVNAKDKDGNTALHLAARNGNLEIVK 78
                           90
                   ....*....|...
gi 24041035   2028 ILLDHFANRDITD 2040
Cdd:pfam12796   79 LLLEHGADINLKD 91
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1832-1940 7.19e-11

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 61.50  E-value: 7.19e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1832 LMLASLRGgssdlsdededaedsSANIITDLVYQGASLQAQTDRTgemALHLAARYSRADAAKRLLDAGADANAQDNMGR 1911
Cdd:pfam12796    1 LHLAAKNG---------------NLELVKLLLEKGADVNLGDTDT---ALHLAARNGNLEIVKLLLENGADVNAKDKDGN 62
                           90       100
                   ....*....|....*....|....*....
gi 24041035   1912 CPLHAAVAADAQGVFQILIRNRVtDLDAR 1940
Cdd:pfam12796   63 TALHLAARNGNLEIVKLLLEHGA-DINLK 90
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
1879-2066 1.23e-15

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 223738 [Multi-domain]  Cd Length: 235  Bit Score: 78.71  E-value: 1.23e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1879 MALHLAARYSRADAAKRLLDAGADANAQDNM----GRCPLHAAVAADAQGVFQILIRNRVTD--LDARMNDGTTPLILAA 1952
Cdd:COG0666    2 KPSLSALLLINKCFLDLLLVALLLLLSLDLSnpsdKKLNLYLELALLPAASLSELLLKLIVDrhLAARDLDGRLPLHSAA 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1953 RLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNN-----VEATLLLLKNGANRD---MQDNKEETPLFLAAREGSYE 2024
Cdd:COG0666   82 SKGDDKIVKLLLASGADVNAKDADGDTPLHLAALNGNppegnIEVAKLLLEAGADLDvnnLRDEDGNTPLHWAALNGDAD 161
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|..
gi 24041035 2025 AAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLLDEYN 2066
Cdd:COG0666  162 IVELLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLDKG 203
PHA03095 PHA03095
ankyrin-like protein; Provisional
1794-2030 5.45e-15

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 78.91  E-value: 5.45e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1794 LEA-ADIRRTPSLALTPP----QAEQEVDVL--------DVNVRGPDGCTPLMlASLRGGSSDlsdededaedssANIIT 1860
Cdd:PHA03095   70 LEAgADVNAPERCGFTPLhlylYNATTLDVIkllikagaDVNAKDKVGRTPLH-VYLSGFNIN------------PKVIR 136
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1861 DLVYQGASLQAqTDRTGEMALHLAARYSRADAA--KRLLDAGADANAQDNMGRCPLH--AAVAADAQGVFQILIRnRVTD 1936
Cdd:PHA03095  137 LLLRKGADVNA-LDLYGMTPLAVLLKSRNANVEllRLLIDAGADVYAVDDRFRSLLHhhLQSFKPRARIVRELIR-AGCD 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1937 LDARMNDGTTPLILAARLAV--EGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPL 2014
Cdd:PHA03095  215 PAATDMLGNTPLHSMATGSSckRSLVLPLLIAGISINARNRYGQTPLHYAAVFNNPRACRRLIALGADINAVSSDGNTPL 294
                         250
                  ....*....|....*.
gi 24041035  2015 FLAAREGSYEAAKILL 2030
Cdd:PHA03095  295 SLMVRNNNGRAVRAAL 310
PHA03100 PHA03100
ankyrin repeat protein; Provisional
1820-2008 1.56e-11

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 67.77  E-value: 1.56e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1820 DVNVRGPDGCTPLMLASLRggssdlsdededaEDSSANIITDLVYQGASLQAQTDRtGEMALHLAARYSRADA--AKRLL 1897
Cdd:PHA03100   98 NVNAPDNNGITPLLYAISK-------------KSNSYSIVEYLLDNGANVNIKNSD-GENLLHLYLESNKIDLkiLKLLI 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1898 DAGADANAQDNMgrcplhaavaadaqgvfQILIRNRVtDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHG 1977
Cdd:PHA03100  164 DKGVDINAKNRV-----------------NYLLSYGV-PINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYG 225
                         170       180       190
                  ....*....|....*....|....*....|.
gi 24041035  1978 KSALHWAAAVNNVEATLLLLKNGANRDMQDN 2008
Cdd:PHA03100  226 DTPLHIAILNNNKEIFKLLLNNGPSIKTIIE 256
DUF1421 pfam07223
Protein of unknown function (DUF1421); This family represents a conserved region approximately ...
2130-2422 2.36e-10

Protein of unknown function (DUF1421); This family represents a conserved region approximately 350 residues long within a number of plant proteins of unknown function.


Pssm-ID: 254110 [Multi-domain]  Cd Length: 357  Bit Score: 63.42  E-value: 2.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2130 RRKKSLSE-KVQLSESSVTLSPVDSLE--SPHTYVSDTTSSPMITSPGILQASPNPMLATAAPPAPvhAQHALSFSNLHE 2206
Cdd:pfam07223   12 RDKQEIAEtQKELSKLQLSHEEAQSSEahSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAP--APQSPQPDQQQQ 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2207 MQ-PLAHGASTVLPsVSQLLSHHHIVSPgsgsagslsrlHPVPVPAdwmnrmevnetqynemfgmvlAPAEGTHPG---- 2281
Cdd:pfam07223   90 SQaPPSHQYPSQLP-PQQVQSVPQQPTP-----------QQEPYYP---------------------PPSQPQPPPaqqp 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2282 IAPQSRPPegkhittPREPLPPivTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMArlPSVAFPTAMMPQq 2361
Cdd:pfam07223  137 QAQQPQPP-------PQVPQQQ--QYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYP--PNEPLPSSMAMQ- 204
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   2362 dgqvaqtilPAYHPFPASVGKY-PTPPSQHSYasSNAAERTPSH--SGHL----QGEHPYLTPSPESP 2422
Cdd:pfam07223  205 ---------PPYSGAPPSQQFYgPPQPSPYMY--GGPGGRPNSGfpSGQQpppsQGQEGYGYSGPPPS 261
PHA03247 PHA03247
large tegument protein UL36; Provisional
2068-2422 7.57e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.34  E-value: 7.57e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2068 TPSPPGTvltSALSPVICGPnrsflslkhTPMGKKSRRPSAKSTMPTSLPNLAKEAKDAKGSRRKKSLSEKVQLSESS-- 2145
Cdd:PHA03247 2615 SPLPPDT---HAPDPPPPSP---------SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqr 2682
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2146 -------VTLSPVDSLESPHTYVSDTTSSPMITSPGI---------LQASPNPMLATAAPPAP----VHAQHALSFSNLH 2205
Cdd:PHA03247 2683 prrraarPTVGSLTSLADPPPPPPTPEPAPHALVSATplppgpaaaRQASPALPAAPAPPAVPagpaTPGGPARPARPPT 2762
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2206 EMQPLAHGASTVLPSVSQLLSHHHIVSPGSGSAGSL-SRLHPVPVPADWMNRmevnetqyNEMFGMVLAPAEGTHPGIAP 2284
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpSPWDPADPPAAVLAP--------AAALPPAASPAGPLPPPTSA 2834
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2285 QSRPPegkhiTTPREPLPPIVTFQ--LIPKGSIAQ--PAGAPQPQSTCP----------PAVAGPLPTMYQIP-EMARLP 2349
Cdd:PHA03247 2835 QPTAP-----PPPPGPPPPSLPLGgsVAPGGDVRRrpPSRSPAAKPAAParppvrrlarPAVSRSTESFALPPdQPERPP 2909
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035  2350 SVAFPTAMMPQQDGQVAQTILPAYHPFPASvgKYPTPPSQHSYASSNAAERTPS-HSGHL-QGEHP---YLTPSPESP 2422
Cdd:PHA03247 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRP--QPPLAPTTDPAGAGEPSGAVPQpWLGALvPGRVAvprFRVPQPAPS 2985
PLN03192 PLN03192
Voltage-dependent potassium channel; Provisional
1873-2011 1.24e-08

Voltage-dependent potassium channel; Provisional


Pssm-ID: 215625 [Multi-domain]  Cd Length: 823  Bit Score: 59.11  E-value: 1.24e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1873 TDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIR-NRVTDLDArmndGTTPLILA 1951
Cdd:PLN03192  554 GDSKGRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIFRILYHfASISDPHA----AGDLLCTA 629
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1952 ARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEE 2011
Cdd:PLN03192  630 AKRNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDMVRLLIMNGADVDKANTDDD 689
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
1819-1931 2.76e-08

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 223738 [Multi-domain]  Cd Length: 235  Bit Score: 55.60  E-value: 2.76e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1819 LDVNVRGPDGCTPLMLASLRG----GSSDLSDEDEDAEDSSANIItdlvyqgaslqaQTDRTGEMALHLAARYSRADAAK 1894
Cdd:COG0666   97 ADVNAKDADGDTPLHLAALNGnppeGNIEVAKLLLEAGADLDVNN------------LRDEDGNTPLHWAALNGDADIVE 164
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 24041035 1895 RLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIR 1931
Cdd:COG0666  165 LLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLD 201
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
1963-2032 1.39e-05

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 49.13  E-value: 1.39e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1963 LINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDH 2032
Cdd:PTZ00322  101 LLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSRH 170
PRK10263 PRK10263
DNA translocase FtsK; Provisional
2296-2422 1.05e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 1.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2296 TPREPLP--------PIVTFQLIPKGSIAQPAGAPQPQSTCP-PAVAGPLPTM---YQIPEMARLPSVAFPTAMMPQQDG 2363
Cdd:PRK10263  341 TQTPPVAsvdvppaqPTVAWQPVPGPQTGEPVIAPAPEGYPQqSQYAQPAVQYnepLQQPVQPQQPYYAPAAEQPAQQPY 420
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 24041035  2364 QVAQTILPAYHPFPASVgkyPTPPSQHSYASSNAAERTPSHSGHLQGEHPYLTPSPESP 2422
Cdd:PRK10263  421 YAPAPEQPAQQPYYAPA---PEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEP 476
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
1766-1942 1.96e-03

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 42.19  E-value: 1.96e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1766 PKKVKAEDEALLSEEDDPIDrrpwtqQHLEAadIRRTPSLALTPPQAEQEVDVLDvnvrgpdgctPLMLASLrggSSDLS 1845
Cdd:PTZ00322   29 AKPISFERMAAIQEEIARID------THLEA--LEATENKDATPDHNLTTEEVID----------PVVAHML---TVELC 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1846 DEDEDAEDSSANIitdLVYQGASLQAQtDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGV 1925
Cdd:PTZ00322   88 QLAASGDAVGARI---LLTGGADPNCR-DYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREV 163
                         170
                  ....*....|....*..
gi 24041035  1926 FQILIRNRVTDLDARMN 1942
Cdd:PTZ00322  164 VQLLSRHSQCHFELGAN 180
Plasmod_Pvs28 pfam06247
Plasmodium ookinete surface protein Pvs28; This family consists of several ookinete surface ...
777-938 3.01e-03

Plasmodium ookinete surface protein Pvs28; This family consists of several ookinete surface protein (Pvs28) from several species of Plasmodium. Pvs25 and Pvs28 are expressed on the surface of ookinetes. These proteins are potential candidates for vaccine and induce antibodies that block the infectivity of Plasmodium vivax in immunised animals.


Pssm-ID: 253638 [Multi-domain]  Cd Length: 196  Bit Score: 40.11  E-value: 3.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035    777 NGYRCTCKKGF--KGYNCQVNIDECAS-----NPCLNQGTCFDDISG-----YTCHCVLPYTGKNCQTVLAPCSPNPCEN 844
Cdd:pfam06247   18 NHFECKCNEGYvlKNENTCEEKVKCDKlenvnKVCGEYATCINQANKaeekaLKCGCINGYTLSQGVCVPNKCNNKVCGS 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035    845 AAVCKESPNFESYTCLCAPGW---QGQRCTIDIDECISKPCMNHGLCHNTQGSYMCECPPGFSGmdceeDIDDCLANPCQ 921
Cdd:pfam06247   98 GKCIVDPANPNNTTCSCNIGKvpdQNGKCTKTGETKCSLKCKENEECKLVGGYYECVCKEGFPG-----DGGGTGSGGPP 172
                          170
                   ....*....|....*..
gi 24041035    922 NGGSCMDGVNTFSCLCL 938
Cdd:pfam06247  173 TSSSVMNGMSIFSILNL 189
 
Name Accession Description Interval E-value
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1938-2062 4.21e-31

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 121.72  E-value: 4.21e-31
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1938 DARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLA 2017
Cdd:cd00204    1 NARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHLA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 24041035 2018 AREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLL 2062
Cdd:cd00204   81 ARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLL 125
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1872-1997 3.34e-30

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 119.03  E-value: 3.34e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1872 QTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVtDLDARMNDGTTPLILA 1951
Cdd:cd00204    2 ARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGA-DVNARDKDGNTPLHLA 80
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 24041035 1952 ARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLL 1997
Cdd:cd00204   81 ARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
182-218 4.02e-09

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 55.34  E-value: 4.02e-09
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  182 DVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQYC 218
Cdd:cd00054    1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1026-1061 5.75e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 51.87  E-value: 5.75e-08
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035 1026 INECSS-HPCLNEGTCVDGLGTYRCSCPLGYTGKNCQ 1061
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
873-909 5.81e-08

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 51.87  E-value: 5.81e-08
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  873 DIDECISK-PCMNHGLCHNTQGSYMCECPPGFSGMDCE 909
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
757-793 2.01e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 2.01e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  757 DKNECLS-NPCQNGGTCDNLVNGYRCTCKKGFKGYNCQ 793
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1151-1185 2.11e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 50.33  E-value: 2.11e-07
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 24041035 1151 DECAS-NPCQHGATCSDFIGGYRCECVPGYQGVNCE 1185
Cdd:cd00054    3 DECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
456-492 2.73e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.94  E-value: 2.73e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  456 DINECHS-DPCQNDATCLDKIGGFTCLCMPGFKGVHCE 492
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
495-530 2.91e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.94  E-value: 2.91e-07
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  495 INECQS-NPCVNNGQCVDKVNRFQCLCPPGFTGPVCQ 530
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1225-1262 6.04e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 49.17  E-value: 6.04e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035 1225 NIDDCARGPHCLNGGQCMDRIGGYSCRCLPGFAGERCE 1262
Cdd:cd00054    1 DIDECASGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
949-985 9.52e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 9.52e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  949 DMNECLSE-PCKNGGTCSDYVNSYTCKCQAGFDGVHCE 985
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
911-947 9.98e-07

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.40  E-value: 9.98e-07
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  911 DIDDCL-ANPCQNGGSCMDGVNTFSCLCLPGFTGDKCQ 947
Cdd:cd00054    1 DIDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
415-454 1.51e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 48.02  E-value: 1.51e-06
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 24041035  415 DVDECAManSNPCEHAGKCVNTDGAFHCECLKGYAGPRCE 454
Cdd:cd00054    1 DIDECAS--GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
682-717 1.56e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.63  E-value: 1.56e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  682 DIDECAS-NPCRKGATCINGVNGFRCICPEGPHHPSC 717
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
608-643 2.11e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.11e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  608 IDECYS-SPCLNDGRCIDLVNGYQCNCQPGTSGVNCE 643
Cdd:cd00054    2 IDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
532-568 2.21e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.21e-06
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  532 DIDDCSST-PCLNGAKCIDHPNGYECQCATGFTGVLCE 568
Cdd:cd00054    1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1188-1223 2.32e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 47.25  E-value: 2.32e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035 1188 VDECQ-NQPCQNGGTCIDLVNHFKCSCPPGTRGLLCE 1223
Cdd:cd00054    2 IDECAsGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
987-1022 4.44e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 46.48  E-value: 4.44e-06
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  987 NINECTESS-CFNGGTCVDGINSFSCLCPVGFTGSFC 1022
Cdd:cd00054    1 DIDECASGNpCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
795-831 8.36e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 45.71  E-value: 8.36e-06
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  795 NIDECAS-NPCLNQGTCFDDISGYTCHCVLPYTGKNCQ 831
Cdd:cd00054    1 DIDECASgNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
260-296 5.87e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 5.87e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  260 NIDDC-PNHRCQNGGVCVDGVNTYNCRCPPQWTGQFCT 296
Cdd:cd00054    1 DIDECaSGNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
298-335 5.92e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 43.01  E-value: 5.92e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 24041035  298 DVDECLlQPNACQNGGTCANRNGGYGCVCVNGWSGDDC 335
Cdd:cd00054    1 DIDECA-SGNPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
645-679 1.41e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 39.16  E-value: 1.41e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  645 NFDDCAS-NPC-IHGICMDGINRYSCVCSPGFTGQRC 679
Cdd:cd00054    1 DIDECASgNPCqNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
570-604 2.23e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.39  E-value: 2.23e-03
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 24041035  570 NIDNCD-PDPC-HHGQCQDGIDSYTCICNPGYMGAIC 604
Cdd:cd00054    1 DIDECAsGNPCqNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1312-1343 2.86e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.00  E-value: 2.86e-03
                         10        20        30
                 ....*....|....*....|....*....|..
gi 24041035 1312 PCLNGGTCavaSNMPDGFICRCPPGFSGARCQ 1343
Cdd:cd00054   10 PCQNGGTC---VNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1117-1147 5.19e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 5.19e-03
                         10        20        30
                 ....*....|....*....|....*....|.
gi 24041035 1117 EHLCQHSGVCINAGNTHYCQCPLGYTGSYCE 1147
Cdd:cd00054    8 GNPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
1264-1302 6.30e-03

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 37.23  E-value: 6.30e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 24041035 1264 DINECLS-NPCSSEGSldCIQLTNDYLCVCRSAFTGRHCE 1302
Cdd:cd00054    1 DIDECASgNPCQNGGT--CVNTVGSYRCSCPPGYTGRNCE 38
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1904-2030 1.30e-29

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 117.48  E-value: 1.30e-29
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1904 NAQDNMGRCPLHAAVAADAQGVFQILIRNRVtDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHW 1983
Cdd:cd00204    1 NARDEDGRTPLHLAASNGHLEVVKLLLENGA-DVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHL 79
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 24041035 1984 AAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILL 2030
Cdd:cd00204   80 AARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126
NOD pfam06816
NOTCH protein; NOTCH signalling plays a fundamental role during a great number of ...
1539-1595 9.44e-26

NOTCH protein; NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. Role of NOD domain remains to be elucidated.


Pssm-ID: 191614  Cd Length: 57  Bit Score: 103.85  E-value: 9.44e-26
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 24041035   1539 PENLAEGTLVIVVLMPPEQLLQDARSFLRALGTLLHTNLRIKRDSQGELMVYPYYGE 1595
Cdd:pfam06816    1 PPKLAEGTLVIVVLIPPEELRNNSVQFLRELSHLLRTNVRFKKDANGQPMIFPWYGE 57
DUF3454 pfam11936
Domain of unknown function (DUF3454); This presumed domain is functionally uncharacterized. ...
2380-2445 1.22e-24

Domain of unknown function (DUF3454); This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 60 amino acids in length. This domain is found associated with pfam00066, pfam00008, pfam06816, pfam07684, pfam00023.


Pssm-ID: 256741  Cd Length: 64  Bit Score: 101.07  E-value: 1.22e-24
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 24041035   2380 VGKYPTPPSQHSYaSSNAAERTPSHSGHLQgEHPYLTPSPESPDQWSSSSPHSASDWSDVTTSPTP 2445
Cdd:pfam11936    1 VEQYPTPPSQHSS-SSSSGDNTPQHQLQVP-DHPYLTPSPESPDQWSSSSPHSNSDWSEGISSPPT 64
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1971-2066 3.10e-21

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 92.83  E-value: 3.10e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1971 NAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVA 2050
Cdd:cd00204    1 NARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHLA 80
                         90
                 ....*....|....*.
gi 24041035 2051 RDRMHHDIVRLLDEYN 2066
Cdd:cd00204   81 ARNGNLDVVKLLLKHG 96
ANK cd00204
ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse ...
1820-1930 1.16e-17

ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.


Pssm-ID: 238125 [Multi-domain]  Cd Length: 126  Bit Score: 82.43  E-value: 1.16e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1820 DVNVRGPDGCTPLMLASLRGgssdlsdededaedsSANIITDLVYQGASLQAQtDRTGEMALHLAARYSRADAAKRLLDA 1899
Cdd:cd00204   32 DVNAKDNDGRTPLHLAAKNG---------------HLEIVKLLLEKGADVNAR-DKDGNTPLHLAARNGNLDVVKLLLKH 95
                         90       100       110
                 ....*....|....*....|....*....|.
gi 24041035 1900 GADANAQDNMGRCPLHAAVAADAQGVFQILI 1930
Cdd:cd00204   96 GADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126
NODP pfam07684
NOTCH protein; NOTCH signalling plays a fundamental role during a great number of ...
1617-1673 5.21e-14

NOTCH protein; NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. The role of the NOD and NODP domains remains to be elucidated.


Pssm-ID: 254356  Cd Length: 62  Bit Score: 70.01  E-value: 5.21e-14
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 24041035   1617 EVAGSKVFLEIDNRQCVQDSDHCFKNTDAAAALLASHAIQGTL--SYPLVSVVSESLTP 1673
Cdd:pfam07684    1 EVTGSVVYLEIDNRKCSQQSGECFWSAQSAAAFLAALAAKGGLdtPYPISSVRSEPDEP 59
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1499-1534 3.87e-09

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


Pssm-ID: 249555  Cd Length: 38  Bit Score: 55.38  E-value: 3.87e-09
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 24041035   1499 NSKTCKYDKYCADHFKDNHCDQGCNSEECGWDGLDC 1534
Cdd:pfam00066    2 PWKNCPKAQYCEKKFGDGVCDPECNNAECLFDGGDC 37
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1420-1456 1.06e-08

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


Pssm-ID: 249555  Cd Length: 38  Bit Score: 54.22  E-value: 1.06e-08
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 24041035   1420 TPPATC-LSQYCADKARDGVCDEACNSHACQWDGGDCS 1456
Cdd:pfam00066    1 SPWKNCpKAQYCEKKFGDGVCDPECNNAECLFDGGDCS 38
NL smart00004
Domain found in Notch and Lin-12; The Notch protein is essential for the proper ...
1418-1455 2.46e-08

Domain found in Notch and Lin-12; The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.


Pssm-ID: 197463  Cd Length: 38  Bit Score: 53.10  E-value: 2.46e-08
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 24041035    1418 PSTPPATCLSQYCADKARDGVCDEACNSHACQWDGGDC 1455
Cdd:smart00004    1 PQDPWSRCEDAQCWDKFGDGVCDEECNNAECLWDGGDC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
182-219 3.37e-08

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 52.63  E-value: 3.37e-08
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     182 DVNECDIPGHCQHGGTCLNLPGSYQCQCPQGFT-GQYCD 219
Cdd:smart00179    1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
873-909 1.63e-07

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 50.71  E-value: 1.63e-07
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     873 DIDECISK-PCMNHGLCHNTQGSYMCECPPGFS-GMDCE 909
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
NL smart00004
Domain found in Notch and Lin-12; The Notch protein is essential for the proper ...
1496-1534 9.93e-07

Domain found in Notch and Lin-12; The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.


Pssm-ID: 197463  Cd Length: 38  Bit Score: 48.48  E-value: 9.93e-07
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1496 CQGNSKTCKyDKYCADHFKDNHCDQGCNSEECGWDGLDC 1534
Cdd:smart00004    1 PQDPWSRCE-DAQCWDKFGDGVCDEECNNAECLWDGGDC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
1025-1061 2.46e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 47.24  E-value: 2.46e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1025 EINECSS-HPCLNEGTCVDGLGTYRCSCPLGYT-GKNCQ 1061
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
1151-1185 2.61e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 47.24  E-value: 2.61e-06
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 24041035    1151 DECAS-NPCQHGATCSDFIGGYRCECVPGYQ-GVNCE 1185
Cdd:smart00179    3 DECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
682-711 3.73e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 46.86  E-value: 3.73e-06
                            10        20        30
                    ....*....|....*....|....*....|.
gi 24041035     682 DIDECAS-NPCRKGATCINGVNGFRCICPEG 711
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPG 31
EGF_CA smart00179
Calcium-binding EGF-like domain;
456-492 4.30e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 46.47  E-value: 4.30e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     456 DINECHS-DPCQNDATCLDKIGGFTCLCMPGFK-GVHCE 492
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
757-793 5.29e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 46.09  E-value: 5.29e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     757 DKNECLS-NPCQNGGTCDNLVNGYRCTCKKGFK-GYNCQ 793
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
494-530 9.49e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 45.70  E-value: 9.49e-06
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     494 EINECQS-NPCVNNGQCVDKVNRFQCLCPPGFT-GPVCQ 530
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
949-985 1.00e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 45.32  E-value: 1.00e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     949 DMNECLSE-PCKNGGTCSDYVNSYTCKCQAGF-DGVHCE 985
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYtDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
415-454 1.22e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 45.32  E-value: 1.22e-05
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 24041035     415 DVDECAManSNPCEHAGKCVNTDGAFHCECLKGY-AGPRCE 454
Cdd:smart00179    1 DIDECAS--GNPCQNGGTCVNTVGSYRCECPPGYtDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
1225-1262 1.38e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 44.93  E-value: 1.38e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1225 NIDDCARGPHCLNGGQCMDRIGGYSCRCLPGF-AGERCE 1262
Cdd:smart00179    1 DIDECASGNPCQNGGTCVNTVGSYRCECPPGYtDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
1187-1223 2.02e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 44.55  E-value: 2.02e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1187 EVDECQ-NQPCQNGGTCIDLVNHFKCSCPPG-TRGLLCE 1223
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGyTDGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
608-643 3.08e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 44.16  E-value: 3.08e-05
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 24041035     608 IDECYS-SPCLNDGRCIDLVNGYQCNCQPG-TSGVNCE 643
Cdd:smart00179    2 IDECASgNPCQNGGTCVNTVGSYRCECPPGyTDGRNCE 39
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
185-216 3.58e-05

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 44.00  E-value: 3.58e-05
                         10        20        30
                 ....*....|....*....|....*....|..
gi 24041035  185 ECDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQ 216
Cdd:cd00053    1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1877-1908 4.16e-05

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 249517  Cd Length: 33  Bit Score: 43.71  E-value: 4.16e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 24041035   1877 GEMALHLAARYSRADAAKRLLDAGADANAQDN 1908
Cdd:pfam00023    2 GNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
911-947 6.34e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 43.00  E-value: 6.34e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     911 DIDDCL-ANPCQNGGSCMDGVNTFSCLCLPGFT-GDKCQ 947
Cdd:smart00179    1 DIDECAsGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF_CA smart00179
Calcium-binding EGF-like domain;
532-568 6.53e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 43.00  E-value: 6.53e-05
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     532 DIDDCSST-PCLNGAKCIDHPNGYECQCATGFT-GVLCE 568
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1976-2008 7.39e-05

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 249517  Cd Length: 33  Bit Score: 42.94  E-value: 7.39e-05
                           10        20        30
                   ....*....|....*....|....*....|...
gi 24041035   1976 HGKSALHWAAAVNNVEATLLLLKNGANRDMQDN 2008
Cdd:pfam00023    1 DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
EGF_CA smart00179
Calcium-binding EGF-like domain;
987-1022 7.51e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 43.00  E-value: 7.51e-05
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 24041035     987 NINECTESS-CFNGGTCVDGINSFSCLCPVGFT-GSFC 1022
Cdd:smart00179    1 DIDECASGNpCQNGGTCVNTVGSYRCECPPGYTdGRNC 38
EGF_CA smart00179
Calcium-binding EGF-like domain;
795-831 2.95e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 41.08  E-value: 2.95e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     795 NIDECAS-NPCLNQGTCFDDISGYTCHCVLPYT-GKNCQ 831
Cdd:smart00179    1 DIDECASgNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
186-217 5.55e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 249503  Cd Length: 32  Bit Score: 40.10  E-value: 5.55e-04
                           10        20        30
                   ....*....|....*....|....*....|..
gi 24041035    186 CDIPGHCQHGGTCLNLPGSYQCQCPQGFTGQY 217
Cdd:pfam00008    1 CSPNNPCSNGGTCVDTPGGYTCECPPGYTGKR 32
EGF_CA smart00179
Calcium-binding EGF-like domain;
298-335 9.93e-04

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 39.54  E-value: 9.93e-04
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     298 DVDECLlQPNACQNGGTCANRNGGYGCVCVNGWS-GDDC 335
Cdd:smart00179    1 DIDECA-SGNPCQNGGTCVNTVGSYRCECPPGYTdGRNC 38
EGF_CA pfam07645
Calcium-binding EGF domain;
182-213 1.07e-03

Calcium-binding EGF domain;


Pssm-ID: 254326  Cd Length: 42  Bit Score: 39.64  E-value: 1.07e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 24041035    182 DVNECDIPGH-CQHGGTCLNLPGSYQCQCPQGF 213
Cdd:pfam07645    1 DVDECADGTHnCPANTVCVNTIGSFECVCPDGY 33
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
2011-2041 1.12e-03

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 249517  Cd Length: 33  Bit Score: 39.48  E-value: 1.12e-03
                           10        20        30
                   ....*....|....*....|....*....|.
gi 24041035   2011 ETPLFLAAREGSYEAAKILLDHFANRDITDH 2041
Cdd:pfam00023    3 NTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1152-1185 1.53e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 39.00  E-value: 1.53e-03
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 24041035 1152 ECA-SNPCQHGATCSDFIGGYRCECVPGYQGV-NCE 1185
Cdd:cd00053    1 ECAaSNPCSNGGTCVNTPGSYRCVCPPGYTGDrSCE 36
Chorion_3 pfam05387
Chorion family 3; This family consists of several Drosophila chorion proteins S36 and S38. The ...
2292-2421 1.73e-03

Chorion family 3; This family consists of several Drosophila chorion proteins S36 and S38. The chorion genes of Drosophila are amplified in response to developmental signals in the follicle cells of the ovary.


Pssm-ID: 253174  Cd Length: 277  Bit Score: 41.63  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2292 KHITTPREPLPPIVtfqlipkgsIAQPAGAPQPQSTCPPAVAGPLPTMYQIPemarlPSVAF--------PTAMmpqqdg 2363
Cdd:pfam05387  144 NHQVIATQPLPPII---------VKQPGAPPKVLVNGPPLVVKPAPVIYKIK-----PSVIYqqevinkvPTPL------ 203
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   2364 QVAQTILPAYHPfPASVGKYPTPPSQHSYASsnaaertPSHSGHlQGEHPYLTPSPES 2421
Cdd:pfam05387  204 SLNPVYVKVYKP-GKKIEAPLVPEVQQVYSQ-------PSYGGS-EYSQPREQASPSS 252
PHA02887 PHA02887
EGF-like protein; Provisional
629-681 2.01e-03

EGF-like protein; Provisional


Pssm-ID: 165214  Cd Length: 126  Bit Score: 39.14  E-value: 2.01e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   629 YQCNCQPGTSGVNCEINFDDCAS---NPCIHGICMDGIN--RYSCVCSPGFTGQRCNI 681
Cdd:PHA02887   66 YKENANAQNFKRKNSMFFEKCKNdfnDFCINGECMNIIDldEKFCICNKGYTGIRCDE 123
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1028-1061 2.03e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 38.61  E-value: 2.03e-03
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 24041035 1028 ECS-SHPCLNEGTCVDGLGTYRCSCPLGYTG-KNCQ 1061
Cdd:cd00053    1 ECAaSNPCSNGGTCVNTPGSYRCVCPPGYTGdRSCE 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
260-296 2.25e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542  Cd Length: 39  Bit Score: 38.38  E-value: 2.25e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035     260 NIDDC-PNHRCQNGGVCVDGVNTYNCRCPPQWT-GQFCT 296
Cdd:smart00179    1 DIDECaSGNPCQNGGTCVNTVGSYRCECPPGYTdGRNCE 39
Ank pfam00023
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1943-1975 2.27e-03

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities. Repeats 13-24 are especially active, with known sites of interaction for the Na/K ATPase, Cl/HCO(3) anion exchanger, voltage-gated sodium channel, clathrin heavy chain and L1 family cell adhesion molecules. The ANK repeats are found to form a contiguous spiral stack such that ion transporters like the anion exchanger associate in a large central cavity formed by the ANK repeat spiral, while clathrin and cell adhesion molecules associate with specific regions outside this cavity.


Pssm-ID: 249517  Cd Length: 33  Bit Score: 38.32  E-value: 2.27e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 24041035   1943 DGTTPLILAARLAVEGMVAELINCQADVNAVDD 1975
Cdd:pfam00023    1 DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
1032-1059 2.43e-03

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 249503  Cd Length: 32  Bit Score: 38.18  E-value: 2.43e-03
                           10        20
                   ....*....|....*....|....*...
gi 24041035   1032 HPCLNEGTCVDGLGTYRCSCPLGYTGKN 1059
Cdd:pfam00008    5 NPCSNGGTCVDTPGGYTCECPPGYTGKR 32
NL smart00004
Domain found in Notch and Lin-12; The Notch protein is essential for the proper ...
1459-1496 2.53e-03

Domain found in Notch and Lin-12; The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.


Pssm-ID: 197463  Cd Length: 38  Bit Score: 38.46  E-value: 2.53e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 24041035    1459 MENPWANCSSPlPCWDYINN-QCDELCNTVECLFDNFEC 1496
Cdd:smart00004    1 PQDPWSRCEDA-QCWDKFGDgVCDEECNNAECLWDGGDC 38
Notch pfam00066
LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch ...
1461-1497 3.68e-03

LNR domain; The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.


Pssm-ID: 249555  Cd Length: 38  Bit Score: 38.04  E-value: 3.68e-03
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 24041035   1461 NPWANCSSPLPCWDYINN-QCDELCNTVECLFDNFECQ 1497
Cdd:pfam00066    1 SPWKNCPKAQYCEKKFGDgVCDPECNNAECLFDGGDCS 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
876-905 5.00e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.46  E-value: 5.00e-03
                         10        20        30
                 ....*....|....*....|....*....|.
gi 24041035  876 EC-ISKPCMNHGLCHNTQGSYMCECPPGFSG 905
Cdd:cd00053    1 ECaASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
760-789 5.14e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.46  E-value: 5.14e-03
                         10        20        30
                 ....*....|....*....|....*....|.
gi 24041035  760 EC-LSNPCQNGGTCDNLVNGYRCTCKKGFKG 789
Cdd:cd00053    1 ECaASNPCSNGGTCVNTPGSYRCVCPPGYTG 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
497-527 5.80e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.46  E-value: 5.80e-03
                         10        20        30
                 ....*....|....*....|....*....|..
gi 24041035  497 ECQ-SNPCVNNGQCVDKVNRFQCLCPPGFTGP 527
Cdd:cd00053    1 ECAaSNPCSNGGTCVNTPGSYRCVCPPGYTGD 32
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
1228-1260 5.96e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 5.96e-03
                         10        20        30
                 ....*....|....*....|....*....|...
gi 24041035 1228 DCARGPHCLNGGQCMDRIGGYSCRCLPGFAGER 1260
Cdd:cd00053    1 ECAASNPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
Ank_3 pfam13606
Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the ...
1877-1905 7.03e-03

Ankyrin repeat; Ankyrins are multifunctional adaptors that link specific proteins to the membrane-associated, spectrin- actin cytoskeleton. This repeat-domain is a 'membrane-binding' domain of up to 24 repeated units, and it mediates most of the protein's binding activities.


Pssm-ID: 257920  Cd Length: 30  Bit Score: 36.84  E-value: 7.03e-03
                           10        20
                   ....*....|....*....|....*....
gi 24041035   1877 GEMALHLAARYSRADAAKRLLDAGADANA 1905
Cdd:pfam13606    2 GNTPLHLAARNGNLELVKLLLENGADINA 30
EGF smart00181
Epidermal growth factor-like domain;
185-216 7.21e-03

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 37.11  E-value: 7.21e-03
                            10        20        30
                    ....*....|....*....|....*....|..
gi 24041035     185 ECDIPGHCQHGgTCLNLPGSYQCQCPQGFTGQ 216
Cdd:smart00181    1 ECASGGPCSNG-TCINTPGSYTCSCPPGYTGD 31
EGF_CA pfam07645
Calcium-binding EGF domain;
298-330 8.50e-03

Calcium-binding EGF domain;


Pssm-ID: 254326  Cd Length: 42  Bit Score: 36.94  E-value: 8.50e-03
                           10        20        30
                   ....*....|....*....|....*....|...
gi 24041035    298 DVDECLLQPNACQNGGTCANRNGGYGCVCVNGW 330
Cdd:pfam07645    1 DVDECADGTHNCPANTVCVNTIGSFECVCPDGY 33
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1948-2040 8.26e-19

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 85.00  E-value: 8.26e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1948 LILAARLAVEGMVAELINCQADVNAVDDHgkSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAK 2027
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLEKGADVNLGDTD--TALHLAARNGNLEIVKLLLENGADVNAKDKDGNTALHLAARNGNLEIVK 78
                           90
                   ....*....|...
gi 24041035   2028 ILLDHFANRDITD 2040
Cdd:pfam12796   79 LLLEHGADINLKD 91
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1832-1940 7.19e-11

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 61.50  E-value: 7.19e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1832 LMLASLRGgssdlsdededaedsSANIITDLVYQGASLQAQTDRTgemALHLAARYSRADAAKRLLDAGADANAQDNMGR 1911
Cdd:pfam12796    1 LHLAAKNG---------------NLELVKLLLEKGADVNLGDTDT---ALHLAARNGNLEIVKLLLENGADVNAKDKDGN 62
                           90       100
                   ....*....|....*....|....*....
gi 24041035   1912 CPLHAAVAADAQGVFQILIRNRVtDLDAR 1940
Cdd:pfam12796   63 TALHLAARNGNLEIVKLLLEHGA-DINLK 90
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1881-2007 3.10e-16

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 77.30  E-value: 3.10e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1881 LHLAARYSRADAAKRLLDAGADANAQDNmgrcplhaavaadaqgvfqilirnrvtdldarmndgTTPLILAARLAVEGMV 1960
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLEKGADVNLGDT------------------------------------DTALHLAARNGNLEIV 44
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 24041035   1961 AELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQD 2007
Cdd:pfam12796   45 KLLLENGADVNAKDKDGNTALHLAARNGNLEIVKLLLEHGADINLKD 91
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
1879-2066 1.23e-15

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 223738 [Multi-domain]  Cd Length: 235  Bit Score: 78.71  E-value: 1.23e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1879 MALHLAARYSRADAAKRLLDAGADANAQDNM----GRCPLHAAVAADAQGVFQILIRNRVTD--LDARMNDGTTPLILAA 1952
Cdd:COG0666    2 KPSLSALLLINKCFLDLLLVALLLLLSLDLSnpsdKKLNLYLELALLPAASLSELLLKLIVDrhLAARDLDGRLPLHSAA 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1953 RLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNN-----VEATLLLLKNGANRD---MQDNKEETPLFLAAREGSYE 2024
Cdd:COG0666   82 SKGDDKIVKLLLASGADVNAKDADGDTPLHLAALNGNppegnIEVAKLLLEAGADLDvnnLRDEDGNTPLHWAALNGDAD 161
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|..
gi 24041035 2025 AAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLLDEYN 2066
Cdd:COG0666  162 IVELLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLDKG 203
PHA03095 PHA03095
ankyrin-like protein; Provisional
1794-2030 5.45e-15

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 78.91  E-value: 5.45e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1794 LEA-ADIRRTPSLALTPP----QAEQEVDVL--------DVNVRGPDGCTPLMlASLRGGSSDlsdededaedssANIIT 1860
Cdd:PHA03095   70 LEAgADVNAPERCGFTPLhlylYNATTLDVIkllikagaDVNAKDKVGRTPLH-VYLSGFNIN------------PKVIR 136
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1861 DLVYQGASLQAqTDRTGEMALHLAARYSRADAA--KRLLDAGADANAQDNMGRCPLH--AAVAADAQGVFQILIRnRVTD 1936
Cdd:PHA03095  137 LLLRKGADVNA-LDLYGMTPLAVLLKSRNANVEllRLLIDAGADVYAVDDRFRSLLHhhLQSFKPRARIVRELIR-AGCD 214
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1937 LDARMNDGTTPLILAARLAV--EGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPL 2014
Cdd:PHA03095  215 PAATDMLGNTPLHSMATGSSckRSLVLPLLIAGISINARNRYGQTPLHYAAVFNNPRACRRLIALGADINAVSSDGNTPL 294
                         250
                  ....*....|....*.
gi 24041035  2015 FLAAREGSYEAAKILL 2030
Cdd:PHA03095  295 SLMVRNNNGRAVRAAL 310
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
1869-2014 6.19e-14

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 223738 [Multi-domain]  Cd Length: 235  Bit Score: 73.32  E-value: 6.19e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1869 LQAQTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQG-----VFQILIRN--RVTDLDARM 1941
Cdd:COG0666   65 HLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNAKDADGDTPLHLAALNGNPPegnieVAKLLLEAgaDLDVNNLRD 144
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1942 NDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLK---------------NGANRDMQ 2006
Cdd:COG0666  145 EDGNTPLHWAALNGDADIVELLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLDkglhlsllkfnlegvANANVSKR 224

                 ....*...
gi 24041035 2007 DNKEETPL 2014
Cdd:COG0666  225 NILNLTSL 232
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
1874-2042 3.05e-13

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 223738 [Multi-domain]  Cd Length: 235  Bit Score: 71.01  E-value: 3.05e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1874 DRTGEMALHLAARYSRADAAKRLLDAGADaNAQDNMGRCPLHAAVAADAQGVFQILIRNRVtDLDARMNDGTTPLILAAR 1953
Cdd:COG0666   38 KLNLYLELALLPAASLSELLLKLIVDRHL-AARDLDGRLPLHSAASKGDDKIVKLLLASGA-DVNAKDADGDTPLHLAAL 115
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1954 -----LAVEGMVAELI---NCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEA 2025
Cdd:COG0666  116 ngnppEGNIEVAKLLLeagADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGADPNSRNSYGVTALDPAAKNGRIEL 195
                        170
                 ....*....|....*..
gi 24041035 2026 AKILLDHFANRDITDHM 2042
Cdd:COG0666  196 VKLLLDKGLHLSLLKFN 212
PHA02875 PHA02875
ankyrin repeat protein; Provisional
1877-2037 5.07e-13

ankyrin repeat protein; Provisional


Pssm-ID: 165206 [Multi-domain]  Cd Length: 413  Bit Score: 72.33  E-value: 5.07e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1877 GEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVA-ADAQGVFQILIRNRVTDlDARMNDGTTPLILAARLA 1955
Cdd:PHA02875   35 GISPIKLAMKFRDSEAIKLLMKHGAIPDVKYPDIESELHDAVEeGDVKAVEELLDLGKFAD-DVFYKDGMTPLHLATILK 113
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1956 VEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFAN 2035
Cdd:PHA02875  114 KLDIMKLLIARGADPDIPNTDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDSGAN 193

                  ..
gi 24041035  2036 RD 2037
Cdd:PHA02875  194 ID 195
PHA02878 PHA02878
ankyrin repeat protein; Provisional
1841-2038 1.23e-12

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 71.45  E-value: 1.23e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1841 SSDLSDEDEDAEDSS--ANIITDLVYQGASLQAQTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAV 1918
Cdd:PHA02878  130 TIDLVYIDKKSKDDIieAEITKLLLSYGADINMKDRHKGNTALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAV 209
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1919 AADAQGVFQILIRNRvTDLDARMNDGTTPL-ILAARLAVEGMVAELINCQADVNAVDD-HGKSALHwaAAVNNVEATLLL 1996
Cdd:PHA02878  210 KHYNKPIVHILLENG-ASTDARDKCGNTPLhISVGYCKDYDILKLLLEHGVDVNAKSYiLGLTALH--SSIKSERKLKLL 286
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 24041035  1997 LKNGANRDMQDNKEETPLFLAAREGS-YEAAKILLDH-----FANRDI 2038
Cdd:PHA02878  287 LEYGADINSLNSYKLTPLSSAVKQYLcINIGRILISNicllkRIKPDI 334
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1981-2062 1.30e-12

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 66.90  E-value: 1.30e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1981 LHWAAAVNNVEATLLLLKNGAnrDMQDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVR 2060
Cdd:pfam12796    1 LHLAAKNGNLELVKLLLEKGA--DVNLGDTDTALHLAARNGNLEIVKLLLENGADVNAKDKDGNTALHLAARNGNLEIVK 78

                   ..
gi 24041035   2061 LL 2062
Cdd:pfam12796   79 LL 80
PHA03095 PHA03095
ankyrin-like protein; Provisional
1889-2035 6.21e-12

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 69.28  E-value: 6.21e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1889 RADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQ---GVFQILIRNRVtDLDARMNDGTTPLILAARLA-VEGMVAELI 1964
Cdd:PHA03095   26 TVEEVRRLLAAGADVNFRGEYGKTPLHLYLHYSSEkvkDIVRLLLEAGA-DVNAPERCGFTPLHLYLYNAtTLDVIKLLI 104
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 24041035  1965 NCQADVNAVDDHGKSALHWAAAVNNVEATL--LLLKNGANRDMQDNKEETPL--FLAAREGSYEAAKILLDHFAN 2035
Cdd:PHA03095  105 KAGADVNAKDKVGRTPLHVYLSGFNINPKVirLLLRKGADVNALDLYGMTPLavLLKSRNANVELLRLLIDAGAD 179
PHA03100 PHA03100
ankyrin repeat protein; Provisional
1820-2008 1.56e-11

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 67.77  E-value: 1.56e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1820 DVNVRGPDGCTPLMLASLRggssdlsdededaEDSSANIITDLVYQGASLQAQTDRtGEMALHLAARYSRADA--AKRLL 1897
Cdd:PHA03100   98 NVNAPDNNGITPLLYAISK-------------KSNSYSIVEYLLDNGANVNIKNSD-GENLLHLYLESNKIDLkiLKLLI 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1898 DAGADANAQDNMgrcplhaavaadaqgvfQILIRNRVtDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHG 1977
Cdd:PHA03100  164 DKGVDINAKNRV-----------------NYLLSYGV-PINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYG 225
                         170       180       190
                  ....*....|....*....|....*....|.
gi 24041035  1978 KSALHWAAAVNNVEATLLLLKNGANRDMQDN 2008
Cdd:PHA03100  226 DTPLHIAILNNNKEIFKLLLNNGPSIKTIIE 256
PHA03100 PHA03100
ankyrin repeat protein; Provisional
1879-2062 1.68e-11

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 67.77  E-value: 1.68e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1879 MALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLH-----AAVAADAQGVFQILIrNRVTDLDARMNDGTTPLILAAR 1953
Cdd:PHA03100   37 LPLYLAKEARNIDVVKILLDNGADINSSTKNNSTPLHylsniKYNLTDVKEIVKLLL-EYGANVNAPDNNGITPLLYAIS 115
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1954 LAVEG--MVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLL------------------LLKNGANRDMQDNKEETP 2013
Cdd:PHA03100  116 KKSNSysIVEYLLDNGANVNIKNSDGENLLHLYLESNKIDLKILkllidkgvdinaknrvnyLLSYGVPINIKDVYGFTP 195
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 24041035  2014 LFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLL 2062
Cdd:PHA03100  196 LHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLL 244
PHA02876 PHA02876
ankyrin repeat protein; Provisional
1801-2058 1.12e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 65.86  E-value: 1.12e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1801 RTPSLA-LTPPQAEQEVDVLDVNVRGPdgcTPLMLASLRGgssdlsdedEDAEDssaniITDLVYQGASLQAqTDRTGEM 1879
Cdd:PHA02876  282 QAPSLSrLVPKLLERGADVNAKNIKGE---TPLYLMAKNG---------YDTEN-----IRTLIMLGADVNA-ADRLYIT 343
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1880 ALHLAARYSR-ADAAKRLLDAGADANAQDNMGRCPLHAAVAADAqgvfqILIRNRVTDLDARMNDGTTPLILAARLAVEG 1958
Cdd:PHA02876  344 PLHQASTLDRnKDIVITLLELGANVNARDYCDKTPIHYAAVRNN-----VVIINTLLDYGADIEALSQKIGTALHFALCG 418
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1959 M-----VAELINCQADVNAVDDHGKSALHWAAAVN-NVEATLLLLKNGANRDMQDNKEETPLFLAAreGSYEAAKILLDH 2032
Cdd:PHA02876  419 TnpymsVKTLIDRGANVNSKNKDLSTPLHYACKKNcKLDVIEMLLDNGADVNAINIQNQYPLLIAL--EYHGIVNILLHY 496
                         250       260       270
                  ....*....|....*....|....*....|....*...
gi 24041035  2033 FA--------NRDITDHMDRLPRDVA----RDRMHHDI 2058
Cdd:PHA02876  497 GAelrdsrvlHKSLNDNMFSFRYIIAhiciQDFIRHDI 534
PHA02874 PHA02874
ankyrin repeat protein; Provisional
1874-2046 2.15e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 64.21  E-value: 2.15e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1874 DRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNrVTDLDARMNDGTTPL---IL 1950
Cdd:PHA02874  154 DDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDH-GNHIMNKCKNGFTPLhnaII 232
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1951 AARLAVEgmvaELINcQADVNAVDDHGKSALHWAAAVN-NVEATLLLLKNGANRDMQDNKEETPLFLAARegSYEAAKIL 2029
Cdd:PHA02874  233 HNRSAIE----LLIN-NASINDQDIDGSTPLHHAINPPcDIDIIDILLYHKADISIKDNKGENPIDTAFK--YINKDPVI 305
                         170
                  ....*....|....*..
gi 24041035  2030 LDHFANRDITDHMDRLP 2046
Cdd:PHA02874  306 KDIIANAVLIKEADKLK 322
DUF1421 pfam07223
Protein of unknown function (DUF1421); This family represents a conserved region approximately ...
2130-2422 2.36e-10

Protein of unknown function (DUF1421); This family represents a conserved region approximately 350 residues long within a number of plant proteins of unknown function.


Pssm-ID: 254110 [Multi-domain]  Cd Length: 357  Bit Score: 63.42  E-value: 2.36e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2130 RRKKSLSE-KVQLSESSVTLSPVDSLE--SPHTYVSDTTSSPMITSPGILQASPNPMLATAAPPAPvhAQHALSFSNLHE 2206
Cdd:pfam07223   12 RDKQEIAEtQKELSKLQLSHEEAQSSEahSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAP--APQSPQPDQQQQ 89
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2207 MQ-PLAHGASTVLPsVSQLLSHHHIVSPgsgsagslsrlHPVPVPAdwmnrmevnetqynemfgmvlAPAEGTHPG---- 2281
Cdd:pfam07223   90 SQaPPSHQYPSQLP-PQQVQSVPQQPTP-----------QQEPYYP---------------------PPSQPQPPPaqqp 136
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2282 IAPQSRPPegkhittPREPLPPivTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMArlPSVAFPTAMMPQq 2361
Cdd:pfam07223  137 QAQQPQPP-------PQVPQQQ--QYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQPQSYP--PNEPLPSSMAMQ- 204
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035   2362 dgqvaqtilPAYHPFPASVGKY-PTPPSQHSYasSNAAERTPSH--SGHL----QGEHPYLTPSPESP 2422
Cdd:pfam07223  205 ---------PPYSGAPPSQQFYgPPQPSPYMY--GGPGGRPNSGfpSGQQpppsQGQEGYGYSGPPPS 261
PHA02876 PHA02876
ankyrin repeat protein; Provisional
1878-2035 5.03e-10

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 63.54  E-value: 5.03e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1878 EMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVTDLDARMNDGTTPLILAARLAVE 1957
Cdd:PHA02876  241 DLSLLKAIRNEDLETSLLLYDAGFSVNSIDDCKNTPLHHASQAPSLSRLVPKLLERGADVNAKNIKGETPLYLMAKNGYD 320
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1958 GM-VAELINCQADVNAVDDHGKSALHWAAAVN-NVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFAN 2035
Cdd:PHA02876  321 TEnIRTLIMLGADVNAADRLYITPLHQASTLDrNKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGAD 400
PHA03095 PHA03095
ankyrin-like protein; Provisional
1820-1998 1.50e-09

ankyrin-like protein; Provisional


Pssm-ID: 222980 [Multi-domain]  Cd Length: 471  Bit Score: 61.58  E-value: 1.50e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1820 DVNVRGPDGCTPLMlASLRggSSDLSDEdedaedssanIITDLVYQGASLQAQTDRtGEMALHLAARYSRADAA--KRLL 1897
Cdd:PHA03095  144 DVNALDLYGMTPLA-VLLK--SRNANVE----------LLRLLIDAGADVYAVDDR-FRSLLHHHLQSFKPRARivRELI 209
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1898 DAGADANAQDNMGRCPLHAAVAADAQG---VFQILIRNrvTDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVD 1974
Cdd:PHA03095  210 RAGCDPAATDMLGNTPLHSMATGSSCKrslVLPLLIAG--ISINARNRYGQTPLHYAAVFNNPRACRRLIALGADINAVS 287
                         170       180
                  ....*....|....*....|....
gi 24041035  1975 DHGKSALHWAAAVNNVEATLLLLK 1998
Cdd:PHA03095  288 SDGNTPLSLMVRNNNGRAVRAALA 311
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2144-2422 2.73e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 251763 [Multi-domain]  Cd Length: 979  Bit Score: 61.24  E-value: 2.73e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2144 SSVTLSPvDSLESPHTYVSDTTSS-----PMITSPGILQAS---PNPMLATAAPPAPVHAQHALSfsNLHEMQPLAHGAS 2215
Cdd:pfam03154  228 SAPSLHP-QRLPSPHPPLQPQTASqqspqPPAPSSRHPQSShhgPGPPMPHALQQGPVFLQHPSS--NPPQPFGLAQSQV 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2216 TVLPSVSQllSHHHIVSPGSGSA---GSLSRLHPVPvPADWMNRME--------VNETQYNEMFGMVLAPAE-GTHPGIA 2283
Cdd:pfam03154  305 PPLPLPSQ--AQPHSHTPPSQSAlqpQQPPREQPLP-PAPSMPHIKpppttpipQLPNQSHKHPPHLQGPSPfPQMPSNL 381
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2284 PQsrPPEGKHITT------PREPLPPIvtfQLIPKG-----SIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMARLPSVA 2352
Cdd:pfam03154  382 PP--PPALKPLSSlpthhpPSAHPPPL---QLMPQSqplqsVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHP 456
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2353 FPTAmmpqqdgqvaqtILPAYHPFPASVGKYPTPPSQHSYASSNAAERTPShSGHLQGEHPYLTP------------SPE 2420
Cdd:pfam03154  457 FTSG------------GLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPS-SGGCAGPGPPLPPiqikeepldeaeEPE 523

                   ..
gi 24041035   2421 SP 2422
Cdd:pfam03154  524 SP 525
PHA02874 PHA02874
ankyrin repeat protein; Provisional
1884-2035 6.12e-09

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 59.59  E-value: 6.12e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1884 AARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRV----------------------TDLDARM 1941
Cdd:PHA02874   42 AIRSGDAKIVELFIKHGADINHINTKIPHPLLTAIKIGAHDIIKLLIDNGVdtsilpipciekdmiktildcgIDVNIKD 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1942 NDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREG 2021
Cdd:PHA02874  122 AELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYG 201
                         170
                  ....*....|....
gi 24041035  2022 SYEAAKILLDHFAN 2035
Cdd:PHA02874  202 DYACIKLLIDHGNH 215
PHA03247 PHA03247
large tegument protein UL36; Provisional
2068-2422 7.57e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.34  E-value: 7.57e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2068 TPSPPGTvltSALSPVICGPnrsflslkhTPMGKKSRRPSAKSTMPTSLPNLAKEAKDAKGSRRKKSLSEKVQLSESS-- 2145
Cdd:PHA03247 2615 SPLPPDT---HAPDPPPPSP---------SPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPqr 2682
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2146 -------VTLSPVDSLESPHTYVSDTTSSPMITSPGI---------LQASPNPMLATAAPPAP----VHAQHALSFSNLH 2205
Cdd:PHA03247 2683 prrraarPTVGSLTSLADPPPPPPTPEPAPHALVSATplppgpaaaRQASPALPAAPAPPAVPagpaTPGGPARPARPPT 2762
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2206 EMQPLAHGASTVLPSVSQLLSHHHIVSPGSGSAGSL-SRLHPVPVPADWMNRmevnetqyNEMFGMVLAPAEGTHPGIAP 2284
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLpSPWDPADPPAAVLAP--------AAALPPAASPAGPLPPPTSA 2834
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2285 QSRPPegkhiTTPREPLPPIVTFQ--LIPKGSIAQ--PAGAPQPQSTCP----------PAVAGPLPTMYQIP-EMARLP 2349
Cdd:PHA03247 2835 QPTAP-----PPPPGPPPPSLPLGgsVAPGGDVRRrpPSRSPAAKPAAParppvrrlarPAVSRSTESFALPPdQPERPP 2909
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035  2350 SVAFPTAMMPQQDGQVAQTILPAYHPFPASvgKYPTPPSQHSYASSNAAERTPS-HSGHL-QGEHP---YLTPSPESP 2422
Cdd:PHA03247 2910 QPQAPPPPQPQPQPPPPPQPQPPPPPPPRP--QPPLAPTTDPAGAGEPSGAVPQpWLGALvPGRVAvprFRVPQPAPS 2985
PLN03192 PLN03192
Voltage-dependent potassium channel; Provisional
1873-2011 1.24e-08

Voltage-dependent potassium channel; Provisional


Pssm-ID: 215625 [Multi-domain]  Cd Length: 823  Bit Score: 59.11  E-value: 1.24e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1873 TDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIR-NRVTDLDArmndGTTPLILA 1951
Cdd:PLN03192  554 GDSKGRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIFRILYHfASISDPHA----AGDLLCTA 629
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1952 ARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEE 2011
Cdd:PLN03192  630 AKRNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDMVRLLIMNGADVDKANTDDD 689
Ank_4 pfam13637
Ankyrin repeats (many copies);
1979-2030 2.42e-08

Ankyrin repeats (many copies);


Pssm-ID: 257947 [Multi-domain]  Cd Length: 54  Bit Score: 53.41  E-value: 2.42e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 24041035   1979 SALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILL 2030
Cdd:pfam13637    3 TALHKAAISGRLELVKYLLEKGVDINRTDSNGNTALHIAALNGNVEVLKLLL 54
PHA02874 PHA02874
ankyrin repeat protein; Provisional
1874-2059 2.60e-08

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 57.28  E-value: 2.60e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1874 DRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVTdLDARMNDGTTPLILAAR 1953
Cdd:PHA02874  121 DAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAY-ANVKDNNGESPLHNAAE 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1954 LAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNnvEATLLLLKNGANRDMQDNKEETPLFLAAR-EGSYEAAKILLDH 2032
Cdd:PHA02874  200 YGDYACIKLLIDHGNHIMNKCKNGFTPLHNAIIHN--RSAIELLINNASINDQDIDGSTPLHHAINpPCDIDIIDILLYH 277
                         170       180
                  ....*....|....*....|....*..
gi 24041035  2033 FANRDITDHMDRLPRDVARDRMHHDIV 2059
Cdd:PHA02874  278 KADISIKDNKGENPIDTAFKYINKDPV 304
ANKYR COG0666
Ankyrin repeat [Signal transduction mechanisms];
1819-1931 2.76e-08

Ankyrin repeat [Signal transduction mechanisms];


Pssm-ID: 223738 [Multi-domain]  Cd Length: 235  Bit Score: 55.60  E-value: 2.76e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035 1819 LDVNVRGPDGCTPLMLASLRG----GSSDLSDEDEDAEDSSANIItdlvyqgaslqaQTDRTGEMALHLAARYSRADAAK 1894
Cdd:COG0666   97 ADVNAKDADGDTPLHLAALNGnppeGNIEVAKLLLEAGADLDVNN------------LRDEDGNTPLHWAALNGDADIVE 164
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 24041035 1895 RLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIR 1931
Cdd:COG0666  165 LLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLD 201
Ank_2 pfam12796
Ankyrin repeats (3 copies);
1821-1907 3.87e-08

Ankyrin repeats (3 copies);


Pssm-ID: 257303 [Multi-domain]  Cd Length: 91  Bit Score: 53.41  E-value: 3.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   1821 VNVRGPDGCTPLMLASLRGgssdlsdededaedsSANIITDLVYQGASLQAQtDRTGEMALHLAARYSRADAAKRLLDAG 1900
Cdd:pfam12796   21 ADVNLGDTDTALHLAARNG---------------NLEIVKLLLENGADVNAK-DKDGNTALHLAARNGNLEIVKLLLEHG 84

                   ....*..
gi 24041035   1901 ADANAQD 1907
Cdd:pfam12796   85 ADINLKD 91
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2140-2442 4.87e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 251763 [Multi-domain]  Cd Length: 979  Bit Score: 57.39  E-value: 4.87e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2140 QLSESSVTLSPVDSLESPHTYVSDTTSSPM-ITSPgilQASPNPMLATAAPPAPVHAQHALSFSNLHemQPLAHGASTVL 2218
Cdd:pfam03154  206 QGSPIAAQPAPQPQQPSPLSLISAPSLHPQrLPSP---HPPLQPQTASQQSPQPPAPSSRHPQSSHH--GPGPPMPHALQ 280
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2219 PSVSQLlsHHHIVSPGSGSAGSLSRLHPVPVPADWMNRMEVNETQynemfgmvlaPAEGthPGIAPQSRP----PEGKHI 2294
Cdd:pfam03154  281 QGPVFL--QHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQ----------SALQ--PQQPPREQPlppaPSMPHI 346
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2295 TTPrePLPPIvtfQLIPKGSIAQPA--GAPQP----QSTCPPAVA----GPLPTMYqiPEMARLPsvafPTAMMPQqdGQ 2364
Cdd:pfam03154  347 KPP--PTTPI---PQLPNQSHKHPPhlQGPSPfpqmPSNLPPPPAlkplSSLPTHH--PPSAHPP----PLQLMPQ--SQ 413
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2365 VAQTIlPAYHPFPASVGKYPTPPSQHSYASSNAAERTPSHSGH--LQGEHPYLTPSPESPDQWSS----SSPHSASDWSD 2438
Cdd:pfam03154  414 PLQSV-PAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHpfTSGGLPAIGPPPSLPTSTPAapprASSGSQPPGSA 492

                   ....
gi 24041035   2439 VTTS 2442
Cdd:pfam03154  493 LPSS 496
PHA03247 PHA03247
large tegument protein UL36; Provisional
2068-2407 6.49e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 57.26  E-value: 6.49e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2068 TPSPPGTVLTSALsPVICGPNRSFLSLKHTPMGKKSR------------RPSAKSTMPTSLPNLAKEAKDAKGSRRKKSL 2135
Cdd:PHA03247 2707 TPEPAPHALVSAT-PLPPGPAAARQASPALPAAPAPPavpagpatpggpARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2136 SEKVQLSESSVTL-SPVDSLESPHTYVSDTTSSPMITSPGILQASPNPMLATAAPPAPVHAQhalsfsnlhemQPLAHGA 2214
Cdd:PHA03247 2786 PAVASLSESRESLpSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP-----------PSLPLGG 2854
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2215 StvlpsvsqllshhhiVSPGsgsaGSLSRLHPVPVPADwmnrmevnetqynemfgmvlAPAEGTHPGIAPQSRPPEGKhi 2294
Cdd:PHA03247 2855 S---------------VAPG----GDVRRRPPSRSPAA--------------------KPAAPARPPVRRLARPAVSR-- 2893
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2295 TTPREPLPPivtfqlIPKGSIAQPAGAPQPQSTcPPAVAGPLPTMyQIPEMARLPSVAFP---TAMMPQQDGQVAQTILP 2371
Cdd:PHA03247 2894 STESFALPP------DQPERPPQPQAPPPPQPQ-PQPPPPPQPQP-PPPPPPRPQPPLAPttdPAGAGEPSGAVPQPWLG 2965
                         330       340       350
                  ....*....|....*....|....*....|....*.
gi 24041035  2372 AYHPFPASVGKYPTPPSQHSYASSnaAERTPSHSGH 2407
Cdd:PHA03247 2966 ALVPGRVAVPRFRVPQPAPSREAP--ASSTPPLTGH 2999
PHA03100 PHA03100
ankyrin repeat protein; Provisional
1857-2044 7.56e-08

ankyrin repeat protein; Provisional


Pssm-ID: 222984 [Multi-domain]  Cd Length: 422  Bit Score: 55.83  E-value: 7.56e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1857 NIITDLVYQGASLQAQTDRtGEMALHLAARYSRADAA--KRLLDAGADANAQDNMGRCPLHAAVAA--DAQGVFQILIRN 1932
Cdd:PHA03100   87 EIVKLLLEYGANVNAPDNN-GITPLLYAISKKSNSYSivEYLLDNGANVNIKNSDGENLLHLYLESnkIDLKILKLLIDK 165
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1933 RVtDLDARMNdgttplilaarlavegmVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEET 2012
Cdd:PHA03100  166 GV-DINAKNR-----------------VNYLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDT 227
                         170       180       190
                  ....*....|....*....|....*....|..
gi 24041035  2013 PLFLAAREGSYEAAKILLDHFANrdiTDHMDR 2044
Cdd:PHA03100  228 PLHIAILNNNKEIFKLLLNNGPS---IKTIIE 256
PHA03379 PHA03379
EBNA-3A; Provisional
2100-2395 2.47e-07

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 55.06  E-value: 2.47e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2100 GKKSRRPsakstmPTSLPNLAKE--AKDAKGSRRKKSLSEKVQLSESSVTLSPVDsLESPHTYVSDTTSspmiTSPGILQ 2177
Cdd:PHA03379  373 GTKRKRP------PIFLRRLHRLllMRAGKLTERAREALEKASEPTYGTPRPPVE-KPRPEVPQSLETA----TSHGSAQ 441
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2178 aSPNPMLATAAPPAPVHAQHALsfsnlhEMQPLAHGASTVLPSvsqllshhhiVSPGSGSAGSLS--RLHPVPVPADWMN 2255
Cdd:PHA03379  442 -VPEPPPVHDLEPGPLHDQHSM------APCPVAQLPPGPLQD----------LEPGDQLPGVVQdgRPACAPVPAPAGP 504
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2256 RMEVNETQYNEMFGMVLAPAEGTHPGIAPQSRPPEGkhITTPREPLPPIVTFQlipkgSIAQPAGAPQPQSTCPPAVAGP 2335
Cdd:PHA03379  505 IVRPWEASLSQVPGVAFAPVMPQPMPVEPVPVPTVA--LERPVCPAPPLIAMQ-----GPGETSGIVRVRERWRPAPWTP 577
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 24041035  2336 LP--TMYQIP---EMARLPSVAFP-TAMMPQQDGQVAQtiLPAYHPFpasvgKYPTPPSQHSYASS 2395
Cdd:PHA03379  578 NPprSPSQMSvrdRLARLRAEAQPyQASVEVQPPQLTQ--VSPQQPM-----EYPLEPEQQMFPGS 636
Ank_5 pfam13857
Ankyrin repeats (many copies);
1963-2017 2.92e-07

Ankyrin repeats (many copies);


Pssm-ID: 258127 [Multi-domain]  Cd Length: 56  Bit Score: 50.44  E-value: 2.92e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 24041035   1963 LINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLA 2017
Cdd:pfam13857    2 LEHGPIDLNATDGEGNTPLHLAAKYGALELVRLLLKPGVDLNLRDSDGLTALDLA 56
PHA02876 PHA02876
ankyrin repeat protein; Provisional
1881-2035 3.39e-07

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 54.30  E-value: 3.39e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1881 LHLAARY-SRADAAKRLLDAGADANAQDNMGRCPLH--AAVAADAQGVFQILIRNrvTDLDARMNDGTTPLILAARL-AV 1956
Cdd:PHA02876  277 LHHASQApSLSRLVPKLLERGADVNAKNIKGETPLYlmAKNGYDTENIRTLIMLG--ADVNAADRLYITPLHQASTLdRN 354
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1957 EGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPL-FLAAREGSYEAAKILLDHFAN 2035
Cdd:PHA02876  355 KDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIGTALhFALCGTNPYMSVKTLIDRGAN 434
PHA02876 PHA02876
ankyrin repeat protein; Provisional
1862-2063 4.16e-07

ankyrin repeat protein; Provisional


Pssm-ID: 165207 [Multi-domain]  Cd Length: 682  Bit Score: 53.91  E-value: 4.16e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1862 LVYQGASLQAQtDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILIRNRvtdldARM 1941
Cdd:PHA02876  164 LLEGGADVNAK-DIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDLSVLECAVDSKNIDTIKAIIDNR-----SNI 237
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1942 NDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATL-LLLKNGANRDMQDNKEETPLFLAARE 2020
Cdd:PHA02876  238 NKNDLSLLKAIRNEDLETSLLLYDAGFSVNSIDDCKNTPLHHASQAPSLSRLVpKLLERGADVNAKNIKGETPLYLMAKN 317
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 24041035  2021 G-SYEAAKILLDHFANRDITDHMDRLPRDVAR--DRMHHDIVRLLD 2063
Cdd:PHA02876  318 GyDTENIRTLIMLGADVNAADRLYITPLHQAStlDRNKDIVITLLE 363
PHA02878 PHA02878
ankyrin repeat protein; Provisional
1968-2096 7.32e-07

ankyrin repeat protein; Provisional


Pssm-ID: 222939 [Multi-domain]  Cd Length: 477  Bit Score: 52.96  E-value: 7.32e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1968 ADVNAVDDH-GKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLP 2046
Cdd:PHA02878  158 ADINMKDRHkGNTALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNTP 237
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 24041035  2047 RDVARDR-MHHDIVRLLDEYNVTPSPPGTVLT-SALSPVICGPNRSFLSLKH 2096
Cdd:PHA02878  238 LHISVGYcKDYDILKLLLEHGVDVNAKSYILGlTALHSSIKSERKLKLLLEY 289
Ank_5 pfam13857
Ankyrin repeats (many copies);
1896-1951 9.02e-07

Ankyrin repeats (many copies);


Pssm-ID: 258127 [Multi-domain]  Cd Length: 56  Bit Score: 48.89  E-value: 9.02e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 24041035   1896 LLDAG-ADANAQDNMGRCPLHAAVAADAQGVFQILIRNRVtDLDARMNDGTTPLILA 1951
Cdd:pfam13857    1 LLEHGpIDLNATDGEGNTPLHLAAKYGALELVRLLLKPGV-DLNLRDSDGLTALDLA 56
PHA02874 PHA02874
ankyrin repeat protein; Provisional
1927-2064 1.37e-06

ankyrin repeat protein; Provisional


Pssm-ID: 165205 [Multi-domain]  Cd Length: 434  Bit Score: 51.89  E-value: 1.37e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1927 QILIRNRVTDLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGA----- 2001
Cdd:PHA02874   18 EKIIKNKGNCINISVDETTTPLIDAIRSGDAKIVELFIKHGADINHINTKIPHPLLTAIKIGAHDIIKLLIDNGVdtsil 97
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2002 -----NRDM-------------QDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLLD 2063
Cdd:PHA02874   98 pipciEKDMiktildcgidvniKDAELKTFLHYAIKKGDLESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLL 177

                  .
gi 24041035  2064 E 2064
Cdd:PHA02874  178 E 178
Ank_5 pfam13857
Ankyrin repeats (many copies);
1862-1917 1.77e-06

Ankyrin repeats (many copies);


Pssm-ID: 258127 [Multi-domain]  Cd Length: 56  Bit Score: 48.12  E-value: 1.77e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 24041035   1862 LVYQGASLQAQTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAA 1917
Cdd:pfam13857    1 LLEHGPIDLNATDGEGNTPLHLAAKYGALELVRLLLKPGVDLNLRDSDGLTALDLA 56
Ank_4 pfam13637
Ankyrin repeats (many copies);
1880-1930 2.75e-06

Ankyrin repeats (many copies);


Pssm-ID: 257947 [Multi-domain]  Cd Length: 54  Bit Score: 47.25  E-value: 2.75e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 24041035   1880 ALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQILI 1930
Cdd:pfam13637    4 ALHKAAISGRLELVKYLLEKGVDINRTDSNGNTALHIAALNGNVEVLKLLL 54
PHA02875 PHA02875
ankyrin repeat protein; Provisional
1822-2014 5.12e-06

ankyrin repeat protein; Provisional


Pssm-ID: 165206 [Multi-domain]  Cd Length: 413  Bit Score: 49.99  E-value: 5.12e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1822 NVRGPDGCTPLMLASLRGGSSDLsdedEDAEDSSaNIITDLVYqgaslqaqtdRTGEMALHLAARYSRADAAKRLLDAGA 1901
Cdd:PHA02875   62 DVKYPDIESELHDAVEEGDVKAV----EELLDLG-KFADDVFY----------KDGMTPLHLATILKKLDIMKLLIARGA 126
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1902 DANAQDNMGRCPLHAAVAADAQGVFQILIRNRVTdLDARMNDGTTPLILAARLAVEGMVAELINCQADVNAVDDHGKSAL 1981
Cdd:PHA02875  127 DPDIPNTDKFSPLHLAVMMGDIKGIELLIDHKAC-LDIEDCCGCTPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAA 205
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 24041035  1982 HWAAAVNN-VEATLLLLKNGANRD---MQDNKEETPL 2014
Cdd:PHA02875  206 LCYAIENNkIDIVRLFIKRGADCNimfMIEGEECTIL 242
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
1963-2032 1.39e-05

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 49.13  E-value: 1.39e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1963 LINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDH 2032
Cdd:PTZ00322  101 LLTGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREVVQLLSRH 170
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
2269-2394 1.57e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 255543 [Multi-domain]  Cd Length: 806  Bit Score: 49.00  E-value: 1.57e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2269 GMVLAPAEGTHPGIAPQSRPPEGKH-ITTPREPLPPIVTFQLIPKGSIAQPAGAPQPQStcPPAVAGPLptmyQIPEMAR 2347
Cdd:pfam09770  177 QQVLPQGMPPRQAAFPQQGPPEQPPgYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQ--LPQQPPPL----QQPQFPG 250
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 24041035   2348 LPSVAFPTAMMPQQdgQVAQTILPAYHPFPASVGKYPTPPSQHSYAS 2394
Cdd:pfam09770  251 LSQQMPPPPPQPPQ--QQQQPPQPQAQPPPQNQPTPHPGLPQGQNAP 295
PHA02946 PHA02946
ankyin-like protein; Provisional
1859-2016 1.76e-05

ankyin-like protein; Provisional


Pssm-ID: 165256 [Multi-domain]  Cd Length: 446  Bit Score: 48.51  E-value: 1.76e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1859 ITDLVYQGASlQAQTDRTGEMALHLAARYSRADAAKRLLDAGADANAQDNMGRCPLHAAVAADAQGVFQIlirNRVTDLD 1938
Cdd:PHA02946   55 VEELLHRGYS-PNETDDDGNYPLHIASKINNNRIVAMLLTHGADPNACDKQHKTPLYYLSGTDDEVIERI---NLLVQYG 130
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1939 ARMN-----DGTTPLiLAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLL--LLKNGANRDMQDNKEE 2011
Cdd:PHA02946  131 AKINnsvdeEGCGPL-LACTDPSERVFKKIMSIGFEARIVDKFGKNHIHRHLMSDNPKASTIswMMKLGISPSKPDHDGN 209

                  ....*
gi 24041035  2012 TPLFL 2016
Cdd:PHA02946  210 TPLHI 214
PHA03247 PHA03247
large tegument protein UL36; Provisional
2098-2403 2.46e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 2.46e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2098 PMGKKSRRPSAKSTMPTSLPnLAKEAKDAKGSRRKKSLSEkvqlSESSVTLSPVDSLESPhtyvsdttsspmitsPGILQ 2177
Cdd:PHA03247 2554 PLPPAAPPAAPDRSVPPPRP-APRPSEPAVTSRARRPDAP----PQSARPRAPVDDRGDP---------------RGPAP 2613
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2178 ASPNPMLATAA-PPAPVHAQHALSFSNLHEMQPLAHGASTVLPSVSQLLSHHHIVSPGS--------------------G 2236
Cdd:PHA03247 2614 PSPLPPDTHAPdPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRaaqassppqrprrraarptvG 2693
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2237 SAGSLSRLHPVPVPADWMNRMEVNETQynemfgmvLAPAegthPGIAPQSRPPEGKHITTPREPLPPIVTFQLIPKGSIA 2316
Cdd:PHA03247 2694 SLTSLADPPPPPPTPEPAPHALVSATP--------LPPG----PAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2317 QPAGAPQPqsTCPPAVAGPLPTMYQIPEMARLpSVAFPTAMMPQQDGQVAQTILPAYHPFPAS---VGKYPTPPSQHSYA 2393
Cdd:PHA03247 2762 TTAGPPAP--APPAAPAAGPPRRLTRPAVASL-SESRESLPSPWDPADPPAAVLAPAAALPPAaspAGPLPPPTSAQPTA 2838
                         330
                  ....*....|
gi 24041035  2394 SSNAAERTPS 2403
Cdd:PHA03247 2839 PPPPPGPPPP 2848
PLN03192 PLN03192
Voltage-dependent potassium channel; Provisional
1948-2062 3.37e-05

Voltage-dependent potassium channel; Provisional


Pssm-ID: 215625 [Multi-domain]  Cd Length: 823  Bit Score: 47.94  E-value: 3.37e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1948 LILAARLAVEGMVAELINCQADVNAVDDHGKSALHWAAAVNNVEATLLLLKNGANRDMQDNKEETPLF------------ 2015
Cdd:PLN03192  529 LLTVASTGNAALLEELLKAKLDPDIGDSKGRTPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWnaisakhhkifr 608
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 24041035  2016 -------------------LAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRLL 2062
Cdd:PLN03192  609 ilyhfasisdphaagdllcTAAKRNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDMVRLL 674
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
2095-2422 3.91e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 251763 [Multi-domain]  Cd Length: 979  Bit Score: 47.76  E-value: 3.91e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2095 KHTPMGKKSRRP-----SAKSTMPTSLPNLAKEAKDAKGSRRKKSLSEkvqlSESSVTLSPVDSLESPHTYVSDTTSSPM 2169
Cdd:pfam03154    2 KHSMRTRRSRGSmstlrSGRKKQTASPDGRASPTNEDQRSSGRNSPSA----ASTSSNDSKAESTKKPNKKIKEEATSPL 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2170 ITSPGILQA----SPNPMLATAAPPAPVHAQHALSFSNlHEMQPLAHGASTVLPSVSQLLSHhhivSPGSGSAGSLSRLH 2245
Cdd:pfam03154   78 KSTKRQREKpasdTEEPERVTAKKSKTQELSRPNSPSE-GEGEGEGEGESSDSRSVNEEGSS----DPKDIDQDNRSSSP 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2246 PVPVPADwmNRMEVNETQYNEMFGMVLAPAEGTHPGIAPQSRPPegkhittpreplPPIVTFQLIPkgsiaqPAGAPQPQ 2325
Cdd:pfam03154  153 SIPSPQD--NESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAP------------PPTPSAQAVP------PQGSPIAA 212
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2326 STCPPAVAGPLPTMYQIPEM--ARLPSVAFPTAMMPQQDGQVAQTILPAYHPFPASVGkyPTPPSQHSYASSNAAERTPs 2403
Cdd:pfam03154  213 QPAPQPQQPSPLSLISAPSLhpQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHG--PGPPMPHALQQGPVFLQHP- 289
                          330
                   ....*....|....*....
gi 24041035   2404 HSGHLQGEHPYLTPSPESP 2422
Cdd:pfam03154  290 SSNPPQPFGLAQSQVPPLP 308
PHA03378 PHA03378
EBNA-3B; Provisional
2272-2424 9.63e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 9.63e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2272 LAPAEGTHPG-IAPQSRPPEGkhiTTPRE---PLPPIVTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMAR 2347
Cdd:PHA03378  598 PVPHPSQTPEpPTTQSHIPET---SAPRQwpmPLRPIPMRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPY 674
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 24041035  2348 LPSVAFPTAMMPQQDG-QVAQTilPAYHPFPASVGKYPTPPSQHSYASSNAAERTPSHSGHLQgeHPYLTPSPESPDQ 2424
Cdd:PHA03378  675 QPSPTGANTMLPIQWApGTMQP--PPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRAR--PPAAAPGRARPPA 748
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
2077-2390 9.88e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralizing antibodies in vivo.


Pssm-ID: 253014 [Multi-domain]  Cd Length: 830  Bit Score: 46.31  E-value: 9.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2077 TSALSPVICGPNRSFLSLKH-TPMGKKSRRPSaKSTMPTSLPNLAKEAKDAKGSRRKKSLSEKVqlseSSVTLSPVDSLE 2155
Cdd:pfam05109  452 TPSLPPASTGPTVSTADPTSgTPTGTTSSTLP-EDTSPTSRTTSATPNATSPTPAVTTPNATSP----TTQKTSDTPNAT 526
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2156 SPHTYVSDTTSSpmitspgilqaSPNPMLATAAPPAPVHAQHALSFSNLHEMQPLAHGASTVLPSVSQLLSHHHIVSPGS 2235
Cdd:pfam05109  527 SPTPIVIGVTTT-----------ATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTS 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2236 GSAGSLSRLHPVP--------------VPADWMNRMEVNETQYNEMFGMVLAPAEG-------THPGIAPQSRPPEGKHI 2294
Cdd:pfam05109  596 QQPGIPSSSHSTPrsnststtplltsaHPTGGENITEETPSVPSTTHVSTLSPGPGpgttsqvSGPGNSSTSRYPGEVHV 675
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2295 TT----PREPLPPIVTFQLIPKGSIAQPAGAPQPQSTCPPAVAGPLPTMYQIPEMARLPSVAFPTAMMPQQDGQVAQTIL 2370
Cdd:pfam05109  676 TEgmpnPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRW 755
                          330       340
                   ....*....|....*....|.
gi 24041035   2371 PAYHPFPASVG-KYPTPPSQH 2390
Cdd:pfam05109  756 TFTSPPVTTKQaTVPVPPTQH 776
PRK10263 PRK10263
DNA translocase FtsK; Provisional
2296-2422 1.05e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.62  E-value: 1.05e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2296 TPREPLP--------PIVTFQLIPKGSIAQPAGAPQPQSTCP-PAVAGPLPTM---YQIPEMARLPSVAFPTAMMPQQDG 2363
Cdd:PRK10263  341 TQTPPVAsvdvppaqPTVAWQPVPGPQTGEPVIAPAPEGYPQqSQYAQPAVQYnepLQQPVQPQQPYYAPAAEQPAQQPY 420
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 24041035  2364 QVAQTILPAYHPFPASVgkyPTPPSQHSYASSNAAERTPSHSGHLQGEHPYLTPSPESP 2422
Cdd:PRK10263  421 YAPAPEQPAQQPYYAPA---PEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEP 476
PRK12323 PRK12323
DNA polymerase III subunits gamma and tau; Provisional
2273-2444 1.29e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.02  E-value: 1.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2273 APAEGThPGIAPQSRPPEGKHITTPREPLPPIVTFQLIPKGSIAQPAGAPQPQST--------CPPAVAGPLPTMYQIPE 2344
Cdd:PRK12323  400 AAPPAA-PAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPApaaapaaaARPAAAGPRPVAAAAAA 478
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  2345 MARLPSVAFPTAmmPQQDGQVAQTILPAYHPFPASVGKYPTPPSQHSYASSNAAERTPSHSGHLQGEHPYLTPSPESPDQ 2424
Cdd:PRK12323  479 APARAAPAAAPA--PADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAA 556
                         170       180
                  ....*....|....*....|
gi 24041035  2425 WSSSSPHSASDWSDVTTSPT 2444
Cdd:PRK12323  557 TEPVVAPRPPRASASGLPDM 576
DUF1421 pfam07223
Protein of unknown function (DUF1421); This family represents a conserved region approximately ...
2204-2446 1.41e-04

Protein of unknown function (DUF1421); This family represents a conserved region approximately 350 residues long within a number of plant proteins of unknown function.


Pssm-ID: 254110 [Multi-domain]  Cd Length: 357  Bit Score: 45.32  E-value: 1.41e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2204 LHEMQPLAHGASTVLPSVSQLLSHHHIVSPGSGSAGSLSRLHPVPVPADWMNRMEVNETQYNEMFgMVLAPAEGTHPG-I 2282
Cdd:pfam07223   24 LSKLQLSHEEAQSSEAHSFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQ-QSQAPPSHQYPSqL 102
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2283 APQSRPPEGKHITTPREPL-PPivtfqliPKGSIAQPAGAPQPQSTcPPAVAGPLPTMYQIPemarlpsvafptammPQQ 2361
Cdd:pfam07223  103 PPQQVQSVPQQPTPQQEPYyPP-------PSQPQPPPAQQPQAQQP-QPPPQVPQQQQYQSP---------------PQQ 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035   2362 DgQVAQTILPAYHPFPASVGKYPTPPS--QHSYASSnaaERTPSHsghLQGEHPYltpSPESPDQWSSSSPhsasdwsdv 2439
Cdd:pfam07223  160 P-QYQQNPPPQAQSAPQVSGLYPEESPyqPQSYPPN---EPLPSS---MAMQPPY---SGAPPSQQFYGPP--------- 220

                   ....*..
gi 24041035   2440 ttSPTPG 2446
Cdd:pfam07223  221 --QPSPY 225
PTZ00322 PTZ00322
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional
1982-2070 2.39e-04

6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional


Pssm-ID: 140343 [Multi-domain]  Cd Length: 664  Bit Score: 44.89  E-value: 2.39e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 24041035  1982 HWAAAVNNVEATLLLlKNGANRDMQDNKEETPLFLAAREGSYEAAKILLDHFANRDITDHMDRLPRDVARDRMHHDIVRL 2061
Cdd:PTZ00322   88 QLAASGDAVGARILL-TGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTLLDKDGKTPLELAEENGFREV