NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|145553997|ref|NP_001078824|]
View 

AT-rich interactive domain-containing protein 1B [Mus musculus]

Protein Classification

ARID_ARID1B and BAF250_C domain-containing protein( domain architecture ID 13013841)

protein containing domains PHA03247, ARID_ARID1B, Med15, and BAF250_C

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
BAF250_C pfam12031
SWI/SNF-like complex subunit BAF250/Osa; This entry represents the mammalian BAF250a/b and its ...
1935-2190 5.35e-163

SWI/SNF-like complex subunit BAF250/Osa; This entry represents the mammalian BAF250a/b and its homolog osa from fruit flies. They are part of the SWI/SNF-like ATP-dependent chromatin remodelling complex that regulates gene expression through regulating nucleosome remodelling. In humans there are two BAF250 isoforms, BAF250a/ARID1a and BAF250b/ARID1b. BAF250a/b may be E3 ubiquitin ligases that target histone H2B.


:

Pssm-ID: 463439  Cd Length: 257  Bit Score: 501.09  E-value: 5.35e-163
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1935 SLAKRCICVSNIVRSLSFVPGNDAEMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEV 2014
Cdd:pfam12031    1 SLARRCLCVSNILRSLSFVPGNDLEMAKHPGLLLILGRLLLLHHEHPERSSAPRTYDKEEDEDFGLSCSRDEWWWDCLEV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  2015 LRDNTLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVLETLCKLSIQDNNVDLI 2094
Cdd:pfam12031   81 LRENALVTLANISGQLDLSPYPESICLPILDGLLHWAVCPSAEAQDPFPSAGPNSPLSPQRLVLEALCKLSVIDNNVDLI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  2095 LATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDTLAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNL 2174
Cdd:pfam12031  161 LATPPFSRLEKLFHILVRLLGRREEQVCREFAVVLLSYLAQADSAAARAIAMQKPCISNLISFIEDAEQNAQYVQSQHGL 240
                          250
                   ....*....|....*..
gi 145553997  2175 MHMQ-PPPLEPPSVDMM 2190
Cdd:pfam12031  241 NALRdNPELMGTSVDML 257
ARID_ARID1B cd16877
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1B (ARID1B) ...
1062-1154 2.90e-64

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1B (ARID1B) and similar proteins; ARID1B, also called BRG1-associated factor 250b (BAF250B), BRG1-binding protein ELD/OSA1, Osa homolog 2 (Osa2), or p250R, is the largest subunit of ATP-dependent SWItch/sucrose nonfermentable (SWI/SNF) chromatin remodeling complex, which plays a critical role in transcriptional control and gene expression. ARID1B exhibits tumour-suppressor activities in pancreatic cancer cell lines. Mutations in the ARID1B gene cause Coffin-Siris syndrome, exhibiting developmental defects, and haplo-insufficiency of ARID1B is a frequent cause of intellectual disability. Moreover, mutations in the ARID1B gene have been found in many cancers. ARID1B contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), which binds DNA in a non-sequence-specific manner similar to ARID1A.


:

Pssm-ID: 350641  Cd Length: 93  Bit Score: 212.93  E-value: 2.90e-64
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 1141
Cdd:cd16877     1 PERKLWVDRYLTFMEERGTPVASLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 80
                          90
                  ....*....|...
gi 145553997 1142 QYLFAFECKTERG 1154
Cdd:cd16877    81 QYLFAFECKIERG 93
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1175-1555 8.28e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 57.71  E-value: 8.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1175 PPSPANSGSLQGPQTPQSTGSNSMAEvpGDLKPPTPASTPHGQMTPMQSGRSSTVSVHDPFSDVSDSAYPKRNSMTPNAP 1254
Cdd:pfam09606  109 PMGQQMGGPGTASNLLASLGRPQMPM--GGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAG 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1255 YQQGMGMPDMMGRMPYEPnkdpFSGMRKVPGSSEPFMTQGQVPNSGMQDM-----YNQSPSGAMSNLGMGQRQQFPYGTS 1329
Cdd:pfam09606  187 GMNGGQQGPMGGQMPPQM----GVPGMPGPADAGAQMGQQAQANGGMNPQqmggaPNQVAMQQQQPQQQGQQSQLGMGIN 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1330 YDRRHEAYGQQYPGQGPPTGQPPYGGHQPG-----LYPQQPNYKRHMDGMygPPAKRHEGdmyNMQYGSQQQEMYNQYGG 1404
Cdd:pfam09606  263 QMQQMPQGVGGGAGQGGPGQPMGPPGQQPGampnvMSIGDQNNYQQQQTR--QQQQQQGG---NHPAAHQQQMNQSVGQG 337
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1405 SYSGPDRRPIQGQYPYPYNRERMQGPGQMQPhgiPPQMMGGPMQSSSSEGPQQNmwATRNDMPYPYQSRQGPGGPAQAPP 1484
Cdd:pfam09606  338 GQVVALGGLNHLETWNPGNFGGLGANPMQRG---QPGMMSSPSPVPGQQVRQVT--PNQFMRQSPQPSVPSPQGPGSQPP 412
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 145553997  1485 YPGMNRtddmMVPEqrinhesqwPSHVSQRQPYMSSSASMQpiTRPPQSSYQTPPSLPNHISrAPSPASFQ 1555
Cdd:pfam09606  413 QSHPGG----MIPS---------PALIPSPSPQMSQQPAQQ--RTIGQDSPGGSLNTPGQSA-VNSPLNPQ 467
PHA03247 super family cl33720
large tegument protein UL36; Provisional
451-863 4.02e-06

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 4.02e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  451 PAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGTHPHSQPQQSSPY-PGGSYG-----------PPGAQRYPLGMQGRA 518
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRaPVDDRGdprgpappsplPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  519 PGA--LGGLQYPQQQMPPQYGQQAVSGYCQQGQQPYYNQQPQPSHLPPQaQYLQPAAAQSQQRYQPQQDMSQEGYGTRSQ 596
Cdd:PHA03247 2633 PAAnePDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTPEPA 2711
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  597 P-------PLAPGKSNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPH 669
Cdd:PHA03247 2712 PhalvsatPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  670 LSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMTGtqrnpqmsqyGP 749
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG----------GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  750 -QQTGPSMSPHPSPGGQMHPGISNFQQsnssgtygPQMSQygpqgnysRTPTYSGVPSASYSGPGPGMGINANNQMHGQG 828
Cdd:PHA03247 2862 vRRRPPSRSPAAKPAAPARPPVRRLAR--------PAVSR--------STESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                         410       420       430
                  ....*....|....*....|....*....|....*
gi 145553997  829 PAQPCGAMPLGRMPSAGMQNRPFPGTMSSVTPSSP 863
Cdd:PHA03247 2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
 
Name Accession Description Interval E-value
BAF250_C pfam12031
SWI/SNF-like complex subunit BAF250/Osa; This entry represents the mammalian BAF250a/b and its ...
1935-2190 5.35e-163

SWI/SNF-like complex subunit BAF250/Osa; This entry represents the mammalian BAF250a/b and its homolog osa from fruit flies. They are part of the SWI/SNF-like ATP-dependent chromatin remodelling complex that regulates gene expression through regulating nucleosome remodelling. In humans there are two BAF250 isoforms, BAF250a/ARID1a and BAF250b/ARID1b. BAF250a/b may be E3 ubiquitin ligases that target histone H2B.


Pssm-ID: 463439  Cd Length: 257  Bit Score: 501.09  E-value: 5.35e-163
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1935 SLAKRCICVSNIVRSLSFVPGNDAEMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEV 2014
Cdd:pfam12031    1 SLARRCLCVSNILRSLSFVPGNDLEMAKHPGLLLILGRLLLLHHEHPERSSAPRTYDKEEDEDFGLSCSRDEWWWDCLEV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  2015 LRDNTLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVLETLCKLSIQDNNVDLI 2094
Cdd:pfam12031   81 LRENALVTLANISGQLDLSPYPESICLPILDGLLHWAVCPSAEAQDPFPSAGPNSPLSPQRLVLEALCKLSVIDNNVDLI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  2095 LATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDTLAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNL 2174
Cdd:pfam12031  161 LATPPFSRLEKLFHILVRLLGRREEQVCREFAVVLLSYLAQADSAAARAIAMQKPCISNLISFIEDAEQNAQYVQSQHGL 240
                          250
                   ....*....|....*..
gi 145553997  2175 MHMQ-PPPLEPPSVDMM 2190
Cdd:pfam12031  241 NALRdNPELMGTSVDML 257
ARID_ARID1B cd16877
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1B (ARID1B) ...
1062-1154 2.90e-64

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1B (ARID1B) and similar proteins; ARID1B, also called BRG1-associated factor 250b (BAF250B), BRG1-binding protein ELD/OSA1, Osa homolog 2 (Osa2), or p250R, is the largest subunit of ATP-dependent SWItch/sucrose nonfermentable (SWI/SNF) chromatin remodeling complex, which plays a critical role in transcriptional control and gene expression. ARID1B exhibits tumour-suppressor activities in pancreatic cancer cell lines. Mutations in the ARID1B gene cause Coffin-Siris syndrome, exhibiting developmental defects, and haplo-insufficiency of ARID1B is a frequent cause of intellectual disability. Moreover, mutations in the ARID1B gene have been found in many cancers. ARID1B contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), which binds DNA in a non-sequence-specific manner similar to ARID1A.


Pssm-ID: 350641  Cd Length: 93  Bit Score: 212.93  E-value: 2.90e-64
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 1141
Cdd:cd16877     1 PERKLWVDRYLTFMEERGTPVASLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 80
                          90
                  ....*....|...
gi 145553997 1142 QYLFAFECKTERG 1154
Cdd:cd16877    81 QYLFAFECKIERG 93
BRIGHT smart00501
BRIGHT, ARID (A/T-rich interaction domain) domain; DNA-binding domain containing a ...
1062-1149 8.30e-31

BRIGHT, ARID (A/T-rich interaction domain) domain; DNA-binding domain containing a helix-turn-helix structure


Pssm-ID: 128777 [Multi-domain]  Cd Length: 93  Bit Score: 117.38  E-value: 8.30e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVG-TSSSAASSLKKQY 1140
Cdd:smart00501    1 RERVLFLDRLYKFMEERGSPLKKIPVIGGKPLDLYRLYRLVQERGGYDQVTKDKKWKEIARELGIPdTSTSAASSLRKHY 80

                    ....*....
gi 145553997   1141 IQYLFAFEC 1149
Cdd:smart00501   81 ERYLLPYER 89
ARID pfam01388
ARID/BRIGHT DNA binding domain; This domain is know as ARID for AT-Rich Interaction Domain, ...
1063-1148 9.09e-25

ARID/BRIGHT DNA binding domain; This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.


Pssm-ID: 460187  Cd Length: 87  Bit Score: 100.01  E-value: 9.09e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVG-TSSSAASSLKKQYI 1141
Cdd:pfam01388    1 EKELFLKSLRKFHEKRGTPLKQIPVIGGKPVDLYKLYKAVQKLGGYDKVTEKNLWREVAEKLGFPpSAASAATQLKQIYE 80

                   ....*..
gi 145553997  1142 QYLFAFE 1148
Cdd:pfam01388   81 KYLLPYE 87
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1175-1555 8.28e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 57.71  E-value: 8.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1175 PPSPANSGSLQGPQTPQSTGSNSMAEvpGDLKPPTPASTPHGQMTPMQSGRSSTVSVHDPFSDVSDSAYPKRNSMTPNAP 1254
Cdd:pfam09606  109 PMGQQMGGPGTASNLLASLGRPQMPM--GGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAG 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1255 YQQGMGMPDMMGRMPYEPnkdpFSGMRKVPGSSEPFMTQGQVPNSGMQDM-----YNQSPSGAMSNLGMGQRQQFPYGTS 1329
Cdd:pfam09606  187 GMNGGQQGPMGGQMPPQM----GVPGMPGPADAGAQMGQQAQANGGMNPQqmggaPNQVAMQQQQPQQQGQQSQLGMGIN 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1330 YDRRHEAYGQQYPGQGPPTGQPPYGGHQPG-----LYPQQPNYKRHMDGMygPPAKRHEGdmyNMQYGSQQQEMYNQYGG 1404
Cdd:pfam09606  263 QMQQMPQGVGGGAGQGGPGQPMGPPGQQPGampnvMSIGDQNNYQQQQTR--QQQQQQGG---NHPAAHQQQMNQSVGQG 337
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1405 SYSGPDRRPIQGQYPYPYNRERMQGPGQMQPhgiPPQMMGGPMQSSSSEGPQQNmwATRNDMPYPYQSRQGPGGPAQAPP 1484
Cdd:pfam09606  338 GQVVALGGLNHLETWNPGNFGGLGANPMQRG---QPGMMSSPSPVPGQQVRQVT--PNQFMRQSPQPSVPSPQGPGSQPP 412
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 145553997  1485 YPGMNRtddmMVPEqrinhesqwPSHVSQRQPYMSSSASMQpiTRPPQSSYQTPPSLPNHISrAPSPASFQ 1555
Cdd:pfam09606  413 QSHPGG----MIPS---------PALIPSPSPQMSQQPAQQ--RTIGQDSPGGSLNTPGQSA-VNSPLNPQ 467
PHA03247 PHA03247
large tegument protein UL36; Provisional
451-863 4.02e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 4.02e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  451 PAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGTHPHSQPQQSSPY-PGGSYG-----------PPGAQRYPLGMQGRA 518
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRaPVDDRGdprgpappsplPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  519 PGA--LGGLQYPQQQMPPQYGQQAVSGYCQQGQQPYYNQQPQPSHLPPQaQYLQPAAAQSQQRYQPQQDMSQEGYGTRSQ 596
Cdd:PHA03247 2633 PAAnePDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTPEPA 2711
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  597 P-------PLAPGKSNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPH 669
Cdd:PHA03247 2712 PhalvsatPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  670 LSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMTGtqrnpqmsqyGP 749
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG----------GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  750 -QQTGPSMSPHPSPGGQMHPGISNFQQsnssgtygPQMSQygpqgnysRTPTYSGVPSASYSGPGPGMGINANNQMHGQG 828
Cdd:PHA03247 2862 vRRRPPSRSPAAKPAAPARPPVRRLAR--------PAVSR--------STESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                         410       420       430
                  ....*....|....*....|....*....|....*
gi 145553997  829 PAQPCGAMPLGRMPSAGMQNRPFPGTMSSVTPSSP 863
Cdd:PHA03247 2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
Enamelin pfam15362
Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It ...
632-832 1.80e-05

Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It is necessary for the extension of enamel during the secretory stage of dental enamel formation. The proteins are expressed in teeth, particularly in odontoblasts, ameloblasts and cementoblasts.


Pssm-ID: 464672 [Multi-domain]  Cd Length: 907  Bit Score: 50.21  E-value: 1.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   632 TGTEATLSSAVSASGSTssqgdQSNPAQSPFSPHASPHLSSIPGgpsPSPVGSPVGSNQSRSGP---ISPASIPGSQMPP 708
Cdd:pfam15362   26 PATEPSANSTVPETNST-----QPNPGGSQGGNDTSPTGNSAPG---PNTGSNPTAQNGVFPPPavnVSGQGVPRSQIPW 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   709 QPpgsqsessSHPAL-SQSPMPQERGFMTGTQRNPQMSQYGPQQTGPSMSphpspggqmhpgisnfQQSNSSGTYGPQMS 787
Cdd:pfam15362   98 GP--------SQPNIrENYPNPNIRNFPSGRQWSPTGTAMGHRQNGPFYR----------------NQQVQRGPRWNSFA 153
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 145553997   788 QYGPQGNYSRTPTY-SGVPSASYSGPGPGMGINANNQMHGQGPAQP 832
Cdd:pfam15362  154 WEGKQAARPGNPTYrKAYPSTSRGNYPNYAGNPANFRRKPQGPNKH 199
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1342-1552 5.08e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 45.20  E-value: 5.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1342 PGQGPPTGQPPYGGHQPGlyPQQPNY-KRHMDGMYGPPAKRHEgdmynmQYGSQQQEMynqyggsysgPDRRPIQGQY-- 1418
Cdd:PRK14086   90 PSAGEPAPPPPHARRTSE--PELPRPgRRPYEGYGGPRADDRP------PGLPRQDQL----------PTARPAYPAYqq 151
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1419 ---PYPYNRERMQGPGQMQPHGIPPqmmGGPMQSSSSEGPQQNMWAT-----RNDMPYPYQSRQGPGGPAQAPpypgmnR 1490
Cdd:PRK14086  152 rpePGAWPRAADDYGWQQQRLGFPP---RAPYASPASYAPEQERDREpydagRPEYDQRRRDYDHPRPDWDRP------R 222
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 145553997 1491 TDDMMVPeqrinhesqWPSHVSQRQPYMSSSASMQPITRPPQSSYQTPPSLPNHISRAPSPA 1552
Cdd:PRK14086  223 RDRTDRP---------EPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPG 275
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
1386-1525 3.14e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.49  E-value: 3.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1386 MYNMQYGSQQQEMYNQYggSYSGPDRRPIQGQYPYPYNRERMQGPGQMQPHGIPPQMMGGPMQSSSSEGPQQNMWATRND 1465
Cdd:TIGR01628  382 MRQLPMGSPMGGAMGQP--PYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPM 459
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1466 MPYPYQsrqgpggpaqaPPYPGMNRTDDMMVPEQRINHESQWPSHVSQRQpymSSSASMQ 1525
Cdd:TIGR01628  460 QPVMYP-----------PNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLA---SATPQMQ 505
 
Name Accession Description Interval E-value
BAF250_C pfam12031
SWI/SNF-like complex subunit BAF250/Osa; This entry represents the mammalian BAF250a/b and its ...
1935-2190 5.35e-163

SWI/SNF-like complex subunit BAF250/Osa; This entry represents the mammalian BAF250a/b and its homolog osa from fruit flies. They are part of the SWI/SNF-like ATP-dependent chromatin remodelling complex that regulates gene expression through regulating nucleosome remodelling. In humans there are two BAF250 isoforms, BAF250a/ARID1a and BAF250b/ARID1b. BAF250a/b may be E3 ubiquitin ligases that target histone H2B.


Pssm-ID: 463439  Cd Length: 257  Bit Score: 501.09  E-value: 5.35e-163
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1935 SLAKRCICVSNIVRSLSFVPGNDAEMSKHPGLVLILGKLILLHHEHPERKRAPQTYEKEEDEDKGVACSKDEWWWDCLEV 2014
Cdd:pfam12031    1 SLARRCLCVSNILRSLSFVPGNDLEMAKHPGLLLILGRLLLLHHEHPERSSAPRTYDKEEDEDFGLSCSRDEWWWDCLEV 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  2015 LRDNTLVTLANISGQLDLSAYTESICLPILDGLLHWMVCPSAEAQDPFPTVGPNSVLSPQRLVLETLCKLSIQDNNVDLI 2094
Cdd:pfam12031   81 LRENALVTLANISGQLDLSPYPESICLPILDGLLHWAVCPSAEAQDPFPSAGPNSPLSPQRLVLEALCKLSVIDNNVDLI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  2095 LATPPFSRQEKFYATLVRYVGDRKNPVCREMSMALLSNLAQGDTLAARAIAVQKGSIGNLISFLEDGVTMAQYQQSQHNL 2174
Cdd:pfam12031  161 LATPPFSRLEKLFHILVRLLGRREEQVCREFAVVLLSYLAQADSAAARAIAMQKPCISNLISFIEDAEQNAQYVQSQHGL 240
                          250
                   ....*....|....*..
gi 145553997  2175 MHMQ-PPPLEPPSVDMM 2190
Cdd:pfam12031  241 NALRdNPELMGTSVDML 257
ARID_ARID1B cd16877
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1B (ARID1B) ...
1062-1154 2.90e-64

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1B (ARID1B) and similar proteins; ARID1B, also called BRG1-associated factor 250b (BAF250B), BRG1-binding protein ELD/OSA1, Osa homolog 2 (Osa2), or p250R, is the largest subunit of ATP-dependent SWItch/sucrose nonfermentable (SWI/SNF) chromatin remodeling complex, which plays a critical role in transcriptional control and gene expression. ARID1B exhibits tumour-suppressor activities in pancreatic cancer cell lines. Mutations in the ARID1B gene cause Coffin-Siris syndrome, exhibiting developmental defects, and haplo-insufficiency of ARID1B is a frequent cause of intellectual disability. Moreover, mutations in the ARID1B gene have been found in many cancers. ARID1B contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), which binds DNA in a non-sequence-specific manner similar to ARID1A.


Pssm-ID: 350641  Cd Length: 93  Bit Score: 212.93  E-value: 2.90e-64
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 1141
Cdd:cd16877     1 PERKLWVDRYLTFMEERGTPVASLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 80
                          90
                  ....*....|...
gi 145553997 1142 QYLFAFECKTERG 1154
Cdd:cd16877    81 QYLFAFECKIERG 93
ARID_ARID1A cd16876
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1A (ARID1A) ...
1062-1154 1.03e-47

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 1A (ARID1A) and similar proteins; ARID1A, also called B120, BRG1-associated factor 250a (BAF250A), Osa homolog 1(OSA1), SWI-like protein, SWI/SNF complex protein p270, or SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin subfamily F member 1 (SWI1), has been identified as a novel tumor suppressor in various tumor types. It interacts with BRG1 adenosine triphosphatase to form a SWItch/Sucrose NonFermentable (SWI/SNF) chromatin remodeling protein complex, which plays a critical role in transcriptional control and gene expression. ARID1A contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), and Eld/Osa homology domains (EHD) 1 and 2 within the C-terminus. The ARID in ARID1A binds nonspecific DNA in general and plays an important role in targeting SWI/SNF to chromatin. The EHD1 may be capable of mediating an intramolecular association with EHD2, and/or an intermolecular association resulting in homo- or hetero-dimerization. The EHD2 binds Swi2/Brahma homologue Brahma-related gene 1 (BRG1, also known as Snf2b), a human homologue of yeast Swi2.


Pssm-ID: 350640  Cd Length: 93  Bit Score: 165.61  E-value: 1.03e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 1141
Cdd:cd16876     1 PERKMWVDRYLAFTEEKAMGMTNLPAVGRKPLDLYRLYVSVKEIGGLTQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 80
                          90
                  ....*....|...
gi 145553997 1142 QYLFAFECKTERG 1154
Cdd:cd16876    81 QCLYAFECKIERG 93
ARID_ARID1A-like cd16865
ARID/BRIGHT DNA binding domain found in AT-rich interactive domain-containing proteins ARID1A, ...
1062-1154 5.41e-47

ARID/BRIGHT DNA binding domain found in AT-rich interactive domain-containing proteins ARID1A, ARID1B and similar proteins; This subfamily contains ARID1A and its paralog ARID1B. They are mutually exclusive components of human SWItch/Sucrose NonFermentable (SWI/SNF) chromatin remodeling protein complexes, but display different functions in development and cell-cycle control. SWI/SNF complexes containing ARID1A have an antiproliferative function, whereas the one harboring ARID1B shows a pro-proliferative function. ARID1A functions as an important tumor suppressor in various tumor types. It has been implicated in cell-cycle arrest, as well as in the interactions with p53 and BRG1/BRM and with topoisomerase II alpha. ARID1B may be considered as a potential therapeutic target for ARID1A-mutant cancers. Moreover, mutations in the ARID1B gene cause Coffin-Siris syndrome, exhibiting developmental defects, and haplo-insufficiency of ARID1B is a frequent cause of intellectual disability. Mutations in the ARID1B gene also have been found in many cancers. Both ARID1A and ARID1B contain an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), which binds DNA in a non-sequence-specific manner.


Pssm-ID: 350629  Cd Length: 93  Bit Score: 163.60  E-value: 5.41e-47
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYI 1141
Cdd:cd16865     1 PERRPFLDRLLRFMEERGSPITNCPQISKQPLDLFRLYVTVKERGGVAEVTKNKKWKEICTELNIGASSSAAFTLRKNYI 80
                          90
                  ....*....|...
gi 145553997 1142 QYLFAFECKTERG 1154
Cdd:cd16865    81 KYLLAYECRFDRG 93
BRIGHT smart00501
BRIGHT, ARID (A/T-rich interaction domain) domain; DNA-binding domain containing a ...
1062-1149 8.30e-31

BRIGHT, ARID (A/T-rich interaction domain) domain; DNA-binding domain containing a helix-turn-helix structure


Pssm-ID: 128777 [Multi-domain]  Cd Length: 93  Bit Score: 117.38  E-value: 8.30e-31
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVG-TSSSAASSLKKQY 1140
Cdd:smart00501    1 RERVLFLDRLYKFMEERGSPLKKIPVIGGKPLDLYRLYRLVQERGGYDQVTKDKKWKEIARELGIPdTSTSAASSLRKHY 80

                    ....*....
gi 145553997   1141 IQYLFAFEC 1149
Cdd:smart00501   81 ERYLLPYER 89
ARID smart01014
ARID/BRIGHT DNA binding domain; Members of the recently discovered ARID (AT-rich interaction ...
1062-1148 1.17e-26

ARID/BRIGHT DNA binding domain; Members of the recently discovered ARID (AT-rich interaction domain) family of DNA-binding proteins are found in fungi and invertebrate and vertebrate metazoans. ARID-encoding genes are involved in a variety of biological processes including embryonic development, cell lineage gene regulation and cell cycle control. Although the specific roles of this domain and of ARID-containing proteins in transcriptional regulation are yet to be elucidated, they include both positive and negative transcriptional regulation and a likely involvement in the modification of chromatin structure. The basic structure of the ARID domain domain appears to be a series of six alpha-helices separated by beta-strands, loops, or turns, but the structured region may extend to an additional helix at either or both ends of the basic six. Based on primary sequence homology, they can be partitioned into three structural classes: Minimal ARID proteins that consist of a core domain formed by six alpha helices; ARID proteins that supplement the core domain with an N-terminal alpha-helix; and Extended-ARID proteins, which contain the core domain and additional alpha-helices at their N- and C-termini.


Pssm-ID: 198082 [Multi-domain]  Cd Length: 88  Bit Score: 105.39  E-value: 1.17e-26
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   1062 PERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNV-GTSSSAASSLKKQY 1140
Cdd:smart01014    1 RERELFLDRLRKFMEKRGTPLDKIPVIGGKPLDLYRLYRAVQKRGGFDKVTKKKKWKQVARELGIpPSATSAGTSLRKHY 80

                    ....*...
gi 145553997   1141 IQYLFAFE 1148
Cdd:smart01014   81 EKYLLPYE 88
ARID_ARID3 cd16867
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing proteins ARID3A, ...
1054-1157 2.17e-25

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing proteins ARID3A, ARID3B, ARID3C, dead ringer (Dri) from Drosophila melanogaster, and similar proteins; The ARID3 subfamily includes AT-rich interactive domain (ARID, also known as BRIGHT)-containing proteins ARID3A, ARID3B and ARID3C, which are the most direct mammalian counterparts of the Drosophila "dead ringer" protein Dri. They consist of an acidic N-terminal region of unknown function, the central ARID matrix association (or attachment) region (MAR)-DNA binding domain, a SUMO-I conjugation (SUMO) motif, and a multifunctional homomerization/nuclear export REKLES domain in the C-terminal third of the molecule. The ARID domain in this subfamily has been described as the "extended" or e-ARID due to additional conserved sequences at both the N and C termini of the core ARID region. The REKLES domain is found only in the ARID3 subfamily. It has co-evolved with and regulates functional properties of the ARID DNA-binding domain.


Pssm-ID: 350631  Cd Length: 118  Bit Score: 102.95  E-value: 2.17e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1054 KVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAA 1133
Cdd:cd16867     3 QLYELSDDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLYELYRLVVEKGGLVEVINKKIWREITKGLNLPSSITSA 82
                          90       100
                  ....*....|....*....|....*
gi 145553997 1134 S-SLKKQYIQYLFAFECKTERGEEP 1157
Cdd:cd16867    83 AfTLRTQYMKYLYPYECEKEKLSSP 107
ARID pfam01388
ARID/BRIGHT DNA binding domain; This domain is know as ARID for AT-Rich Interaction Domain, ...
1063-1148 9.09e-25

ARID/BRIGHT DNA binding domain; This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.


Pssm-ID: 460187  Cd Length: 87  Bit Score: 100.01  E-value: 9.09e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVG-TSSSAASSLKKQYI 1141
Cdd:pfam01388    1 EKELFLKSLRKFHEKRGTPLKQIPVIGGKPVDLYKLYKAVQKLGGYDKVTEKNLWREVAEKLGFPpSAASAATQLKQIYE 80

                   ....*..
gi 145553997  1142 QYLFAFE 1148
Cdd:pfam01388   81 KYLLPYE 87
ARID_ARID3A cd16878
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 3A (ARID3A) ...
1050-1170 1.34e-23

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 3A (ARID3A) and similar proteins; ARID3A, also called B-cell regulator of IgH transcription (Bright), dead ringer-like protein 1 (Dril1), or E2F-binding protein 1 (E2FBP1), is an ubiquitously expressed DNA-binding protein that has been implicated in embryonic patterning, cell lineage gene regulation, and cell cycle control, chromatin remodeling and transcriptional regulation. It was originally identified as a B cell-specific trans-activator of immunoglobulin heavy-chain (IgH) transcription, which increases immunoglobulin transcription in antigen-activated B cells and plays regulatory roles in hematopoiesis. It also functions as an E2F transcription regulator, inducing promyelocytic leukemia protein (PML) reduction and suppressing the formation of PML-nuclear bodies. It antagonizes the p16(INK4A)-Rb tumor suppressor machinery by regulating PML stability. ARID3A transcriptional activity can be modulated by SUMO (Small Ubiquitin-related Modifier) modification through the interaction with the SUMO-conjugating enzyme Ubc9. ARID3A also plays an important role in marginal zone B lymphocyte development and autoantibody production. Furthermore, ARID3A is a direct p53 target gene. It controls cell growth in a p53-dependent manner. ARID3A contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a SUMO-I conjugation (SUMO) motif and a multifunctional homomerization/nuclear export REKLES domain, which consists of two subdomains: a modestly conserved N-terminal REKLES alpha and a highly conserved (among ARID3 orthologous proteins) C-terminal REKLES beta.


Pssm-ID: 350642  Cd Length: 133  Bit Score: 98.21  E-value: 1.34e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1050 EKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTS 1129
Cdd:cd16878     9 EQFKQLYELDGDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLYMLYVLVTEKGGLVEVINKKLWREITKGLNLPTS 88
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 145553997 1130 SSAAS-SLKKQYIQYLFAFECKtERGEEPPPEVFSTGDSKKQ 1170
Cdd:cd16878    89 ITSAAfTLRTQYMKYLYPYECE-KRGLSNPNELQAAIDSNRR 129
ARID_ARID3B cd16879
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 3B (ARID3B) ...
1054-1175 2.54e-23

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 3B (ARID3B) and similar proteins; ARID3B, also called Bright and dead ringer protein, or Bright-Dri-like protein (Bdp), is a DNA binding protein involved in cellular immortalization, epithelial-mesenchymal transition (EMT), and tumorigenesis. Its expression is differentially regulated in normal and malignant tissues. It is required for heart development by regulating the motility and differentiation of heart progenitors. ARID3B is overexpressed in neuroblastoma and ovarian cancer. It acts as a novel target with roles in cell motility in breast cancer cells, promotes migration of mouse embryo fibroblasts (MEFs) and breast cancer cells, and induces tumor necrosis factor alpha (TNFalpha)-mediated apoptosis. ARID3B contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a SUMO-I conjugation (SUMO) motif and a multifunctional homomerization/nuclear export REKLES domain, which consists of two subdomains: a modestly conserved N-terminal REKLES alpha and a highly conserved (among ARID3 orthologous proteins) C-terminal REKLES beta.


Pssm-ID: 350643  Cd Length: 126  Bit Score: 97.39  E-value: 2.54e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1054 KVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAA 1133
Cdd:cd16879     5 KLYELDNDPKRKEFLDDLFAFMQKRGTPINRIPIMAKQVLDLYMLYKLVTEKGGLVEVINKKIWREITKGLNLPTSITSA 84
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|...
gi 145553997 1134 S-SLKKQYIQYLFAFECKtERGEEPPPEVFSTGDSKKQPKLQP 1175
Cdd:cd16879    85 AfTLRTQYMKYLYPYECE-KKSLSSPAELQAAIDGNRREGRRP 126
ARID_ARID3C cd16880
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 3C (ARID3C) ...
1050-1170 1.92e-22

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 3C (ARID3C) and similar proteins; ARID3C, also called Brightlike, is a new ARID3 family transcription factor that co-activates ARID3A-mediated immunoglobulin gene transcription. It also functions as a potential regulator of early events in B cell antigen receptor (BCR) signaling. ARID3C contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a SUMO-I conjugation (SUMO) motif and a multifunctional homomerization/nuclear export REKLES domain, which consists of two subdomains: a modestly conserved N-terminal REKLES alpha and a highly conserved (among ARID3 orthologous proteins) C-terminal REKLES beta.


Pssm-ID: 350644  Cd Length: 127  Bit Score: 94.71  E-value: 1.92e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1050 EKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTS 1129
Cdd:cd16880     2 EQFKQLYELDDDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLYTLYRLVTDKGGLVEVINKKIWREITKGLSLPTS 81
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|..
gi 145553997 1130 SSAAS-SLKKQYIQYLFAFECKTeRGEEPPPEVFSTGDSKKQ 1170
Cdd:cd16880    82 ITSAAfTLRTQYMKYLYPYECEK-RALSSPGELQAAIDSNRR 122
ARID cd16100
ARID/BRIGHT DNA binding domain family; The AT-rich interaction domain (ARID) family of ...
1063-1148 1.22e-21

ARID/BRIGHT DNA binding domain family; The AT-rich interaction domain (ARID) family of transcription factors, found in a broad array of organisms from fungi to mammals, is characterized by a highly conserved, helix-turn-helix DNA binding domain that binds to the major groove of DNA. The ARID domain, also called BRIGHT, was first identified in the mouse B-cell-specific transcription factor Bright and in the product of the dead ringer (dri) gene of Drosophila melanogaster. ARID family members are implicated in normal development, differentiation, cell cycle regulation, transcriptional activation and chromatin remodeling. Different family members exhibit different DNA-binding properties. Drosophila Dri, mammalian ARID3A/3B/3C and ARID5A/5B, selectively bind AT-rich sites. However, ARID1A/1B, Drosophila Osa, yeast SWI1, ARID2, ARID4A/4B, JARID1A/1B/1C/1D, and JARID2, bind DNA without sequence specificity.


Pssm-ID: 350627  Cd Length: 87  Bit Score: 90.88  E-value: 1.22e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSA-ASSLKKQYI 1141
Cdd:cd16100     1 EREEFLEQLRAFLESRGTPLLKPPTIGGKPLDLYKLYRAVVSRGGYEKVTEKKLWKEVARKLGLPTSSTSaAQALKRIYE 80

                  ....*..
gi 145553997 1142 QYLFAFE 1148
Cdd:cd16100    81 KYLLPFE 87
ARID_Dri-like cd16881
ARID/BRIGHT DNA binding domain of dead ringer (Dri) from Drosophila melanogaster and similar ...
1050-1157 2.48e-20

ARID/BRIGHT DNA binding domain of dead ringer (Dri) from Drosophila melanogaster and similar proteins; Dri, also termed retained (retn), is a nuclear protein with a sequence-specific DNA-binding domain termed AT-rich DNA-interacting domain (ARID, also known as BRIGHT). It is a founding member of the ARID family. Sequence comparison shows that DRI belongs to the "extended" or e-ARID subfamily, which exhibits an extended region of similarity either side of the ARID. Dri plays an important role in embryogenesis. It functions as an essential transcription factor involved in aspects of dorsal/ventral and anterior/posterior axis patterning, as well as myogenesis and hindgut development.


Pssm-ID: 350645  Cd Length: 125  Bit Score: 88.80  E-value: 2.48e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1050 EKITKVYELGNEPERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTS 1129
Cdd:cd16881     6 EQFKQLYEISDDPKRKEFLDDLFSFMQKRGTPVNRIPIMAKQVLDLYELYNLVVARGGLVEVINKKLWREITKGLHLPSS 85
                          90       100
                  ....*....|....*....|....*....
gi 145553997 1130 SSAAS-SLKKQYIQYLFAFECKTERGEEP 1157
Cdd:cd16881    86 ITSAAfTLRTQYMKYLYPYECEKLKLSTP 114
ARID_ARID4 cd16868
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing proteins ARID4A, ...
1063-1148 8.94e-17

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing proteins ARID4A, ARID4B and similar proteins; This subfamily contains ARID4A and its paralog ARID4B, both of which are retinoblastoma (Rb)-binding proteins that function as coactivators to enhance the androgen receptor (AR) and Rb transcriptional activity, and play important roles in the AR and Rb pathways to control male fertility. They also act as the leukemia and tumor suppressors involved in epigenetic regulation in leukemia and Prader-Willi/Angelman syndrome. Moreover, they associate with the mSIN3A histone deacetylase (HDAC) chromatin remodeling complex through the interaction with each other, as well as with the breast cancer associated tumor suppressor ING1 and the breast cancer metastasis suppressor BRMS1. Both ARID4A and ARID4B contain a Tudor domain, a PWWP domain (also known as HATH domain or RBB1NT domain), an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a chromobarrel domain, and a C-terminal R2 domain.


Pssm-ID: 350632  Cd Length: 87  Bit Score: 77.04  E-value: 8.94e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAAS-SLKKQYI 1141
Cdd:cd16868     1 EKENFLEQLYKFMEDRGTPINKPPVLGYKDLDLFKLYKLVQELGGMERVSQGAKWRSIYQQLGIPVLNSAAShNIKQAYK 80

                  ....*..
gi 145553997 1142 QYLFAFE 1148
Cdd:cd16868    81 KYLYAFE 87
ARID_ARID2 cd16866
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 2 (ARID2) and ...
1071-1148 2.47e-11

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 2 (ARID2) and similar proteins; ARID2, also called BRG1-associated factor 200 (BAF200) or zinc finger protein with activation potential (Zipzap/p200), is a novel serum response factor (SRF)-binding protein with multiple conserved domains, including an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), RFX DNA-binding domain, a glutamine-rich domain, and two C2H2 zinc fingers. It binds DNA without sequence specificity. ARID2 is an intrinsic subunit of PBAF (SWI/SNF-B) remodeling complex, which needs ARID2 to play an essential role in promoting osteoblast differentiation, maintaining cellular identity and activating tissue-specific gene expression. Moreover, ARID2 may function as a tumor suppressor in many cancers. It may also serve as a transcription co-activator for the regulation of cardiac gene expression, and is required for heart morphogenesis and coronary artery development.


Pssm-ID: 350630  Cd Length: 88  Bit Score: 61.90  E-value: 2.47e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1071 YLTFMEE-------RGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNV-GTSSSAASSLKKQYIQ 1142
Cdd:cd16866     2 YDAFLNElrqfhasRGTPFKKIPVVGGKELDLYLLYSKVTALGGWAKVTDKNKWEEILEDFNFpRGCSNAAFALKQIYLR 81

                  ....*.
gi 145553997 1143 YLFAFE 1148
Cdd:cd16866    82 YLEAYE 87
ARID_HMGB9-like cd16872
ARID/BRIGHT DNA binding domain of Arabidopsis thaliana high mobility group B proteins HMGB9, ...
1063-1148 2.90e-11

ARID/BRIGHT DNA binding domain of Arabidopsis thaliana high mobility group B proteins HMGB9, HMGB10, HMGB11, HMGB15 and similar proteins; This subfamily includes a group of conserved plant DNA-binding proteins, including HMGB9 (or ARID-HMG1), HMGB10 (or ARID-HMG2), HMGB11, and HMGB15. They have been termed ARID-HMG proteins, due to containing two DNA-binding domains, an N-terminal AT-rich DNA-interacting domain (ARID, also known as BRIGHT), and a C-terminal high mobility group (HMG)-box domain. They are widely expressed in Arabidopsis and localize primarily to the nucleus. HMGB9/ARID-HMG1 binds specifically to A/T-rich DNA. HMGB15 is a transcription factor predominantly expressed in mature pollen grains and pollen tubes. It may work in the form of a homodimer, or interact with HMGB9, HMGB10 and HMGB11 to form heteromultimers in plant cells. HMGB15 is required for pollen tube growth in Arabidopsis and is involved in transcriptional regulation through the interaction with AGL66 and AGL104.


Pssm-ID: 350636  Cd Length: 86  Bit Score: 61.51  E-value: 2.90e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1063 ERKLWVDRYLTFMEERGSPvSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVG-TSSSAASSLKKQYI 1141
Cdd:cd16872     1 DPDLFWETLRKFHESLGTK-FRIPIVGGKELDLHRLYKEVTSRGGLEKVIKDRKWKEVAAVFNFPpTITNASFVLRKYYL 79

                  ....*..
gi 145553997 1142 QYLFAFE 1148
Cdd:cd16872    80 SLLHHYE 86
ARID_Swi1p-like cd16871
ARID/BRIGHT DNA binding domain of yeast SWI/SNF chromatin-remodeling complex subunit Swi1p and ...
1074-1148 1.45e-10

ARID/BRIGHT DNA binding domain of yeast SWI/SNF chromatin-remodeling complex subunit Swi1p and similar proteins; Saccharomyces cerevisiae Swi1p, also called SWI/SNF chromatin-remodeling complex subunit SWI1, regulatory protein GAM3, or transcription regulatory protein ADR6, is a transcription regulatory protein that is a subunit of the SWI/SNF complex, which plays critical roles in the regulation of gene transcription and expression. It can exist as a prion, [SWI(+)], which demonstrates a link between prionogenesis and global transcriptional regulation. Swi1p contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT) that binds DNA nonspecifically. This subfamily also includes Schizosaccharomyces pombe SWI/SNF chromatin-remodeling complex subunit sol1 (sol1p, also known as switch one-like protein). sol1p is a homolog of S. cerevisiae Swi1p and is also a part of SWI/SNF chromatin-remodeling complex.


Pssm-ID: 350635  Cd Length: 90  Bit Score: 59.57  E-value: 1.45e-10
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 145553997 1074 FMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSA--ASSLKKQYIQYLFAFE 1148
Cdd:cd16871    13 FMAKRGTPIEQQPVIGGRPVNLFRLYQLVQKLGGSRQVTQNNQWPRVAQKLGFPPEQNPqvAQQLAQIYQRYLLPYE 89
ARID_ARID4B cd16883
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 4B (ARID4B) ...
1063-1151 8.12e-10

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 4B (ARID4B) and similar proteins; ARID4B, also called 180 kDa Sin3-associated polypeptide (p180), breast cancer-associated antigen BRCAA1, histone deacetylase complex subunit SAP180, or retinoblastoma-binding protein 1-like 1 (RBP1L1, or RBBP1L1), is a tumor suppressor involved in epigenetic regulation in leukemia and Prader-Willi/Angelman syndrome. It associates with the mSIN3A histone deacetylase (HDAC) chromatin remodeling complex through the interaction with the breast cancer associated tumor suppressor ING1, the breast cancer metastasis suppressor BRMS1, and the ARID4 family homolog ARID4A ( also known as RBP1). ARID4B plays a causative role in metastatic progression of breast cancer. It may also be associated with regulating cell cycle. ARID4B contains a Tudor domain, a PWWP domain (also known as HATH domain or RBB1NT domain), an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a chromobarrel domain, and a C-terminal R2 domain.


Pssm-ID: 350647  Cd Length: 92  Bit Score: 57.67  E-value: 8.12e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAAS-SLKKQYI 1141
Cdd:cd16883     1 ERENFLQQLYKFMEDRGTPINKRPVLGYRNLNLFKLFRLVHKLGGFDNIESGAVWKQVYQDLGIPVLNSAAGyNVKCAYR 80
                          90
                  ....*....|..
gi 145553997 1142 QYLFAFE--CKT 1151
Cdd:cd16883    81 KYLYGFEeyCTS 92
ARID_JARID cd16864
ARID/BRIGHT DNA binding domain of JARID proteins; The JARID subfamily within the JmjC protein ...
1074-1148 4.22e-09

ARID/BRIGHT DNA binding domain of JARID proteins; The JARID subfamily within the JmjC protein family includes lysine-specific demethylase KDM5A, KDM5B, KDM5C, KDM5D and a Drosophila homolog, protein little imaginal discs (Lid). KDM5A was originally identified as a retinoblastoma protein (Rb)-binding partner and its inactivation may be important for Rb to promote differentiation. It is involved in transcription through interacting with TBP, p107, nuclear receptors, Myc, Sin3/HDAC, Mad1, RBP-J, CLOCK and BMAL1. KDM5B has a restricted expression pattern in the testis, ovary, and transiently in the mammary gland of the pregnant female and has been shown to be upregulated in breast cancer, prostate cancer, and lung cancer, suggesting a potential role in tumorigenesis. Both KDM5A and KDM5B function as trimethylated histone H3 lysine 4 (H3K4me3) demethylases. KDM5C is a H3K4 trimethyl-histone demethylase that catalyzes demethylation of H3K4me3 and H3K4me2 to H3K4me1. It plays a role in neuronal survival and dendrite development. KDM5C defects are associated with X-linked mental retardation (XLMR). KDM5D is a male-specific antigen that shows a demethylase activity specific for di- and tri-methylated histone H3K4 (H3K4me3 and H3K4me2), and has a male-specific function as a histone H3K4 demethylase by recruiting a meiosis-regulatory protein, MSH5, to condensed DNA. KDM5D directly interacts with a polycomb-like protein Ring6a/MBLR, and plays a role in regulation of transcriptional initiation through H3K4 demethylation. The family also includes Drosophila melanogaster protein little imaginal discs (Lid) that functions as a JmjC-dependent trimethyl histone H3K4 (H3K4me3) demethylase, which is required for dMyc-induced cell growth. It positively regulates Hox gene expression in S2 cells. Members of this subfamily contain the catalytic JmjC domain, JmjN, the AT-rich domain interacting domain (ARID)/BRIGHT domain, a C5HC2 zinc finger, as well as two or three plant homeodomain (PHD) fingers.


Pssm-ID: 350628  Cd Length: 87  Bit Score: 55.39  E-value: 4.22e-09
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 145553997 1074 FMEERGSPVSsLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFAFE 1148
Cdd:cd16864    14 FWELQGSSLK-IPNVERKALDLFTLHKIVQEEGGFEEVTKERKWSKVARRLGYPPGKGVGSLLRGHYERILYPYD 87
ARID_ARID5 cd16869
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing proteins ARID5A, ...
1074-1120 2.47e-08

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing proteins ARID5A, ARID5B, and similar proteins; This subfamily contains ARID5A and its paralog ARID5B. ARID5A, also called modulator recognition factor 1 (MRF-1), is an estrogen receptor alpha (ER alpha)-interacting protein that is expressed abundantly in cardiovascular tissues and suppresses ER alpha-induced transactivation. It also plays an important role in the promotion of inflammatory processes and autoimmune diseases. ARID5B, also called MRF1-like protein or modulator recognition factor 2 (MRF-2), is a DNA-binding protein that directly interacts with plant homeodomain (PHD) finger 2 (PHF2) to form a protein kinase A (PKA)-dependent PHF2-ARID5B histone H3K9Me2 demethylase complex. It also functions as a transcriptional co-regulator for the transcription factor sex determining region Y (SRY)-box protein 9 (Sox9) and promotes chondrogenesis through histone modification. Moreover, ARID5B is highly expressed in the cardiovascular system and may play essential roles in the phenotypic change of smooth muscle cells (SMCs) through its regulation of SMC differentiation. Both ARID5A and ARID5B contain an AT-rich DNA-interacting domain (ARID, also known as BRIGHT).


Pssm-ID: 350633  Cd Length: 87  Bit Score: 53.07  E-value: 2.47e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 145553997 1074 FMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWREL 1120
Cdd:cd16869    12 FMKDRGTPIERIPHLGFKQIDLYTFFKLVQKLGGYEQVTAKRLWKHV 58
ARID_ARID4A cd16882
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 4A (ARID4A) ...
1063-1148 5.13e-08

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 4A (ARID4A) and similar proteins; ARID4A, also called retinoblastoma-binding protein 1 (RBBP1, or RBP1), is a leukemia and tumor suppressor involved in epigenetic regulation in leukemia and Prader-Willi/Angelman syndrome. It associates with the mSIN3A histone deacetylase (HDAC) chromatin remodeling complex through the interaction with the breast cancer associated tumor suppressor ING1, the breast cancer metastasis suppressor BRMS1, and the ARID4 family homolog ARID4B (also known as RBP1L1). ARID4A specifically interacts with retinoblastoma protein (pRb) and shows both HDAC-dependent and -independent repression activities. It is also involved in the pocket domain of pRb-mediated repression of E2F-dependent transcription and cellular proliferation. Moreover, it acts as a Runx2 coactivator and is involved in the regulation of osteoblastic differentiation in Runx2-osterix transcriptional cascade. ARID4A contains a Tudor domain, a PWWP domain (also known as HATH domain or RBB1NT domain), an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a chromobarrel domain, and a C-terminal R2 domain. The ARID and R2 domains are responsible for the repression activities. The Tudor, PWWP, and chromobarrel domains are all Royal Family domains, but only chromobarrel domain of ARID4A is responsible for recognizing both dsDNA and methylated histone tails, particularly H4K20me3, in chromatin remodeling and epigenetic regulation.


Pssm-ID: 350646  Cd Length: 87  Bit Score: 52.28  E-value: 5.13e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGT-SSSAASSLKKQYI 1141
Cdd:cd16882     1 ERDNFLQQLYKFMEDRGTPINKPPVLGYKDLNLFKLFRLVYQQGGCDNIESGSVWKQIYMDLGIPIlNSAASYNVKTAYR 80

                  ....*..
gi 145553997 1142 QYLFAFE 1148
Cdd:cd16882    81 KYLYGFE 87
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1175-1555 8.28e-08

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 57.71  E-value: 8.28e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1175 PPSPANSGSLQGPQTPQSTGSNSMAEvpGDLKPPTPASTPHGQMTPMQSGRSSTVSVHDPFSDVSDSAYPKRNSMTPNAP 1254
Cdd:pfam09606  109 PMGQQMGGPGTASNLLASLGRPQMPM--GGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAG 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1255 YQQGMGMPDMMGRMPYEPnkdpFSGMRKVPGSSEPFMTQGQVPNSGMQDM-----YNQSPSGAMSNLGMGQRQQFPYGTS 1329
Cdd:pfam09606  187 GMNGGQQGPMGGQMPPQM----GVPGMPGPADAGAQMGQQAQANGGMNPQqmggaPNQVAMQQQQPQQQGQQSQLGMGIN 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1330 YDRRHEAYGQQYPGQGPPTGQPPYGGHQPG-----LYPQQPNYKRHMDGMygPPAKRHEGdmyNMQYGSQQQEMYNQYGG 1404
Cdd:pfam09606  263 QMQQMPQGVGGGAGQGGPGQPMGPPGQQPGampnvMSIGDQNNYQQQQTR--QQQQQQGG---NHPAAHQQQMNQSVGQG 337
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1405 SYSGPDRRPIQGQYPYPYNRERMQGPGQMQPhgiPPQMMGGPMQSSSSEGPQQNmwATRNDMPYPYQSRQGPGGPAQAPP 1484
Cdd:pfam09606  338 GQVVALGGLNHLETWNPGNFGGLGANPMQRG---QPGMMSSPSPVPGQQVRQVT--PNQFMRQSPQPSVPSPQGPGSQPP 412
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 145553997  1485 YPGMNRtddmMVPEqrinhesqwPSHVSQRQPYMSSSASMQpiTRPPQSSYQTPPSLPNHISrAPSPASFQ 1555
Cdd:pfam09606  413 QSHPGG----MIPS---------PALIPSPSPQMSQQPAQQ--RTIGQDSPGGSLNTPGQSA-VNSPLNPQ 467
PHA03247 PHA03247
large tegument protein UL36; Provisional
451-863 4.02e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.63  E-value: 4.02e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  451 PAMSPGTPGPTMGRSQGSPMDPMVMKRPQLYGMGTHPHSQPQQSSPY-PGGSYG-----------PPGAQRYPLGMQGRA 518
Cdd:PHA03247 2553 PPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRaPVDDRGdprgpappsplPPDTHAPDPPPPSPS 2632
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  519 PGA--LGGLQYPQQQMPPQYGQQAVSGYCQQGQQPYYNQQPQPSHLPPQaQYLQPAAAQSQQRYQPQQDMSQEGYGTRSQ 596
Cdd:PHA03247 2633 PAAnePDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTPEPA 2711
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  597 P-------PLAPGKSNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPH 669
Cdd:PHA03247 2712 PhalvsatPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  670 LSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMTGtqrnpqmsqyGP 749
Cdd:PHA03247 2792 SESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG----------GD 2861
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  750 -QQTGPSMSPHPSPGGQMHPGISNFQQsnssgtygPQMSQygpqgnysRTPTYSGVPSASYSGPGPGMGINANNQMHGQG 828
Cdd:PHA03247 2862 vRRRPPSRSPAAKPAAPARPPVRRLAR--------PAVSR--------STESFALPPDQPERPPQPQAPPPPQPQPQPPP 2925
                         410       420       430
                  ....*....|....*....|....*....|....*
gi 145553997  829 PAQPCGAMPLGRMPSAGMQNRPFPGTMSSVTPSSP 863
Cdd:PHA03247 2926 PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVP 2960
ARID_ARID5A cd16884
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 5A (ARID5A) ...
1063-1124 1.72e-05

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 5A (ARID5A) and similar proteins; ARID5A, also called modulator recognition factor 1 (MRF-1), is an estrogen receptor alpha (ER alpha)-interacting protein that is expressed abundantly in cardiovascular tissues and suppresses ER alpha-induced transactivation. It also associates with thyroid receptor alpha (TR alpha) and retinoid X receptor alpha (RXR alpha) in a ligand-dependent manner, and with ER beta, androgen receptor (AR), and the retinoic acid receptor (RAR) in a ligand-independent manner. ARID5A functions as a negative regulator of RORgamma-induced Th17 cell differentiation and may be involved in the pathogenesis of rheumatoid arthritis (RA). Moreover, it is an important transcriptional partner of the transcription factor sex determining region Y (SRY)-box protein 9 (Sox9) in stimulation of chondrocyte-specific transcription. Meanwhile, ARID5A plays an important role in promotion of inflammatory processes and autoimmune diseases. It works as a unique RNA binding protein, which stabilizes interleukin-6 (IL-6) but not tumor necrosis factor-alpha (TNF-alpha) mRNA through binding to the 3' untranslated region (UTR) of IL-6 mRNA, and inhibits the destabilizing effect of regnase-1 on IL-6 mRNA. ARID5A contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT).


Pssm-ID: 350648  Cd Length: 87  Bit Score: 44.99  E-value: 1.72e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 145553997 1063 ERKLWVDRYLTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNL 1124
Cdd:cd16884     1 EEQAFLVNLYKFMKERNTPIERIPHLGFKQINLWKIYKAVEKLGGYELVTARRLWKNVYDEL 62
Enamelin pfam15362
Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It ...
632-832 1.80e-05

Enamelin; ENAMELIN is involved in the mineralization and structural organization of enamel. It is necessary for the extension of enamel during the secretory stage of dental enamel formation. The proteins are expressed in teeth, particularly in odontoblasts, ameloblasts and cementoblasts.


Pssm-ID: 464672 [Multi-domain]  Cd Length: 907  Bit Score: 50.21  E-value: 1.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   632 TGTEATLSSAVSASGSTssqgdQSNPAQSPFSPHASPHLSSIPGgpsPSPVGSPVGSNQSRSGP---ISPASIPGSQMPP 708
Cdd:pfam15362   26 PATEPSANSTVPETNST-----QPNPGGSQGGNDTSPTGNSAPG---PNTGSNPTAQNGVFPPPavnVSGQGVPRSQIPW 97
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   709 QPpgsqsessSHPAL-SQSPMPQERGFMTGTQRNPQMSQYGPQQTGPSMSphpspggqmhpgisnfQQSNSSGTYGPQMS 787
Cdd:pfam15362   98 GP--------SQPNIrENYPNPNIRNFPSGRQWSPTGTAMGHRQNGPFYR----------------NQQVQRGPRWNSFA 153
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*.
gi 145553997   788 QYGPQGNYSRTPTY-SGVPSASYSGPGPGMGINANNQMHGQGPAQP 832
Cdd:pfam15362  154 WEGKQAARPGNPTYrKAYPSTSRGNYPNYAGNPANFRRKPQGPNKH 199
ARID_JARD2 cd16870
ARID/BRIGHT DNA binding domain of Jumonji/ARID domain-containing protein 2 (JARID2) and ...
1075-1147 1.83e-05

ARID/BRIGHT DNA binding domain of Jumonji/ARID domain-containing protein 2 (JARID2) and similar proteins; JARID2, also called protein Jumonji, is a DNA-binding protein that contains both the Jumonji C (JmjC) domain and AT-rich DNA-interacting domain (ARID, also known as BRIGHT). It is an interacting component of Polycomb repressive complex-2 (PRC2) that catalyzes methylation of lysine 27 of histone H3 (H3K27) and regulates important gene expression patterns during development. It exhibits nucleosome-binding activity that contributes to PRC2 stimulation. However, unlike other JmjC domain-containing proteins, JARID2 is catalytically inactive due to the lack of conserved residues essential for histone demethylase activity. JARID2 is also involved in transforming growth factor-beta (TGF-beta)-induced epithelial-mesenchymal transition (EMT) of lung and colon cancer cell lines through the modulation of histone H3K27 methylation. Moreover, JARID2 is a part of GLP- and G9a-containing protein complex that promotes lysine 9 on histone H3 (H3K9) methylation on the cyclin D1 promoter and silences the expression of cyclin D1 and other cell cycle genes. It functions as a transcriptional repressor that plays critical roles in embryonic development including heart development in mice, and regulates cardiomyocyte proliferation via interaction with retinoblastoma protein (Rb), one of the master regulatory genes of the cell cycle. Furthermore, JARID2 acts as a transcriptional repressor of target genes, including Notch1. It directly binds to SETDB1 (SET domain, bifurcated 1) to form a complex that plays an important role in a novel process involving the modification of H3K9 methylation during heart development. Meanwhile, JARID2 is a key transcriptional repressor that plays a role in invariant natural killer T (iNKT) cell maturation. It regulates promyelocytic leukemia zinc finger (PLZF) expression by linking T-cell receptor (TCR) signaling to H3K9me3. JARID2 polymorphisms are associated with non-syndromic orofacial clefts (NSOC) susceptibility.


Pssm-ID: 350634  Cd Length: 112  Bit Score: 45.68  E-value: 1.83e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 145553997 1075 MEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNV-GTSSSAASSLKKQYIQYLFAF 1147
Cdd:cd16870    16 LESQGINLTPPPLIGGCELDLPRLYHLVQELGGMQQVTDKKKWNKVADHLNIpKTAQDRPSKLQDAYCKYLLSY 89
ARID_KDM5A cd16873
ARID/BRIGHT DNA binding domain of lysine-specific demethylase 5A (KDM5A); KDM5A, also called ...
1067-1148 6.27e-05

ARID/BRIGHT DNA binding domain of lysine-specific demethylase 5A (KDM5A); KDM5A, also called histone demethylase JARID1A, Jumonji/ARID domain-containing protein 1A, or Retinoblastoma-binding protein 2 (RBBP-2 or RBP2), was originally identified as a retinoblastoma protein (Rb)-binding partner; its inactivation may be important for Rb to promote differentiation. It is involved in transcription through interacting with TBP, p107, nuclear receptors, Myc, Sin3/HDAC, Mad1, RBP-J, CLOCK and BMAL1. KDM5A functions as the trimethylated histone H3 lysine 4 (H3K4me3) demethylase that belongs to the JARID subfamily within the JmjC proteins. It also displays DNA-binding activities that can recognize the specific DNA sequence CCGCCC. KDM5A contains the catalytic JmjC domain, a JmjN domain, an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a C5HC2 zinc finger, as well as three plant homeodomain (PHD) fingers.


Pssm-ID: 350637  Cd Length: 92  Bit Score: 43.71  E-value: 6.27e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1067 WVDRYLTFMEERGSPVSsLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFA 1146
Cdd:cd16873     7 FLDQLAKFWELQGSTLK-IPVVERKILDLYALSKIVASEGGFEMVTKEKKWSKVGSRMGYLPGKGTGSLLKSHYERILYP 85

                  ..
gi 145553997 1147 FE 1148
Cdd:cd16873    86 YE 87
ARID_KDM5C_5D cd16875
ARID/BRIGHT DNA binding domain of lysine-specific demethylase KDM5C and KDM5D; This group ...
1067-1148 7.26e-05

ARID/BRIGHT DNA binding domain of lysine-specific demethylase KDM5C and KDM5D; This group includes KDM5C and KDM5D, both of which belong to the JARID subfamily within the JmjC proteins. KDM5C, also called histone demethylase JARID1C, Jumonji/ARID domain-containing protein 1C, protein SmcX, or protein Xe169, is a H3K4 trimethyl-histone demethylase that catalyzes demethylation of H3K4me3 and H3K4me2 to H3K4me1. It plays a role in neuronal survival and dendrite development. KDM5C defects are associated with X-linked mental retardation (XLMR). KDM5D, also called histocompatibility Y antigen (H-Y), histone demethylase JARID1D, Jumonji/ARID domain-containing protein 1D, or protein SmcY, is a male-specific antigen that shows a demethylase activity specific for di- and tri-methylated histone H3K4 (H3K4me3 and H3K4me2), and has a male-specific function as a histone H3K4 demethylase by recruiting a meiosis-regulatory protein, MSH5, to condensed DNA. KDM5D directly interacts with a polycomb-like protein Ring6a/MBLR, and plays a role in regulation of transcriptional initiation through H3K4 demethylation. Both KDM5C and KDM5D contain the catalytic JmjC domain, a JmjN domain, an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a C5HC2 zinc finger, as well as two plant homeodomain (PHD) fingers.


Pssm-ID: 350639  Cd Length: 92  Bit Score: 43.37  E-value: 7.26e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1067 WVDRYLTFMEERGSPVSsLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFA 1146
Cdd:cd16875     7 YLDQIAKFWEIQGSSLK-IPNVERRILDLYSLSKIVQEEGGYEAICKDRRWARVAQRLGYPPGKNIGSLLRSHYERIIYP 85

                  ..
gi 145553997 1147 FE 1148
Cdd:cd16875    86 YE 87
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
563-860 8.17e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 48.22  E-value: 8.17e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   563 PPQAQYLQPAAAQSQQRYQPQQDMSQEGYGTRSQPPLAPgKSNHEDLNLIQQERPSSLPDLSGSIDDLPTGTEATLSSAV 642
Cdd:pfam03154  183 PPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQT-QSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQV 261
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   643 SASGSTSSQGDQSNPAQsPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPgSQSESSSHPA 722
Cdd:pfam03154  262 SPQPLPQPSLHGQMPPM-PHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPP-SQSQLQSQQP 339
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   723 LSQSPMPQERGFMTGTQ----------RNPQMSQYGPQQTGPS-------------------------MSPHPSP----- 762
Cdd:pfam03154  340 PREQPLPPAPLSMPHIKpppttpipqlPNPQSHKHPPHLSGPSpfqmnsnlppppalkplsslsthhpPSAHPPPlqlmp 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   763 -GGQMHPGISNFQQSNSSGTYGPQMSQYGPQGNYSRTPTYSGVPSASYSGPGPGMGINANNQMHGQGPAQPCGAMPLGRM 841
Cdd:pfam03154  420 qSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSAS 499
                          330
                   ....*....|....*....
gi 145553997   842 PSAGMqnrPFPGTMSSVTP 860
Cdd:pfam03154  500 VSSSG---PVPAAVSCPLP 515
ARID_KDM5B cd16874
ARID/BRIGHT DNA binding domain of lysine-specific demethylase 5B (KDM5B); KDM5B, also called ...
1067-1147 2.41e-04

ARID/BRIGHT DNA binding domain of lysine-specific demethylase 5B (KDM5B); KDM5B, also called cancer/testis antigen 31 (CT31), histone demethylase JARID1B, Jumonji/ARID domain-containing protein 1B (JARID1B), PLU-1, or retinoblastoma-binding protein 2 homolog 1 (RBP2-H1 or RBBP2H1A), is a member of the JARID subfamily within the JmjC proteins. It has a restricted expression pattern in the testis, ovary, and transiently in the mammary gland of the pregnant female and has been shown to be upregulated in breast cancer, prostate cancer, and lung cancer, suggesting a potential role in tumorigenesis. KDM5B acts as a histone demethylase that catalyzes the removal of trimethylation of lysine 4 on histone H3 (H3K4me3), induced by polychlorinated biphenyls (PCBs). It also mediates demethylation of H3K4me2 and H3K4me1. Moreover, KDM5B functions as a negative regulator of hematopoietic stem cell (HSC) self-renewal and progenitor cell activity. KDM5B has also been shown to interact with the DNA binding transcription factors BF-1 and PAX9, as well as TIEG1/KLF10 (transforming growth factor-beta inducible earlygene-1/Kruppel-like transcription factor 10), and possibly function as a transcriptional corepressor. KDM5B contains the catalytic JmjC domain, a JmjN domain, an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), a C5HC2 zinc finger, as well as three plant homeodomain (PHD) fingers.


Pssm-ID: 350638  Cd Length: 90  Bit Score: 41.85  E-value: 2.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1067 WVDRYLTFMEERGSPVSsLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNLNVGTSSSAASSLKKQYIQYLFA 1146
Cdd:cd16874    10 FLDQIAKFWELQGCTLK-IPHVERKILDLFQLNKLVAEEGGFDLVCKERKWTKIATKMGFAPGKAVGSHIRAHYERILYP 88

                  .
gi 145553997 1147 F 1147
Cdd:cd16874    89 Y 89
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
648-961 2.60e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 46.15  E-value: 2.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   648 TSSQGDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPispasiPGSQMPPQ-----PPGSQSESSSHPA 722
Cdd:pfam09606  148 RMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQAGGMNGGQQGP------MGGQMPPQmgvpgMPGPADAGAQMGQ 221
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   723 LSQ--SPMPQERGFMTGTQRNPQMSQYGPQQTGPSMSPHPSPGGQMHPGISNFQQSNSSGTYGPQMSQ----YGPQGNYS 796
Cdd:pfam09606  222 QAQanGGMNPQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQqpgaMPNVMSIG 301
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   797 RTPTYSGVPSASYSGPGPGMGINA--NNQMHGQGPAQPCGAMPLGRMPSAGMQNRPFPGTMSSVTPSSPGMSQQGGPGMG 874
Cdd:pfam09606  302 DQNNYQQQQTRQQQQQQGGNHPAAhqQQMNQSVGQGGQVVALGGLNHLETWNPGNFGGLGANPMQRGQPGMMSSPSPVPG 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   875 PPMPTVNRKAQEAAAAVMQAAANSAQSRQGSFPGMNQS----GLVASSSPYSQSMNNNSSLMSTQAQPYSMTptmvnsst 950
Cdd:pfam09606  382 QQVRQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPGGMipspALIPSPSPQMSQQPAQQRTIGQDSPGGSLN-------- 453
                          330
                   ....*....|.
gi 145553997   951 aSMGLADMMSP 961
Cdd:pfam09606  454 -TPGQSAVNSP 463
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
630-932 5.00e-04

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.95  E-value: 5.00e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   630 LPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPHL-SSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPP 708
Cdd:pfam17823  130 LPAAIAALPSEAFSAPRAAACRANASAAPRAAIAAASAPHAaSPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPA 209
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   709 QPPGSQSESSSHPALSQ--------SPMPQERGFMTGTQRNPQM-----------SQYGPQQTGPSMSPHPSPGGQMHpg 769
Cdd:pfam17823  210 RGISTAATATGHPAAGTalaavgnsSPAAGTVTAAVGTVTPAALatlaaaagtvaSAAGTINMGDPHARRLSPAKHMP-- 287
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   770 iSNFQQSNSSGTYGPQMSqyGP-------QGNYSRTPTYSGVPSASYSGPGPGMGINANN-------QMHGQGP-AQPCG 834
Cdd:pfam17823  288 -SDTMARNPAAPMGAQAQ--GPiiqvstdQPVHNTAGEPTPSPSNTTLEPNTPKSVASTNlavvtttKAQAKEPsASPVP 364
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   835 AMPLGRMP-----SAGMQNRPFPGTMSSVTPSSPGMSQQGGPGMGPPMPTVNRKAQEAAAAVMQAAANSAQSRQGSFpgm 909
Cdd:pfam17823  365 VLHTSMIPeveatSPTTQPSPLLPTQGAAGPGILLAPEQVATEATAGTASAGPTPRSSGDPKTLAMASCQLSTQGQY--- 441
                          330       340
                   ....*....|....*....|...
gi 145553997   910 nqsgLVASSSPYSQSMNNNSSLM 932
Cdd:pfam17823  442 ----LVVTTDPLTPALVDKMFLL 460
dnaA PRK14086
chromosomal replication initiator protein DnaA;
1342-1552 5.08e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 45.20  E-value: 5.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1342 PGQGPPTGQPPYGGHQPGlyPQQPNY-KRHMDGMYGPPAKRHEgdmynmQYGSQQQEMynqyggsysgPDRRPIQGQY-- 1418
Cdd:PRK14086   90 PSAGEPAPPPPHARRTSE--PELPRPgRRPYEGYGGPRADDRP------PGLPRQDQL----------PTARPAYPAYqq 151
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1419 ---PYPYNRERMQGPGQMQPHGIPPqmmGGPMQSSSSEGPQQNMWAT-----RNDMPYPYQSRQGPGGPAQAPpypgmnR 1490
Cdd:PRK14086  152 rpePGAWPRAADDYGWQQQRLGFPP---RAPYASPASYAPEQERDREpydagRPEYDQRRRDYDHPRPDWDRP------R 222
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 145553997 1491 TDDMMVPeqrinhesqWPSHVSQRQPYMSSSASMQPITRPPQSSYQTPPSLPNHISRAPSPA 1552
Cdd:PRK14086  223 RDRTDRP---------EPPPGAGHVHRGGPGPPERDDAPVVPIRPSAPGPLAAQPAPAPGPG 275
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
623-828 1.02e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 44.18  E-value: 1.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   623 LSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPhaSPHLSSIPGGPSPSPVGSPvgsnqSRSGPISPASI- 701
Cdd:pfam17823  241 VTAAVGTVTPAALATLAAAAGTVASAAGTINMGDPHARRLSP--AKHMPSDTMARNPAAPMGA-----QAQGPIIQVSTd 313
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   702 -PGSQMPPQPPGSQSESSSHPALSQSPMPQERGFMTGTQrnpqmsqygPQQTGPSMSPHPSPGGQMHPGISNFQQSN--- 777
Cdd:pfam17823  314 qPVHNTAGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTK---------AQAKEPSASPVPVLHTSMIPEVEATSPTTqps 384
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 145553997   778 ----SSGTYGPQMSQYGPQGNYSRTP-TYSGVPSASYSGpGPGMGINANNQMHGQG 828
Cdd:pfam17823  385 pllpTQGAAGPGILLAPEQVATEATAgTASAGPTPRSSG-DPKTLAMASCQLSTQG 439
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
652-1005 1.25e-03

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 44.23  E-value: 1.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   652 GDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESSSHPALSQSPMPQE 731
Cdd:pfam09606  105 GPGGPMGQQMGGPGTASNLLASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGQ 184
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   732 RGFMTGTQRNPQMSQYGPQQTGPSMSPHPSPGGQMHpgisnfQQSNSSGTYGPQMSQygpqGNYSRTPTYSGVPSASYSG 811
Cdd:pfam09606  185 AGGMNGGQQGPMGGQMPPQMGVPGMPGPADAGAQMG------QQAQANGGMNPQQMG----GAPNQVAMQQQQPQQQGQQ 254
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   812 PGPGMGINANNQM-HGQGPAQPCGaMPLGRMPSAGMQNRPFPGTMSSVTP-------SSPGMSQQGGPGMGPPMPTVNRK 883
Cdd:pfam09606  255 SQLGMGINQMQQMpQGVGGGAGQG-GPGQPMGPPGQQPGAMPNVMSIGDQnnyqqqqTRQQQQQQGGNHPAAHQQQMNQS 333
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   884 AQEAAAAVMQAAANSAQSR-QGSFPGMN-------QSGLVASSSPYS----QSMNNNSSLMSTQAQ--PYSMTPTMVNSS 949
Cdd:pfam09606  334 VGQGGQVVALGGLNHLETWnPGNFGGLGanpmqrgQPGMMSSPSPVPgqqvRQVTPNQFMRQSPQPsvPSPQGPGSQPPQ 413
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 145553997   950 TASMGladmMSPSESKLSVPlkadgKEEGVSQPESKSkdSYGSQGISQP-PTPGNLP 1005
Cdd:pfam09606  414 SHPGG----MIPSPALIPSP-----SPQMSQQPAQQR--TIGQDSPGGSlNTPGQSA 459
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
1170-1516 2.43e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 43.01  E-value: 2.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1170 QPKLQPPSPANSGSLQGPQTPQSTGSNSMAEVPGDLKPPTPASTPHGQMTPMQSGRSStvsvHDPFSDVSDSAYPKRNSM 1249
Cdd:pfam03157  279 QQGYYPTSLQQPGQGQSGYYPTSQQQAGQLQQEQQLGQEQQDQQPGQGRQGQQPGQGQ----QGQQPAQGQQPGQGQPGY 354
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1250 TPNAPYQQGMGMPDMMGRMPYEPNKDPF-----SGMRKVPGSSEPFMTQGQVPNSGMQDMYNQSPSGA---------MSN 1315
Cdd:pfam03157  355 YPTSPQQPGQGQPGYYPTSQQQPQQGQQpeqgqQGQQQGQGQQGQQPGQGQQPGQGQPGYYPTSPQQSgqgqpgyypTSP 434
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1316 LGMGQRQQFPYGTSYDRRHEAYGQQyPGQGPPTGQPPYGGH-------QPGLYP---QQPNYKRHMDGMYGPPAKRHEGD 1385
Cdd:pfam03157  435 QQSGQGQQPGQGQQPGQEQPGQGQQ-PGQGQQGQQPGQPEQgqqpgqgQPGYYPtspQQSGQGQQLGQWQQQGQGQPGYY 513
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1386 MYNMQYGSQQQEMYNQYGGSYSGPDRRPIQGQYPYPYNRERMQGPGQMQPHGIPPQMMGGPMQSSSSEGPQQNMWATRND 1465
Cdd:pfam03157  514 PTSPLQPGQGQPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQ 593
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|...
gi 145553997  1466 MPYPYQSRQGPGGPAQAPPY--PGMNRTDDMMVPEQRINHESQWPSHVSQRQP 1516
Cdd:pfam03157  594 PGYYPTSPQQSGQGQQPGQWqqPGQGQPGYYPTSSLQLGQGQQGYYPTSPQQP 646
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
1386-1525 3.14e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 42.49  E-value: 3.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1386 MYNMQYGSQQQEMYNQYggSYSGPDRRPIQGQYPYPYNRERMQGPGQMQPHGIPPQMMGGPMQSSSSEGPQQNMWATRND 1465
Cdd:TIGR01628  382 MRQLPMGSPMGGAMGQP--PYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAPSRNAQNAAQKPPM 459
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1466 MPYPYQsrqgpggpaqaPPYPGMNRTDDMMVPEQRINHESQWPSHVSQRQpymSSSASMQ 1525
Cdd:TIGR01628  460 QPVMYP-----------PNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLA---SATPQMQ 505
ARID_ARID5B cd16885
ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 5B (ARID5B) ...
1063-1156 4.15e-03

ARID/BRIGHT DNA binding domain of AT-rich interactive domain-containing protein 5B (ARID5B) and similar proteins; ARID5B, also called MRF1-like protein or modulator recognition factor 2 (MRF-2), is a DNA-binding protein that directly interacts with plant homeodomain (PHD) finger 2 (PHF2) to form a protein kinase A (PKA)-dependent PHF2-ARID5B histone H3K9Me2 demethylase complex, which is a signal-sensing modulator of histone methylation and gene transcription. It also functions as a transcriptional co-regulator for the transcription factor sex determining region Y (SRY)-box protein 9 (Sox9) and promotes chondrogenesis through histone modification. Moreover, ARID5B is highly expressed in the cardiovascular system and may play essential roles in the phenotypic change of smooth muscle cells (SMCs) through its regulation of SMC differentiation. Its polymorphism has been associated with risk for pediatric acute lymphoblastic leukemia (ALL). ARID5B contains an AT-rich DNA-interacting domain (ARID, also known as BRIGHT), which can bind both the major and minor grooves of its target sequences.


Pssm-ID: 350649  Cd Length: 95  Bit Score: 38.52  E-value: 4.15e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1063 ERKLWVDRYlTFMEERGSPVSSLPAVGKKPLDLFRLYVCVKEIGGLAQVNKNKKWRELATNL--NVGTSSSAASSlKKQY 1140
Cdd:cd16885     2 EQAFLVALY-KYMKERKTPIERIPYLGFKQINLWTMFQAAQKLGGYETITARRQWKHIYDELggNPGSTSAATCT-RRHY 79
                          90
                  ....*....|....*.
gi 145553997 1141 IQYLFAFEcKTERGEE 1156
Cdd:cd16885    80 ERLILPYE-RFIKGEE 94
PRK12495 PRK12495
hypothetical protein; Provisional
632-722 4.31e-03

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 41.01  E-value: 4.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  632 TGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPHASPH----LSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMP 707
Cdd:PRK12495   78 AGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPeassTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSP 157
                          90
                  ....*....|....*...
gi 145553997  708 ---PQPPGSQSESSSHPA 722
Cdd:PRK12495  158 rqrPPVSGEPPTPSTPDA 175
dnaA PRK14086
chromosomal replication initiator protein DnaA;
658-863 4.63e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.12  E-value: 4.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  658 AQSPFSPHASPHLSSIPGGPSPSPVGSPvGSNQSRSGPISPASIPGSQMPPQPPGsqsesssHPALSQSPMPqergfmtG 737
Cdd:PRK14086   92 AGEPAPPPPHARRTSEPELPRPGRRPYE-GYGGPRADDRPPGLPRQDQLPTARPA-------YPAYQQRPEP-------G 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  738 TQRNPQMSqYGPQQTG---PSMSPHPSPgGQMHPGISNFQQSnssgtYGPQMSQYG-PQGNYSRTPTYSGVPSASYS--- 810
Cdd:PRK14086  157 AWPRAADD-YGWQQQRlgfPPRAPYASP-ASYAPEQERDREP-----YDAGRPEYDqRRRDYDHPRPDWDRPRRDRTdrp 229
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....
gi 145553997  811 GPGPGMGinaNNQMHGQGPAQPCGAMPLGRMPSAGMqnrPFPGTMSSVT-PSSP 863
Cdd:PRK14086  230 EPPPGAG---HVHRGGPGPPERDDAPVVPIRPSAPG---PLAAQPAPAPgPGEP 277
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
615-792 4.84e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 4.84e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  615 ERPSSLPDLSGSIDDLPTGTEATLSSAVSASGSTSSQGDQSNPAQSPFSPhASPHLSSIPGGPSPSPVGSPVGSNQSRSG 694
Cdd:PRK07764  599 GPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVA-APEHHPKHVAVPDASDGGDGWPAKAGGAA 677
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  695 PISPASIPGSQMPPQPPGsQSESSSHPALSQSPMPQERGFMTGTQRNPQMSQYGPQQTG-----PSMSPHPSPGGQMHPG 769
Cdd:PRK07764  678 PAAPPPAPAPAAPAAPAG-AAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAAddpvpLPPEPDDPPDPAGAPA 756
                         170       180
                  ....*....|....*....|...
gi 145553997  770 ISNFQQSNSSGTYGPQMSQYGPQ 792
Cdd:PRK07764  757 QPPPPPAPAPAAAPAAAPPPSPP 779
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
642-771 5.45e-03

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 41.46  E-value: 5.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997   642 VSASGSTSSQGDQSNPAQSPFSPHASPHLSSI---PGGPS-PSPVGSPVGSNQSRSGPIsPASIPGSQMPPQPPGSQSES 717
Cdd:pfam16014   36 VAVEALPGQNSEQQTASASPPSQHPAQAIPTIlapAAPPSqPSVVLSTLPAAMAVTPPI-PASMANVVAPPTQPAASSTA 114
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 145553997   718 S--SHPALSQSPMPQERGFMTGTQRNPQMSQYGPQQTGPSMSPHPS-PGGQMHPGIS 771
Cdd:pfam16014  115 AcaVSSVLPEIKIKQEAEPMDTSQSVPPLTPTSISPALTSLANNLSvPAGDLLPGAS 171
PHA03377 PHA03377
EBNA-3C; Provisional
1144-1421 6.18e-03

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 41.96  E-value: 6.18e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1144 LFAFECKTERGEEPPPEVF-STGDSKKQPKLQPPSPANSGSLQGPQTPQSTGSNSMAEVPgdlkpptpaSTPHGQMTPMQ 1222
Cdd:PHA03377  632 MFLRERLLEQSTGPKPKSFwEMRAGRDGSGIQQEPSSRRQPATQSTPPRPSWLPSVFVLP---------SVDAGRAQPSE 702
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1223 SGRSSTVSVHDPFSDVSDSAYpkRNSMTPNAPYQQGMGMPDMMGRMPYEPNKDPFSGMRKVPGSSEPfmTQGQVPNSGMQ 1302
Cdd:PHA03377  703 ESHLSSMSPTQPISHEEQPRY--EDPDDPLDLSLHPDQAPPPSHQAPYSGHEEPQAQQAPYPGYWEP--RPPQAPYLGYQ 778
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997 1303 DmynQSPSGAMSNLGMGQRQQFPYGTSYDR-RHE-AYGQQYPGQGPPTGqpPYGGHQPGLYPQQPNYKRH-MDGMYGPPA 1379
Cdd:PHA03377  779 E---PQAQGVQVSSYPGYAGPWGLRAQHPRyRHSwAYWSQYPGHGHPQG--PWAPRPPHLPPQWDGSAGHgQDQVSQFPH 853
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 145553997 1380 KRHEGDMYNMQYGSQQQEMYNQYGGSYSGPDR---------RPIQGQYPYP 1421
Cdd:PHA03377  854 LQSETGPPRLQLSQVPQLPYSQTLVSSSAPSWsspqprapiRPIPTRFPPP 904
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
639-764 6.82e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 41.79  E-value: 6.82e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  639 SSAVSASGSTSSQGDQSNPAQSPFSPHASPHLSSIPGGPSPSPVGSPVGSNQSRSGPISPASIPGSQMPPQPPGSQSESS 718
Cdd:PRK12323  372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPA 451
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*.
gi 145553997  719 SHPAlsQSPMPQERGFMTGTQRNPQMSQYGPQQTGPSMSPHPSPGG 764
Cdd:PRK12323  452 PAPA--AAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDD 495
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
1331-1484 8.02e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 41.33  E-value: 8.02e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 145553997  1331 DRRHEAYGQQYPGQGPPTGQPPYGGHQPGLYPQQPNYKRHMDGMYGPPAKRHEGdmyNMQYGSQQQEMynqyggsysgpd 1410
Cdd:TIGR01628  365 EQRRAHLQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPR---MSMMPTPMGPG------------ 429
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 145553997  1411 rRPIQGQYPYPYNRERMQGPGQMQPHGIPPQMmggPMQSSSSEGPQQNMWATRNDMPYPYQSRQGPGGPAQ---APP 1484
Cdd:TIGR01628  430 -GPLRPNGLAPMNAVRAPSRNAQNAAQKPPMQ---PVMYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVlasATP 502
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH