Comparative analysis of the protein sequences encoded in the genomes of three families of large DNA viruses that replicate, completely or partly, in the cytoplasm of eukaryotic cells (poxviruses, asfarviruses, and iridoviruses) and phycodnaviruses that replicate in the nucleus reveals 9 genes that are shared by all of these viruses and 22 more genes that are present in at least three of the four compared viral families. Although orthologous proteins from different viral families typically show weak sequence similarity, because of which some of them have not been identified previously, at least five of the conserved genes appear to be synapomorphies (shared derived characters) that unite these four viral families, to the exclusion of all other known viruses and cellular life forms. Cladistic analysis with the genes shared by at least two viral families as evolutionary characters supports the monophyly of poxviruses, asfarviruses, iridoviruses, and phycodnaviruses. The results of genome comparison allow a tentative reconstruction of the ancestral viral genome and suggest that the common ancestor of all of these viral families was a nucleocytoplasmic virus with an icosahedral capsid, which encoded complex systems for DNA replication and transcription, a redox protein involved in disulfide bond formation in virion membrane proteins, and probably inhibitors of apoptosis. The conservation of the disulfide-oxidoreductase, a major capsid protein, and two virion membrane proteins indicates that the odd-shaped virions of poxviruses have evolved from the more common icosahedral virion seen in asfarviruses, iridoviruses, and phycodnaviruses. |
------------------------------KILAN------------------------------------------------------------------------------
#
KilAN+ kilAC
-----------------------------------------------------
125404 KILA_BPP1 kilA N 1-128 + 143-266 (kilAC)
# kilA N + Bro-aC
----------------------------------------------------
9964424 AMV110
9964414 AMV100
9964341 AMV027
9964550 AMV236
9964470 AMV156
15078718 CIV006L 231-352 (CIV029)
9634794 FPV124 N1R/p28 gene family prote. + BROC (217-271)
2407299 17K ORF [Heliothis armigera. + BROC.
9964426 AMV112 + BROC
9964338 AMV024 + BROC
# kilAN+ Orf11D3
----------------------------
9635595 Orf11 [Pseudomonas phage D3] >gi|889...
13559861 unknown [Bacteriophage HK620] >gi|1.. + Orf``D3 C-terminus (105-161)
# kilAN + T5orf172
------------------------------
15079027 CIV315L + Bro-E Cterminus 96-201
# kilA N (the Cs are distinct and not conserved
-----------
11281012 hypothetical protein NMB0900 + C- nothing
11345968 phage-related protein XF2294 + C-nothing
11290039 hypothetical protein NMA1544 [imported] .... C distinct but not conserved
9634745 ORF FPV075 N1R/p28 gene family prote..... C-distinct but not conserved
9634833 C- distinct but not conserved FPV163
9634829 ORF FPV159 N1R/p28 gene family prote.. C-distinct but not conserved
9634825 ORF FPV155 N1R/p28 gene family prote.... C distinct but not conserved... low complexity
9634906 ORF FPV236 N1R/p28 gene family prote..C distinct but not conserved... low complexity
9634918 ORF FPV248 N1R/p28 gene family prote..--- little extension.. nothing
9634831 ORF FPV161 N1R/p28 gene family prote..----little extension.. nothing
1777419 ORF4 [Fowlpox virus].........little extension.. nothing
9964446 AMV132- C matches low complexity region C -terminal to MSV199-T5orf172
# kilaN + CIV029R/BROC
-----------------
15079025 313L 10-130 (kilAN) + 130-237 (MSV199) 237-353 (CIV029R/BROC)
#P63C + kilAN
------------
9634189 Gp73 [Bacteriophage HK97] >gi|690161.. kilaN inserted into 9632501 (or 4499795) 933W orf12
#kilAN + RING
---------------
9634827 ORF FPV157 N1R/p28 gene family prote.... 2 RINGS or 1 zinc + RING
9634820 ORF FPV150 N1R/p28 gene family prote..
9633890 gp143R [Rabbit fibroma virus] >gi|46..
12085126 143R protein [Yaba-like disease vir..
9633779 m143R [Myxoma virus] >gi|6523998|gb|..
6682986 Yb-C4R [Yaba monkey tumor ..
----------------------------------MSV199 like--------------------------------------------------------------------------------------------------------------------------
# MSV199like solo
------------
9631447 MSV199 fragment of MSV198
CIV200R fragment
# MSV199like motif + UVRC
--------------------------------
15078859 CIV146R 1-118 (UVRC domain) 143-243 (MSV199like motif)
# N-MSV199like motif + CIV029R/BROC like
--------------------
15079179 CIV468L 1-177 + C 177-376 (CIV029R/BROC)
15079099 CIV388R 1-175 + C 226-344 (CIV029)
15078923 CIV211L 1-180 + C 259-381 (CIV029R/BROC)
15078924 CIV212L 30-155 + C 238-360 (CIV029R/BROC)
15078950 CIV238R N+ 86-230 (MSV199like) + C 315-436 (29R/BROC)
15078732 CIV019R N + 136-274 (MSV199like)
15078861 148R _CIV 61-191 (MSV199like???) + C _ 265-330 (414) + BROC (330-414)
#
MSV199+C Bro-e C terminus
--------------------------
9964508 AMV194 N? + 66-215 (MSV199like) + C Bro-e (252-358
9631448 MSV198 1-155 (MSV199like) + C Bro-e (192-292)
9631453 MSV191 1-120 (MSV199like) + C
9964523 AMV209 N + 72-215 (MSV199like) + C Bro-e 257-356
9964521 AMV207 77-228 + C 270-369 Bro-eC
15079131 CIV420R 1-154 + C Bro-e (234-327) +
9631537 52-144(T5orf172 + MSV021 (MSV199like) 105-250 +
-------------------------------T5orf172---------------------------------------------------------------------------------------------------------------------------------
# BRO-e C-terminus Looks like a uvrC nuclease to me
-------------------------------------------------
9631041 Ld-bro-f [Lymantria dispar nucleopol.. 10-129 (solo)
281258 hypothetical protein - phage T5 >gi|579090.. 65-158 probably solo
93750 hypothetical protein 172 - phage T5 65-158 probably solo
7474985 hypothetical protein yeeC - Bacillus subt.. N--nothing + 265-374
14194257 hypothetical pro.. N--nothing + 137-233 (Unidentified bacterium)
8346568| phage P27 N--nothing + 266-376
11345564 hypothetical protein NMB1132, NMB1170 [i.. 2-73 + C low complexity
15079171 460R CIV 4-88
--------------------------------- BRON-------------------------------------------------------------------------------------------------------------------------
# P22ANT-N + BroA like N-terminus + P22ARC
------------------------------------------
9635550 P22-ant 130-207 + -199-272
# Bro-A N + T5orf172
--------------------------
15079001 CIV 289L N-BroN (1-120) 186-299 Bro-e C
15078913 CIV 201R N-Bro N (1-188) 188-301 Bro-eC
13751084 (AJ309235) Bro-I protein [Bombyx mor.. 1-111 (BRON) + 111-220
9630900 BRO-b [Bombyx mori nuclear polyhedro.. 1-111 (BRON) + 111-218
9630956 BRO-e [Bombyx mori nuclear polyhedro.. 1-111 (BRON) + 111-220
9631082 Ld-bro-k [Lymantria dispar nucleopol.. 1-108 (BRON) + 108-217
9631117 Ld-bro-m [Lymantria dispar nucleopol. . 1-102 (BRON) + 137-233
9631452 ORF MSV194 ALI motif gene family pro.. 1-100 (BRON) + 175-290
9631535 ORF MSV023 ALI motif gene family pro.. ~1-100 (BRON) + 145-257
12597544 Heliocoverpa armigera nucleopo.. 1-145 BRON+ 145-243
9635380 ORF130 [Xestia c-nigrum granulovirus.. 1-84 BRON 84-197 C
9631042 Ld-bro-g [Lymantria dispar nucleopol... 17-100 (Bro-aN) + 100-222
13242588 Esv-1-117
# BroA-N + kilAC
---------------------------
13095813 bIL309 BRON + 137-247(kilAC)
1395130 LL-H _ BRON + 152-258 (kilAC)
1362213 2-139 (BRON) + 139-247(kilAC)..
1251473 prophage CP-933N 1-123 (BRON) + 117-229 (kilAC)
14246624 1-139 (BRON) + 139-252(kilAC)
14251162 BK5-T 1-138 (BRON) + 140-256 (kilAC)
9635686 phiPV83 1-143 (BRON) + 143-256 (kilAC)
1353522 ORF5_r1t 1-137 (BRON) + 139-255 (kilAC)
13622137 putative antirepressor - p... 6-92 + kilAC
# BRON+ P63
--------------
>gi|15320633 p63 Bacteriophage Mx8 : Myxococcus xanthus
#BroA like N-terminus + P22ARC
------------------------------------------------------------
12514734 putative antire - 4-122+ 191-264
1175791 HI1418 11-124 + - 137-194
# Bro-aN
--------------------------------------------------------------------------------------------------
9964369 AMV055 [Amsacta moorei entomopoxviru... solo (BRON)
9631040 Ld-bro-e [Lymantria dispar nucleopol... 1-82 (solo)
9631451 ORF MSV195 ALI motif gene family pro... solo (BRON)
9631397 ORF MSV226 hypothetical protein [Mel... 3-95 (solo)
1395127 putative [Bacteriophage LL-H] solo -- truncated?
12697190 putative antirepressor [N... 5-90 + C or probably solo
6599316 Broa-N solo
13623111 hypothetical protein - pha... 8-89 + C -- nothing
7480004 othetical protein SCGD3.15 - Streptomy... 17-120 + C nothing
11349554 othetical protein PA2423 [imported] -... N-nothing 99-251 (bro) + C nothing
11349113 othetical protein PA1153 [imported] -... 1-139
9964576 AMV262 [Amsacta moorei entomopoxviru... 1-100 (BRON) + C nothing
9635312 ORF62 [Xestia c-nigrum granulovirus]... 28-121 + C nothing
AMV055
#BRON duplication
------------------------
11068085 PxORF82 peptide [Plutella xylostell... 120-200(BRON), 250-325 (BRON)
13160526 F274292) unknown [Culex nigripalpus... 36-100(BRON), 149-256 (BRON) + C nothing
# Bro-aN + BROC
-----
9799895 hypothetical protein [Antica... 1-112 + C \
93042 othetical protein ORF2, ptp-region [impo... 1-113 + C |
10442572 38.7 kD-like pr. +295-390 (b |
347406 24 kDa ORF [Autographa califor... 45-139 + C + 147-206 \
9627755 AcOrf-13 peptide [Autograph +215-326 |
5565846 AcMNPV ORF13 homo. +107-207 |
9627744 baculovirus repeated ORF [Autographa... 1-113 + C 133- |
9629950 unknown [Orgyia pseudotsuga BRON ? +214-316 |
9635364 ORF114 [Xestia c-nigrum granulovirus... N-nothing 211-384 + C..427
9635326 ORF76 [Xestia c-nigrum granulovirus]... N + 143-236
9635409 ORF159 [Xestia c-nigrum granulovirus. Broa N 48-153 + 281-392 (BROC)
9631120 Ld-bro-n [Lymantria dispar nucleopol... 1-116 + C |
9631081 Ld-bro-j [Lymantria dispar nucleopol... 1-113 + C /
9631128 Ld-bro-p [Lymantria dispar nucleopol. BRON 1-134 + 63-178 (BROC)
9630998 Ld-bro-a [Lymantria dispar nucleopol. Broa N 1-102 +216-327 (BROC)
9631113 Ld-bro-l [Lymantria dispar nucleopol. Broa N 1-108 +222-333 (BROC)
9631121 Ld-bro-o [Lymantria dispar nucleopol. Broa N 1-112 +205-316 (BROC)
9630999 Ld-bro-b [Lymantria dispar nucleopol. Broa N (1-113) +202-313 (BROC)
12597545 bro [Heliocoverpa armigera nucleopo. 1-107 BRON + BRON (183-284)+391-502 (BROC)- duplication of BRON
13751087 (AJ309236) Bro-II protein [Bombyx mo. BRON (1-115) +197-306 (BROC)
9630821 AcMNPV orf13 [Bombyx mori n 49-143 (BRON)+ 219-330 |
13751089 Bro-III protein [Bombyx m... 1-114 + C |
9630901 BRO-c [Bombyx mori nuclear polyhedro. BRON +195-304 (BROC)
9630839 BRO-a [Bombyx mori nuclear polyhedro. BRON (1-115) +195-304 (BROC)
9630955 BRO-d=AcMNPV orf2 [Bombyx mori nucle... 1-115 + C |
9635359 ORF109 [Xestia c-nigrum granulovirus. BRON 1-82 +189-296 (BROC)
7672865 bro-a [Spodoptera. BRON( 1-113) +200-309 (BROC)
12597608 38.7kd [Heliocoverpa armig + 292-382 /
15213135 unknown [Epiphyas postvitt... 83 2e-15
9634234 ORF13 38.7kD [Spodoptera exigua nucl... |
9964371 AMV057 210-290 -(CIV029R/BROC)
9964491 AMV177 217-297 - (CIV029R/BROC)
9964489 AMV175 203-283- (CIV029R/BROC)
3510491 orf6Heliolithis BRON + C 190-271 (CIV029R/BROC)
*BROC solo
----------------
9630054 Orgyia pseudotsugata single. C-terminus solo-- NO BRO
15213228 unknown [Epiphyas postvitt... 146 9e-35 NO BRO-JUST THE C-terminal region and even that may be fragmented
9631089 LdOrf-122 peptide [Lymantria + 97-176--solo
2760643 CIV029R and Bro-a C can be unified
15078742
11931724 DpAV4 (59-181)
11931708 DpAV4 (91-194)
11931709 DpAV4 (15-97)
Xylella specific family
BRON-BRON-BRON + C XF1559 (that has no bro)
------------------------------
11362500 phage-related protein XF2524 [imported] ... 32-122, 166-253, 283-371 (BRON) + C
11362477 phage-related protein XF0684 [imported] ... 6-96 (BRON) 140-226 (BRON) 256-344 (BRON) + C
11362484 phage-related protein XF1663 [imported] ... 12-120, 130-237 Only 2 Bro domains
11362478 phage-related protein XF0704 [imported] ... 17-104 + C
11362483 phage-related protein XF1645 [imported] ... N + 122-210 + C C XF0704
11362060 hypothetical protein XF2506 [imported] -... N 1-74 XF2129/XF1645 + 189-279 (BRON) + C XF0704
XF0704 + XF2129 = XF1645 and XF2506
XF1559+
#BRON- gp30 like
-----------------------
9633590 P43 [Bacteriophage APSE-1] >gi|61180... 1-96 + C
9630500 gp30 [Bacteriophage N15] >gi|7521545... 7-101 (BRON)+ C
# Broa-N -- C synapomorphic to this group
-----------------------------------------
9635310 ORF60 [Xestia c-nigrum granulovirus]... 1-114 + C + C
10442560 Orf60-like proti... 12-110 + C+C
12597590 bro [Heliocoverpa armigera nucleopo... 12-110 + C + C
9635381 ORF131 [Xestia c-nigrum granulovirus... 1-93 + C +C
9631038 Ld-bro-c [Lymantria dispar nucleopol... 1-112 + C +C
9631039 Ld-bro-d [Lymantria dispar nucleopol... 1-112 + C
9631080 Ld-bro-i [Lymantria dispar nucleopol... 1-112 + C missing the middle domain
# Broa-NC
-----------------------------------------
9635363 ORF113 [Xestia c-nigrum granulovirus... N + 146-239+ C
14602336 ORF99 similar to XcGV ORF113 [Cydia... 150-240 (BRON) + BRON
# Broa-N + C- SinR like HTH
------------------
13623110 hypothetical protein - pha... N + 88-182 + C - HTH
# BRON + Vsr
---------------
15078782 069L [Chilo iridescent virus] >gi|7...
9631450 ORF MSV196 4-75 BRON + C vsr nuclease (101-196)
9631533 ORF MSV026 1-69 (BRON) + C vsr nuclease (101-196)
9631534 ORF MSV024 8-79 + C vsr nuclease
9631444 ORF MSV204 ALI motif gene family pro.1-91 + C vsr nuclease
#VSR nuclease
-------------
9631531 ORF MSV028 hypothetical protein [Mel.. (solo VSR).
9964571 AMV257 [Amsacta moorei entomopoxviru...(solo VSR)
9631399 ORF MSV229 leucine rich repeat gene ...
-----------------------------------------phi31orf238N----------------------------------------------------------------------------------------------------------------------------------------------------------------
#phi31orf238N + kilAC
-------
2897108 tr thermophilus phageTP-J34 + 112-236 (238) -- N - phi31orf238N 1-111
9632967 Strthe_orf287_Sfi21 - + N--phi31orf238N 46-116
13622110 Spy Mgas_ 116-240 (242) --- + N phi31orf238N 1-116
14247767, 13701788, 8918426, 1370718, 9635199 Sa 132-350 -- + N phi31orf238N 10-124 ORF11_phi ETA_131-246 (250) - -- N phi31orf238N 10-123
5823644 A118_136-260-- N 9-108 N phi31orf238N 1-111
13701788 anti repressor [Staphyloc 10-123 + C kilAC
#kilA middle + kilAC Ant1/2 : Originally may have had a RHAdomain N-terminal to it as it seems to have a region C-terminus in RHAat its N (the DELE motif)
------------------
137444, 15742 kilA middle + 225-321 (kilAC)
12514711 kila middle + kilAC
# phi31orf238N domain + shares a C-terminal domain with orf6_BPbIL285
--------------
gi|7239197, 12724417, 13095749, 13487806 Orf238 1-103 + C
# phi31orf238N + P22ARC
-----------------------------------------
9632546, 9633476, 1175795 hypothetical protein [Bacteriophage 8-114 + C unknown (191-264)- CP-933N like antipreressor C
11354036 NMA1293 4-128 + C
11138338 wonder if this is truncated shares a small domain wiht phi31orf238N proteins + KilAC
#phi31orf238N + phiSLT orf 81a
---------
15024925 Cab 1-98 + C- phiSLT orf 81a (which is a solo) -12719400
------------------------------------------RHA-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
#RHA+ kilAC
------------------
9630225 SpbC2 1-109 + 133-244 (kilAC)
13559853 Roi_HK620 23-116 + 143-233 (kilAC)
1197729 Roi_HK022 24-117 + 118-234 (kilAC) -- check extreme C-terminus, 9634208 (RHAidentical)
9632502 Roi_BP-933W 48-118 + 119-235 (kilAC) 4499797: orf14_933W ilAN 48-118 + 119-235 (kilAC) gi:2668765 Roi_H-19B kilA N 23-116 + 102-233 (kilAC) gi:9633432 roi_VT2-Sa kilA N 26-118 + 119-235 (kilAC) gi:13360660 coli kilA N 24-116 + 117-233 (kilAC), 13360660 coli kilA N 24-116 + 117-233 (kilAC)
9634208 RHA+ KilA
# RHA domain
-------------
12719399 antirepressor [Staphylococcus aureu 2-108 --probably solo
2120256 rha protein - phage phi-80 >gi|1019108|gb 28-124---probably solo
12722192 unknown [Pasteurella multo 26-125 + C-nothing
13095661 Orf3 [bacteriophage bIL311] >gi|127 14-100 \subfamily
14972611 hypothetical protein [Stre 3-102 /
13095876 anti-repressor [bacteriophage bIL31 8-127 RHA+ a C-terminal region specific to 14972611 and this protein
9633589 P42 [Bacteriophage APSE-1] >gi|61180 12-101 + kilA middle domain
#RHA+ P4ASH-N
------------
421263 hypothetical protein 179 - Shigella flexne 1-106 (P4ASH only part N)+ 106-168 (RHA)
#RHA+ Orf11D3
--------------
1175786 HYPOTHETICAL PROTEIN HI1412 >gi| 10-114 + Orf11D3 114-173
9635533 unknown protein [Enterobacteria phag 39-139 + Orf11D3 144-197
--------------------------------------------------------------------------------------------------------------------------------------
-----------------------------------------ALIGNMENTS-----------------------------------------------------------------------------------------------
1. kilA N-terminus
PHD Sec. Structure -EEEEE--------EEE-------------HHHHHHHHH--------HHHHH----------------------------------------EEEEEE------------HHHHHHHHHHHHH------HH--HHHHHHHHHHHHHH--
NMA1544_Nm_11290039 LIPRVESG---EIIPQRMSD-------GYINATALCKSVG----KSYSDYRQLQSTNHFLNELKAQTG---------------------LSEQQLIQQRIGGEPSL--QGSWVHPYLAINLAQ------WLSPAFAVKVSTWVHEWMSG \
NMB0900_Nm_11281012 NVSVLNFG---NTPVSFRQD-------GFLNATAIASHFG----KLPKDYLKSEQTQQYISALAENLSVRRKIL---------------TEANQIVIVKRGGSE----QGTWLHPKLAIHFAR------WLNPKFAVWCDEQIEILLNG /kilAN solo
AMV132_AMV_9964446 NYWCLHIN---DFNLIYNKKL------NLYNASRVCDIYE----KNIHIWLE-ENYDYTIKYLKIKEI---------------------NDHVSIINNNKESSL----NGLYVSEHILLGISI------WISEECYYKCINIILHNHDI-- has a C-terminus which matches the C-terminus of MSV-T5orf172
KILA_BPP1_125404 STTLPVIC---GVEITTDRA-------GRYNLNALHRASGLGAHKAPAQWLRTLSAKQLIEELEKET----------------------MQNCIVSFEGRGG-------GTFAHELLAVEYAG------WISPAFRLKVNQTFIDYRTG | kilAN + kilAC
HKBK_BPhk620_13559861 -MKAITLF---NTPIRVDES-------GMICLTDMWKASGKSESESPYHYLRNKQTKEFLAELEKN-----------------------HESVVFTERGVHG-------GTYGGKFVAYDYAA------WLNPGFKYAAYKVLDDYFTG |kilAN + ORF11D3-C
Orf11_D3__9635595 NVIPFHYQ---GKPVRFNSD-------GWINATDIAAAHG----MRLDNWLRNKETEAYIEALARHLNTSD------------------SRDLIRGQRGRGG-------GTWLHPKLAVAFAR------WISPDFAVWADLHIDALLRG |kilAN + ORF11D3-C
XF2294_Xf_11345968 TTQQLAIN---SLPIR-EQD-------GLYSLNDFHKASGGAVRHRPSEFLRLDKTKALVVELTNSPEFVSSIKGGA------------PHLFVRKEKGRAG-------STFACRELAIAYAS------WISPAFQLKVIRVFLASVVV |C-terminus
Gp73_HK97_9634189 NIIPIDFE---GHPMRFSDD-------GWFDATAAADKFN----KEPAQWLRLPETVRYIEALKSRYGNIT------------------YVKTSRARKDRGG-------GTWLHPKLAVRFAR------WLSVDFEIWCDEQIDAIIQG |Fused to p63C
-EEEEEEE---EEEEEEE-------------HHHHHHH--------HHHHHHHHHHHHHHHHHHHH-------------------------EEEEEE---------EEEEE---HHHHHHHHH------HHEEEEE----EEEEEEEE-
CIV315L_CIV_15079027 NFYYGLFR---DFKLVVDKNT------ECFNATKLCNSGG----KQFRQWTRLEKSKKLMEYYSRRG----------------------SQQMYEIKGDNKDQLVTQTTGTYAPIDFFEDIKR------WIQLPKASSASGVVYVVTTS kilAN + Bro-eC
FPV124_FPV_9634794 RFCYIKYD---KFDLIMMKEN------RFINATKLCKLGG----KDFHRWKRLDGSKELMIKVNEMN-EMWKSAPPPPDL---------GGIIIEVNG-SNQYTEYDIAGSYVHQDLIPHIAS------WISPLFALKVSKIISCYVSG \
AMV112_AMV_9964426 SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLSG----KRFRNWIRLDRSKQLLKYMENYRSSYV------------------SVGFYEVKGDNNNKTSKEITGQYVPKEVILDISS------WISVEFYLKCNDIIINYYNN |
AMV024_AMV__9964338 SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLGG----KKFKQWKRLEKSQELIDYIKNNRGGDP------------------HPGFYETKGDNKDENVKKITGCYVPKEVILDISS------WISVEFYLKCNDIIINYYNT |
CIV006L_CIV_15078718 TFYKGLFG---DFPLIVDKKT------GCFNATKLCVLGG----KRFVDWNKTLRSKKLIQYYETRCDIKT------------------ESLLYEIKGDNNDEITKQITGTYLPKEFILDIAS------WISVEFYDKCNNIIINYFVN |
CIV313L-CIV_15079025 NFYYGLFG---DFKLVVDKNT------ECFNATKLCNSGG----KRFRDWTKLEKSKKLMEYYKGRRDDHRG-----------------GSNFYEVKGDNKDDEVSKTTGQYVKKELILDIAS------WISTEFYDKCNQIVIDFFVV | kilAN + Broa-c
AMV110_AMV__9964424 SYYYGLFG---DFKLVIDKTT------GCFNATKLCNLGG----KQYRDWKRLEKSKELIKTLINVRRENS------------------RVWEYNIISNNNHEIHKQYTGYYVSKDLILDIAS------WIAPEFYLKCNDIIINYYNN |
AMV100_AMV__9964414 TFYSAHIN---SYQLVIDKKT------GFFNASYVCIKNY----RKINNWLNNKKTIKLIKYYMNLLNNKNNN----------------NNKIKYKIVDKYDNIN----GIYLHPILLNHLLD------WINIKINNKYN--IIDYIIL /
FPV161_FPV_9634831 GFLILYYD---SIEIIVMSCN------HFINISALLAKKN----KDFNEWLKIESFREIIDTLDKIN--YDLGQRYCEEPYGASHSSVIIEVKASNLIDDRTA------GFYVHKDLIPYILT------CISIPFSLKVVRVLDTYIGE \
FPV236_FPV_9634906 YFMSMKLL---DVEVVIMRSN------GFVNITRLCNLEG----KDFNDWKQLESSRRLLNTLKDNN--KLHDP-----------------IINIRHTRIKIN------GEYVSQLLLDYVIP------WISPYVATRVSILMRYYRRC |
FPV155_FPV_9634825 EFCYIQYS---GFHLVMMISN------CYINASKLCDT------KDFKKWLRLDSSLSLLQEIENTN---FPSEKKFSIKNSK------SVIILEKYYHEEVE------GYYIHPDILPHIVG------WLSPTFAISMSKFINGYISN | C-nothing
FPV159_FPV_9634829 KFSYIIYD---KIKIIIMKSN------NYVNATRLCELRG----RKFTNWKKLSESKILVDNVKKIN---DKTNQLKTDMI--------IYVKDIDHKGRDTC------GYYVHQDLVSSISN------WISPLFAVKVNKIINYYICN |
FPV248_FPV_9634918 NFCKLSYE---DIEIIMMKEN------EYINATRLCSSRG----RDILDWMSKESSVELINELDRIN---RSCNDYYDY----------RGIVLNVVSDSETS------ELYVHRDLILHISH------WISPLFSLKVVKFINSYIQD /
FPV163_FPV_9634833 HFCYIKYD---GITLTMMKDN------GYINATQLCMLGN----KDFKEWIKLDHSIELIKEIEKNI--NKETTKYVKAVISV------RSDYYNSETSNDIK------GFYIHGNIMPHICA------WISSKFAIKVSNIVHNYLND
FPV075_FPV_9634745 NFCFINYA---NIEVIMLKYN------GYINATKICDLGN----KNFRQWCRLESSKKLIKTLNYKN---GIYNKAVLE----------IGLASNSAYKYELV------GTYVHIDLVPHIIC------WVFPSIALNFSKILNSYLSN
FPV157_FPV_9634827 SFDSIKYR---DIKVIIMKNN------GYVNCSKLCKMRN----KYFSRWLRLSTSKALLDIYNNKS---VDNA---------------IVKVYGKGKKLIIT------GFYLKQNMIRYVIE------WIGDDFTNDIYKMINFYNAL \ + RING
FPV150_FPV_9634820 EYRVIEDN---GFSIILLKHT------EYINVTKLCKIHN----KEFYRWKRLISAGRIIETVSRDISNQGFESPL---------------VYVNRKGNKEFY------GFYAHPQLALYIAK------WISEDIFNKIKHLINSYTIS |
p28_Ectro_1360841 LQYIDEPN---DIRLPVCIIRNINNITYFINITKINPDLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSKL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYIA |
D6R_VAR_885801 LQYIDEPN---DIRLTVCIIQNINNITYYINITKINPHLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSNL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYID |
YH22_VV_140731 LQYIDEPN---DIRLTVCIIRNINNITYYINITKINTHLA----NQFRAWKKRIAGRDYMTNLSRDT--GIQQSKL-------------TETIRNCQKNRNIY------GLYIHYNLVINVVI-----DWITDVIVQSILRGLVNWYIA -- no RING- truncated
PHDSec Str -EEEEE------EEEEEE----------EEEEHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHH----------------------EEEEEEEE----EEE------EEEHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHH
1MB1_Sc_3402004 VDVYEFIH---STGSIMKRKK-----DDWVNATHILKA------ANFAKAKRTR----ILEKEVLK----------------------ETHEKV--QGGFGKY-----QGTWVPLNIAKQLAEKFSVYDQLKPLFDFTQTDGSASP
MBP1_Kla_729994 VDVYEFIH---PTGSIMKRKA-----DNWVNATHILKA------AKFPKAKRTR----ILEKEVIT----------------------DTHEKV--QGGFGKY-----QGTWIPLELASKLAEKFEVLDELKPLFDFTQQEGSASP
PCT1_Sp_11346262 VEVYECFI---KGVSVMRRRR-----DSWLNATQILKV------ADFDKPQRTR----VLERQVQI----------------------GAHEKV--QGGYGKY-----QGTWVPFQRGVDLATKYKVDGIMSPILSLDIDEGKAIA
SCT1_Sp_464742 VEVFEYTI---NGFPLMKRCH-----DNWLNATQILKI------AELDKPRRTR----ILEKFAQK----------------------GLHEKI--QGGCGKY-----QGTWVPSERAVELAHEYNVFDLIQPLIEYS---GSAFM
SWI4_Sc_666106 TDVYECYIRGFETKIVMRRTK-----DDWINITQVFKI------AQFSKTKRTK----ILEKESND----------------------MQHEKV--QGGYGRF-----QGTWIPLDSAKFLVNKYEIIDPVVNSILTFQFDPNNPP
CC10_Sp_115906 MKYMELSC---GDNVALRRCP-----DSYFNISQILRL------AGTSSSENAK----ELDDIIES----------------------GDYENV--DSKHPQI-----DGVWVPYDRAISIAKRYGVYEILQPLISFNLDLFPKFS
EFG1_Cal_1169477 TLCYQVDA---NNVSVVRRAD-----NNMINGTKLLNV------AQMTRGRRDG----ILKSE-------------------------KVRHVV--KIGSMHL-----KGVWIPFERALAMAQREQIVDMLYPLFVRDIKRVIQTG
Sok2_Sc__6323658 TLCYQVEA---NGISVVRRAD-----NDMVNGTKLLNV------TKMTRGRRDG----ILKAE-------------------------KIRHVV--KIGSMHL-----KGVWIPFERALAIAQREKIADYLYPLFIRDIQSVLKQN
Phd1-_Sc_6322808 TICYQVEA---NGISVVRRAD-----NNMINGTKLLNV------TKMTRGRRDG----ILRSE-------------------------KVREVV--KIGSMHL-----KGVWIPFERAYILAQREQILDHLYPLFVKDIESIVDAR
MGF-1_Yaly_5139660 TLCFQVEA---RGICVARRED-----NDMINGTKLLNV------AGMTRGRRDG----ILKGE-------------------------KLRHVV--KAGAMHL-----KGVWIPYDRALEFANKEKIIDLLFPLFVRDIKSVLYHP
AM1_Nc_1517923 SLCFQVEA---RGICVARRED-----NAMINGTKLLNV------AGMTRGRRDG----ILKSE-------------------------KVRHVV--KIGPMHL-----KGVWIPFERALDFANKEKITELLYPLFVHNIGALLYHP
StuA_Eni_549002 SLCYQVEA---KGVCVARRED-----NGMINGTKLLNV------AGMTRGRRDG----ILKSE-------------------------KVRNVV--KIGPMHL-----KGVWIPFDRALEFANKEKITDLLYPLFVQHISNLLYHP
SPBC19C7_10_Sp_7491471 LKCTNPES--KVPHFLMRMAK-----DSSISATSMFRS------AFPKATQEEE----DLEMRW------------------------IRDNLN--PIEDKRV-----AGLWVPPADALALAKDYSMTPFINALLEASSTPSTYAT
G6G8_4_Nc__12802359 SGIFKSSP---PSYFLMRRSQ-----DGYISATGMFKA------TFPYASQEEE----EAERKY------------------------IKSIPT--TSSEETA-----GNVWIPPEQALILAEEYQITPWIRALLDPSDIAVTATD
1MB1 Sec structure EEEEEEEE-----EEEEEEE---------EEEHHHHHH----------HHHHHH----HHHHH----------------------------EEE---------------EEEE-HHHHHHHHHH---HH---HH------------
2. kilAC-terminus roi1 roi2
--------------- * *
PHD Sec Str. ------------HHH-HHH-----EEEHHHHHHHHH----HHHHHHHHHHHH--EEE-----------HHHH---EEEEEEEEEE-----EEEEEEEE----HHHHHHHHHHH------
SA1801_SaN315_13701788 LETKIERDKPKIVFADAVATTKTSILVGELAKIIKQNGINIGQRRLFEWLRQNGFLIKRKGVDYNMPTQYSMERELFEIKETSITHSDGHTSISKTPKVTGKGQQYFVNKFLGEKQTS \
ORF11_BPETA_8918426 LETKIERDKPKIVFADAVATTKTSILVGELAKIIKQNGINIGQRRLFEWLRQNGFLIKRKGVDYNMPTQYSMERELFEIKETSITHSDGHTSISKTPKVTGKGQQYFVNKFLGETQTT |
orf238_BPTP-J34_2897108 LEAQIEADRPKVLFADAVSASKSSCLIGELAKILKQNGINIGQNKLFQWLRSNGYLISRRGDSWNQPTQKSMQLGLFELKKTNINHADGHTTTNTTTKVTGKGQQYFINKFLNQERLT |
orf287_BPSfi21_9632967 LEAQIEADRPKVLFADAVSASKSSCLIGELAKILKQNGINIGRNKLFQWLRSNGYLISRRGDSWNQPTQKSMQLGLFELKKTNINHADGHTTTNTTTKVTGKGQQYFINKFLNQERLT | phi31orf238N + kilAC
SPy0946_Spy_13622110 LEAQIEADRPKVLFADAVSASHTSILVGELAKLLKQNGVNIGATRLFTWLRKHGYLIKRNGRDWNMPTQKSVELGLIRVKETSITHSDGHITVSKTPLVTGKGQQYFINKFLNQEYLP |
ORF42_A118_5823644 ALNQIEEQKPKVIFADAVQTSENTVLVKDLATILKQNGLDIGQNRLFEWLRGSGYLLN-KGTYYNKPSQKAMNLGLFEQKTHIHTDRNGLMVTTYTPRVTGKGQVYLLNKLLEEHGLV /
ORF169a_BPmv4_11138338 LTLQLEESNKKASYLDIILGTPDLLATTQIAAD-----YGYSARTFNQLLKEVGIQH--KVNGQWILYKAYMGKGYVQSKSFAFKDRKGHDRSKPSTYWTQKGRKLIYDVLKENGTLP |shares a small domain with phi31orf238N
KILA_BPP1_125404 LEQKMLMDAPKVEFAERVATASG-VLIGNYAKV-----LGLGQNYLFTWLRDNGILIA-TGERRNVPKQEYISRGYFTLKETVIDTSNG-SRISFTTRITGKGQQWLMKRLLDAGVLV \+kilAN
yoqD_BPSPBC2_9630225 LQEQLTLAEPKVEKYDRFLNTDGLMKIGQVAKAIGI--KGMGQNNLFRFLRENKVLI--DGTNKNAPYQKYVERGFFQVKTQETS-----VGIKTITLVTPKGADFIVDLLKKHGHKR \
ROI_HK022_1197729 LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGSRRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKFVDNGMLK |
orf14_BP933W_4499797 LEKQLALAAPKVEFADRVGEASG-ILIGNFAKVV-----GIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK |
ROI_BP933W_9632502 LEKQLALAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK | RHA+ KilAC
ROI_BPH-19B_2668765 LEKQLALAAPKVEFADRVGEASG-ILIGNFAKVV-----GIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMDRGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK |
ROI_BPHK97_9634208 LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGARRNVPMQEYMERGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK |
ROI_BPHK620_13559853 LENQLAIAAPKVEFADRVGEASG-ILIGNFAKV-----VGIGPNKLFAWMRDHKILIA-SGSRRNVPMQEYMERGYFTVKETAVNTNHG-IQISFTTKITGRGQQWLTRKLLDNGMLK /
Ant1_BPP1_137444 LEQQLVAAAPKVDFADRVSVANG-ILIGNFAKV-----VGLKQNALFSWLRQNGILMA-FGARKNVPRQQYINAGYFTVKEVVLDDENG-YQIRLTPN-------------------- \
ANT2_BPP1_15742 LEQQLVAAAPKVDFADRVSVANG-ILIGNFAKVV-----GLKQNALFSWLRQNGILMA-FGARKNVPRQQYINAGYFTVKEVVLDDENG-YQIRLTPN-------------------- | truncated? kila middle + kilAC
Z1797_Ec_12514711 LENQLAIAAPKAEFVDNYVEASGLMGFREVAKLL-----GIKETDFRLFLLENGIMYR--LAGKMTPYSHHLDAGRFSVKTGEA----GNGHAFTQVKFTPKGVQWIAGLLAAWRATA /
ORF23_BPRLT_1353540 SITYVPIEK-K-----NIILSNQEISYSEFIELLELNNIKMSKIMFLKFMRDRRITIDEKGKFYNFPTAFSIEMGIMLLSSTTKENVQ-----KYIPKITIEGQKYFIEKFHYMIEDK | kilAC solo
ORF5_BPRLT_1353522 LNIELAAATEKTTYLDLILESPDDILITQIAQD-----YGFSAVKFNRILNELRIQR--KVNKQWVLYSRYMGKGYIGSRTQNYVDSKGQERTSITTTWKQKGRKFLYETLKKHGYLP \
ORF38_BPBK5-T_14251162 LNLELAAATEKTTYLDLILEIPDDILITQIAQD-----YGFSAVKLNRILNELRIQR--KVNKQWVLYSRYMGKGYIGSRTQNYVDSKGQERTSITTTWKQKGRKFLYETLKKHGYLP |
SAV0855_Sa_14246624 LQQEIGELKPKADYVDEILKSTGTLATTQIAADY-----GISAQKLNKLLHEARLQRK-VNKQWVLYSEHM-GKSYTDSDTITIVRSDGREDTVLQTRWTQKGRLKIHEIMTEFGYEA | + Broa-N
orf8_BPbIL309_13095813 LAVENQIMQPKAQYFDDLVERNLLTSFRDTAKML-----KVGQKQLIDWLLENKYIYR-DKKNKLMPYAQY-NNDLFEIKESKGATN---SWKGAQTLITPVGRETFNLLLN-EYKAS |
ORF291_BPLL-H_1395130 AEQKLSEAKPKLDYVDKILASKKTILTTHLATDY-----GCSAVAFNRMLCDKKIQRK-VRDTYVLYSQYQ-GHGWTHTFARAIKTKHG-QEIKEQMEWTQKGNIGLYELLKDRFGLL |
orf9_BPphiPV83_9635686 LQQQVEVNKPKVLFADSVAGSDNSILVGELAKILKQNGVDIGQNRLFKWLRNNGYLIKKSGESYNLPTQKSMDLKILDIKKRIINNPDGSSKVSRTPKVTGKGQQYFVNKFLGETQTT |
SPy0980_Strpy_13622137 LSVENMVMKPKADYFDDLVDRNLLTSFRETAKQL-----KVKERRFIQFLLDKKYVYR-DKKGKLMPFADK-NNGLFEVKESVNEKTN---WAGTQTLITPKGRETFRLLFI------ /
onsensus/85% Lp.bl.....Kh.ahD.l..sp..h.h.phAp.......sh....h..hhpp..hbb......b....p.....shhp.+p....p.p.......ps.hp.cG...h...h.......
3. ORF11CD3
--------------
PHD Sec. Str. ---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH----
HI1412_Hi_1175786 GYSLMHKYNELCIEHKAKKAFASLCGKGLREW-KGDKPVLEATLKLFEDKMQIELPIK |+ RHA
orf201_BPP22_9635533 SESYEAERNAIMLEYMKEKDVASMSGRLLNRWGKIKKPQLLARIGRLEQHGQTVIPGL |+ RHA
orf11_BPD3_9635595 ELTEKQAFDRACKQLEDGRQLASLHGKGLADW-KFKKPMLEHRVDEMRDRLQMVLGLE |+ kilAN
hkbK_HK620_13559861 RNSLSAQLNMKCHEFDQKKDMASFCGQGLAAW-RYTKPVLVAEINSLANQLQITIPGL |+ kilAN
ANT_BP7888_10799917 RMSVMEELNQACADMKRDKNIASVFATGLNEW-KQVKSAHVSKIRTLINEANLLIDFV | + P22 ANT-N- near identical or identical, 9632512, 13361651, 12516389
ANT_BP933W_9632512 KMSVMEELNQACADMKRDKNIASVFATGLNEW-KQVKAAHVSKIRTLVNEANMLIDFV || + P22 ANT-N- near identical or identical, 9632512, 13361651, 12516389
consensus/100% ..o....ts..h.p....+.hASh.sp.L..W.+..Ks....pl..h.pp.p..ls..
4. P22 AR-N
------------------
ANT_BP7888_10799917 MNMMTVPFHGDSLYVVNHNGEPYVPMKPVVAGMGLAWQSQLAKL-RQRFASTITEIVMVAEDGKRRNMVSLPLRKLAGWLQTINPNKVKPEIRGKVIQYQEECDDVLYEYWTKGFVVNPR|P22ANT-N + ORF11D3C
ANT_BPP22_9635550 VNTSYVPFNGQHVLTAMVAGVAYVAMKPVVDNIGLSWSSQVQKLLKMKDKFNYVDIDMVAGDMKKRLMGCIPLKKLNGWLFSINPEKVRADIRDKLIKYQEECFTVLYDYWTKGKAENPR| P22 ANT-N + Bro-AN+ P22ANTC
5. RHA roi3
---------------- *
PHD Sec. Str. ----------------------EE--HHHHHHHHHHHHHHHHHHHHHHHH---------EEE----------------------EEEEE--------EEEEEEEE----EEEEEEEEE----HHHHHHHHHHHHHHHHHH--
Orf3_BPbIL311_13095661 MKGLAFLTSP------DLSKAEVVTNHVVIAEYAGIERKSVRRLINNHKNDF------ENFGRL----------------RF-EITTLPDSR---GQKVKIYQLNRNQAMLMITYLDNTEVVRNFKIALVKRFDEMEKELYA \
SP1134_Spn__14972611 M-ELVY----------MDGKKEPYTTSEIIAECAEVQHHTITRLIRENKADF------EELGIL----------------GF-K-IHKLDTR---GQPKKSYILNEQQATFLITYLKNTETVRQFKLNLVKAFFEMREELS- /
orf14_bIL310_13095876 MNEITV------SLDVIIKNKNVIVSSLSVAKAFDRQHSHILRSIEDIKRDWDSLIQSKNGLNRNIIPLKSQGNKQVIFTDYFKESEYIAEN---GRLVKFFEMNRNGFMLLANSFNGKRIL-PIKLAFIERFDELE----- | shares a C-terminal domain with 14972611
PM1774_Pm__12722192 LQGAESAVNNAVFPKVFHKETVAMTDSLKVAHYFGKRHDNLLNTIKNLGC-------SDEFRLL----------------NF-KESYYLNEQ---NKKQPMFYMTQDGFTLLVMGFTGKKAM-QFKEQYIKEFNEMKKRLAT |solo
RHA_BPphi-80_2120256 MNNPSVIPAFDFREMVTTLDNKIITTSLKVADYFGKRHKDVLRAIRNLKC-------SDDFTQR----------------NF-APIDFIDKN---GDVQPMYNITRDGCMMLVMGFTGKTAA-AVKECYINAFNWMAEQLN- |solo
orf179_Sf_421263 MATILTLSHP----DATIENGRAVTTSVAVAEFFRKMHKNVIQKIETLEC-------SPEFNRL----------------NF-KPVTYTDAK---AKNAQCTKSPKTASFSW------------------------------ + ASH-N
orf182_BPphiSLT_12719399 MQAL----------QIVEQNETHYVDSREVAEMIGKRHDNLVRDIKGYIKVLED---SSKLSSH----------------NFFEESTYVNSQ---NKVQPCYLLTKKGCDIVANKMTGSKGI-LFTATYVDAFHKMDEYIKQ |solo
ORF201_9BPP22_9635533 MNELIANHDFDFRQLVTAAEGQPVTDTFQIAKAFGKRHADVLRALKNCHC-------SEDFRRA----------------HF-CVSEKINNLGIFDKKQIYYRMDFSGFVMLVMGFNGAKAD-AVKEAYINAFNWMSAELR- \ + ORF11D3-C
P42_BPAPSE-1__9633589 MQNLIT-------------FQSLTMSSLEIAELVNKRHDNVKRTIETLAK-------SEIIQLP----------------QS-EKVENKQSNSPNR-FTEVFIFEGEQGKRDSIIVVAQLC-PEFTACLVDRWQELEQKLNT|+ kilA middle
yoqD_BPSPBc2_9630225 -MESYL--------TVIEQNGQLLVDSREVAEMVGKRHTDLLRSIDGYVAILL----NAKLRS----------------VEFFLESTYKDAT---GRSLKHFHLTRKGCDMVANKMTGAKGV-LFTAQYVSKFEEMEKALKA \
hkbC_BPHK620_13559853 MNELIN-------------GNAIKMTSIEIAELVGKRHDNVKRTIETLAK-------NGVIRLP----------------QI-EVSERINNLGFNV-QYEHYVFEGEQGKRDSIVVVAQLS-PEFTARLVDRWRELEETAVN |
Roi_BPH-19B_2668765 MNELIN-------------GNAIKMTSIEIAELVGKRHDNVKRTIETLAK-------NGVIRLP----------------QI-EVSERINNLGFNV-QYEHYVFEGEQGKRDSIVVVAQLS-PEFTARLVDRWRELEGATAK |
ROI_BPVT2-Sa_9633432 MNELIN-------------SNAIKMTSIEIAELVGSRHDKVKQSIERLAV-------RGVIRNP----------------PM-VVFEKINNLGLLR-GVEAYVFEGEQGKRDSIIVVAQLS-PEFTARLVDRWRELEGATAK | +kilAC
ROI_BP933W_9632502 MNELIN-------------SNAIKMTSIEIAELVGSQHGNVRISIERLAK-------RGVIQLP----------------SM-QKVENKQTISPNK-FTSVYIFEGEQGKRGSIIVVAQLS-PEFTARLVDRWRELEGATAK |
Roi_BPHK022_1197729 MNELIN-------------GNAIKMTSIEIAELVESRHSNVKVSIDRLVK-------RGVIKPP----------------AL-QHTNIINDLGVITGKRDFYVFEGEQGKRDSIIVVAQLS-PEFTARLVDRWRELEEAAVN |
ROI_BPHK97__9634208 MNELIN-------------GNAIKMTSIEIAELVESRHSNVKVSIDRLVK-------RGVIKPP----------------AL-QHTNIINDLGVITGKRDFYVFEGEKGKRDSIIVVAQLS-PEFTARLVDRWRELEEAAV- /
consensus/100% .........................sp..lAchh..b+.pl...lp.......................................................a.bp.p...b....h.s......hp..blp.a..b......
http://www.bmm.icnet.uk/servers/3dpssm/output/1f3d9c24585b2b85.job_summary.html
6. orf6N
-------------
PHD Sec. Str. -----EE-----EEEEHHHHHHH----HHHHHHH------------EEEEE----HHHHH--------EEE-----------------------EEEE---HHHHH---
N.orf6_BPbIL285_13095686 MNELQITELNGQRVLTTQQIAEGYGTDSASITKNFNNNKSRFKEGKHFFLLQGADLKEFK---NNIQNLDV----------------VGNRAPKLYLWTEKGALLHAKS \ +ORF6C
N.ORF6_BPTP901-1_13786537 MNELQITELNGQRVLTTQQIAEGYGTDSASITKNFNNNKSRFKEGKHFFLLQGADLKEFK---NNIQNLDV----------------VGNRAPKLYLWTEKGALLHAKS /
orf13_GMSE-1_12276103 NTQLPVIEYQGQRVITTELLAQGYGAEVKSIHMNFTRNKSRFEETKHYFLLQGEELKAFI---NYPTNCGL----------------VDKRSPSLVLWTGRGS------ \
H0107_Ec_7649857 VETLSPITHNQIPVITTELLAQLYGTEPVRIRQNHHENKVRFVEGKHFFKVVGNDLKELRVALNYSQNLRVTLSNSQNLQPSLRGLQISPKARSLILWTERGAARHAKR /solo
ANT_BPVT2-Sa_9633431 VETLSPITHNQIPVITTELLAQLYGTEPVRIRQNHHENKVRFVEGKHFFKVVGNDLKELRVALNYSQNLRVTLSNSQNLQPSLRGLQISPKARSLILWTERGAARHAKM | + P22ARC
7. ORF6C : the proteins are almost identical but at least two of them are fusedto orf6N
-------------
PHD Sec. Str. -------HHHHHHHHHH------HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHEEEEE----HHHHHHHHHHHHHHHHHHHHHH---EEEEE------HHHHHHHHHHHH-----EEEEE------------
C.orf6_BPbIL285_13095686 EKQLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE \ + orf6N
C.ORF6_BPTP901-1_13786537 EKQLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE /
ORF55_BPpi3_12724417 QQLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEA \
ORF6_BPTuc2009_13487806 QQLLPQTPEQQIALLARGNVNLNKKVERIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEE | + phi31orf238N
orf6_BPbIL286_13095749 QHLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTDRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTTLEIRGLNSQTSFDFEA |
orf238_BPphi31.1_7239197 QQLLPQTPEQQIALLAQGNVNLNKKVEQIENSVLDLTVRFGLPSNKAKVLQKKVASKVYMFTGGKYSNAHKKLGAKVFREFYKDLNNRFDVVKYSDIPLSRYDEATEYLDMWQPSFNTMLEIRGLNSQTSLSNYQ /
8. phi31orf238N
---------------------
PHD Sec. Str. --EEEEEEE------EE--HHHHHH------HHHHHHHHHHH----------EEEEEEEE-----------------EEEEEHHHHHHHHHHH----HHHHHHHHHHHHH--
orf238_BPphi31_1_7239197 MNQLITITQNENNEQVVSGRELHQFLGV-KTRYNDWFED-MVKYG-FTENVDFIGFTEKRV-KPQG-----GRPSVDHALKLDMAKEISMIQRNEKGKQARQYFIEVEKELK\ + ORF6C
orf6_BPTuc2009_13487806 MNQLITITQNENNDQVVSGRELHEFLGV-KTRYNDWFED-MVKYG-FTENVDFIGFTEKRV-KPQG-----GRPSVDHALKLDMAKEISMIQRNEKGKQARQYFIEVEKELK |
orf6_BPbIL286_13095749 MNQLITITQNENNDQVVSGRELHEFLDI-TERYSTWFER-MLKYG-FVENIDFVGC--KVF-NTLA-----KQELQDHALKIDMAKEISMIQRNEKGKQARQYFIEVEKELK |
P55_BPpi3_12724417 MNQLITITQNENNDQVVSGRELHEFLDI-TERYSTWFER-MLKYG-FVENIDFVGC--KVF-NTLA-----KQELQDHALKIDMAKEISMIQRNEKGKQARQYFIEVEKELK /
orf238_BPTP-J34_2897108 MNELINITLNENQEPVVSGRQLHKALGV-KTAYKDWFPR-MTEYG-FTDGEDFSSFLSKSTG---------GRPSQDHIIKLDMAKEIAMIQRTDKGKEVRQYFIQVEKDFN \ + kilAC
SPy0946_Spy_13622110 MNQLINVTLNENQEPVVSGRDLHKVLEI-KTQYTKWLER-MSEYG-FVENEDFMAISQKRL-TAQGN----QTEYTDHVLKLDMAKEIAMLQRNEKSKEVRKYFIQVEKDFN |
SA1801_Sa_13701788 IGEMFNIQEKENGEIAISGRELHQALEV-KTPYKKWFER-MSDYG-FEENIDYIVTDIFVH-NPLGG----RQNQTDHALTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN |
ORF11_BPphiETA_8918426 IGEMFNIQEKENGEIAISGRELHQALEV-KTAYKDWFPR-MLKYG-FEENTDYTAIAQKRA-TAQGN----MTHYIDHALTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN |
orf34_BPphiPVL_9635199 IGEMFNIQEKENGEIAISARELYKALEV-KKRFSAWAEI-NLKH--FKENRDFTSVLTSTV-VNNGA----VRQLEDYALTLDVAKHVAMMSGTEKGFDFREYFIQVEKAWN |
ORF42_BPA118_5823644 ANEMLPVLENEKGEKFVNARTLHEKLMT-TTKFADWIKRRIRQYG-FVENEDFFSLLKNEK-RAIG-----GTTSIDYIFTLDSGKELAMVENTEQGRAIRKYFIEVEKQAR |
SAV1994_Sa_14247767 IGEMFNIQEKENGEIAISGRELHQALEV-STRYDKWFER-MTEYG-FENGIDFISQVEKVH-GQKRAR---TYEQVNHILTLDTAKEIAMIQRSEPGKRARQYFIQVEKAWN |
orf287_BPsfi21_9632967 MNELINVTLDKNNEPIVSARQLHKTLEV-KTRFSQWVEQ-NFKI--FKENEDFSSVVTTTQQNQYGG----TKELQDYAVTIRMAEHLAMMSKTNKGHEVREYFIKVEKDFN /
L0142_BP933W_9632546 RIPVFNGTIANETTLLVNARDLHTFLGV-GKRFASWITERIEEYG-FVENQDYIAISQKREIGY-------GRGKKDYHLTLDTAKETAMVERNEKGRQIRRYFIECEKKLR \ + P22ARC
orf80_BPVT2-Sa_9633476 LIPVFNGTIANETTLLVNARDLHTFLGV-GKRFASWITERIEEYG-FVENQDYIAISQKREIGY-------GRGKKDYHLTLDTAKETAMVERNEKGRQIRRYFIECEKKLR |
HI1422_Hi_1175795 LIPVFNGLIQNQPVQLCNARELHAFVES-KQQYTDWIKNRINEYG-FIQDEDYLVITERTN----------GRPRKEYHITLDMGKELGMVERNERGRQIRQYFIRCERTLK |
NMA1293_Nm_11354036 LIPTVSGQLDNQTQALVDAHDLHKFLGV-ETPFSKWIQRRIEEYG-FTQALDFIGVDKIVR-TEAGFFGQRDKTVQGYYLSLDMAKELCMVERNDKGRQARRYFIEMEKQAK /
CAC1945_Cab_15024925 MENLIRIS----DKGLVSAKELYLGLGLNKTNWSRWYPKNIQSNEFFKENIDWIGVRHNDE----------GNETMDFAISIEFAKHIAMMAKTEKSHEYRNYFIKCENKLK + phi-SLT -orf81a
consensus/100% ................hss+pLH..l....p.a..W......p...F.ps.Da.s......................a..plc.scc.sM...sp.s..hRpYFIbhEp..p
9. P22ARC
--------------------
PHD Sec. Str. -------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------EE--HHHHHHHHHHHHHHHHHHHH---
L0142_BP933W_9632546 QAEPQQQFTDEEIILLCYMQLWMEKAQDLSKHLYPIMKELNSSYTNKLYDIAFETIYMVTKNRDVLLREAARLD \
orf80_BPVT2-Sa_9633476 QAEPQQQFTDEEIILLCYMQLWMEKAQDLSKHLYPIMKELNSSYTNKLYDIAFETIYMVTKNRDVLLREAARLD |
HI1422_Hi_1175795 PEKFTHEFTEFEIETLVWLLIGHHQMNTLLGQLEKPLDAIGSNLHPAVYSYWKEYGRQYKDALPTIKRLMAPFK | + phi31orf238N
HI1418_Hi_1175791 EKKFSFEFTEYELQQLVWLWFAFMRGIVTFQHIEKAFKALGSNMSGDIYGQAYEYLSVYAQQTKS--------- /
Z1818_Ec_12514734 QEKSTNELSAKEANSLVWLWDYANRSQALFRELYPALKQIQSNYSGRCYDYGHEFSYVIGMARDVLINHTRDVD | + Bro-a N
ANT_BPP22_9635550 QEKKTNDLSAKEANSLVWLWDYANRSQALFRELYPAMRQIQSNYSGKCYDYGHEFSYIIGIARDVLINHTRDVD |P22ANTN + Broa-N + P22ARC
ANT_BPVT2-Sa_9633431 QEKKLNGLSAKETDSLVWLWDYANRSQALFRELYPALKLIQSGYSGICHDYGYEFSYIIGRARGVLINHTRDID | +orf6N at N
12514711 1-102 start from here (kilAC)
10. ASH
-------------------
ECs2630_Ec_13362098 TQKNRLPCRNRSGYISAAPHKTGAGILNPIQSKAHNRASGFFVRTV------------LPRLFRVRIMAGRTGPTSVGPDSLLSGVENPVRLASP-RFSTL-DGELFL--------------------------------------------------------------------- + C low complexity
ash_BPP4_75898 ------------------------------MVWCVVSRADGIPCIL-----------PASAHYAAESMVAQAGQPPGWPVSCEAGILTPVWAIAI-ERENS-GDSVICYSQEAAIMATTLTPSHPEFVFVFAAVRRADRHPRICMLRTVAGDERSARRSLVRDYVLSLAARLPVVEV
ASH_BPphi-R73_93828 ------------------------------MVWRVVCRAGMIL--------------FAIACYATESMVAQAGQPPGWPVFFEAGIPTPVWAIAI-ERRNS-GDSSYLLLEGDGLMATTLTPSHPEFVFVFAAVRRADRHPRICMLRTVAGDERSARRSLVRDYVLSLAARLPVVEV
Z0337_Ec_12513051 YLYSGLLTVVISRYSFSAVAKSAAGIGVPYNLLATIDAPCVFFYVVAQAQPFSGLWCLCLHHGSIEIMVVRAGQPSGWPVSNKAGYANPVRAATS-EIGVS-GGSNNRYLLEAAIMATILTPSHPQYVFVFAAIRRADTHPRICMLRTVSCDERSARRLLVRDYVLSLSARLPAGEV
Orf179_Sf_421263 MLNVAIENQNGWNYSAPAPHKTGAGIATPTMTTAHNRAQAVF---------------LCVKHSHIQIMVGRAGQPQGWPVSVVTGCSNPVRLTTH-EIATS-GGESFKLTIEAAIMATILTLSHPD--------------------------------------------------- + RHA, the other paralogs dont seem to have this.. I wonder if this is an artificial fusion
ORF199_Sf_312621 MLNVAIENQNGWNYSASAPHKTGAGRGNPNVTRAHSRAEAVF---------------LCVMHSSIQIMVGCAGQSQDWPGSRVTGISTPVRLTTL-MVVENLGGELINLSLEDAIMATIPALSHPD---------------------------------------------------
gp32_BPN15_9630502 -----------------------------------------------------------------------MGPTSVGPVSSCTGVENPVWATTPIEILNS-GGSTLYKIGM-----------HTMFKFKFAAVVRTDKKSHIHRLSTIASSEREARRQFASRFVLVLSARIPVSEV
Reconstruction:
ancestor of Ecs2630/Z0337 + P4ASH protein insert = Z0337,
Eca2630 secondary loss of C
C-terminal loss = ORF199
+ RHA= ORF179
gp32, secondary loss of linker region
or P4ASH plus extension = ECs2630
11. BRON
-----------------------------------
EEEEEEEEEEE-------------------------HHHHHHHHHHHHHH--H-HHHH---------------------------HHHHHHHHHHHHHHHHH--------------------------------EEEEEE---HHHHHHH-HHH-------HHHHHHHHHHHHHHHHHHHHHHHHHH
MSV226_MSV_9631397 EKVP----FVI------------------KK-DNETWYNMLDII-KILGYKKKL-HLHA---------------------------SLLNKNNK-KKFYQLLTK---------NTLKNKYFKY------TNVQKNRIFINEVALFYILLSSKKE--------NAII--CKNYVFG--NLFKLENLNL |solo
AMV055_AMV_9964369 TFNEI---FKFN------NKSIDVI----GT-LNNPWFCGKDVL-NILEYEKSSFKKIL---------------------------QRLKESYK-KSYREILYKV-----------GDNLSP-----TLNGNNSKIIYINDSGLYTLIMNFNLN--------NAIV--FKEYVI------------- |
AMV262_AMV_9964576 TIIKQ--IYISDT-----KEKYNIYIYVDIK-TKLSYFISNDIL-KILTESTDN-IY-----------------------------KYCEKSD---IFKWINIHN---------------------NIPSNISDETILINKNGLNNIISKLNNE--------KSNH--FRKWLND--IDINIIIKNE |solo
SCGD3_15_Scoe_7480004 -DVSD---FVYA------ATGARVR-RLTMP-GGSHWFPAADVC-KELGYTTTR-KALL---------------------------DHVPEEHR-DSLETVTG------------SHSLSIPAG-----RKWRRDLQLIDLQGLILLVNACTKP--------ACAP--FKQWVA---EVVETVQREG |Clong but nothing significant
ORF1_Nm_12697190 -MNEI---FNFH------GQEVRTL---T-I-DDEPWFVGKDVA-DILGYSKAR-NAIA---------------------------LHVDEEDA-LK-QGI--------------------------PTSGGTQDMLIINESGLYSLILSSKLP--------QARE--FKRWVTS--EVLPAIRKQ- |solo
ANT_LcBPA2_6599316 NELQH---FDFK------GRQVRTV---V-V-DNEPMFVGKDIA-EVLGYSKPA-NAVN---------------------------KYVPDKFK-GVTKL---------------------------MTPGGKQDFVVIAEPGLYKLVFKSDMP--------NADE--FTDWVAE--KVLPSIRKHG | fragment of kilAC
PA2423_Pa_11349554 -QLAP--HYFFRQ-----QRLLRA----LLI-DDQAWFVLDDFA-RLIEHSQPE-QMLA----------------------------RLDDDQARR--ESL-------------------------RSERGEDQAQWLISESGAYAALIYQQRG--------DGGE--LRRWLSG--EVVPELRSAT |N-nothing + Bro + C nothing
PA1153_Pa_11349113 TLLQPS-RFTHH------HRVLRA---VL-L-DEEGWFVLSDLV-RLLGRYLGG-RAPAALCDEAPWPLATAEQRERLFALCHALERHLDTDQWRLAWL---------------------------HDERHGPRQDCLVSESGLYALLWLAAPG--------AARG--LRRWVSG--SVLPRLRSQS
SPy2128_Spy_13623111 NKTE-----TWN------GYTIRF----VEH-QGEWWAVLADIA-KALDL-NPK--FIK---------------------------QRLGDE----------------------VVSNNHV-----TDSLGRQQEMLIVNEFGIYETIFSSRKK--------EAKT--FKLWVFE--TIKQLRQSTG | solo
ORF5_BPRLT_1353522 KELQN---FNFN------NLPVRTV---L-I-NDEPWFVGKDVA-IAIGYKNFR-DALK---------------------------SHVKDKYK-RESRI---------------------------TTPSGVQSVTVISEPGLYQLAGESKLP--------SAEP--FQDWVYE--EVLPTIRSTE \
orf8_BPbIL309_13095813 KELQN---FT--------NGIFNLD--VKVD-GENILFSAEQAA-KAMGITQVK-NGK----------------------------EYV----K---WERVNSYL-----------PNS---------P--EVGKGSFISEPMVYKLAFKANNA--------VSEK--FTDWLAV--EVLPTIRKHG |
orf9_BPphiPV_9635686 QALQT---FNFK------ELPVRTV----EI-ENEPYFVGKDIA-EILGYARTD-NAIR---------------------------NHVDSEDK-LTHQF---------------------------SASGQNRNMIIINESGLYSLIFDASKQSKNEKIRETARK--FKRWVTS--DVLPAIRKHG | + kilAC
ORF38_BK5T_14251162 NELQN---FNFN------NLPVRTV---L-I-NDEPWFVGKDVA-IAIGYKNFR-DALK---------------------------SHVKDKYK-RESRI---------------------------TTPSGVQSVTVISEPGLYQLAGESKLP--------SAEP--FQDWVYE--EVLPTIRKHG |
ORF291_BPLLH_1395130 NEVQI---FENN------GRGISLP--VKEV-GGQVYFEAEAAA-IGLGITT-----------------------------------EVNGDTY-VRWPRINSYL-------------GFATSG------KKIKKGDWITEPQFYKLAFKASND--------VAEK--FQDWVAS--EVLPSIRKHG |
SPy0980_Spy_13622137 MELQV---FTNEQ-----FGEVRT----ATI-NNQIYFNLNDCC-QILELSNPR-KTIE---------------------------R-LNKDG--VTTSDII-------------------------DSLGRTQQANFINESNFYKLVFQSRKP--------EAEK--FADWVTS--EVLPSIRKH- |
SAV0855_Sa_14246624 QALQT---FNFE------ELPVRTL---E-V-DGEPYFIGKDVA-DILGYANGR-DALS---------------------------KHVDEDDK-KVLTSRNTTL-----------------------ENLPNRGLTAVNESGLYSLIFSSKLE--------SAKR--FKRWVTS--DVLPAIRKYG /
Z1818_Ec_12514734 NDFTI---FKFG------DSEIRVI----NK-CGEPWFVAKDVC-DALALTNSR-KALT---------------------------ALDDDE-KGVTLSY----------------------------TLGGEQNLSIVSESGMYTLVLRCRDA---VNKGSVPHK--FRKWVTA--EVLPSIRKHG | + p22ARC
gp30_BPN15_9630500 KALSV---FSFQE-----SHPIRVV---L-V-GGDPWFVALDIC-AALNIANPS-DALR---------------------------K-LDHDEK-LTLGLTEAQ-----------------------KLDRMAREVNVVSESGLYTIILRCRDA---VKQGTTAWR--FRKWVTN--EVLPAIRKNG \N15 gp30 like BRON
P43_BPAPSE1_9633590 --MTT---LVFR------NTVLET----ISH-NGQIWFTSSVLA-KALQYSSSK-SV-------------------------------TDLYHK--NSDEFADHM--------SKVVDST-------TLGKSRNKTRIFSLRGAHLIAIFSRTP--------VAKE--FRKWVLD--ILDKQTVNQT /
XF2506_Xf_11362060 QSIIP---FDFH------SHAVRVV---M-R-DGNPWFVATDVC-TALGYRNPS-KAVA---------------------------DHLDDDEK-SNQSLGLA-----------------------------GKPVIIISESGLYALVLRSRKP--------EARK--FSKWVTS--EVLPSIRKTC \
M_XF2524_Xf_11362500 NAITP---FQFE------SHAVRT---VVDD-HGEVWFVGKDVA-DVLGYTNHN-KALG---------------------------DHCRGVTK-CYPIL---------------------------DSLGRSRETRIISEPDMLRLIVSSKLP--------AAER--FERWVFE--ELLPTLRKTG |
C_XF2524_11362500 NAITP---FQFE------SKDVRIQ---LDE-ASAPWFNANDVC-AVLEFGNPH-QAIE---------------------------SHVDADDL-QKLEVI--------------------------DALGRTQRANHINESGLYALIMGSTKP--------AAKR--FKRWVTS--EVLPTLRKTG | Xylella specific fusion
N_XF2524_Xf_11362500 MNAPSEFTLQFE------SHAVRVQ---VDE-AGTPWFNANDIC-TAVELLNPC-AALA---------------------------QHVGARNV--SKRKII-------------------------DTIGRTQRANYLNEPGMLTLLIGSTKE--------AAKR--LRRWLIS--EALPAAAVQK |
XF0684_Xf_11362477 NAITP---FQFE------SHAVRT---VVDD-HGEVWFVGKDVA-DVLGYTNHN-KALG---------------------------DHCKGVPK-RYPL----------------------------QTPGGIQEIRIISEPDMLRLIVSSKLP--------AAER--FERWVTS--EVLPTIHKT- |
XF1663_Xf_11362484 NAITP---FHFE------SQAVRT---VVDD-HGEVWFVGKDVA-DVLGYANHN-DALG---------------------------AHCKGVAK-RYPLP---------------------------DSLGRLQYFRIISEPDMFRLIAGSKLP--------AAER--FERWVFE--GVLPTIHKTG |
XF1645_Xf_11362483 TLPAS---VDFS------DVSLTI----IDH-DGIPYLTAADLA-RALGYKDAS-AVLR---------------------------IYSRHTDE-FTSEM-------------SLTVNLTVKG---FGCGNSEKPVRLFSPRGCHLVAMFARTS--------VAAA--FRRWVLDVLEVLPSIRKTG |
XF0704_Xf_11362478 TQLPA--AVCFS------GKSLS----IIDR-DGVPHLTAADLA-RALGYKDTS-AVLR---------------------------IYSRHTDE-FTYQM-------------SLVVNLTVKG---FGSGNSEKPVRLFSPRGCHLVAMFARTS--------VAAA--FRRWVLDVLEVLPSIRKTG /
MSV194_MSV_9631452 MDLDN---LIFN------NKKIHIA----IY-ENKPYFKGKDIA-EILEYKDTN-DAIK---------------------------KHVDDDDK-SKYEDLINR--------PGILP----------SLTYNEKNTIYISESGLYSLILSSKKS--------EAKI--FKKWITN--EVLPNIRKHG \ + T5orf172
BROE_BMNV_9630956 VKIGK---FKFG------EDTFTLR-YVLGG-EQPVRFVARDIA-NKLKFKNTK-KAIR---------------------------DHVDGKYK-CTFEQACI----------NISKEKHVKQG---NPLYLQTQTILLDKIGVIQLFMRSKMT--------NAAE--LQNWFYE--HVLPQCTARQ |
orf117_ESV_13242588 DILQT---FVFN------NTRHKVV-ILRDE-NDDPLFKASDIG-KILSIKNIH-TSMI---------------------------D-LHDDDK--AIRTA--------------------------STPGGEQKTVFVTEKGVYKLIMRSRKP--------VAKP--FQDWVF---EVLKTIRKRG |
BROM_LdNV_9631117 MALTK---VNFV------SGPLEVF-TVQDD-EQENWMAANPFA-ETLKYNNCN-KAIR---------------------------IHVSANNQ-KTLEELNID-----------------KSQ--VLPRNVQAKTKFINMNGVIELLLASQMQ--------QAKE--FRYWMTN--VKFAETSADP |
BROK_LDNV_9631082 VKIGQ---FRFG------EDAFTLR-YVLAA-EQPVKFVAKDIA-RSLKYEKPA-NAIA---------------------------KHVDDKYK-SAFEQLCF-------------DDLRVKQG---DPLYLHKSTILIDKIGVIQLFMRSKLH--------NAAE--LQNWFYE--RVLPQCTARQ |
BROI_BmNV_13751084 VKIGE---FKFG------EDTFTLR-YVLDA-EQQVKFVAKDIA-SSLKYVNCK-QAVI---------------------------VNVDNKYK-TTYEQACI----------NISKENRVKQG---DPLYLQSQTILLDKIGVIQLFMRSKMT--------NAAE--LQNWFYE--HVLPQCTARQ |
Brob_BMNV_9630900 VKIGQ---FKFG------QDEFTLR-YVLGD-EQPVKFVAKDIA-RSLKYVNYE-KAVR---------------------------VHVDVKYK-TTYEQACI----------NISKENRVKHG---DPLYLSPQTILLDKIGVIQLFMRSKMH--------NAAE--LQNWFYE--HVLPQCTASA |
BROG_LdNV_9631042 THLQH---FEASL---DDGVKFECW-GVVTP-DGKVACKLKEFM-DFLGYKEVN-SAYK----------------------------MIPKEWK-VYWHKLQDDL----------CVDS---------SVDLHPRNVFVYEPGMYAFMTRSGSP--------LAKW--CMGFLYD--VVVPTLKKNQ |
ORF130_XnNV_9635380 --------------------------------MDKLLYTGHGVA-ESLGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKKL---------TFFNEAL-------LPSNWQPNTVFITEAGVYALINKSKLA--------GAEI--FREWLFD--TIIPQMRRAK |
bro_HaNV_12597544 MSLTK---IQFG------DKEVET--YTVDF-NGEKWMVANPFA-EALNYSRAN-KAIL---------------------------EKVSDGNQ-KTFDQIKPYR--------IVHDGTGESSV---IPRNMKPNTKFINRAGVFELIMSSQME--------YARQ--FRYWLSS--VKLNTTVETD |
201R_CIV_15078913 YMTIT---INGN------EHQIKLA----GI-IEDPYFCGKDVC-TILGYKDKE-QALR---------------------------KRVKSKHK-KSLSELFEKK----LPVVTTGNFFLGTQN---ELSYHEGKSIYINEPGLYNLIMSSEAP--------FAEQ--FQDMVYE--KILPSIRKYG |
289L_CIV_15079001 YMTIT---FCNQ------EHQIKLA----GT-VDTPYFCGKDVC-KVLGYKDIK-DALK---------------------------KHVDREDK-LPLSEIKKVG-------GTAPPTFLGQTY--AYLSHNDGRAVYISEGGLYSLIMSSEAP--------FAKD--FRRLVCN--VILPSIRKFG |
MSV023_MSV_9631535 DLIS----------------KINI----ITY-NNCSYYKAKDIA-DILNYKSVD-YFIK---------------------------KYVKNEHK-INYE-----------------------------------STIYVNNSGLYYIMFKSKKH--------EAEK--FQNWIKE--ENLPEIENNK /
AMV175_AMV_9964489 TFNEI---FNYN------DVKIKVI----GT-INNPWFCGKNIL-KALEYSDDSHNKIL---------------------------NRLDDKFK-DNMYNILSSV-----------RDNLS------MTKNNKNKAIYLNEPGIYYIILHCTKD--------SAKG--FQDFILF--DLLPTIRKRT \
AMV057_AMV_9964371 NFNNI---FKFN------NISINII----GS-LDNPWFKGKDILIDGLEYTDQSAKCVL---------------------------KRLNTSFK-KSYNDIISVE-------GNLPP-----------TKNNDNKAIYVNEAGLYYIILHCTKD--------SAKG--FQNYILF--DLLPSIRKRA |
orf6_HaEPV_3510491 --MKS---FKYK------NINIDVL----GD-INYPWFNGKNILIDGLQYTEQSAKCVL---------------------------KRLESKFK-NKLSDIICVG------GNLPPTGNLDKIS--NITRHNDGKAIYINEAGLYYIIIHCTKE--------SAKP--FQDYILF--DLLPSIRKLA |
AMV177_AMV_9964491 NFNKI---FKFK------DTDIKIN----GT-IDQPWFCLKDIIIYGFGYTKESYKSIL---------------------------KELNNSYK-KSLYDIIVEG---------------GKTP---PTKNNENKAIYVNESGLYYIVFQCTKD--------SAKD--FQKYILD--ELLPSIRKLA |
BROA_LDNV_9630998 MALSK---VEFV------NGPLEVF-TVQDD-KQENWMAANPFA-ETLKYLNVN-RAIR---------------------------VHVSKHNQ-KTLDELQSD------------RNGL-------ITSSLHPQTKFINRAGVFELISASEMP--------AAKR--FKQWNAN--DLLPSLCREG | BROC
BROL_LDNV_9631113 MALSK---VEFV------NGPLEVF-TVQDE-NQEKWMVANPFA-EALGYTRLN-YAVT---------------------------QHVSVVNQ-KTYEEFKSQG-------STATDDS---SL---LPRNIQAKTKFINQAGVFELIGASEMP--------AAKR--FKTWNTN--DLLPTLCAEG |
BroO_LdNV_9631121 MALTK---VEFV------NGPLEVF-TVQDE-NQEKWMVANPFA-ESLKYAIPH-IAIS---------------------------KFVSTVNQ-KTYEELRSMR---ITSRITSTDDS---SL---LPRNVQAKTKFINRAGVFELISASEMP--------AAKR--FKTWNTN--DLLPTLCAEG |
BRO_HaNV_12597545 MSLTK---IQFG------DKEVETY--TVDF-NGEKWMVANPFA-EALSYSNVN-RAIR---------------------------VHVSEKNQ-QNYEEFKSDR--------VGLTDSV--TS---LPRNIQAKTKFINRAGVFELINASDMP--------GAKR--FQAWNNN--DLLPSLCQEG |
ORF109_XnNV_9635359 -------------------------------------MVANPFA-EALNYSNVN-RAIR---------------------------VHVSNQNQ-KCMEELRSDR-------CGLTDDS---SC---LPRNIQAKTKFINRAGVFELINASEMP--------AAKR--FKAWNSN--DLLPTLCTDG |
ORF159_XnNV_9635409 ARKQK---FLYC------NEELNVI-TQVDE-FGEPWMVANPFA-TVLQYYKPN-DAVR---------------------------KHVSEWNV-KSYEDFRSRR------IGADDSSHWVDE----ITSSLHPKTKFINRAGLFELIQSSRMP--------KAQE--FKNWVNS--DLLPKLCQEG |
BRO-d_BmNPV_9630955 VKIGQ---FKFG------QDTFTLR-YVLEQGNPQVKFVAKDIA-SSLKYGNCK-DAVS---------------------------RHVDKKYK-YTYSESGARL-------PPSAPNSVAKQG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- | Essential for lytic infection
Bro-III_BmNPV_13751089 VKIGE---FKFG------EDTFTLR-YVLEQGNQQVKFVAKDIA-ISLKYASYE-KAVR---------------------------VHVDGKYK-STFEHAG-QI-------GHHAPNSVAKQG---DPLYLHPRTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- |
ORF2_AcNPV_9627744 VKIGE---FKFG------EDTFNLR-YVLER-DQQVRFVAKDVA-NSLKYTVCD-KAIR---------------------------VHVDNKYK-SLFEQTI-QN-------GGPTSNSVVKRG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- |
AntgemNPV_9799895 VKIGQ---FKFG------EDVFTLR-YVLDR--DIVKFVAKDIA-NSLKHTNAA-EAVR---------------------------NHVDIKYK-TTYEQGE-TV-------SHPASTSLVKRG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAVE--LQEWLLE--EVIPQVLCT- |
ORF153_LdNPV_9631120 VKIGE---FKFG------EDTFTLR-YVLEK-DQQVKFVARDVA-VSLRYERPA-DAVS---------------------------KHVDIKYK-STYAELGRQI-----ADPTLNVKLIVKKG---DPLYLQPHTVLITKSGVIQLIMKSKLP--------YAVE--LQEWLLE--EVIPQVLCT- |
BRO-c__BmNPV_9630901 VKIGE---FKFG------EDTFTLR-YVLGD-EQPVRFVAKDIA-SSLKYVNCE-RAIR---------------------------VHVDGKYK-STFEHAD-QI-------QHHAPDSVAKQG---DPLYLHPHTVLITKSGVIQLIMKSKLP--------YAIE--LQEWLLE--EVIPQVLCT- |
BRO-a_BMNV_9630839 VKIGE---FKFG------EDTFTLR-YVLEQGNLQVKFVAKDIA-SSLKYVNCK-QAVI---------------------------VNVDKKYK-TTYSESGSIP-------YTPAPDNVVKQG---DPLYLQPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG |
bro-a_SpLNV_7672865 VKIGE---FKFG------EDTFSLR-YVLER-DQPLKFVAKDVA-ASLKYQDAK-RAIK---------------------------IHVDDKYR-STFEHGG-QI-------APLVSNALAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG |
BROB_LDNV_9630999 VKIGQ---FKFG------EEEFTLR-YVLER-DQSIKFVAKDVA-ASLKYVDCK-QAVR---------------------------INVDDKYK-FTFEQGCVP--------HTLASDSVAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG |
BROP_LDNV_9631128 VKMGE---FRFG------EDVFRLR-YVL---NDPVKFVAKDVA-GSLKYQDAK-RAIR---------------------------IHVDDKYK-STFEHGE-IR-------SHLASNALAKQG---DPLYLHPHTVLITKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG |
ORF114_XnNV_9635364 RKQV----ILFQ------NEPVEVVFSDKTGPDGLVYYF-FEVT-PFARLMNVD-NPL----------------------------SKIDSQHV-IVVEEPVTA----------ADTNNW-------AVRNNTRSTTLVSEAGLYQLMFTGKPV--------TVRQGMVRNWLFD--IVLPTVKQFT |
BROJ_LDNV_9631081 VKIGQ---FKFG------QDTFTLR-YVLGG-EQQVKFVAKDIA-SNLKHANCA-EAVR---------------------------KHVDGKYK-STFEHGE-IR-------SHLASNALAKQG---DPLYLHPHTVLVTKEGVIQLIMKSKLP--------YAVE--LQAWLLE--EVIPQVLCTG |
ORF13_SeNV_9634234 -KTKR---LQFDD-----QFSFTVD-YIF---NDEVWIAGNKLA-EGLGFREPQ-TAID---------------------------EFVDGKYK-RTINELVFN-----------------------NSVDDTNGLVCVNKHGVLQLIDRLDFK--------NKAE--FTAWIIE--EVYVELENKF |
38_7kd_HzNV_10442572 LERKR---INFDD-----QFSFTVR-HLTR--NQQMWMIGSDFA-SGIGFDEPE-FVVD---------------------------NYVSNHNK-ICLETLIFG-----------------KRV--EIENDDVKRSMCINRDGCLQLLNHIEFA--------NKSE--FIAWLVT--YAFDKLYSHM /
MSV195_MSV_9631451 MNLDN---LIFN------NKKIHI---VIDN-NNKVLFKAKNCA-EILKYTNPL-KAIR---------------------------DHVRQKHQ-ISFKNINMN-------------DSF-------ILNNIHPDTIFITESDFYSLISK------------------------------------- | solo
MSV196_MSV_9631450 ---------------------MNI--YVAIF-NNKSYFRAKDCA-SILEFKHTK-DAIR---------------------------HYVSNGNK-IKFKNINIR-----------------------SKKYIHPHTVFINNFGLIELILKHKSI--------VHHN-IIDKLICK-FDLNVDLNITP \ + vsr nuclease
MSV024_MSV_9631534 MDLMQ---------------GI----HVINY-NDNLYFKAIDIA-KLLKHKNIY-RAIK---------------------------YKISDCNK-TLYKNISNT-----------------------NLSYKKNKMVYINKLGLIELIKESTTI--------VSPM-VINGLINKFNLNLDLPIKFI |
MSV026_MSV_9631533 DIISN----------------IKT----INV-NNCLYFKGEDCA-KILKYKNTY-GAIR---------------------------NNVSKNNK-IKFE---------------------------------KNNDIYINKLGLSELIIKHKSI--------VSTN-TINTLIHNFNLNLDLFEKKK |
MSV204_MSV_9631444 MEI-----INYN------NNQIHL---LYTT-IGEVYYKGKDIA-KILRYIDTK-KVIR---------------------------NNVLSTNK-VNYSTLIKNV----------SING--------QLHKTPHHTIFINTKGLKNLFDMP------------VKR--LSN--KEINDLIEFLNSHN |
069L_CIV_15078782 GGLRA--IFNLD------GVTLDTP--IMGT-WDKPVFFGKEIA-EFLGFKKPK-DALQ---------------------------KHVKPKYK-TTLSKVLEKK---------LDTEPV---------SYNEGKRVLLYKEGVVELIKKTRLV--------GIEN--KIDALIE--AFELNLNVVH /
ORF62_XnNV_9635312 IVKKT---FTSD------KKKWELY-NITSC-PYHFYYEAYPIA-KLLCNKHPE-LAIK---------------------------NYVDRSCC-KIYEELKRWFRPYCIFQSVGSPCSPGPNN---QPIHWQSNTLFINKDGIISLINNSTLP--------VAHE--FKRWFLA--QRHDEAEVFK |solo
ORF60like_HzNV_10442560 LVNRK---CKLG----------EVW-ITEIE-ENRFLCSGHGVA-EALGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKGVL------NQHSLVTSSDSIE---MPLNWQPNTLFITEAGIYALIMRSKLP--------AAEE--FQSWLFE--EVLPELRRTG \
BRO_HzNV_12597590 LVNRK---CKLG----------EVW-ITEIE-ENRFLCSGHGVA-EALGYKCPR-RALY---------------------------DHVKPQWR-KTWAEIKGVL------NQHSLVTSSDSIE---MPLNWQPNTLFITEAGIYALIMRSKLP--------AAEE--FQSWLFE--EVLPELRRTG |
ORF60_XnNV_9635310 LVNRK---CNMG------GINADIW-LTQME-MDKFLYMGHSIA-KSVGYANPQ-KAIR---------------------------DHVRPEWR-KTWSEIVDGT------NRSPLVTSFNDSH---LPANWQPNTVFITEAGVWALIIKSKLP--------AAEK--FQKWLFE--EVLPELRRTG |
ORF131_XnNV_9635381 -----------------------------ME-MDKFLYMGHSIA-KSVGYANPQ-KAIR---------------------------DHVRPEWR-KTWSEIVDGT------NRSPLVTSFNDSH---LPANWQPNTVFITEAGVWALIIKSKLP--------AAEK--FQKWLFE--EVLPELRRTG | BRON + C synapomorphy shared by this group
BROD_LdNV_9631039 MALQR---FEFPMSADEDESKFECW-GIVMP-DGSVAVKLKELA-EFLNYEDVK-KAYK----------------------------LVPDEWK-ITWNILQNKL-------EPSRPHLVAPST---TPANWQPETLFVLEPGVYALMARSTKP--------MAKE--KMKYVYE--TILPTIRKTG |
BROI_LdNV_9631080 MALQR---FVFPMSADEDGAKFECW-GVVMP-DGDVAVKLKELA-LFLGYADVK-MSYK----------------------------HVPDEWK-ITWKNLQNKL-------ASKRHQLVAPPT---TPANWHPETLFVLEPGVYALLARSNKP--------LAKE--RMKFVYE--TILPTIRKTG |
BROC_LdNV_9631038 MALQR---FEFPMSADEDESKFECW-GVVMP-DGSVAVKLKELA-LFLGYADVK-MSYK----------------------------LIPEEWK-ITWKNLQNKL-------ASKRHQLVAPPT---TPANWHPETLFVLEPGVYALMARSTKP--------MAKE--KMKFVYE--TILPTIRKTG /
CnBV__13160526 FQLQN---WDVD------DKSVVLRLYIHPI-TNEPWVVAADLA-RCLGYEKYR-QTHT----------------------------RILAAFKRKLSDLVHTEP-----FSGTVESEVARLEGAPVELSSRERDIVVVNEGGIHQMLIGSRLP--------NVQK--YKELVFG--KILPAARARG \BRON duplication
PxORF82_PxNV_11068085 VGCSV--GILFD----------KLH--YIVI-DGVVWFKLNQIC-KYFD--------IP---------------------------KQCPD-YNIITWYTLSKRL----------------KSN-----ITWKLNTIMISDMGVYKLLIIKNEI--------IAEE--FYH------KRLHELRSTG /
ORF99_CpGV_14602336 ESVDSVCGVL--------PSNIEF----FSV-NERTYFKGLDVA-RHLKCSPS--YTIN---------------------------KYVADTDM-VLWGDLRRYV-----------HDKYVWTN---CKNHWKDNTIFLKETGVKQLCIATQGD-------DKLYQ-EMMDGVYNYDSGDEQVVYAK |bro duplication
SPy2127_Spy__13623110 QVITT---TNFH------GQPLDIY----GD-IQEPLFLARAVA-EMIDYTKTS-QGYY---------------------DVQAMLRKVDEDEK---LKGMAL--------------EGTTKN------FRSGQKVWFLTEHGLYEVLMRSNKP--------KAKE--FRKAVK---NILKEIRLNG | SinR like HTH
p63_BPMx8_15320633 PTPEMPKPFLFEGS----TRIRVVVDE-----AGEPWFVAQDIA-HALEYRMAS-D-LT---------------------------RLLKPHHL-RTHAV---------------------------RTNRGERSATIISEPAMYRAVFLSKSK--------KAEP--FQEWVTS--DVLRSIRKTG |p63C
consensus/85% ........h............hp.............bh..pshh...L.h.p....sh..............................l..p.b........................................p..hlsc.Ghh.lh..sp...........s....hb.hh.....hls.h....
12. T5orf172
------------------------
PHD Sec Str. ------EE---------HHHHHHHH----------EEEEEEEE---------HHHHHHHHHHHH-------------------------HHHHHHHHHHHH
orf172_BPT5__93750 PAWKNQYKIGMSQN---PKERLAQYQTYSPY-RD--YKLEHWS-FWF---DKRKGEKLIHQYFKDLK--------------EHEWFSINSRDLSKYLERINSSSD \
yeeC_Bs_7474985 SSIKNLYKIGFTTG--SVENRIRNAENQSTYLYAPVEIVTTYQVFNM---NASKFETAIHHALENNNLDVSILGANGKMLVPKEWFVVTLEDLQAVIDEIVMMVH |
orf240SM63E2_14194257 MRSAKRYKIGKSNS---PSRRYREVRLDLP---DA-TILVHTI-PTD---DPSGIEAYWHRRFADKRV------------RDTEFFNLTASDVTAFKRRKYQ--- |solos or with insignificant extensions
CIV460R_CIV_15079171 YEPLDIYKIGCTKD---INRRLKTMNASRI-SFDK-FFIVNQI-QTF---HYFKLEQGLHKLLKKYRL-------------NNEFFQCNVNIIEKAISDYANNNV |
BROF_LdNV_9631041 YRDRRIYKIGRTAS---PADRLCALNTGRA-DDF--LYFEHVS-PDLGHEASVRVERLMHDSLAPLR-------------MHGDSFN------------------ |
NMB1170_Nm_11345564 TVIKGVYKIGISDV-SNFEGRMRHLENNGYANVAG-LERILAV-KTD---NYKEKENLLHEIFSKSRI------------GDTELFAVDENLVKRLFLSLRGEIV |
ORF1_BPP27_8346568 SFGENVYKVGMTRR-LEPMDRVKELGDASV-PFD--FDVHAMI-SCD---DAPALEKALHDYLERYRV--------NKVNLRKEFFRVELEKIIEVVKHHHGNIE /
CIV315L_CIV_15079027 LQVHNVFKIGYTKN---FEERLKTFNDYRH-SLEPQFFAVAIY-DTD---NAKKLETTIHKKLKDFRS-------------EGEFFQVELSVIKEAFLKEDCCLK | + kilAN
AMV209_AMV_9964523 YASINNFKVGKTDN---LSSRQSNFNSSHI-DQDE-FYICFYQ-KVY---NMSKTENLIHDLLEDFR-----------DKKRKEIFIIHYTYLLDIINLVIKNIN \
AMV207_AMV_9964521 YAMINNFKVGKTDN---LSSRQSNFNSSHN-TEDE-FYICYYE-KVF---NISKTENLIHDLLDNFR-----------DKKRKEIFVIHYKYLLDMVNLVIKNIN |
MSV198_MSV_9631448 YAKLNTFKIGKTDN--LISKRQSQLNNSHT-SFDK-IYICYYE-AVY---NPNKVEQIIHDVLESFR-----------DSSNNEFFILHYKYLLNIVKLIIKNIN | +MSV199
AMV194_AMV_9964508 YAAQNRFKIGGVENNNLIKPRLSTYNSRSA-EGDE-WYYTYIK-NIN---NYKHFENRFWSVMSSFR-----------DKKDKEIIVLYYNDLINIFNFISENYN |
CIV420R_CIV_15079131 YAAQHRFKVGGVEGRRRLRGRLSDYNGRSA-SGDE-WYFCHLI-DVA---DFRKAEGRIEDIIGKFR-----------DKKDKEIYIMPYRKLLKVIELICQNYT |
MSV021_MSV_9631537 YKEKNIYKIGYTND---VVGKLVKMNSNRL-KFEQ-FYYVKIY-KVN---NIFSIQNYIYKKLYPYI-------------LNYPYLNCD----LNVITNAMENID /
orf117_ESV_13242588 EDNSVLVKIGSTKN---IRARTTGLVNEFG------SMAIFRIFECD---RYEEFEKSLHKHNDIKRY---RFKKPINGKRSMEVFNMTKEELQRAVNIAGSNVC \
BROM_LdNV_9631117 LQTVDAYKIGYTHD---LHDRIAELNVASP--LD--FKPVFVY-DTA---TPRRLEQQLHNYFLDKR-------------IKREFYKLDKEDLLMLPVVCNKLCA |
ORF59_HaNV_12597544 LQMIDAYKIGYTFD---LTARLNELNVASP--LD--FKSVFVR-ESS---NPYDLEQKLHRHFHESRI-------------KREFFKLTEEDLALLPLICDNLLA |
CIV289L_CIV_15079001 YQQQHKFKVGGVQTFDLLKSRLTQYNSGES-DSEA-HFFIYIR-KTV---NYRSIEHAIKGLLSGFR-----------ENQSNELYIMHYDWLVKFVDAIMDGNA |
CIV201R_CIV_15078913 YQQHHKFKVGGVQSFKDLKSRLTQYNSGES-NSEA-HFFIYVR-KTV---SYRSIEHIIKGLLSGFR-----------ENQSNELYIMHCDWLVKFLDAIMDGNA |
MSV194_MSV_9631452 DLSKNIFKIGKTNI-NSIKNRLSTYNTGAS---DP-YYYVFYK-EVY---DATKIEKDFNTLMNRYN---INVTSPNKTKLNNELYKLYYLDLEYVLNAVIDSND |
MSV023_MSV_9631535 DLSNNIFKIGKTNI-NSIKNRLSTYNTGAS---DP-YYYVFYK-NVY---DGNKIEKEFNYLMNRYN------VLLNNNKINVELYKLYFPDLEYVLNAVIDSND | +BRON
BROB_BmNV__9630900 YAERNLFKIGQTTN---LTRRLATLNCGRA-DDDQMQYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLPHCS |
BROI_BmNV_13751084 YAERNLFKIGQTTN---LTRRLATLNCGRA-DDDQMQYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLPRCS |
BROK_LdNV_9631082 YAERNLFKIGQTTN---LTRRLAALNCGRA-DDDQMRYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALESCLRRCS |
BROE_BmNV_9630956 -AERNLFKIGQTTN---LTRRLVSLNCGRA-DDDQMRYVLQTE-PTV---HHTLLEKLMKQELRPYR-------------NSGEVYCTDFEHIKRALETCLPHCS |
ORF130_XnGV_9635380 YKSKHIYKIGTSRS---PAKRVRQLNCGRP-YDL--LILDHCQAAAD---QGFIVEALMLNEYKTQQ-------------LHGEWVQFADNKQYQSAKKKLDEFI |
BROG_LdNV_9631042 NRERNLYRIGRTAS---PTALLCFLNEDRH-EDR--FYLDYVS-PDVSREGSVRAERMIREHIESLQ-------------THGDFYQFATKEALDLMREAIVKIQ /
consensus/90% ....p.aKlG.s........R...bps..........bh..h...s....p...hE..b...b...p..............p.-hb.h.......h...h.....
http://www.bmm.icnet.uk/servers/3dpssm/output/a47ba25a21f354b7.job_summary.html
13. MSV199
----------------
PHD Sec. Str. HHHHHH------HHHHHHHHHHHHHHHHHH------------EEEHHHHHHHH------------------HHH------------------------------------------------HHHHHHHHHHH----EEEEE----------HHHHHHHHHHH---------EEEEE---HHHHHHHHH----HHHHHHHHHHHHHHH-
CIV146R_CIV_15078859 LIDIFIEEEQN-----------FGTILNE--MTCQ-------HKIYISKKLLKWIGYEG--------------DYKK------------------------------------------------QRDSFKKLLKRHNIDFEELKSNDIECEN--YPEIKVDMANL-SNGVISQSKWLILNIYNFKYI------------------------ |+UVRC
MSV199_MSV_9631447 MLNIFEFIEQN-NFEINLG-SWFNEIWLP--LFNK-------TELLITLNILHFIHYGTSKSVLDGNTT---LNYRE------------------------------------------------LKRDFEKILNNNKIKYKKIKYEEIVNNKNYYELVKNEIKNI-TPNNLNKSTWFILDVLQFKMLIMRLSTNVAKEICEYYVTLENILH |solo
MSV198_MSV_9631448 MLNIFEFIEQN-NFDIKLG-PWFNEIWIP--LFNE-------TELLITLNILNFIHYGTSNVVLDYHPM---RNNTN------------------------------------------------LKRDFEKILNNNKINYKKIKYNDIINNEDYYNKVKEEIENI-RPCNLEKSTWFILSVDEFKMLIMRLSTNVAIEVREYFILLEKILF \
AMV209_AMV_9964523 FVDIFTFITNN-DYDFKLG-SWFKDIWYP--LFEE-------KDVLITNDILTFIYYFPEG---SQPPP---EMFKG------------------------------------------------YKKNLIDSLNNYNIKFIEIDYKHEYVLT--NKKLKNEIKFI-TPNNILRKRWIILSVENFKLLIMRLNTKSAHYIREYYLFIEGLLY |
AMV194_AMV_9964508 FVDIFTFIKNN-NYEFKLG-EWFIDIWYP--LFER-------KDVLITNKILYFIHYGISGG-DTHPPL---EKYRL------------------------------------------------MRKDLEKILKNYNINYIKIKYYKNIDID--YNFLIDEIKNI-TPNNIIQKTWIKLSVKNFKKLILKIRTAIADDIRDYYITLEEILY |
AMV207_AMV_9964521 LMDVSTFITYN-NYDIELG-SWFKDIWFP--LFNK-------KNVVITNEILNFIYNFQVGKCFPTYNL---DNYIQ------------------------------------------------YKKDYRSFLKKNNIEYNIIKYDENILNK--YNILKSELKLY-DKHALVQKTWLILSVDDFKESIMMMNNNNSKMIRKYYIKIEKILF | +T5orf172
MSV021_MSV_9631537 VNNIFSIQNY---IYKKLYPYILNYPYLNCDLNVITN-----AMENIDKSLLSNNLY---------------TEYQN------------------------------------------------LKDNFETILTTNRIKFKKLKYHEMSDEN--REMLNSEVIKL-SMSELANTTWCILKTSDFKNLILQINTLPVEEIREYYLLIEKILL |
MSV191_MSV_9631453 MEHIYEYIENKQNENIIMN-PWIKDICLP--MYNK-------SNVLITSSILKFLYFGPKIPINDSPGYIYVDEYKKNEIYAIYYSKDNIEFPCKIKINVDDMVLVKNYLCYKLSEYKYGDSGELFKCDFDIILRAMEIPYN-----NELLTKNLEKVLIEKNITY-SKSEYNDSFRLIVHIDQFKLLINKLN---IDILSKPYEAVEKIIQ |
CIV420R_CIV_15079131 MTDLFTYIKDK-NIAIDLNSKWFQELWYP--LSKK-------TGSIITTRLLEWMGYSG--------------EYKL------------------------------------------------QRQNFKRLLDNNNIPYEEIYHNDDRFLE--HPSMIYEIEQT-DKKQIKQKRWITLEMRNFKKAILRLNTKNAEVIRDYYLNLEEACF /
CIV468L_CIV_15079179 LLDIFKFIEIT-NFDLD--PIMTNWFWQV--MVNN-------HSTHLGRVVLEWFGYEG--------------EDSN------------------------------------------------QKQKFIDMLKRNKIPYKQLKHTDNEIEL--YPSIKEEMTLLPHKGAIASSKWLVMEPFNIKMAMLRLNTKNADIIKRYYIKMEELIR \
CIV238R_CIV_15078950 ILD-----SAMNESKIKLDISWFFDNYMDQELTNVMNYFDGEEPIHINTVVLEWFGYEG--------------DLRT------------------------------------------------QKRKFIDMLKRNSIPYKELTSKE-EIEL--YPTIKEEILSLPHKGAIACSKWLVMKPYDIKIAMLRLNTKNSQIIKQYYIKMEELVR |
CIV212L_CIV_15078924 LMDLETFIDTT-GFEKD--PIMNDYFWQI--MVTK-------QRTHLSAMLLQCLGYEG--------------EFRV------------------------------------------------QQQHFKRFLKSNNIHPLELTSSDPDIKN--YPTIQDEMKLL-KPNVISNRKWLIVEPREFKKVIMKLNTKHGDRIREYYLCLEEL-- |
CIV388R_CIV_15079099 YLEIETFMDVI-GFVKD--PVMTDYFWHI--MVDN-------HCRHLATVLLECLGYEG--------------TYNK------------------------------------------------QQYAIKRFLKSNRINYSELSSDDPQIDL--YPTIKEEMKNM-KPNAIACRKWLIIEPREFKKVIMKLNTKNGDNIREYYIRLEELIK | +BROC
CIV019R_CIV_15078732 KLEINEFIDLFIG---------EENKWNK--MFDSDL-----SGIHISSLILNQLGYEG--------------EFKN------------------------------------------------QQTCFKRFLKRNNIIIQEFSSSNPELKL--YPSIQEEMKNM-KTNVIANRKWLIANPRDLKKIIMKLNTKNGDAIREYYICMDELVQ |
CIV211L_CIV_15078923 LLDIPSFMKVA-GIEFD--PIMFNHFWQV--LVDNGD-----RLPHVGETTLNWLGYEG--------------VFTK------------------------------------------------QKEKFINMLKRNQISFKELSYQDNEIQL--YPSIQKEMLLLPNESAKTKSKWLLMNPDDFKMAIMGLKTKNSEKIKRYYVTLEKTMK |
CIV148R_CIV_15078861 IVDIIKFVEIT-NFDID--PFMIDKFWHT--MYDN-------SLLYISRDILEWMGYTG--------------EFGE------------------------------------------------QRKAFKKLLKRKNINFTELSNNDPTKHL--YPEIQKDSLLL-SNAVVSQSKWIIMNSDDFKDSILMLNTKNSGKIRKYYRSFEKLLK /
consensus/85% h.pl.phbp....b.b.....h...ba....h.pp.......p...ls..lLphh.Y................php.................................................bbpphbphLpp.pI.a.blp.pc.......b..lbp-hb.h.p.s.l.pppWhlhp..phKbhhhblps..s..lpcYY..hEphh.
http://www.bmm.icnet.uk/servers/3dpssm/output/e8c2fb22ee6b2ed4.job_summary.html
14. CIV029R / BROC
----------------------------------------
PHD Sec. Str. -----------------------HHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHHHHHHHHHHHHHH--------------HHH----------------------EEEEEE-----------EEEEE---------------------EEEEE--------HHHHHHHHHHHH----
029R_CIV_15078742 -------------------------------------------------------------------------------*MVERLGI--------------AVED-------RSPK-LRKQAIRERFVLFKKNTERVE--KYEYYAIRGQSIYINGRLSKLQSERYPKMIILLDIFCQPNPRNLFLRFKERIDGKSEW \
ORF17_DpAV4_11931709 ------------------------------------------------------------------MGVQLDETNEQLNEMNNKLDV--------------AVED-------RAPI-PEDQSKVERFVFLKRPNE-----NYPYYAIRAQAASTKTAIRK-QQKEFGAIELLLDFETHPNTKTYYNRIKWR*------ | solos
ORF116_OpNV_9630054 ---------------------------------------------------------------------MLEDKDRRIQELYASLLE---------------MSE-------RAVQYPAKGHQTP-MLCVARE-------FNCLRAITGQKVHVTKMKREL-TD---AAELVIDA-MRPNPQVDLNNFVNRV*----- |
ORF103_EpNV_15213228 --MSPPDLTPMEKLLE-----SIENQIKIK-DEQLRKNNEMLERYIML----------------------LEEKNKRIEELYRSLME---------------MTD-------RVVQYPAKSYQTP-MLCMTRE-------FNCLRAITGQKVHVNKMKRDL-TT---AAEIIIDS-VRPNPQVDFNNIVNYVESEFKE |
ORF3_DpAV4_11931724 GHGDHPQSQHCIG-------YEPETGGEGIGGGEIREQRRLLNRLA-------------------QMGIQLNETNEQLNEMNNKLDV--------------ACED-------RAPI-PDDCSKVERFVFLKRRTS-----DYPYYAIRAKRRARRRPIRK-QQNEFGVINILLDFETHPNTKTYYNRINWALNKRGVK |
ORF122_LdNV_9631089 TDSGYDEDYEEEEDEE----QNAILAHLRATNASIREIQQKLQTLEKI---------GGILNRADADADADDDLSFL------DEPD--------------VEPD-KPPVGATVKF-PRDATKHPWLTVLAKEVRREGAVATEIAFATSRAA-ASARKRKYS-----DMSLIYQG-VHPNPQLAVCCITEEWQERGLS |
P20_LsNV_2760643 FFVNTKYFVDMEAH-------IETQQYLIK-----SIADKDVIIQHK-------------DAQIAELLNAILLANSQCMSLSKRLVD--------------IVQD-------VVVKPQNCQLLHA-LAVCELS-------CNKFAFLRTQLRSLKRSIKRLQRAEQHEPTIIYQSEYVPNSINILNKIKEQLPKDKFT /
ORF10_EpNV_15213135 LAIKTDKGYDCDD-------VRDNIKTVLKHIKTLNVNSDKFINAHKLFENQVCARFEQLEQRLETLERVPDA--------PTMP---------------------------GVIF-PRDVNKHQHLAVFVNQERG----NTQIGFARGQEEYFRKRKLEFEEE---DMHKMLET-VHPNPQMAVQCIKDRFISNGYK \
ORF12_OpNV_9629950 LAVGANKDHDRDN---LLDKIEAVLNHVKTLNTNSDKFISAHKSFKLEVGARFE-QFEQRLQTLDTKLNALQCA----APTRTAP---------------------------GVVF-PRDVTKHPHLAVFMGRVEDRG--VTQIAFARGQEEHFRKRKLEFEE----GMDVDVRG-RAPNPLLAVHCIKEEFANGGHK |
ORF13_ACNV_9627755 --LHIQTEGERDDLRDKIESVLKHVKKLNANSEKFMVTHETFKNEVGN-------RFEQFELRLHELDAKLNML-QSAEKLKTAVVAE-------------SKNG-------TVTF-PRDITKHQHLAVFSERIDD----RIKLAFVLGQERHFRKRKMRFED----DMEVLYDG-VHPNPLLAIQCINEKLYDKHYK |solos more closely related to the ones with BRON.. maybe truncated or maybe an ancestral solo
orf13_BmNV_9630821 --LHLQTEGERDDLRDKIESVLKHVKKLNTNSEKFMVTHETFKNDVGN-------RFEQFELRLNELDAKLNML-QSAEKLKTAIVTE-------------SKNG-------TVTF-PRDITKHQHLAIFSERIDD----RIKLAFVLGQERHFRKRKMRFED----DMEVLYDG-VHPNPLLAIQCINEKLYDKHYK |
ORF13H_MbNV_5565846 ----------------------QVIEKFDAFDRRVAELNDKMNMYEN----------VDDLYRRLREHHRTLERPQHMSF--LSSSNTIN-----------DDHDQRCIRFDTVRF-PRDTSKHPRLSVFVKPVEEG---GTKVAFVAGQQRRICALKRKYS-----DMEMIYDS-VHPNPQLAMQCINEELDLKNLD /
BROA_BmNV_9630998 DDIIVEKDKIIVAK-------TEQNQQLAS---ALQEANQNLIEANKG---------------LMTAFNMINDARKETAQLANRMAD--------------IAQD-------VITKPSDPRLCHS-LAVCSLG-------GDQYAFLRPQKRNLKRSLDRLSVD---NREIVYKSEYVPNAMNVLNKVKESLPRDKFK \
BROL_LdNV_9631113 KEIICKKDEIIAVK-------EDENKKLTI---SLQETNQNLIIANKG--------LLQAFEIINEARKDSENARKETAQLANRMAD--------------IAQD-------VITKPSDPRLCHS-LAVCSLG-------GDQYAFLRPQKRNMKRSLDRLSVD---SREIVYKSEYVPNAMNVLNKVKENLPRDKFK |
ORF109_XnGV_9635359 KQKIVEKDTIIAVK-------DEENKKLTV---ALQDANQNLIEANKG---------------LLQAFNIINEARKETAQLANRMAD--------------IAQD-------VIAKPSDPQLLHS-LAVCAMG-------GDQYAFVRPQKRSL----DRLSVD---EKDIVYRSDYVPNAMNVLNKVKEALPKEKYK |
ORF60_HaNV_12597545 KLMLSHKDELLAVK-------DKENEALTV---ALQNANHNLAVANQG---------------LLKAFDVVNDARKETAEIAKRMAD--------------IAQD-------VIAKPSDPQLLHS-LAVCSMG-------GDQYAFLRPQKRSLKRSLDRLSVD---EKDIVYKSDYVPNSMNVLNKVKERLPKEKYK |
BROP_LdNV_9631128 IAEESILRNEIVAK-------TEENKQLAT---ALIEANGKIILFAGA--------LVEANAGLLLANKNLHDANQTIGQMANRMAD--------------IAQD-------VIAKPSNPNLCHS-LAVCALG-------GDQYAFLRPQKRNMKRSLDRLSVD---NREIVFKREYVPNAINVLNKVKESLPRDKFK |
BroO_LdNV_9631121 AVHVATNEGREAPW-------MKDLEEFKV---VLAEKDRKIDKLTNA--------LIQSNEKNNTLTQALIAVTERTDKLANRIID--------------LAQD-------VVTKPSNPNLCHS-LAVCALG-------GDQYAFLRPQKRNMKRSLDRLSVD---NREIVFKSEYVPNAMNVLNKVKENLPRDKFK |
ORF159_XnGV_9635409 TVALQESNQKLVIT-------TEKLTDANE---KLTETNNKLVTLATA--------LVSANEGLIKANTMLNDARVETAQLANRMAD--------------VAQD-------VIAKPSDPQLLHS-LAVCSMG-------GDQYAFLRPQKRSLKRSLNRLSVD---DSQILFKSDYVPNSMNVLNKVKENLPKDKFK |
38_7_HaNV_12597608 RADHHSANENMHK---------SILGKVGDIENRLSELDHKISAIEK----------IDVLYNHLKNYHRLQTNNSN------DTALY-------------SEED---NFVNGFRL-PRDSSKHPHLGVLVRSVDQH---NTEIEFLTGQRNYYQTRKRKLK-----SGDLIYDA-VHPNPQVAVHRFNEELDMKNLS |
BROA_BmNV_9630839 ------PAVKMDTN-YGVI--EELNKKLAFASESLAEANEKIIHFANA--------LVTANAGLVQANTMLNEARRETAQLANRMAD--------------IAQD-------VIAKPNNPQLLHS-LAVCALG-------GEKYAFLRAQKRSLNRSIKRLG-----SSDVVFSSDYVPNAMNVLNKVKETLPRNQYK |
BROA_SlNV_7672865 ------PAVEMDAN-YGAI--EELNKKLTFASESLAKANEKIIHFANALVTANT-GLVQANAMLNEARKDCENARRETAQLANRMAD--------------IAQD-------VIAKPDNPQLLHS-LAVCALG-------GEEYAFLRAQKRSLNRSIKRLG-----SSDVVFSSDYVPNAMNVLNKVKETLPRNQYK | + BRON
BROC_BmNV_9630901 ------PAVEMDTN-DVIAKIDDLTQKLTVANADLAEANRSLILFANE--------MIVARRDAETARQDCENARRETAQLANRMAD--------------IAQD-------VIAKPSNPQLCHS-LAVCDVG-------NNEFAFLRPQKRSLGRSLKRLG-----SNDVIFSSDYVPNSMNVLNKVKEAIPRNKFK |
BROII_BmNV_13751087 ------PAVEMDTN-NDIAKIDDLTQKLTVANADLAEANRSLILFANE--------MIVARRDAETARKDCENARRETAQLANRMAD--------------IAQD-------VIAKPSNPQLCHS-LAVCDVG-------NNEFAFLRPQKRSLGRSLKRLG-----SNDVIFSSDYVPNSMNVLNKVKEAIPRNKFK |
BROB_LdNV_9630999 ------PAVKMDTS-GALVKIDDLTAKLTEANANLMEANKSLIVFANEMIVARR-DAETARQDCEAARQDCEAARRETAQLANRMAD--------------IAQD-------VIAKPADPRLRHT-LAVCEIG-------QNEYAFLRPQKRNFRQSLNRLSVD---DRNVVFKSEYVPNAMNVLNKVKESLPRDKFK |
BRON_LdNV_9631120 ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEIMQQKDAQVTELV-------AKVVD---------------LSE-------RAVQYPADERKHP-VLCVARD-------GTTFMAIAGQKSYVRSQKHKRNID---AASVVAEA-TRPNPTVDWNNATHRLPAKKTK |
BRO_AcNV_9627744 ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQK----KDEIMQKKDAQVTDLV-------AKVVD---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVENQKHKRNIN---VANIVVEN-IRPNPTVDWNNATDRLQAKRSK |
ORF2_AcNV_93042 ELVKKQEFIERIVA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQK----KDEIMQKKDAQVTDLV-------AKVVD---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQLTYVEMQLHLRMIM---VANIVVEN-IRPNPTVDWNNATDRLQAKRSK |
BROD_BmNV_9630955 ELFKKQEFIERIIA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEMMHKKDELLQVKDTQVSNLIAKMID---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVESQKHKRNID---AANIVVEN-IRPNPTVDWNNATDRLQSKRSK |
BROIII_BmNV_13751089 ELFKKQEFIERIIA-------IKDKQIEAK-DLQVTRVMTDLNRMYTGFQETMQR----KDEMMHKKDELLQVKDTQVSNLIAKMID---------------LSD-------RAVQYPADKRKHP-VLCVTRD-------GTTFTAITGQKTYVESQKHKRNID---AANIVVEN-IRPNPTVDWNNATDRLQSKRSK |
BROJ_LdNV_9631081 -----EQFQETMQK-----KDEQFKETIQKKDEQFKETIQKKDEQFQE----------IIQKKDAQLQETIQRKDEQIARLIDAAMD---------------LSS-------RAVQYPADERKHP-VLCVARD-------GTTFHGIAGQRRYVQSQKRKLGVK---DDDLVLET-RRPNPALDWTNATHTTSAVKRSK |
ORF114_XnGV_9635364 -VYRERELESKTNQ------LANKEKQLKNALSLIEFKENQLSEVISL-------TQKKDIQLEQQFTMLSSLMGKHIKKIE--ISD---------------SDD------------ELPQNHDT-VLMIVREN------NTTFKGIAAKRRYVDQQKQKLRYH---ESMIVVHS-KRPDPKRDWNAAMDIVVELGVK |
AMV175_AMV_9964489 RKRTQKKYIDIINN------KQDKIDILSIKLDNISKQNNELLTQNQ-----------LALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI |
AMV177_AMV_9964491 ------QKCKIDELFN---QNKKIISQNNELINKTEYQNNEILKLNKQ--------NQLALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI |
AMV057_AMV_9964371 MDIISNQKDKIDD-------LFKKIDNQSLEINNISKQNNELLTQNQ-----------LALNKLQELGINLIETKEEIKDVKDKLNV--------------VIED-------RNVK-PKEVKLQHKYLLLKNKII-----NNEYKFIRAQDQYIKTNKSNWLE----KHNVIIDEKYNPNPIDMCSRLKSKIYELDKI |
orf6_HaEV_3510491 LDIINNKQDKID-------ILTQDLEEIKNQNILTIEQNNKLLQQNQ-----------LALNKLQELGINLIESKEEIKSINNRIDT--------------IIVD-------RNIK-PSNPKLHHKYLLLKNKN------KNEYKFIRAQDKYIKNNKSLWLE----KYNTIIEEKYNPNPIDLCSRLKDKIAKLNPQ |
ORF13_SeNV_9634234 ----------------------QVIERFEWFNVQISELNKKMSTLNN----------VDELYRRLQDYHKSNNINTTMFSNNASSSTALSSSSSYENNLMGGIVDNEHTRYETVRF-PRDTSKHPRLSVFVKPSEE----GTDIAFITAQQRRHNALKRKFN-----DMEMIYDS-VHPNPQLAMHCINEELDIKQFN |
38_7l_HzNV_10442572 RADHHSANENMHK---------SILGKVGDIENRLSELDHKISAIEK----------IDVLYNHLKNYHRLQTNNSN------DTALY-------------SEED---NFVNGFRL-PRDSSKHPHLGVLVRSVDQH---NTEIEFLTGQRNYYQTRKRKLK-----SGDLIYDA-VHPNPQVAVHRFNEELDMKNLS /
468L_CIV_15079179 MKITNHRQENMLIE------SHNMLRSMGVEIKDIRHENNDLLDQNN------------------ELLERVDDVLQKVNTVQKKLDI--------------SVED-------RAPQ-PDKNTRRERFLLLKRNND-----TFPYYTIRAQEINARKALKR-QRNMYTDVTVLLDIVCHPNTKTFYVRIKDDLKSKGVE \ + MSV199
238R_CIV_15078950 QEEERKLDRLMLTE------SRNMLQTMGIEIKTVKYNNNNLIDQNN------------------ELLERVDEVLHKVDVVQTKLNI--------------SVED-------RAPQ-PDKNKRRERFLLLKRNDE-----NYPYYTIRAQDINAKKALKR-QKDMFSDVTILLDLICHPNTKTFYVRIKDDLKKKGVE |
211L_CIV_15078923 RQYMQKMGITLED-------TREEVKKVNIQNKDIKAQNEEIKAQN-------------------------EDLAFDLSDVRDRLIE--------------AAED-------RSPK-LETKPLRERFVIIKRKDS-----SFPYYAIRGQDVYVKGRLTHFKNTRYPELKIIFDTNYQPNPRNLYIRFKELKDERFII |
148R_CIV_15078861 RLERKQSEERSIK-------QEQLLLSIGYNLKELQEQKEEDTQKID-----------VLIDQNEDLKQNIEETNDKLDSVVEKLGI--------------AVED-------RAPR-LKRASIRERFVLFKKNNSTNE--IYQYYAIRGQSVYVNGRLSKLQSEKYPDMIILIDIICQPNPRNLFLRFKERIDGKPEW |
388R_CIV_15079099 --EYSLYFKEREAQ-----------IEKQKSQFHIETLEKKLDEMK-----------LEAEKRHDELLDKVEEVQYDLNVVGEKLDI--------------AVED-------RAPK-VKAELLRERFVVLNRNDKRA---SCQYYVMRGQDHYINGKIFSYK-NLHPNLKIIFDISCQPNPRNLFVRFKELKDNRFKV |
212L_CIV_15078924 ----CVYFKEREAKLQIT-TLEQKLEQMNITMIEMKEEMNLSMEEHAD-------KLDTLVDQNEELKLDVSEANEKLETVTHKLGI--------------AVED-------RSPR-LEQKPLRERFVLFKRNVKNA---RFQYYAIRGQSIYVNGRLTL-YNERYPNLEIIIDIFCQPNPRNLFLRFKNYVKDDERF /
313L_CIV_15079025 IYASKRQEQMLLE-------SHNLLKSMGIEVKDIKEQNNELLNEVG-----------ELREDNNELQEQVENVQEQIQKVQVKLEI--------------SVED-------RAPQ-PDKRGKKERFILLKRNDE-----HYPYYTIRAQDINAKKAVKR-QQGKYEEVLILLDLVCHPNTKTFYVRIKDDLKKKGVK \
006L_CIV_15078718 QEKNDKIDELILFSKRMEEDRKKDREMMIKQEKMLRELGIHLEDVSSQ--------NNELIEKVDEQVEQNAVLNFKIDNIQNKLEI--------------AVED-------RAPQ-PKQNLKRERFILLKRNDD-----YYPYYTIRAQDINARSALKR-QKNLYNEVSVLLDLTCHPNSKTLYVRVKDELKQKGVV |
AMV112_AMV_9964426 TNIIEENEITIKQK-------DDKIDELIQINKRIEEQNIKLLKLAE-----------KQNIKLDEISDELDETNYKLDTLTQTVEEN-------------ILPD-------RNIQ-PNDINLKHNLVIY-KKI------NNIIKITRAQNKYINKIKIS-------EDNIIIKE-YVPNPIDFINRMKLYCIDLNKK | kilAN
AMV110_AMV_9964424 IKQKDDKIDELNNKLD---IIITTNKILEQKSTNLENINNKLLKLAE-----------KQNIKLDEISDELDETNYKLDTLTQTVEEN-------------ILPD-------RNIQ-PNDINLKHNLVIY-KKI------NNIIKITRAQNKYINKIKIS-------EDNIIIKE-YVPNPIDFINRMKLYCIDLNKK |
AMV024_AMV_9964338 INIVEDKELEINDLNKKLSDIINQNNKILESNKNLENQNKKLLKLAE-----------KQNIKLDEIGDELDETNFKLDTLTQTVEEN-------------ILPD-------RNIS-PKDVNLKHNLVIY-KN-------NNEIKIIRAQNKYINKIKIL-------DENIIIKE-YVPNPIDFINRMKLYCVDINKK |
FPV124_FPV_9634794 HKFNNKYDKDTLE-------LKELYREQRKEAKSLRKINERIEEKYDK-------DTRELKQGLKELKDENKELKFEL----KKIEER--------------LRD-------KVIN-PFSPNKHHRLVILQKKID-----NNSFKTLRLQAERLNQEMNKY------KTNILYFL-MHTNLTQYPVLIG*-------- /
consensus/95% ...........................................................................................................................p..h.hh.............h..h.st.......t............hlhp....PNs...h.ph..........
consensus/90% ............................................................................p......................................t.......p..hhhh.............h..h.sQp..hp..t.p.........pllhp....PNs...h.php..h......
consensus/85% ..................................h....pph..................................p......ph...................s..........t.s....tc..hhhh.............h.hlpsQp..hp..t.p.........pllhc..h.PNst..h.phpp.h......
http://www.bmm.icnet.uk/servers/3dpssm/output/d454e5b32aaf504f.job_summary.html
#P63C
-----------------------
p63_BPMx8_15320633 QAVAERFLGVGLAPYAKRFPTPFYEGIFRLRGWPWHGPGTP--RPGVIAYWTNDLVYERLAP-ELLRLLRERNPMDKDTGRRAAKHHQLLSEDIGHPALAVAAKVDALNLPLEQNQVRVVFNWLQ | + BRON
orf12_BP933W_4499795 --ILEAFVAKEIQPYITTFPADYYEELFRLRGLE-YPPENPRFRPQYFGVLTNDIVYKRLAP-NILEELKKQNV----KASKGTKLFQGLTPNIGYQKL \ + kilAN
Gp73_BPHK97_9634189 ------FLLDKSQPWEKRFSDPFYSAMFKMSGLPRHRPGR---RPSLFGMISAKWVYGPVLPPEVYAEVKRR-------LAAGDKIHQHLKPD /
PHAGES: HOST
phiPV83, phi ETA, phiSLT S aureus
bIL285, bIL286, bIL311 pi3, BPphi31_1, TP901-1, RLT, BK5-T LL-H Tuc2009 Lactococcus
LcBPA2 Lactobacillus
TP-J34 Sfi21 Streptococcus thermophilus
A118: Listeria monocytogenes
HK97, N15 , HK620 , HK022 933W, VT2-Sa, phi-R73 P22 H-19B, P1 Escherichia coli
D3 Pseudomonas
APSE-1, GMSE-1 Acyrthosiphon pisum Endosymbionts
Mx8 Myxococcus xanthus
XF2506_Xf_11362060
M_XF2524_Xf_11362500
C_XF2524_11362500
N_XF2524_Xf_11362500
XF0684_Xf_11362477
XF1663_Xf_11362484
XF1645_Xf_11362483
XF0704_Xf_11362478