Score E
Sequences producing significant alignments: (bits) Value
sp|Q45493|YKQC_BACSU HYPOTHETICAL 61.5 KD PROTEIN IN ADEC-P... 392 e-108
emb|CAB13551| (Z99112) similar to hypothetical proteins [Ba... 362 4e-99
sp|P47385|Y139_MYCGE HYPOTHETICAL PROTEIN MG139 >gi|1361567... 280 3e-74
gb|AAF30984.1|AE002155_6 (AE002155) conserved hypothetical ... 279 4e-74
sp|P75497|Y139_MYCPN HYPOTHETICAL PROTEIN MG139 HOMOLOG >gi... 279 4e-74
emb|CAB73696.1| (AL139079) hypothetical protein Cj1710c [Ca... 270 3e-71
emb|CAA15548| (AL008967) hypothetical protein Rv2752c [Myco... 268 7e-71
sp|P56185|YE30_HELPY HYPOTHETICAL PROTEIN HP1430 >gi|231460... 267 1e-70
gi|4155934 (AE001555) putative [Helicobacter pylori J99] 265 6e-70
emb|CAA20296| (AL031260) hypothetical protein SC9A10.09 [St... 263 4e-69
sp|P54122|YOR4_CORGL HYPOTHETICAL 69.1 KD PROTEIN (ORF4) >g... 248 1e-64
gi|4104709 (AF039028) unknown [Streptomyces toyocaensis] 242 6e-63
sp|P54123|Y551_SYNY3 HYPOTHETICAL 70.4 KD PROTEIN SLR0551 >... 230 2e-59
emb|CAA14898| (AJ235271) unknown [Rickettsia prowazekii] 189 5e-47
dbj|BAA30170| (AP000004) 450aa long hypothetical protein [P... 157 2e-37
emb|CAB49829.1| (AJ248285) hypothetical protein [Pyrococcus... 155 7e-37
gi|2621085 (AE000797) conserved protein [Methanobacterium t... 137 3e-31
gb|AAF30921.1|AE002149_6 (AE002149) conserved hypothetical ... 135 1e-30
sp|Q58271|Y861_METJA HYPOTHETICAL PROTEIN MJ0861 >gi|212807... 134 2e-30
sp|P47662|Y423_MYCGE HYPOTHETICAL PROTEIN MG423 >gi|1361690... 129 6e-29
sp|P75174|Y423_MYCPN HYPOTHETICAL PROTEIN MG423 HOMOLOG >gi... 111 2e-23
gb|AAD25735.1|AF100324_3 (AF100324) unknown [Mycoplasma fer... 107 2e-22
emb|CAB49663.1| (AJ248285) mRNA 3'-end processing factor, p... 58 2e-07
dbj|BAA30510| (AP000006) 651aa long hypothetical protein [P... 58 2e-07
dbj|BAA79093.1| (AP000058) 420aa long hypothetical cleavage... 55 2e-06
dbj|BAA11296| (D78193) yycJ [Bacillus subtilis] >gi|2636584... 55 2e-06
sp|Q60355|Y047_METJA HYPOTHETICAL PROTEIN MJ0047 >gi|282623... 50 4e-05
gi|2650146 (AE001071) mRNA 3'-end processing factor, putati... 50 4e-05
pir||G64305 hypothetical protein YLR277c homolog - Methanoc... 50 4e-05
gi|2622312 (AE000888) cleavage and polyadenylation specific... 50 7e-05
emb|CAB57542.1| (Y18930) mRNA 3'-end polyadenylation factor... 49 9e-05
dbj|BAA29553| (AP000002) 514aa long hypothetical protein [P... 49 1e-04
sp|Q58633|YC36_METJA HYPOTHETICAL PROTEIN MJ1236 >gi|212807... 48 2e-04
emb|CAB54223.1| (Z48334) F10B5.8 [Caenorhabditis elegans] 45 0.002
gi|2650088 (AE001067) mRNA 3'-end processing factor, putati... 45 0.002
sp|Q10568|CPSB_BOVIN CLEAVAGE AND POLYADENYLATION SPECIFICI... 43 0.007
sp|Q45493|YKQC_BACSU HYPOTHETICAL 61.5 KD PROTEIN IN ADEC-PDHA INTERGENIC REGION
>gi|2633824|emb|CAB13326| (Z99111) similar to
hypothetical proteins [Bacillus subtilis] >gi|3282138
(AF012285) unknown [Bacillus subtilis]
Length = 555
Score = 392 bits (996), Expect = e-108
Identities = 204/544 (37%), Positives = 322/544 (58%), Gaps = 4/544 (0%)
Query: 9 LGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAGIFLT 68
LGG+ E GKN Y V+ +++I ++DAG+K+ E + LG+D+VIPD TYL++N D++ G+F+T
Sbjct: 14 LGGLGEIGKNTYAVQFQDEIVLIDAGIKFPEDELLGIDYVIPDYTYLVKNEDKIKGLFIT 73
Query: 69 HGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTEDTEIDFGK 128
HGH D IG +PY++ ++ +PV+G +L I L + ++ + R+ +++ ED + F K
Sbjct: 74 HGHEDHIGGIPYLLRQVNIPVYGGKLAIGLLRNKLEEHGLLRQ-TKLNIIGEDDIVKFRK 132
Query: 129 AVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIGNSGVXX 188
+SFF TTH+IP+S GIV+ T GNIV+TGDF+FD G N+ ++AEIG GV
Sbjct: 133 TAVSFFRTTHSIPDSYGIVVKTPPGNIVHTGDFKFD-FTPVGEPANLTKMAEIGKEGVLC 191
Query: 189 XXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAAIKLGRR 248
+ SE + E I+D +GR+I A A N+ R+QQ I+AA++ GR+
Sbjct: 192 LLSDSTNSENPEFTMSERRVGESIHDIFRKVDGRIIFATFASNIHRLQQVIEAAVQNGRK 251
Query: 249 VAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGEPLKSLGD 308
VA G M+ IE L + K + I+ EI + N++ IL TG GEP+ +L
Sbjct: 252 VAVFGRSMESAIEIGQTLGYINC-PKNTFIEHNEINRMPANKVTILCTGSQGEPMAALSR 310
Query: 309 MAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGG-VMKMLASDLKISGHAN 367
+A+ H+ + I GD V+ +SP +++R N++Y+AG V+ +D+ SGH
Sbjct: 311 IANGTHRQISINPGDTVVFSSSPIPGNTISVSRTINQLYRAGAEVIHGPLNDIHTSGHGG 370
Query: 368 ARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVSLENGDMI 427
+ + +L + +PK +PI GEYR H LA + I E+ FI GE ++L+ +
Sbjct: 371 QEEQKLMLRLIKPKFFMPIHGEYRMQKMHVKLATDCGIPEENCFIMDNGEVLALKGDEAS 430
Query: 428 PSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSKSRVHTRG 487
+G I + + LRDR++LSE+G+ I V++I + KI + + +RG
Sbjct: 431 VAGKIPSGSVYIDGSGIGDIGNIVLRDRRILSEEGLVIVVVSIDMDDFKISAGPDLISRG 490
Query: 488 FVYVKTSRDLMREAGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLYEQTKRKPVILP 547
FVY++ S DL+ +A EL++ + K + K W+EIK I D L FLYE+TKR+P+ILP
Sbjct: 491 FVYMRESGDLINDAQELISNHLQKVMERKTTQWSEIKNEITDTLAPFLYEKTKRRPMILP 550
Query: 548 VVME 551
++ME
Sbjct: 551 IIME 554
emb|CAB13551| (Z99112) similar to hypothetical proteins [Bacillus subtilis]
Length = 515
Score = 362 bits (920), Expect = 4e-99
Identities = 191/520 (36%), Positives = 306/520 (58%), Gaps = 12/520 (2%)
Query: 37 YAEMDQLGVDFVIPDITYLLENADRVAGIFLTHGHADSIGALPYIVSELKVPVFGSELTI 96
+ E + LG+D VIPDI+YL+E ADRV IFLTHGH ++IG + Y++++L VPV+G++LT+
Sbjct: 2 HPENEMLGIDVVIPDISYLIERADRVKAIFLTHGHDENIGGVFYLLNKLSVPVYGTKLTL 61
Query: 97 ELAKINVKNYADSRKFNDFHVVTEDTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIV 156
L + +K Y +RK D + + I F +SFF T H+IP+S+G+ T+ G+IV
Sbjct: 62 ALLREKLKQYGHNRK-TDLREIHSKSVITFQSTKVSFFRTIHSIPDSVGVSFKTSLGSIV 120
Query: 157 YTGDFRFDPAVAKGYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYI 216
GDF+FD A ++ +A+IGNSGV + SE + +I D +
Sbjct: 121 SAGDFKFDQTPALNQTCDIGEIAKIGNSGVLALLSDSANVERPGYTPSEAAVSGEISDAL 180
Query: 217 DDNEGRVIIACNAGNLGRIQQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKS 276
+++ RVIIA A N+ RIQQ I AA + GR++A G+++ +++ A +L ++ D +
Sbjct: 181 YNSQNRVIIAVFASNINRIQQVIHAAAQNGRKIAVAGKNLQSVLQLARKLGYIE-ADDEL 239
Query: 277 IIKPAEIKKYADNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYE 336
I ++KKY E+ I+ G GEPL +L MA++ HK + I++GD V+ ++P E
Sbjct: 240 FISVQDVKKYPKREVAIITAGSQGEPLAALTRMANKAHKQLNIEEGDTVVIASTPIPGQE 299
Query: 337 TTIARIENEIYKAGGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAH 396
++ + + +AG + + +SGH + +L+ ++++ +PK LIP+ GEYR AH
Sbjct: 300 LIYSKTVDLLARAGAQVIFAQKRVHVSGHGSQEELKLMINLLKPKYLIPVNGEYRMQKAH 359
Query: 397 ADLAMEMDILPEHIFIAKRGETVSLEN-----GDMIPSGVIQAENXXXXXXXXXXXXXXX 451
+ +A E + IF+ ++G+ V GD +P G N
Sbjct: 360 SKIAEETGMKRSDIFLIEKGDVVEFRGQNVKIGDKVPYG-----NILIDGLGVGDIGNIV 414
Query: 452 LRDRKVLSEDGIFIAVITISKTERKIVSKSRVHTRGFVYVKTSRDLMREAGELVNETVDK 511
LRDR++LS+DGI I VIT+ K ++ +VS + TRGFVYV+ S L+ +A ELV V +
Sbjct: 415 LRDRRLLSQDGILIVVITLDKQKKHLVSGPEIITRGFVYVRESEGLIVQATELVRSIVTE 474
Query: 512 YLSGKEFDWAEIKGSIRDALGKFLYEQTKRKPVILPVVME 551
+W+ +K ++RDAL +FLYE+TKRKP+I+P++ME
Sbjct: 475 ATETSNVEWSTLKQAMRDALNQFLYEKTKRKPMIIPIIME 514
sp|P47385|Y139_MYCGE HYPOTHETICAL PROTEIN MG139 >gi|1361567|pir||D64215 hypothetical
protein homolog MG139 - Mycoplasma genitalium (SGC3)
>gi|3844731 (U39694) conserved hypothetical protein
[Mycoplasma genitalium]
Length = 569
Score = 280 bits (708), Expect = 3e-74
Identities = 163/554 (29%), Positives = 283/554 (50%), Gaps = 13/554 (2%)
Query: 5 KITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAG 64
KI GG++E GKNMY +E +++I ++D G+K+A D LG++ +IP +L+EN +V
Sbjct: 17 KIYAFGGIQEVGKNMYGIEYDDEIIIIDCGIKFASDDLLGINGIIPSFEHLIENQSKVKA 76
Query: 65 IFLTHGHADSIGALPYIVSELKVPV-FGSELTIELAKINVKNYADSRKFNDFHVVTEDTE 123
+F+THGH D IG +PY++ ++ +PV + + L V + D+ K N + +E
Sbjct: 77 LFITHGHEDHIGGVPYLLKQVDIPVIYAPRIAASLILKKVNEHKDA-KLNKIVTFDDFSE 135
Query: 124 IDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIGN 183
I F+ H+IP++ GI + T +GNIV +GD+RFD A A ++ ++ +I
Sbjct: 136 FQTKHFKIDFYRVNHSIPDAFGICVQTPNGNIVQSGDYRFDFA-AGSEMLDVHKVVKIAE 194
Query: 184 SGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAAI 243
V S SE IY I + + GRVI+ A N+ RI + I+ A+
Sbjct: 195 RNVHVFMSESTNAEVPGFSQSEKLIYRNIQKILKEARGRVILTTFASNITRINEIIEIAL 254
Query: 244 KLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGEPL 303
R++ G+ MD + + ++ L +D I++ +IK Y D ++IL TG GE
Sbjct: 255 NNKRKICLLGKSMDVNVNISRKIG-LMAIDSNDIVEVRDIKNYPDRNILILCTGSQGEEA 313
Query: 304 KSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGGVMKMLASDLKI- 362
+L MA +H +V +K D ++ ++P + + NE+ K G + +S LK+
Sbjct: 314 AALNTMARGKHNWVSLKSTDTIIMSSNPIPGNYAAVENLLNELSKFGVAIYENSSQLKLH 373
Query: 363 -SGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVSL 421
SGHA ++LQ +L++ PK LIPI GE++ + ++A E I E + + G+ + L
Sbjct: 374 ASGHATQQELQLMLNLMFPKYLIPIHGEFKMMRTIKNIANECGIKSEDVALLSNGQVMYL 433
Query: 422 ENGDMIPSG-VIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSK 480
+ ++ S +I A+ ++ R++LS DG+F AVI + I+
Sbjct: 434 IDEELYYSNEIINADPIYIESHNSSPDLARIIKQRQILSRDGMF-AVIVVFDKNNNIIGI 492
Query: 481 SRVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFD-----WAEIKGSIRDALGKFL 535
+ TRG + S LM + V T++ + K+F+ E+K ++ + F+
Sbjct: 493 PTLITRGCFFALDSNPLMTKIAHSVKRTLESVIQSKKFNSHEQLTKELKRVCKETVSYFI 552
Query: 536 YEQTKRKPVILPVV 549
++ R P+I V+
Sbjct: 553 WKNKNRNPLISTVL 566
gb|AAF30984.1|AE002155_6 (AE002155) conserved hypothetical [Ureaplasma urealyticum]
Length = 596
Score = 279 bits (706), Expect = 4e-74
Identities = 170/577 (29%), Positives = 297/577 (51%), Gaps = 18/577 (3%)
Query: 1 MSNIKITP-----LGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYL 55
M N K +P LGG+ E GKN Y+VE E++I ++DAG+K+A G D + + YL
Sbjct: 1 MENTKKSPTYVYALGGLEEIGKNTYVVEHEDEIILIDAGIKFANASLPGFDGTVANFDYL 60
Query: 56 LENADRVAGIFLTHGHADSIGALPYIVSELKV-PVFGSELTIELAKINVKNYADSRKFND 114
++N ++ + +THGH D IG +P+I+ +K+ ++ L +L + + Y D +
Sbjct: 61 IKNNHKIHSLVVTHGHEDHIGGIPHILRHVKIKTIYAPTLAAKLIERRLSEYKDIKP--P 118
Query: 115 FHVVTEDTEIDFGKAV-ISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRT 173
+V ED + K + F+ H+IP+S GI + T +G IV TGDFRFD A A G T
Sbjct: 119 RIIVFEDESMYKTKYFEVDFYRVCHSIPDSFGICVKTPNGYIVTTGDFRFDFATA-GDET 177
Query: 174 NMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLG 233
N+ ++++I N G+ S SE + + I DY+ + +GR I+ A NLG
Sbjct: 178 NLAKISQISNRGISVLMCESTSAEIPGFSESERYVIDNIRDYMVNIKGRTFISTFASNLG 237
Query: 234 RIQQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVI 293
R+++ I A+ L +++ G+ M+ I+T+ +L L V + S I E+ Y D+E+V+
Sbjct: 238 RVEEIIAIAVGLNKKICIIGKSMEANIKTSRKLGYLN-VPESSFITHKELPFYKDHEIVV 296
Query: 294 LETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGGVM 353
+ TG GE + +L MA+ H + +K D ++ ++P + + N++YK G +
Sbjct: 297 ILTGSQGEKMAALNVMANNNHSKITLKPSDTIILSSNPIPGNYAQVEAMVNKLYKLGLTV 356
Query: 354 KMLASDLKI--SGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIF 411
+ + KI SGHA + Q ++ P L PI GEY+ A A++ +H+
Sbjct: 357 YENSPNKKIHASGHATRSEHQLMIKAINPSYLFPIHGEYKMFRALKQNAVDQGFDKDHVI 416
Query: 412 IAKRGETVSLENGDMIPSGV-IQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITI 470
IA G+ + L +G + + + + AE L +R VLS DGI V+
Sbjct: 417 IAPNGQKLQLLDGVLSHTNIYVDAEPKFINGYEISSKISKLLSERVVLSSDGILNLVLNA 476
Query: 471 SKTERKIVSKSRVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFDWAEIKGSIRDA 530
+ K+ S + TRG + K S +L+ + + ++++ L+ KEFD ++K +
Sbjct: 477 DFKKAKLNSAVSISTRGCFFAKESTNLINKISNVAKSSLEEALAKKEFDEKKLKEIVSGN 536
Query: 531 LGKFLYEQTKRKPVILPVVMEARQPQDLNKRYTKKNH 567
+ +++ K+ P+I ++ DL +++ K N+
Sbjct: 537 VKSIVWKWRKKNPIINVTIIN----NDLVEQFRKDNN 569
sp|P75497|Y139_MYCPN HYPOTHETICAL PROTEIN MG139 HOMOLOG >gi|2146372|pir||S73881 MG139
homolog A65_orf569 - Mycoplasma pneumoniae (SGC3) (ATCC
29342) >gi|1674253 (AE000054) Mycoplasma pneumoniae,
MG139 homolog, from M. genitalium [Mycoplasma
pneumoniae]
Length = 569
Score = 279 bits (706), Expect = 4e-74
Identities = 166/554 (29%), Positives = 284/554 (50%), Gaps = 13/554 (2%)
Query: 5 KITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAG 64
KI GG++E GKNMY +E +++I ++D G+K+A D LG+D +IP YL+EN +V
Sbjct: 17 KIFAFGGIQEVGKNMYGIEYDDEIIIIDCGIKFASDDLLGIDGIIPSFEYLIENQAKVKA 76
Query: 65 IFLTHGHADSIGALPYIVSELKVPV-FGSELTIELAKINVKNYADSRKFNDFHVVTEDTE 123
+F+THGH D IG +PY++ ++ VPV + + L V + D+ K N V + +
Sbjct: 77 LFITHGHEDHIGGVPYLLKQVDVPVIYAPRIAASLILKKVNEHKDA-KLNKVVVYDDFSN 135
Query: 124 IDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIGN 183
+ I F+ H+IP++ G+ + T +GNIV +GDFRFD A A G ++ ++ +I
Sbjct: 136 FETKHFKIDFYRVNHSIPDAFGVCVQTPNGNIVESGDFRFDFA-AGGEMLDVHKVVKIAE 194
Query: 184 SGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAAI 243
V S SE IY I I + GRVI+ A N+ RI + I+ A+
Sbjct: 195 RNVHVFMCETTNAEIPGFSQSEKLIYRNINKIIKEARGRVILTTFASNITRINEIIEIAV 254
Query: 244 KLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGEPL 303
R+V G+ MD + + ++ L +D I++ +IK Y D ++IL TG GE
Sbjct: 255 NNKRKVCLLGKSMDVNVNISRKIG-LMDIDSNDIVEVRDIKNYPDRSILILCTGSQGEDS 313
Query: 304 KSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGGVMKMLASDLKI- 362
+L MA +H +V +K D ++ ++P + + NE+ K G + + ++K+
Sbjct: 314 AALNTMARGKHNWVSLKSTDTIIMSSNPIPGNYAAVENLLNELSKYGVTIFENSPNMKLH 373
Query: 363 -SGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVSL 421
SGHA ++LQ +L++ P+ LIPI GEY+ + ++A E I + + + G+ + L
Sbjct: 374 ASGHATQQELQLMLNLVFPRYLIPIHGEYKMMRTIKNIAQECGINGDDVGLLANGQVMYL 433
Query: 422 ENGDMIPSG-VIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSK 480
+G + SG VI A+ ++ R++LS +G+F AVI + I+
Sbjct: 434 IDGKLYYSGEVINADPIYIESRNSSPDLARVIKQRQILSREGMF-AVIVVFDKNNNILGM 492
Query: 481 SRVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFDW-----AEIKGSIRDALGKFL 535
+ TRG + S LM + + ++ + K F+ E+K ++ + F+
Sbjct: 493 PTLITRGCFFALDSSPLMTKITHSIKRGLENVIQNKRFNTREQMIKELKRVCKETVSYFI 552
Query: 536 YEQTKRKPVILPVV 549
++ R P+I V+
Sbjct: 553 WKNKSRNPLISTVL 566
emb|CAB73696.1| (AL139079) hypothetical protein Cj1710c [Campylobacter jejuni]
Length = 664
Score = 270 bits (682), Expect = 3e-71
Identities = 154/551 (27%), Positives = 271/551 (48%), Gaps = 10/551 (1%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
I+ITPLGG+ E G N+ + E + ++D G+ + + GVD +IPD Y+ + D++
Sbjct: 116 IRITPLGGLGEIGGNISVFETNKDAIIVDIGMSFPDGTMHGVDIIIPDFDYVRKIKDKIR 175
Query: 64 GIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTEDTE 123
GI +TH H D IGA+PY E + P++ + L + + + + + F V +
Sbjct: 176 GIVITHAHEDHIGAVPYFFKEFQFPIYATPLALGMISNKFEEHGLKAERKWFRPVEKRRV 235
Query: 124 IDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIGN 183
+ G+ I + TH+I ++ + I T G I++TGDF+ D GY T++ RLA G
Sbjct: 236 YEIGEFDIEWIHITHSIIDASALAIKTKAGTIIHTGDFKIDQTPIDGYPTDLGRLAHYGE 295
Query: 184 SGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAAI 243
GV + SE + +GRVI++ + N+ R+ Q I +
Sbjct: 296 EGVLCLLSDSTNSYKEGYTKSESSVGPTFDQIFARTKGRVIMSTFSSNIHRVYQAITYGL 355
Query: 244 KLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGEPL 303
K GR+V G M++ + T L +++ D+K I E+ KY DNE++I+ TG GE +
Sbjct: 356 KYGRKVCVIGRSMERNLYTTMELGYIKL-DRKIFIDADEVSKYKDNEVLIVTTGSQGETM 414
Query: 304 KSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGG-VMKMLASDLKI 362
+L MA HK++KIK D V+ E +++ + + + KAG V S++ +
Sbjct: 415 SALYRMATDEHKFIKIKPTDQVIISAKAIPGNEASVSAVLDYLLKAGAKVAYQEFSEIHV 474
Query: 363 SGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVSLE 422
SGHA+ + + +L + +PK +P+ GEY ++ H + AM+ I +I++ G+ V L
Sbjct: 475 SGHASIEEQKLMLTLTKPKFFLPVHGEYNHITKHKETAMKCGIPERNIYLMSDGDQVELC 534
Query: 423 NGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSKSR 482
+ ++ + DR+ L++ GI + + I K + +++K R
Sbjct: 535 QKYVKRIKTVKTGKVFVDNQINKQIADDVVIDRQKLADSGIVVIIAQIDKATKTLINKPR 594
Query: 483 VHTRGFVYVK----TSRDLMREAGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLYEQ 538
V + G V K S+D+ G+ D+ L+ F ++ IR L K ++ +
Sbjct: 595 VFSYGLVADKHDHAFSKDMAEVLGQFFINVKDEVLNDPRF----LENQIRQVLRKHIFRK 650
Query: 539 TKRKPVILPVV 549
K+ P I+P +
Sbjct: 651 IKKYPTIVPTI 661
emb|CAA15548| (AL008967) hypothetical protein Rv2752c [Mycobacterium
tuberculosis]
Length = 558
Score = 268 bits (679), Expect = 7e-71
Identities = 160/550 (29%), Positives = 282/550 (51%), Gaps = 10/550 (1%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
+++T LGG+ E G+NM + E ++ ++D G+ + D+ GVD ++PD+ ++ + D +
Sbjct: 16 LRVTALGGINEIGRNMTVFEHLGRLLIIDCGVLFPGHDEPGVDLILPDMRHVEDRLDDIE 75
Query: 64 GIFLTHGHADSIGALPYIVS-ELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTEDT 122
+ LTHGH D IGA+P+++ +PV GS+ T+ L + Y + F + V E
Sbjct: 76 ALVLTHGHEDHIGAIPFLLKLRPDIPVVGSKFTLALVAEKCREYRITPVFVE---VREGQ 132
Query: 123 EIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIG 182
G +F+ H+ P++L I + T G I++TGD +FD G T++ ++ +G
Sbjct: 133 STRHGVFECEYFAVNHSTPDALAIAVYTGAGTILHTGDIKFDQLPPDGRPTDLPGMSRLG 192
Query: 183 NSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAA 242
++GV SE E+ ++ I +GRVI+AC A N+ R+QQ IDAA
Sbjct: 193 DTGVDLLLCDSTNAEIPGVGPSESEVGPTLHRLIRGADGRVIVACFASNVDRVQQIIDAA 252
Query: 243 IKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGEP 302
+ LGRRV+F G M + + A +L L++ D +I A + A +++V++ TG GEP
Sbjct: 253 VALGRRVSFVGRSMVRNMRVARQLGFLRVAD-SDLIDIAAAETMAPDQVVLITTGTQGEP 311
Query: 303 LKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGG-VMKMLASDLK 361
+ +L M+ H+ + + GDL++ +S E + + + + K G V+ + +
Sbjct: 312 MSALSRMSRGEHRSITLTAGDLIVLSSSLIPGNEEAVFGVIDALSKIGARVVTNAQARVH 371
Query: 362 ISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVSL 421
+SGHA A +L FL + RP+N++P+ G +R L A+A LA + E I +A+ G +V L
Sbjct: 372 VSGHAYAGELLFLYNGVRPRNVMPVHGTWRMLRANAKLAASTGVPQESILLAENGVSVDL 431
Query: 422 ENGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSKS 481
G SG + L +R +LS + + V+ T + + +
Sbjct: 432 VAGKASISGAVPVGKMFVDGLIAGDVGDITLGERLILSSGFVAVTVVVRRGTGQPLAA-P 490
Query: 482 RVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLYEQTKR 541
+H+RGF + A V ++ ++ D I +R +GK++ E +R
Sbjct: 491 HLHSRGF---SEDPKALEPAVRKVEAELESLVAANVTDPIRIAQGVRRTVGKWVGETYRR 547
Query: 542 KPVILPVVME 551
+P+I+P V+E
Sbjct: 548 QPMIVPTVIE 557
sp|P56185|YE30_HELPY HYPOTHETICAL PROTEIN HP1430 >gi|2314602|gb|AAD08469.1| (AE000643)
conserved hypothetical ATP-binding protein [Helicobacter
pylori 26695]
Length = 689
Score = 267 bits (676), Expect = 1e-70
Identities = 151/550 (27%), Positives = 282/550 (50%), Gaps = 10/550 (1%)
Query: 2 SNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADR 61
+++KITPLGG+ E G NM ++E + V+DAG+ + + GVD +IPD +YL + D+
Sbjct: 139 ASVKITPLGGLGEIGGNMMVIETPKSAIVIDAGMSFPKEGLFGVDILIPDFSYLHQIKDK 198
Query: 62 VAGIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTED 121
+AGI +TH H D IGA PY+ EL+ P++G+ L++ L + + + F +V +
Sbjct: 199 IAGIIITHAHEDHIGATPYLFKELQFPLYGTPLSLGLIGSKFDEHGLKKYRSYFKIVEKR 258
Query: 122 TEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEI 181
I G+ +I + TH+I +S + I T G I++TGDF+ D T++ RLA
Sbjct: 259 CPISVGEFIIEWIHITHSIIDSSALAIQTKAGTIIHTGDFKIDHTPVDNLPTDLYRLAHY 318
Query: 182 GNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDA 241
G GV + + SE I + +GRVI++ + N+ R+ Q I
Sbjct: 319 GEKGVMLLLSDSTNSHKSGTTPSESTIAPAFDTLFKEAQGRVIMSTFSSNIHRVYQAIQY 378
Query: 242 AIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGE 301
IK R++A G M++ ++ A L + + +S I+ E+ KY DNE++I+ TG GE
Sbjct: 379 GIKYNRKIAVIGRSMEKNLDIARELGYIHL-PYQSFIEANEVAKYPDNEILIVTTGSQGE 437
Query: 302 PLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIEN-EIYKAGGVMKMLASDL 360
+ +L MA H+++ IK DLV+ E +++ + N I K V ++
Sbjct: 438 TMSALYRMATDEHRHISIKPNDLVIISAKAIPGNEASVSAVLNFLIKKEAKVAYQEFDNI 497
Query: 361 KISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVS 420
+SGHA + + +L + +PK +P+ GEY ++ H A+ + ++I++ + G+ V
Sbjct: 498 HVSGHAAQEEQKLMLRLIKPKFFLPVHGEYNHVARHKQTAISCGVPEKNIYLMEDGDQVE 557
Query: 421 LENGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSK 480
+ + G I++ ++ R+ ++ G+F+A I ++K ++ ++
Sbjct: 558 VGPAFIKKVGTIKSGKSYVDNQSNLSIDTSIVQQREEVASAGVFVATIFVNKNKQALLES 617
Query: 481 SRVHTRGFVYVKTSRDLMRE----AGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLY 536
S+ + G V K + L++E L+ + + L+ + +++ R+ + K L+
Sbjct: 618 SQFSSLGLVGFKDEKPLIKEIQGGLEVLLKSSNAEILNNPK----KLEDHTRNFIRKALF 673
Query: 537 EQTKRKPVIL 546
++ ++ P I+
Sbjct: 674 KKFRKYPAII 683
gi|4155934 (AE001555) putative [Helicobacter pylori J99]
Length = 692
Score = 265 bits (671), Expect = 6e-70
Identities = 151/550 (27%), Positives = 281/550 (50%), Gaps = 10/550 (1%)
Query: 2 SNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADR 61
+++KITPLGG+ E G NM ++E + V+DAG+ + + GVD +IPD +YL + D+
Sbjct: 142 ASVKITPLGGLGEIGGNMMVIETPKSAIVIDAGMSFPKEGLFGVDILIPDFSYLHQIKDK 201
Query: 62 VAGIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTED 121
+AGI +TH H D IGA PY+ EL+ P++G+ L++ L + + + F +V +
Sbjct: 202 IAGIIITHAHEDHIGATPYLFKELQFPLYGTPLSLGLIGSKFDEHGLKKYRSYFKIVEKR 261
Query: 122 TEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEI 181
I G+ +I + TH+I +S + I T G I++TGDF+ D T++ RLA
Sbjct: 262 CPISVGEFIIEWIHITHSIIDSSALAIQTKAGTIIHTGDFKIDHTPVDNLPTDLYRLAHY 321
Query: 182 GNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDA 241
G GV + + SE I + +GRVI++ + N+ R+ Q I
Sbjct: 322 GEKGVMLLLSDSTNSHKSGTTPSESTIAPAFDTLFKEAQGRVIMSTFSSNIHRVHQAIQY 381
Query: 242 AIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGE 301
IK R++A G M++ ++ A L + + +S I+ E+ KY DNE++I+ TG GE
Sbjct: 382 GIKYNRKIAVIGRSMEKNLDIARELGYIHL-PYQSFIEANEVAKYPDNEVLIVTTGSQGE 440
Query: 302 PLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIEN-EIYKAGGVMKMLASDL 360
+ +L MA H+++ IK DLV+ E +++ + N I K V ++
Sbjct: 441 TMSALYRMATDEHRHISIKPNDLVIISAKAIPGNEASVSAVLNFLIKKEAKVAYQEFDNI 500
Query: 361 KISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVS 420
+SGHA + + +L + +PK +P+ GEY ++ H A+ + ++I++ + G+ V
Sbjct: 501 HVSGHAAQEEQKLMLRLIKPKFFLPVHGEYNHVARHKQTAIACGVPEKNIYLMEDGDQVE 560
Query: 421 LENGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSK 480
+ + G I++ ++ R+ ++ G+F A I ++K ++ ++
Sbjct: 561 VGPAFIKKVGTIKSGKSYVDNQSNLSIDTSIVQQREEVASAGVFAATIFVNKNKQALLES 620
Query: 481 SRVHTRGFVYVKTSRDLMRE----AGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLY 536
S+ + G V K + L++E L+ + + L+ + +++ R+ + K L+
Sbjct: 621 SQFSSLGLVGFKDEKHLIKEIQGGLEVLLKSSNAEILNNPK----KLEDHTRNFIRKALF 676
Query: 537 EQTKRKPVIL 546
++ ++ P I+
Sbjct: 677 KKFRKYPAII 686
emb|CAA20296| (AL031260) hypothetical protein SC9A10.09 [Streptomyces coelicolor]
Length = 561
Score = 263 bits (664), Expect = 4e-69
Identities = 164/551 (29%), Positives = 283/551 (50%), Gaps = 12/551 (2%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
+++TPLGG+ E G+NM + E ++ ++D G+ + E +Q G+D ++PD T + + D +
Sbjct: 19 LRVTPLGGLGEIGRNMTVFEYGGRLLIVDCGVLFPEEEQPGIDLILPDFTSIRDRLDDIE 78
Query: 64 GIFLTHGHADSIGALPYIVSEL-KVPVFGSELTIELAKINVKNYADSRKFNDFHV-VTED 121
GI LTHGH D IG +P+++ E +P+ GS+LT+ L + ++ + + + + V E
Sbjct: 79 GIVLTHGHEDHIGGVPFLLREKPDIPLIGSKLTLALIEAKLQEH----RIRPYTLEVAEG 134
Query: 122 TEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEI 181
G F + H+IP++L + I T G +V+TGDF+ D G T++ A +
Sbjct: 135 HRERVGPFDCEFVAVNHSIPDALAVAIRTPAGMVVHTGDFKMDQLPLDGRLTDLHAFARL 194
Query: 182 GNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDA 241
G+ E +I + + R+I+A A ++ RIQQ +DA
Sbjct: 195 SEEGIDLLLADSTNAEVPGFVPPERDISNVLRQVFANARKRIIVASFASHVHRIQQILDA 254
Query: 242 AIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGE 301
A + GRRVAF G M + + A L L+ V ++ + D+E+V++ TG GE
Sbjct: 255 AHEYGRRVAFVGRSMVRNMGIARDLGYLK-VPPGLVVDVKTLDDLPDSEVVLVCTGSQGE 313
Query: 302 PLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAG-GVMKMLASDL 360
P+ +L MA+R H+ ++I +GD V+ +S E + R+ N + + G V+ + +
Sbjct: 314 PMAALSRMANRDHQ-IRIVNGDTVILASSLIPGNENAVYRVINGLTRWGANVVHKGNAKV 372
Query: 361 KISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVS 420
+SGHA+A +L + +I RPKNL+P+ GE+R L A+A+L + + I IA+ G V
Sbjct: 373 HVSGHASAGELLYFYNICRPKNLMPVHGEWRHLRANAELGALTGVPHDRIVIAEDGVVVD 432
Query: 421 LENGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSK 480
L G +G +QA L+DRK+L ++GI + + + KI
Sbjct: 433 LVEGKAKITGKVQAGYVYVDGLSVGDVGEPALKDRKILGDEGIISVFVVMDSSTGKITGG 492
Query: 481 SRVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLYEQTK 540
V RG ++ V E +++ + +++ IR LGK++ + +
Sbjct: 493 PHVQARGSGIEDSA---FAAVLPKVTEALERSAQDGVVEPHQMQQLIRRTLGKWVSDTYR 549
Query: 541 RKPVILPVVME 551
R+P+ILPVV+E
Sbjct: 550 RRPMILPVVVE 560
sp|P54122|YOR4_CORGL HYPOTHETICAL 69.1 KD PROTEIN (ORF4) >gi|1200047|emb|CAA64951|
(X95649) function unknown [Corynebacterium glutamicum]
Length = 645
Score = 248 bits (626), Expect = 1e-64
Identities = 142/490 (28%), Positives = 248/490 (49%), Gaps = 8/490 (1%)
Query: 2 SNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADR 61
+ ++I LGG+ E G+NM + E ++ ++D G+ + + GVD ++PD + ++ R
Sbjct: 152 NGLRIYALGGISEIGRNMTVFEYNNRLLIVDCGVLFPSSGEPGVDLILPDFGPIEDHLHR 211
Query: 62 VAGIFLTHGHADSIGALPYIVSELK-VPVFGSELTIELAKINVKNYADSRKFNDFHVVTE 120
V + +THGH D IGA+P++++ +P+ S T+ L K + K + V E
Sbjct: 212 VDALVVTHGHEDHIGAIPWLLNVRNDIPILASRFTLALIAAKCKEHRQRPKLIE---VNE 268
Query: 121 DTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAE 180
+ D G I F++ H+IP+ LG+ I T G +++TGD + D G T++ L+
Sbjct: 269 QSNEDRGPFNIRFWAVNHSIPDCLGLAIKTPAGLVIHTGDIKLDQTPPDGRPTDLPALSR 328
Query: 181 IGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTID 240
G+ GV S SE ++ + + D + RVI+A A N+ R+Q +D
Sbjct: 329 FGDEGVDLMLCDSTNATTPGVSGSEADVAPTLKRLVGDAKQRVILASFASNVYRVQAAVD 388
Query: 241 AAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMG 300
AA+ R+VAF G M + +E A +L L+ + +II + + A ++++++ TG G
Sbjct: 389 AAVASNRKVAFNGRSMIRNMEIAEKLGYLK-APRGTIISMDDASRMAPHKVMLITTGTQG 447
Query: 301 EPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGGVMKMLASDL 360
EP+ +L MA R H+ + ++DGDL++ +S E + + N + + G + + D
Sbjct: 448 EPMAALSRMARREHRQITVRDGDLIILSSSLVPGNEEAVFGVINMLAQIGATV-VTGRDA 506
Query: 361 KI--SGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGET 418
K+ SGH + +L FL + RPKN +P+ GE+R L A+ +LA+ + +++ +A+ G
Sbjct: 507 KVHTSGHGYSGELLFLYNAARPKNAMPVHGEWRHLRANKELAISTGVNRDNVVLAQNGVV 566
Query: 419 VSLENGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIV 478
V + NG G I N L DR L E G+ I +++
Sbjct: 567 VDMVNGRAQVVGQIPVGNLYVDGVTMGDIDADILADRTSLGEGGLISITAVIDNRTGRLL 626
Query: 479 SKSRVHTRGF 488
+ V T GF
Sbjct: 627 ERPTVQTSGF 636
gi|4104709 (AF039028) unknown [Streptomyces toyocaensis]
Length = 528
Score = 242 bits (611), Expect = 6e-63
Identities = 155/536 (28%), Positives = 266/536 (48%), Gaps = 10/536 (1%)
Query: 19 MYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAGIFLTHGHADSIGAL 78
M + E ++ ++D G+ + E Q GVD ++PD T + + D V + LTHGH D IG +
Sbjct: 1 MTVFEHAGKLLIVDCGVLFPEETQPGVDVILPDFTSIRDRLDDVVAVVLTHGHEDHIGGV 60
Query: 79 PYIVSELK-VPVFGSELTIELAKINVKNYADSRKFNDFHVVTEDTEIDFGKAVISFFSTT 137
PY++ E + +PV GS+LT+ + +K + + V E G F +
Sbjct: 61 PYLLRERRDIPVVGSKLTLAFLEAKLKEHGIRPRTVR---VREGDRRGLGPFDCEFVAVN 117
Query: 138 HTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIGNSGVXXXXXXXXXXX 197
H+IP+SL + I T G +++TGDF+ D T++R A +G GV
Sbjct: 118 HSIPDSLAVAIRTRAGMVLHTGDFKMDQFPLDDRITDLRAFARLGEEGVDLFLTDSTNAE 177
Query: 198 XTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAAIKLGRRVAFTGEDMD 257
+ SE E+ I + RVI++ A ++ RIQQ +DAA + GR+VAF G M
Sbjct: 178 VPGFTTSERELNPAIEQVMRTAPRRVIVSSFASHVHRIQQVLDAAHQHGRKVAFVGRSMV 237
Query: 258 QIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGEPLKSLGDMAHRRHKYV 317
+ + A L L+ V ++ E++K D+++ ++ TG GEP+ +L MA+R H +
Sbjct: 238 RNMGIARDLGYLK-VPSGLVVSTKELEKLPDHKITLVCTGSQGEPMAALSRMANRDH-MI 295
Query: 318 KIKDGDLVLAVTSPSVSYETTIARIENEIYKAGG-VMKMLASDLKISGHANARDLQFLLD 376
+I GD VL +S E I R+ N + + G V+ + + +SGHA+A +L + +
Sbjct: 296 RIGKGDTVLLASSLIPGNENAIYRVINGLTRWGAHVVHKGNAKVHVSGHASAGELVYCYN 355
Query: 377 IFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVSLENGDMIPSGVIQAEN 436
I +P+N++P+ GE+R L A+ DLA+ + PE + IA+ V L +G +G + A N
Sbjct: 356 IVKPRNVMPVHGEWRHLRANGDLAIRTGVDPERVVIAEDSVIVDLVDGRASITGKVPAGN 415
Query: 437 XXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSKSRVHTRGFVYVKTSRD 496
L+DR L+ +G+ V + + RGFV+ +
Sbjct: 416 VYVDGMEVGGATEASLKDRLTLAAEGVVTVVAIVDADTGALAEAPDFLARGFVHDDAT-- 473
Query: 497 LMREAGELVNETVDKYLSGKEFDWAEIKGSIRDALGKFLYEQTKRKPVILPVVMEA 552
++ +T+ D +++ I A+ + + +RKP+I+PV+++A
Sbjct: 474 -FEPVVPVIEKTLATAAEEGVGDAHQLEQLIARAVANWAFRTYRRKPLIIPVIIDA 528
sp|P54123|Y551_SYNY3 HYPOTHETICAL 70.4 KD PROTEIN SLR0551 >gi|1001381|dbj|BAA10871|
(D64006) hypothetical protein [Synechocystis sp.]
Length = 640
Score = 230 bits (581), Expect = 2e-59
Identities = 154/581 (26%), Positives = 285/581 (48%), Gaps = 43/581 (7%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
+KI PLGG+ E GKN + E +++I +LDAGL + D GV+ V+PD+TYL EN +++
Sbjct: 10 LKILPLGGLHEIGKNTCVFEYDDEILLLDAGLAFPTDDMHGVNVVLPDMTYLRENREKIK 69
Query: 64 GIFLTHGHADSIGALPYIVSELKVP-VFGSELTIELAKINVKNYADSRKFNDFHVVTEDT 122
G+ +THGH D IG + Y + + +P ++G L + L + ++ A + + V+
Sbjct: 70 GMVVTHGHEDHIGGIAYHLKQFDIPIIYGPRLAMALLRDKLEE-AGMLERTNLQTVSPRE 128
Query: 123 EIDFGKA-VISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEI 181
+ GK+ V+ F TH+I +S + I T G ++++GDF+ D G +++++AE
Sbjct: 129 MVRLGKSFVVEFIRNTHSIADSYCLAIHTPLGVVMHSGDFKIDHTPIDGEFFDLQKVAEY 188
Query: 182 GNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDA 241
G GV + SE + + EGR+++ A ++ R+ +
Sbjct: 189 GEKGVLCLLSDSTNAEVPGITPSEASVIPNLDRVFSQAEGRLMVTTFASSVHRVNIILSL 248
Query: 242 AIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGE 301
A K R+VA G M +I A +L ++ D + A + D + +IL TG GE
Sbjct: 249 AQKHQRKVAVVGRSMLNVIAHARKLGYIKCPDNLFVPLKA-ARNLPDQQQLILTTGSQGE 307
Query: 302 PLKSLGDMAHRRHKYVKIKDGDLVLAVTSP----SVSYETTIARIENEIYKAGGVMKMLA 357
PL ++ +++ H +KI+ GD V+ +P +++ TI R+ + + V+
Sbjct: 308 PLAAMTRISNGEHPQIKIRQGDTVVFSANPIPGNTIAVVNTIDRL---MMQGANVIYGKH 364
Query: 358 SDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGE 417
+ +SGHA+ + + LL + RPK +P+ GE+R L H+ +A I E+I I G+
Sbjct: 365 QGIHVSGHASQEEHKMLLALTRPKFFVPVHGEHRMLVKHSQMAQAQGIPSENIVIVNNGD 424
Query: 418 TVSLENGD------MIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITIS 471
+ L GD +PSG+ + + +R+ ++EDG+ +S
Sbjct: 425 VIEL-TGDRIRVAGQVPSGIELVDQ-------AGIVHESTMAERQQMAEDGLVTVAAALS 476
Query: 472 KTERKIVSKSRVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGK------------EFD 519
KT +++ VH RG V + L EL+ T++ +L+ + E
Sbjct: 477 KT-GTLLAYPEVHCRGVVMTIQPKLL----EELIVRTIENFLTERWSEFTHGSNGSTEVS 531
Query: 520 WAEIKGSIRDALGKFLYEQTKRKPVILPVVMEARQPQDLNK 560
W ++ + +L + + + + P++L ++++ P +L++
Sbjct: 532 WNALQKELESSLQRLIKRELQSSPMVL-LMLQTDTPIELDQ 571
emb|CAA14898| (AJ235271) unknown [Rickettsia prowazekii]
Length = 560
Score = 189 bits (475), Expect = 5e-47
Identities = 138/549 (25%), Positives = 245/549 (44%), Gaps = 11/549 (2%)
Query: 2 SNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADR 61
+++ PLGG E G N+ L + + ++D G +A+ GVD +I D +++ +
Sbjct: 11 NDLLFVPLGGSNEIGMNLNLYHYKGKWLMIDCGSGFADDYLPGVDMIIADSSFIEKYKKD 70
Query: 62 VAGIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTED 121
+ G+ LTH H D +G + Y+ + LK P++ + T KI + Y D K H V
Sbjct: 71 LVGLILTHAHEDHLGGVQYLWNSLKCPIYTTTFTANFLKIRLNEY-DFAKNIKIHEVKPG 129
Query: 122 TEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEI 181
++I+ + TH+ PE I+I T+ GNI++TGD++FD G + + L
Sbjct: 130 SKINLEPFSLEMVPLTHSAPEMQAIMIRTDSGNILHTGDWKFDNDPILGKKVDEELLKSY 189
Query: 182 GNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDA 241
G+ GV S SE ++ + + D I V+++ A NL R+ + A
Sbjct: 190 GDEGVLALVCDSTNVFNKGSSGSEGDVRKSLIDIIAGCPQMVVVSTFASNLARLDTIMHA 249
Query: 242 AIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILETGRMGE 301
A GR+V TG + ++I A + D +I ++ ++ EL+++ TG GE
Sbjct: 250 ARLAGRKVVLTGRSLYRMIFAAQESGYFK--DLAPLISERDVSRFRRKELLVIATGCQGE 307
Query: 302 PLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAG-GVMKMLASDL 360
+ + +A H +K+ D ++ E I R+ N KAG VM +
Sbjct: 308 SMAATAKLASNSHPSIKLAPQDTMIFSAKIIPGNEKKIFRLFNVFVKAGVEVMTERDHFV 367
Query: 361 KISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETVS 420
+SGH + +LQ + + RP IP+ GE + H LA + I EH + G V
Sbjct: 368 HVSGHPSIDELQKMYSLIRPNICIPVHGEPVHIHEHVKLAKKNGI--EHAIEVENGSVVL 425
Query: 421 LENGDMIPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTERKIVSK 480
LE + ++ + R+ + E GI +A + I+K + ++S
Sbjct: 426 LEPNNAKVISKVENGYLAVDGNYLLPVESPIFKIRRRMRESGIVVASVVINK--KGLLSA 483
Query: 481 SRVHTR-GFVYVKTSRDLMREAGELVNE--TVDKYLSGKEFDWAEIKGSIRDALGKFLYE 537
+ + + G + K L+ + E T+ K + K ++ SI+ + K L +
Sbjct: 484 NPILSMPGLLDPKEDIALVNLIKNDIKELITIQKQRAKKVLSDEQVIESIKSTIRKTLKQ 543
Query: 538 QTKRKPVIL 546
+ + P I+
Sbjct: 544 EINKSPFII 552
dbj|BAA30170| (AP000004) 450aa long hypothetical protein [Pyrococcus horikoshii]
Length = 450
Score = 157 bits (393), Expect = 2e-37
Identities = 125/452 (27%), Positives = 209/452 (45%), Gaps = 48/452 (10%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKY----------------AEMDQLGVDF 47
IKI LGG E GKNM VE E ++ ++D G++ ++ +LG
Sbjct: 12 IKIYTLGGYEEVGKNMTAVEYEGEVVIIDMGIRLDRVLIHEDVEFQKMSSKDLRKLGA-- 69
Query: 48 VIPDITYLLENADRVAGIFLTHGHADSIGALPYIVSELK-VPVFGSELTIELAKINVKNY 106
IPD + + +V I L+HGH D IGA+ + VP++G+ TI LAK +K
Sbjct: 70 -IPDDRPIRDK--KVVAIALSHGHLDHIGAVGKLAPHYPDVPIYGTPYTIRLAKSEIKGE 126
Query: 107 ADSRKFNDFHVVTEDTEIDFGKAV-------ISFFSTTHTIPESLGIVISTNDGNIVYTG 159
F V E ++G+ V I F TH+IP S +VI T +G +VY
Sbjct: 127 ------EYFEVTNPLYETNYGEIVQVSENLAIEFVQVTHSIPHSSIVVIHTPEGAVVYAC 180
Query: 160 DFRFDPAVAKGYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASE---HEIYEKIYDYI 216
D++FD G + + +RL E+G GV ++ SE + E + Y
Sbjct: 181 DYKFDNNHPYGEKPDYKRLKELGKEGVKVLIAESTRVAEETKTPSEAVAKMLLEDFFLYE 240
Query: 217 DDNEGRVIIACNAGNLGRIQQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKS 276
+I A ++ R+Q+ I+ A K+GR+ F G + + A +L +++ +
Sbjct: 241 GMEADGLIATTFASHIARLQELIEIANKMGRQAIFIGRSLAKYTGIAKQLGLIKMKGSRV 300
Query: 277 IIKPAEIKK------YADNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTS 330
+ P + K A +++ TG GEP L MA+ + +D + A
Sbjct: 301 LRSPNAVSKVLKEVSQARENYLLIVTGHQGEPGAILTRMANGELYDIGPRDTVVFSAGVI 360
Query: 331 PSVSYETTIARIENEIYKAGGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEY 390
P+ +E ++ K GV + DL +SGHA+ D ++L+ + P+ ++P GE+
Sbjct: 361 PNPLNIAQRYALETKL-KMRGV--RMIKDLHVSGHASKEDHRYLIRMLNPEYIVPAHGEF 417
Query: 391 RELSAHADLAMEMDI-LPEHIFIAKRGETVSL 421
R L+ +A+LA E + + +FI++ G V +
Sbjct: 418 RMLTHYAELAEEEGYRIGKDVFISRNGHIVEI 449
emb|CAB49829.1| (AJ248285) hypothetical protein [Pyrococcus abyssi]
Length = 451
Score = 155 bits (389), Expect = 7e-37
Identities = 123/452 (27%), Positives = 210/452 (46%), Gaps = 48/452 (10%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKY----------------AEMDQLGVDF 47
IKI LGG E GKNM VE ++ ++D G++ ++ +LG
Sbjct: 8 IKIYTLGGYEEVGKNMTAVEYNGEVVIVDMGIRLDRVLIHEDVEFQKMSSKDLRKLGA-- 65
Query: 48 VIPDITYLLENADRVAGIFLTHGHADSIGALPYIVSELK-VPVFGSELTIELAKINVKNY 106
IPD + +V I L+HGH D IGA+ + VP++G+ TI LAK +K
Sbjct: 66 -IPDDRPIRNK--KVVAIALSHGHLDHIGAVGKLAPHYPDVPIYGTPYTIRLAKSEIKGE 122
Query: 107 ADSRKFNDFHVVTEDTEIDFGKAV-------ISFFSTTHTIPESLGIVISTNDGNIVYTG 159
F V E ++G+ V I F TH+IP+S +VI T +G +VY
Sbjct: 123 ------EYFEVTNPLYETNYGEIVQVSENLAIEFVQITHSIPQSSIVVIHTPEGAVVYAC 176
Query: 160 DFRFDPAVAKGYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASE---HEIYEKIYDYI 216
D++FD G R + +RL E+G GV ++ SE + E + Y
Sbjct: 177 DYKFDNNHPYGERPDYKRLKELGKEGVKVLIAESTRVAEETKTPSEAVAKMLLEDFFLYE 236
Query: 217 DDNEGRVIIACNAGNLGRIQQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKS 276
+I A ++ R+Q+ I+ A K+GR+ F G + + A +L +++ +
Sbjct: 237 GMEADGLIATTFASHIARLQELIEIANKMGRQAIFIGRSLAKYTGIAKQLGLIKMKGSRV 296
Query: 277 IIKPAEIKK------YADNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTS 330
+ P + K A +++ TG GEP L MA+ + +D + A
Sbjct: 297 LRSPNAVSKVLKEVSQARENYLLIVTGHQGEPGAILTRMANGELYDIGPRDTVVFSAGVI 356
Query: 331 PSVSYETTIARIENEIYKAGGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEY 390
P+ +E ++ G ++M+ +L +SGHA+ D ++L+ + P+ ++P GE+
Sbjct: 357 PNPLNVAQRYALETKLRMKG--VRMI-KNLHVSGHASKEDHRYLIRMLNPEYIVPAHGEF 413
Query: 391 RELSAHADLAMEMD-ILPEHIFIAKRGETVSL 421
R L+ +A+LA E ++ + +FI++ G V +
Sbjct: 414 RMLTHYAELAEEEGYMIGKEVFISRNGHVVEI 445
gi|2621085 (AE000797) conserved protein [Methanobacterium thermoautotrophicum]
Length = 450
Score = 137 bits (341), Expect = 3e-31
Identities = 112/455 (24%), Positives = 201/455 (43%), Gaps = 41/455 (9%)
Query: 3 NIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKY-----------AEMDQLGV--DFVI 49
++++ +GG E GKNM V+V + + + D G+ A M L + VI
Sbjct: 2 SVEVIAIGGYEEVGKNMSAVKVGDDVVIFDMGIHLDRVHIHEDTDIARMHSLDLIERGVI 61
Query: 50 PDITYLLENADRVAGIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADS 109
PD T + + +V I THGH D IGA+ + + P+ + TI L + +K
Sbjct: 62 PDDTLMKDVDGKVRAIVFTHGHLDHIGAVAKLAHRYQAPIIATPYTIALIERTIKAERKF 121
Query: 110 RKFNDFHVVTEDTEIDFGKAV-ISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVA 168
N V+ + + + F +TH+IP+S+ + T +G IVY DF+FD
Sbjct: 122 NVLNTLQVLNAGEKCQISPGITLEFIQSTHSIPQSVIAALHTPEGIIVYALDFKFDDHQK 181
Query: 169 KGYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHE-----IYEKIYDYIDDNEGRV 223
+ RL E+G GV + + E + E I + + +
Sbjct: 182 ISPPPDYHRLRELGRKGVLAMIVETTRANEKQEVKTHSEKVARIVLEDIMKNPLEEKTGM 241
Query: 224 IIACNAGNLGRIQQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSI------ 277
I+ + ++ RIQ D A K R++ G M++ A + L++ + SI
Sbjct: 242 IVTTFSSHMERIQAISDIASKSDRQMLLLGRSMERYCGLAEAMGILKLPENASIYGSPKA 301
Query: 278 ----IKPAEIKKYADNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSV 333
+ AE K+ + +++ TG GEP L +A+ + ++ +I+ GD V+ +++P +
Sbjct: 302 VNRALARAEAKR---EDYLLITTGHQGEPDALLPRIANAKTQF-RIQRGDNVV-ISAPVI 356
Query: 334 SYETTIAR---IENEIYKAGGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEY 390
+A +E + +G + ++ +SGHA D + + + P ++IP G+
Sbjct: 357 PNPMNVANRNLMERRLASSGA---RIYTNAHVSGHAGREDHRDFIRMLNPMHIIPAHGDL 413
Query: 391 RELSAHADLAMEMDI-LPEHIFIAKRGETVSLENG 424
LSA+A++A E L I I + G+ G
Sbjct: 414 SMLSAYAEIAEEEGYKLGNDIHILRNGQAQVFNGG 448
gb|AAF30921.1|AE002149_6 (AE002149) conserved hypothetical ATP/GTP-binding protein
[Ureaplasma urealyticum]
Length = 556
Score = 135 bits (336), Expect = 1e-30
Identities = 123/569 (21%), Positives = 243/569 (42%), Gaps = 36/569 (6%)
Query: 1 MSNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENAD 60
M+ I LGG E GK+ +++EV + IF+ +AG K D GV+ ++ D +YL +NA
Sbjct: 1 MAKINFLSLGGQDERGKSCFVLEVNDDIFIFNAGAKIPTSDVFGVNMIVCDYSYLEKNAK 60
Query: 61 RVAGIFLTHGHADSIGALPYIVSEL--KVPVFGSELTIELAKINVKNYADSRKFNDFHVV 118
RV GIF+ +++ + ++ ++ K+P++ S + + K + +++K +++
Sbjct: 61 RVKGIFIGTPTFNNVMGIKLLLGQVGYKIPIYTSPIGAIVIKKIFEQKVNNKKIEP-NII 119
Query: 119 TEDTEID--FGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMR 176
D D G ++ F ++++P S G V+ T+DG IVY +F K + + +
Sbjct: 120 ELDPISDKKIGSIYVTSFKVSNSMPHSYGFVLKTSDGAIVYVDEFIISNDKNKTFDSQIN 179
Query: 177 RLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQ 236
L I + +A H+ + + + R+I+ C + + I
Sbjct: 180 LLNNITKNNTLALIVGMGQAGNPYFTAPNHKNKAFYEAALQNTKNRLIVGCYSNDAYSIF 239
Query: 237 QTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIK--PAEIKKYADNELVIL 294
A + R + I T + KL++ + K++I +EI + +V++
Sbjct: 240 TLATIAKQQNRPFIVYS---NNFINTFIGVLKLKLFNSKNLISLPVSEINNSKNAIIVVI 296
Query: 295 ETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEI------YK 348
E L + H KYV + D ++ + +E A + +E+ YK
Sbjct: 297 E--NQDTLFSKLNKILHNEDKYVNLSSDDQLILGVVITPGFEMLAAELSDEVGRLDIPYK 354
Query: 349 A--GGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDIL 406
A V+ M SD DL+ L++ +PK LIPI G Y+ + I
Sbjct: 355 ALPKTVLPMTQSD---------EDLKHLINFLQPKFLIPINGLYKTEVKFSSTVTTSWIK 405
Query: 407 PEHIFIAKRGETVSLENGDMIPS-GVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFI 465
+ I GE ++E+ + P +I+ E+ L +R + E+G+
Sbjct: 406 SDQIISVSNGELFTIEDKVLNPKPQIIELEDKYISSFDALDVGANILFERSQMGENGVIN 465
Query: 466 AVITISKTERKIVSKSRVHTRGFVYVKTSRDLMREAGELVNETVDK---YLSGKEFDWAE 522
++ K +K+ + G V + ++E E + + + Y K +
Sbjct: 466 LIVIFDKQFQKLFNYVEFDYCGVV---NNDAQIKEIEETFKKRMGECLVYDERKRLILKD 522
Query: 523 IKGSIRDALGKFLYEQTKRKPVILPVVME 551
K +++ L K ++ ++P++LP V++
Sbjct: 523 TKANLKRLLTKLFEKKFNKRPLVLPTVVD 551
sp|Q58271|Y861_METJA HYPOTHETICAL PROTEIN MJ0861 >gi|2128073|pir||E64407 hypothetical
protein MG423 homolog - Methanococcus jannaschii
>gi|1591546 (U67530) conserved hypothetical protein
[Methanococcus jannaschii]
Length = 448
Score = 134 bits (333), Expect = 2e-30
Identities = 117/449 (26%), Positives = 195/449 (43%), Gaps = 36/449 (8%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLK------YAEMD-------QLGVDFVIP 50
++I +GG E G+NM V V+ +I +LD G++ + + D +L +IP
Sbjct: 3 LEIIAIGGYEEVGRNMTAVNVDGEIIILDMGIRLDRVLIHEDTDISKLHSLELIEKGIIP 62
Query: 51 DITYLLENADRVAGIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYADSR 110
+ T + V I L+HGH D IGA+P + P+ G+ TIEL K + +
Sbjct: 63 NDTVMKNIEGEVKAIVLSHGHLDHIGAVPKLAHRYNAPIIGTPYTIELVKREILSEKKFD 122
Query: 111 KFNDFHVVTEDTEIDFGKAV-ISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAK 169
N V+ ID + + F TH+IP+S+ V+ T G+IVY DF+FD
Sbjct: 123 VRNPLIVLNAGESIDLTPNITLEFIRITHSIPDSVLPVLHTPYGSIVYGNDFKFDNFPVV 182
Query: 170 GYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASE---HEIYEKIYDYIDDNEGRVIIA 226
G R + R + ++G +GV ++ E + + D+++ +I+
Sbjct: 183 GERPDYRAIKKVGKNGVLCFISETTRINHEGKTPPEIIASGLLKNDLLAADNDKHGIIVT 242
Query: 227 CNAGNLGRIQQTIDAAIKLGRRVAFTGEDMDQIIETATRL------NKLQIVDKKSIIKP 280
+ ++ RI+ D A K+GR G M + A + L+I S I+
Sbjct: 243 TFSSHIARIKSITDIAEKMGRTPVLLGRSMMRFCGIAQDIGLVKFPEDLRIYGDPSSIEM 302
Query: 281 A--EIKKYADNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETT 338
A I K + +I+ TG GE L MA + Y K + D V+ P +
Sbjct: 303 ALKNIVKEGKEKYLIIATGHQGEEGAVLSRMATNKTPY-KFEKYDCVVFSADPIPNPMNA 361
Query: 339 IARIENEIYKAGGVMKMLA----SDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELS 394
R Y +K+L +SGHA D + +L P+++IP G++ +
Sbjct: 362 AQR-----YMLESRLKLLGVRIFKGAHVSGHAAKEDHRDMLRWLNPEHIIPSHGDFNLTA 416
Query: 395 AHADLAMEMDI-LPEHIFIAKRGETVSLE 422
+ LA E L E + + + G+ +S E
Sbjct: 417 EYTKLAEEEGYRLGEDVHLLRNGQCLSFE 445
sp|P47662|Y423_MYCGE HYPOTHETICAL PROTEIN MG423 >gi|1361690|pir||G64246 hypothetical
protein MG423 - Mycoplasma genitalium (SGC3) >gi|3845013
(U39724) conserved hypothetical protein [Mycoplasma
genitalium]
Length = 561
Score = 129 bits (321), Expect = 6e-29
Identities = 122/565 (21%), Positives = 243/565 (42%), Gaps = 21/565 (3%)
Query: 1 MSNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENAD 60
M+ IK LGG E GKN Y++E++ +F+ + G LGV +IPD +++ EN
Sbjct: 1 MAKIKFFALGGQDERGKNCYVLEIDNDVFIFNVGSLTPTTAVLGVKKIIPDFSWIQENQA 60
Query: 61 RVAGIFLTHGHADSIGALPYIVSELK-VPVFGSEL--TIELAKINVKNYADSRKFNDFHV 117
RV GIF+ + +++G+L ++ + P++ S + +I +KIN +R + H
Sbjct: 61 RVKGIFIGNAITENLGSLEFLFHTVGFFPIYTSSIGASIIKSKINENKLNIARDKLEIHE 120
Query: 118 VTEDTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRF--DPAVAKGYRTNM 175
+ I+ I+ F + ++P S G ++T++G IV+ DF D +A + N
Sbjct: 121 LKPLETIEISNHSITPFKVSSSLPSSFGFALNTDNGYIVFIDDFIVLNDKNIAFENQLN- 179
Query: 176 RRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRI 235
+ + ++ ++ + + + +H+ E++ I +GR+ +AC N +
Sbjct: 180 QIIPKLSDNTLLLITGVGLVGRNSGFTTPKHKSLEQLNRIITPAKGRIFVACYDSNAYSV 239
Query: 236 QQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNELVILE 295
A R + + T R KL + I EI + N +V+L
Sbjct: 240 MTLAQIARMQNRPFIIYSQSFVHLFNTIVR-QKLFNNTHLNTISIEEINN-STNSIVVL- 296
Query: 296 TGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKAGGVMKM 355
T + L + + ++ + D + +T YE A+I ++I +
Sbjct: 297 TSPPDKLYAKLFKIGMNEDERIRYRKSDTFIFMTPKVAGYEEIEAQILDDIARNEVSYYN 356
Query: 356 LASDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEHIFIAKR 415
L ++ +S A+ D++FL+ +PK +IP G YR+ + + I I
Sbjct: 357 LGREI-LSIQASDEDMKFLVSSLKPKYIIPTGGLYRDFINFTMVLKQAGAEQNQILILFN 415
Query: 416 GETVSLENGDM-IPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVITISKTE 474
GE +++EN + ++ + +R +SE G+ I +I + +
Sbjct: 416 GEVLTIENKKLDSKKNELKLNPKCVDSAGLQEIGASIMFERDQMSESGVVIIIIYFDQKK 475
Query: 475 RKIVSKSR---------VHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFDWAEIKG 525
+ +++ V + + K + ++ ++ + T K GKE E+K
Sbjct: 476 SEFLNEITYSFLGVSLDVPEKDKLKTKMEELIKKQINDIKDFTTIKKRIGKEIS-KELKV 534
Query: 526 SIRDALGKFLYEQTKRKPVILPVVM 550
SI+ A+ + T + P+IL ++
Sbjct: 535 SIKRAVMNLFTKMTSKAPLILSTII 559
sp|P75174|Y423_MYCPN HYPOTHETICAL PROTEIN MG423 HOMOLOG >gi|2146459|pir||S73547 MG423
homolog C12_orf561 - Mycoplasma pneumoniae (SGC3) (ATCC
29342) >gi|1673887 (AE000022) Mycoplasma pneumoniae,
MG423 homolog, from M. genitalium [Mycoplasma
pneumoniae]
Length = 561
Score = 111 bits (275), Expect = 2e-23
Identities = 120/571 (21%), Positives = 233/571 (40%), Gaps = 33/571 (5%)
Query: 1 MSNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENAD 60
M+ I GG E GKN +++E+ +F+ + G LGV +IPD +++ EN
Sbjct: 1 MAKINFFAFGGQDERGKNCFVLEINNDVFIFNVGSLTPTTAVLGVKKIIPDFSWIQENQA 60
Query: 61 RVAGIFLTHGHADSIGALPYIVSELK-VPVFGSELTIELAKINVKNYADSRKFN------ 113
R+ GIF+ + ++IG+L ++ + P++ T + + +K K N
Sbjct: 61 RIKGIFIGNPVTENIGSLEFLFHTVGFFPIY----TSTIGAVVIKTKIHENKLNIPHDEL 116
Query: 114 DFHVVTEDTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRF--DPAVAKGY 171
+ H + + G I+ F + +IP S G + T+DG IVY DF D +A
Sbjct: 117 EIHELKPLETVKIGHHNITPFKVSSSIPSSFGFALHTDDGYIVYVDDFIVLNDKNIAFEN 176
Query: 172 RTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGN 231
+ N + + ++ N + T + +H+ E++ I +GRV AC N
Sbjct: 177 QLN-QIIPQVANKTLLLITGVGLVGRNTGFTTPKHKSLEQLNRIIASAKGRVFAACYDSN 235
Query: 232 LGRIQQTIDAAIKLGRRVAFTGEDMDQIIETATRLNKLQIVDKKSIIKPAEIKKYADNEL 291
+ A R + R KL + I EI + N +
Sbjct: 236 AYSVMTLAQIARMQNRPFVIYSHSFVHLFNAIVR-QKLFNNTHLNTISIEEINN-STNAI 293
Query: 292 VILET--GRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEIYKA 349
V+L ++ L +G R +Y K D + + YE A+I +++ +
Sbjct: 294 VVLTAPPDKLYAKLFKIGTNEDERVRYRKT---DSFIFMIPRIAGYEELEAQILDDVARN 350
Query: 350 GGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELSAHADLAMEMDILPEH 409
L ++ +S +A+ D++FL+ +PK +IP G YR+ + + +
Sbjct: 351 EVSYYNLGREI-LSINASDEDMKFLVTSLKPKYIIPTSGLYRDFINFTMVMKQAGVEQSQ 409
Query: 410 IFIAKRGETVSLENGDM-IPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGIFIAVI 468
+ I GE +++ + + ++ + +R +SE G+ +I
Sbjct: 410 VLIPFNGEVLAINHKQIDNKKRELKLNPKCVDSAGLQEIGASIMFERDQMSEAGVVTIII 469
Query: 469 TISKTERKIVSKSRVHTRGF-------VYVKTSRD--LMREAGELVNETVDKYLSGKEFD 519
+ + +++ G V +KT + + ++ ++ + T K GK+
Sbjct: 470 YYDSKKSEFLNEITYSFLGVSLDSNNQVKLKTKMEELIRKQINDIKDFTTIKRRLGKDTS 529
Query: 520 WAEIKGSIRDALGKFLYEQTKRKPVILPVVM 550
E+K SI+ A+ + T + P+IL ++
Sbjct: 530 -KELKVSIKRAVMNLFTKMTAKAPLILSTII 559
gb|AAD25735.1|AF100324_3 (AF100324) unknown [Mycoplasma fermentans]
Length = 550
Score = 107 bits (265), Expect = 2e-22
Identities = 111/564 (19%), Positives = 234/564 (40%), Gaps = 38/564 (6%)
Query: 1 MSNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENAD 60
M++I I LGG+ E GKN Y+ E + I+++++G K GVD +IP YL ++ D
Sbjct: 1 MNHINIFALGGLDENGKNCYVFEFNDDIYIINSGTKIPINSNNGVDTLIPSFEYLEKHKD 60
Query: 61 RVAGIFLTHGHADSIGALPYIVSEL-KVPVFGSELTIELAKINVKNYADSRKFNDFHVVT 119
R+ GIF++ +S ALP+++ ++ + ++ S + + Y K V+
Sbjct: 61 RIKGIFISDVKNESFSALPWLLMKIPNLTIYTSAFNKIMIMDRLSKYKIPTKNYKVMVIN 120
Query: 120 EDTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDF---------RFDPAVAKG 170
+ T K + ++P +G T DG+I++ +F + K
Sbjct: 121 KVTPFS-DKLFVKPIDLAGSMPGHIGFDFITPDGDILFMFNFVEGDLGIYGKLSFNELKQ 179
Query: 171 YRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAG 230
T + LA + +SG ++ E ++ NE R+I+
Sbjct: 180 RFTKRKILALVVDSG------KANFGGKAIEKIGLPEGIRDVFLETKKNE-RIIVGAYDE 232
Query: 231 NLGRIQQTIDAAIKLGRRVAFTGEDMDQII----ETATRLNKLQIVDKKSIIKPAEIKKY 286
+ I + ++ A + R V G+ QI + L +++D ++ K
Sbjct: 233 EMVAIHKILELAQETSRPVVTYGKTYGQIFYLIKKAHPELKLPELIDYRNANKV------ 286
Query: 287 ADNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIARIENEI 346
VIL TG + ++K++ D V+ + P E+ A +EI
Sbjct: 287 --KNAVILVTGATERLYSRFIRITDNNDVFLKLQKSDTVVMIAPPINGLESLEALTLDEI 344
Query: 347 YKAGGVMKMLASDLKISGHANARDLQFLLDIFRPKNLIPIQGEYRELS-AHADLAMEMDI 405
+ + + ++ +DL L++ P+ +IP+QG YR L+ A ++ +
Sbjct: 345 ARITPKISDVNANEFYRCRPAKQDLIDLVNALNPEYVIPVQGLYRYLNEATIAISENTQV 404
Query: 406 LPEHIFIAKRGETVSLENGDM--IPSGVIQAENXXXXXXXXXXXXXXXLRDRKVLSEDGI 463
H + + G+ V +G M V + + + +R+ L +G+
Sbjct: 405 KQNHCLVMQNGKIVHFIDGKMSTTKGKVKEIGDTIIDGFGVGDISTEVISEREALGREGV 464
Query: 464 FIAVITISKTERKIVSKSRVHTRGFVYVKTSRDLMREAGELVNETVDKYLSGKEFDW-AE 522
+ + + + K +++ G + +++ EA +++ T+ K + + F+ +
Sbjct: 465 ILVSSQYNPKTKLLTGKLQINFVGVI----NKEEKNEASDIIKSTIVKLIETESFNGIKD 520
Query: 523 IKGSIRDALGKFLYEQTKRKPVIL 546
+ R A+ K +Y+ +++P+++
Sbjct: 521 FQNEARHAIRKRIYKTFEKEPIVI 544
emb|CAB49663.1| (AJ248285) mRNA 3'-end processing factor, putative [Pyrococcus
abyssi]
Length = 651
Score = 57.8 bits (137), Expect = 2e-07
Identities = 49/185 (26%), Positives = 83/185 (44%), Gaps = 24/185 (12%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMD---QLGVDFVIPDITYLLENAD 60
I+IT LGG RE G++ LV+ +E ++D G+ A ++ + F P+ Y+L+
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESYVLVDFGVNVAALNDPYKAFPHFDAPEFQYVLKEG- 247
Query: 61 RVAGIFLTHGHADSIGALPYI--VSELKVPVFGSELTIELAKINVKNYADSRKFND---- 114
+ I +TH H D G LPY+ + P++ + T +L + K++ + ++ N
Sbjct: 248 LLDAIIITHAHLDHSGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 307
Query: 115 ------FHVVTEDTEIDFGKA---VISFFSTTHTIPESLG-----IVISTNDGNIVYTGD 160
V+ +D+G+ T H LG + I NI TGD
Sbjct: 308 YRPRDIKEVIKHTITLDYGEVRDISPDIRLTLHNAGHILGSAIVHLHIGNGLHNIAITGD 367
Query: 161 FRFDP 165
F+F P
Sbjct: 368 FKFIP 372
dbj|BAA30510| (AP000006) 651aa long hypothetical protein [Pyrococcus horikoshii]
Length = 651
Score = 57.8 bits (137), Expect = 2e-07
Identities = 49/185 (26%), Positives = 82/185 (43%), Gaps = 24/185 (12%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMD---QLGVDFVIPDITYLLENAD 60
I+IT LGG RE G++ LV+ +E ++D G+ A ++ + F P+ Y+L
Sbjct: 189 IRITGLGGFREVGRSALLVQTDESFVLVDFGVNVAMLNDPYKAFPHFDAPEFQYVLREG- 247
Query: 61 RVAGIFLTHGHADSIGALPYI--VSELKVPVFGSELTIELAKINVKNYADSRKFND---- 114
+ I +TH H D G LPY+ + P++ + T +L + K++ + ++ N
Sbjct: 248 LLDAIIITHAHLDHCGMLPYLFRYNLFDGPIYTTPPTRDLMVLLQKDFIEIQQSNGQDPL 307
Query: 115 ------FHVVTEDTEIDFGKA---VISFFSTTHTIPESLG-----IVISTNDGNIVYTGD 160
V+ +D+G+ T H LG + I NI TGD
Sbjct: 308 YRPRDIKEVIKHTITLDYGEVRDISPDIRLTLHNAGHILGSAIVHLHIGNGLHNIAITGD 367
Query: 161 FRFDP 165
F+F P
Sbjct: 368 FKFIP 372
dbj|BAA79093.1| (AP000058) 420aa long hypothetical cleavage and polyadenylation
specificity factor subunit [Aeropyrum pernix]
Length = 420
Score = 55.0 bits (130), Expect = 2e-06
Identities = 59/238 (24%), Positives = 96/238 (39%), Gaps = 30/238 (12%)
Query: 1 MSNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENAD 60
M+ I+I LG RE G+ LVE + +LD G+ + E D+ P + D
Sbjct: 1 MARIRI--LGSGREVGRAAILVESGGRGLLLDYGVNFDENDR-------PVFPGDVRPRD 51
Query: 61 RVAGIFLTHGHADSIGALPYIVSELKVPVFGSELTIELAKINVKNYA---------DSRK 111
+ G+ LTH H D IGA PY+ VFG+ +T+ ++++ + + D R
Sbjct: 52 -LDGLVLTHSHLDHIGAAPYLYVSQGPKVFGTRVTLHVSRLLLYDMIKLNGAYLPYDERS 110
Query: 112 FNDF----HVVTEDTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAV 167
D + E + G+ F + H IP S +++ + I+YT D V
Sbjct: 111 VEDMLGTAEYIDYGREYEAGRFAFKTFYSGH-IPGSTAVLVEVDGRRILYTSDVN----V 165
Query: 168 AKGYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVII 225
+ RL E + V +S SE Y + D + G V++
Sbjct: 166 IETKLVGPARL-EGAKADVVIVESTYGDSDHPPRSVSEERFYNAVMDVVSQG-GTVLV 221
dbj|BAA11296| (D78193) yycJ [Bacillus subtilis] >gi|2636584|emb|CAB16074|
(Z99124) yycJ [Bacillus subtilis]
Length = 268
Score = 54.7 bits (129), Expect = 2e-06
Identities = 35/127 (27%), Positives = 58/127 (45%), Gaps = 12/127 (9%)
Query: 18 NMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAGIFLTHGHADSIGA 77
N + +E E+ F++DAGL MD L + + D V GIF+TH H+D I
Sbjct: 15 NAFYLETEDHAFLVDAGLSGKAMDGL--------MAQIGRKLDDVDGIFVTHEHSDHIKG 66
Query: 78 LPYIVSELKVPVFGSELTIELAKINVKNYADSRKFNDFHVVTEDTEIDFGKAVISFFSTT 137
L + + K+P++ +E T + + + +KF V +T FG + F +
Sbjct: 67 LGVVARKYKLPIYANEKTWKAMENQIGKIDTDQKF----VFPMETVKSFGGLDVESFGVS 122
Query: 138 HTIPESL 144
H E +
Sbjct: 123 HDAAEPM 129
sp|Q60355|Y047_METJA HYPOTHETICAL PROTEIN MJ0047 >gi|2826239 (U67462) putative mRNA
3'-end processing factor 1 [Methanococcus jannaschii]
Length = 428
Score = 50.4 bits (118), Expect = 4e-05
Identities = 49/194 (25%), Positives = 81/194 (41%), Gaps = 57/194 (29%)
Query: 10 GGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAGIFLTH 69
G E G++ ++ ++ +LD G+K LG + P + + + D+V F++H
Sbjct: 7 GAALEVGRSCIEIKTDKSKILLDCGVK------LGKEIEYPILDNSIRDVDKV---FISH 57
Query: 70 GHADSIGALPYIV-SELKVPVFGSELTIELAKINVK------------------------ 104
H D GALP + ++ VPV +EL+ +L K+ +K
Sbjct: 58 AHLDHSGALPVLFHRKMDVPVITTELSKKLIKVLLKDMVKIAETENKKIPYNNHDVKEAI 117
Query: 105 ------NYADSRKFNDFHVVTEDTEIDFGKAVISFFSTTHTIPESLGIVIS-TNDGNIVY 157
NY D + + DF FS H IP S I+++ N+ I+Y
Sbjct: 118 RHTIPLNYNDKKYYKDFS--------------YELFSAGH-IPGSASILLNYQNNKTILY 162
Query: 158 TGDFRF-DPAVAKG 170
TGD + D + KG
Sbjct: 163 TGDVKLRDTRLTKG 176
gi|2650146 (AE001071) mRNA 3'-end processing factor, putative [Archaeoglobus
fulgidus]
Length = 632
Score = 50.4 bits (118), Expect = 4e-05
Identities = 39/180 (21%), Positives = 80/180 (43%), Gaps = 24/180 (13%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
+++T LGG RE G++ YL++ E ++D G+ + + +V P++ L D +
Sbjct: 178 VRVTFLGGSREVGRSCYLLQTPESRILIDCGVNVSNLSSTPYLYV-PEVQPL----DALD 232
Query: 64 GIFLTHGHADSIGALPYIVS-------ELKVPVFGSELTIELAKINVKNYADSRKFNDFH 116
+ +TH H D G +P + L P + ++L + V + +
Sbjct: 233 AVVITHAHLDHCGLVPLLYKFGYRGPIYLTPPTRDLMVLLQLDFLEVAGREGTNPPYSSN 292
Query: 117 VVTEDTE----IDFGKAV-------ISFFSTTHTIPESLG-IVISTNDGNIVYTGDFRFD 164
++ E + +D+G ++F++ H + ++ I NI +TGDF+F+
Sbjct: 293 LIREALKHTITLDYGVVTDISPDVRLTFYNAGHILGSAIAHFHIGEGHYNIAFTGDFKFE 352
pir||G64305 hypothetical protein YLR277c homolog - Methanococcus jannaschii
Length = 435
Score = 50.4 bits (118), Expect = 4e-05
Identities = 49/194 (25%), Positives = 81/194 (41%), Gaps = 57/194 (29%)
Query: 10 GGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAGIFLTH 69
G E G++ ++ ++ +LD G+K LG + P + + + D+V F++H
Sbjct: 14 GAALEVGRSCIEIKTDKSKILLDCGVK------LGKEIEYPILDNSIRDVDKV---FISH 64
Query: 70 GHADSIGALPYIV-SELKVPVFGSELTIELAKINVK------------------------ 104
H D GALP + ++ VPV +EL+ +L K+ +K
Sbjct: 65 AHLDHSGALPVLFHRKMDVPVITTELSKKLIKVLLKDMVKIAETENKKIPYNNHDVKEAI 124
Query: 105 ------NYADSRKFNDFHVVTEDTEIDFGKAVISFFSTTHTIPESLGIVIS-TNDGNIVY 157
NY D + + DF FS H IP S I+++ N+ I+Y
Sbjct: 125 RHTIPLNYNDKKYYKDFS--------------YELFSAGH-IPGSASILLNYQNNKTILY 169
Query: 158 TGDFRF-DPAVAKG 170
TGD + D + KG
Sbjct: 170 TGDVKLRDTRLTKG 183
gi|2622312 (AE000888) cleavage and polyadenylation specificity factor
[Methanobacterium thermoautotrophicum]
Length = 636
Score = 49.6 bits (116), Expect = 7e-05
Identities = 44/179 (24%), Positives = 80/179 (44%), Gaps = 22/179 (12%)
Query: 5 KITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAG 64
++T +GG RE G++ ++ +LD G+ A D + + L++ D V
Sbjct: 181 RLTAMGGFREVGRSCLYLQTPNSRVLLDCGVNVAGGDDKNSYPYLNVPEFTLDSLDAV-- 238
Query: 65 IFLTHGHADSIGALPYIVS-ELKVPVFGSELT------IELAKINVKNYADS-RKFNDFH 116
+TH H D G LPY+ PV+ + T ++L I++ + D FN H
Sbjct: 239 -IITHAHLDHSGFLPYLYHYGYDGPVYCTAPTRDLMTLLQLDHIDIAHREDEPLPFNVKH 297
Query: 117 V---VTEDTEIDFGKAV-------ISFFSTTHTIPESLG-IVISTNDGNIVYTGDFRFD 164
V V +D+G+ ++ + H + ++ + I N+VYTGDF+++
Sbjct: 298 VKKSVKHTITLDYGEVTDIAPDIRLTLHNAGHILGSAMAHLHIGDGQHNMVYTGDFKYE 356
emb|CAB57542.1| (Y18930) mRNA 3'-end polyadenylation factor [Sulfolobus
solfataricus]
Length = 639
Score = 49.2 bits (115), Expect = 9e-05
Identities = 43/183 (23%), Positives = 77/183 (41%), Gaps = 29/183 (15%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
++IT LGG E G++ LVE E +LD GL + G + + P + + +
Sbjct: 183 VRITALGGFLEVGRSAVLVETPESKVLLDVGLN-PSANMFG-EKLFPKLDIDQLKMEELD 240
Query: 64 GIFLTHGHADSIGALPYIVS-------ELKVPVFGSELTIELAKINVKN--------YAD 108
+ +TH H D G +P++ VP ++L ++V A
Sbjct: 241 AVVITHAHLDHCGMVPFLFKYGYEGPVYTTVPTRDIMALMQLDSLDVAEKEGKPIPYSAK 300
Query: 109 SRKFNDFHVVTEDTEIDFGKAV-------ISFFSTTHTIPESLG-IVISTNDGNIVYTGD 160
+ H +T +D+G+ ++F++ H + + + I NIVYTGD
Sbjct: 301 EVRKELLHTIT----LDYGEVTDIAPDIRLTFYNAGHILGSGMAHLHIGDGKHNIVYTGD 356
Query: 161 FRF 163
F++
Sbjct: 357 FKY 359
dbj|BAA29553| (AP000002) 514aa long hypothetical protein [Pyrococcus horikoshii]
Length = 514
Score = 48.8 bits (114), Expect = 1e-04
Identities = 72/323 (22%), Positives = 119/323 (36%), Gaps = 61/323 (18%)
Query: 131 ISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFDPAVAKGYRTNMRRLAEIGNSGVXXXX 190
+ F+ H+I + +I D ++ YTGDFR KG T R+ + S
Sbjct: 215 VKAFNVDHSIYGATAYIIE-GDVSLAYTGDFRLHGR--KGDET--RKFIKAAKSSSILIT 269
Query: 191 XXXXXXXXTMQSASEHEIYEKIYDYIDDNEGRVIIACNAGNLGRIQQTIDAAIKLGRRVA 250
+ SE E+YE +++ +G VI + N R++ A GR +
Sbjct: 270 EGTRVGRDEHGNVSEQEVYENALKIVEEAKGLVIADFSPRNFERLEIFKKIAENTGRELV 329
Query: 251 FTGEDMD-----QIIETATRLNKLQIV-------DK-----------KSIIKPAEIKKYA 287
T +D Q+++ RL L+I DK + I P EI+K
Sbjct: 330 ITAKDAYFLHALQLVDNVNRLKDLRIYGNLKATQDKWESIVVWGNYAEQYISPFEIRKNQ 389
Query: 288 DNELVILETGRMGEPLKSLGDMAHRRHKYVKIKDGDLVLAVTSPSVSYETTIA--RIENE 345
+N ++ S DM H + DG + +S + E + R+ N
Sbjct: 390 ENYILCF----------SFYDMPHLLD---IMPDGGTYIYSSSEAFGEEQVFSFLRLWNW 436
Query: 346 IYKAGGVMKMLASD----------LKISGHANARDLQFLLDIFRPKNLIPIQGEYRELSA 395
+ G + D SGH + +L+ ++D P +IP+ E EL
Sbjct: 437 LQYFGFEVHGFRVDKYGKPIFEKGFHASGHISREELRKVIDEIDPDYVIPVHTEKPELFK 496
Query: 396 HADLAMEMDILPEHIFIAKRGET 418
A E + + K GE+
Sbjct: 497 WA--------FGERVILLKNGES 511
sp|Q58633|YC36_METJA HYPOTHETICAL PROTEIN MJ1236 >gi|2128070|pir||C64454 hypothetical
protein L9328.4 homolog - Methanococcus jannaschii
>gi|1591868 (U67564) putative mRNA 3'-end processing
factor 2 [Methanococcus jannaschii]
Length = 634
Score = 48.0 bits (112), Expect = 2e-04
Identities = 46/180 (25%), Positives = 78/180 (42%), Gaps = 24/180 (13%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVA 63
I+++ LGG RE G++ V+ + ++D G+ A D+ F P+ + +E+ D
Sbjct: 180 IRVSFLGGAREVGRSCLYVQTPDTRVLIDCGINVACEDKAFPHFDAPE--FSIEDLD--- 234
Query: 64 GIFLTHGHADSIGALPYIVS-ELKVPVFGSELTIELAKINVKNYADSRKFNDFHV----- 117
+ +TH H D G +P + PV+ + T +L + K+Y + K V
Sbjct: 235 AVIVTHAHLDHCGFIPGLFRYGYDGPVYCTRPTRDLMTLLQKDYLEIAKKEGKEVPYTSK 294
Query: 118 -----VTEDTEIDFGKAV---ISFFSTTHTIPESLGIVIS---TNDG--NIVYTGDFRFD 164
V ID+G + T H LG I+ +G N+ YTGD +F+
Sbjct: 295 DIKTCVKHTIPIDYGVTTDISPTIKLTLHNAGHVLGSAIAHLHIGEGLYNLAYTGDIKFE 354
emb|CAB54223.1| (Z48334) F10B5.8 [Caenorhabditis elegans]
Length = 474
Score = 44.9 bits (104), Expect = 0.002
Identities = 94/456 (20%), Positives = 178/456 (38%), Gaps = 55/456 (12%)
Query: 4 IKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLEN---AD 60
IKI PLG ++ G++ L+ + + ++D G+ D D PD +Y+ D
Sbjct: 8 IKIVPLGAGQDVGRSCILITIGGKNIMVDCGMHMGYQD----DRRFPDFSYIGGGGRLTD 63
Query: 61 RVAGIFLTHGHADSIGALPYI--VSELKVPVFGSELTIELAKINVKNY----ADSRKFND 114
+ + ++H H D G+LP++ + P++ + T + + +++Y D + +
Sbjct: 64 YLDCVIISHFHLDHCGSLPHMSEIVGYDGPIYMTYPTKAICPVLLEDYRKVQCDIKGETN 123
Query: 115 FHVVTEDTEIDFGKAV---------------ISFFSTTHTIPESLGIVISTNDGNIVYTG 159
F ++D + K V I F H + ++ I D +++YTG
Sbjct: 124 F-FTSDDIKNCMKKVVGCALHEIIHVDNELSIRAFYAGHVLGAAM-FEIRLGDHSVLYTG 181
Query: 160 DFRFDPAVAKGYRTNMRRLAEIGNSGVXXXXXXXXXXXXTMQSASEHEIYEKIYDYIDDN 219
D+ P G R+ V + A E + K+++ +
Sbjct: 182 DYNMTPDRHLG----AARVLPGVRPTVLISESTYATTIRDSKRARERDFLRKVHECVMKG 237
Query: 220 EGRVIIACNAGNLGRIQQTIDAAIKLGRRVAFTGE--DMDQIIETATRLNKLQIVDKKSI 277
G+VII A LGR Q+ R+A + E A + +L I
Sbjct: 238 -GKVIIPVFA--LGRAQELCILLESYWERMALNVPIYFSQGLAERANQYYRLFISWTNEN 294
Query: 278 IKPAEIKK--YADNELVILETGRMGEP----LKSLGDMAH--RRHKYVKIKDGDLVLAVT 329
IK +++ + + +E G +P L S M H + K K D + +
Sbjct: 295 IKKTFVERNMFEFKHIKPMEKGCEDQPGPQVLFSTPGMLHGGQSLKVFKKWCSDPLNMII 354
Query: 330 SPSVSYETTI-ARIEN-----EIYKAGGVMKMLASDLKISGHANARDLQFLLDIFRPKNL 383
P T+ AR+ N EI + +++ + S HA+A+ + L+ P+++
Sbjct: 355 MPGYCVAGTVGARVINGEKKIEIDQKMHEIRLGVEYMSFSAHADAKGIMQLIRQCEPQHV 414
Query: 384 IPIQGEYRELSAHADLAMEMDILPEHIFIAKRGETV 419
+ + GE ++ + +P H + GETV
Sbjct: 415 MFVHGEASKMEFLKGKVEKEYKVPVH--MPANGETV 448
gi|2650088 (AE001067) mRNA 3'-end processing factor, putative [Archaeoglobus
fulgidus]
Length = 407
Score = 44.9 bits (104), Expect = 0.002
Identities = 44/167 (26%), Positives = 77/167 (45%), Gaps = 29/167 (17%)
Query: 9 LGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADRVAGIFLT 68
LGG RE G++ +V+ ++D G+K ++ + ++ + P + L+
Sbjct: 7 LGGCREVGRSAVMVDG----IMIDYGVKPSDPPEFPLNGLSP------------RAVILS 50
Query: 69 HGHADSIGALP---YIVSELKVPVFGSELTIELAKINVK-------NYADSRKF-NDFHV 117
HGH D IG P Y E+ + EL++ L + ++K + R+F ++
Sbjct: 51 HGHLDHIGVAPNLMYYDPEVILTPPSHELSMILLRDSMKIMHPPPFTKRELRQFESNIRE 110
Query: 118 VTEDTEIDFGKAVISFFSTTHTIPESLGIVISTNDGNIVYTGDFRFD 164
V + I G + FF+ H IP S I + D NI+Y+GD R +
Sbjct: 111 VEYEEPITVGDYEVEFFNAGH-IPGSASIHM-RGDVNILYSGDIRLE 155
sp|Q10568|CPSB_BOVIN CLEAVAGE AND POLYADENYLATION SPECIFICITY FACTOR, 100 KD SUBUNIT
(CPSF 100 KD SUBUNIT) >gi|1363022|pir||A56351 cleavage
and polyadenylation specificity factor 100K chain -
bovine >gi|599683|emb|CAA53535| (X75931) Cleavage and
Polyadenylation specificity factor (CPSF) 100kD subunit
[Bos taurus]
Length = 782
Score = 43.0 bits (99), Expect = 0.007
Identities = 46/182 (25%), Positives = 79/182 (43%), Gaps = 30/182 (16%)
Query: 2 SNIKITPLGGVREFGKNMYLVEVEEQIFVLDAGLKYAEMDQLGVDFVIPDITYLLENADR 61
S IK+T L GV+E YL++V+E F+LD G D+ F + I L ++ +
Sbjct: 3 SIIKLTTLSGVQEESALCYLLQVDEFRFLLDCG-----WDE---HFSMDIIDSLRKHVHQ 54
Query: 62 VAGIFLTHGHADSIGALPYIVSE--LKVPVFGSELTIELAKINVKNYADSR-KFNDFHVV 118
+ + L+H +GALPY V + L ++ + ++ ++ + + SR DF +
Sbjct: 55 IDAVLLSHPDPLHLGALPYAVGKLGLNCAIYATIPVYKMGQMFMYDLYQSRHNTEDFTLF 114
Query: 119 T-EDTEIDFGKAVISFFS------------------TTHTIPESLGIVISTNDGNIVYTG 159
T +D + F K FS H I ++ ++ + IVY
Sbjct: 115 TLDDVDAAFDKIQQLKFSQIVNLKGKGHGLSITPLPAGHMIGGTIWKIVKDGEEEIVYAV 174
Query: 160 DF 161
DF
Sbjct: 175 DF 176
Posted date: Mar 2, 2000 12:24 PM
Number of letters in database: 140,871,481
Number of sequences in database: 457,798
Lambda K H
0.318 0.137 0.380
Gapped
Lambda K H
0.270 0.0470 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 139722682
Number of Sequences: 457798
Number of extensions: 5573767
Number of successful extensions: 13240
Number of sequences better than 1.0e-02: 36
Number of HSP's better than 0.0 without gapping: 22
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 13107
Number of HSP's gapped (non-prelim): 39
length of query: 570
length of database: 140,871,481
effective HSP length: 59
effective length of query: 511
effective length of database: 113,861,399
effective search space: 58183174889
effective search space used: 58183174889
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 41 (21.7 bits)
S2: 98 (42.6 bits)