BLASTP 2.0.11 [Jan-20-2000]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= yidG (838 kb reverse) BIO15.01
(185 letters)
457,798 sequences; 140,871,481 total letters
Searching...................................................done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAA63514| (X92946) hypothetical protein [Lactococcus la... 370 e-102
sp|Q50239|YI6A_MYCMY INSERTION ELEMENT IS1296 HYPOTHETICAL ... 64 8e-10
sp|Q48585|YI3A_LACJO INSERTION ELEMENT IS1223 HYPOTHETICAL ... 63 1e-09
sp|Q47309|YI9A_ECOLI INSERTION ELEMENT IS1397 HYPOTHETICAL ... 60 1e-08
sp|Q57066|YH20_HAEIN HYPOTHETICAL PROTEIN HI1720 >gi|107491... 57 7e-08
emb|CAA71078| (Y09946) first of two overlapping orfs with s... 53 1e-06
pir||B30868 hypothetical protein 1 (insertion sequence IS86... 51 6e-06
gi|790865 (L36381) orfA; putative [Neisseria gonorrhoeae] 49 3e-05
sp|P19768|YI5A_ECOLI INSERTION ELEMENT IS150 HYPOTHETICAL 1... 44 7e-04
emb|CAA63514| (X92946) hypothetical protein [Lactococcus lactis]
Length = 185
Score = 370 bits (939), Expect = e-102
Identities = 184/185 (99%), Positives = 185/185 (99%)
Query: 1 MVKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKN 60
MVKYSIELKQR+IQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKN
Sbjct: 1 MVKYSIELKQRVIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKN 60
Query: 61 YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL 120
YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL
Sbjct: 61 YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL 120
Query: 121 ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQDSSTVSVKPS 180
ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQDSSTVSVKPS
Sbjct: 121 ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQDSSTVSVKPS 180
Query: 181 NSKKS 185
NSKKS
Sbjct: 181 NSKKS 185
sp|Q50239|YI6A_MYCMY INSERTION ELEMENT IS1296 HYPOTHETICAL 21.4 KD PROTEIN (ORFA)
>gi|1679640 (U61140) ORFA [Mycoplasma mycoides mycoides
SC]
Length = 180
Score = 64.0 bits (153), Expect = 8e-10
Identities = 50/171 (29%), Positives = 85/171 (49%), Gaps = 14/171 (8%)
Query: 1 MVKYSIELKQRIIQDYLS-GKGGSTYLAKLHNVGSSS--QVRHWIRNYRAEGLPTAHSKV 57
M K ++E K +I+++ STYLA +++ + + + +R EGL K
Sbjct: 1 MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFRIEGLINKEKK- 59
Query: 58 NKNYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRR 117
YS +LK V L T+ +Y+ VA+KF I + +A WV ++ YG + ++ GR
Sbjct: 60 -PYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWVKKYREYGFLGLNNNIGRP 118
Query: 118 KKL--------ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRL 160
KK+ I S + ND Q+IKEL++++ Y ++E + K L
Sbjct: 119 KKIMKNPNKKPAKIKKSQVKINND-QQIKELKEQVEYYKLEAEFWKKFHTL 168
sp|Q48585|YI3A_LACJO INSERTION ELEMENT IS1223 HYPOTHETICAL 20.7 KD PROTEIN (ORFA)
>gi|495495 (U09558) ORFA, putative Helix-Turn-Helix
motif from amino acid 21 through 42 and from amino acid
78 through 99. [Lactobacillus johnsonii]
Length = 177
Score = 63.2 bits (151), Expect = 1e-09
Identities = 48/163 (29%), Positives = 78/163 (47%), Gaps = 10/163 (6%)
Query: 1 MVKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNK- 59
M KYS ELK I+ YL+ + LAK +N+ + +R W+ + +GL K K
Sbjct: 1 MTKYSTELKIEIVSKYLNHEDSIKGLAKQYNI-HWTLIRRWVDKAKCQGLAALSVKHTKT 59
Query: 60 NYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVP-ISKKRGRRK 118
YS + K N V+ YLT + VA KF I++ + + +W F G + K++GR +
Sbjct: 60 TYSSDFKLNVVRYYLTHSIGVSKVAAKFNISD-SQVYNWAKKFNEEGYAGLLPKQKGRPR 118
Query: 119 KLESIASSMT------QNPNDSQRIKELEQELRYAQIEVAYLK 155
K+ + T + ++I + E EL ++E LK
Sbjct: 119 KVPKKSKKTTKKLELSEKQKYEEKILKQEAELERLRVENLVLK 161
sp|Q47309|YI9A_ECOLI INSERTION ELEMENT IS1397 HYPOTHETICAL 20.1 KD PROTEIN (ORFA)
>gi|1064899|emb|CAA63546| (X92970) orfA [Escherichia
coli]
Length = 173
Score = 59.7 bits (142), Expect = 1e-08
Identities = 45/170 (26%), Positives = 77/170 (44%), Gaps = 7/170 (4%)
Query: 2 VKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKNY 61
+K+S E+K + YL+G G AKL + +S + HWI + G + ++Y
Sbjct: 1 MKHSFEVKLAAVNHYLAGHAGIISTAKLFQLSHTS-LSHWINLFLLHGPRALDCRHKRSY 59
Query: 62 SMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKLE 121
S E K V L + VA +F I + + +W+ ++ G + RK+
Sbjct: 60 SPEDKLCVVLYALGHSESLPRVAARFNIPSHNTVKNWIKGYRKSGNEAFIR---CRKEKS 116
Query: 122 SIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQD 171
S T + +E++ ELRY + E AYLK ++++ L K Q+
Sbjct: 117 MTRSDDTHENEANMTPEEMKNELRYLRAENAYLKA---MQEHLLEKKRQE 163
sp|Q57066|YH20_HAEIN HYPOTHETICAL PROTEIN HI1720 >gi|1074910|pir||D64176 hypothetical
protein HI1720 - Haemophilus influenzae (strain Rd KW20)
>gi|1574576 (U32845) conserved hypothetical protein
[Haemophilus influenzae Rd]
Length = 188
Score = 57.4 bits (136), Expect = 7e-08
Identities = 48/172 (27%), Positives = 73/172 (41%), Gaps = 10/172 (5%)
Query: 1 MVKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLP-TAHSKVNK 59
M KY+ KQ++I+ YL S+ L + H + + + WI + G+ A +
Sbjct: 20 MTKYNFLFKQQVIEFYLQNDKNSS-LTRRHFQLAETTLERWINQFNHSGINGLALLGKKR 78
Query: 60 NYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFK---IYGEVPISKKRGR 116
NYS E K N +Q + EA F I N +++ W+ F+ I G +P K R
Sbjct: 79 NYSPEFKLNVIQAVKNGKFSAEAACLHFGIANSGVVSQWLQAFEKQGINGLIPKPKGRPT 138
Query: 117 RKKLESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNK 168
K P R +ELE E + E A LK L+ L + + K
Sbjct: 139 MK-----LQYPKMPPKPKTREEELELENLRLRAENAILKKLQELNQQKMQKK 185
emb|CAA71078| (Y09946) first of two overlapping orfs with similarity to IS3 genes
[Bacillus thuringiensis]
Length = 185
Score = 53.5 bits (126), Expect = 1e-06
Identities = 34/97 (35%), Positives = 54/97 (55%), Gaps = 9/97 (9%)
Query: 61 YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL 120
YS ELK AVQ Y+ + +Y + K++I + T L +WV +K GE I+ RG+ +
Sbjct: 90 YSEELKLAAVQSYVNGEGSYNMIKEKYQIISSTQLKNWVKKYKESGE--ITDTRGKNGGM 147
Query: 121 ESIASSMTQNPNDSQRI--KELEQELRYAQIEVAYLK 155
+ I+ NP +R+ K +E+E Y + +V YLK
Sbjct: 148 KGIS-----NPLKGKRVHFKTIEEERDYYKAQVEYLK 179
Score = 41.8 bits (96), Expect = 0.004
Identities = 15/48 (31%), Positives = 31/48 (64%)
Query: 2 VKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEG 49
+ YS ELK +Q Y++G+G + + + + SS+Q+++W++ Y+ G
Sbjct: 88 ITYSEELKLAAVQSYVNGEGSYNMIKEKYQIISSTQLKNWVKKYKESG 135
pir||B30868 hypothetical protein 1 (insertion sequence IS861) - Streptococcus
agalactiae >gi|1196921|gb|AAA88574.1| (M22449) unknown
protein [Streptococcus agalactiae]
Length = 141
Score = 51.2 bits (120), Expect = 6e-06
Identities = 33/115 (28%), Positives = 58/115 (49%), Gaps = 5/115 (4%)
Query: 58 NKNYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRR 117
N+ Y ELK+ + L + +V+ + ++N ++L +W++ FK G + K RGR
Sbjct: 18 NEYYPPELKQEMIDKVLIHGCSQLSVSLDYALSNCSILTNWLSQFKKNGYTIVEKTRGRP 77
Query: 118 KKLESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLR--RLEKNALMNKNQ 170
K+ + + +R++E + LR E A+LK LR RL AL ++ Q
Sbjct: 78 SKMGRKRKKTWEEMTELERLQEENERLR---TENAFLKKLRDLRLRDEALQSERQ 129
gi|790865 (L36381) orfA; putative [Neisseria gonorrhoeae]
Length = 172
Score = 48.8 bits (114), Expect = 3e-05
Identities = 37/141 (26%), Positives = 65/141 (45%), Gaps = 3/141 (2%)
Query: 34 SSSQVRHWIRNYRAEGLP-TAHSKVNKNYSMELKENAVQCYLTTDLTYEAVARKFEITNF 92
S S VR W+ YR G K YS+E K A++ ++ +A A + +
Sbjct: 32 SDSLVRKWLARYRLHGESGIKRRKHTTKYSVEYKLEAIRPVTGQGMSQKAAADQLNLPEC 91
Query: 93 TLLASWVNHFKIYGEVPISKKRGRRKKLESIASSMTQNPNDSQRIKELEQELRYAQIEVA 152
++L W+ +++ G + K RK ++ T+ + + +EL EL Y + E A
Sbjct: 92 SVLPQWLRLYRLNGINGLKPKPKGRKPVKKQYPPQTKKADYLKTKEELFAELAYLKAEAA 151
Query: 153 YLKGLRRLEKNALMNKNQDSS 173
LK L L++ + K ++SS
Sbjct: 152 VLKKLDALKE--VRQKERNSS 170
sp|P19768|YI5A_ECOLI INSERTION ELEMENT IS150 HYPOTHETICAL 19.7 KD PROTEIN (ORFA)
>gi|1073485|pir||S47778 hypothetical protein A
(insertion sequence IS150) - Escherichia coli
>gi|48676|emb|CAA30085| (X07037) ORF A [Escherichia
coli] >gi|466695 (U00039) orfA in IS150 [Escherichia
coli] >gi|1789980 (AE000433) IS150 hypothetical protein
[Escherichia coli] >gi|226098|prf||1410311A insertion
element IS150 ORF A [Escherichia coli]
Length = 173
Score = 44.1 bits (102), Expect = 7e-04
Identities = 35/164 (21%), Positives = 70/164 (42%), Gaps = 7/164 (4%)
Query: 3 KYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNK-NY 61
KY E + ++ Y + G ++ V +QVR W+ Y G K +
Sbjct: 5 KYPFEKRLEVVNHYFTTDDGYRIISARFGV-PRTQVRTWVALYEKHGEKGLIPKPKGVSA 63
Query: 62 SMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGE-----VPISKKRGR 116
EL+ V+ + ++ A F + +A W+ ++ GE + I KR
Sbjct: 64 DPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARWLKVYEERGEAGLRALKIGTKRNI 123
Query: 117 RKKLESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRL 160
++ ++ + +RI++LE+++R+ + + YLK L+ L
Sbjct: 124 AISVDPEKAASALELSKDRRIEDLERQVRFLETRLMYLKKLKAL 167
Posted date: Mar 2, 2000 12:24 PM
Number of letters in database: 140,871,481
Number of sequences in database: 457,798
Lambda K H
0.312 0.127 0.348
Gapped
Lambda K H
0.270 0.0470 0.230
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 48280012
Number of Sequences: 457798
Number of extensions: 1783155
Number of successful extensions: 4814
Number of sequences better than 1.0e-02: 9
Number of HSP's better than 0.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 4797
Number of HSP's gapped (non-prelim): 10
length of query: 185
length of database: 140,871,481
effective HSP length: 61
effective length of query: 124
effective length of database: 112,945,803
effective search space: 14005279572
effective search space used: 14005279572
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.8 bits)
X3: 64 (24.9 bits)
S1: 42 (21.9 bits)
S2: 93 (40.6 bits)