BLASTP 2.0.11 [Jan-20-2000]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= yidG (838 kb reverse) BIO15.01
         (185 letters)

           457,798 sequences; 140,871,481 total letters

Searching...................................................done

                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAA63514|  (X92946) hypothetical protein [Lactococcus la...   370  e-102
sp|Q50239|YI6A_MYCMY  INSERTION ELEMENT IS1296 HYPOTHETICAL ...    64  8e-10
sp|Q48585|YI3A_LACJO  INSERTION ELEMENT IS1223 HYPOTHETICAL ...    63  1e-09
sp|Q47309|YI9A_ECOLI  INSERTION ELEMENT IS1397 HYPOTHETICAL ...    60  1e-08
sp|Q57066|YH20_HAEIN  HYPOTHETICAL PROTEIN HI1720 >gi|107491...    57  7e-08
emb|CAA71078|  (Y09946) first of two overlapping orfs with s...    53  1e-06
pir||B30868  hypothetical protein 1 (insertion sequence IS86...    51  6e-06
gi|790865  (L36381) orfA; putative [Neisseria gonorrhoeae]         49  3e-05
sp|P19768|YI5A_ECOLI  INSERTION ELEMENT IS150 HYPOTHETICAL 1...    44  7e-04

 emb|CAA63514| (X92946) hypothetical protein [Lactococcus lactis]
           Length = 185
           
 Score =  370 bits (939), Expect = e-102
 Identities = 184/185 (99%), Positives = 185/185 (99%)

Query: 1   MVKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKN 60
           MVKYSIELKQR+IQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKN
Sbjct: 1   MVKYSIELKQRVIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKN 60

Query: 61  YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL 120
           YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL
Sbjct: 61  YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL 120

Query: 121 ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQDSSTVSVKPS 180
           ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQDSSTVSVKPS
Sbjct: 121 ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQDSSTVSVKPS 180

Query: 181 NSKKS 185
           NSKKS
Sbjct: 181 NSKKS 185
 sp|Q50239|YI6A_MYCMY INSERTION ELEMENT IS1296 HYPOTHETICAL 21.4 KD PROTEIN (ORFA)
           >gi|1679640 (U61140) ORFA [Mycoplasma mycoides mycoides
           SC]
           Length = 180
           
 Score = 64.0 bits (153), Expect = 8e-10
 Identities = 50/171 (29%), Positives = 85/171 (49%), Gaps = 14/171 (8%)

Query: 1   MVKYSIELKQRIIQDYLS-GKGGSTYLAKLHNVGSSS--QVRHWIRNYRAEGLPTAHSKV 57
           M K ++E K +I+++        STYLA  +++   +   + +    +R EGL     K 
Sbjct: 1   MSKLNLEKKLKIVKEAKKLNIKKSTYLANKYDISVDTVESLVNRFEAFRIEGLINKEKK- 59

Query: 58  NKNYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRR 117
              YS +LK   V   L T+ +Y+ VA+KF I   + +A WV  ++ YG + ++   GR 
Sbjct: 60  -PYYSAKLKLKIVLYKLETNHSYDEVAKKFNIIYSSTIAGWVKKYREYGFLGLNNNIGRP 118

Query: 118 KKL--------ESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRL 160
           KK+          I  S  +  ND Q+IKEL++++ Y ++E  + K    L
Sbjct: 119 KKIMKNPNKKPAKIKKSQVKINND-QQIKELKEQVEYYKLEAEFWKKFHTL 168
 sp|Q48585|YI3A_LACJO INSERTION ELEMENT IS1223 HYPOTHETICAL 20.7 KD PROTEIN (ORFA)
           >gi|495495 (U09558) ORFA, putative Helix-Turn-Helix
           motif from amino acid 21 through 42 and from amino acid
           78 through 99. [Lactobacillus johnsonii]
           Length = 177
           
 Score = 63.2 bits (151), Expect = 1e-09
 Identities = 48/163 (29%), Positives = 78/163 (47%), Gaps = 10/163 (6%)

Query: 1   MVKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNK- 59
           M KYS ELK  I+  YL+ +     LAK +N+   + +R W+   + +GL     K  K 
Sbjct: 1   MTKYSTELKIEIVSKYLNHEDSIKGLAKQYNI-HWTLIRRWVDKAKCQGLAALSVKHTKT 59

Query: 60  NYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVP-ISKKRGRRK 118
            YS + K N V+ YLT  +    VA KF I++ + + +W   F   G    + K++GR +
Sbjct: 60  TYSSDFKLNVVRYYLTHSIGVSKVAAKFNISD-SQVYNWAKKFNEEGYAGLLPKQKGRPR 118

Query: 119 KLESIASSMT------QNPNDSQRIKELEQELRYAQIEVAYLK 155
           K+   +   T      +     ++I + E EL   ++E   LK
Sbjct: 119 KVPKKSKKTTKKLELSEKQKYEEKILKQEAELERLRVENLVLK 161
 sp|Q47309|YI9A_ECOLI INSERTION ELEMENT IS1397 HYPOTHETICAL 20.1 KD PROTEIN (ORFA)
           >gi|1064899|emb|CAA63546| (X92970) orfA [Escherichia
           coli]
           Length = 173
           
 Score = 59.7 bits (142), Expect = 1e-08
 Identities = 45/170 (26%), Positives = 77/170 (44%), Gaps = 7/170 (4%)

Query: 2   VKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNKNY 61
           +K+S E+K   +  YL+G  G    AKL  +  +S + HWI  +   G      +  ++Y
Sbjct: 1   MKHSFEVKLAAVNHYLAGHAGIISTAKLFQLSHTS-LSHWINLFLLHGPRALDCRHKRSY 59

Query: 62  SMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKLE 121
           S E K   V   L    +   VA +F I +   + +W+  ++  G     +    RK+  
Sbjct: 60  SPEDKLCVVLYALGHSESLPRVAARFNIPSHNTVKNWIKGYRKSGNEAFIR---CRKEKS 116

Query: 122 SIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNKNQD 171
              S  T     +   +E++ ELRY + E AYLK    ++++ L  K Q+
Sbjct: 117 MTRSDDTHENEANMTPEEMKNELRYLRAENAYLKA---MQEHLLEKKRQE 163
 sp|Q57066|YH20_HAEIN HYPOTHETICAL PROTEIN HI1720 >gi|1074910|pir||D64176 hypothetical
           protein HI1720 - Haemophilus influenzae (strain Rd KW20)
           >gi|1574576 (U32845) conserved hypothetical protein
           [Haemophilus influenzae Rd]
           Length = 188
           
 Score = 57.4 bits (136), Expect = 7e-08
 Identities = 48/172 (27%), Positives = 73/172 (41%), Gaps = 10/172 (5%)

Query: 1   MVKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLP-TAHSKVNK 59
           M KY+   KQ++I+ YL     S+ L + H   + + +  WI  +   G+   A     +
Sbjct: 20  MTKYNFLFKQQVIEFYLQNDKNSS-LTRRHFQLAETTLERWINQFNHSGINGLALLGKKR 78

Query: 60  NYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFK---IYGEVPISKKRGR 116
           NYS E K N +Q       + EA    F I N  +++ W+  F+   I G +P  K R  
Sbjct: 79  NYSPEFKLNVIQAVKNGKFSAEAACLHFGIANSGVVSQWLQAFEKQGINGLIPKPKGRPT 138

Query: 117 RKKLESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRLEKNALMNK 168
            K            P    R +ELE E    + E A LK L+ L +  +  K
Sbjct: 139 MK-----LQYPKMPPKPKTREEELELENLRLRAENAILKKLQELNQQKMQKK 185
 emb|CAA71078| (Y09946) first of two overlapping orfs with similarity to IS3 genes
           [Bacillus thuringiensis]
           Length = 185
           
 Score = 53.5 bits (126), Expect = 1e-06
 Identities = 34/97 (35%), Positives = 54/97 (55%), Gaps = 9/97 (9%)

Query: 61  YSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRRKKL 120
           YS ELK  AVQ Y+  + +Y  +  K++I + T L +WV  +K  GE  I+  RG+   +
Sbjct: 90  YSEELKLAAVQSYVNGEGSYNMIKEKYQIISSTQLKNWVKKYKESGE--ITDTRGKNGGM 147

Query: 121 ESIASSMTQNPNDSQRI--KELEQELRYAQIEVAYLK 155
           + I+     NP   +R+  K +E+E  Y + +V YLK
Sbjct: 148 KGIS-----NPLKGKRVHFKTIEEERDYYKAQVEYLK 179
 Score = 41.8 bits (96), Expect = 0.004
 Identities = 15/48 (31%), Positives = 31/48 (64%)

Query: 2   VKYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEG 49
           + YS ELK   +Q Y++G+G    + + + + SS+Q+++W++ Y+  G
Sbjct: 88  ITYSEELKLAAVQSYVNGEGSYNMIKEKYQIISSTQLKNWVKKYKESG 135
 pir||B30868 hypothetical protein 1 (insertion sequence IS861) - Streptococcus
           agalactiae >gi|1196921|gb|AAA88574.1| (M22449) unknown
           protein [Streptococcus agalactiae]
           Length = 141
           
 Score = 51.2 bits (120), Expect = 6e-06
 Identities = 33/115 (28%), Positives = 58/115 (49%), Gaps = 5/115 (4%)

Query: 58  NKNYSMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGEVPISKKRGRR 117
           N+ Y  ELK+  +   L    +  +V+  + ++N ++L +W++ FK  G   + K RGR 
Sbjct: 18  NEYYPPELKQEMIDKVLIHGCSQLSVSLDYALSNCSILTNWLSQFKKNGYTIVEKTRGRP 77

Query: 118 KKLESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLR--RLEKNALMNKNQ 170
            K+        +   + +R++E  + LR    E A+LK LR  RL   AL ++ Q
Sbjct: 78  SKMGRKRKKTWEEMTELERLQEENERLR---TENAFLKKLRDLRLRDEALQSERQ 129
 gi|790865 (L36381) orfA; putative [Neisseria gonorrhoeae]
           Length = 172
           
 Score = 48.8 bits (114), Expect = 3e-05
 Identities = 37/141 (26%), Positives = 65/141 (45%), Gaps = 3/141 (2%)

Query: 34  SSSQVRHWIRNYRAEGLP-TAHSKVNKNYSMELKENAVQCYLTTDLTYEAVARKFEITNF 92
           S S VR W+  YR  G       K    YS+E K  A++      ++ +A A +  +   
Sbjct: 32  SDSLVRKWLARYRLHGESGIKRRKHTTKYSVEYKLEAIRPVTGQGMSQKAAADQLNLPEC 91

Query: 93  TLLASWVNHFKIYGEVPISKKRGRRKKLESIASSMTQNPNDSQRIKELEQELRYAQIEVA 152
           ++L  W+  +++ G   +  K   RK ++      T+  +  +  +EL  EL Y + E A
Sbjct: 92  SVLPQWLRLYRLNGINGLKPKPKGRKPVKKQYPPQTKKADYLKTKEELFAELAYLKAEAA 151

Query: 153 YLKGLRRLEKNALMNKNQDSS 173
            LK L  L++  +  K ++SS
Sbjct: 152 VLKKLDALKE--VRQKERNSS 170
 sp|P19768|YI5A_ECOLI INSERTION ELEMENT IS150 HYPOTHETICAL 19.7 KD PROTEIN (ORFA)
           >gi|1073485|pir||S47778 hypothetical protein A
           (insertion sequence IS150) - Escherichia coli
           >gi|48676|emb|CAA30085| (X07037) ORF A [Escherichia
           coli] >gi|466695 (U00039) orfA in IS150 [Escherichia
           coli] >gi|1789980 (AE000433) IS150 hypothetical protein
           [Escherichia coli] >gi|226098|prf||1410311A insertion
           element IS150 ORF A [Escherichia coli]
           Length = 173
           
 Score = 44.1 bits (102), Expect = 7e-04
 Identities = 35/164 (21%), Positives = 70/164 (42%), Gaps = 7/164 (4%)

Query: 3   KYSIELKQRIIQDYLSGKGGSTYLAKLHNVGSSSQVRHWIRNYRAEGLPTAHSKVNK-NY 61
           KY  E +  ++  Y +   G   ++    V   +QVR W+  Y   G      K    + 
Sbjct: 5   KYPFEKRLEVVNHYFTTDDGYRIISARFGV-PRTQVRTWVALYEKHGEKGLIPKPKGVSA 63

Query: 62  SMELKENAVQCYLTTDLTYEAVARKFEITNFTLLASWVNHFKIYGE-----VPISKKRGR 116
             EL+   V+  +   ++    A  F +     +A W+  ++  GE     + I  KR  
Sbjct: 64  DPELRIKVVKAVIEQHMSLNQAAAHFMLAGSGSVARWLKVYEERGEAGLRALKIGTKRNI 123

Query: 117 RKKLESIASSMTQNPNDSQRIKELEQELRYAQIEVAYLKGLRRL 160
              ++   ++     +  +RI++LE+++R+ +  + YLK L+ L
Sbjct: 124 AISVDPEKAASALELSKDRRIEDLERQVRFLETRLMYLKKLKAL 167
Posted date: Mar 2, 2000 12:24 PM Number of letters in database: 140,871,481 Number of sequences in database: 457,798 Lambda K H 0.312 0.127 0.348 Gapped Lambda K H 0.270 0.0470 0.230 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 48280012 Number of Sequences: 457798 Number of extensions: 1783155 Number of successful extensions: 4814 Number of sequences better than 1.0e-02: 9 Number of HSP's better than 0.0 without gapping: 4 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 4797 Number of HSP's gapped (non-prelim): 10 length of query: 185 length of database: 140,871,481 effective HSP length: 61 effective length of query: 124 effective length of database: 112,945,803 effective search space: 14005279572 effective search space used: 14005279572 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.8 bits) X3: 64 (24.9 bits) S1: 42 (21.9 bits) S2: 93 (40.6 bits)