National Center for Biotechnology Information (NCBI)

welcome to the blast network service.


PEPTIDE SEQUENCE DATABASES

 nr        Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
 month     All new or revised GenBank CDS translation+PDB+SwissProt+PIR
           sequences released in the last 30 days
 pdb       PDB protein sequences
 yeast     Yeast (Saccharomyces cerevisiae) protein sequences.
 kabat     Kabat Sequences of Proteins of Immunological Interest
 alu *     Translations of Select Alu Repeats from REPBASE
 swissprot SwissProt sequences


NUCLEOTIDE SEQUENCE DATABASES

 nr       Non-redundant GenBank+EMBL+DDBJ+PDB sequences
          (but no EST, STS, GSS, or HTGS sequences) 
 month    All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
          the last 30 days
 yeast    Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
 est +    Non-redundant Database of GenBank+EMBL+DDBJ EST Division
 sts +    Non-redundant Database of GenBank+EMBL+DDBJ STS Division
 htgs     High Throughput Genomic Sequences 
 pdb      PDB nucleotide sequences
 vector   Vector subset of GenBank
 mito *   Database of mitochondrial sequences, Rel. 1.0, July 1995
 gss      Genome Survey Sequences (includes single-pass genomic data, exon-
          trapped sequences, and Alu PCR primers.
 kabat    Kabat Sequences of Nucleic Acid of Immunological Interest
 epd      Eukaryotic Promotor Database
 alu *+   Select Alu Repeats from REPBASE



BLASTP 2.2.3 [Apr-24-2002]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= orf1 [1-28]
         (27 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
           1,028,106 sequences; 324,535,921 total letters

Searching done
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF
    Posted date:  Jul 12, 2002 12:04 AM
  Number of letters in database: 324,535,921
  Number of sequences in database:  1,028,106
  
Lambda     K      H
   0.321    0.140    0.000 

Gapped
Lambda     K      H
   0.267   0.0410    0.000 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21,062,022
Number of Sequences: 1028106
Number of extensions: 378603
Number of successful extensions: 459
Number of sequences better than 5.0e-02: 0
Number of HSP's better than  0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 459
Number of HSP's gapped (non-prelim): 0
length of query: 27
length of database: 324,535,921
effective HSP length: 3
effective length of query: 24
effective length of database: 321,451,603
effective search space: 7714838472
effective search space used: 7714838472
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf2 [41-279]
         (237 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
           1,028,106 sequences; 324,535,921 total letters

Searching done
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF
    Posted date:  Jul 12, 2002 12:04 AM
  Number of letters in database: 324,535,921
  Number of sequences in database:  1,028,106
  
Lambda     K      H
   0.319    0.133    0.000 

Gapped
Lambda     K      H
   0.267   0.0410    0.000 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,298,903
Number of Sequences: 1028106
Number of extensions: 3022093
Number of successful extensions: 8164
Number of sequences better than 5.0e-02: 0
Number of HSP's better than  0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8164
Number of HSP's gapped (non-prelim): 0
length of query: 237
length of database: 324,535,921
effective HSP length: 118
effective length of query: 119
effective length of database: 203,219,413
effective search space: 24183110147
effective search space used: 24183110147
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf3 [273-947]
         (673 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
           1,028,106 sequences; 324,535,921 total letters

Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAA89640.1|  (AB036666) similar to terminase large subun...   210  5e-053
dbj|BAA89621.1|  (AB036665) similar to terminase large subun...   206  1e-051
ref|NP_518974.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...   205  2e-051
ref|NP_297798.1|  (NC_002488) phage-related terminase large ...   123  6e-027
ref|NP_461544.1|  (NC_003197) Gifsy-1 prophage: similar to t...   112  1e-023
sp|P36693|TERL_BPP21  TERMINASE LARGE SUBUNIT (GP2) >gi|5407...   102  1e-020
ref|NP_310206.1|  (NC_002695) putative terminase large subun...    97  6e-019
ref|NP_287368.1|  (NC_002655) putative DNA packaging protein...    96  1e-018
ref|NP_040581.1|  (NC_001416) DNA packaging protein [Bacteri...    95  2e-018
ref|NP_046897.1|  (NC_001901) gp2 [Bacteriophage N15] >gi|74...    93  9e-018
emb|CAA26343.1|  (X02501) gp 2 (aa 1-309) [Bacteriophage 21]...    73  1e-011
>dbj|BAA89640.1| (AB036666) similar to terminase large subunit of phage lambda
           [Wolbachia sp. wKue]
          Length = 609

 Score =  210 bits (534), Expect = 5e-053
 Identities = 172/615 (27%), Positives = 276/615 (43%), Gaps = 80/615 (13%)

Query: 29  LTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L  S+WA + R L    +  PG +  +  PY +E++D   P SP   V   KGAQ+G T 
Sbjct: 17  LKVSEWANEYRVLAPTAASEPGKWRTERTPYLKEIMDSLSPSSPTEKVVFMKGAQIGGTE 76

Query: 89  SVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGA 148
           +   N IGY ID      M+      ++ KR       P++ +    +S ++       +
Sbjct: 77  AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPCLKSKVK----DPRS 130

Query: 149 KRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTE 208
           +  G + +  E+ GG V+   GA S   +RS PV +L  DE+  +P   G +GDP+ L+ 
Sbjct: 131 RDSGNTVQSKEFPGGIVVIT-GANSSVALRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSI 189

Query: 209 TRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKNG 268
            RTN+FA  RKI  +STP + G   I + FE  D+RY+ VPC HC  YQ L +       
Sbjct: 190 ARTNTFAR-RKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW------- 241

Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGEGNWVPTAKPTRPDFRSYHLS 328
             ++ WE         +  YVC  C G++ N  K  ++  G W PT +  + + + +HLS
Sbjct: 242 -SQIKWENSDS----RTAHYVCTECSGKIENHQKTEMLDRGEWRPTNR-VKGEKKGFHLS 295

Query: 329 AMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVY 388
           ++Y+P    SWQ+  + ++   + E+        L+V+ N  LGE +  K      ++++
Sbjct: 296 SLYSPVGWYSWQQAVEDFLHAKESEQ-------LLKVWINTTLGETWVDKGEVPDWKQLF 348

Query: 389 AHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIID 448
             R  +  G VP    +         +T  VDVQ   L V  + W  S +      + ID
Sbjct: 349 NRREFFQIGTVPRREVV---------LTAGVDVQKDRLEVEVVAWGKSRE-----SWSID 394

Query: 449 YEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYC--NDW 506
           Y+  EGD  T  GE W KL+E+++       G  Y I+ + +DA   T  VY +   +  
Sbjct: 395 YQVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452

Query: 507 GENVMPIRG-------------RDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRWSP 553
              VM ++G              D+ + G ++K        +G++   V V + K     
Sbjct: 453 SGRVMAVKGVNKALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQ 504

Query: 554 TLRKEWSGQGEMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKII----RQVWHRPG 609
            L      +G+   G    P   P+ + K+LT E          +K++    +Q W +  
Sbjct: 505 LLNILKEEEGKALPGYCHFPEYAPE-YFKQLTAEQL-------VSKVVKGYTKQEWQKV- 555

Query: 610 GSRNELWDTLIYNTA 624
             RNE+ D  IY  A
Sbjct: 556 RERNEVLDCRIYARA 570
>dbj|BAA89621.1| (AB036665) similar to terminase large subunit of lambda
           [bacteriophage WO]
          Length = 610

 Score =  206 bits (523), Expect = 1e-051
 Identities = 170/613 (27%), Positives = 275/613 (44%), Gaps = 78/613 (12%)

Query: 29  LTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L  S+WA + R L    +  PG +  +  PY +E++D   P SP   V   KGAQ+G T 
Sbjct: 17  LKVSEWANEYRVLAATAASEPGKWRTERTPYLKEIMDSLSPSSPAEKVIFMKGAQIGGTE 76

Query: 89  SVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGA 148
           +   N IGY ID      M+      ++ KR       P++ +    +S ++       +
Sbjct: 77  AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPCLKSKVK----DPRS 130

Query: 149 KRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTE 208
           +  G + +  E+ GG V+   GA S   +RS PV +L  DE+  +P   G +GDP+ L+ 
Sbjct: 131 RDSGNTVQSKEFPGGIVVIT-GANSSVGLRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSI 189

Query: 209 TRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKNG 268
            RTN+FA  RKI  +STP + G   I + FE  D+RY+ VPC HC  YQ L +       
Sbjct: 190 ARTNTFAR-RKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW------- 241

Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGEGNWVPTAKPTRPDFRSYHLS 328
             ++ WE +     P++  Y+C  CG ++ N  K  ++  G W  T +    + + +HLS
Sbjct: 242 -SQIKWEDKN----PNTAHYICIECGKKIENHQKTEMLDRGEWRAT-EAKEGEKKGFHLS 295

Query: 329 AMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVY 388
           ++Y+P    SW +  + ++   + E+        L+V+ N  LGE +  K      ++++
Sbjct: 296 SLYSPVGWYSWTQAVEDFLHAKESEQ-------LLKVWINTTLGETWVDKGEVPDWKQLF 348

Query: 389 AHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIID 448
             R  +  G VP    +         +T  VDVQ   L V  + W  S +      + ID
Sbjct: 349 NRREFFPVGTVPRREVV---------LTAGVDVQEDRLEVEVVAWGKSREN-----WSID 394

Query: 449 YEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYC--NDW 506
           Y   EGD  T  GE W KL+E+++       G  Y I+ + +DA   T  VY +   +  
Sbjct: 395 YRVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452

Query: 507 GENVMPIRG-----------RDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRWSPTL 555
              VM ++G            D+ + G ++K        +G++   V V + K      L
Sbjct: 453 SGRVMAVKGVALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQLL 504

Query: 556 RKEWSGQGEMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKII----RQVWHRPGGS 611
                 +G+   G    P   P+ + K+LT E          +K++    +Q W +    
Sbjct: 505 NVLKEEEGKALPGYCHFPEYAPE-YFKQLTAEQL-------VSKVVKGYTKQEWQKV-RE 555

Query: 612 RNELWDTLIYNTA 624
           RNE+ D  IY  A
Sbjct: 556 RNEVLDCRIYARA 568
>ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14555.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 660

 Score =  205 bits (521), Expect = 2e-051
 Identities = 176/620 (28%), Positives = 275/620 (43%), Gaps = 57/620 (9%)

Query: 29  LTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           LT S+W+++ R L  + S  PG +     PY R ++DC  P SP+  V   K AQ+GAT 
Sbjct: 26  LTVSEWSDRHRMLSSKASAEPGRWRTGRTPYLRAIMDCLSPTSPIERVVFMKAAQLGAT- 84

Query: 89  SVLENLIGYGIDYVKTASM-IFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKG 147
            +  N IGY I +     M ++ TV  ++ KR     I P++       S + A   +  
Sbjct: 85  EMGSNWIGYVIHHAPGPMMAVWPTV--EMAKRNSKQRIDPLI-----EESAVLAERIAPA 137

Query: 148 AKRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLT 207
             R   +    +   GGVL   GA S   +RS PV +L  DEV G+P  V  +GD + L 
Sbjct: 138 RSRDSGNTILAKEFRGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLA 197

Query: 208 ETRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKN 267
           E RT +FA  RKI  +STP +AG  TI + ++  DQR Y VPC HC   Q L F      
Sbjct: 198 EARTRTFAR-RKIFIVSTPTIAGASTIEREYDASDQRRYFVPCPHCDHRQWLRFE----- 251

Query: 268 GQGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGEGNWVPTAKPTRPDFRSYHL 327
              +L W T+G    P++  Y+C  C   +    K  ++ +G W    + T      +HL
Sbjct: 252 ---QLRW-TKG---EPETAAYICEACFEPIHEHHKAWMLAQGEWRAMVE-TGGRTAGFHL 303

Query: 328 SAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREV 387
           S++Y+P   RSW+EIA AW      E   A     ++ F N +LGE +  +      + +
Sbjct: 304 SSLYSPVGWRSWREIAAAWESAVSKESGSAA---AIKTFRNTELGETWVEEGEAPDWQRL 360

Query: 388 YAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYII 447
              R DY  G VP    +   G  ++   + V V           WA      G   +++
Sbjct: 361 LERREDYPIGTVPAGGLLLSAGADVQKDRIEVSV-----------WAFG---RGKEAWLV 406

Query: 448 DYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYCNDWG 507
           ++  + GD  T     W++L E++  +   + G    +AR+ +D    T   Y +    G
Sbjct: 407 EHRVLMGD--TARDAVWKRLAELVEEQWTHASGATMPLARLALDTGFATQEAYAFVRACG 464

Query: 508 E-NVMPIRG--RDLPIKGAQIKY---FNRQVNEKGVEYLSVTVDLYKDRWSPTLRK--EW 559
           +  VM ++G  R   + G         N +   +G++  +V V + K  +   LRK  + 
Sbjct: 465 DARVMAVKGTARGAALIGTPTAVDVTRNGKKLRRGIKVFTVAVGIAKLEFYNNLRKAADV 524

Query: 560 SGQG---EMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKIIRQVWHRPGGSRNELW 616
           +  G     P G +  P  +  + L++L  E     RD     I  + W +    RNE  
Sbjct: 525 AKDGATIAFPTGFVHLP-KIDAEFLQQLCAEQLITRRDRSGFPI--RAWQKM-RERNEAL 580

Query: 617 DTLIYNTAIFESMVLEACED 636
           D  +Y  A   +  L+  E+
Sbjct: 581 DCYVYARAAASAAGLDRFEE 600
>ref|NP_297798.1| (NC_002488) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
 pir||D82798 phage-related terminase large subunit XF0508 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF83318.1|AE003899_10 (AE003899) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
          Length = 699

 Score =  123 bits (309), Expect = 6e-027
 Identities = 160/647 (24%), Positives = 237/647 (35%), Gaps = 122/647 (18%)

Query: 32  SQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVL 91
           SQWA+  R + +     PG +  D  P  RE+++C   ++PV  V+  K  Q+GAT  + 
Sbjct: 35  SQWADAYRKIARGSGAEPGQWRTDRHPPLREIMNCLSDHTPVQQVSFMKSGQIGAT-EIG 93

Query: 92  ENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAKRR 151
            N + Y ID    + ++   V D      V  F   +L        L   N F K     
Sbjct: 94  INWVCYVIDRGIDSMIVTQPVKDLARTWTVAKFDPGVLDMPPLLNKLTTNNTFEKQYP-- 151

Query: 152 GASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTETRT 211
                      GG L+   A S  ++R     +   DE+  +P+++   G   +    R 
Sbjct: 152 -----------GGTLFVKWANSSSQLRQITACYAFLDEIDEYPRNLNNQGTADQQIAARI 200

Query: 212 NSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKNGQGR 271
            S     KI    TP +AG   I   F  GDQR+Y + C HCG  Q L            
Sbjct: 201 MSHGERGKIYRACTPTVAGGSAIEASFLDGDQRHYHIHCPHCGSEQILDLEH-------- 252

Query: 272 LIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIM------GEGNWVPTAKPTRPDFRSY 325
                    L PD   + C   G  +    K TI+      G   W P      PD RSY
Sbjct: 253 ---------LQPDGT-FACAVNGCIIQEHHKNTILKERGTGGTAFWHPHNPSAPPDHRSY 302

Query: 326 HLSAMYAPY-YARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKP 384
           +L A YAP     SW++IA  W E         +D  +L  F N  LG  ++ +      
Sbjct: 303 YLWAAYAPLGLGLSWKQIADKWAEA-------KRDPSKLPGFTNLILGLPFQGERDARNA 355

Query: 385 REVYA-HRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYA 443
            EV         RG VP        GG +  +   VD+ H    V  I     A   G  
Sbjct: 356 HEVATLAEPGVYRGLVPR-------GGLV--LAAGVDLAHDRAEVHLI-----ATGRGQR 401

Query: 444 PYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYC 503
            YI+DY  ++ D   +  E +  L   +     T+ G    I+ V ID    T+ V ++ 
Sbjct: 402 RYIVDYAVIDLDPTVL--ESYADLDAYLRGTWKTASGIDMPISAVAIDGGNWTETVAQFV 459

Query: 504 ND---WG--------------ENVMPIRGR-DLPIKGAQIKYFNRQVNEKGVEYLSVTVD 545
                W               + +  +RGR ++    A  +     V+E+G + L+  V 
Sbjct: 460 KQHVGWSGQSRILDTPQGFLKQTLYLVRGRAEVKSDRAVYRPSKTTVDERG-KTLARDVG 518

Query: 546 LY---------------------KDRWSPT----------LRKEWSGQGEMPRGMLSAPV 574
           ++                     KD  + T          LR       ++P  +   P 
Sbjct: 519 VWGVGTSVLKHMIYGWLGAALTAKDNAAQTGTVEDLSARMLRFPGGRGDDIPDPLHPDPG 578

Query: 575 DMPDKHLKELTVEYKREERDPDTNKIIRQVWHRPGGSRNELWDTLIY 621
            +P+ +   LT EY     D D  +     W +P G RNE  DT +Y
Sbjct: 579 ALPEHYFAGLTAEY----FDKDAGR-----WIKPRGVRNEPLDTAVY 616
>ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to terminase large chain gp2
           of N15 [Salmonella typhimurium LT2]
 gb|AAL21503.1| (AE008818) Gifsy-1 prophage: similar to terminase large chain gp2
           of N15 [Salmonella typhimurium LT2]
          Length = 643

 Score =  112 bits (280), Expect = 1e-023
 Identities = 157/653 (24%), Positives = 266/653 (40%), Gaps = 100/653 (15%)

Query: 9   AESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFD 68
           A +A R I   L        LT  QWA++  +LP++ S  PG   ++  P+   +++   
Sbjct: 8   ANNANRAITNGLIALHIPVPLTTVQWADEYYYLPKESSYTPG--KWETLPFQVAIMNAMG 65

Query: 69  PYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPM 128
            Y  +  V + K A+VG T  +L  + GY I++    S++F   D    +  + + + P 
Sbjct: 66  -YELIRVVNLIKSARVGYTKMLL-GVEGYFIEHKSRNSLLFQPTDSSA-EDFMKSHVEPT 122

Query: 129 LRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRD 188
           +R       L  A  F  G K R  +     ++ G   +  G  +    R   V  +  D
Sbjct: 123 IRDVPVLLEL--APWF--GRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREKSVDVVCYD 178

Query: 189 EVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ--RYY 246
           E+S +   V K+G P  L + R       + I   STP + G+  I +   +     R+Y
Sbjct: 179 ELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRG-STPKVKGSCQIEKAANESAHFMRFY 237

Query: 247 EVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIM 306
            VPC HCGE Q L F G+     G L WE       P++V Y+C H  G +I + ++   
Sbjct: 238 -VPCPHCGEEQYLKF-GDGSTPFG-LKWEKS----KPETVYYLCEH-NGCVIRQSELD-Q 288

Query: 307 GEGNWV--PTAKPTR---------------PDFRSYHLSAMYAPYYARSWQEIAQAWIEC 349
             G W+   T   TR               P   ++H+   Y+P+   +W +I   W++ 
Sbjct: 289 KAGRWICDNTGMWTRDGLAYFSASGEEVPPPRSITFHIWTAYSPF--TTWIQIIYDWLDA 346

Query: 350 WDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTG 409
                   KD + ++ F N  LGE YE   +     E+   +  +    VP         
Sbjct: 347 -------LKDPNGVKTFINTTLGEPYEEAVAEKLSHELLLEKVIHYAAPVPER------- 392

Query: 410 GPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTE 469
             +  +T  +D Q     +   GWAP     G   ++ID + + G  +  + +  Q++  
Sbjct: 393 --VVYLTAGIDSQRNRYEMYVWGWAP-----GEEAFLIDKQIIMG--RHDDEDTLQRVDA 443

Query: 470 IISTRQYTSGGRVYDIARV-----GIDASELTDVVYEYCNDWG-ENVMPIRGRDLPIKGA 523
           +I+ +   + G    I+R+     GIDA    ++VY+     G   V+P++G    + G 
Sbjct: 444 VINKKYRHADGTDISISRICWDIGGIDA----EIVYKRSKKHGIFRVLPVKGAS--VYGK 497

Query: 524 QIKYFNRQVNEKGVEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMLSAPVD-------- 575
            +    ++ N+ GV    +  D  K+     +            G ++AP D        
Sbjct: 498 PVITMPKKRNQSGVFLCEIGTDTAKEMLYARM------------GAVTAPADEATPYAIR 545

Query: 576 MPDKHLKELTVEYKREERDPDTNKII----RQVWHRPGGSRNELWDTLIYNTA 624
            PD       VE K+   +    K++    R +W    G RNE  D L+Y +A
Sbjct: 546 FPDNPDVFTEVEAKQLVAEELVEKLVNGKFRLLWDAK-GRRNEALDCLVYASA 597
>sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2)
 pir||B49849 terminase large chain gp2 - phage 21
 gb|AAA32340.1| (M81255) terminase large subunit [Bacteriophage 21]
          Length = 642

 Score =  102 bits (255), Expect = 1e-020
 Identities = 153/645 (23%), Positives = 259/645 (39%), Gaps = 75/645 (11%)

Query: 1   MYSASDYYAESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYW 60
           M S +   A +A  +  GLL+       LT  QWA K  +LP++ S  PG   ++  P+ 
Sbjct: 1   MISDAQKAANAAGAIATGLLSLIIPVP-LTTVQWANKHYYLPKESSYTPG--RWETLPFQ 57

Query: 61  REVVDCFDPYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRR 120
             +++C      +  V + K A+VG T  +L  +  Y I++    S++F   D    +  
Sbjct: 58  VGIMNCMGN-DLIRTVNLIKSARVGYTKMLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDF 114

Query: 121 VTNFILPMLRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSF 180
           + + + P +R   A   L  A  F  G K R  +     ++ G   +  G  +    R  
Sbjct: 115 MKSHVEPTIRDVPAMLEL--APWF--GRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREK 170

Query: 181 PVPWLLRDEVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEK 240
            V  +  DE+S +   V K+G P  L + R       + I   STP + G+  I +   +
Sbjct: 171 SVDVVCYDELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRG-STPKIKGSCQIEKAANE 229

Query: 241 GDQ--RYYEVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMI 298
                R+Y VPC HCGE Q L F  +A      L WE       P+SV Y+C H  G +I
Sbjct: 230 SAHFMRFY-VPCPHCGEEQYLKFGDDA--SPFGLKWEKN----KPESVFYLCEH-HGCVI 281

Query: 299 NEDKVTIMGEGNWV--PTAKPTR---------------PDFRSYHLSAMYAPYYARSWQE 341
           ++ ++     G W+   T   TR               P   ++H+   Y+P+   +W +
Sbjct: 282 HQSELD-QSNGRWICENTGMWTRDGLMFFSARGDEIPPPRSITFHIWTAYSPF--TTWVQ 338

Query: 342 IAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVYAHRRDYLRGEVPN 401
           I   W++         KD + L+ F N  LGE +E         +V   +       VP 
Sbjct: 339 IVYDWLDA-------LKDPNGLKTFVNTTLGETWEEAVGEKLDHQVLMDKVVRYTAAVP- 390

Query: 402 EHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIIDYEHVEGDCKTIEG 461
                     +  +T  +D Q     +   GWAP     G   +++D   + G  +  E 
Sbjct: 391 --------ARVVYLTAGIDSQRNRFEMYVWGWAP-----GEEAFLVDKIIIMG--RPDEE 435

Query: 462 EGWQKLTEIISTRQYTSGGRVYDIARVGIDASELT-DVVYEYCNDWGE-NVMPIRGRDLP 519
           E   ++   I+ +   + G    I+RV  D   +  ++VY+     G   V+P++G    
Sbjct: 436 ETLLRVDAAINKKYRHADGTEMTISRVCWDIGGIDGEIVYQRSKKHGVFRVLPVKGAS-- 493

Query: 520 IKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMLSAPVD---M 576
           + G  +    +  N++GV    V  D  K+     ++ + +   E     +  P D    
Sbjct: 494 VYGKPVITMPKTRNQRGVYLCEVGTDTAKEILYARMKADPTPVDEATSYAIRFPDDPEIF 553

Query: 577 PDKHLKELTVEYKREERDPDTNKIIRQVWHRPGGSRNELWDTLIY 621
                ++L  E   E+ +      +R +W      RNE  D L+Y
Sbjct: 554 SQTEAQQLVAEELVEKWEKGK---MRLLWDNK-KRRNEALDCLVY 594
>ref|NP_310206.1| (NC_002695) putative terminase large subunit [Escherichia coli
           O157:H7]
 dbj|BAB35602.1| (AP002557) putative terminase large subunit [Escherichia coli
           O157:H7]
          Length = 641

 Score = 97.1 bits (240), Expect = 6e-019
 Identities = 143/647 (22%), Positives = 254/647 (39%), Gaps = 71/647 (10%)

Query: 30  TPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVS 89
           T  +WA+   +LP++ +   G   ++  P+ R +++       +  V V K A+VG +  
Sbjct: 29  TAVEWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKM 85

Query: 90  VLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAK 149
           +L  +  Y I++ +  ++I+   D D      T+ + P +R   +  SL        G K
Sbjct: 86  LL-GVYAYFIEHKQRNTLIWLPTDGDAENFMKTH-VEPTIRDIPSLLSLAPWY----GKK 139

Query: 150 RRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTET 209
            R  +     +  G   +  G ++    R   V     DE++ + + + ++G P  L + 
Sbjct: 140 HRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDK 199

Query: 210 RTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ-RYYEVPCKHCGEYQRLYFRGNAKNG 268
           R       + I   STP + GT  I +   +      + V C HCGE Q L F G+ +  
Sbjct: 200 RIEGSVWPKSIRG-STPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF-GDKETP 257

Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINED------KVTIMGEGNWVPTA------- 315
            G L W  +     P SV Y+C H    +  ++      +      G W           
Sbjct: 258 FG-LKWTPDD----PSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSS 312

Query: 316 --KPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGE 373
             +   PD  ++H+   Y+P+   +W +I + W++   D   +       + F N  LGE
Sbjct: 313 GEEIEPPDSVTFHIWTAYSPF--TTWVQIVKDWMKTKGDTGKR-------KTFVNTTLGE 363

Query: 374 AYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGW 433
            +E K       EV A R+++    VP+          +  +T  +D Q     +   GW
Sbjct: 364 TWEAKIGERPDAEVMAERKEHYSASVPDR---------VAYLTAGIDSQLDRYEMRVWGW 414

Query: 434 APSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDAS 493
            P     G   ++ID + + G  +  + +   ++ E I+       G    ++R+  D  
Sbjct: 415 GP-----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTG 467

Query: 494 EL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRW 551
            +   +VYE     G   V+PI+G    + G  +    R+ N+ GV    +  D  K++ 
Sbjct: 468 GIDPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQI 525

Query: 552 SPTLRKEWSGQGEMPRGMLSAPVDMPD----KHLKELTVEYKREERDPDTNKIIRQVWHR 607
                    G   +P G +  P + PD       ++LT E + E+      KI+     R
Sbjct: 526 YNRFTLTPEGDEPLP-GAVHFP-NNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR 583

Query: 608 PGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGK 654
               RNE  D  +Y  A     +     D+  L A +  E  A   K
Sbjct: 584 ----RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_287368.1| (NC_002655) putative DNA packaging protein of prophage CP-933X
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309657.1| (NC_002695) terminase large subunit [Escherichia coli O157:H7]
 gb|AAG55980.1|AE005330_12 (AE005330) putative DNA packaging protein of prophage CP-933X
           [Escherichia coli O157:H7 EDL933]
 dbj|BAB35053.1| (AP002555) terminase large subunit [Escherichia coli O157:H7]
          Length = 641

 Score = 95.9 bits (237), Expect = 1e-018
 Identities = 142/647 (21%), Positives = 254/647 (38%), Gaps = 71/647 (10%)

Query: 30  TPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVS 89
           T  +WA+   +LP++ +   G   ++  P+ R +++       +  V V K A+VG +  
Sbjct: 29  TAVEWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKM 85

Query: 90  VLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAK 149
           +L  +  Y I++ +  ++I+   D D      T+ + P +R   +  +L        G K
Sbjct: 86  LL-GVYAYFIEHKQRNTLIWLPTDGDAENFMKTH-VEPTIRDIPSLLALAPWY----GKK 139

Query: 150 RRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTET 209
            R  +     +  G   +  G ++    R   V     DE++ + + + ++G P  L + 
Sbjct: 140 HRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDK 199

Query: 210 RTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ-RYYEVPCKHCGEYQRLYFRGNAKNG 268
           R       + I   STP + GT  I +   +      + V C HCGE Q L F G+ +  
Sbjct: 200 RIEGSVWPKSIRG-STPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF-GDKETP 257

Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINED------KVTIMGEGNWVPTA------- 315
            G L W  +     P SV Y+C H    +  ++      +      G W           
Sbjct: 258 FG-LKWTPDD----PSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSS 312

Query: 316 --KPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGE 373
             +   PD  ++H+   Y+P+   +W +I + W++   D   +       + F N  LGE
Sbjct: 313 GEEIEPPDSVTFHIWTAYSPF--TTWVQIVKDWMKTKGDTGKR-------KTFVNTTLGE 363

Query: 374 AYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGW 433
            +E K       EV A R+++    VP+          +  +T  +D Q     +   GW
Sbjct: 364 TWEAKIGERPDAEVMAERKEHYSASVPDR---------VAYLTAGIDSQLDRYEMRVWGW 414

Query: 434 APSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDAS 493
            P     G   ++ID + + G  +  + +   ++ E I+       G    ++R+  D  
Sbjct: 415 GP-----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTG 467

Query: 494 EL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRW 551
            +   +VYE     G   V+PI+G    + G  +    R+ N+ GV    +  D  K++ 
Sbjct: 468 GIDPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQI 525

Query: 552 SPTLRKEWSGQGEMPRGMLSAPVDMPD----KHLKELTVEYKREERDPDTNKIIRQVWHR 607
                    G   +P G +  P + PD       ++LT E + E+      KI+     R
Sbjct: 526 YNRFTLTPEGDEPLP-GAVHFP-NNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR 583

Query: 608 PGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGK 654
               RNE  D  +Y  A     +     D+  L A +  E  A   K
Sbjct: 584 ----RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_040581.1| (NC_001416) DNA packaging protein [Bacteriophage lambda]
 sp|P03708|TERL_LAMBD TERMINASE LARGE SUBUNIT (DNA PACKAGING PROTEIN A) (GPA)
 pir||JVBPAL DNA-packaging protein A - phage lambda
 gb|AAA96534.1| (J02459) A (DNA packaging;641) [bacteriophage lambda]
          Length = 641

 Score = 95.1 bits (235), Expect = 2e-018
 Identities = 143/647 (22%), Positives = 253/647 (39%), Gaps = 71/647 (10%)

Query: 30  TPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVS 89
           T  +WA+   +LP++ +   G   ++  P+ R +++       +  V V K A+VG +  
Sbjct: 29  TAVEWADANYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKM 85

Query: 90  VLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAK 149
           +L  +  Y I++ +  ++I+   D D      T+ + P +R   +  +L        G K
Sbjct: 86  LL-GVYAYFIEHKQRNTLIWLPTDGDAENFMKTH-VEPTIRDIPSLLALAPWY----GKK 139

Query: 150 RRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTET 209
            R  +     +  G   +  G ++    R   V     DE++ +   + ++G P  L + 
Sbjct: 140 HRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDK 199

Query: 210 RTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ-RYYEVPCKHCGEYQRLYFRGNAKNG 268
           R       + I   STP + GT  I +   +      + V C HCGE Q L F G+ +  
Sbjct: 200 RIEGSVWPKSIRG-STPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF-GDKETP 257

Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINED------KVTIMGEGNWVPTA------- 315
            G L W  +     P SV Y+C H    +  ++      +      G W           
Sbjct: 258 FG-LKWTPDD----PSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSS 312

Query: 316 --KPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGE 373
             +   PD  ++H+   Y+P+   +W +I + W++   D   +       + F N  LGE
Sbjct: 313 GEEIEPPDSVTFHIWTAYSPF--TTWVQIVKDWMKTKGDTGKR-------KTFVNTTLGE 363

Query: 374 AYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGW 433
            +E K       EV A R+++    VP+          +  +T  +D Q     +   GW
Sbjct: 364 TWEAKIGERPDAEVMAERKEHYSAPVPDR---------VAYLTAGIDSQLDRYEMRVWGW 414

Query: 434 APSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDAS 493
            P     G   ++ID + + G  +  + +   ++ E I+       G    I+R+  D  
Sbjct: 415 GP-----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTG 467

Query: 494 EL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRW 551
            +   +VYE     G   V+PI+G    + G  +    R+ N+ GV    +  D  K++ 
Sbjct: 468 GIDPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQI 525

Query: 552 SPTLRKEWSGQGEMPRGMLSAPVDMPD----KHLKELTVEYKREERDPDTNKIIRQVWHR 607
                    G   +P G +  P + PD       ++LT E + E+      KI+     R
Sbjct: 526 YNRFTLTPEGDEPLP-GAVHFP-NNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR 583

Query: 608 PGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGK 654
               RNE  D  +Y  A     +     D+  L A +  E  A   K
Sbjct: 584 ----RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_046897.1| (NC_001901) gp2 [Bacteriophage N15]
 pir||T13088 DNA-packaging protein - phage N15
 gb|AAC19038.1| (AF064539) gp2 [Bacteriophage N15]
          Length = 640

 Score = 93.2 bits (230), Expect = 9e-018
 Identities = 142/624 (22%), Positives = 245/624 (38%), Gaps = 79/624 (12%)

Query: 27  EFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGA 86
           E +T  +WA+   +LP++ +   G   ++  P+ R +++       +  V V K A+VG 
Sbjct: 26  EPMTAVEWADAHYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIRIVNVIKSARVGY 82

Query: 87  TVSVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSK 146
           +  +L  +I Y I++ +   +++   D D     + + + P +R   +  SL        
Sbjct: 83  SKMLL-GVIAYFIEHKQRNELLWLPTDGDADNF-MKSHVEPTIRDVPSLLSLAPWY---- 136

Query: 147 GAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKL 206
           G K R  +     +  G   +  G ++    R   V  +  DE++ +   + K+G P  L
Sbjct: 137 GKKHRDNTLSMKRFTNGRGFWCLGGKAAKNYREKSVDVVGYDELAAFDADIEKEGSPTFL 196

Query: 207 TETRTNSFANSRKILDLSTPLLAGTDTITQRF-EKGDQRYYEVPCKHCGEYQRLYFRGNA 265
            + R       + I   STP L GT  I +   E G    + V C HCGE Q L F G+ 
Sbjct: 197 GDKRIEGSVWPKSIRG-STPKLRGTCQIERAAKESGHFMRFHVACPHCGEEQYLKF-GDR 254

Query: 266 KNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGE-------GNWV------ 312
               G   WE E      ++V Y+C H    +I + ++            G W       
Sbjct: 255 DTPFG-FKWEPEQA----ETVYYLCEH-NACVIKQHELDFSNARYICELTGIWTRDGLRW 308

Query: 313 ---PTAKPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNN 369
                A+   P+  ++H+   Y+P+   +W +I + W +   D   +       + F N 
Sbjct: 309 FSSSNAEIDPPESVTFHIWTAYSPF--TTWVQIVKDWFKTKGDTGKR-------KTFVNT 359

Query: 370 DLGEAYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVC 429
            LGE +E K       +V A R+++    VP           +  +T  +D Q     + 
Sbjct: 360 TLGETWEAKIGDRPDADVLAERKEHFDAAVPER---------VAYLTAGIDSQLDRYEMR 410

Query: 430 TIGWAPSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVG 489
             GW P     G   ++ID + + G  +  +     ++ E I+       G    I+R+ 
Sbjct: 411 VWGWGP-----GEESWLIDRQIIMG--RHDDESTLARVDEAINKTYTRRNGVEMSISRIC 463

Query: 490 IDASEL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLY 547
            D   +   +VY      G   V+PI+G    + G  +    R+ N+ GV    V  D  
Sbjct: 464 WDIGGIDPTIVYNRSKKHGLFRVIPIKGAS--VYGKPVANMPRKRNKNGVYLTEVGTDTA 521

Query: 548 KDRWSPTLRKEWSGQGEMPRGMLSAPVDMPD-------KHLKELTVEYKREERDPDTNKI 600
           K++     R     +G+ P   L+  V  P+          ++LT E   E+      KI
Sbjct: 522 KEQIYN--RFTLIVEGDEP---LAGAVHFPNNPDIYDLSEAQQLTAEELVEKWVDGKRKI 576

Query: 601 IRQVWHRPGGSRNELWDTLIYNTA 624
           I     R    RNE  D  +Y  A
Sbjct: 577 IWDSKKR----RNEALDCFVYALA 596
>emb|CAA26343.1| (X02501) gp 2 (aa 1-309) [Bacteriophage 21]
 gb|AAA32338.1| (M23775) glycoprotein 2 [Bacteriophage 21]
          Length = 309

 Score = 73.2 bits (178), Expect = 1e-011
 Identities = 86/314 (27%), Positives = 138/314 (43%), Gaps = 22/314 (7%)

Query: 1   MYSASDYYAESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYW 60
           M S +   A +A  +  GLL+       LT  QWA K  +LP++ S  PG   ++  P+ 
Sbjct: 1   MISDAQKAANAAGAIATGLLSLIIPVP-LTTVQWANKHYYLPKESSYTPG--RWETLPFQ 57

Query: 61  REVVDCFDPYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRR 120
             +++C      +  V + K A+VG T  +L  +  Y I++    S++F   D    +  
Sbjct: 58  VGIMNCMGN-DLIRTVNLIKSARVGYTKMLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDF 114

Query: 121 VTNFILPMLRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSF 180
           + + + P +R   A   L  A  F  G K R  +     ++ G   +  G  +    R  
Sbjct: 115 MKSHVEPTIRDVPALLEL--APWF--GRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREK 170

Query: 181 PVPWLLRDEVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEK 240
            V  +  DE+S +   V K+G P  L + R       + I   STP + G+  I +   +
Sbjct: 171 SVDVVCYDELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRG-STPKIKGSCQIEKAANE 229

Query: 241 GDQ--RYYEVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMI 298
                R+Y VPC HCGE Q L F  +A      L WE       P+SV Y+C H  G +I
Sbjct: 230 SAHFMRFY-VPCPHCGEEQYLKFGDDA--SPFGLKWEKN----KPESVFYLCEH-HGCVI 281

Query: 299 NEDKVTIMGEGNWV 312
           ++ ++     G W+
Sbjct: 282 HQSELD-QSNGRWI 294
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.136 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 482,511,288 Number of Sequences: 1028106 Number of extensions: 22114366 Number of successful extensions: 47607 Number of sequences better than 5.0e-02: 21 Number of HSP's better than 0.1 without gapping: 5 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 47521 Number of HSP's gapped (non-prelim): 34 length of query: 673 length of database: 324,535,921 effective HSP length: 128 effective length of query: 545 effective length of database: 192,938,353 effective search space: 105151402385 effective search space used: 105151402385 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 95 (41.2 bits) Query= orf4 [961-1054] (92 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.320 0.136 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 52,111,724 Number of Sequences: 1028106 Number of extensions: 1659105 Number of successful extensions: 3987 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3987 Number of HSP's gapped (non-prelim): 0 length of query: 92 length of database: 324,535,921 effective HSP length: 68 effective length of query: 24 effective length of database: 254,624,713 effective search space: 6110993112 effective search space used: 6110993112 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf5 [1075-1621] (545 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_040583.1|  (NC_001416) capsid component [Bacteriophag...    82  2e-014
ref|NP_287370.1|  (NC_002655) putative capsid structural pro...    81  4e-014
ref|NP_046899.1|  (NC_001901) gp4 [Bacteriophage N15] >gi|74...    80  5e-014
ref|NP_518978.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...    79  1e-013
sp|P36272|VG04_BPP21  PORTAL PROTEIN (HEAD PROTEIN GP4) >gi|...    77  4e-013
ref|NP_287794.1|  (NC_002655) putative capsid protein of pro...    75  2e-012
ref|NP_461542.1|  (NC_003197) Gifsy-1 prophage: similar to h...    74  6e-012
ref|NP_459888.1|  (NC_003197) Fels-1 prophage [Salmonella ty...    72  2e-011
ref|NP_460007.1|  (NC_003197) Gifsy-2 prophage [Salmonella t...    69  1e-010
emb|CAC83156.1|  (AJ304858) putative portal protein [Escheri...    65  2e-009
ref|NP_288458.1|  (NC_002655) putative head-tail preconnecto...    65  2e-009
>ref|NP_040583.1| (NC_001416) capsid component [Bacteriophage lambda]
 sp|P03710|VMCB_LAMBD PORTAL PROTEIN (GPB) (MINOR CAPSID PROTEIN B)
 pir||VHBPBL minor capsid protein B - phage lambda
 gb|AAA96536.1| (J02459) B (capsid component;533) [bacteriophage lambda]
          Length = 533

 Score = 81.6 bits (200), Expect = 2e-014
 Identities = 103/489 (21%), Positives = 192/489 (39%), Gaps = 30/489 (6%)

Query: 1   MPVVADPSGPVMSVENTSRFNGRGDLAHAGYRYTRHDGEKYEGGLGPIEILYTDYWSQRR 60
           +P +  P G + S+   + ++G G       R      E  +  L P      ++     
Sbjct: 6   IPTLLGPDG-MTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLP------NFTRGNA 58

Query: 61  RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
           R++   + N YA   I+    ++V +   L+  P+   L I E+ A  ++  V   ++ +
Sbjct: 59  RADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEF 118

Query: 121 GELPEVC-DFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGL---PTYELIDGDMVQ 176
            E    C D ++  TF    R   A     G++ V    ++++       + ++    + 
Sbjct: 119 AEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRIS 178

Query: 177 TPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKA-----TGANGRRIAWL 231
            P    N  G S     GV++   G  + ++V ++G+     +          G R +++
Sbjct: 179 NP----NNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFI 234

Query: 232 YYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTK 291
           +        + RG  +   VM+ +  +D +++   + A + A     I+ ++  ++A   
Sbjct: 235 HVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSA--M 292

Query: 292 PFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEIKAFTNDTTDEKFG 348
            F  GA  + Q     G   E     A  P    G  V  L  G+ +   T   TD  + 
Sbjct: 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS 352

Query: 349 EFEEAILRAVAWALGMPYSVLAMQF-DSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVY 407
            FE+++LR +A  LG+ Y  L+  +   SYS +R    E  A     R   A      ++
Sbjct: 353 VFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMF 412

Query: 408 RSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQI 467
             WL   ++ R +  P   + AR   + A  +AW    W G  + A+D  KE +     I
Sbjct: 413 LCWLEEAIVRRVVTLP---SKARFSFQEAR-SAWGNCDWIGSGRMAIDGLKEVQEAVMLI 468

Query: 468 KLGALTRTK 476
           + G  T  K
Sbjct: 469 EAGLSTYEK 477
>ref|NP_287370.1| (NC_002655) putative capsid structural protein of prophage CP-933X
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309659.1| (NC_002695) portal protein [Escherichia coli O157:H7]
 ref|NP_310204.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 gb|AAG55982.1|AE005330_14 (AE005330) putative capsid structural protein of prophage CP-933X
           [Escherichia coli O157:H7 EDL933]
 dbj|BAB35055.1| (AP002555) portal protein [Escherichia coli O157:H7]
 dbj|BAB35600.1| (AP002557) putative portal protein [Escherichia coli O157:H7]
          Length = 533

 Score = 80.9 bits (198), Expect = 4e-014
 Identities = 104/489 (21%), Positives = 190/489 (38%), Gaps = 30/489 (6%)

Query: 1   MPVVADPSGPVMSVENTSRFNGRGDLAHAGYRYTRHDGEKYEGGLGPIEILYTDYWSQRR 60
           +P +  P G + S+   + ++G G       R     GE  +  L P      ++     
Sbjct: 6   IPTLLGPDG-MTSLREYAGYHGGGSGFGGQLRAWNPPGESVDAALLP------NFTRGNA 58

Query: 61  RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
           R++   + N YA   I+    ++V +   L+  P+   L I E+ A  ++  V   ++ +
Sbjct: 59  RADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEF 118

Query: 121 GELPEVC-DFKKLYTFGQWQRVAKAEALIAGDVLVV---EHHNSATGLPTYELIDGDMVQ 176
            E    C D ++  TF    R   A     G++ V    +   S      + ++    + 
Sbjct: 119 AEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTRPSRLFRTQFRMVSPKRIS 178

Query: 177 TPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKA-----TGANGRRIAWL 231
            P    N    S     GV++   G  + ++V ++G+     +          G R +++
Sbjct: 179 NP----NNTSDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFI 234

Query: 232 YYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTK 291
           +        + RG  +   VM+ +  +D +++   + A + A     I+ ++  ++A   
Sbjct: 235 HVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSA--M 292

Query: 292 PFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEIKAFTNDTTDEKFG 348
            F  GA  + Q     G   E     A  P    G  V  L  G+ +   T   TD  + 
Sbjct: 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS 352

Query: 349 EFEEAILRAVAWALGMPYSVLAMQF-DSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVY 407
            FE+++LR +A  LG+ Y  L+  +   SYS +R    E  A     R   A      ++
Sbjct: 353 VFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMF 412

Query: 408 RSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQI 467
             WL   ++ R +  P   + AR   + A  +AW    W G  + A+D  KE +     I
Sbjct: 413 LCWLEEAIVRRVVTLP---SKARFSFQEAR-SAWGNCDWIGSGRMAIDGLKEVQEAVMLI 468

Query: 468 KLGALTRTK 476
           + G  T  K
Sbjct: 469 EAGLSTYEK 477
>ref|NP_046899.1| (NC_001901) gp4 [Bacteriophage N15]
 pir||T13090 probable minor capsid protein gp4 - phage N15
 gb|AAC19040.1| (AF064539) gp4 [Bacteriophage N15]
          Length = 530

 Score = 80.5 bits (197), Expect = 5e-014
 Identities = 98/442 (22%), Positives = 180/442 (40%), Gaps = 29/442 (6%)

Query: 51  LYTDYWSQRRRSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWA 110
           L  +Y     R++   + N YA   ++    ++V +   L+  P+   L I+E+ +  ++
Sbjct: 46  LLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFS 105

Query: 111 ENVNDLFEAWGELPE--VC--DFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNS-ATGL- 164
            +V     AW E  E   C  D ++  TF    R   A     G++ V    +S +T L 
Sbjct: 106 RDVE---AAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLF 162

Query: 165 -PTYELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGF-----KHKRI 218
              ++++    V  P    N IG +     GV++   G  + ++V  +G+     ++   
Sbjct: 163 RTQFKMVSPKRVSNP----NNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTY 218

Query: 219 KATGANGRRIAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGF 278
                 G R ++++        + RG      VM+ +  +D +++   + A + A     
Sbjct: 219 IPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAAT 278

Query: 279 IQRQIGGKTAGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEI 335
           I+ ++  ++A    F  GA  + Q +   G   E     +  P    G  V  L  G+ +
Sbjct: 279 IESELDTQSA--MDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSL 336

Query: 336 KAFTNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEM 394
              +   TD  +  FE+++LR +A  LG+ Y  L+  +   SYS +R    E  A     
Sbjct: 337 NLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGR 396

Query: 395 RDRDADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAV 454
           R   A      ++  WL   ++ R +  P   + AR   + A   AW  + W G  + A+
Sbjct: 397 RKFVASRQACQMFLCWLEEAIVRRVVTLP---SKARFSFQEAR-TAWGNANWIGSGRMAI 452

Query: 455 DLEKETRGRGEQIKLGALTRTK 476
           D  KE +     I+ G  T  K
Sbjct: 453 DGLKEVQEAVMLIEAGLSTYEK 474
>ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14559.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 508

 Score = 79.3 bits (194), Expect = 1e-013
 Identities = 104/454 (22%), Positives = 168/454 (36%), Gaps = 62/454 (13%)

Query: 43  GGLGPIEILYTDYWSQRRRSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPID 102
           G  G +  L       R +S    +RN +A+  I   V N V TG          + P  
Sbjct: 41  GNPGAVAALLASGEDLRIKSRDLVRRNAWAQAGIEAFVANAVGTG----------IKPQS 90

Query: 103 EDAADTWAENVNDLFEAWGELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLV-VEHHNSA 161
               D +  +V  L+  W    +       Y     Q +A    L  G+ L+ +      
Sbjct: 91  LSTDDAFKADVQALWRDWTAEADAAGQTDFYGL---QALACRAMLEGGECLIRLRPRRPE 147

Query: 162 TGLPT---YELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRI 218
            GL      +L++ + +  P      +   + + +G+E    G  VA+ + +      R+
Sbjct: 148 DGLTVPLQLQLLEAEHL--PMTLNVDLPSGNAVRSGIEFDGLGRRVAYHLYRSHPDDGRL 205

Query: 219 KATGANG---------RRIAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKA 269
                 G           I  LY     R  + RGEP L+  +  + E+D+  DA   + 
Sbjct: 206 APMSGQGGLDTVRVDASEIIHLYRVL--RPGQIRGEPWLSRALVKLNELDQYDDAELVRK 263

Query: 270 SLGAQIVGFIQRQIGGKTAGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPFGM-MVEG 328
              A   GF+ RQ                    + + +G+G      I+        ++ 
Sbjct: 264 KTAAMFAGFVTRQ------------------SPEDNLMGEGLPDEAGISLVGLEPGTLQI 305

Query: 329 LAEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEF 387
           L  GE+IK          +GEF     RAVA ALG+ Y  L       +YS+ R  + EF
Sbjct: 306 LEPGEDIKFSDPADVGGSYGEFLRTQFRAVAAALGITYEQLTGDLTGVNYSSIRAGLLEF 365

Query: 388 ----EAVIHEMRDRDADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTR 443
               E V H +       + RPV+ +W++  VL+  + APGF       R       W  
Sbjct: 366 RRRCEMVQHSVL---VHQMCRPVWAAWMKQAVLSGALVAPGFARGGAARRRQYLQVKWIP 422

Query: 444 SQWYGMVKEAVDLEKETRGRGEQIKLGALTRTKA 477
             W     + VD EKE +     I+ G ++R++A
Sbjct: 423 QGW-----QWVDPEKEFKAMLLAIRAGLMSRSEA 451
>sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4)
 pir||JN0538 head protein gp4 - phage 21
 gb|AAA32342.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
          Length = 530

 Score = 77.4 bits (189), Expect = 4e-013
 Identities = 101/468 (21%), Positives = 193/468 (40%), Gaps = 39/468 (8%)

Query: 61  RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
           R++   + N  A   +     ++V    +++  PN   L + E AA ++ + V     AW
Sbjct: 56  RADDLVRNNGIAANAVALHKDHIVGHMFLISYRPNWRWLGMRETAAKSFVDEVE---AAW 112

Query: 121 GELPE----VCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGL---PTYELIDGD 173
            E  E      D +   TF ++ R         G++ V    ++ T       ++ +   
Sbjct: 113 SEYAEGMFGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRTRFKAVSPK 172

Query: 174 MVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGF------KHKRIKATGANGRR 227
            V TP         +  +  GVE+ + G  VA+ + ++ F      + +RI      GR 
Sbjct: 173 RVDTPGHGMG----NRFLRAGVEVDRYGRAVAYHICEDDFPFSGSGRWERIPRELPTGRP 228

Query: 228 IAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKT 287
            A L+        + RG      VM+ +  +D ++    + A + A     I+ ++  + 
Sbjct: 229 -AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESELDTEK 287

Query: 288 AGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF----GMMVEGLAEGEEIKAFTNDTT 343
           A    + +GA +  +D   +    E+ ++  +T      G+ +  L  G+++K  T   +
Sbjct: 288 AFE--YIAGAPQEQKDNPLINI-LEKFSSWYDTNNVTLGGVKIPHLFPGDDLKLQTAQDS 344

Query: 344 DEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEMRDRDADTL 402
           D  F   E+A+LR +A  LG+ Y  L+  +   SYS++R    E        R   A  L
Sbjct: 345 DNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARL 404

Query: 403 LRPVYRSWLRAMVLNRRIDAPGFLAAARNPREF-ATYAAWTRSQWYGMVKEAVDLEKETR 461
              ++  WL   +L       G +   R   +F    +AW+R++W G  + A+D  KE +
Sbjct: 405 ATQMFSCWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQ 458

Query: 462 GRGEQIKLGALTRTKAARMLTGTSFKTNMRKLAKEN--QMLADAMRPI 507
               +I+ G  T  K   ++ G  ++   R+  +E+  +  A   RP+
Sbjct: 459 ESVMRIEAGLSTYEKGLALM-GEDYQDIFRQQVRESAERQKAGLSRPV 505
>ref|NP_287794.1| (NC_002655) putative capsid protein of prophage CP-933R
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309134.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 ref|NP_310760.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 gb|AAG56408.1|AE005368_14 (AE005368) putative capsid protein of prophage CP-933R [Escherichia
           coli O157:H7 EDL933]
 dbj|BAB34530.1| (AP002554) putative portal protein [Escherichia coli O157:H7]
 dbj|BAB36156.1| (AP002559) putative portal protein [Escherichia coli O157:H7]
          Length = 530

 Score = 75.1 bits (183), Expect = 2e-012
 Identities = 101/468 (21%), Positives = 192/468 (40%), Gaps = 39/468 (8%)

Query: 61  RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
           R++   + N  A   +     ++V    +++  PN   L + E AA ++ + V     AW
Sbjct: 56  RADDLVRNNGIAANAVALHKDHIVGHMFLISYRPNWRWLGMRETAAKSFVDEVE---AAW 112

Query: 121 GELPE----VCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGL---PTYELIDGD 173
            E  E      D +   TF ++ R         G++ V    ++ T       ++ +   
Sbjct: 113 SEYAEGMSGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRTRFKAVSPK 172

Query: 174 MVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGF------KHKRIKATGANGRR 227
            V TP         +  +  GVE+ + G  VA+ + ++ F      + +RI      GR 
Sbjct: 173 RVDTPGHGMG----NRFLRAGVEVDRYGRAVAYHICEDDFPRSGSGRWERIPRELPTGRP 228

Query: 228 IAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKT 287
            A L+        + RG      VM+ +  +D ++    + A + A     I+  +  + 
Sbjct: 229 -AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESDLDTEK 287

Query: 288 AGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF----GMMVEGLAEGEEIKAFTNDTT 343
           A    + +GA +  +D   +    E+ ++  +T      G+ +  L  G+++K  T   +
Sbjct: 288 AFE--YIAGAPQGQKDNPLINI-LEKFSSWYDTNNVTLGGVKIPHLFPGDDLKLQTAQDS 344

Query: 344 DEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEMRDRDADTL 402
           D  F   E+A+LR +A  LG+ Y  L+  +   SYS++R    E        R   A  L
Sbjct: 345 DNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARL 404

Query: 403 LRPVYRSWLRAMVLNRRIDAPGFLAAARNPREF-ATYAAWTRSQWYGMVKEAVDLEKETR 461
              ++  WL   +L       G +   R   +F    +AW+R++W G  + A+D  KE +
Sbjct: 405 ATQMFSYWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQ 458

Query: 462 GRGEQIKLGALTRTKAARMLTGTSFKTNMRKLAKEN--QMLADAMRPI 507
               +I+ G  T  K   ++ G  ++   R+  +E+  +  A   RP+
Sbjct: 459 ESVMRIEAGLSTYEKEPALM-GEDYQDIFRQQVRESAERQKAGLSRPV 505
>ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp4
           of phage 21 [Salmonella typhimurium LT2]
 gb|AAL21501.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp4
           of phage 21 [Salmonella typhimurium LT2]
          Length = 526

 Score = 73.6 bits (179), Expect = 6e-012
 Identities = 100/466 (21%), Positives = 186/466 (39%), Gaps = 37/466 (7%)

Query: 61  RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
           R++   + N  A   +     ++V    +++  PN   L + E AA ++ + V     AW
Sbjct: 56  RADDLVRNNGIAANAVALHKDHIVGHLFLISYRPNWRYLGMRESAAKSFVDEVE---AAW 112

Query: 121 GELPE----VCDFKKLYTFGQWQRVAKAEALIAGDVL---VVEHHNSATGLPTYELIDGD 173
            E  +      D +   TF ++ R         G++    V +   +      ++ +   
Sbjct: 113 TEYCDGIFGEMDAEGKRTFTEFIREGVGVHAFNGEIFLQPVWDAETTQVFRTRFKAVSPK 172

Query: 174 MVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAF------WVVQEGFKHKRIKATGANGRR 227
            V TP  A+     + ++  GVE  ++G+ +A+      W V  G +  RI     +GR 
Sbjct: 173 RVDTPGYARG----NRQLRAGVETDRNGKALAYHVCDDDWPVAGGERWTRIPRFLPSGRP 228

Query: 228 IAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKT 287
            A L+        + RG      VM+ +  +D ++    + A + A     I+ ++  + 
Sbjct: 229 -AMLHIFEPVEDGQTRGANQFYSVMERLKMLDTLQATQLQSAIVKAMYAATIESELDSE- 286

Query: 288 AGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEIKAFTNDTTD 344
              K F        +DT  V          +       G+ +  L  G+E+   T   +D
Sbjct: 287 ---KAFEYITAADNKDTPLVNMLANYARYYSTNSIKLGGVKIPHLYPGDELNLQTAQDSD 343

Query: 345 EKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEMRDRDADTLL 403
             F   E+A+LR +A  LG+ Y  L+  +   SYS++R    E        R   A  L 
Sbjct: 344 NGFSALEQALLRYIAAGLGVSYEQLSRDYSQVSYSSARASANESWRYFLGRRRFIAGRLA 403

Query: 404 RPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGR 463
             ++  WL   ++   I AP    +    R     ++W+RS+W G  + A+D  KE +  
Sbjct: 404 TQMFSCWLEEALIRGVIRAPRARFSFWEAR-----SSWSRSEWIGAGRMAIDGLKEVQES 458

Query: 464 GEQIKLGALTRTKAARMLTGTSFKTNMRKLAKENQ--MLADAMRPI 507
             +I+ G  T  K   ++ G  ++   R+  +E++    A   RP+
Sbjct: 459 VMRIEAGLSTYEKELAIM-GEDYQEIFRQQVRESEERRAAGLSRPV 503
>ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella typhimurium LT2]
 gb|AAL19847.1| (AE008738) Fels-1 prophage [Salmonella typhimurium LT2]
          Length = 511

 Score = 72.0 bits (175), Expect = 2e-011
 Identities = 66/262 (25%), Positives = 106/262 (40%), Gaps = 30/262 (11%)

Query: 237 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTKPFSSG 296
           +R  +ARG  LL  V+  + ++ +  D+ +  A + A +  FI++Q  G      P    
Sbjct: 233 RRLNQARGASLLAPVIIRLMDLKEYEDSERIAARIAASLGMFIKKQDVGTDGYVAP---- 288

Query: 297 AQRRVQDTDFVGDGTERRTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 356
                          E+R      P GM+ +GL  GE+I    +D  +     F    LR
Sbjct: 289 ---------------EKRKETQIQP-GMLFDGLNPGEDIGMIKSDRPNAGLESFRMGQLR 332

Query: 357 AVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVYRSWLRAMVL 416
           AVA  L   +S +A  +D +YSA R E+ E +     ++D       RP+YR WL A V 
Sbjct: 333 AVAAGLRGSFSSIARNYDGTYSAQRQELVEAQEGYSILQDSFIAAFTRPLYRRWLAAAVA 392

Query: 417 NRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQIKLGALTRTK 476
           +  I+ P               ++   + + G V   +D  KE       I+ GA T   
Sbjct: 393 SGAIEVPA----------GTDMSSLFNAVYSGPVMPWIDPLKEANAWRVLIRGGAATEGD 442

Query: 477 AARMLTGTSFKTNMRKLAKENQ 498
             R   G       R+ A+ ++
Sbjct: 443 WVRARGGAPADVKRRRKAETDE 464
>ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella typhimurium LT2]
 gb|AAL19966.1| (AE008744) Gifsy-2 prophage [Salmonella typhimurium LT2]
          Length = 437

 Score = 68.9 bits (167), Expect = 1e-010
 Identities = 69/266 (25%), Positives = 112/266 (41%), Gaps = 29/266 (10%)

Query: 237 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTKPFSSG 296
           +R  +ARG  LL  V+  + ++ +  D+ +  A + A    FI+R               
Sbjct: 155 RRLNQARGVTLLAPVIIRLLDLKEYEDSERLAARISAAFAMFIRR--------------- 199

Query: 297 AQRRVQDTDFVGDGTERRTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 356
           +   VQD D   D  ++  ++   P G +++ L  GE+I    +D  +     F    LR
Sbjct: 200 SDAMVQDGD-APDYADKDRDLDIEP-GTILKDLLPGEDIGTIKSDRPNANLESFRMGQLR 257

Query: 357 AVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVYRSWLRAMVL 416
           AVA  +   +S +A  +D +YSA R E+ E +     ++D     + RPVYR WL   + 
Sbjct: 258 AVAAGVRGSFSSIARNYDGTYSAQRQELVEAQEGYAILQDNFIAAVSRPVYRRWLATAIT 317

Query: 417 NRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQIKLGALTRTK 476
              ID P          + AT      + + G V   +D  KE       I+ GA T + 
Sbjct: 318 AGVIDVP-------TDTDMATL---FNAVYSGPVMPWIDPLKEANAWRILIRGGAATESD 367

Query: 477 AARMLTGTSFKTNMRKLAK--ENQML 500
             R   G   +   R+ A+  EN+ L
Sbjct: 368 WVRARGGAPAEVKRRRKAEIDENRKL 393
>emb|CAC83156.1| (AJ304858) putative portal protein [Escherichia coli]
          Length = 500

 Score = 65.1 bits (157), Expect = 2e-009
 Identities = 92/381 (24%), Positives = 151/381 (39%), Gaps = 51/381 (13%)

Query: 110 AENVNDLFEAWGELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGLPT--- 166
           AE   D+   W E     D    YT    +R+     L  G+V       +  GL     
Sbjct: 104 AELAADIRRLWAEWSVSPDVTGQYTRPVLERLLLRTWLRDGEVFAQMVSGAGNGLERTAG 163

Query: 167 ----YELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATG 222
                E ++ D V    D      +S  ++ GV + + G    + V    +K+  +    
Sbjct: 164 VPFWLEAMEPDFVPMRTD------ESAGLNQGVFLDEWGRPKKYLV----YKNYPVSGRQ 213

Query: 223 ANGRRIA---WLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFI 279
           ++ + IA    ++    +R  + RG  +L+ V+  I+ + +  DA    A + A +  +I
Sbjct: 214 SDTKEIAAGKMIHLKFTRRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYI 273

Query: 280 QRQIGGKTAGTKPFSSGAQRRVQDTDFVG-DGTERRTNIAETPFGMMVEGLAEGEEIKAF 338
           ++  G                 QD +  G   TER  +I  TP G++ + L +GE+I   
Sbjct: 274 RKGDG-----------------QDYEDPGIKETEREVHI--TP-GIIYDDLRKGEDIGMV 313

Query: 339 TNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRD 398
            +D  +     F    LRAVA    + +S  A  ++ +YSA R E+ E       ++D  
Sbjct: 314 KSDRPNPNLETFRNGQLRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCF 373

Query: 399 ADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEK 458
              + RPVYR+WL  +V      A G L    +      Y A     + G V   +D  K
Sbjct: 374 IGAVTRPVYRTWLNMVV------AAGLLKIPADVEMKTLYNA----TYSGPVMPWIDPVK 423

Query: 459 ETRGRGEQIKLGALTRTKAAR 479
           E      QI+ GA T +   R
Sbjct: 424 EAEAWRIQIRGGAATESDWVR 444
>ref|NP_288458.1| (NC_002655) putative head-tail preconnector protein of prophage
           CP-933U [Escherichia coli O157:H7 EDL933]
 ref|NP_310988.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 dbj|BAB19579.1| (AP000400) portal protein [Escherichia coli O157:H7]
 gb|AAG57012.1|AE005420_17 (AE005420) putative head-tail preconnector protein of prophage
           CP-933U [Escherichia coli O157:H7 EDL933]
 dbj|BAB36384.1| (AP002560) putative portal protein [Escherichia coli O157:H7]
          Length = 500

 Score = 65.1 bits (157), Expect = 2e-009
 Identities = 92/381 (24%), Positives = 151/381 (39%), Gaps = 51/381 (13%)

Query: 110 AENVNDLFEAWGELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGLPT--- 166
           AE   D+   W E     D    YT    +R+     L  G+V       +  GL     
Sbjct: 104 AELAADIRRLWAEWSVSPDVTGQYTRPVLERLLLRTWLRDGEVFAQMVSGAGNGLERTAG 163

Query: 167 ----YELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATG 222
                E ++ D V    D      +S  ++ GV + + G    + V    +K+  +    
Sbjct: 164 VPFWLEAMEPDFVPMRTD------ESAGLNQGVFLDEWGRPKKYLV----YKNYPVSGRQ 213

Query: 223 ANGRRIA---WLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFI 279
           ++ + IA    ++    +R  + RG  +L+ V+  I+ + +  DA    A + A +  +I
Sbjct: 214 SDTKEIAAGKMIHLKFTRRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYI 273

Query: 280 QRQIGGKTAGTKPFSSGAQRRVQDTDFVG-DGTERRTNIAETPFGMMVEGLAEGEEIKAF 338
           ++  G                 QD +  G   TER  +I  TP G++ + L +GE+I   
Sbjct: 274 RKGDG-----------------QDYEDPGIKETEREVHI--TP-GIIYDDLRKGEDIGMV 313

Query: 339 TNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRD 398
            +D  +     F    LRAVA    + +S  A  ++ +YSA R E+ E       ++D  
Sbjct: 314 KSDRPNPNLETFRNGQLRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCF 373

Query: 399 ADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEK 458
              + RPVYR+WL  +V      A G L    +      Y A     + G V   +D  K
Sbjct: 374 IGAVTRPVYRTWLNMVV------AAGLLKIPADVEMKTLYNA----TYSGPVMPWIDPVK 423

Query: 459 ETRGRGEQIKLGALTRTKAAR 479
           E      QI+ GA T +   R
Sbjct: 424 EAEAWRIQIRGGAATESDWVR 444
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.317 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 355,839,259 Number of Sequences: 1028106 Number of extensions: 15123247 Number of successful extensions: 32458 Number of sequences better than 5.0e-02: 15 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 32431 Number of HSP's gapped (non-prelim): 17 length of query: 545 length of database: 324,535,921 effective HSP length: 126 effective length of query: 419 effective length of database: 194,994,565 effective search space: 81702722735 effective search space used: 81702722735 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 94 (40.8 bits) Query= orf6 [1616-2012] (395 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAB65791.1|  (AF009900) Z2Z3 protein [Vibrio shilonii]          94  4e-018
dbj|BAA89626.1|  (AB036665) similar to GPC of lambda [bacter...    65  2e-009
gb|AAC24140.1|  (AF071201) unknown [bacteriophage Felix 01]        60  6e-008
dbj|BAA89643.1|  (AB036666) similar to minor capsid protein ...    59  1e-007
ref|NP_536361.1|  (NC_003309) gp5 [Bacteriophage phiE125] >g...    56  8e-007
ref|NP_518979.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...    53  5e-006
pir||T03571  probable peptidase - Rhodobacter capsulatus >gi...    52  1e-005
ref|NP_208226.1|  (NC_000915) protease IV (PspA) [Helicobact...    51  2e-005
ref|NP_224046.1|  (NC_000921) putative PROTEASE IV [Helicoba...    50  4e-005
ref|NP_540710.1|  (NC_003317) PUTATIVE PROTEASE IV [Brucella...    50  6e-005
ref|NP_470963.1|  (NC_003212) similar to proteases [Listeria...    49  1e-004
ref|NP_461541.1|  (NC_003197) Gifsy-1 prophage: similar to h...    48  2e-004
sp|P36273|VG05_BPP21  HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CO...    45  0.001
ref|NP_287793.1|  (NC_002655) putative capsid assembly prote...    44  0.003
ref|NP_275945.1|  (NC_000916) protease IV [Methanothermobact...    44  0.004
>gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii]
          Length = 330

 Score = 93.6 bits (231), Expect = 4e-018
 Identities = 67/214 (31%), Positives = 106/214 (49%), Gaps = 8/214 (3%)

Query: 54  ARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGA 113
           A I++ GV++   + F  + GG +     L +  +AA N+ S+K +   +DSPGGEA G 
Sbjct: 40  AIIHVNGVISRYANLFHAVCGGVSTEV--LAKEFNAALNESSVKAVILNVDSPGGEASGI 97

Query: 114 IKLGDKIR--NAKKPXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLS-MRRPS 170
            +L + I     KKP                    + V     +RVG+IG V+S ++RP 
Sbjct: 98  HELSEMIHASRGKKPVRAYVGGDGCSAAYWITTACDRVTMDATARVGSIGTVVSFVKRPD 157

Query: 171 T--SVYVDVTSTDAPNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVNQN 228
              +   +  S+ +PNKR DPESE+G+  I+ Q LD +  +F   VA   G T   V  +
Sbjct: 158 AEGAKRFEFVSSQSPNKRLDPESEQGQTAIQTQ-LDAMAEVFISRVARNMGVTSDKVKSD 216

Query: 229 FGRGGSLFAETALKMGMIDKILTAETESATSTGG 262
           FG+GG    + A+  GM  ++ + E   A+ + G
Sbjct: 217 FGQGGVKIGQQAVDAGMAHELGSLEALIASLSNG 250
>dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacteriophage WO]
          Length = 353

 Score = 64.7 bits (156), Expect = 2e-009
 Identities = 58/209 (27%), Positives = 90/209 (42%), Gaps = 15/209 (7%)

Query: 53  TARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEG 112
           TA I + G++T  P  F    G  +  Y  + E I+ A ++  I+ I   IDSPGGE  G
Sbjct: 47  TAVIAINGILTKKPGAFDDFLGMTS--YEKIQEEIEEALSNKDIETILLDIDSPGGEVNG 104

Query: 113 AIKLGDKI--RNAKKPXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLS---MR 167
              L D I    AKK                     E V  SR S VG+IG + S     
Sbjct: 105 IFDLADFIYESRAKKRIIAIANDDAYSAAYAIASSAEKVFVSRTSGVGSIGVIASHIDQS 164

Query: 168 RPSTSVYVDVTSTDAPNKRPD-----PESEEGRRVIREQSLDPLHNMFAQAVADGRGTTL 222
           R      +  T+  A +++ D     P + E    ++++ +  L+ MF Q +A  RG ++
Sbjct: 165 RFDEKQGIKYTTIFAGSRKNDLNPHEPMTSESLESLQKE-VGRLYEMFLQLIARNRGLSI 223

Query: 223 SDVNQNFGRGGSLFAETALKMGMIDKILT 251
             +       G  F E A+++G+ D + T
Sbjct: 224 EKIRST--EAGLYFGEKAVEIGLADGVTT 250
>gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01]
          Length = 442

 Score = 59.7 bits (143), Expect = 6e-008
 Identities = 72/290 (24%), Positives = 120/290 (40%), Gaps = 26/290 (8%)

Query: 39  PTANADRVMTVAGDTARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKR 98
           P   A  ++ V  D A I + G +T    F   +  GG   Y  L    D A  D+SIK 
Sbjct: 48  PKGEARSLLKVKDDIAIIPIMGGLTHRMTFIDAMCTGGLSSYEGLRRGFDEALADESIKT 107

Query: 99  IEYAIDSPGGEAEGAIKLGDKI---RNAKKPXXXXXXXXXXXXXXXXXXXDEIVAASRVS 155
           I   IDS GGEA G  +L   I   R  KK                    +EI+ AS  +
Sbjct: 108 IVLHIDSGGGEASGCFELARHIMASRGQKKIIAYVDEFACSAAYALASSAEEII-ASPDA 166

Query: 156 RVGNIGAV-----LSMRRPSTSVYVDVTSTDAPNKRPDP---ESEEGRRVIREQSLDPLH 207
            VG+IG +     L+       V ++V           P    SEE +  ++++ ++  +
Sbjct: 167 DVGSIGVIMVHQELTKAFEKNGVTINVIKAGEFKGMGSPFQALSEESKERLQKR-INDTY 225

Query: 208 NMFAQAVADGRGTTLSDVNQNFGRGGSLFAETALKMGMIDKILTAETESATSTGGAEA-- 265
             F   VA+ R   LS+            A+ AL++G+I+ I++ +       G  EA  
Sbjct: 226 ATFTGFVAESR--NLSEEAVKNTEANVYSAQEALELGLINSIMSQDDFLNYLQGSEEAPV 283

Query: 266 -----NAQDDTTGVQTMDLEKLKSEHPAVYAQAIEEGKQIGLTEERNRVA 310
                N+ ++ T  +  +LE L+ +     AQ   + ++  L++  N+++
Sbjct: 284 SLNVNNSGEEMTEQEKQELEALRLQ----VAQMKAKEQEAALSDLTNKIS 329
>dbj|BAA89643.1| (AB036666) similar to minor capsid protein GPC of phage lambda
           [Wolbachia sp. wKue]
          Length = 350

 Score = 58.9 bits (141), Expect = 1e-007
 Identities = 54/212 (25%), Positives = 88/212 (41%), Gaps = 21/212 (9%)

Query: 53  TARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEG 112
           TA I + G++T  P+ F    G  +  Y  + E I+ A  D+ ++ I   IDSPGGE  G
Sbjct: 49  TAIIPIHGILTKKPEAFDDFLGMTS--YEKMQEEIEEALEDEKVEIILLDIDSPGGEVNG 106

Query: 113 AIKLGDKIRNA--KKPXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPS 170
              L D I NA  KK                     E +  +R S VG+IG + S    S
Sbjct: 107 IFDLADFIYNARSKKRIIAIANDDAYSAAYAIASSAEKIFLTRTSGVGSIGVIASHIDQS 166

Query: 171 ----------TSVYVDVTSTDA-PNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRG 219
                     ++++      D  P++    ES E  +      ++ L+ M  + +A  R 
Sbjct: 167 GFDEKQGIKYSTIFAGSRKNDLNPHEPITSESLENLK----SEVNRLYGMLVELIARNRS 222

Query: 220 TTLSDVNQNFGRGGSLFAETALKMGMIDKILT 251
            ++  +       G  F E A+++G+ D + T
Sbjct: 223 LSIEAIKST--EAGLYFGEKAVEIGLADGVTT 252
>ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125]
 gb|AAL40278.1|AF447491_5 (AF447491) gp5 [Bacteriophage phiE125]
          Length = 301

 Score = 55.8 bits (133), Expect = 8e-007
 Identities = 52/186 (27%), Positives = 78/186 (40%), Gaps = 17/186 (9%)

Query: 80  YCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXXXXXXXXXXX 137
           Y  L  A++ A  D +++ I   IDS GG A GA +L D IR A   KP           
Sbjct: 82  YEGLRSAVNQAVADPAVEHIVLDIDSNGGSATGAFELADDIRAASLVKPITAIVNFSAFS 141

Query: 138 XXXXXXXXDEIVAASRVSRVGNIGAV---LSMRRPSTSVYVDVTSTDAPNKRPDPESEEG 194
                      V  SR S VG+IG +   L + +      + VTS  A + + D    E 
Sbjct: 142 GGYLIAAAASKVIVSRTSGVGSIGVIANHLDVSKRDEQQGIKVTSVFAGDHKNDLTPHE- 200

Query: 195 RRVIREQSLDPLHNM-------FAQAVADGRGTTLSDVNQNFGRGGSLFAETALKMGMID 247
              + +QSL  L +M       F  A+A+ RG +   V     + G  F +  ++ G+ D
Sbjct: 201 --PLSDQSLTFLTSMVQNSYKQFVDAIANFRGLSTQAVKDT--QAGIFFGQKGVEAGLAD 256

Query: 248 KILTAE 253
            + T +
Sbjct: 257 SVETPQ 262
>ref|NP_518979.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14560.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 416

 Score = 53.1 bits (126), Expect = 5e-006
 Identities = 61/252 (24%), Positives = 93/252 (36%), Gaps = 25/252 (9%)

Query: 76  GNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXXXXXXX 133
           G   Y  + E +DAA  D  I  I   +DSPGGE+ G   L D+IR A   KP       
Sbjct: 80  GLASYTAIGEQLDAALADPGIAAILLDVDSPGGESGGVFDLADRIRVAAAIKPVWAAAND 139

Query: 134 XXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPS----------TSVYVDVTSTDAP 183
                          V  SR   VG+IG +      S          T+V+      D  
Sbjct: 140 MAFSAAYALACAASRVFVSRTGGVGSIGVIAMHVDQSVKDAKDGVRYTAVFAGARKNDLN 199

Query: 184 NKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLFAETALKM 243
              P  +  + +    +  +  ++ +F   V   RG +   V       G  F + A+  
Sbjct: 200 PHEPITDEAQAQ---LQAEVSRIYRLFVATVGSYRGLSAEAVTAT--EAGLFFGQDAVAA 254

Query: 244 GMIDKILT-----AETESATSTGGAEANAQDDTTGVQTMDLEKLKSEHPAVYAQAIEEGK 298
           G+ D + T     A+  ++ S     A+A+  +  +Q MD        PA  A ++    
Sbjct: 255 GLADAVGTFDDALAQLTASLSVPAPVASARTPSLNLQ-MDCSMNTQPDPA--AVSVPAAD 311

Query: 299 QIGLTEERNRVA 310
            IG T +   VA
Sbjct: 312 AIGTTAQPPVVA 323
>pir||T03571 probable peptidase - Rhodobacter capsulatus
 gb|AAC16224.1| (AF010496) potential peptidase [Rhodobacter capsulatus]
          Length = 303

 Score = 52.0 bits (123), Expect = 1e-005
 Identities = 54/191 (28%), Positives = 79/191 (41%), Gaps = 16/191 (8%)

Query: 76  GNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN--AKKPXXXXXXX 133
           G   Y  + EA      DD++  +    D+PGG   G       +R+  A KP       
Sbjct: 77  GWATYRGIAEACADLAADDAVSAVVVEFDTPGGYTLGIEDAVTALRDLAAVKPVHALAAP 136

Query: 134 XXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRR---PSTSVYV---DVTSTDAPNKRP 187
                          +  +  + VG+IG  ++      P  +  V   D TS  A  K P
Sbjct: 137 LAASAGYWLASQARELVMTPRAAVGSIGVAVTAAANVAPGAASGVQLFDFTSRHARAKWP 196

Query: 188 DPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSD------VNQNFGRGGSLF-AETA 240
           DP +E+G+  IR + LD     F  AVA+GRG    D      V+ +   GG++F A  A
Sbjct: 197 DPATEDGKAEIR-RGLDRTEARFHAAVAEGRGIAPEDLAARLSVSDDPEDGGAVFEAAEA 255

Query: 241 LKMGMIDKILT 251
           L  G+ D+I T
Sbjct: 256 LARGLADRIET 266
>ref|NP_208226.1| (NC_000915) protease IV (PspA) [Helicobacter pylori 26695]
 pir||C64699 proteinase IV (EC 3.4.-.-) - Helicobacter pylori (strain 26695)
 gb|AAD08477.1| (AE000644) protease IV (PspA) [Helicobacter pylori 26695]
          Length = 292

 Score = 51.2 bits (121), Expect = 2e-005
 Identities = 52/198 (26%), Positives = 76/198 (38%), Gaps = 11/198 (5%)

Query: 64  STPDFFAMLFGGGNCLYCDLFEA-IDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN 122
           S P   A L+  G     + F+  +D      SIK +   IDSPGG    +++L +KI +
Sbjct: 45  SAPPNLAKLYLNGAIFSTEDFDKEVDKILKTPSIKGVLLLIDSPGGAVSASVELSEKIAD 104

Query: 123 AKK--PXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPSTSVYVDVTST 180
            K+  P                      V AS+ S +G+IG + S       +     +T
Sbjct: 105 LKQKMPVLAYARGVMASGSYYAGMQASEVYASKASLIGSIGVIFSGANVENLLNKVGVAT 164

Query: 181 DAPNKRPDPESEEGRRVIREQSLDPLHN-------MFAQAVADGRGTTLSDVNQNFGRGG 233
              +     E     R  +    D L N       MF   VA  R     D  ++F  G 
Sbjct: 165 QGVHAGEYKEIGTFTRAWKPNEKDFLQNLVNEQYQMFVNDVAKARKLDAKDY-KDFAEGK 223

Query: 234 SLFAETALKMGMIDKILT 251
              A+ ALK+ +IDKI T
Sbjct: 224 VFSAQKALKLKLIDKIST 241
>ref|NP_224046.1| (NC_000921) putative PROTEASE IV [Helicobacter pylori J99]
 pir||F71819 probable proteinase IV - Helicobacter pylori (strain J99)
 gb|AAD06917.1| (AE001556) putative PROTEASE IV [Helicobacter pylori J99]
          Length = 292

 Score = 50.4 bits (119), Expect = 4e-005
 Identities = 51/198 (25%), Positives = 76/198 (37%), Gaps = 11/198 (5%)

Query: 64  STPDFFAMLFGGGNCLYCDLFEA-IDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN 122
           S P   A L+  G     + F+  +D      SIK +   IDSPGG    +++L +KI +
Sbjct: 45  SEPPNLAKLYLNGAIFSTEDFDKEVDKILKTPSIKGVLLLIDSPGGAVSASVELSEKIAD 104

Query: 123 AKK--PXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPSTSVYVDVTST 180
            K+  P                      V AS+ S +G+IG + S       +     +T
Sbjct: 105 LKQKMPVLAYARGVMASGSYYAGMQASEVYASKASLIGSIGVIFSSANVENLLNKVGVAT 164

Query: 181 DAPNKRPDPESEEGRRVIREQSLDPLHN-------MFAQAVADGRGTTLSDVNQNFGRGG 233
              +     E     R  +    + L N       MF   VA  R     D  ++F  G 
Sbjct: 165 QGVHAGEYKEIGTFTRAWKPNEKEFLQNLVNEQYQMFVDDVAKARKLNAKDY-KDFAEGK 223

Query: 234 SLFAETALKMGMIDKILT 251
              A+ ALK+ +IDKI T
Sbjct: 224 VFSAQKALKLKLIDKIST 241
>ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella melitensis]
 gb|AAL52974.1| (AE009612) PUTATIVE PROTEASE IV [Brucella melitensis]
          Length = 301

 Score = 49.7 bits (117), Expect = 6e-005
 Identities = 38/150 (25%), Positives = 62/150 (41%), Gaps = 11/150 (7%)

Query: 82  DLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN--AKKPXXXXXXXXXXXXX 139
           +L + +D    DD++K +   +DSPGG   G   + D +R    KKP             
Sbjct: 39  ELLKRLDKIAGDDAVKGVILLLDSPGGTTVGGEAIYDAVRKIATKKPVVTQVGTLAASAG 98

Query: 140 XXXXXXDEIVAASRVSRVGNIGAV-----LSMRRPSTSVYVDVTSTDAPNKRPD---PES 191
                  + + A + S VG+IG +     LS    +  V V+   +      P+   P S
Sbjct: 99  YMIASASDHIVARQTSIVGSIGVLFQYPDLSKLLDTLGVKVETIKSSPLKAEPNYFSPAS 158

Query: 192 EEGRRVIREQSLDPLHNMFAQAVADGRGTT 221
           EE + +IR   +D  ++ F   V + R  T
Sbjct: 159 EEAKNMIRSMIMDS-YDWFVGIVQERRAFT 187
>ref|NP_470963.1| (NC_003212) similar to proteases [Listeria innocua]
 emb|CAC96858.1| (AL596169) similar to proteases [Listeria innocua]
          Length = 337

 Score = 48.5 bits (114), Expect = 1e-004
 Identities = 59/262 (22%), Positives = 103/262 (38%), Gaps = 32/262 (12%)

Query: 10  LQLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTV---AGDT--ARINMQGVMTS 64
           L +++     T  Q+     SSP F+ +      +   TV    GD   A +++ G +  
Sbjct: 15  LLIVSALAKFTSSQIASTEESSPTFVESLFADTGELTETVIEEGGDDTIAVLSVDGTIQD 74

Query: 65  TPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK 124
           T D    LFG     +    + ++   NDD I+ +   ++SPGG    + ++ DKI   +
Sbjct: 75  TGDS-GSLFGDAGYDHSFFMQQLEQVRNDDYIQGVLLYVNSPGGGVMESAQIRDKILQIQ 133

Query: 125 K----PXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLS-------MRRPSTS- 172
           K    P                    + + AS+ +  G++G ++        M++   S 
Sbjct: 134 KERNIPFYVSMGSMAASGGYYISAPADKIFASKETLTGSLGVIMQGYDYSELMKKLGVSD 193

Query: 173 ------VYVDVTSTDAPNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVN 226
                  Y D+ S      RP  E E   + I +  +D  +N F + VA GRG +   V 
Sbjct: 194 NTIKSGEYKDIMS----GTRPMTEDE---KKIMQSMIDDSYNEFVKVVAKGRGMSAEKV- 245

Query: 227 QNFGRGGSLFAETALKMGMIDK 248
           +    G       A + G+ID+
Sbjct: 246 RKIADGRIYDGRQAKENGLIDE 267
>ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp5
           of phage 21 [Salmonella typhimurium LT2]
 gb|AAL21500.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp5
           of phage 21 [Salmonella typhimurium LT2]
          Length = 498

 Score = 47.8 bits (112), Expect = 2e-004
 Identities = 48/203 (23%), Positives = 79/203 (38%), Gaps = 17/203 (8%)

Query: 87  IDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKI--RNAKKPXXXXXXXXXXXXXXXXXX 144
           +  A +D  +K I   IDSPGG+A GA    D I     +KP                  
Sbjct: 110 LQQAVSDPEVKGILLDIDSPGGQAAGAFDCADMIYRMREQKPVWALANETACSAAMLLAA 169

Query: 145 XDEIVAASRVSRVGNIGAVLS-------MRRPSTSVYVDVTSTDAPNKRPDPESEEGRRV 197
                  ++ SR+G+IG V++       +++    + +  +     +  P  +  E    
Sbjct: 170 ACSHRLVTQTSRMGSIGVVMAHTSYAEKLKQEGIDITLIYSGAHKADLTPSQKLPESVYA 229

Query: 198 IREQSLDPLHNMFAQAVADGRGTTLSDV--NQNFGRGGSLFAETALKMGMIDKI----LT 251
             +Q +D    MFA+ VA   G ++  V   +     G     T L  GM++      + 
Sbjct: 230 DYQQRMDEARKMFAEKVARYTGLSVDAVMATEAAVYDGQAIITTGLADGMVNAADAIGVM 289

Query: 252 AETESATSTGGA--EANAQDDTT 272
           AE  ++  TGG   E +A D  T
Sbjct: 290 AEAINSNKTGGTMPELSAADAVT 312
>sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CONTAINS: SCAFFOLD PROTEIN
           (HEAD PROTEIN GP6)]
 pir||JN0539 head protein gp5 - phage 21
 gb|AAA32343.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
          Length = 501

 Score = 45.1 bits (105), Expect = 0.001
 Identities = 61/271 (22%), Positives = 101/271 (36%), Gaps = 25/271 (9%)

Query: 11  QLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTVAGDTARINMQGVMTSTPDFFA 70
           Q+  DAP +  E  D+ MA      G  P     RV  V    A + + G +        
Sbjct: 46  QVQFDAPGMLAE-TDEYMAG-----GKRPA----RVYRVVNGIAVLPVTGTLVHR--LGG 93

Query: 71  MLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXX 128
           M    G   Y  +   +  A  D  ++ +   IDSPGG+A GA    D I   +  KP  
Sbjct: 94  MRPFSGMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVW 153

Query: 129 XXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVL---SMRRPSTSVYVDVTSTDAPNK 185
                                  ++ SR+G+IG ++   S         VD+T   +   
Sbjct: 154 ALCNDTACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAH 213

Query: 186 RPDPESEEGR----RVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLF-AETA 240
           + D    E      R   +Q +D    MFA+ VA   G ++  V    G   ++F  ++ 
Sbjct: 214 KVDGNQFEALPAEVRQDMQQRIDAARRMFAEKVAMYTGLSVDAVT---GTEAAVFEGQSG 270

Query: 241 LKMGMIDKILTAETESATSTGGAEANAQDDT 271
           ++ G+ D+++ A    +       +N +  T
Sbjct: 271 IEAGLADELINASDAISVMATALNSNVRGGT 301
>ref|NP_287793.1| (NC_002655) putative capsid assembly protein of prophage CP-933R
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309135.1| (NC_002695) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
 ref|NP_310759.1| (NC_002695) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
 gb|AAG56407.1|AE005368_13 (AE005368) putative capsid assembly protein of prophage CP-933R
           [Escherichia coli O157:H7 EDL933]
 dbj|BAB34531.1| (AP002554) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
 dbj|BAB36155.1| (AP002559) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
          Length = 501

 Score = 44.3 bits (103), Expect = 0.003
 Identities = 61/271 (22%), Positives = 100/271 (36%), Gaps = 25/271 (9%)

Query: 11  QLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTVAGDTARINMQGVMTSTPDFFA 70
           Q+  DAP +  E  D+ MA      G  P     RV  V    A + + G +        
Sbjct: 46  QVQLDAPGMLAE-TDEYMAG-----GKRPA----RVYRVVNGIAVLPVTGTLVHR--LGG 93

Query: 71  MLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXX 128
           M    G   Y  +   +  A  D  ++ +   IDSPGG+A GA    D I   +  KP  
Sbjct: 94  MRPFSGMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVW 153

Query: 129 XXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVL---SMRRPSTSVYVDVTSTDAPNK 185
                                  ++ SR+G+IG ++   S         VD+T   +   
Sbjct: 154 ALCNDTACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAH 213

Query: 186 RPDPESEEGR----RVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLF-AETA 240
           + D    E      R   +Q +D    MFA+ VA   G ++  V    G   ++F  ++ 
Sbjct: 214 KVDGNQFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSG 270

Query: 241 LKMGMIDKILTAETESATSTGGAEANAQDDT 271
           +  G+ D+++ A    +       +N +  T
Sbjct: 271 IDAGLADELVNASDAISVMATALNSNVRGGT 301
>ref|NP_275945.1| (NC_000916) protease IV [Methanothermobacter thermautotrophicus]
           [Methanothermobacter thermautotrophicus str. Delta H]
 pir||F69207 proteinase IV - Methanobacterium thermoautotrophicum (strain Delta
           H)
 gb|AAB85306.1| (AE000858) protease IV [Methanothermobacter thermautotrophicus str.
           Delta H]
          Length = 287

 Score = 43.5 bits (101), Expect = 0.004
 Identities = 25/85 (29%), Positives = 39/85 (45%)

Query: 82  DLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAKKPXXXXXXXXXXXXXXX 141
           D+ E I  A +D S+K I   I+SPGG    + +L D I  ++KP               
Sbjct: 59  DIKELIAEANSDPSVKAIVLDINSPGGTPVASEELMDAINKSEKPVVSWISDSGTSGAYL 118

Query: 142 XXXXDEIVAASRVSRVGNIGAVLSM 166
                + + AS  + VG+IG +L +
Sbjct: 119 AASASDRIVASPSAWVGSIGVILDL 143
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.312 0.129 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 221,328,797 Number of Sequences: 1028106 Number of extensions: 8485333 Number of successful extensions: 19869 Number of sequences better than 5.0e-02: 27 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 25 Number of HSP's that attempted gapping in prelim test: 19859 Number of HSP's gapped (non-prelim): 28 length of query: 395 length of database: 324,535,921 effective HSP length: 123 effective length of query: 272 effective length of database: 198,078,883 effective search space: 53877456176 effective search space used: 53877456176 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 92 (40.0 bits) Query= orf7 [2017-2153] (135 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_518980.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...    41  0.003
pir||S34243  16.5K structural protein - actinophage VWB >gi|...    38  0.022
>ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
          solanacearum]
 emb|CAD14561.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
          solanacearum]
          Length = 125

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 19/42 (45%), Positives = 29/42 (68%)

Query: 32 AAAETWPAGAVLGRVTATGRYVRYNPAASDGSQVPSAILTEA 73
          AA ++   G VLG VTATG+  + +P+A+DGSQ  + +L +A
Sbjct: 30 AAGQSLVLGTVLGMVTATGKVKQLDPSATDGSQYAAGVLMQA 71
>pir||S34243 16.5K structural protein - actinophage VWB
 emb|CAA50984.1| (X72092) ORFp14 [Bacteriophage VWB]
          Length = 129

 Score = 37.7 bits (86), Expect = 0.022
 Identities = 20/50 (40%), Positives = 29/50 (58%)

Query: 39 AGAVLGRVTATGRYVRYNPAASDGSQVPSAILTEAVTQSAAGNITYSVAV 88
          +G  LG++TA+G+Y  YN AASDG++     L  A+   A   I  S A+
Sbjct: 50 SGIPLGKITASGKYGPYNNAASDGTETLVGFLYSALQAPAVNTIDPSGAM 99
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.133 0.000 Gapped Lambda K H 0.267 0.0410 9.75e+199 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 69,411,284 Number of Sequences: 1028106 Number of extensions: 2206067 Number of successful extensions: 4372 Number of sequences better than 5.0e-02: 2 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4370 Number of HSP's gapped (non-prelim): 2 length of query: 135 length of database: 324,535,921 effective HSP length: 111 effective length of query: 24 effective length of database: 210,416,155 effective search space: 5049987720 effective search space used: 5049987720 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 83 (36.6 bits) Query= orf8 [2157-2522] (364 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_056680.1|  (NC_002214) gp348 [Streptococcus thermophi...    43  0.005
pir||T13320  hypothetical protein 31 - Streptococcus phage p...    41  0.019
>ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
 pir||T13625 hypothetical protein gp348 - Streptococcus phage phi-Sfi11
 gb|AAC34402.1| (AF158600) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
          Length = 348

 Score = 43.1 bits (100), Expect = 0.005
 Identities = 41/167 (24%), Positives = 68/167 (40%), Gaps = 16/167 (9%)

Query: 37  GKTVEIDVKRFSENVAVVITKLSGPNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLIG 96
           G++V +    F  NV V           D       + + P + EA     +D  Q  + 
Sbjct: 51  GQSVALKAAAFDTNVTV----------RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNL- 99

Query: 97  VNPYDDANIAYSSKLVGRLMDYFMEANDMIMRGVEIQASQILQTGRLNLLDRDGETAYEI 156
               D  N A  + +V  + +  +   +     +E    Q+L TG++     DG    +I
Sbjct: 100 --VKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTS-DGVNK-DI 155

Query: 157 DYAPKATHFPTVTTAWSDDGADPIDDLRALFEVIRADGKVNPDMIIM 203
           DY  K  H   V+ +W++ GA P+ DL    E  R  G +NP+  +M
Sbjct: 156 DYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELG-LNPERAVM 201
>pir||T13320 hypothetical protein 31 - Streptococcus phage phi-O1205
 gb|AAC79546.1| (U88974) ORF31 [Streptococcus thermophilus temperate bacteriophage
           O1205]
          Length = 348

 Score = 41.2 bits (95), Expect = 0.019
 Identities = 25/74 (33%), Positives = 38/74 (50%), Gaps = 3/74 (4%)

Query: 130 VEIQASQILQTGRLNLLDRDGETAYEIDYAPKATHFPTVTTAWSDDGADPIDDLRALFEV 189
           +E    Q+L TG++     DG    +IDY  K  H   V+ +W++ GA P+ DL    E 
Sbjct: 131 LEAMRMQVLATGKIAFTS-DGVNK-DIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIET 188

Query: 190 IRADGKVNPDMIIM 203
            R  G +NP+  +M
Sbjct: 189 ARELG-LNPERAVM 201
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.319 0.137 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 239,165,556 Number of Sequences: 1028106 Number of extensions: 10169464 Number of successful extensions: 26545 Number of sequences better than 5.0e-02: 2 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 26544 Number of HSP's gapped (non-prelim): 3 length of query: 364 length of database: 324,535,921 effective HSP length: 122 effective length of query: 242 effective length of database: 199,106,989 effective search space: 48183891338 effective search space used: 48183891338 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 92 (40.0 bits) Query= orf9 [2651-2682] (30 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.313 0.143 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 23,737,498 Number of Sequences: 1028106 Number of extensions: 438023 Number of successful extensions: 682 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 682 Number of HSP's gapped (non-prelim): 0 length of query: 30 length of database: 324,535,921 effective HSP length: 6 effective length of query: 24 effective length of database: 318,367,285 effective search space: 7640814840 effective search space used: 7640814840 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 85 (37.4 bits) Query= orf10 [2682-2811] (128 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.313 0.131 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 62,409,598 Number of Sequences: 1028106 Number of extensions: 2086195 Number of successful extensions: 3050 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3050 Number of HSP's gapped (non-prelim): 0 length of query: 128 length of database: 324,535,921 effective HSP length: 104 effective length of query: 24 effective length of database: 217,612,897 effective search space: 5222709528 effective search space used: 5222709528 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 84 (37.0 bits) Query= orf11 [2808-3038] (229 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.132 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 134,864,601 Number of Sequences: 1028106 Number of extensions: 5071371 Number of successful extensions: 10485 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 10485 Number of HSP's gapped (non-prelim): 0 length of query: 229 length of database: 324,535,921 effective HSP length: 117 effective length of query: 112 effective length of database: 204,247,519 effective search space: 22875722128 effective search space used: 22875722128 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 89 (38.9 bits) Query= orf12 [3042-3543] (500 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_050643.1|  (NC_000929) major tail subunit [Enterobact...   105  1e-021
ref|NP_404841.1|  (NC_003143) putative bacteriophage tail sh...   104  2e-021
ref|NP_284552.1|  (NC_003116) hypothetical protein [Neisseri...    97  3e-019
ref|NP_599043.1|  (NC_003444) tail sheath protein [Shigella ...    84  3e-015
ref|NP_313006.1|  (NC_002695) putative tail sheath protein [...    76  8e-013
ref|NP_543096.1|  (NC_003356) putative sheath protein [Bacte...    74  4e-012
ref|NP_274135.1|  (NC_003112) phage sheath protein [Neisseri...    74  5e-012
>ref|NP_050643.1| (NC_000929) major tail subunit [Enterobacteria phage Mu]
 sp|P79678|VPL_BPMU Tail sheath protein (gpL)
 dbj|BAA19195.1| (AB000833) sheath protein gpL [Enterobacteria phage Mu]
 gb|AAF01117.1|AF083977_36 (AF083977) L [Enterobacteria phage Mu]
          Length = 495

 Score =  105 bits (262), Expect = 1e-021
 Identities = 136/487 (27%), Positives = 208/487 (41%), Gaps = 37/487 (7%)

Query: 5   NAIDNSVRARVLGIKTEFRNFNTGRTFFLP-QHVALLGQ-GNTAATYELTPFRATSAAQV 62
           NAI + VR  +  I  EF N N       P Q V + GQ G+ A+     P R  S +Q 
Sbjct: 7   NAIPSDVRVPLTYI--EFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQA 64

Query: 63  GQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATV 122
              FG GS L L A   L  N    R   L   P G N  G AA G + LSGTA    ++
Sbjct: 65  SAAFGQGSMLALMADAFLNAN----RVAELWCIPQG-NGTGNAAVGEISLSGTAGENGSL 119

Query: 123 RVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAV------DGTNANEVDVT 176
              I  QR + V++  G T    A LLVA I+G P +P+TA V      D T+A+ V ++
Sbjct: 120 VTYIAGQRLA-VSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHADVV-LS 177

Query: 177 AKWQG-LSGNDL---VVSIEGSIPGITVAITQPTGGAADPEVADTLALFGENVWYTQIVS 232
           AK+ G LS  D+     + E +  GI  A    +G   +P+++ ++A  G+ + Y  IV 
Sbjct: 178 AKFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGD-LQYKYIVM 236

Query: 233 CFNTANTDALDAFETFGEGRWDPILRRP---LAVFTGTNETDPNTLAAIGDARRAQRTNL 289
            +   +   L+   T  + RW P+ +     + V +GT       ++  G +R     + 
Sbjct: 237 PYT--DEPNLNLLRTELQERWGPVNQADGFAVTVLSGTY----GDISTFGVSRNDHLISC 290

Query: 290 VTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLV 349
           +  + GA                   + DPA     LTLPG  P     ++T ++R+ L+
Sbjct: 291 MG-IAGAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALL 349

Query: 350 KAGISTSIVRD-GVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLFNLDLIFNTRE 406
             GIST  V D G  +I   +T Y     G+ +P YL        S + ++L     T++
Sbjct: 350 FDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRI-TQK 408

Query: 407 WDGAPLIPD-DQPTTNARAKRPKDAVAVLSVLANNLGLDAIISDVPFTLENIRASINDQN 465
           +    L  D  +  T      P      L  L        ++ D     E +  + N  +
Sbjct: 409 FPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDD 468

Query: 466 PNRLDII 472
            +RLD++
Sbjct: 469 KDRLDVL 475
>ref|NP_404841.1| (NC_003143) putative bacteriophage tail sheath protein [Yersinia
           pestis]
 emb|CAC90076.1| (AJ414147) putative bacteriophage tail sheath protein [Yersinia
           pestis]
          Length = 502

 Score =  104 bits (260), Expect = 2e-021
 Identities = 103/371 (27%), Positives = 160/371 (42%), Gaps = 33/371 (8%)

Query: 21  EFRNFNTGRTFFLPQHVALLGQGNTAATYELT-PFRATSAAQVGQRFGFGSPLHLAALQL 79
           EF N +   T    Q   ++GQ    +T     P   +S A V    G GS LH      
Sbjct: 20  EFDN-SQANTASTTQRTLIIGQTRAGSTLPANIPLLVSSTATVAGLSGAGSMLHGQMAAY 78

Query: 80  LPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTG 139
           L ++  G     + + P+ D D+ VAA G + ++  AS T  + + IG  R     + T 
Sbjct: 79  LANDTAG----EIYLLPLSDADSMVAAIGKITINTAASETGVISLYIGGIRVQTTVVATD 134

Query: 140 TTAEQAAALLVAGIQGNPFMPMTAAVDGTNANE---VDVTAKWQGLSGN--DLVVSIEGS 194
           + +  AAAL  A I+  P +P+T    G   +E   V + AK +G  GN  DL ++  G+
Sbjct: 135 SVSTIAAAL-AAAIENQPELPVTVVHTGDMVSEGAVVVLEAKNKGAHGNNIDLRLNYLGN 193

Query: 195 IPG------ITVAITQPTGGAADPEVADTLALFGENVWYTQIVSCFNTANTDALDAFETF 248
             G      + + IT   GGA  PE+AD LA   +  +   I    +TA+ DA+  F + 
Sbjct: 194 AGGEITPESLVLMITPMAGGAGAPELADGLANLQDRTFDFIINPYTDTASLDAIKDFLSD 253

Query: 249 GEGRW---DPILRRPLAVFTGTNETDPNTLAAIGDARRAQRTNLVTPVPGAQNLPCEI-- 303
             GRW     +     A  +GT       L A G+ R  Q  +L+    G  N P     
Sbjct: 254 STGRWSYRQQLYGHSFAAQSGTY----GQLTAAGELRNDQHASLL----GIHNSPTPAHI 305

Query: 304 -XXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLVKAGIST-SIVRDG 361
                         NDP      L + G+     + ++T T+R+ L+ +GIST ++  D 
Sbjct: 306 WSAAYVGAIAQSLRNDPGRPLQTLAIRGVLAPPLSSRFTLTERNNLLHSGISTVTVADDS 365

Query: 362 VAEISDTVTTY 372
             ++ + +TTY
Sbjct: 366 TVQVENIITTY 376
>ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
 pir||E81810 hypothetical protein NMA1841 [imported] - Neisseria meningitidis
           (group A strain Z2491)
 emb|CAB71993.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
 emb|CAB85065.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
          Length = 475

 Score = 97.4 bits (241), Expect = 3e-019
 Identities = 105/369 (28%), Positives = 156/369 (41%), Gaps = 40/369 (10%)

Query: 19  KTEFRNFNTGRTFFLPQHVALLGQGNTAATYELTPFRAT-SAAQVGQRFGFGSPLHLAAL 77
           K   RN  T +     Q V ++ Q N  A  ELT      SAA    ++G GS  HL   
Sbjct: 27  KLAVRNLPTNK-----QRVLIVAQHNNPALGELTELENVFSAADAAAKYGAGSMAHLMVT 81

Query: 78  QLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIP 137
             +           L++  + DN AGVAA G + LSGTA+    +RV I +  +  + I 
Sbjct: 82  AAIK----AYAYADLSLITVADNKAGVAAGGKITLSGTANTQGVLRVSIANADTLTIGIG 137

Query: 138 TGTTAEQAAALLVAGIQGNPFMPMTAAVDGTNANEVDVTAKWQGLSGNDLVVSIEGSIPG 197
              TA   AA + A I   P +P+TA V       V +TAK +G +GN + +    +  G
Sbjct: 138 AEDTAATVAAAVKAAIDAVPDLPVTATVAEA---VVTLTAKNKGTAGNAIRIKTSNTAEG 194

Query: 198 ITVAITQPTGGAADPEVADTL-ALFGENVWYTQIVSCFNTANTDA------LDAFETFGE 250
           IT A+T  TGG A+P++A  L A+  E      I++C     T+       LD   +  E
Sbjct: 195 ITAAVTAMTGGDANPDIAAALNAVVAEG---HHIIACGINDETNLLKLRAHLDTVASPME 251

Query: 251 GRWDPILRRPLAVFTGTNETDPNTLA-AIGDARRAQRTNLVTP-VPGAQNLPCEIXXXXX 308
            RW       + V+  T      TLA A   A R    ++V+    G  +LPCE+     
Sbjct: 252 KRW------AICVYGQT-----GTLAQATTLAGRLNHGHIVSAWYRGTPSLPCEL--AAA 298

Query: 309 XXXXXXXNNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLVKAGIS--TSIVRDGVAEIS 366
                    DPA     L L  +       +   T+++  +  G++   +      A+I 
Sbjct: 299 FAAVMASEEDPARPLNTLALNSIGVCESKDKTMRTEQENALYNGVTPIETSPAGTQAQIV 358

Query: 367 DTVTTYHPT 375
             +TTY  T
Sbjct: 359 RAITTYTKT 367
>ref|NP_599043.1| (NC_003444) tail sheath protein [Shigella flexneri bacteriophage V]
 gb|AAL89420.1| (U82619) tail sheath protein [Shigella flexneri bacteriophage V]
          Length = 498

 Score = 84.3 bits (207), Expect = 3e-015
 Identities = 92/388 (23%), Positives = 150/388 (37%), Gaps = 27/388 (6%)

Query: 103 GVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMT 162
           G AA   L ++G A+ + TV V +G  R     +  G      A+ +   I   P +P T
Sbjct: 97  GAAATVTLTVTGEATESGTVNVYVGRTRVQ-APVTNGDNVTTIASSIQDAINAVPTLPFT 155

Query: 163 AAVDGTNANEVDVTAKWQGLSGNDLVVSIE------GSI--PGITVAITQPTGGAADPEV 214
           A+   ++A  V +TA+ +GL GN++ VS+       G +   G+ +A+   T G   P +
Sbjct: 156 AS---SSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVL 212

Query: 215 ADTLALFGENVWYTQIVSCFNTANTDALDAFETFGEGRWDPILRRPLAVFTGTNETDPNT 274
              +A   +  +    +   +TA+ + L        GRW    +    V+T    T  + 
Sbjct: 213 TGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGT-LSE 271

Query: 275 LAAIGDARRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPG 334
           L   GD    Q   L       Q    E+             NDPA       L G+ P 
Sbjct: 272 LVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPA 331

Query: 335 TDAQQWTHTQRDLLVKAGISTSIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKAS 392
              +++T T++  L+  G++T+ V  GV  I   VTTY     G  +  YL  +    ++
Sbjct: 332 PKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSA 391

Query: 393 NVLFNLDLIFNTR------EWDGAPLIPDDQPTTNARAKRPKDAVAVLSVLANNLGLDAI 446
            VL  L  +  ++        DG    P     T A  K        L      L    I
Sbjct: 392 YVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIK------GELLATYRQLERAGI 445

Query: 447 ISDVPFTLENIRASINDQNPNRLDIIFP 474
           + +     + +    +   PNRL+ +FP
Sbjct: 446 VENYELFKQYLVVERDASVPNRLNTLFP 473
>ref|NP_313006.1| (NC_002695) putative tail sheath protein [Escherichia coli O157:H7]
 dbj|BAB38402.1| (AP002567) putative tail sheath protein [Escherichia coli O157:H7]
          Length = 500

 Score = 76.3 bits (186), Expect = 8e-013
 Identities = 107/438 (24%), Positives = 176/438 (39%), Gaps = 33/438 (7%)

Query: 53  PFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDL 112
           P R T A+Q  + +G GS + L   + +  N D      L     G N  G A    ++L
Sbjct: 58  PVRITRASQATELWGRGSMIALMVAEFIAINPDA----ELYAIAQG-NGTGQATAAAMNL 112

Query: 113 SGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTA---AVDGTN 169
           +GT +    V   +G +R +L   P G   ++    L A I  +P  P TA   A  G N
Sbjct: 113 TGTVTRDGIVYACVGGRRYTLPA-PKGKKGKELTDELAALINADPDAPFTASSGAGSGDN 171

Query: 170 A----NEVDVTAKWQG-LSGNDLVVSI---EGSIPGITVAITQPTGGAADPEVADTLALF 221
                  + +TA++ G  S +D+ ++    E +  GI VAI  P   AA+P++  ++A  
Sbjct: 172 GAGLKGSLGITARFTGECSVHDVRLNYHDGEATPEGIQVAIAYPKQKAANPDITRSVAGM 231

Query: 222 GENVWYTQIVSCFNTANTDALDAFETFGEGRWDPILRRPLAVFTGTNETDPNTLAAIGDA 281
           G+  +   ++   + AN   L         RW P       ++     T    + A G A
Sbjct: 232 GDRQYNYVVMPYKDDAN---LKIISDELLKRWGPAKMSDGVLWLAHTGTF-GEVQAFG-A 286

Query: 282 RRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWT 341
           +R       T +P A                     DPA     L LP       A + T
Sbjct: 287 KRNDFLATCTSIPKAPEPDYLWAAAVCATCAPSLGADPARPLQTLALPSRMAPAPADRLT 346

Query: 342 HTQRDLLVKAGIST-SIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLFNL 398
             +R+ L+  GI+T S+    V +I   VT Y     GE +P YL  +     S + ++L
Sbjct: 347 REERNSLLWGGIATVSVAAGDVVQIERQVTMYRQNVYGESDPSYLDVETIYTLSWLRYSL 406

Query: 399 DLIFNTREWDGAPLIPDDQPTTNARAKRPKDAVAVLSVLANNLGLD----AIISDVPFTL 454
              F T+ +    L  D  P    RA +      ++ +   +LG +     ++ ++    
Sbjct: 407 -RTFITQRFPRHKLADDGTP---VRAGQNIVTPEIMKLQLISLGEEWVELGLVENLDTFK 462

Query: 455 ENIRASINDQNPNRLDII 472
           +N+    N  + NRLD++
Sbjct: 463 KNLLVERNASDRNRLDVL 480
>ref|NP_543096.1| (NC_003356) putative sheath protein [Bacteriophage P27]
 emb|CAC83562.1| (AJ298298) putative sheath protein [Bacteriophage P27]
          Length = 498

 Score = 73.9 bits (180), Expect = 4e-012
 Identities = 90/389 (23%), Positives = 152/389 (38%), Gaps = 29/389 (7%)

Query: 103 GVAAEGVLDLSGTASATATVRVRIGSQRSSL-VTIPTGTTAEQAAALLVAGIQGNPFMPM 161
           G AA   + ++G A  + T+ + +G  RSS+ V +  G  A   A  +   + G   +P 
Sbjct: 97  GAAATVRVTVTGEAEESGTLSLYVG--RSSVQVPVVNGDDATAVATAIKEAVNGVITLPF 154

Query: 162 TAAVDGTNANEVDVTAKWQGLSGNDLVVSIE------GSI--PGITVAITQPTGGAADPE 213
            A+ D   A  V +TA+ +GL GN+L V +       G I   G+ V     T G+  P+
Sbjct: 155 AASSD---AGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPD 211

Query: 214 VADTLALFGENVWYTQIVSCFNTANTDALDAFETFGEGRWDPILRRPLAVFTGTNETDPN 273
           +   +A  G+  +    +   + A+ + +        GRW    +    V+T    T  +
Sbjct: 212 LTAAVAAMGDEAFDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGT-LS 270

Query: 274 TLAAIGDARRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTP 333
            L   GD    Q   L       Q+   E+             NDPA       L G+ P
Sbjct: 271 ELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLP 330

Query: 334 GTDAQQWTHTQRDLLVKAGISTSIVRDGVAEISDTVTTY--HPTGEVNPGYLFYKDTVKA 391
               +++  T++  L+  G++T+ V  G   I  +VTTY  +  G  +  YL  +    +
Sbjct: 331 APKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTS 390

Query: 392 SNVLFNLDLIFNTREW------DGAPLIPDDQPTTNARAKRPKDAVAVLSVLANNLGLDA 445
           + VL  L  +  ++        DG    P     T A  K        L      +    
Sbjct: 391 AYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIK------GELLATYRQMERAG 444

Query: 446 IISDVPFTLENIRASINDQNPNRLDIIFP 474
           I+ +     + +    +  NPNRL+ +FP
Sbjct: 445 IVENYDLFKQYLIVERDADNPNRLNTLFP 473
>ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseria meningitidis MC58]
 pir||E81121 phage sheath protein NMB1104 [imported] - Neisseria meningitidis
           (group B strain MD58)
 gb|AAF41495.1| (AE002460) phage sheath protein [Neisseria meningitidis MC58]
          Length = 469

 Score = 73.6 bits (179), Expect = 5e-012
 Identities = 88/345 (25%), Positives = 131/345 (37%), Gaps = 27/345 (7%)

Query: 34  PQHVALLGQGNTAATYE-LTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPL 92
           PQ V ++    TA     L P +  S A+    FG GS  HL   Q   +N      + L
Sbjct: 35  PQKVLMVAPMLTAGIQPALEPVQLFSDAEAADLFGQGSLAHLMVRQAFANNP----YLDL 90

Query: 93  TIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAG 152
           T+  + D+ AGV A   + LSGTA+A   V + IG ++ S   + TG TA   A  L   
Sbjct: 91  TVIGIADHSAGVQATATVTLSGTATAPGVVEITIGGKQVS-TAVNTGETAATVADRLKTA 149

Query: 153 IQGNPFMPMTAAVDGTNANEVDVTAKWQGLSGNDLVVSIEGSIPGITVAITQPTGGAADP 212
           I       +T    G+ A  V +TAK +G  GN+  +++     G+T      TGGA + 
Sbjct: 150 ITA---ADVTVTASGSGA-AVTLTAKHKGEIGNESGLTVSTGNTGLTYQANAFTGGAKNA 205

Query: 213 EVADTLALFGENVWYTQIVSCFNTANTDALDAFETFGEGRWDP-----ILRRPLAVFTGT 267
           ++A  L+      ++       + AN  AL    T      +      +L    A+ T T
Sbjct: 206 DIATALSKVAGKHYHIICSPFSDDANAKALSNHITNVSNAIEQRGCIGVLGMSAALSTAT 265

Query: 268 NETDPNTLAAIGDARRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLT 327
             T       I D R           P                       DPA     L 
Sbjct: 266 TAT-----GEINDGRMTCAWYKGAVEPNG-------IIAAGYAAVLAFEEDPAKPLNTLE 313

Query: 328 LPGLTPGTDAQQWTHTQRDLLVKAGISTSIVRDGVAEISDTVTTY 372
           + GL    DAQ     + +  +  G++   V +   +I   V+TY
Sbjct: 314 IKGLAVTPDAQWPLFAECNNALYNGLTPLTVVNNRVQIMRAVSTY 358
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.317 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 316,395,816 Number of Sequences: 1028106 Number of extensions: 13381695 Number of successful extensions: 29261 Number of sequences better than 5.0e-02: 7 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 29247 Number of HSP's gapped (non-prelim): 8 length of query: 500 length of database: 324,535,921 effective HSP length: 125 effective length of query: 375 effective length of database: 196,022,671 effective search space: 73508501625 effective search space used: 73508501625 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 93 (40.4 bits) Query= orf13 [3605-3675] (69 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.307 0.125 0.000 Gapped Lambda K H 0.267 0.0410 2.50e+262 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 38,340,155 Number of Sequences: 1028106 Number of extensions: 1211910 Number of successful extensions: 4970 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4970 Number of HSP's gapped (non-prelim): 0 length of query: 69 length of database: 324,535,921 effective HSP length: 45 effective length of query: 24 effective length of database: 278,271,151 effective search space: 6678507624 effective search space used: 6678507624 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 84 (37.0 bits) Query= orf14 [3703-3837] (133 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.135 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 84,081,426 Number of Sequences: 1028106 Number of extensions: 3350964 Number of successful extensions: 8801 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8801 Number of HSP's gapped (non-prelim): 0 length of query: 133 length of database: 324,535,921 effective HSP length: 109 effective length of query: 24 effective length of database: 212,472,367 effective search space: 5099336808 effective search space used: 5099336808 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 83 (36.6 bits) Query= orf15 [3868-3903] (34 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.325 0.142 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 18,686,389 Number of Sequences: 1028106 Number of extensions: 313471 Number of successful extensions: 679 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 679 Number of HSP's gapped (non-prelim): 0 length of query: 34 length of database: 324,535,921 effective HSP length: 10 effective length of query: 24 effective length of database: 314,254,861 effective search space: 7542116664 effective search space used: 7542116664 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 85 (37.4 bits) Query= orf16 [3919-4523] (603 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_058453.1|  (NC_002321) orf 15 [Staphylococcus aureus ...   169  7e-041
ref|NP_061640.1|  (NC_002486) phi PVL ORF 15 and 16 homologu...   169  7e-041
gb|AAL82371.1|  (AF424783) tail length tape measure protein ...   169  1e-040
ref|NP_646712.1|  (NC_003923) hypothetical protein [Staphylo...   166  1e-039
ref|NP_310668.1|  (NC_002695) putative tail protein [Escheri...   163  5e-039
ref|NP_288365.1|  (NC_002655) putative tail fiber component ...   161  2e-038
ref|NP_108725.1|  (NC_002703) unknown [Bacteriophage Tuc2009...   157  5e-037
ref|NP_112708.1|  (NC_002747) TMP [Lactococcus lactis bacter...   156  8e-037
ref|NP_075512.1|  (NC_002661) orf1374 [Staphylococcus aureus...   145  1e-033
ref|NP_490625.1|  (NC_003278) orf25; similar to T gene of P2...   130  5e-029
ref|NP_046782.1|  (NC_001895) gpT [Enterobacteria phage P2] ...   114  3e-024
ref|NP_052273.1|  (NC_001317) G protein [Enterobacteria phag...   111  2e-023
ref|NP_471714.1|  (NC_003212) similar to hypothetical protei...   110  6e-023
gb|AAK85310.1|  (AY043263) phage-related tail protein [Wolba...    49  2e-004
>ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus bacteriophage PVL]
 pir||T00148 hypothetical protein 15 - Staphylococcus aureus phage phi PVL
 dbj|BAA31888.1| (AB009866) orf 15 [Staphylococcus aureus bacteriophage PVL]
          Length = 694

 Score =  169 bits (429), Expect = 7e-041
 Identities = 114/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)

Query: 59  SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
           ++ GIEF+  + +     G       + ++ L   A+E+G             ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234

Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
           G++++Q M  L  ++ LA A+G +L   +DI TD L AFGL  KDS  LA       DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHLA-------DVL 287

Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
           A+TS+ ANT +  + E  +  AP A A G +IE  S  IG+M++ GIK E AGTA++  F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347

Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
             L++P       + RLGI + DS G M     ++  L      + + Q+ +    +FG 
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407

Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
           E ++G L +INA  +   +   ++ S+ GA+ RMA  +   + G LRTLRS +E +A+  
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467

Query: 359 FELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 417
           ++     ++ +V   +             V  ++  +  +I   V G  L   V   L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFVAVLGPLVF 515

Query: 418 AFNTIVTTITGAITLLN-LVIAAN 440
            F   ++ +  A+T+L  L+I  N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
           prophage phiPV83]
 dbj|BAA97857.1| (AB044554) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
           prophage phiPV83]
          Length = 1313

 Score =  169 bits (429), Expect = 7e-041
 Identities = 114/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)

Query: 59  SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
           ++ GIEF+  + +     G       + ++ L   A+E+G             ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234

Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
           G++++Q M  L  ++ LA A+G +L   +DI TD L AFGL  KDS   A       DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287

Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
           A+TS+ ANT +  + E  +  AP A A G +IE  S  IG+M++ GIK E AGTA++  F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347

Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
             L++P       + RLGI + DS G M     ++  L      + + Q+ +    +FG 
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407

Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
           E ++G L +INA  +   +   ++ S+ GA+ RMA  +   + G LRTLRS +E +A+  
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467

Query: 359 FELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 417
           ++     ++ +V   +             V  ++  +  +I   V G  L A V   L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVIGFGLFAAVLGPLVF 515

Query: 418 AFNTIVTTITGAITLLN-LVIAAN 440
            F   ++ +  A+T+L  L+I  N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>gb|AAL82371.1| (AF424783) tail length tape measure protein [Staphylococcus aureus
           phage phi 13]
          Length = 1550

 Score =  169 bits (427), Expect = 1e-040
 Identities = 113/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)

Query: 59  SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
           ++ GIEF+  + +     G       + ++ L   A+E+G             ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234

Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
           G++++Q M  L  ++ LA A+G +L   +DI TD L AFGL  KDS   A       D+L
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELEAVSDIVTDGLTAFGLKAKDSGHFA-------DIL 287

Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
           A+TS+ ANT +  + E  +  AP A A G +IE  S  IG+M++ GIK E AGTA++  F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347

Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
             L++P       + RLGI + DS G M     ++  L      + + Q+ +    +FG 
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407

Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
           E ++G L +INA  +   +   ++ S+ GA+ RMA  +   + G LRTLRS +E +A+  
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467

Query: 359 FELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 417
           ++     ++ +V   +             V  ++  +  +I   V G  L A V   L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFAAVLGPLVF 515

Query: 418 AFNTIVTTITGAITLLN-LVIAAN 440
            F   ++ +  A+T+L  L+I  N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_646712.1| (NC_003923) hypothetical protein [Staphylococcus aureus subsp.
           aureus MW2]
 dbj|BAB95760.1| (AP004828) hypothetical protein [Staphylococcus aureus subsp.
           aureus MW2]
          Length = 1549

 Score =  166 bits (419), Expect = 1e-039
 Identities = 98/302 (32%), Positives = 160/302 (52%), Gaps = 11/302 (3%)

Query: 59  SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
           ++ GIEF+  + +     G       + ++ L   A+E+G             ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234

Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
           G++++Q M  L  ++ LA A+G +L   +DI TD L AFGL  KDS   A       DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287

Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
           A+TS+ ANT +  + E  +  AP A A G +IE  S  IG+M++ GIK E AGTA++  F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347

Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
             L++P       + RLGI + DS G M     ++  L      + + Q+ +    +FG 
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407

Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
           E ++G L +INA  +   +   ++ S+ GA+ RMA  +   + G LRTLRS +E +A+  
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467

Query: 359 FE 360
           ++
Sbjct: 468 YD 469
>ref|NP_310668.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
 dbj|BAB36064.1| (AP002559) putative tail protein [Escherichia coli O157:H7]
          Length = 696

 Score =  163 bits (413), Expect = 5e-039
 Identities = 122/392 (31%), Positives = 187/392 (47%), Gaps = 33/392 (8%)

Query: 111 GIDFLAMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 170
           G  FLAMAGF  +   AALP ++++A A G++L    DI ++ L  F L        A  
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299

Query: 171 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVA 230
           + RV D L    T  NT +  + ET++   P A   G S+E  +AM G++A+NG++   A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359

Query: 231 GTAVQNFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLA 290
           GTA++    RLA+P   A   L+ LG+ VAD+ G MR   DV+ DL  A    G+  +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419

Query: 291 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 349
             + + G E   G    V  AG   L +    L  A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEEAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479

Query: 350 TVESVAIRFFELSGGPMRD-------VVDQATAWIRANRELIAQNV--AGFLNTIVENID 400
             E + IR  +L  GP+R        V+++ T+  +A+  L  Q +   G L  +   I 
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539

Query: 401 SIVRGIKLIATVFATLWAFNTIVT--------------TITGAITLLNLVIAANPXXXXX 446
           S+   I ++    ATL     I+T               +TG+++LL  VI A       
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVVRVLPALWGMVTGSVSLLGGVIGA--LFSPV 597

Query: 447 XXXXXXXXXXXXXXYLHWEPIKTFFTDLWDGI 478
                         + +W+PI+ FF  ++ GI
Sbjct: 598 GLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_288365.1| (NC_002655) putative tail fiber component of prophage CP-933T
           [Escherichia coli O157:H7 EDL933]
 gb|AAG56919.1|AE005413_13 (AE005413) putative tail fiber component of prophage CP-933T
           [Escherichia coli O157:H7 EDL933]
          Length = 696

 Score =  161 bits (407), Expect = 2e-038
 Identities = 121/392 (30%), Positives = 186/392 (46%), Gaps = 33/392 (8%)

Query: 111 GIDFLAMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 170
           G  FLAMAGF  +   AALP ++++A A G++L    DI ++ L  F L        A  
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299

Query: 171 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVA 230
           + RV D L    T  NT +  + ET++   P A   G S+E  +AM G++A+NG++   A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359

Query: 231 GTAVQNFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLA 290
           GTA++    RLA+P   A   L+ LG+ VAD+ G MR   DV+ DL  A    G+  +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419

Query: 291 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 349
             + + G     G    V  AG   L +    L  A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEXAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479

Query: 350 TVESVAIRFFELSGGPMRD-------VVDQATAWIRANRELIAQNV--AGFLNTIVENID 400
             E + IR  +L  GP+R        V+++ T+  +A+  L  Q +   G L  +   I 
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539

Query: 401 SIVRGIKLIATVFATLWAFNTIVT--------------TITGAITLLNLVIAANPXXXXX 446
           S+   I ++    ATL     I+T               +TG+++LL  VI A       
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVVRVLPALWGMVTGSVSLLGGVIGA--LFSPV 597

Query: 447 XXXXXXXXXXXXXXYLHWEPIKTFFTDLWDGI 478
                         + +W+PI+ FF  ++ GI
Sbjct: 598 GLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009]
 gb|AAK19873.1|AF109874_46 (AF109874) unknown [Bacteriophage Tuc2009]
          Length = 1025

 Score =  157 bits (396), Expect = 5e-037
 Identities = 115/441 (26%), Positives = 199/441 (45%), Gaps = 23/441 (5%)

Query: 59  SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
           ++ G +FE  ++R     G   +   +  Q  +D+    G            G++ LA A
Sbjct: 69  AKIGGDFESQMSRVKAISGATGSSFEELRQQAIDL----GAKTAFSAKESASGMENLASA 124

Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
           GFNA++ M A+P ++ LA  +G D+  A++ A  +L  F L    S  +A       +V 
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVGLASENAATALRGFNLDASQSGHVA-------NVF 177

Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
           AK + + N  +  M E ++  AP A + G SIE VSA IG+M+  GIK   AGT+++   
Sbjct: 178 AKAAANTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237

Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
            RLA P    +  +  LG+   DS G M+   D +G L  A   +   Q+   +  ++G 
Sbjct: 238 SRLAKPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297

Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
           E L+G + +I+ G D L +   +L +++GAAD+MAK + D+M  SL  +   +ES AI  
Sbjct: 298 ESLSGMIALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGALESAAIVV 357

Query: 359 FELSGGPMRDVVDQATAWIR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 412
            ++    +R V D  +  +    +     Q +   +  IV  I  ++    + + ++  V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVLVLQRV 417

Query: 413 FATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLHWEPIKTFFT 472
                A  + +  I G+ T ++L +                       Y +W+ I  F  
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471

Query: 473 DLWDGIGNAFDTFATNLANKW 493
           ++W  +       A  +  KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacteriophage TP901-1]
 gb|AAG32164.1|AF252967_1 (AF252967) tape measure protein [Lactococcus lactis bacteriophage
           TP901-1]
 gb|AAK38062.1| (AF304433) TMP [Lactococcus lactis bacteriophage TP901-1]
          Length = 937

 Score =  156 bits (394), Expect = 8e-037
 Identities = 116/441 (26%), Positives = 197/441 (44%), Gaps = 23/441 (5%)

Query: 59  SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
           ++ G +FE  ++R     G   +   +  Q  +D+    G            G++ LA A
Sbjct: 69  AKIGGDFESQMSRVKAISGATGSSFEELRQQAIDL----GAKTAFSAKESASGMENLASA 124

Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
           GFNA++ M A+P ++ LA  +G D+A A++ A  +L  F L    S  +A       +V 
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVALASENAATALRGFNLDASQSGHVA-------NVF 177

Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
           AK +   N  +  M E ++  AP A + G SIE VSA IG+M+  GIK   AGT+++   
Sbjct: 178 AKAAADTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237

Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
            RLA P    +  +  LG+   DS G M+   D +G L  A   +   Q+   +  ++G 
Sbjct: 238 SRLADPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297

Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
           E L+G + +I+ G D L +   +L +++GAAD+MAK + D+M  SL  +    ES AI  
Sbjct: 298 ESLSGMMALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGAFESAAIVV 357

Query: 359 FELSGGPMRDVVDQATAWIR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 412
            ++    +R V D  +  +    +     Q +   +  IV  I  ++    + +  +  V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVVTLQRV 417

Query: 413 FATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLHWEPIKTFFT 472
                A  + +  I G+ T ++L +                       Y +W+ I  F  
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471

Query: 473 DLWDGIGNAFDTFATNLANKW 493
           ++W  +       A  +  KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus temperate phage phiSLT]
 dbj|BAB21743.1| (AB045978) orf1374 [Staphylococcus aureus temperate phage phiSLT]
          Length = 1374

 Score =  145 bits (366), Expect = 1e-033
 Identities = 88/298 (29%), Positives = 153/298 (50%), Gaps = 18/298 (6%)

Query: 94  AKEVGXXXXXXXXXXXXGIDFLAMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDS 153
           A ++G            G++ LA  GFNA+Q+M A+P ++S A A+G ++A    +   +
Sbjct: 364 AVDLGAKTSKSANEVAKGMEELAALGFNAKQTMEAMPGVISAAEASGAEMATTATVMASA 423

Query: 154 LGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETV 213
           + +FGL   D+  +A       D+LA+++  +   I  M + ++ A   A A G SIE  
Sbjct: 424 INSFGLKASDANHVA-------DLLARSANDSAADIQYMGDALKYAGTPAKALGVSIEDT 476

Query: 214 SAMIGVMASNGIKAEVAGTAVQNFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVV 273
           SA I V++++G++   AGTA++  F+RLA P+    K +++LGI ++D+ G      +++
Sbjct: 477 SAAIEVLSNSGLEGSQAGTALRASFIRLANPSKNTAKEMKKLGIHLSDAKGQFVGMGELI 536

Query: 274 GDLNGALATMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMA 333
                 +  M   Q+LA +  + G E  +G L +I AG D +  Y  +L ++ G + + A
Sbjct: 537 RQFQDNMKGMTREQKLATVATIVGTEAASGFLALIEAGPDKINSYSKSLKNSNGESKKAA 596

Query: 334 KRIGDDMLGSLRTLRSTVESVAIRFFELSGGPMRDVVDQATAWIRANRELIAQNVAGF 391
             + D++ G+L  L    ES+AI           +V    T  IRA  E + + V GF
Sbjct: 597 DLMKDNLKGALEQLGGAFESLAI-----------EVGKDLTPMIRAGAEGLTKLVDGF 643
>ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2 [Bacteriophage phi CTX]
           [bacteriophage phi CTX]
 dbj|BAA36253.1| (AB008550) orf25; similar to T gene of P2 [bacteriophage phi CTX]
          Length = 904

 Score =  130 bits (327), Expect = 5e-029
 Identities = 107/457 (23%), Positives = 202/457 (43%), Gaps = 54/457 (11%)

Query: 62  GIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAGFN 121
           G EF+  ++ AT  +     K     Q L   A+ +             G  FL   G+N
Sbjct: 212 GYEFDATMS-ATQAVTRIERKDDPQMQALRQQARTLPLSSKFTDKEVAQGQYFLGRTGYN 270

Query: 122 AEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 181
           A+Q + A+P +++LA A  +DL    DIA++   A G+  +   Q+A       DVL   
Sbjct: 271 AKQILGAMPGMLNLAAAGDMDLGDTADIASNIQTAMGIPAEKMDQVA-------DVLTAA 323

Query: 182 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLRL 241
            T  N  I M+ ++++ +A      GQS+ETV+    ++ + G++  +AGT++++   RL
Sbjct: 324 FTRNNVDIRMLGDSLKYSAGVGREYGQSLETVTTATALLGNAGVQGSMAGTSMRSVLTRL 383

Query: 242 AAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGAEGL 301
                   K + +LG+   D+ GNMRD  D++ D+N   A MG  QR A+ + + G   +
Sbjct: 384 GTS-----KAVAKLGVQTKDANGNMRDMLDILKDINKKTAGMGNVQRGAIFKDIAGQYAV 438

Query: 302 AGNLGVINAGKDALVE-YRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE 360
                ++ A +    +  R +L ++EG A R+A    D++ G +  L + +E++++  F+
Sbjct: 439 TSFGTLMRAVEGGQFQTMRESLNNSEGEAARVAATQLDNLKGDMTMLHAALENISVELFD 498

Query: 361 LSGGPMRDVVDQAT-------AWIRANREL---IAQNVAGFLNTIVENIDSIVRGIK-LI 409
            +   +R++    +        +++AN ++   I   VA F + ++  + S+   +  ++
Sbjct: 499 KNSPWLRELAADLSHLLHNIGEFLKANPQVSKGIVITVAAF-SALMATVGSLAITLAGIL 557

Query: 410 ATVFATLWAFNTIVTTITGAITLLNLV----------------------------IAANP 441
             + A  +   TI   + G I LL L+                            +AANP
Sbjct: 558 GPMIAVRFMLGTIGIRLPGLIGLLKLLFAPIRMLAGLLIGPLVTALRVVSIALWGLAANP 617

Query: 442 XXXXXXXXXXXXXXXXXXXYLHWEPIKTFFTDLWDGI 478
                              Y +W+ +K +   +W+ I
Sbjct: 618 VVLAIAAVVAVLAGAAYLIYRNWDAVKAYLLGMWEEI 654
>ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2]
 gb|AAD03293.1| (AF063097) gpT [Enterobacteria phage P2]
          Length = 815

 Score =  114 bits (285), Expect = 3e-024
 Identities = 105/470 (22%), Positives = 179/470 (37%), Gaps = 42/470 (8%)

Query: 62  GIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAGFN 121
           G EF    +     +G  V K +     L   A+++G                +A AG +
Sbjct: 212 GYEFAQKNSELQAVIG--VAKDSAEMAALRKQARQLGDNTAASADDAAGAQIIIAKAGGD 269

Query: 122 AEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 181
            +   AA P  +++A A    +     +      AF L           +A + DVL+ T
Sbjct: 270 VDAIQAATPVTLNMALANRRTMEENAALLMGMKSAFQLSND-------KVAHIGDVLSMT 322

Query: 182 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLRL 241
                   D M + +  AAP A  AG SIE  +AM+G +    I   +AGT  +    RL
Sbjct: 323 MNKTAADFDGMSDALTYAAPVAKNAGVSIEETAAMVGALHDAKITGSMAGTGSRAVLSRL 382

Query: 242 AAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALA--TMGERQRLAVIQKVFGAE 299
            AP G+A   L+ LG+  +DS GN R  F ++ ++  +     +G  Q+   ++ +FG E
Sbjct: 383 QAPTGKAWDALKELGVKTSDSKGNTRPIFTILKEMQASFEKNRLGTAQQAEYMKTIFGEE 442

Query: 300 GLAGNLGVINAGKDALVEYRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
             +    ++ A     ++       A +G    +   + D++ G  +  +S  E+V    
Sbjct: 443 ASSAAAVLMTAASTGKLDKLTAAFKASDGKTAELVNIMQDNLGGDFKAFQSAYEAVGTDL 502

Query: 359 FELSGGPMRDVVDQAT-------AWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA- 410
           F+   G +R +   AT        WI+ N+ L   +  G +      +  I+  I L+A 
Sbjct: 503 FDQQEGALRKLTQTATKYVLKLDGWIQKNKSL--ASTIGIIAGGALALTGIIGAIGLVAW 560

Query: 411 -------TVFATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLH 463
                   + A   A   + TT+  A+     + A                        +
Sbjct: 561 PVITGINAIIAAAGAMGAVFTTVGSAV-----MTAIGAISWPVVAVVAAIVAGALLIRKY 615

Query: 464 WEPIKTFFTDLWDGIGNAFDTFATNLANKWEALIAPIRSSIQWLLEQADA 513
           WEP+  FF  + +G+  AF             L  P++    WL E+  A
Sbjct: 616 WEPVSAFFGGVVEGLKAAFAPVG--------ELFTPLKPVFDWLGEKLQA 657
>ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phage 186]
 gb|AAC34170.1| (U32222) G protein [Enterobacteria phage 186]
          Length = 812

 Score =  111 bits (278), Expect = 2e-023
 Identities = 106/456 (23%), Positives = 180/456 (39%), Gaps = 51/456 (11%)

Query: 80  VTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAGFNAEQSMAALPKIVSLATAA 139
           V K +   Q L   A+ VG                +A +G +A    A     +++A + 
Sbjct: 227 VDKQSPEMQALRKQARHVGDNTAASADDAASAQIIIAKSGGDAAAIQARRRVTLNMALSN 286

Query: 140 GIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKA 199
              +     + T    AF L           +A + DVL+ T        D + + +  A
Sbjct: 287 RRTMEENAALLTGMKSAFQLSND-------KIAHIGDVLSMTMNKTAADFDGLSDALTYA 339

Query: 200 APTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLRLAAPAGEARKILRRLGIDV 259
           AP A  AG SIE  +AM+G +    I   +AGT  +    RL  P  +A + ++ LG+  
Sbjct: 340 APVAKNAGVSIEQTAAMVGALHDAKITGSMAGTGSRAILSRLQLPPEKAFEAIKELGVKT 399

Query: 260 ADSAGNMRDAFDVVGDLNGALA--TMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVE 317
           +DS GN R  F ++ ++  +     +G  QR   ++ +FG E  +    ++ A     ++
Sbjct: 400 SDSKGNTRPIFSILKEMQRSFEKNNLGTSQRGEYMKTIFGEEASSAAAVLMEAASSGKLD 459

Query: 318 YRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGPMRDVVDQAT-- 374
                  A +G  + + K + D++ G  +  +S  E+V    F+   G +R +   AT  
Sbjct: 460 RLTAAFKASDGKTEELVKVMQDNLGGDFKEFQSAYEAVGTDLFDQQEGSLRKLTQTATQY 519

Query: 375 -----AWIRANREL------IAQNVAGFLNTI----------VENIDSIVRGIKLIATVF 413
                 WI+ N+ L      IA      +  I          V  I++I+    ++ TVF
Sbjct: 520 VLKLDGWIQKNKGLATTIGIIAGGALALIGIIGGIGLVAWPVVMGINAIIAAAGVLGTVF 579

Query: 414 ATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLHWEPIKTFFTD 473
                  + + T  GAIT   + + A                       +WEPI  FF+ 
Sbjct: 580 TVT---GSAIVTALGAITWPIVAVGA------------AIVAGALLIRKYWEPISAFFSG 624

Query: 474 LWDGIGNAFDTFA---TNLANKWEALIAPIRSSIQW 506
           + +GI +AF   A     LA  ++ L   +R   QW
Sbjct: 625 VIEGIMSAFAPVAEMFAPLAPIFDGLGEKLRGVWQW 660
>ref|NP_471714.1| (NC_003212) similar to hypothetical protein [Lactobacillus casei
           bacteriophage A2] [Listeria innocua]
 emb|CAC97610.1| (AL596172) similar to hypothetical protein [Lactobacillus casei
           bacteriophage A2] [Listeria innocua]
          Length = 1601

 Score =  110 bits (274), Expect = 6e-023
 Identities = 74/340 (21%), Positives = 153/340 (44%), Gaps = 25/340 (7%)

Query: 59  SRAGIEFEHVI---NRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFL 115
           ++A ++F++ +   N    + G++  +  D Y  + D A++              G+  +
Sbjct: 100 TKAYLDFDNEVTEVNSLLRESGESAKEFGDRYTQVFDYAQKASVKYGVASEQTMLGMKEM 159

Query: 116 AMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVN 175
              G++  Q+MA++P I + A A+G D      + T +L  FG+++KD+ +      +V 
Sbjct: 160 VKKGYDINQTMASMPAIFNAARASGDDFETVMSVTTSTLEQFGMISKDTNKQMEYTNKVA 219

Query: 176 DVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQ 235
           DVL   +         M   +    P + + G S+   +A +G++++ GI+ + AGT ++
Sbjct: 220 DVLTYVADKTAAGFSDMGTAMNYVGPISHSLGYSLTDTAAAVGLLSNRGIEGQKAGTGLR 279

Query: 236 NFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKV 295
                L  P+  A + +  +G+ + D+ GNM+    ++ D+N     M + Q+ + +  +
Sbjct: 280 GMLTSLLKPSKSAAEAMSAVGLTIEDNNGNMKTLPTLLDDINDKTKKMTKTQKNSFLTMI 339

Query: 296 FGAEGLAGNLGVINAGKDALVEYRNTLLSAEG----AADRMAK----------------- 334
           FG E L+    ++ AG D+L +Y      A G     AD M K                 
Sbjct: 340 FGREPLSAVNTLLEAGGDSLRKYSKGADEANGYTKQVADNMRKAGKFGVDQFKASLEVLE 399

Query: 335 -RIGDDMLGSLRTLRSTVESVAIRFFELSGGPMRDVVDQA 373
             +G  ++ +L  +      +  +F +LSG   ++++  A
Sbjct: 400 QNVGQKLMPALTPIIEWANKMIDKFNDLSGEQQQNIIKWA 439
>gb|AAK85310.1| (AY043263) phage-related tail protein [Wolbachia endosymbiont of
           Drosophila melanogaster]
          Length = 680

 Score = 48.9 bits (115), Expect = 2e-004
 Identities = 87/474 (18%), Positives = 163/474 (34%), Gaps = 58/474 (12%)

Query: 60  RAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAG 119
           +  I+FE  +  A V    +  KGTD         KE+                     G
Sbjct: 110 KVAIDFESAM--ADVTKVVDFKKGTDEATKFAKKLKEMSRTIPLSAAELAQIAASGGQLG 167

Query: 120 FNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLA 179
              E        +  ++TA  +   +A D        +G+           +  V +V+ 
Sbjct: 168 IKKEDLFMFTETVAKMSTAFDMSAEQAGDSIAKLSNVYGIDVS-------KMEYVGNVIN 220

Query: 180 KTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFL 239
             S +       M E +     TA   G  I+  S+++    S G +   A TA+     
Sbjct: 221 HLSDNTAAKAKDMVEALAIVGGTAKQFGLDIKETSSLVNAFVSLGKQPAKAATAINALLS 280

Query: 240 RLAAP---AGEARKILRRLGIDVADSAGNMRD-AFDVVGDLNGALATMGERQRLAVIQKV 295
           +L       G+ +  L ++GI   +    + +   + +     AL  M  ++R  ++ K+
Sbjct: 281 KLQTAEEQGGDFKAALEQMGITAEEIVQRISENGEEALLYFFQALKKMDNQERSTILMKL 340

Query: 296 FGAE---GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVE 352
           FG E    +A   G  N  +DA+    +T             R+ D     LR LR+ + 
Sbjct: 341 FGQEYQDDIALLAGSFNKYEDAIRLLSDTEEYKSSLQKEFQNRV-DTTASKLRLLRNAIA 399

Query: 353 SVAIRFFELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA-- 410
            V +    +    ++ + +      R+   L A+        I+  + +++  +K++A  
Sbjct: 400 EVGMNLGSVMLPTLKSIAEFLQEKTRSIA-LFAEKYPTLTKAIMGTVAALI-SLKVVAVG 457

Query: 411 ----------TVFAT----LWAFNTIVTTITGAI-TLLNLVIAANPXXXXXXXXXXXXXX 455
                     T+F+     L  F+ +  T+  A+ T L  V  ANP              
Sbjct: 458 LGYGFTLLGSTIFSLKANLLGVFSFLSATVFPAVVTGLRAVTLANPIGLLITGLVTGAAL 517

Query: 456 XXXXXYLHWEPIKTFFTDLWDGIGNAFDTFATNLANKWEALIAPIRSSIQWLLE 509
                  +W+ +K FF+                    W+++I PI  +  W+ E
Sbjct: 518 VIT----NWQKVKDFFSSF------------------WKSIIKPIEEAFSWIGE 549
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.319 0.133 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 269,999,329 Number of Sequences: 1028106 Number of extensions: 8713787 Number of successful extensions: 24707 Number of sequences better than 5.0e-02: 34 Number of HSP's better than 0.1 without gapping: 23 Number of HSP's successfully gapped in prelim test: 11 Number of HSP's that attempted gapping in prelim test: 24614 Number of HSP's gapped (non-prelim): 74 length of query: 603 length of database: 324,535,921 effective HSP length: 127 effective length of query: 476 effective length of database: 193,966,459 effective search space: 92328034484 effective search space used: 92328034484 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 94 (40.8 bits) Query= orf17 [4523-4941] (417 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_274140.1|  (NC_003112) phage virion protein, putative...    40  0.039
>ref|NP_274140.1| (NC_003112) phage virion protein, putative [Neisseria meningitidis
           MC58]
 pir||B81122 phage virion protein, probable NMB1109 [imported] - Neisseria
           meningitidis (group B strain MD58)
 gb|AAF41500.1| (AE002460) phage virion protein, putative [Neisseria meningitidis
           MC58]
          Length = 443

 Score = 40.4 bits (93), Expect = 0.039
 Identities = 33/129 (25%), Positives = 59/129 (45%), Gaps = 10/129 (7%)

Query: 3   WQTRLREAAYTPPSGNRLTFIYTGVSEEFDQKGGPFDFAGAQGTYVQFLGVTGRRYPMTI 62
           W T L++A+Y    G  +  +     +   +   PF     QG  ++ +G+TGR+  +  
Sbjct: 4   WHTLLQDASYKGV-GFDIEVVDESNGKALAEHARPF----VQGIDLEDMGMTGRQVQINA 58

Query: 63  IVSGDDYDLDAAAWMRALAEQGEAVLEHPAYGRL--TVAPVGTVKRSENFVNGAGQATIE 120
           +  G  Y       + AL + G  VL HP +GR+   +A   + +   ++V+ AG   I+
Sbjct: 59  VFWGKGYAGRLKKLLDALEQPGGGVLVHPVWGRMHNMIAASWSYRHEADYVDYAG---ID 115

Query: 121 VTLFETTGA 129
           +T  E   A
Sbjct: 116 ITFREAAEA 124
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.317 0.135 0.000 Gapped Lambda K H 0.267 0.0410 2.92e-152 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 200,742,413 Number of Sequences: 1028106 Number of extensions: 6748262 Number of successful extensions: 13026 Number of sequences better than 5.0e-02: 1 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 13026 Number of HSP's gapped (non-prelim): 1 length of query: 417 length of database: 324,535,921 effective HSP length: 124 effective length of query: 293 effective length of database: 197,050,777 effective search space: 57735877661 effective search space used: 57735877661 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 93 (40.4 bits) Query= orf18 [4938-5356] (417 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_439666.1|  (NC_000907) 43 kD tail protein [Haemophilu...    63  6e-009
ref|NP_050648.1|  (NC_000929) putative tail protein [Enterob...    62  2e-008
ref|NP_284542.1|  (NC_003116) hypothetical protein [Neisseri...    60  4e-008
ref|NP_313011.1|  (NC_002695) putative tail protein [Escheri...    60  4e-008
ref|NP_599048.1|  (NC_003444) tail protein [Shigella flexner...    51  3e-005
>ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilus influenzae Rd]
          Length = 420

 Score = 63.2 bits (152), Expect = 6e-009
 Identities = 46/188 (24%), Positives = 90/188 (47%), Gaps = 10/188 (5%)

Query: 68  TQDPDEVTLRINGLNFRFWRAVTITQHLDAVS-TVSLHAPFDPNDQQSRDAFRPYSYQPV 126
           T+  +++ L +NG     W+++ + + L+++S    L     P D  S  A    +  P+
Sbjct: 7   TETNNKIELYLNGKILSGWKSLNLQRSLESMSGRFDLGIAVRPEDDISVLA----AGSPL 62

Query: 127 AVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQT 186
            + +GG+ + +G L   +   + N++T+  S       L DC    +++  Q+   T + 
Sbjct: 63  VLKMGGQTVITGYLDEIKQRVSGNDKTISVSGRDKTCDLVDCAIIHNSY--QFKNQTAKQ 120

Query: 187 IAADLCRPFGVQVLAPNGTGQTFERI---AVEPAEKVMAVIARLAAQRNLVVRSDEQGRL 243
           IA  +C+PFG+ V+      +  ERI    VEP E     ++++A  + ++V SD  G L
Sbjct: 121 IAEAICKPFGISVVWQVQAPEANERIPVWQVEPGETAFDNLSKIARHKGVLVTSDVDGNL 180

Query: 244 VLLRPDTQ 251
           +   P  +
Sbjct: 181 LFTEPSNK 188
>ref|NP_050648.1| (NC_000929) putative tail protein [Enterobacteria phage Mu]
 sp|P08558|VPP_BPMU 43 kDa tail protein (gpP)
 pir||ZPBPMU gene P protein - phage Mu
 emb|CAA29956.1| (X06796) 43 kDa tail protein [Enterobacteria phage Mu]
 gb|AAF01122.1|AF083977_41 (AF083977) P [Enterobacteria phage Mu]
          Length = 379

 Score = 61.6 bits (148), Expect = 2e-008
 Identities = 72/311 (23%), Positives = 124/311 (39%), Gaps = 32/311 (10%)

Query: 74  VTLRINGLNFRFWRAVTITQHLDAVSTVSLHAPFDPNDQQSRDAFRPYSYQPVAVDVGGE 133
           VTLR +G  F  W +V++T+ +++V+    +     N     D       +   +++GG+
Sbjct: 5   VTLRADGRLFTGWTSVSVTRSIESVAG---YFELGVNVPPGTDLSGLAPGKKFTLEIGGQ 61

Query: 134 RLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQTIAADLCR 193
            + +G + + +    A+   +  +       L DC    S    QW   TL+ IA DLC 
Sbjct: 62  IVCTGYIDSRRRQMTADSMKITVAGRDKTADLIDCAAVYSGG--QWKNRTLEQIARDLCA 119

Query: 194 PFGVQV---LAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLLRPDT 250
           P+GV V   L+   +   F    ++ +E V   + R +  R +++ S+  G LV  R  +
Sbjct: 120 PYGVTVRWELSDKESSAAFPGFTLDHSETVYEALVRASRARGVLMTSNAAGELVFSRAAS 179

Query: 251 Q-------GEPV--AEFIEGQQPPISVTPTFGNQDYYSHVTGI------TPTIVGLEGPQ 295
                   GE +   +F E  +   S     G    Y+   G         +IV  +G  
Sbjct: 180 TATDELVLGENLLTLDFEEDFRDRFSEYTVKG----YARANGAEGDDIDAKSIVSRKGTA 235

Query: 296 ATVRNPRLEGVLRPFVYNADDMDEADLVQA-VQSKAGRMFAAAATYDVPVPTWRNANGGL 354
                 R     RP +  AD    A   QA    +  R  A + T++  +  W   +G L
Sbjct: 236 TDSDVTR----YRPMIIIADSKITAKDAQARALREQRRRLAKSITFEAEIDGWTRKDGQL 291

Query: 355 WRVGDFVILEA 365
           W     V ++A
Sbjct: 292 WMPNLLVTIDA 302
>ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
 pir||C81809 hypothetical protein NMA1830 [imported] - Neisseria meningitidis
           (group A strain Z2491)
 emb|CAB72001.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
 emb|CAB85055.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
          Length = 379

 Score = 60.5 bits (145), Expect = 4e-008
 Identities = 79/310 (25%), Positives = 122/310 (38%), Gaps = 28/310 (9%)

Query: 71  PDE-VTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDQQSRDAFRPYSYQPV 126
           PD  VTL ING     W    I   L   +   +V L  P D       DA R  +   V
Sbjct: 4   PDNTVTLLINGKTHGQWTNYDIVSDLLTPADDFSVMLGRPVDAKP----DAVR--AGDKV 57

Query: 127 AVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQT 186
            V VGG+ + SG +   Q  T    +T+        GVL DC+ P         +  L  
Sbjct: 58  EVRVGGDTVLSGRIDRVQTVTEKGGKTLTIQGRDDAGVLLDCSAPL----FNAQDMDLNQ 113

Query: 187 IAADLCRPFGVQVLAPNG--TGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV 244
           I   + +P G+  +  +   T +T  ++ +EP  +    +   A    L    +  G LV
Sbjct: 114 IIEKIVKPLGLAKIRIDAAKTDKT-HKVQIEPGSRAWDALLEYAEANGLWPWLEPDGTLV 172

Query: 245 LLRPDTQGEPVAEFI---EGQQPPISVTPTFGNQDY---YSHVTGITPTIVGLEGPQATV 298
           +  PD    PVAE +    GQ   I       N+D    YS VT +  +  G    +AT 
Sbjct: 173 VGGPDYTAAPVAELVLRTNGQNNNIKRLEV--NRDMAARYSEVTVLAQSHSGKNNIKATA 230

Query: 299 RNPRLEGVLRPFVYNADDMD-EADLVQAVQSKAGRMFAAAATYDVPVPTWRNANGGLWRV 357
           ++  ++ + RP +    D+D +A   +  + +         T    V   R  +G LW+ 
Sbjct: 231 KDESVK-LHRPLIVTEPDIDSQAQAQRKAKKRLADSRLEGLTITATVQGHRTDDGTLWQP 289

Query: 358 GDFV-ILEAP 366
           G  + +L  P
Sbjct: 290 GQRINVLSEP 299
>ref|NP_313011.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
 dbj|BAB38407.1| (AP002567) putative tail protein [Escherichia coli O157:H7]
          Length = 374

 Score = 60.5 bits (145), Expect = 4e-008
 Identities = 70/268 (26%), Positives = 105/268 (39%), Gaps = 21/268 (7%)

Query: 124 QPVAVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTP--PASAFPLQWDE 181
           Q + + + G+ + +G L   +   +A    +  S     G L DC+   P S    QW  
Sbjct: 52  QSLELRIDGQPVITGWLDQVRQRISATRFQITLSGRDKTGDLVDCSAIHPGS----QWRN 107

Query: 182 ATLQTIAADLCRPFGVQV---LAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSD 238
            TL+ IA+DLC PFGV V   +      + F    +E +E V   + R A  R ++V S+
Sbjct: 108 RTLEHIASDLCAPFGVTVRWQVNDATAARPFSTFTLENSETVADALTRAARHRGVLVTSN 167

Query: 239 EQGRLVLLRPDTQ-------GEPVAEFIEGQQPPI--SVTPTFGNQDYYSHVTGITPTIV 289
             G LV  +  +Q       GE + +        +  S     G+     H  G   T  
Sbjct: 168 AAGELVFTQAGSQRGDTLTLGENLLDLDHNVDHRLRHSEYRVRGHGRGGGH-AGDALTAG 226

Query: 290 GLEGPQATVRNPRLEGVLRPFVYNADDMDEAD-LVQAVQSKAGRMFAAAATYDVPVPTWR 348
            L  P  TV +  +    RP +  AD   +AD   Q    +  R  A +      V  W 
Sbjct: 227 TLAAPVGTVTDSAIHR-YRPKIVLADHAVDADGARQRAVREMRRAVARSVRLTATVRHWF 285

Query: 349 NANGGLWRVGDFVILEAPGAQVYRRTLM 376
             NG LW +     + AP   V  R L+
Sbjct: 286 RENGQLWDINLLTAVTAPRTGVEERDLL 313
>ref|NP_599048.1| (NC_003444) tail protein [Shigella flexneri bacteriophage V]
 gb|AAL89425.1| (U82619) tail protein [Shigella flexneri bacteriophage V]
          Length = 359

 Score = 50.8 bits (120), Expect = 3e-005
 Identities = 81/367 (22%), Positives = 132/367 (35%), Gaps = 59/367 (16%)

Query: 72  DEVTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDQQSRDAFRPYSYQPVAV 128
           D VTLR+NG  +  W +V I   ++ ++   +V +   +  ++  +    R  +   V V
Sbjct: 3   DNVTLRVNGREWNGWTSVRIGAGIERLARDFSVDITRQWPGDEGITTLQPRIKNGSKVEV 62

Query: 129 DVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQTIA 188
            +G E + +G +        A   +   +  S    L DC    +A P Q++  +L  IA
Sbjct: 63  LIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDC----AAEPTQFNGRSLVQIA 118

Query: 189 ADLCRPFGVQVLAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV---- 244
             L  PFG++V+           +  +  E V+ VI ++  Q+  +   D  GRLV    
Sbjct: 119 QALAAPFGIEVVNNGAPSGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGI 178

Query: 245 --------------LLRPDTQGEPVAEF----IEGQQPPISVTPTFGNQDYYSHVTGITP 286
                         +L  DT+      F    + GQ+         GN D +   T    
Sbjct: 179 GSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRA--------GNDDDFGEAT---- 226

Query: 287 TIVGLEGPQATVRNPRLEGVL----RP-FVYNADDMDEADLVQAVQSKAGRMFAAAATYD 341
                     T    R E       RP ++        A  +     +A +  A      
Sbjct: 227 ---------TTALRARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETT 277

Query: 342 VPVPTWRNANGGLWRVGDFVILEAPGAQVYRRTLMQIKTVRFTATPAERTAVLELIIPGS 401
             V  WR  NG LW+    VI+  P    +  T + +  V FT    ++   L  I  G 
Sbjct: 278 YVVQGWRQGNGTLWQPNQRVIVFDPVCG-FDNTELLVSEVTFT---QDQNGTLTEIRVGP 333

Query: 402 LSGQLPE 408
               LPE
Sbjct: 334 PDAYLPE 340
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.317 0.134 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 278,634,757 Number of Sequences: 1028106 Number of extensions: 11907888 Number of successful extensions: 25791 Number of sequences better than 5.0e-02: 6 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 25782 Number of HSP's gapped (non-prelim): 9 length of query: 417 length of database: 324,535,921 effective HSP length: 124 effective length of query: 293 effective length of database: 197,050,777 effective search space: 57735877661 effective search space used: 57735877661 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 93 (40.4 bits) Query= orf19 [5349-5564] (214 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.312 0.132 0.000 Gapped Lambda K H 0.267 0.0410 2.09e-115 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 139,201,419 Number of Sequences: 1028106 Number of extensions: 5630057 Number of successful extensions: 9531 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 9531 Number of HSP's gapped (non-prelim): 0 length of query: 214 length of database: 324,535,921 effective HSP length: 116 effective length of query: 98 effective length of database: 205,275,625 effective search space: 20117011250 effective search space used: 20117011250 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 89 (38.9 bits) Query= orf20 [5568-5704] (135 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_518999.1|  (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni...    47  3e-005
>ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD14580.1| (AL646061) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 132

 Score = 47.4 bits (111), Expect = 3e-005
 Identities = 33/125 (26%), Positives = 51/125 (40%), Gaps = 3/125 (2%)

Query: 3   LAFSQNVRNQRLXXXXXXXXXGSGPALIRIYNXXX--XXXXXXXXXLLAQLEMSDPAFDA 60
           +  S  VRN RL         G+   L+R+Y+              LLA++ +  P   +
Sbjct: 2   ITLSVPVRNSRLVVIGQALDAGTAGGLLRLYSAPRPDVGQALAEQVLLAEVRLPQPCMGS 61

Query: 61  PANGTMTARAITPEGSTPIGGTATWFRITDSEGNFVLDGDVGLDGSNAELELGDVNFLSN 120
              G +    I  +      G   W R+ DS+G +V D D GL  S AE+EL  +   + 
Sbjct: 62  LEGGRLVFAPIG-QALCRRSGIVAWARLCDSDGRWVADLDAGLPDSGAEVELPKLQVFAG 120

Query: 121 QEVRI 125
             V +
Sbjct: 121 GAVNV 125
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.314 0.133 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 72,014,635 Number of Sequences: 1028106 Number of extensions: 2549127 Number of successful extensions: 3840 Number of sequences better than 5.0e-02: 1 Number of HSP's better than 0.1 without gapping: 1 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3838 Number of HSP's gapped (non-prelim): 1 length of query: 135 length of database: 324,535,921 effective HSP length: 111 effective length of query: 24 effective length of database: 210,416,155 effective search space: 5049987720 effective search space used: 5049987720 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 83 (36.6 bits) Query= orf21 [5705-5927] (221 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.316 0.138 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 133,914,063 Number of Sequences: 1028106 Number of extensions: 5487482 Number of successful extensions: 10382 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 10382 Number of HSP's gapped (non-prelim): 0 length of query: 221 length of database: 324,535,921 effective HSP length: 117 effective length of query: 104 effective length of database: 204,247,519 effective search space: 21241741976 effective search space used: 21241741976 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 89 (38.9 bits) Query= orf22 [5930-6320] (389 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_313014.1|  (NC_002695) hypothetical protein [Escheric...    65  2e-009
ref|NP_415670.1|  (NC_000913) orf, hypothetical protein [Esc...    58  2e-007
dbj|BAA35978.1|  (D90749) Hypothetical protein HI1520 [Esche...    58  2e-007
ref|NP_599051.1|  (NC_003444) tail protein [Shigella flexner...    57  5e-007
ref|NP_284072.1|  (NC_003116) hypothetical protein NMA1323 [...    56  8e-007
ref|NP_404849.1|  (NC_003143) putative bacteriophage protein...    49  1e-004
ref|NP_439669.1|  (NC_000907) conserved hypothetical protein...    48  2e-004
ref|NP_050651.1|  (NC_000929) Hypothetical protein [Enteroba...    47  3e-004
ref|NP_470623.1|  (NC_003212) similar to a B. subtilis PBSX ...    45  0.001
>ref|NP_313014.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
 dbj|BAB38410.1| (AP002567) hypothetical protein [Escherichia coli O157:H7]
          Length = 361

 Score = 64.7 bits (156), Expect = 2e-009
 Identities = 69/308 (22%), Positives = 116/308 (37%), Gaps = 26/308 (8%)

Query: 20  ISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQMFIRTASNTPVTINGQTVT 79
           IS +L    P   +     +A AL+G+    + +  ++  Q+    A         + + 
Sbjct: 20  ISQRLPGSQPGVNETTLNAIAYALAGLSAQEHEHLAWISRQIIPTEADEA------ELLK 73

Query: 80  PLAEWGRILGAGPQRAAVQTQLDVSVAVVTQGGVLASGEQLQGPNGIIYITTGDTLLNXX 139
             A WG I     ++ A +    V + + T  G+   G  LQ  +G++Y  TG       
Sbjct: 74  HCAFWGVI-----RKPASRADGPVQLMLTTDAGI-TEGVLLQRSDGVVYRITGSA----- 122

Query: 140 XXXXXXXXXXXXXXXXXXXXXXNLADGAVLNFVQPLAALQPTATVSSTARTG-IDAETTE 198
                                 N   G  L+F+ P A +  TATV+ T  TG  D ET  
Sbjct: 123 --TGKAGTLNVNVEAESAGRAGNTPTGTRLSFITPQAGINQTATVTGTGLTGGADVETVP 180

Query: 199 AYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAY-PYTGDTPGTVAVYIESSTEADGI 257
              SR+  R Q  P GG   D++ WA     V  A+        G+V V        D  
Sbjct: 181 ELLSRLVFRVQNPPSGGTQYDFERWAREVPGVTRAWCKPEWPEAGSVGVTFVQDNNPDIF 240

Query: 258 PTNGQLLEARTAI---NQSPNGRRNRAPVGTLVNTFPITRE--TYNVTVSGLNVDNPADV 312
           P  G +      I   +    G+    P+G  ++ F +T +   + + +     +N A V
Sbjct: 241 PGEGDVKRVADYIRSHDDPATGQPVGQPLGPTISVFKLTNKPVAFEIRIVPKTPENQAAV 300

Query: 313 RRDISNAL 320
           ++ +++ L
Sbjct: 301 KQALTDLL 308
>ref|NP_415670.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
 sp|P75981|YMFP_ECOLI Hypothetical protein ymfP in lambdoid prophage e14 region
 pir||E64860 hypothetical protein b1152 - Escherichia coli (strain K-12)
 gb|AAC74236.1| (AE000214) orf, hypothetical protein [Escherichia coli K12]
          Length = 263

 Score = 57.8 bits (138), Expect = 2e-007
 Identities = 48/164 (29%), Positives = 66/164 (39%), Gaps = 9/164 (5%)

Query: 108 VTQGGVLASGEQLQGPNGIIYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLADGA 167
           V+ G  + +G  +Q  + + Y TT D                            N  DG 
Sbjct: 9   VSDGLKVTAGSVIQRDDLVQYTTTDDA-------TSSGGVLRVPIACSSAGAVGNADDGT 61

Query: 168 VLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETS 227
            L  V P+  L P++ V+ T   G D E  E +R+R+ ER    P GGA  DY +WA+  
Sbjct: 62  ALILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEV 120

Query: 228 SAVLNAYPYTG-DTPGTVAVYIESSTEADGIPTNGQLLEARTAI 270
             +  A+ Y      GTV V I SS   + IP       AR  I
Sbjct: 121 PGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHI 164
>dbj|BAA35978.1| (D90749) Hypothetical protein HI1520 [Escherichia coli]
 dbj|BAA35990.1| (D90750) Hypothetical protein HI1520 [Escherichia coli]
          Length = 276

 Score = 57.8 bits (138), Expect = 2e-007
 Identities = 48/164 (29%), Positives = 66/164 (39%), Gaps = 9/164 (5%)

Query: 108 VTQGGVLASGEQLQGPNGIIYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLADGA 167
           V+ G  + +G  +Q  + + Y TT D                            N  DG 
Sbjct: 22  VSDGLKVTAGSVIQRDDLVQYTTTDDA-------TSSGGVLRVPIACSSAGAVGNADDGT 74

Query: 168 VLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETS 227
            L  V P+  L P++ V+ T   G D E  E +R+R+ ER    P GGA  DY +WA+  
Sbjct: 75  ALILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEV 133

Query: 228 SAVLNAYPYTG-DTPGTVAVYIESSTEADGIPTNGQLLEARTAI 270
             +  A+ Y      GTV V I SS   + IP       AR  I
Sbjct: 134 PGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHI 177
>ref|NP_599051.1| (NC_003444) tail protein [Shigella flexneri bacteriophage V]
 gb|AAL89428.1| (U82619) tail protein [Shigella flexneri bacteriophage V]
          Length = 352

 Score = 56.6 bits (135), Expect = 5e-007
 Identities = 40/110 (36%), Positives = 52/110 (46%), Gaps = 2/110 (1%)

Query: 162 NLADGAVLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQ 221
           N  DG  L  V P+  L P++ V+ T   G D E  E +R+R+ ER    P GGA  DY 
Sbjct: 145 NADDGTSLILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYV 203

Query: 222 LWAETSSAVLNAYPYT-GDTPGTVAVYIESSTEADGIPTNGQLLEARTAI 270
           +WA+    +  A+ Y      GTV V I SS   + IP       AR  I
Sbjct: 204 VWAKEVPGITRAWTYRHWMGTGTVGVMIASSDLINPIPEESTETAARQHI 253
>ref|NP_284072.1| (NC_003116) hypothetical protein NMA1323 [Neisseria meningitidis
           Z2491]
 pir||C81901 hypothetical protein NMA1323 [imported] - Neisseria meningitidis
           (group A strain Z2491)
 emb|CAB84574.1| (AL162755) hypothetical protein NMA1323 [Neisseria meningitidis
           Z2491]
          Length = 351

 Score = 55.8 bits (133), Expect = 8e-007
 Identities = 51/174 (29%), Positives = 82/174 (46%), Gaps = 14/174 (8%)

Query: 187 TARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPY-TGDTPGTVA 245
           T + G D E+  +  +R+ E  +R P GG   DY+ WA +   V +AY Y      GTV 
Sbjct: 162 TVQGGTDRESDASLLARLLEIIRRPPAGGNRYDYKNWALSVDGVTSAYVYPLRRGLGTVD 221

Query: 246 VYIESSTEADGIPTNGQLLEARTAINQ-SPNGRRNRAPVGTLVNTFPITRETYNVTVSGL 304
           + I   T ADG+P+   +   +  I++  P   +N   +   V   P+   T  V + G+
Sbjct: 222 IAI---TSADGVPSEETVRRVQAYIDEMRPVTAKNALVLKPTVTAVPV---TVQVKLDGI 275

Query: 305 NVDNPADVRRDISNALAEYFLQREPFITGVTTGVRRDQITQIAVGGVVEGIVTA 358
           ++D   + +R I  AL EYF   +  I G    V + +     V GV++  +TA
Sbjct: 276 DLD---EAKRRIRTALKEYF---DTLIPGDGLTVSQIEAAISNVDGVIDLRLTA 323
>ref|NP_404849.1| (NC_003143) putative bacteriophage protein [Yersinia pestis]
 emb|CAC90084.1| (AJ414147) putative bacteriophage protein [Yersinia pestis]
          Length = 378

 Score = 48.9 bits (115), Expect = 1e-004
 Identities = 58/246 (23%), Positives = 90/246 (36%), Gaps = 15/246 (6%)

Query: 7   PTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQMFIRTA 66
           PT   +     + I S+L  G  +   +   V++ A +G+    Y Y  ++  Q      
Sbjct: 6   PTLSELRQRNQSYIQSELKTGGNLLRFSNIGVISDADAGMAHLHYGYLDYIARQA----- 60

Query: 67  SNTPVTINGQTVTPLAEWGRILGAGPQRAAVQTQLDVSVAVVTQGGVLASGEQLQGPNGI 126
             TP     +    LA WG +     + A   T  +V  + +  G V+ +G  L   +G 
Sbjct: 61  --TPYNATDEY---LAAWGALKDVFRKAANPATSNEVRFSGIA-GRVIPAGRLLNRADGY 114

Query: 127 IYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLAD---GAVLNFVQPLAALQPTAT 183
            Y    + ++                         N  +   G VL     +  +Q TAT
Sbjct: 115 QYQLNKEVIIAEQGSALGEITAILPSPLDDATGGGNRGNSPAGTVLTLDIAIDGVQATAT 174

Query: 184 VSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPYTG-DTPG 242
             +    G D E+ +A+RSR+    Q  P GG   DYQ WA     V   +        G
Sbjct: 175 ALTKISGGADIESEDAFRSRMLLAYQNVPQGGNDTDYQSWALAVPGVTRCWVKRRLMGAG 234

Query: 243 TVAVYI 248
           TV VYI
Sbjct: 235 TVGVYI 240
>ref|NP_439669.1| (NC_000907) conserved hypothetical protein [Haemophilus influenzae
           Rd]
 sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47
 pir||G64034 hypothetical protein HI1520 - Haemophilus influenzae (strain Rd
           KW20)
 gb|AAC23162.1| (U32827) conserved hypothetical protein [Haemophilus influenzae Rd]
          Length = 355

 Score = 48.1 bits (113), Expect = 2e-004
 Identities = 58/241 (24%), Positives = 87/241 (36%), Gaps = 23/241 (9%)

Query: 94  RAAVQTQLDVSVAVVT----QGGVLASGEQLQGPN-GIIYITTGDTLLNXXXXXXXXXXX 148
           +  V+ Q   +  +VT        + +G   +  N G+ +ITT +T++            
Sbjct: 74  KGIVRKQASTATGLVTVTAANDTTIPAGTVFEDTNTGLTFITTQETVVKAGTADIAVKCE 133

Query: 149 XXXXXXXXXXXXXNLADGAVLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERT 208
                        NL  G  L+    +  L PTATV   +  G D E+     +R+  R 
Sbjct: 134 TTGVEG-------NLKAGTSLSLTSAILGLLPTATVKVMSG-GADIESLSRLLARLIYRV 185

Query: 209 QRRPLGGAPVDYQLWAETSSAVLNAY---PYTGDTPGTVAVYIESSTEADGIPTNGQLLE 265
           Q  P GGA  DY  WA     V  A+    Y G   GTV V        D +PT   +  
Sbjct: 186 QYPPAGGASHDYIRWATEVPGVTRAWCFERYYGG--GTVGVAFACDEREDILPTPEDIAR 243

Query: 266 ARTAINQSPN---GRRNRAPVGTLVNTFPITRETYNVTVSGLNVDNPADVRRDISNALAE 322
            R  I    N   G+    P    +  F    +  N  +      N   +R+ +  +LA 
Sbjct: 244 VRAYIEGHKNEVTGQFEGMPANVELYVFAPQFQAVNFKIR--LAPNTPTLRQAVRKSLAA 301

Query: 323 Y 323
           Y
Sbjct: 302 Y 302
>ref|NP_050651.1| (NC_000929) Hypothetical protein [Enterobacteria phage Mu]
 sp|Q9T1V2|VG47_BPMU Protein gp47
 gb|AAF01125.1|AF083977_44 (AF083977) gp47 [Enterobacteria phage Mu]
          Length = 360

 Score = 47.4 bits (111), Expect = 3e-004
 Identities = 55/247 (22%), Positives = 91/247 (36%), Gaps = 21/247 (8%)

Query: 2   VAPTTPTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQM 61
           +A + PT  S+       I  +L    P   +     +A A +G+    + +  ++  Q+
Sbjct: 1   MAYSPPTLSSLIARTEQNIEQRLPGSWPQAREKTLSAIAYAQAGLAAGCHEHISWVGRQI 60

Query: 62  FIRTASNTPVTINGQTVTPLAEWGRILGAGPQRA-AVQTQLDVSVAVVTQGGVLASGEQL 120
              TA              L E  R  G   ++A A    L V+ +  T    + +G + 
Sbjct: 61  IPSTADEDE----------LLEHCRFWGVRRKQATAASGPLTVTTSAAT---TIPAGTRW 107

Query: 121 QGPNGIIYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLADGAVLNFVQPLAALQP 180
           Q  +G++Y +  DT++                         N  +  +L  + P+A +  
Sbjct: 108 QRADGVVY-SLADTIV-----IDRAGTTEITVTALAAGEAGNTGENTLLTLITPVACVVS 161

Query: 181 TATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPY-TGD 239
            A        G D E+     SR++ R Q  P GG   DY  WA   S V  A+ + T  
Sbjct: 162 DAITVKGFSGGADIESAAELLSRLEYRVQYPPFGGNQFDYVRWAREVSGVTRAWCFPTWK 221

Query: 240 TPGTVAV 246
             GTV V
Sbjct: 222 GGGTVGV 228
>ref|NP_470623.1| (NC_003212) similar to a B. subtilis PBSX prophage protein
           [Listeria innocua]
 emb|CAC96518.1| (AL596168) similar to a B. subtilis PBSX prophage protein [Listeria
           innocua]
          Length = 361

 Score = 45.4 bits (106), Expect = 0.001
 Identities = 47/194 (24%), Positives = 82/194 (42%), Gaps = 15/194 (7%)

Query: 191 GIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPY-TGDTPGTVAVYIE 249
           G D E  E +R R   + Q+    G    Y+LWAE    V  A  Y  G   GTV + I 
Sbjct: 166 GSDEEELETFRKRFFTKIQKDAFSGNKNHYKLWAEEVLGVGKAKIYPLGYGEGTVKIVI- 224

Query: 250 SSTEADGIPTNGQLLE-ARTAINQSPNGRRNRAPVGTLVNTFPITRETYNVTVSGLNVDN 308
             T+ +  P +  L+E  R  I+  P      AP+G  V       +T N++   +  D 
Sbjct: 225 --TDTNLHPASPTLVEQVRNYIDPLPQKGEGVAPIGAKVFIESAAWKTINISGQLVLKDG 282

Query: 309 PADVRRDISNALAEYFLQREPFITGVTTGVRRDQITQIAVGGVVEGIVTAAGGTLT--GV 366
                + +++ + E  +        +T     D++T + +  V+  I+ +A   L    +
Sbjct: 283 -----KSVNDVIEEITMDINKLFKSIT--FPEDELTSVKL-AVISNIIFSAPSVLDNFNI 334

Query: 367 SVTNEAGDAVVSLN 380
            + NEA + +++ N
Sbjct: 335 KLNNEAANIILAEN 348
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.315 0.132 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 219,559,145 Number of Sequences: 1028106 Number of extensions: 8329913 Number of successful extensions: 23385 Number of sequences better than 5.0e-02: 10 Number of HSP's better than 0.1 without gapping: 5 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 23379 Number of HSP's gapped (non-prelim): 10 length of query: 389 length of database: 324,535,921 effective HSP length: 123 effective length of query: 266 effective length of database: 198,078,883 effective search space: 52688982878 effective search space used: 52688982878 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 92 (40.0 bits) Query= orf23 [6324-6565] (240 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.319 0.139 0.000 Gapped Lambda K H 0.267 0.0410 1.51e-154 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 174,536,852 Number of Sequences: 1028106 Number of extensions: 8124398 Number of successful extensions: 23542 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 23542 Number of HSP's gapped (non-prelim): 0 length of query: 240 length of database: 324,535,921 effective HSP length: 118 effective length of query: 122 effective length of database: 203,219,413 effective search space: 24792768386 effective search space used: 24792768386 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 89 (38.9 bits) Query= orf24 [6569-7014] (444 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_519813.1|  (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni...    63  8e-009
ref|NP_520042.1|  (NC_003295) PUTATIVE TAIL FIBER-RELATED PR...    59  1e-007
gb|AAK77227.1|  (AY040839) endoglucanase C [Aspergillus niger]     53  6e-006
ref|NP_630626.1|  (NC_003888) putative secreted cellulase [S...    46  0.001
ref|NP_637119.1|  (NC_003902) cellulase [Xanthomonas campest...    45  0.002
ref|NP_518238.1|  (NC_003295) CONSERVED HYPOTHETICAL PROTEIN...    42  0.011
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 225

 Score = 62.8 bits (151), Expect = 8e-009
 Identities = 28/37 (75%), Positives = 30/37 (80%)

Query: 104 WLLANGQAVSRTTYSELFAVLGTRFGQGDGSTTFNLP 140
           WL  NG AVSRTTY+ LF V+GT FG GDGSTTFNLP
Sbjct: 80  WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLP 116
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 554

 Score = 58.9 bits (141), Expect = 1e-007
 Identities = 26/37 (70%), Positives = 30/37 (80%)

Query: 104 WLLANGQAVSRTTYSELFAVLGTRFGQGDGSTTFNLP 140
           WL ANG AVSRTTY+ L+A +GT FG GDG+ TFNLP
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLP 451
>gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger]
          Length = 857

 Score = 53.1 bits (126), Expect = 6e-006
 Identities = 53/197 (26%), Positives = 82/197 (40%), Gaps = 19/197 (9%)

Query: 106 LANGQAVSRTTYSELFAVLGTRFGQ----GDGSTTFNLPADGASKEPGQEFVNLGTGTGL 161
           LA   +   + Y++  A   T  GQ     D  T   + +DGA +       N  T + +
Sbjct: 513 LALSSSFGESWYADYAASSSTATGQVALSADADTILLMNSDGAYRS-----ANSATLSAV 567

Query: 162 PTYNSEARFARAPNGNMYFVATGPTRLYRSTDNGLTWT-TTNIGSGLPANFDNPAIMVGQ 220
            +  S A  A     N YF     +  Y S+D   T+T TT +GS   AN       +  
Sbjct: 568 SSLPSGAVIASDKANNTYFYGASGSSFYLSSDTAATFTVTTTLGSSTTANAIRAQPSLAG 627

Query: 221 NNYVYFIDRTNADLRRSTDDGVSFTNVGVGGGLPSSIGAPFLASAPTTTGSNPEILYFID 280
           + +V     T+  L  ST+ G SFT +G G     + G  F    P++ G  P +  F  
Sbjct: 628 DVWV----STDTGLFHSTNYGKSFTQIGSG----CTEGWSFGFGKPSSDGDYPVLFGFFT 679

Query: 281 SANDT-LYVSTNEGINW 296
               T L+ + ++G+NW
Sbjct: 680 VDGVTGLFKTEDQGVNW 696
 Score = 45.8 bits (107), Expect = 0.001
 Identities = 65/252 (25%), Positives = 100/252 (38%), Gaps = 48/252 (19%)

Query: 188 LYRSTDNGLTWTTTN----IGSGLPANF--DNPAIMVGQNNYVYFIDRTNADLRRSTDDG 241
           + RSTD G TW  T     +G  +P     +  A+    N+ +YF  R+   L +STD G
Sbjct: 118 ILRSTDQGDTWEETKLPFKVGGNMPGRGVGERLAVDPNDNSILYFGARSGNGLWKSTDYG 177

Query: 242 VSFTNVGV--GGGLPSSIGAPFLASAP---------TTTGSN----PEILYFIDSANDTL 286
            +++NV      G      +    S P         +T+GS+    P I   +    +++
Sbjct: 178 ETWSNVTAFKWTGTYFQDSSSTYTSDPVGIAWVTFDSTSGSSGSPTPRIFVGVVDTGESV 237

Query: 287 YVSTNEGINWTTFGVGD---GLPP-----------------NGLGEMIGAPDGYFYGIDI 326
           +VS + G  WT    G+   G  P                 NG G   G  +G  +  +I
Sbjct: 238 FVSEDAGETWTWVS-GEPMYGFLPHKGILSPSEHTLYISYSNGAGPYDGT-NGTVHKYNI 295

Query: 327 VSDVLYRQVPPSTTWSSVGIGNGLPSTISSPSMSVDPDGTLYFIDGADRDRLYRSTDRGV 386
            S V     P S T +  G G GL   +  P   +      ++ D    + ++RSTD G 
Sbjct: 296 TSGVWTDISPTSMTDTYYGYG-GLAVDLQVPGTVMVAALNCWWPD----ELIWRSTDSGG 350

Query: 387 TWSSQFLGQGLP 398
           TWS  +   G P
Sbjct: 351 TWSPIWAWNGYP 362
>ref|NP_630626.1| (NC_003888) putative secreted cellulase [Streptomyces coelicolor
           A3(2)]
 pir||T35237 probable secreted cellulase - Streptomyces coelicolor
 emb|CAA20642.1| (AL031515) putative secreted cellulase [Streptomyces coelicolor
           A3(2)]
          Length = 890

 Score = 45.8 bits (107), Expect = 0.001
 Identities = 64/266 (24%), Positives = 93/266 (34%), Gaps = 56/266 (21%)

Query: 158 GTGLPTYNSEARFARAPNGN--MYFVATGPTRLYRSTDNGLTWTTTNIGSGLPANFDNPA 215
           G  +P      R A  P+ N  +Y  A     L+RSTD G+TW+           F NP 
Sbjct: 154 GGNMPGRGMGERLAVDPHDNDVLYLGAPSGHGLWRSTDAGVTWSEVTA-------FPNPG 206

Query: 216 IMVGQNNYVYFIDRTNADLRRSTDDGVSFTNV--GVGGGLPSSIGAPFLASAPTTTGSNP 273
                 NY    D  +     S + G+++       GGG                 G+  
Sbjct: 207 ------NYAQ--DPNDTSGYASDNQGITWVTFDESTGGG----------------AGTAT 242

Query: 274 EILYF-IDSANDTLYVSTNEGINWTTFGVGDGLPPNGLGE--MIGAPDGYFY-------- 322
             LY  +    + +Y ST+ G  W       G P   L    ++ A +GY Y        
Sbjct: 243 RTLYVGVADKENAVYRSTDAGATWERLA---GQPTGYLAHKGVLDAENGYLYLAYSDTGG 299

Query: 323 GIDIVSDVLYRQVPPSTTWSSVGIGNGLPSTISSPSMSVDPD------GTLYFIDGADRD 376
             D     LYR    + TW+ +       +      ++VD         T Y     D  
Sbjct: 300 PYDGGKGRLYRYATATGTWTDISPAAEADTYYGFSGLTVDRQRPGTVMATAYSSWWPDT- 358

Query: 377 RLYRSTDRGVTWSSQFLGQGLPDGGN 402
           +++RSTD G TWS  +     PD  N
Sbjct: 359 QIFRSTDSGATWSQAWSYTSYPDREN 384
>ref|NP_637119.1| (NC_003902) cellulase [Xanthomonas campestris pv. campestris str.
           ATCC 33913]
 gb|AAM41043.1| (AE012276) cellulase [Xanthomonas campestris pv. campestris str.
           ATCC 33913]
          Length = 751

 Score = 44.7 bits (104), Expect = 0.002
 Identities = 51/201 (25%), Positives = 81/201 (39%), Gaps = 41/201 (20%)

Query: 224 VYFIDRTNADLRRSTDDGV------SFTNVGVGG-------GLPSSIGAPFL---ASAPT 267
           V  +   +A L RS D G       SF +  + G       G   ++G  F+   A++  
Sbjct: 179 VLLLGSRDAGLWRSDDRGAHWAKVASFPDAALAGATARNHVGREQAVGIAFVVFDAASGN 238

Query: 268 TTGSNPEILYFIDSANDTLYVSTNEGINWTTF-GVGDGLPPNGLGEMIGAPDGYFY---- 322
           T    P I   + +   +LYVS + G +W    G   GL P+    M G  DG++Y    
Sbjct: 239 TGTPTPRIYVGVSTEQTSLYVSEDAGRSWAPVAGQPRGLRPS---HMAGGSDGHWYLSYG 295

Query: 323 ---GIDIVSD-VLYRQVPPSTTWSSVGIGNGLPSTISSP-----SMSVDPDGTLYFIDGA 373
              G D+++   L++  P    W  +   + +P   S       +++VDP      +   
Sbjct: 296 DQPGPDLMAGGALWKFTPAQGRWREI---SPIPQPASGDGFGWGAVAVDPQQPQVLLAST 352

Query: 374 -----DRDRLYRSTDRGVTWS 389
                 RD LYRS D G  W+
Sbjct: 353 FRRRTPRDELYRSVDGGKHWA 373
>ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD13645.1| (AL646057) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 177

 Score = 42.4 bits (98), Expect = 0.011
 Identities = 28/62 (45%), Positives = 35/62 (56%), Gaps = 7/62 (11%)

Query: 101 APQ-WLLANGQAVSRTTYSELFAVLGTRFGQGDGSTTFNLPADGASKEPGQEFVNLGTGT 159
           APQ WLL  GQ +  ++   L+A+LGT +G GDG TTF LP    +   GQ     G G 
Sbjct: 16  APQGWLLCQGQLIDISSNEILYALLGTTYG-GDGRTTFALPNLQGALPVGQ-----GQGP 69

Query: 160 GL 161
           GL
Sbjct: 70  GL 71
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.313 0.135 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 337,325,147 Number of Sequences: 1028106 Number of extensions: 17038337 Number of successful extensions: 27753 Number of sequences better than 5.0e-02: 11 Number of HSP's better than 0.1 without gapping: 3 Number of HSP's successfully gapped in prelim test: 8 Number of HSP's that attempted gapping in prelim test: 27714 Number of HSP's gapped (non-prelim): 31 length of query: 444 length of database: 324,535,921 effective HSP length: 124 effective length of query: 320 effective length of database: 197,050,777 effective search space: 63056248640 effective search space used: 63056248640 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 93 (40.4 bits) Query= orf25 [7015-7465] (449 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_519813.1|  (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni...    67  4e-010
ref|NP_520042.1|  (NC_003295) PUTATIVE TAIL FIBER-RELATED PR...    59  1e-007
ref|NP_298995.1|  (NC_002488) phage-related tail fiber prote...    57  6e-007
ref|NP_251770.1|  (NC_002516) hypothetical protein [Pseudomo...    47  5e-004
ref|NP_625406.1|  (NC_003888) conserved hypothetical protein...    45  0.002
ref|NP_640423.1|  (NC_003919) microcystin dependent protein ...    44  0.003
ref|NP_640422.1|  (NC_003919) microcystin dependent protein ...    44  0.004
gb|AAK77227.1|  (AY040839) endoglucanase C [Aspergillus niger]     44  0.004
ref|NP_440411.1|  (NC_000911) hypothetical protein [Synechoc...    44  0.005
ref|NP_105312.1|  (NC_002678) microcystin dependent protein ...    43  0.007
ref|NP_347555.1|  (NC_003030) Probably secreted sialidase; s...    43  0.009
gb|AAK06388.1|  (AF078042) glycosyl hydrolase 5 [Caldicellul...    43  0.009
emb|CAC02964.1|  (AJ292929) CEL6 protein [Agaricus bisporus]       41  0.033
ref|NP_642098.1|  (NC_003919) cellulase [Xanthomonas axonopo...    41  0.033
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 225

 Score = 67.0 bits (162), Expect = 4e-010
 Identities = 41/100 (41%), Positives = 52/100 (52%), Gaps = 9/100 (9%)

Query: 100 PEPQWLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQEPT-------PTPG 152
           P   WL+ +G AVSRT YA LF V+GT +GAGDGSTTFN+P+   + P           G
Sbjct: 76  PPAGWLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRGWDDGRGVDAG 135

Query: 153 ETFVEKQSDGYPRDVRSTAIAGLPDGTILGAFGRFSGTPI 192
             F   Q+         TAI    DG+ L  +G  + TPI
Sbjct: 136 RVFGSVQAQALSSHQHKTAIGF--DGSNLFGWGDGNATPI 173
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 554

 Score = 58.9 bits (141), Expect = 1e-007
 Identities = 47/141 (33%), Positives = 66/141 (46%), Gaps = 31/141 (21%)

Query: 104 WLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQEPTPTPGETFVEKQSDGY 163
           WL+ +G AVSRT YA L+A +GT +GAGDG+ TFN+P   G+         F+    DG 
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGE---------FLRGWDDG- 464

Query: 164 PRDVRSTAIAGLPDGTILGAFGRFSGTPIRKSGDGG-DTWLSTGLGNGL----SSYNIPV 218
                     G+  G   G +   SG+P+     GG  ++  T LG+G     S+   P 
Sbjct: 465 ---------RGVDSGRGFGTWQ--SGSPVVHDDVGGIASFNITALGDGTNVAWSNIADPW 513

Query: 219 MAA-----NDSGTILLIDINN 234
           + A      DS     +D NN
Sbjct: 514 VGAFPLTMYDSSAATFVDANN 534
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
           9a5c]
 pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
           9a5c]
          Length = 387

 Score = 56.6 bits (135), Expect = 6e-007
 Identities = 34/75 (45%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 66  NNTPDSVGNPQYLDALTAITAPRVGQTLRTYGVLPEPQWLRTDGRAVSRTEYADLFAVVG 125
           NNT   + N    +  TA++    GQ + T G    P  L  DGRAVSR  Y  LF  + 
Sbjct: 117 NNTDTPLQNEP--NRWTALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEIN 174

Query: 126 TQYGAGDGSTTFNIP 140
           T YGAGDG +TFNIP
Sbjct: 175 TSYGAGDGVSTFNIP 189
>ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
 pir||D83260 hypothetical protein PA3080 [imported] - Pseudomonas aeruginosa
           (strain PAO1)
 gb|AAG06468.1|AE004732_2 (AE004732) hypothetical protein [Pseudomonas aeruginosa]
          Length = 365

 Score = 47.0 bits (110), Expect = 5e-004
 Identities = 52/240 (21%), Positives = 92/240 (37%), Gaps = 22/240 (9%)

Query: 195 SGDGGDTWLSTGLGNGLSSYNIPVMAANDSGTILLIDINNDSRPLFRSQDGGENWSXXXX 254
           S D G TW       G  +  + +  A++   + +         L  ++DGG++W     
Sbjct: 138 SDDAGSTWARQFEDLGREAPLLDIWFADEQHGLAV----GAYGALLETRDGGQHWEDVSE 193

Query: 255 XXXXXXXXX--XXAAAPNGDFYYINRDETIFRSTDNGENWAQVSGQLPSHSFAGITVDST 312
                        AA  +     +    ++FRS D G  W ++ G      F  I     
Sbjct: 194 RLDNEDQFHLNAIAAVKDNGLLVVGEAGSLFRSKDWGATWEKLEGPYEGSLFGAIGTADA 253

Query: 313 GAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLP-EVSNRSSLGAGRNGDLYI 371
           G V V+ +         ++RS + G +W  +  +     L   +S+ + L  GR     +
Sbjct: 254 GGVLVYGLRG------HLFRSADFGDSWEEIPLKAASGDLEFGLSDGALLADGR----IV 303

Query: 372 IAPDRNQGYRSVDNGASWSPVSGLIDDFSFAYVTSTPNGNIYVVAGTFSTGGVYASEGSG 431
           +         S D G S+S V    D  S A V++T +G++ +V      GGV+ +  +G
Sbjct: 304 VVGHGGSVLESTDGGRSFS-VFNRPDRLSLAGVSATGDGHLILV----GQGGVHLAAANG 358
>ref|NP_625406.1| (NC_003888) conserved hypothetical protein [Streptomyces coelicolor
           A3(2)]
 emb|CAC13065.1| (AL445503) conserved hypothetical protein [Streptomyces coelicolor
           A3(2)]
          Length = 384

 Score = 45.1 bits (105), Expect = 0.002
 Identities = 55/223 (24%), Positives = 88/223 (38%), Gaps = 39/223 (17%)

Query: 202 WLSTGLGNGLSSYNIPVMAANDSGTILLIDINNDSRPLFRSQDGGENWSXXXXXXXXXXX 261
           W+  G G GL +    +  A D G++    +   +  +FR+ DGG +W+           
Sbjct: 165 WVPGGGGEGLHTV---LTDARDPGSVT---VAVSTAGVFRTADGGASWAPSNSGVSAVFL 218

Query: 262 -------------XXXXAAAPNGDFYYINRDETIFRSTDNGENWAQVSGQLPS-HSFAGI 307
                            AA P  D  Y+     ++RS D G +W  +   LPS   FA  
Sbjct: 219 PDPDPEFGQCVHKVARDAATP--DRLYLQNHWGVYRSDDAGAHWTDIGEGLPSTFGFAVA 276

Query: 308 TVDSTG-AVYVFAV---FDVVPS--VTRVYRSVNNGANWTLVAEEGDGSGLPEVSN---- 357
                G   YVF +    D VP+    RV+R+ + G +W     E   +GLP   +    
Sbjct: 277 AHPHRGDTAYVFPINADADRVPADHRCRVFRTADAGKSW-----EPLSAGLPREDHYGTV 331

Query: 358 -RSSLGAGRNGDLYIIAPDRN-QGYRSVDNGASWSPVSGLIDD 398
            R +L         +   +RN + + S D+G SW  ++  + D
Sbjct: 332 LRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQLASHLPD 374
 Score = 42.4 bits (98), Expect = 0.011
 Identities = 32/126 (25%), Positives = 53/126 (41%), Gaps = 19/126 (15%)

Query: 192 IRKSGDGGDTWLSTGLGNGL-SSYNIPVMA---ANDSGTILLIDINNDSRP------LFR 241
           + +S D G  W  T +G GL S++   V A     D+  +  I+ + D  P      +FR
Sbjct: 250 VYRSDDAGAHW--TDIGEGLPSTFGFAVAAHPHRGDTAYVFPINADADRVPADHRCRVFR 307

Query: 242 SQDGGENWSXXXXXX-------XXXXXXXXXAAAPNGDFYYINRDETIFRSTDNGENWAQ 294
           + D G++W                         A     Y+ NR+  +F S D+G++W Q
Sbjct: 308 TADAGKSWEPLSAGLPREDHYGTVLRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQ 367

Query: 295 VSGQLP 300
           ++  LP
Sbjct: 368 LASHLP 373
>ref|NP_640423.1| (NC_003919) microcystin dependent protein [Xanthomonas axonopodis
           pv. citri str. 306]
 gb|AAM34959.1| (AE011629) microcystin dependent protein [Xanthomonas axonopodis
           pv. citri str. 306]
          Length = 174

 Score = 44.3 bits (103), Expect = 0.003
 Identities = 21/44 (47%), Positives = 28/44 (62%), Gaps = 1/44 (2%)

Query: 104 WLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQEP 147
           WL  DGR +  T    L +++GTQ+G GDG TTFN+P   G+ P
Sbjct: 16  WLPCDGRTLPITSNVALASLLGTQFG-GDGKTTFNLPDLRGRTP 58
>ref|NP_640422.1| (NC_003919) microcystin dependent protein [Xanthomonas axonopodis
           pv. citri str. 306]
 gb|AAM34958.1| (AE011629) microcystin dependent protein [Xanthomonas axonopodis
           pv. citri str. 306]
          Length = 183

 Score = 43.9 bits (102), Expect = 0.004
 Identities = 22/46 (47%), Positives = 29/46 (62%), Gaps = 2/46 (4%)

Query: 101 EPQ-WLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQ 145
           EPQ W   DGR +    Y  L++++GT YG GDG TTFN+P   G+
Sbjct: 16  EPQGWAFCDGRQLQINTYMALYSLIGTTYG-GDGRTTFNLPDLRGR 60
>gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger]
          Length = 857

 Score = 43.9 bits (102), Expect = 0.004
 Identities = 39/172 (22%), Positives = 67/172 (38%), Gaps = 10/172 (5%)

Query: 204 STGLGNGLSSYNIPVMAANDSGTILLIDINNDS-------RPLFRSQDGGENWSXXXXXX 256
           S G     +S  +  +++  SG ++  D  N++          + S D    ++      
Sbjct: 552 SDGAYRSANSATLSAVSSLPSGAVIASDKANNTYFYGASGSSFYLSSDTAATFTVTTTLG 611

Query: 257 XXXXXXXXXAAAPNGDFYYINRDETIFRSTDNGENWAQV-SGQLPSHSFAGITVDSTGAV 315
                    A        +++ D  +F ST+ G+++ Q+ SG     SF      S G  
Sbjct: 612 SSTTANAIRAQPSLAGDVWVSTDTGLFHSTNYGKSFTQIGSGCTEGWSFGFGKPSSDGDY 671

Query: 316 YVFAVFDVVPSVTRVYRSVNNGANWTLV--AEEGDGSGLPEVSNRSSLGAGR 365
            V   F  V  VT ++++ + G NW ++  AE G GS    V N      GR
Sbjct: 672 PVLFGFFTVDGVTGLFKTEDQGVNWQIISDAEHGFGSASANVVNGDLQNYGR 723
>ref|NP_440411.1| (NC_000911) hypothetical protein [Synechocystis sp. PCC 6803]
 sp|P73069|Y48L_SYNY3 Ycf48-like protein
 pir||S75177 hypothetical protein slr2034 - Synechocystis sp. (strain PCC 6803)
 dbj|BAA17091.1| (D90903) ORF_ID:slr2034~hypothetical protein [Synechocystis sp. PCC
           6803]
          Length = 342

 Score = 43.5 bits (101), Expect = 0.005
 Identities = 31/156 (19%), Positives = 60/156 (37%), Gaps = 18/156 (11%)

Query: 239 LFRSQDGGENWSXXXXXXXXXXXXXXXAAAPNGDFYYINRDETIFRSTDNGENWAQV--S 296
           +F + DGG+ W                 +    + +   +   +  +TD G+ WA++  S
Sbjct: 72  IFETTDGGDTWEQKLIDLGEEKASFSAVSFSGNEGWITGKPSILLHTTDGGQTWARIPLS 131

Query: 297 GQLPSHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVS 356
            +LP   ++ I +    A       +++  +  +Y++ N G NW  + E     G+    
Sbjct: 132 EKLPGAPYSIIALGPQTA-------EMITDLGAIYKTTNGGKNWKALVE--GAVGVARTI 182

Query: 357 NRSSLGAGRNGDLYIIAPDRNQGYRSVDNG-ASWSP 391
            RS+ G       Y+    R   Y +   G   W+P
Sbjct: 183 QRSTDGR------YVAVSARGNFYSTWAPGQTEWTP 212
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
 dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
          Length = 169

 Score = 43.1 bits (100), Expect = 0.007
 Identities = 21/61 (34%), Positives = 33/61 (53%), Gaps = 1/61 (1%)

Query: 87  PRVGQTLRTYGVLPEPQWLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQE 146
           P +G+     G      W+  +G+ +  +EY  LF ++GT YG GDG +TF +P   G+ 
Sbjct: 4   PYIGEIRMFAGNFAPAGWMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALPDLRGRV 62

Query: 147 P 147
           P
Sbjct: 63  P 63
>ref|NP_347555.1| (NC_003030) Probably secreted sialidase; several ASP-boxes and
           dockerin domain [Clostridium acetobutylicum]
 gb|AAK78895.1|AE007608_1 (AE007608) Probably secreted sialidase; several ASP-boxes and
           dockerin domain [Clostridium acetobutylicum]
          Length = 839

 Score = 42.7 bits (99), Expect = 0.009
 Identities = 51/198 (25%), Positives = 81/198 (40%), Gaps = 23/198 (11%)

Query: 242 SQDGGENWSXXXXXXXXXXXXXXXAAAPNGDFYYINRDE--TIFRSTDNGENWAQVSGQL 299
           S DGG+NW                AA  +      + +E      STDNG  W   SG L
Sbjct: 536 SYDGGKNWFSAGSNISGVYKAGTVAAGADAKTIVWSPEEGANAAYSTDNGNKWTPCSG-L 594

Query: 300 PSHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEG---DGSGLPEVS 356
           P    A +  D       +   +      + Y S + GA +T  ++ G    G G+ +  
Sbjct: 595 PQG--AKVRSDRVNPKKFYGFLN-----GKFYISTDAGATFTQSSQTGLPTKGKGIFKTV 647

Query: 357 NRSSLGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGL-IDDFSFAYVTSTPNG--NIY 413
                  G  GD++ IA  ++  + S D+GA+++ VSG+   D      + T +G   IY
Sbjct: 648 ------IGHEGDIW-IAGGKDGLWHSTDSGATFTKVSGVDASDTVGLGKSKTDDGYPAIY 700

Query: 414 VVAGTFSTGGVYASEGSG 431
           + A    T G++ S+  G
Sbjct: 701 MDATIDGTAGIFRSDDEG 718
>gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellulosiruptor sp. Tok7B.1]
          Length = 996

 Score = 42.7 bits (99), Expect = 0.009
 Identities = 44/191 (23%), Positives = 72/191 (37%), Gaps = 19/191 (9%)

Query: 242 SQDGGENWSXXXXXXXXXXXXXXXAAAPNGD-FYYINRDETIFRSTDNGENWAQVSGQLP 300
           S DGG++W                AAA +G    +  +   +  STDNG  W + +  +P
Sbjct: 538 SYDGGKSWFQGNTEPQGTSEGGTVAAAADGSAVVWAPKGAKVCYSTDNGNKWVECA-NVP 596

Query: 301 SHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVSNRSS 360
           S +       +    Y F          + Y S + G  +     E   +GLP +S    
Sbjct: 597 SEAIVYSDRVNPNKFYAF-------KNGKFYISADKGKTFI----ESPAAGLP-ISGNFK 644

Query: 361 LGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGLIDDFSFAYVTSTPNGN---IYVVAG 417
              G  GD++++    N  + S D G S+  +SG+ D  S  +           IY  A 
Sbjct: 645 TVPGIEGDIWLVG--NNGMWHSTDGGYSFVKISGVEDAASIGFGKPAEGETYPAIYTYAK 702

Query: 418 TFSTGGVYASE 428
                G++ S+
Sbjct: 703 INGVRGIFRSD 713
>emb|CAC02964.1| (AJ292929) CEL6 protein [Agaricus bisporus]
          Length = 806

 Score = 40.8 bits (94), Expect = 0.033
 Identities = 60/275 (21%), Positives = 99/275 (35%), Gaps = 27/275 (9%)

Query: 181 LGAFGRFSGTPIRKSGDGGDTWLST-GLGNGLSSYNIPVMAAND------SGTILLIDIN 233
           +G     SG  +  S D G TW    G  + +    + + A  D      +G  +++  N
Sbjct: 510 IGTGDSTSGKQVAISSDYGVTWNQHFGAPDNVQGGKVAISADADIILWRTNGNGVMVSRN 569

Query: 234 N---------DSRPLFRSQDGGENWSXXXXXXXXXXXXXXXAAAPNGDFYYINRDETIFR 284
                       +  + S D G+ +S                        + + D+ +F 
Sbjct: 570 QATFNIFYGASGKTFYVSTDNGKTFSAHGSLGSATSVFDITVHPSVSGDIWASTDKGLFH 629

Query: 285 STDNGENWAQVSGQLPSHSFA-GITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLV 343
           STD+G  ++ +SG   +   A G    + G   VFA  +    V   +RS + G NW  +
Sbjct: 630 STDSGATFSAISGITQAWGVALGAPRSTGGYPAVFAAANYGGEVA-YFRSDDRGVNWVKI 688

Query: 344 AEEGDGSGLPEVSNRSSLGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGLIDDFSFAY 403
            +   G G    +N  +      G +YI    R   Y  V  G++  P S      S   
Sbjct: 689 NDAAHGFGAAS-ANCMAADPRVYGRVYIGTNGRGIFYGDV-AGSAPPPTSSATSTTS--- 743

Query: 404 VTSTPNGNIYVVAGTFSTGGVYASEGSGGPAPGTG 438
           V++T   +      T +T     S G+G P  G G
Sbjct: 744 VSTTTTRSTTTTRSTTTT----TSSGNGSPVSGFG 774
>ref|NP_642098.1| (NC_003919) cellulase [Xanthomonas axonopodis pv. citri str. 306]
 gb|AAM36634.1| (AE011809) cellulase [Xanthomonas axonopodis pv. citri str. 306]
          Length = 751

 Score = 40.8 bits (94), Expect = 0.033
 Identities = 49/178 (27%), Positives = 69/178 (38%), Gaps = 43/178 (24%)

Query: 277 NRDETIFRSTDNGENWAQVSGQLPSHSFAGIT----VDSTGAVYV-FAVFD--------- 322
           +RD  ++RS D G +WA+V G  P+ + AG T    V S  AV + F VFD         
Sbjct: 185 SRDAGLWRSDDRGAHWARVEG-FPADALAGATARNHVGSEQAVGIAFVVFDAASGHAGAA 243

Query: 323 -------VVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVSNRSSLGAGRNGDLYIIAPD 375
                  V  + T +Y S + G  W+ VA  G   GL     R S   G N   + ++  
Sbjct: 244 TPRIYVGVSTAQTSLYVSEDAGRTWSAVA--GQPKGL-----RPSHMVGHNAGQWYLSYG 296

Query: 376 RNQGYRSVDNGASWS--------------PVSGLIDDFSFAYVTSTPNGNIYVVAGTF 419
              G   +  GA W+              P     D F +  V   P     ++A TF
Sbjct: 297 DRPGPDLMAGGALWNYDATQGRWREISPIPQPATGDGFGWGAVAVDPQHPQVLLASTF 354
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.314 0.135 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 339,263,776 Number of Sequences: 1028106 Number of extensions: 16873702 Number of successful extensions: 30518 Number of sequences better than 5.0e-02: 17 Number of HSP's better than 0.1 without gapping: 3 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 30426 Number of HSP's gapped (non-prelim): 80 length of query: 449 length of database: 324,535,921 effective HSP length: 124 effective length of query: 325 effective length of database: 197,050,777 effective search space: 64041502525 effective search space used: 64041502525 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 93 (40.4 bits) Query= orf26 [7519-7603] (83 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.143 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 67,709,073 Number of Sequences: 1028106 Number of extensions: 2777088 Number of successful extensions: 4718 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4718 Number of HSP's gapped (non-prelim): 0 length of query: 83 length of database: 324,535,921 effective HSP length: 59 effective length of query: 24 effective length of database: 263,877,667 effective search space: 6333064008 effective search space used: 6333064008 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 84 (37.0 bits) Query= orf27 [7603-7802] (198 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.321 0.136 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 119,367,929 Number of Sequences: 1028106 Number of extensions: 4571982 Number of successful extensions: 12946 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 12946 Number of HSP's gapped (non-prelim): 0 length of query: 198 length of database: 324,535,921 effective HSP length: 115 effective length of query: 83 effective length of database: 206,303,731 effective search space: 17123209673 effective search space used: 17123209673 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 88 (38.5 bits) Query= orf28 [7841-7919] (77 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.311 0.126 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 43,039,292 Number of Sequences: 1028106 Number of extensions: 1312202 Number of successful extensions: 2139 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2139 Number of HSP's gapped (non-prelim): 0 length of query: 77 length of database: 324,535,921 effective HSP length: 53 effective length of query: 24 effective length of database: 270,046,303 effective search space: 6481111272 effective search space used: 6481111272 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 84 (37.0 bits) Query= orf29 [7921-8761] (839 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_519008.1|  (NC_003295) PROBABLE HEMAGGLUTININ-RELATED...    44  0.011
ref|NP_522101.1|  (NC_003296) PROBABLE HEMAGGLUTININ-RELATED...    44  0.011
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 3501

 Score = 43.5 bits (101), Expect = 0.011
 Identities = 37/128 (28%), Positives = 53/128 (40%), Gaps = 32/128 (25%)

Query: 443 ENKIGTDRGNIDFSGDL-----SQFSPANKSDYWTAVIASGSESIGGLTFNNGDRLIAVQ 497
           +  +G +  N+  SG L     S  S A+  D   +V+A GS S  G     G+  +   
Sbjct: 379 QQNLGINANNVASSGTLGAGVNSDGSFAHAGDL--SVVAGGSMSATGQNVAGGNATL--- 433

Query: 498 SNDVVPTTLNSPSWRVDPKTVPQASATTLGGVKISTTSALEINASTGNLDVKAATANQAG 557
                                 Q ++  L G + S    L +NA TGNLD+  ATA+  G
Sbjct: 434 ----------------------QGASVNLAGSQTSANGNLNLNAQTGNLDLTGATASAGG 471

Query: 558 ALKADAVG 565
           AL A+A G
Sbjct: 472 ALSANAQG 479
>ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD17691.1| (AL646079) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 3552

 Score = 43.5 bits (101), Expect = 0.011
 Identities = 51/203 (25%), Positives = 75/203 (36%), Gaps = 67/203 (33%)

Query: 371 TKTADNQPLQVQGS-DNTGPNPGGPHRAVDYQ-------PLAYRQSLAVGDYELIFDELN 422
           T  + N  +  QG  DNTG   G                 LA +Q+LAV  + +     +
Sbjct: 335 TNASGNLSVSAQGGIDNTGTTYGRQSVTASTSGDLTNSGTLAAQQNLAVNAHNVA----S 390

Query: 423 NAPEGPGLNAEKVKGIEAFIENKIGTDRGNIDFSGDLSQFSPANKSDYWTAVIASGSESI 482
           +   G G+N++                 G++  +GDLS             V+A GS S 
Sbjct: 391 SGTLGAGVNSD-----------------GSLAHAGDLS-------------VVAGGSMSA 420

Query: 483 GGLTFNNGDRLIAVQSNDVVPTTLNSPSWRVDPKTVPQASATTLGGVKISTTSALEINAS 542
            G     G+  +                         Q ++  L G + S    L +NA 
Sbjct: 421 TGQNVAGGNATL-------------------------QGASVNLAGSQTSANGNLSLNAQ 455

Query: 543 TGNLDVKAATANQAGALKADAVG 565
           TGNLD+  ATA+  GAL A+A G
Sbjct: 456 TGNLDLTGATASAGGALSANAQG 478
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.314 0.133 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 527,885,665 Number of Sequences: 1028106 Number of extensions: 23443597 Number of successful extensions: 39838 Number of sequences better than 5.0e-02: 2 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 39829 Number of HSP's gapped (non-prelim): 7 length of query: 839 length of database: 324,535,921 effective HSP length: 130 effective length of query: 709 effective length of database: 190,882,141 effective search space: 135335437969 effective search space used: 135335437969 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 96 (41.6 bits) Query= orf30 [8762-9136] (373 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.311 0.128 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 182,819,854 Number of Sequences: 1028106 Number of extensions: 5879262 Number of successful extensions: 10720 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 10720 Number of HSP's gapped (non-prelim): 0 length of query: 373 length of database: 324,535,921 effective HSP length: 123 effective length of query: 250 effective length of database: 198,078,883 effective search space: 49519720750 effective search space used: 49519720750 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 92 (40.0 bits) Query= orf31 [9212-9372] (159 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.332 0.140 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 73,066,036 Number of Sequences: 1028106 Number of extensions: 2164437 Number of successful extensions: 8179 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8179 Number of HSP's gapped (non-prelim): 0 length of query: 159 length of database: 324,535,921 effective HSP length: 111 effective length of query: 48 effective length of database: 210,416,155 effective search space: 10099975440 effective search space used: 10099975440 T: 11 A: 40 X1: 15 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (22.0 bits) S2: 86 (37.7 bits) Query= orf32 [9375-9464] (88 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.323 0.137 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 41,268,036 Number of Sequences: 1028106 Number of extensions: 1081384 Number of successful extensions: 2718 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2718 Number of HSP's gapped (non-prelim): 0 length of query: 88 length of database: 324,535,921 effective HSP length: 64 effective length of query: 24 effective length of database: 258,737,137 effective search space: 6209691288 effective search space used: 6209691288 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 84 (37.0 bits) Query= orf33 [9470-9574] (103 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.141 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 77,005,740 Number of Sequences: 1028106 Number of extensions: 3028339 Number of successful extensions: 4946 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4946 Number of HSP's gapped (non-prelim): 0 length of query: 103 length of database: 324,535,921 effective HSP length: 79 effective length of query: 24 effective length of database: 243,315,547 effective search space: 5839573128 effective search space used: 5839573128 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 84 (37.0 bits) Query= orf34 [9570-9645] (74 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.314 0.131 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 41,016,557 Number of Sequences: 1028106 Number of extensions: 1225595 Number of successful extensions: 2134 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2134 Number of HSP's gapped (non-prelim): 0 length of query: 74 length of database: 324,535,921 effective HSP length: 50 effective length of query: 24 effective length of database: 273,130,621 effective search space: 6555134904 effective search space used: 6555134904 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 84 (37.0 bits) Query= orf35 [9636-9823] (186 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.320 0.136 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 115,113,120 Number of Sequences: 1028106 Number of extensions: 4308382 Number of successful extensions: 7861 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 7861 Number of HSP's gapped (non-prelim): 0 length of query: 186 length of database: 324,535,921 effective HSP length: 114 effective length of query: 72 effective length of database: 207,331,837 effective search space: 14927892264 effective search space used: 14927892264 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 87 (38.1 bits) Query= orf36 [9820-9880] (59 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.312 0.133 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 44,848,135 Number of Sequences: 1028106 Number of extensions: 1497367 Number of successful extensions: 1995 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 1995 Number of HSP's gapped (non-prelim): 0 length of query: 59 length of database: 324,535,921 effective HSP length: 35 effective length of query: 24 effective length of database: 288,552,211 effective search space: 6925253064 effective search space used: 6925253064 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 85 (37.4 bits) Query= orf37 [9877-10062] (184 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.322 0.139 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 106,714,148 Number of Sequences: 1028106 Number of extensions: 4067607 Number of successful extensions: 8466 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8466 Number of HSP's gapped (non-prelim): 0 length of query: 184 length of database: 324,535,921 effective HSP length: 114 effective length of query: 70 effective length of database: 207,331,837 effective search space: 14513228590 effective search space used: 14513228590 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 87 (38.1 bits) Query= orf38 [10064-10226] (161 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.135 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 100,547,735 Number of Sequences: 1028106 Number of extensions: 3769955 Number of successful extensions: 8101 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8101 Number of HSP's gapped (non-prelim): 0 length of query: 161 length of database: 324,535,921 effective HSP length: 111 effective length of query: 50 effective length of database: 210,416,155 effective search space: 10520807750 effective search space used: 10520807750 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 86 (37.7 bits) Query= orf39 [10245-10932] (686 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|P06225|DPOL_BPSP2  DNA POLYMERASE >gi|67052|pir||DJBPS2 D...   264  3e-069
gb|AAL82293.1|  (AF424782) DNA polymerase A domain [Staphylo...   253  5e-066
ref|NP_646242.1|  (NC_003923) DNA polymerase [Staphylococcus...   252  9e-066
ref|NP_051006.1|  (NC_000935) P45 [Acyrthosiphon pisum bacte...   159  1e-037
ref|NP_299569.1|  (NC_002488) phage-related protein [Xylella...    90  1e-016
gb|AAA23103.1|  (M94320) putative DNA polymerase [Citrus gre...    62  2e-008
gb|AAB25243.1|  DNA polymerase homolog [bacterium-like organ...    62  2e-008
sp|Q9S1G2|DPO1_RHILE  DNA polymerase I (POL I) >gi|5596366|g...    44  0.007
ref|NP_384281.1|  (NC_003047) PROBABLE DNA POLYMERASE I PROT...    42  0.019
ref|NP_254180.1|  (NC_002516) DNA polymerase I [Pseudomonas ...    42  0.019
ref|NP_353146.1|  (NC_003062) AGR_C_169p [Agrobacterium tume...    41  0.043
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
 pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
 gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
           SPO2]
 emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
          Length = 648

 Score =  264 bits (674), Expect = 3e-069
 Identities = 201/633 (31%), Positives = 296/633 (46%), Gaps = 83/633 (13%)

Query: 59  QCYCDMAKARRFSLPGSLDTAAKALR-TAEKDKKGKQLIGKLCRPVSATKARPEPRWTLY 117
           Q  C    A    LPG+LD  AKAL+ +A+KDK GK LI     P   TKA  +    L 
Sbjct: 94  QWRCTAVHATTLGLPGNLDGVAKALKLSAQKDKAGKALIRYFSVPCKPTKANGQRVRNLP 153

Query: 118 TA-PQDYADLYRYCDQDVKSEDHVSALVPDMTPAERE--TWLADQRINARGVLVDVESLD 174
              P+ +     YC QDV+ E  +   +    P E E   W  DQ IN RGV +DV+ + 
Sbjct: 154 EHDPEKWEKFKVYCIQDVEVERAIKNRISKFEPLESEHKLWALDQEINDRGVRIDVDLVK 213

Query: 175 NALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAETLERD 234
           +A+    Q       E   +T   + + +  A+L  +L   G  + +L  + + E +E  
Sbjct: 214 HAIACDEQYQAGLIAEAKKLTG--LPNPNSTAQLKKWLEEKGLTISSLAKDKIEELIENT 271

Query: 235 DLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSAGGVQLQ 293
           + + T  R+L +R+ +A  +VKK  +++  +  D R+R    F+GA  TGRW+   VQ+Q
Sbjct: 272 N-DETVHRVLRLRQEMAKTSVKKYLAMEKALCPDNRVRGLLQFYGASRTGRWAGRLVQVQ 330

Query: 294 NLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWALKDL 353
           NL                                          NK   +D  R  LK  
Sbjct: 331 NLPQ----------------------------------------NKIEDLDTARNLLKGG 350

Query: 354 KHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEV 413
            +  ++L+   +G    +LS  +R  FI + G +F   DFSAIEA V+A LA  EWR+EV
Sbjct: 351 HYEAIELL---YGQVPFVLSQLVRTAFIPSEGNEFYVSDFSAIEARVIAWLAGEEWRLEV 407

Query: 414 FRTHGKIYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGAWKAF 473
           F THGKIYE SA+++  VP E +           PLR+  GKVAELA GY G  GA    
Sbjct: 408 FNTHGKIYEASAAQMFKVPVESI-------TKGSPLRQK-GKVAELALGYQGGKGALIQM 459

Query: 474 GAEEHFEDDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIMAILN 533
           GA      +  + + +  WR A+ +IV+FW                 Y +E A I A+  
Sbjct: 460 GALNMGLAEGELPELVKAWRTANKKIVKFW-----------------YDVEAAAIKAVKE 502

Query: 534 PGKCYAVGDITYAVFDDVLYCRLPSGRFLQYHQPKLIESDSWKGPEYQVTFMGYNTNSQK 593
                    +T+     +L+ +LPSGR L Y +PKL E D   G E  +T+ G   + + 
Sbjct: 503 RKPVKLQHGLTFLYESGILFVQLPSGRRLAYAKPKL-ELDERFGKE-ALTYEGKLESGK- 559

Query: 594 GPVGWVRMDTYGGRLAENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVPGDFGS 653
               W R++TYGG+L EN+ Q          + + +N GY  VMH HDE + +VP     
Sbjct: 560 ----WGRLNTYGGKLVENIVQATARDCLAITLMRLDNAGYKTVMHVHDEAVLDVPRGKNE 615

Query: 654 VEEMAALMTGREPWREWWPIRAAGWEDDRYQKD 686
           ++++ A+M     W +  P+ A G+  D Y+KD
Sbjct: 616 LDKVEAIMGEPISWAKGLPLTADGFVTDYYKKD 648
>gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylococcus aureus phage phi
           12]
          Length = 650

 Score =  253 bits (646), Expect = 5e-066
 Identities = 197/638 (30%), Positives = 302/638 (46%), Gaps = 91/638 (14%)

Query: 58  EQCYCDMAKARRFSLPGSLDTAAKALRT-AEKDKKGKQLIGKL---CRP--VSATKARPE 111
           E+  C M  + R  LP SLD   + LR  ++KDK GK LI      C+P  V+  + R  
Sbjct: 95  EEWICTMVNSMRIGLPASLDKVGEVLRLQSQKDKAGKNLIRYFSIPCKPTKVNGGRTRNL 154

Query: 112 PRWTLYTAPQDYADLYRYCDQDVKSEDHVSALVPD--MTPAERETWLADQRINARGVLVD 169
           P   L    + +     YC +DV+ E  ++  + D  +T  E+  W+ DQ IN RG+ + 
Sbjct: 155 PEHDL----EKWQQFIDYCIRDVEVEMAIANKIKDFPVTAIEQTYWVFDQHINDRGIKLS 210

Query: 170 VESLDNALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAE 229
            +SL    +++ + ++   ++ A    G     S    L       G  + NL+ +TV E
Sbjct: 211 -KSLMLGANVLDKQSKEELLKQAKHITGLENPNSPTQLLAWLKDEQGLDIPNLQKKTVQE 269

Query: 230 TLERDDLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGG 289
            L+  +    A+++LEIR  ++  +VKK   +   + SD R+R  + F+GAGTGRW+  G
Sbjct: 270 YLK--EATGKAKKMLEIRLQMSKTSVKKYNKMHDMMCSDERVRGLFQFYGAGTGRWAGRG 327

Query: 290 VQLQNLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWA 349
           VQLQNLT                                           D  ++  R  
Sbjct: 328 VQLQNLTKHYIS--------------------------------------DTELEIARDL 349

Query: 350 LKDLKHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEW 409
           +K+ +  +LDL++++   P  LLS  +R  F A  G +    DFSAIEA V+A  A+ +W
Sbjct: 350 IKEQRFDDLDLLLNV--HPQDLLSQLVRTTFTAEEGNELAVSDFSAIEARVIAWYAKEQW 407

Query: 410 RIEVFRTHGKIYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGA 469
           R++VF THGKIYE SAS++  VP E +       K D PLR+  GKV+ELA GY G  GA
Sbjct: 408 RLDVFNTHGKIYEASASQMFNVPVESI------TKGD-PLRQK-GKVSELALGYQGGAGA 459

Query: 470 WKAFGAEEHFEDDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIM 529
            KA GA E   +++ ++  +  WR+A+P IV FW                    + A I 
Sbjct: 460 LKAMGALEMGIEENELQGLVDSWRNANPNIVNFWKA-----------------CQEAAIN 502

Query: 530 AILNPGKCYAVGDITYAVFDDVLYCRLPSGRFLQYHQPK-LIESDSWKGPEYQVTFMGYN 588
            + +  K +    + + +    L   LPSGR L Y  PK L+  +SW      V FMG +
Sbjct: 503 TVKSR-KTHHTHGLRFYMKKGFLMIELPSGRALAY--PKALVGENSWGSQV--VEFMGLD 557

Query: 589 TNSQKGPVGWVRMDTYGGRLAENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVP 648
            N +     W ++ TYGG+L EN+ Q          + + E  G+ +V H HDE+I E+P
Sbjct: 558 LNRK-----WSKLKTYGGKLVENIVQATARDLLAISIARLEALGFKIVGHVHDEVIVEIP 612

Query: 649 GDFGSVEEMAALMTGREPWREWWPIRAAGWEDDRYQKD 686
                ++E+  +M     W +   + + G+    Y KD
Sbjct: 613 RGSNGLKEIETIMNKPVDWAKGLNLNSDGFTSPFYMKD 650
>ref|NP_646242.1| (NC_003923) DNA polymerase [Staphylococcus aureus subsp. aureus
           MW2]
 dbj|BAB95290.1| (AP004827) DNA polymerase [Staphylococcus aureus subsp. aureus MW2]
          Length = 653

 Score =  252 bits (644), Expect = 9e-066
 Identities = 195/637 (30%), Positives = 299/637 (46%), Gaps = 89/637 (13%)

Query: 58  EQCYCDMAKARRFSLPGSLDTAAKALRTA-EKDKKGKQLIGKL---CRP--VSATKARPE 111
           E+  C M  + R  LP SLD   + LR   +KDK GK LI      C+P  V+  + R  
Sbjct: 98  EEWICTMVNSMRIGLPASLDKVGEVLRLQNQKDKAGKNLIRYFSIPCKPTKVNGGRTRNL 157

Query: 112 PRWTLYTAPQDYADLYRYCDQDVKSEDHVSALVPD--MTPAERETWLADQRINARGVLVD 169
           P   L    + +     YC +DV+ E  ++  + D  +T  E+  W+ DQ IN RG+ + 
Sbjct: 158 PEHDL----EKWQQFIDYCIRDVEVEMTIAHKIKDFPVTAIEQAYWVFDQHINDRGIKLS 213

Query: 170 VESLDNALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAE 229
            +SL    +++ + ++   +  A    G     S    L       G  + NL+ +TV E
Sbjct: 214 -KSLMLGANVLDKQSKEELLNQAKHITGLENPNSPTQLLAWLKDDQGLDIPNLQKKTVQE 272

Query: 230 TLERDDLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGG 289
            L+  +    A+++LEIR  ++  +VKK   +   + SD R+R  + F+GAGTGRW+  G
Sbjct: 273 YLK--EATGKAKKMLEIRLQMSKTSVKKYNKMHDMMCSDERVRGLFQFYGAGTGRWAGRG 330

Query: 290 VQLQNLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWA 349
           VQLQNLT                                           D  ++  R  
Sbjct: 331 VQLQNLTKHYIS--------------------------------------DTELEIARDL 352

Query: 350 LKDLKHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEW 409
           +K+ +  +LDL++++   P  LLS  +R  F A  G +    DFSAIEA V+A  A+ +W
Sbjct: 353 IKEQRFDDLDLLLNV--HPQDLLSQLVRTTFTAEEGNELAVSDFSAIEARVIAWYAKEQW 410

Query: 410 RIEVFRTHGKIYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGA 469
           R++VF THGKIYE SAS++  VP E +       K D PLR+  GKV+ELA GY G  GA
Sbjct: 411 RLDVFNTHGKIYEASASQMFNVPVESI------TKGD-PLRQK-GKVSELALGYQGGAGA 462

Query: 470 WKAFGAEEHFEDDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIM 529
            KA GA E   +++ ++  +  WR+A+P IV FW                    + A I 
Sbjct: 463 LKAMGALEMGIEENELQGLVDSWRNANPNIVNFW-----------------KACQEAAIN 505

Query: 530 AILNPGKCYAVGDITYAVFDDVLYCRLPSGRFLQYHQPKLIESDSWKGPEYQVTFMGYNT 589
            +    K +    + + +    L   LPSGR L Y +  + E +SW      V FMG + 
Sbjct: 506 TV-KSRKTHHTHGVRFYMKKGFLMIELPSGRALAYPKASVGE-NSW--GSQVVEFMGLDL 561

Query: 590 NSQKGPVGWVRMDTYGGRLAENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVPG 649
           N +     W ++ TYGG+L EN+ Q          + + E +G+ +V H HDE+I E+P 
Sbjct: 562 NRK-----WSKLKTYGGKLVENIVQATARDLLAISIARLEASGFKIVGHVHDEVIVEIPR 616

Query: 650 DFGSVEEMAALMTGREPWREWWPIRAAGWEDDRYQKD 686
               ++E+  +M     W +   + + G+    Y KD
Sbjct: 617 GSNGLKEIETIMNKPVDWAKGLNLNSDGFTSPFYMKD 653
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
 sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
 gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
          Length = 993

 Score =  159 bits (401), Expect = 1e-037
 Identities = 159/587 (27%), Positives = 246/587 (41%), Gaps = 102/587 (17%)

Query: 55  LQLEQCYCDMAKARRFSLPGSLDTAAKALRTAE---KDKKGKQLIGKLCRPVSATKARPE 111
           L L + +  + +A    LPG+L              KDK+GK LI  LC+P    K    
Sbjct: 89  LPLSRVHDTLVQALAHGLPGALGLLCDIFNVNSDKAKDKEGKALISLLCKP--RPKNSKI 146

Query: 112 PRWTLYTAPQDYADLYRYCDQDVKSEDHVSALVPD--MTPAERETWLADQRINARGVLVD 169
            R T  T  +++     Y   D+ +   +   +P+  M   E E W  DQ+IN RG+ +D
Sbjct: 147 QRATALTHAEEWQRFKDYAGSDILAMREIYQHLPNWNMNVHETELWQLDQKINRRGMCMD 206

Query: 170 VESLDNALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATV-GCRMHNLKSETVA 228
           VE   +AL  V    +R +     +T+ AV + ++   L   +A+  G  + ++++ T+ 
Sbjct: 207 VELAKSALTAVENEQKRLSTVTQQLTDNAVQNATQRDALLQHIASAFGITLPDMQASTLQ 266

Query: 229 ETLERDDLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSA 287
             +   D+ P  R +L +R      +  K  +L   +S+DGRLR    F GA  TGRW+ 
Sbjct: 267 RRINDPDIPPALRELLSVRLQSCTTSTSKYKALLKSVSADGRLRGTKQFCGASRTGRWAG 326

Query: 288 GGVQLQNLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVR 347
              Q  NL                                P    N        T+D   
Sbjct: 327 RVFQPDNLPR------------------------------PTLDPN--------TIDN-- 346

Query: 348 WALKDLKHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARC 407
             ++ LK    +LI D   D + L S  LRG  IA PGKK V  D S IE  +LA LA  
Sbjct: 347 -GIEALKAGCAELICD---DIMQLTSSALRGCIIAPPGKKLVISDLSNIEGRMLAWLAGE 402

Query: 408 EWRIEVFR--THGK---IYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASG 462
            W++  F    +GK   +Y+++ ++   +  E++   KE        ++ +GKV EL  G
Sbjct: 403 NWKVNAFSEFDNGKGDDLYKLAYARAFNLLPEDVT--KE--------QRQIGKVMELGLG 452

Query: 463 YGGWIGAWKAFGAEEHFEDDD-----------AMKKAILGWRDASPEIVEFWGGQYR--- 508
           YGG + A+  F      + D+            +K+  + W   S E  + +G   +   
Sbjct: 453 YGGGVAAFLTFALAYGLDLDELAEAALPNIPPGVKREAMRWYQKSVETDKTYGLSEKIFV 512

Query: 509 -RDPYTRKW-------VREYYGLEGAVIMAILNPGKCYAVGDITYAVFDDVLYCRLPSGR 560
             D   R W       V  +Y +E AV  AI +PG  +    ++       L   LPSGR
Sbjct: 513 TCDSLKRMWRNAHPQTVSFWYDIEDAVKQAIQSPGIAFKCRKLSVRRDKSWLRICLPSGR 572

Query: 561 FLQYHQPKLIESDSWKGPEYQVTFMGYNTNSQKGPVGWVRMDTYGGR 607
            L Y   ++           Q+T+MG N  S+K    W R+ TYGG+
Sbjct: 573 SLCYPSARIENG--------QITYMGTNPYSRK----WERLKTYGGK 607
 Score = 45.1 bits (105), Expect = 0.003
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 610 ENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVPG-DFGSVEEMAALMTGREPWR 668
           EN+ Q          M   E  GY +V+  HDE+I+E P     S E ++ L++    W 
Sbjct: 916 ENICQAAARDVLAYNMPPIEKAGYEIVLTVHDEIISEAPDTPQFSAEGLSKLLSFNSDWA 975

Query: 669 EWWPIRAAGWEDDRYQKD 686
              P+ A G+E  RY+K+
Sbjct: 976 WDLPLSANGFETYRYRKE 993
>ref|NP_299569.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
 pir||G82574 phage-related protein XF2290 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85089.1|AE004041_1 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
          Length = 312

 Score = 89.7 bits (221), Expect = 1e-016
 Identities = 86/298 (28%), Positives = 122/298 (40%), Gaps = 48/298 (16%)

Query: 122 DYADLYRYCDQDVKSEDHVSALVP--DMTPAERETWLADQRINARGVLVDVESLDNALDI 179
           D+A    Y  +DV +   V   +P  +   AE   W  DQ IN RGVL+D + +  A+  
Sbjct: 2   DWAQFVEYARRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLIDTDLVQAAIGA 61

Query: 180 VGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATV-GCRMHNLKSETVAETLERDDLNP 238
           V +  +       A+T GAV + ++   L   L T  G  + +++  TVA  ++   L  
Sbjct: 62  VERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVARCVDDPALPE 121

Query: 239 TARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSAGGVQLQNLTS 297
           T R +L IR   +  +  K  +L    S DGRLR    F GA  TGRW+    Q  NL  
Sbjct: 122 TVRELLSIRRQASSTSTAKYQALLHCTSPDGRLRGTLQFNGASRTGRWAGRLFQPHNL-- 179

Query: 298 KGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWALKDLKHRN 357
             P+             S  V  +G                    +DA++    DL    
Sbjct: 180 --PRPTL----------SQEVIAVG--------------------IDAMKAGCVDL---- 203

Query: 358 LDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFR 415
                 ++ D + L S  LR   IA   KK V  D S IE  VLA LA    ++  FR
Sbjct: 204 ------VFDDVMALTSSALRSCLIAPTHKKLVVADLSIIEGRVLAWLAGETPKLHAFR 255
>gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus greening
           disease-associated bacterium]
          Length = 206

 Score = 62.4 bits (150), Expect = 2e-008
 Identities = 47/165 (28%), Positives = 73/165 (43%), Gaps = 24/165 (14%)

Query: 17  PTDLIDHIANGGPIA-AWNITFEFWIWNMVLAKSEGWPWLQLEQCYCDMAKARRFSLPGS 75
           P+DL+ ++ +   +  A N  FE  ++   L        +  ++  C    AR   LP S
Sbjct: 54  PSDLLQYLRDETVMCVAHNSLFERILFKKTLGID-----IPSKRWICTSVLARTNGLPSS 108

Query: 76  LDTAAKALRTAE---KDKKGKQLIGKLCR------PVSATKARPEPRWTLYTAPQDYADL 126
           L  A  ALR  E   K ++GK LI + C+      P   T+A     W L+         
Sbjct: 109 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFR-------- 160

Query: 127 YRYCDQDVKSEDHVSALVPDMTPAERETWLADQRINARGVLVDVE 171
             YC +DV++   +   +  + P ER  WL DQ IN RG  +D++
Sbjct: 161 -EYCKRDVEATRELFKSLTPLNPEERTLWLLDQEINDRGYRIDLD 204
>gb|AAB25243.1| DNA polymerase homolog [bacterium-like organism, citrus greening
           disease-associated, Peptide, 207 aa]
          Length = 207

 Score = 62.4 bits (150), Expect = 2e-008
 Identities = 47/165 (28%), Positives = 73/165 (43%), Gaps = 24/165 (14%)

Query: 17  PTDLIDHIANGGPIA-AWNITFEFWIWNMVLAKSEGWPWLQLEQCYCDMAKARRFSLPGS 75
           P+DL+ ++ +   +  A N  FE  ++   L        +  ++  C    AR   LP S
Sbjct: 55  PSDLLQYLRDETVMCVAHNSLFERILFKKTLGID-----IPSKRWICTSVLARTNGLPSS 109

Query: 76  LDTAAKALRTAE---KDKKGKQLIGKLCR------PVSATKARPEPRWTLYTAPQDYADL 126
           L  A  ALR  E   K ++GK LI + C+      P   T+A     W L+         
Sbjct: 110 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFR-------- 161

Query: 127 YRYCDQDVKSEDHVSALVPDMTPAERETWLADQRINARGVLVDVE 171
             YC +DV++   +   +  + P ER  WL DQ IN RG  +D++
Sbjct: 162 -EYCKRDVEATRELFKSLTPLNPEERTLWLLDQEINDRGYRIDLD 205
>sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I)
 gb|AAD45559.1|U86403_1 (U86403) DNA polymerase I [Rhizobium leguminosarum]
          Length = 1016

 Score = 43.9 bits (102), Expect = 0.007
 Identities = 24/61 (39%), Positives = 34/61 (55%)

Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
           +R  FI+ PG K +  D+S IE  VLA +A      + F     I+ M+AS++ GVP E 
Sbjct: 771 IRTAFISTPGHKLISADYSQIELRVLAHVAEIPQLTKAFEDGVDIHAMTASEMFGVPVEG 830

Query: 436 M 436
           M
Sbjct: 831 M 831
>ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
           meliloti]
 emb|CAC41562.1| (AL591782) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
           meliloti]
          Length = 1004

 Score = 42.4 bits (98), Expect = 0.019
 Identities = 24/61 (39%), Positives = 34/61 (55%)

Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
           +R  F+A PG K V  D+S IE  VLA +A      + F     I+ M+AS++ GVP + 
Sbjct: 761 IRTAFVATPGHKLVSADYSQIELRVLAHVADIPQLRQAFADGVDIHAMTASEMFGVPVDG 820

Query: 436 M 436
           M
Sbjct: 821 M 821
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
 pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
           (strain PAO1)
 gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
          Length = 913

 Score = 42.4 bits (98), Expect = 0.019
 Identities = 19/61 (31%), Positives = 37/61 (60%)

Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
           +R  F+A  G K +  D+S IE  ++A LA+ +  ++ FR    ++  +A+++ GVP E+
Sbjct: 674 IRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEVFGVPLED 733

Query: 436 M 436
           +
Sbjct: 734 V 734
>ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 ref|NP_530819.1| (NC_003304) DNA polymerase I [Agrobacterium tumefaciens str. C58
           (U. Washington)]
 gb|AAK85931.1| (AE007952) AGR_C_169p [Agrobacterium tumefaciens str. C58 (Cereon)]
 gb|AAL41135.1| (AE008985) DNA polymerase I [Agrobacterium tumefaciens str. C58 (U.
           Washington)]
          Length = 1031

 Score = 41.2 bits (95), Expect = 0.043
 Identities = 24/61 (39%), Positives = 33/61 (53%)

Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
           +R  FI+ PG K +  D+S IE  VLA +A        F     I+ M+AS++ GVP E 
Sbjct: 788 IRTAFISTPGHKLLSADYSQIELRVLAHVADIPQLRNAFENGIDIHAMTASEMFGVPVEG 847

Query: 436 M 436
           M
Sbjct: 848 M 848
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.319 0.137 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 473,031,019 Number of Sequences: 1028106 Number of extensions: 20509499 Number of successful extensions: 42979 Number of sequences better than 5.0e-02: 15 Number of HSP's better than 0.1 without gapping: 12 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 42901 Number of HSP's gapped (non-prelim): 36 length of query: 686 length of database: 324,535,921 effective HSP length: 128 effective length of query: 558 effective length of database: 192,938,353 effective search space: 107659600974 effective search space used: 107659600974 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 95 (41.2 bits) Query= orf40 [10952-11194] (241 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.137 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 164,092,175 Number of Sequences: 1028106 Number of extensions: 6878976 Number of successful extensions: 10673 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 10673 Number of HSP's gapped (non-prelim): 0 length of query: 241 length of database: 324,535,921 effective HSP length: 118 effective length of query: 123 effective length of database: 203,219,413 effective search space: 24995987799 effective search space used: 24995987799 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 89 (38.9 bits) Query= orf41 [11217-11573] (355 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.319 0.137 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 125,280,771 Number of Sequences: 1028106 Number of extensions: 4585833 Number of successful extensions: 5950 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 5950 Number of HSP's gapped (non-prelim): 0 length of query: 355 length of database: 324,535,921 effective HSP length: 122 effective length of query: 233 effective length of database: 199,106,989 effective search space: 46391928437 effective search space used: 46391928437 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 92 (40.0 bits) Query= orf42 [11577-11953] (375 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_297976.1|  (NC_002488) phage-related protein [Xylella...    46  8e-004
ref|NP_299800.1|  (NC_002488) phage-related protein [Xylella...    46  8e-004
ref|NP_051012.1|  (NC_000935) hypothetical protein [Acyrthos...    45  0.002
>ref|NP_297976.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
 pir||C82775 phage-related protein XF0686 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF83496.1|AE003912_8 (AE003912) phage-related protein [Xylella fastidiosa 9a5c]
          Length = 425

 Score = 45.8 bits (107), Expect = 8e-004
 Identities = 47/177 (26%), Positives = 73/177 (40%), Gaps = 9/177 (5%)

Query: 4   QHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLNDLDAEDYIGH 63
           QHA + PS A  W HC   + +     D  +     GT +H   + A+    DA  Y+G 
Sbjct: 3   QHAMLSPSSAHRWLHCPASVPLTRTCKDDASPFADEGTVAHTVAANALRTGSDASAYVGE 62

Query: 64  A-DPDGTI--VDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRVYMPDIHPDNWGTLDYA 120
             + +G    V   MA   Q  VD VR        G   +EQ +++  I  +        
Sbjct: 63  RHEINGRTWEVTAEMAASVQEYVDYVRA-----IAGVRLVEQPLHIASITGEQGAKGTAD 117

Query: 121 YVLPAIKRVILGDYKHGH-LEVSPERNLQLVDYLKGLENLVGYTFDGWRIDLKVCQP 176
            V+ A   + + D K+G  ++V  E N QL  Y     +  G+  D  ++ L + QP
Sbjct: 118 VVILAGDALTIVDLKYGKGVKVFAEGNEQLQLYALAALSEFGWVEDFQQVRLVIVQP 174
>ref|NP_299800.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
 pir||H82547 phage-related protein XF2522 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85320.1|AE004059_10 (AE004059) phage-related protein [Xylella fastidiosa 9a5c]
          Length = 425

 Score = 45.8 bits (107), Expect = 8e-004
 Identities = 47/177 (26%), Positives = 73/177 (40%), Gaps = 9/177 (5%)

Query: 4   QHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLNDLDAEDYIGH 63
           QHA + PS A  W HC   + +     D  +     GT +H   + A+    DA  Y+G 
Sbjct: 3   QHAMLSPSSAHRWLHCPASVPLTRTCQDDASPFADEGTVAHTVAANALRTGSDASAYVGE 62

Query: 64  A-DPDGTI--VDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRVYMPDIHPDNWGTLDYA 120
             + +G    V   MA   Q  VD VR        G   +EQ +++  I  +        
Sbjct: 63  RHEINGRTWEVTAEMAASVQEYVDYVRA-----IAGVRLVEQPLHIASITGEQGAKGTAD 117

Query: 121 YVLPAIKRVILGDYKHGH-LEVSPERNLQLVDYLKGLENLVGYTFDGWRIDLKVCQP 176
            V+ A   + + D K+G  ++V  E N QL  Y     +  G+  D  ++ L + QP
Sbjct: 118 VVILAGDALTIVDLKYGKGVKVFAEGNEQLQLYALAALSEFGWVEDFQQVRLVIVQP 174
>ref|NP_051012.1| (NC_000935) hypothetical protein [Acyrthosiphon pisum bacteriophage
           APSE-1]
 sp|Q9T1P7|VP51_BPAPS Putative protein P51
 gb|AAF03994.1|AF157835_51 (AF157835) P51 [Acyrthosiphon pisum bacteriophage APSE-1]
          Length = 439

 Score = 44.7 bits (104), Expect = 0.002
 Identities = 50/193 (25%), Positives = 83/193 (42%), Gaps = 26/193 (13%)

Query: 4   QHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLN---------- 53
           +HA + PS A  W  C G   +   + D  +     G+A+H A +E++L           
Sbjct: 10  EHAKLAPSSAHRWLKCPGSTALEATLPDKTSPFAEEGSAAH-ALAESILKMRQNPFSDGR 68

Query: 54  ----DLDAEDYIGH---ADPDGTIVDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRV-Y 105
                 D ++YIG       +   VDE M    Q  +D V    +   G  L +E+RV +
Sbjct: 69  RRTYGFDVKEYIGTYPLLKANSPQVDEEMIEHVQTYIDTV---WQLANGKMLQVEERVDF 125

Query: 106 MPDIHPDN-WGTLDYAYVLPAIKRVILGDYKHGH-LEVSPERNLQLVDYLKGLENLVGYT 163
              I  DN +GT D   ++ +   + + D K+G  ++V  + N QL+ Y  G  +     
Sbjct: 126 SAVIGVDNSFGTAD--AIIVSSDELQIHDLKYGKGVKVDAQNNEQLMLYALGALHQFDLV 183

Query: 164 FDGWRIDLKVCQP 176
           +D   + L + QP
Sbjct: 184 YDFKTVRLFIHQP 196
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.317 0.135 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 254,503,813 Number of Sequences: 1028106 Number of extensions: 10752186 Number of successful extensions: 25572 Number of sequences better than 5.0e-02: 3 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 25572 Number of HSP's gapped (non-prelim): 3 length of query: 375 length of database: 324,535,921 effective HSP length: 123 effective length of query: 252 effective length of database: 198,078,883 effective search space: 49915878516 effective search space used: 49915878516 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 92 (40.0 bits) Query= orf43 [11954-12238] (283 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.309 0.127 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 185,208,683 Number of Sequences: 1028106 Number of extensions: 7804335 Number of successful extensions: 27615 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 27615 Number of HSP's gapped (non-prelim): 0 length of query: 283 length of database: 324,535,921 effective HSP length: 120 effective length of query: 163 effective length of database: 201,163,201 effective search space: 32789601763 effective search space used: 32789601763 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 90 (39.3 bits) Query= orf44 [12267-12335] (67 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.323 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 31,807,219 Number of Sequences: 1028106 Number of extensions: 669762 Number of successful extensions: 2653 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2653 Number of HSP's gapped (non-prelim): 0 length of query: 67 length of database: 324,535,921 effective HSP length: 43 effective length of query: 24 effective length of database: 280,327,363 effective search space: 6727856712 effective search space used: 6727856712 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 85 (37.4 bits) Query= orf45 [12332-13051] (718 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAB63670.1|  (AJ251789) hypothetical protein [Lactobacil...   202  9e-051
ref|NP_511031.1|  (NC_003291) helicase [Listeria phage 2389]...   197  5e-049
ref|NP_050128.1|  (NC_000896) putative DEAH-family helicase ...   196  6e-049
ref|NP_049952.1|  (NC_000871) putative helicase [Streptococc...   190  6e-047
ref|NP_049999.1|  (NC_000872) putative helicase [Streptococc...   190  6e-047
pir||T13299  probable helicase - Streptococcus phage phi-O12...   190  6e-047
ref|NP_478500.1|  (NC_003267) ORF_ID:all8075~probable helica...   189  1e-046
ref|NP_471919.1|  (NC_003212) similar to DEAH-family helicas...   188  2e-046
gb|AAL74161.1|AF442520_1  (AF442520) putative helicase [Stre...   187  3e-046
sp|P24125|V51K_BPL79  51.5 kDa protein >gi|76364|pir||JU0445...   179  1e-043
ref|NP_490263.1|  (NC_003276) putative helicase [Nostoc sp. ...   170  6e-041
ref|NP_607398.1|  (NC_003485) hypothetical phage protein [St...   169  1e-040
ref|NP_459877.1|  (NC_003197) Fels-1 prophage; putative DNA ...   120  8e-026
ref|NP_142208.1|  (NC_000961) hypothetical protein [Pyrococc...    85  3e-015
ref|NP_599062.1|  (NC_003444) unknown [Shigella flexneri bac...    83  1e-014
ref|NP_602261.1|  (NC_003450) COG1061:DNA or RNA helicases o...    66  2e-009
ref|XP_132906.1|  (XM_132906) similar to KIAA0111 gene produ...    57  6e-007
ref|NP_248512.1|  (NC_000909) ATP dependent RNA helicase, pu...    55  3e-006
sp|O02494|IF4A_CRYPV  Eukaryotic initiation factor 4A (eIF-4...    54  5e-006
ref|NP_540200.1|  (NC_003317) Hypothetical Cytosolic Protein...    53  1e-005
ref|NP_531749.1|  (NC_003304) conserved hypothetical protein...    49  3e-004
ref|NP_354075.1|  (NC_003062) AGR_C_1942p [Agrobacterium tum...    49  3e-004
gb|AAB67607.1|  (U73459) translational initiation factor eIF...    49  3e-004
ref|NP_385209.1|  (NC_003047) CONSERVED HYPOTHETICAL PROTEIN...    47  6e-004
emb|CAB38640.1|  (Y17288) RNA helicase [Plasmodium falciparum]     45  0.004
>emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacillus casei bacteriophage
           A2]
          Length = 455

 Score =  202 bits (515), Expect = 9e-051
 Identities = 144/422 (34%), Positives = 214/422 (50%), Gaps = 52/422 (12%)

Query: 1   MFKLRDYQQLAYNRI---MAAWEKYRSVLAVLPTGAGKTVIFSKIIH---DHTGAAAAIV 54
           MF+L  YQ+   N+    +AA  K  SVL V P G+GK+VI ++I     + +G    +V
Sbjct: 1   MFQLHPYQKELVNQAREKLAAGNK--SVLLVSPAGSGKSVIIAEIARLAVERSGHVMFMV 58

Query: 55  HRREIVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLT 114
           HR+E+V QI  +  +      +   A T+ T+ K                          
Sbjct: 59  HRQELVNQIVQTFQA----DEIDLQATTIMTVGK-------------------------- 88

Query: 115 SKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVG 174
             + LN       + + +L + DE HH + +  + K    + +   L  +A+P R +G G
Sbjct: 89  IANRLNR------LPRPSLIITDESHHSLAK-TYRKIYSFYADVPRLGFSASPWRMNGQG 141

Query: 175 LGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKAR 234
           LG       E M+EGP+  WLI++ +L+ + Y AP T ID++ L T   GD+  K++ A 
Sbjct: 142 LGN----VYESMVEGPSVKWLIDHQYLAPYDYYAP-TLIDVQKLQTSSTGDYTNKSMDAA 196

Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERD 294
           V ++ I GDVV HY+     ++AIV+A  +E ++Q+ E F+ AG TA     +T +G+RD
Sbjct: 197 VPKA-IFGDVVSHYQHLAGGRQAIVYAHSIEASKQVVEAFQSAGITAVHADAKTPKGQRD 255

Query: 295 HLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
            ++A F+ G I +L NVDL  EGF+VP V  +I+ RPT SL   +Q   R +R   GK  
Sbjct: 256 RIMADFKAGKITILSNVDLISEGFNVPDVGVIIMLRPTASLVLDIQQSMRGMRYKPGKR- 314

Query: 355 AIIIDPVRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYC 414
           +IIID V N  R G P     W+L  R K +       P KTCDAC        K CP C
Sbjct: 315 SIIIDHVANAYRFGLPDTEHEWTLNDRPKKKKRDNTGPPIKTCDACYAVIPVQCKICPIC 374

Query: 415 GN 416
           G+
Sbjct: 375 GH 376
>ref|NP_511031.1| (NC_003291) helicase [Listeria phage 2389]
 emb|CAC85606.1| (AJ312240) helicase [Bacteriophage PSA]
          Length = 418

 Score =  197 bits (500), Expect = 5e-049
 Identities = 138/431 (32%), Positives = 213/431 (49%), Gaps = 47/431 (10%)

Query: 32  GAGKTVIFSKIIHDHT---GAAAAIVHRREIVAQISLSLASFGVKHRVIAPAKTLKTIRK 88
           GAGK+VI S+II   T        +VHR+E++ QI  +L    V    +           
Sbjct: 9   GAGKSVILSEIIRMTTHNKNYVLFLVHRKELIDQIRNTLIMNEVDMSYV----------- 57

Query: 89  KHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQGIW 148
               K G              VQT+  +        +   ++ +L + DE HH +    +
Sbjct: 58  ----KLGM-------------VQTIVRR--------LNKTSEPSLIIIDESHHVLANS-Y 91

Query: 149 AKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRA 208
            K +H F  AK++  TATP R +G GLG       + +IE     WLIEN FL+ + Y A
Sbjct: 92  KKIIHHFSKAKVVGFTATPVRINGGGLGD----INDTLIEKVNVKWLIENQFLAHYKYYA 147

Query: 209 PQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVETAE 268
           P+T +    L   + G+F+  +L  +  +  I GDV++HY+K  + ++AI++A+ +  +E
Sbjct: 148 PET-VQTETLNVKRTGEFDMTSLDDQFNKRMIWGDVIKHYQKLADGEQAILYASSIYQSE 206

Query: 269 QMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVIL 328
           +MA  F  AG ++A + G+T +  RD ++ QF EG ++VL N+DL  EGFDVP    VI+
Sbjct: 207 KMAASFNAAGISSAHIDGKTPKPIRDDIIKQFREGELKVLCNLDLIGEGFDVPDCSTVIM 266

Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWSLMGREKGQGSG 388
            RPTQSL+ ++Q   R +R  +GK  A IID V N +R G P   R WSL  R KG  S 
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYKQGKT-ATIIDHVGNVKRFGLPDMERTWSLEPR-KGSNST 324

Query: 389 GDTIPQKTCDACTVPYEAYYKACPYCGNVNEIADRSAPDKVDGDLVELDLDAWNALFAEI 448
               P K C  C +      K C +CG+  ++  +        +L E+    +   ++  
Sbjct: 325 KAEAPVKICKECFMTVSQTAKKCEHCGHEFKVEVKPIQIDEAAELQEITEAVFKVNYSSP 384

Query: 449 ERAKMSDDDYE 459
           +  K   + YE
Sbjct: 385 DECKNMKELYE 395
>ref|NP_050128.1| (NC_000896) putative DEAH-family helicase [Lactobacillus
           bacteriophage phi adh]
 emb|CAB52498.1| (AJ131519) putative DEAH-family helicase [Lactobacillus
           bacteriophage phi adh]
          Length = 455

 Score =  196 bits (499), Expect = 6e-049
 Identities = 138/443 (31%), Positives = 220/443 (49%), Gaps = 50/443 (11%)

Query: 4   LRDYQQLAYNRIMAAWEK-YRSVLAVLPTGAGKTVIFSKI---IHDHTGAAAAIVHRREI 59
           +R YQ+   + +  + +  +++VL   P G+GK++  +++   I  +      +VHRRE+
Sbjct: 5   IRSYQKKLIDDLRLSLKHGHKNVLIQSPAGSGKSITMAELAKRITSNGERVLFVVHRREL 64

Query: 60  VAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTL 119
           V QI  + +S+GV   +                            C +  VQT + +  L
Sbjct: 65  VKQIKNTFSSWGVNMNL----------------------------CEIYMVQTASRR--L 94

Query: 120 NNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGE 179
                  ++      + DE HH + +  + K  + F+NA +L  TATP R  G    KG 
Sbjct: 95  EKLMTPNYI------LVDEAHHSLAK-TYKKINNHFKNAHILGFTATPVRLSG----KGF 143

Query: 180 GGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVESH 239
               + ++ GP   WLIEN +L+ + Y +    ID   L     GD+  K+++    ++ 
Sbjct: 144 KDIYDDLVLGPKISWLIENHYLAPYTYYSVNL-IDQTKLKKSSTGDYTHKSIE-NAGKNI 201

Query: 240 IVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQ 299
           + GDV+Q Y K+  N +AI+++  V + +Q+A+EF +    A  V G+T++ +RD  +  
Sbjct: 202 VYGDVIQSYHKFANNTKAIIYSYSVHSCQQIAKEFNKNNIPAKEVDGKTKKEDRDKAMQD 261

Query: 300 FEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
           F +G I++LVN +L+ EG DVP  E VI+ RPTQSL+ F+Q   R +R    K+ AIIID
Sbjct: 262 FRDGKIKILVNAELYGEGVDVPDCETVIMLRPTQSLSLFIQQSMRCMRYQPDKQ-AIIID 320

Query: 360 PVRNWERHGPPTAVRNWSLMGREK--GQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNV 417
            V N+ R G P   R W+L  R K   +  G D I  KTC  C     A Y  CP CG  
Sbjct: 321 QVANYTRFGLPDMDRVWTLEDRAKHPQREGGSDGIAIKTCPNCFGVIMASYHKCPLCGYS 380

Query: 418 NEIADRSAPDKVDGDLVELDLDA 440
            E   R   +    +L +++LDA
Sbjct: 381 FEAEFRKLAEDKRAELEKINLDA 403
>ref|NP_049952.1| (NC_000871) putative helicase [Streptococcus thermophilus
           bacteriophage Sfi19]
 gb|AAC97921.1| (AF077306) putative helicase; gp443 [Streptococcus thermophilus
           bacteriophage Sfi19]
 gb|AAD44071.1|AF115102_30 (AF115102) orf443 gp [Streptococcus thermophilus bacteriophage
           Sfi19]
          Length = 443

 Score =  190 bits (482), Expect = 6e-047
 Identities = 146/481 (30%), Positives = 232/481 (47%), Gaps = 62/481 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
           +LR+YQ    N I  +     + ++   P  +GKTV+ + I    T     I+   HR+E
Sbjct: 2   ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61

Query: 59  IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
           I  Q+  +                        FK+ G            V +  +T  S 
Sbjct: 62  INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85

Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
              A  +  + + ++ + DE HH V+   + K +  + N+ +L  T TP R DG G    
Sbjct: 86  TKIARNLDRIQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143

Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
               A+ ++ G    WL +NG ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
            I GDV++HY K  + K+AIV+   VE +E ++  F   GYT+ ++SG+T    RD  + 
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257

Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F +G + ++VN +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
           D V N +RHG P A R WSL G  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPT--TRTCDECYATFWSAERICPMCG 374

Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
           + N+      P K + +++ E++L +    + +++E    SD    ++ +K    Q G  
Sbjct: 375 HENK------PTKEEVEIIREIELEERRQEVASKVETFVTSDQCQTVEELKEFAKQHGYK 428

Query: 474 P 474
           P
Sbjct: 429 P 429
>ref|NP_049999.1| (NC_000872) putative helicase [Streptococcus thermophilus
           bacteriophage Sfi21]
 ref|NP_056709.1| (NC_002214) putative helicase [Streptococcus thermophilus
           bacteriophage Sfi11]
 gb|AAC72435.1| (AF004379) orf443 [Streptococcus thermophilus bacteriophage Sfi21]
 gb|AAD44104.1|AF115103_34 (AF115103) orf443 gp [Streptococcus thermophilus bacteriophage
           Sfi21]
 gb|AAF63056.1|AF158600_10 (AF158600) putative helicase [Streptococcus thermophilus
           bacteriophage Sfi11]
 gb|AAF63085.1|AF158601_13 (AF158601) putative helicase [Streptococcus thermophilus
           bacteriophage SFi18]
          Length = 443

 Score =  190 bits (482), Expect = 6e-047
 Identities = 146/481 (30%), Positives = 232/481 (47%), Gaps = 62/481 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
           +LR+YQ    N I  +     + ++   P  +GKTV+ + I    T     I+   HR+E
Sbjct: 2   ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61

Query: 59  IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
           I  Q+  +                        FK+ G            V +  +T  S 
Sbjct: 62  INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85

Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
              A  +  + + ++ + DE HH V+   + K +  + N+ +L  T TP R DG G    
Sbjct: 86  TKIARNLDRIQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143

Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
               A+ ++ G    WL +NG ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
            I GDV++HY K  + K+AIV+   VE +E ++  F   GYT+ ++SG+T    RD  + 
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257

Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F +G + ++VN +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
           D V N +RHG P A R WSL G  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPT--TRTCDECYATFWSAERICPMCG 374

Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
           + N+      P K + +++ E++L +    + +++E    SD    ++ +K    Q G  
Sbjct: 375 HENK------PTKEEVEIIREIELEERRQEVASKVETFVTSDQCQTVEELKEFAKQHGYK 428

Query: 474 P 474
           P
Sbjct: 429 P 429
>pir||T13299 probable helicase - Streptococcus phage phi-O1205
 gb|AAC79526.1| (U88974) ORF10 [Streptococcus thermophilus temperate bacteriophage
           O1205]
          Length = 443

 Score =  190 bits (482), Expect = 6e-047
 Identities = 145/481 (30%), Positives = 233/481 (48%), Gaps = 62/481 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
           +LR+YQ    N I  +     + ++   P  +GKTV+ + I    T     I+   HR+E
Sbjct: 2   ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61

Query: 59  IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
           I  Q+  +                        FK+ G            V +  +T  S 
Sbjct: 62  INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85

Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
              A  +  + + ++ + DE HH V+   + K +  + N+ +L  T TP R DG G    
Sbjct: 86  TKIARNLDRIQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143

Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
               A+ ++ G +  WL +NG ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKSVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
            I GDV++HY K  + K+AIV+   VE +E ++  F   GYT+ ++SG+T    RD  + 
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257

Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F +G + ++VN +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
           D V N +RHG P A R WS+ G  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSIKGVNKTKKKLKLGEPT--TRTCDECYATFWSAERICPMCG 374

Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
           + N+      P K + +++ E++L +    + +++E    SD    ++ +K    Q G  
Sbjct: 375 HENK------PTKEEVEIIREIELEERRQEVASKVETFVTSDQCQTVEELKEFAKQHGYK 428

Query: 474 P 474
           P
Sbjct: 429 P 429
>ref|NP_478500.1| (NC_003267) ORF_ID:all8075~probable helicase [Nostoc sp. PCC 7120]
 dbj|BAB77405.1| (AP003603) ORF_ID:all8075~probable helicase [Nostoc sp. PCC 7120]
          Length = 681

 Score =  189 bits (480), Expect = 1e-046
 Identities = 138/452 (30%), Positives = 206/452 (45%), Gaps = 51/452 (11%)

Query: 1   MFKLRDYQQLAYNRIMAAWEK-YRSVLAVLPTGAGKTVIFSKIIH---DHTGAAAAIVHR 56
           +++LR+YQ      I  +W++  R VLA LPTGAGKT+ F+ I             I HR
Sbjct: 7   IYELRNYQHQWIKDIWNSWKRGNRRVLAQLPTGAGKTICFAHICQKFFQKQQKVLVIAHR 66

Query: 57  REIVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSK 116
            E++ Q +  L        V  P   +K          G     P     V S+QTL  +
Sbjct: 67  IELITQAAEKLEQI-----VGEPVGIIK----------GGCAAHPERRIQVASIQTLARR 111

Query: 117 STLNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLG 176
                  I+     + L ++DE HH      + + +  ++ A++L VTATP+R DG G  
Sbjct: 112 D------ILELPLNIGLLLFDEAHHSSASS-YRRLIEHYQEAQILGVTATPQRIDGQGFQ 164

Query: 177 KGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKN-GDFNAKALKARV 235
           +      + ++ G +T  LI+ G+LSKF  R   T+  +  +   K+ GDF AK L   V
Sbjct: 165 E----LFDDLVVGISTSTLIKEGYLSKF--RLFTTNQTISTIGVKKSRGDFRAKELAVAV 218

Query: 236 VESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDH 295
                V ++ Q+Y K+ +N R ++FA  +E +  +A +FRR G  A  + GET    R  
Sbjct: 219 TSQIGVDEIFQNYLKYAKNLRTVIFACSLEHSRALAAKFRRNGIKAEHLDGETPPELRVQ 278

Query: 296 LLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYA 355
           +L +F  G  QV+ N ++  EG+D P +EC+   RPT+S   +LQM GR LR    K  A
Sbjct: 279 ILERFRGGETQVITNYEILTEGYDCPNIECIYCVRPTESSTLWLQMTGRVLRTHTLKPTA 338

Query: 356 IIIDPVRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVP------------ 403
           +IID   NW++HG P   R WSL  +     S    I    C     P            
Sbjct: 339 VIIDVTDNWKKHGLPDEQRQWSLEAKTLTASSSLGLIQCPHCTHAFKPLSHELAIVDGEI 398

Query: 404 ------YEAYYKACPYCGNVNEIADRSAPDKV 429
                  E +   CP CG   E   +   + +
Sbjct: 399 GEDGLLIEHHEAICPNCGQAVEFTTKETTEPI 430
>ref|NP_471919.1| (NC_003212) similar to DEAH-family helicase [Listeria innocua]
 emb|CAC97816.1| (AL596173) similar to DEAH-family helicase [Listeria innocua]
          Length = 418

 Score =  188 bits (477), Expect = 2e-046
 Identities = 129/388 (33%), Positives = 194/388 (49%), Gaps = 47/388 (12%)

Query: 32  GAGKTVIFSKIIHDHT---GAAAAIVHRREIVAQISLSLASFGVKHRVIAPAKTLKTIRK 88
           GAGK+VI S+II   T        +VHR+E++ QI  +L    V  + +           
Sbjct: 9   GAGKSVILSEIIRMTTRNKNNVLFLVHRKELIDQIRNTLTMNDVDMKFV----------- 57

Query: 89  KHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQGIW 148
                             +  VQT+  +        +   ++  L + DE HH +    +
Sbjct: 58  -----------------NLGMVQTVVRR--------LEKTSEPALIIIDESHHVLANS-Y 91

Query: 149 AKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRA 208
            K ++ F NAK++  TATP R +G GLG       +++IE     WLIEN FL+ + Y A
Sbjct: 92  KKIINHFSNAKVVGFTATPVRINGGGLGD----INDMLIEKVNVKWLIENQFLAPYKYFA 147

Query: 209 PQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVETAE 268
           P+  +    L   + G+F+   L  +  +  I GDV++HY+K    ++AI++A+ +  +E
Sbjct: 148 PEI-VQTETLDIKRTGEFDMTGLDDQFNKRMIWGDVIKHYQKLANGEQAILYASSLYQSE 206

Query: 269 QMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVIL 328
           +MA  F   G T+A + G+T +  RD ++ +F EG I+VL N+DL  EGFDVP    VI+
Sbjct: 207 KMAMSFESVGITSAHIDGKTPKSIRDDIIQRFREGEIKVLCNLDLIGEGFDVPDCSTVIM 266

Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWSLMGREKGQGSG 388
            RPTQSL+ ++Q   R +R   GK  AIIID V N  R G P   R WSL  + KG  S 
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYRTGKT-AIIIDHVGNVNRFGLPDMERKWSLEAK-KGSNSN 324

Query: 389 GDTIPQKTCDACTVPYEAYYKACPYCGN 416
               P K C  C +   +    C +CG+
Sbjct: 325 KAEAPVKICPDCFMTVLSTNIKCSHCGH 352
>gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Streptococcus thermophilus
           bacteriophage k3]
          Length = 443

 Score =  187 bits (476), Expect = 3e-046
 Identities = 144/481 (29%), Positives = 235/481 (47%), Gaps = 62/481 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
           +LR+YQ    N I  +     + ++   P  +GKTV+ + I    T     ++   HR+E
Sbjct: 2   ELRNYQNDLVNGIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTVLFFSHRKE 61

Query: 59  IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
           I  Q+  +                        FK+ G            V +  +T  S 
Sbjct: 62  INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85

Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
              A  +  +++ ++ + DE HH V+   + K +  + N+ +L  T TP R DG G    
Sbjct: 86  TKVARNLDRISEPSIILIDEAHH-VKAKTYLKIIEYYINSIVLMFTGTPARLDGSGFDD- 143

Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
               A+ ++ G +  WL ENG ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKSVKWLQENGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
            I GDV++HY K  + K+AIV+   VE +E ++  F++ GYT+ ++SG+T    R+  + 
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFKKNGYTSIAISGKTPPEVRERAMQ 257

Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F +  + ++VN +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 TFRDRELTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
           D V N +RHG P A R WSL G  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGINKTKKKLKLGEPT--TRTCDECYATFWSAERICPLCG 374

Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
           + N+      P K + +++ E++L +    + +++E    SD    ++ +K    Q G  
Sbjct: 375 HENQ------PTKEEIEIIREIELEERRQEVASKVETFVTSDQCQSVEELKEFAKQHGYK 428

Query: 474 P 474
           P
Sbjct: 429 P 429
>sp|P24125|V51K_BPL79 51.5 kDa protein
 pir||JU0445 51K protein GP51C - Lactococcus phage mi7-9
 gb|AAA32363.1| (M36388) conserved bacteriophage encoded protein [Lactococcus
           bacteriophage (ISOLATE 7-9)]
          Length = 452

 Score =  179 bits (454), Expect = 1e-043
 Identities = 133/410 (32%), Positives = 195/410 (47%), Gaps = 53/410 (12%)

Query: 30  PTGAGKTVIFS---KIIHDHTGAAAAIVHRREIVAQISLSLASFGVKHRVIAPAKTLKTI 86
           P G+GK+V+ S   K+  +  G    +VHR+E++ QI+ S    GV              
Sbjct: 5   PPGSGKSVVISEITKLATEKGGRVLFLVHRKELIDQITNSFKVHGVP------------- 51

Query: 87  RKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQG 146
                               +  V+ LT     N  +++R   + TL + DEGHH  +  
Sbjct: 52  --------------------LNQVELLTVGKAKNRLSVLR---KPTLIITDEGHHG-KAA 87

Query: 147 IWAKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFLSKFVY 206
            + K    + +   L  TATP R  G G         + MIEG T  WLI N  L+ + Y
Sbjct: 88  TYQKIYEFYADVPRLGFTATPWRMSGDGFKDTY----DYMIEGKTVEWLINNKRLAPYQY 143

Query: 207 RA-PQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVE 265
            + P  DI    +   KNGD++ +++   + ++ I GDVVQ Y K    ++AI++A  VE
Sbjct: 144 YSLPSIDISKLRI---KNGDYSNQSIDDALGKT-IFGDVVQEYVKHANGQKAILYAHSVE 199

Query: 266 TAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVPAVEC 325
            ++  A+EF+ AG  A  V  +T + ERD L+  F  G I+VL NVDL  EGFDVP    
Sbjct: 200 ASQSFAKEFQEAGIKAVHVDAKTPKNERDKLMLDFRNGRIKVLCNVDLISEGFDVPDCTV 259

Query: 326 VILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWS--LMGREK 383
            IL RPT+SL  FLQ   R++R    K  AIIID V NW  HG P    +W     G  K
Sbjct: 260 TILCRPTKSLVLFLQQSMRSMRYQHRKT-AIIIDNVMNWHAHGLPDTHHDWKDYFEGGWK 318

Query: 384 GQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNVNEIADRSAPDKVDGDL 433
            +G   + +  K C  C+  +    K C  CG+   I ++    +++ +L
Sbjct: 319 KKGQ-KNIVQAKQCPDCSAMWPLSQKMCNLCGHDFSIEEKHEKLRLEAEL 367
>ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. PCC 7120]
 dbj|BAB78241.1| (AP003600) ORF_ID:alr7157~putative helicase [Nostoc sp. PCC 7120]
          Length = 629

 Score =  170 bits (430), Expect = 6e-041
 Identities = 129/421 (30%), Positives = 194/421 (45%), Gaps = 56/421 (13%)

Query: 1   MFKLRDYQQLAYNRIMAAWEK-YRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HR 56
           MF LRDYQQ   ++  AAW    R VL  L TG GKTVIF++I    T     ++   HR
Sbjct: 1   MFNLRDYQQDLISKTFAAWSSGIRKVLLQLSTGGGKTVIFAEIASKMTAQGEGVLVVAHR 60

Query: 57  REIVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSK 116
            E++ Q +          ++ A  K    I K  +K         ++P  + S+QTL  +
Sbjct: 61  EELILQAA---------EKLTAVTKLQPGIIKAGYKS-------TDSPLQIASIQTLARR 104

Query: 117 STLNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLG 176
            T  +A ++         + DE HH      + K +  + +A +L +TATP R DG GL 
Sbjct: 105 QTYPSAQLV---------IIDEAHHSSANS-YRKLLDAYPHALVLGLTATPRREDGYGLR 154

Query: 177 KGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVV 236
                  + +I   +T  LI  G+L+ +   A             +  DF  K L+  V 
Sbjct: 155 D----IFDHLICSVSTKELITLGYLTDYKLIA---GFKYSRHKVPQKRDFTRKELE-EVA 206

Query: 237 ESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHL 296
             +   +V++ ++ +   K+ ++FA +V  ++Q+A  F   G T   + GET   ER  +
Sbjct: 207 SDYKPSEVLKQWQNFCAGKKTVIFAVNVMHSKQIAAAFCADGITCEHLDGETPHNERQAI 266

Query: 297 LAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAI 356
           L +F  G  QV+ N  +  EGFD P     ++ARPT S+  +LQM+GR LR   GK+YA 
Sbjct: 267 LNRFRSGQTQVISNCAILTEGFDCPDSSAAVIARPTSSVTLWLQMIGRVLRPAPGKDYAT 326

Query: 357 IIDPVRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYCGN 416
           I+D   NW R G P   R WSL                 +CD    P     + CP+C +
Sbjct: 327 ILDMTDNWFRLGRPCDNRKWSL--------------EPVSCD----PDTQGSRCCPHCHH 368

Query: 417 V 417
           V
Sbjct: 369 V 369
>ref|NP_607398.1| (NC_003485) hypothetical phage protein [Streptococcus pyogenes
           MGAS8232]
 gb|AAL97897.1| (AE010051) hypothetical phage protein [Streptococcus pyogenes
           MGAS8232]
          Length = 455

 Score =  169 bits (428), Expect = 1e-040
 Identities = 124/414 (29%), Positives = 196/414 (46%), Gaps = 50/414 (12%)

Query: 25  VLAVLPTGAGKTVIFSKIIHDHT---GAAAAIVHRREIVAQISLSLASFGVKHRVIAPAK 81
           ++ V P G+GK+V+ S I    T   G    +VHR+E++ QI+ S    G+    +    
Sbjct: 1   MMIVSPPGSGKSVVISDIAKSATQKNGHVLFLVHRKELIDQITNSFKFHGIDMNKV---- 56

Query: 82  TLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHH 141
            L T+ K                          +K+ L+       + + TL + DEGHH
Sbjct: 57  DLMTVGK--------------------------AKNRLDK------LTKPTLIITDEGHH 84

Query: 142 YVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFL 201
             +   +      F +   +  TATP R  G G         +VM+ G T  WLI N  L
Sbjct: 85  G-KASTYQIIYEYFSDVPRIGFTATPWRLSGDGFTDTY----DVMVLGKTVEWLINNNKL 139

Query: 202 SKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFA 261
           + + Y +  + ID   L   +NGD++ K++     +  I GDVVQ Y K    ++AI++A
Sbjct: 140 APYDYYSVLS-IDTAKLKV-QNGDYSNKSIDESFGKK-IFGDVVQEYIKKANGQKAILYA 196

Query: 262 TDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVP 321
             VE ++  A+EF+  G  A     +T + +RD ++  F +G IQV+ NVDL  EGFDVP
Sbjct: 197 HSVEASQAFAKEFQSMGINAIHADAKTPKAKRDKIMKDFRDGKIQVICNVDLISEGFDVP 256

Query: 322 AVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWS--LM 379
                IL RPT+SL  FLQ   R++R    K+ AII+D V NW  HG P    +W     
Sbjct: 257 DCTVTILCRPTKSLVLFLQQSMRSMRYQPNKK-AIILDHVGNWNIHGLPDTPHHWENYFR 315

Query: 380 GREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNVNEIADRSAPDKVDGDL 433
           G  K + +  +T+  K C  C+  +    + C  C +   + ++   ++++ +L
Sbjct: 316 GGWKKESNKTNTVHAKECPVCSALWPLSQQLCELCNHDFGLKEKQEKERIEAEL 369
>ref|NP_459877.1| (NC_003197) Fels-1 prophage; putative DNA or RNA helicases of
           superfamily II [Salmonella typhimurium LT2]
 gb|AAL19836.1| (AE008738) Fels-1 prophage; putative DNA or RNA helicases of
           superfamily II [Salmonella typhimurium LT2]
          Length = 527

 Score =  120 bits (300), Expect = 8e-026
 Identities = 111/421 (26%), Positives = 178/421 (41%), Gaps = 62/421 (14%)

Query: 19  WEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIVHRREIVAQISLSLASFGVKHRVIA 78
           W+ + + +   PTG+GK           TG AA IV              S G++     
Sbjct: 6   WKSHNTFMVYAPTGSGK-----------TGLAAFIVD----------GFVSRGMRVMFCV 44

Query: 79  PAKTLKTIRKKHFKKYGKS-----FI-------DPNAPCGVVSVQTLTSKSTLNNAAIMR 126
           P + L T     F +YG S     ++       +P     + S  TL  +   +N     
Sbjct: 45  PYQILITQTASRFIEYGLSGDEIGYVWASHPNHNPALKIQIASADTLIRREFPDN----- 99

Query: 127 WVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVM 186
               + L + DE H   ++ +           K++ ++ TP            G + + +
Sbjct: 100 ----IDLLIIDEAHLRKKRILQDIERLRASGVKVIGLSGTP-------FSPFLGKYYDRL 148

Query: 187 IEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKN----GDFNAKALKARVVESHIVG 242
           I+  T   LI+ G LSK+ + AP T  DL+ + T  +     D+N   L   +  S +VG
Sbjct: 149 IKPTTIGELIQRGDLSKYEFYAP-TKPDLKGVKTKSSLEYGSDYNEAQLAEIMCGSTLVG 207

Query: 243 DVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEE 302
           D+VQ++ + G +   I F  +V  A  +  +F  AG  A  ++ +T   ER  ++ +FE 
Sbjct: 208 DIVQNWLENGRDLPTIAFCVNVAHANYLTIQFNLAGINAEVMTADTPVDERQTIIHRFET 267

Query: 303 GAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVR 362
           GA +++V+V +   GFD   V C+I ARPT+S  ++LQ +GR LR   GKE  +I D   
Sbjct: 268 GATKIIVSVGVLVAGFD-SDVRCIIYARPTKSEIRWLQAIGRGLRTAPGKESCLIFDHSG 326

Query: 363 NWERHGPPTAVRNWSLMGREKGQGSGGDTIPQ-------KTCDACTVPYEAYYKACPYCG 415
              R G P ++    L G+  G         Q       + C  C     A    CP CG
Sbjct: 327 TVHRLGYPDSIEYDDLPGKSDGMEESARRSSQERAEKLPRECSQCHYMKPAGVYVCPKCG 386

Query: 416 N 416
           +
Sbjct: 387 H 387
>ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||H71243 probable helicase protein PH0210 - Pyrococcus horikoshii
 dbj|BAA29279.1| (AP000001) 444aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 444

 Score = 85.1 bits (209), Expect = 3e-015
 Identities = 99/365 (27%), Positives = 164/365 (44%), Gaps = 49/365 (13%)

Query: 2   FKLRDYQQLAYNRIMAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIVHRREIVA 61
           +KLR YQ+ A   +  A ++   VLA LP G+GKTV+  +IIH+   +A  IVH +E++ 
Sbjct: 93  YKLRKYQKKA---VKLAIKEKMGVLA-LPVGSGKTVVGLRIIHEINKSALVIVHTKELLY 148

Query: 62  QISLSLAS-FGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLN 120
           Q +  +    GV+  +I   K                      P  V  +QTL S+ T  
Sbjct: 149 QWANKVREILGVEPGIIGDNKW------------------SEGPITVAMIQTLLSRGT-- 188

Query: 121 NAAIMRWVNQVTLGVYDEGHH------YVEQGIWAKGVHVFENAKLLFVTATPERADGVG 174
                +  N+  + ++DE H       + + GI    V+ F       ++ATP R     
Sbjct: 189 ----DKLQNKYAIVMFDECHRTSAAEKFYKVGISLPQVYRFG------LSATPWRRLRGE 238

Query: 175 LGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKAR 234
             K EG    ++ E      LI+ GFL+K  +   + D  +  LA  K  +   +A+   
Sbjct: 239 EMKIEGVVGPIIYEVKAED-LIKEGFLAKPKFEVIEYDSKMPALA-DKYKELYEEAVMEN 296

Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERD 294
              +  + +      K G   R ++    ++  E + +  R  G  A  +S ++    R 
Sbjct: 297 EERNRAIVEKAIELAKQGH--RVLIDVKRIDHGEILVKMLRDRGINAEFLSSQSP--NRW 352

Query: 295 HLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
            +L ++++G I VLV+  L  EG D+P +  +ILA   +S    +Q +GRALR   G E 
Sbjct: 353 EILEKYKKGEIPVLVST-LLKEGVDIPEISAIILAGGGKSDVMTIQTIGRALRPKPGGE- 410

Query: 355 AIIID 359
           A+I+D
Sbjct: 411 AVIVD 415
>ref|NP_599062.1| (NC_003444) unknown [Shigella flexneri bacteriophage V]
 gb|AAL89432.1| (U82619) unknown [Shigella flexneri bacteriophage V]
          Length = 178

 Score = 82.8 bits (203), Expect = 1e-014
 Identities = 53/125 (42%), Positives = 68/125 (54%), Gaps = 4/125 (3%)

Query: 569 DKFDLPIKTIAYHLSHINRFNGAVGQ-YSVAQHCVQVXXXXXXXXXXXXXXHDATEAVLC 627
           +K D+ I  IA  LS+I RF G +   YSVAQH V                HDATEA   
Sbjct: 18  NKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 77

Query: 628 DVPAPLKRMLPDYQAIENRLQDAVDARFKV-KTRHKRVREADLSMLAAEARDFGLDLGPL 686
           D+PAPLKR+LPDY+ +E ++   +  ++ +       V+ ADL MLA E RD GLD G  
Sbjct: 78  DIPAPLKRLLPDYKQMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDG-- 135

Query: 687 GFEPV 691
            F PV
Sbjct: 136 SFWPV 140
>ref|NP_602261.1| (NC_003450) COG1061:DNA or RNA helicases of superfamily II
           [Corynebacterium glutamicum]
 dbj|BAC00463.1| (AP005283) Restriction enzymes type I helicase subunits and related
           helicases [Corynebacterium glutamicum ATCC 13032]
          Length = 1643

 Score = 65.9 bits (159), Expect = 2e-009
 Identities = 72/255 (28%), Positives = 100/255 (38%), Gaps = 52/255 (20%)

Query: 158 AKLLFVTATPERADGVGLGKGEGGFAEVM------IEGPTTHWL-----IENGFLSKFVY 206
           AK L++TATP   D    GK     AEV       I GP  H L     +E G L+ +  
Sbjct: 360 AKRLYMTATPRLFDDSVKGKAADHSAEVSSMDDEAIYGPEFHRLGFGEAVEKGLLTDYKV 419

Query: 207 RAPQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENK----------- 255
                D  +   A    G    + L   +  S I+G      ++ G+ +           
Sbjct: 420 VVMTVDEQVAASALTVLGSTPGEELTLDMT-SAIIGAWNGLAKRSGKEQDTKTGFSSSDA 478

Query: 256 ---RAIVFATDVETAEQMAEEFRRA--GYTAA--------------------SVSGETEQ 290
              RA+ FA D++T++Q+AE F R    YT                       V G    
Sbjct: 479 AMERAVAFARDIKTSQQIAESFPRVVNAYTTELEVKNDDVDEHNLNLSVACQHVDGSMNA 538

Query: 291 GERDHLL----AQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRAL 346
            ER+  L    A  +    ++L N     EG DVPA++ VI   P  S+   +Q VGR +
Sbjct: 539 LERNSRLTWLKAPTQSMETKILTNARCLSEGVDVPALDSVIFFNPRNSMVDVVQSVGRVM 598

Query: 347 RIMEGKEYAIIIDPV 361
           R   GK Y  II PV
Sbjct: 599 RKSPGKNYGYIILPV 613
>ref|XP_132906.1| (XM_132906) similar to KIAA0111 gene product [Mus musculus]
          Length = 411

 Score = 57.4 bits (137), Expect = 6e-007
 Identities = 26/92 (28%), Positives = 51/92 (55%)

Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
           +A++F       + + E+ R A +T +S+ G+  Q ER+ ++ +F  GA +VL++ D++ 
Sbjct: 279 QAVIFCITKRKVDWLTEKMREANFTVSSMHGDMPQKERESIMKEFRSGASRVLISTDVWA 338

Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALR 347
            G DVP V  +I      +   ++  +GR+ R
Sbjct: 339 RGLDVPQVSLIINYDLPNNRELYIHRIGRSGR 370
>ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, putative [Methanococcus
           jannaschii] [Methanocaldococcus jannaschii]
 sp|Q58900|YF05_METJA Putative ATP-dependent RNA helicase MJ1505
 gb|AAB99518.1| (U67591) ATP dependent RNA helicase, putative [Methanococcus
           jannaschii] [Methanocaldococcus jannaschii]
          Length = 778

 Score = 55.1 bits (131), Expect = 3e-006
 Identities = 36/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)

Query: 241 VGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETE-------QGER 293
           V D+V++  +  +++R I+FA   +T E++     + G  A    G+         Q E+
Sbjct: 339 VVDMVKNILEKNKDERIIIFAQYRDTVEKIVNLLTQNGIKAIRFIGQANKEGKGMSQKEQ 398

Query: 294 DHLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKE 353
              + +F++    VLV+  + +EG D+P+V  +I   P  S  +F+Q  GRA+R   GK 
Sbjct: 399 IEAIERFKKEG-SVLVSTSVSEEGIDIPSVNYIIFYEPVPSEIRFIQRRGRAMRGEGGKV 457

Query: 354 YAII 357
           Y +I
Sbjct: 458 YVLI 461
>sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4A) (eIF4A)
 gb|AAB58726.1| (AF001211) translation initiation factor [Cryptosporidium parvum]
 gb|AAB58799.1| (AF001378) translation initiation factor [Cryptosporidium parvum]
          Length = 405

 Score = 54.3 bits (129), Expect = 5e-006
 Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 5/117 (4%)

Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
           +AI++       +Q+ ++ R   +T +S+ G+ +Q +R+ ++ QF  G+ +VL+  DL  
Sbjct: 272 QAIIYCNTRRRVDQLTKQMRERDFTCSSMHGDMDQKDREVIMRQFRSGSSRVLITTDLLA 331

Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALRI-MEGKEYAIIIDP----VRNWERH 367
            G DV  V  VI      S   ++  +GR+ R   +G     + D     +R+ ERH
Sbjct: 332 RGIDVQQVSLVINYDLPVSPETYIHRIGRSGRFGKKGVSINFVTDDDIVCLRDIERH 388
>ref|NP_540200.1| (NC_003317) Hypothetical Cytosolic Protein [Brucella melitensis]
 gb|AAL52464.1| (AE009566) Hypothetical Cytosolic Protein [Brucella melitensis]
          Length = 206

 Score = 53.1 bits (126), Expect = 1e-005
 Identities = 51/173 (29%), Positives = 74/173 (42%), Gaps = 25/173 (14%)

Query: 566 LSGDKFDLPIKTIAYHLSHINRFNG-AVGQ--YSVAQHCVQVXXX-------XXXXXXXX 615
           L     D+ I+ IA+ L+ + R+NG  VG+  +SVAQH + V                  
Sbjct: 26  LDPSPLDVEIEDIAHGLARVARWNGQTVGEHAFSVAQHSLLVDQIFNRLVPGASIEWQML 85

Query: 616 XXXHDATEAVLCDVPAPLKRMLP-DYQAIENRLQDAVDARF--------KVKTRHKRVRE 666
              HDA E V+ D+ +P K ++  +Y+ IE RL+ A+  RF        K+KT  KR  +
Sbjct: 86  ALLHDAPEYVIGDMISPFKAVMGGNYKLIETRLESAIHLRFSLPVSPPAKLKTLIKRADQ 145

Query: 667 ADLSMLAAEARDFGLDLG------PLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
                 A     F    G      P G  PV   I P P +  +  +L  F A
Sbjct: 146 VAAFFEATRLAGFTEAEGVKYFGRPRGLNPVGLDIVPRPTQDVQADFLKRFSA 198
>ref|NP_531749.1| (NC_003304) conserved hypothetical protein [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
 gb|AAL42065.1| (AE009069) conserved hypothetical protein [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
          Length = 207

 Score = 48.5 bits (114), Expect = 3e-004
 Identities = 48/176 (27%), Positives = 75/176 (42%), Gaps = 31/176 (17%)

Query: 566 LSGDKFDLPIKTIAYHLSHINRFNGAVG---QYSVAQHCVQVXXX-------XXXXXXXX 615
           L     D+ I  IA+ L+ + R+NG       ++VAQHC+ V                  
Sbjct: 22  LDPSPLDVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQM 81

Query: 616 XXXHDATEAVLCDVPAPLKRML-PDYQAIENRLQDAVDARFKV-----KTRHKRVREADL 669
              HDA E V+ D+ +P K ++   Y+ +E RL+ AV  RF +     +    R+++AD 
Sbjct: 82  ALLHDAPEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADT 141

Query: 670 SMLAAEARD------------FGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
                EA +            FGL   P G       I P P+  A++ ++A F A
Sbjct: 142 VAAFFEATELAGFSTAEAQKFFGL---PRGITRDMFDIIPLPSTEAQRLFIARFEA 194
>ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 gb|AAK86860.1| (AE008035) AGR_C_1942p [Agrobacterium tumefaciens str. C58
           (Cereon)]
          Length = 226

 Score = 48.5 bits (114), Expect = 3e-004
 Identities = 48/176 (27%), Positives = 75/176 (42%), Gaps = 31/176 (17%)

Query: 566 LSGDKFDLPIKTIAYHLSHINRFNGAVG---QYSVAQHCVQVXXX-------XXXXXXXX 615
           L     D+ I  IA+ L+ + R+NG       ++VAQHC+ V                  
Sbjct: 41  LDPSPLDVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQM 100

Query: 616 XXXHDATEAVLCDVPAPLKRML-PDYQAIENRLQDAVDARFKV-----KTRHKRVREADL 669
              HDA E V+ D+ +P K ++   Y+ +E RL+ AV  RF +     +    R+++AD 
Sbjct: 101 ALLHDAPEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADT 160

Query: 670 SMLAAEARD------------FGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
                EA +            FGL   P G       I P P+  A++ ++A F A
Sbjct: 161 VAAFFEATELAGFSTAEAQKFFGL---PRGITRDMFDIIPLPSTEAQRLFIARFEA 213
>gb|AAB67607.1| (U73459) translational initiation factor eIF-4A [Zea mays]
          Length = 414

 Score = 48.5 bits (114), Expect = 3e-004
 Identities = 26/93 (27%), Positives = 48/93 (50%), Gaps = 2/93 (2%)

Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
           ++++F       + + ++ R   +T ++  G+ +Q  RD ++ +F  G+ +VL+  DL  
Sbjct: 282 QSVIFVNTRRKVDWLTDKMRSRDHTVSATHGDMDQNTRDIIMREFRSGSSRVLITTDLLA 341

Query: 316 EGFDVPAVECVI-LARPTQSLAKFLQMVGRALR 347
            G DV  V  VI    PTQ    +L  +GR+ R
Sbjct: 342 RGIDVQQVSLVINYDLPTQP-ENYLHRIGRSGR 373
>ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
 emb|CAC45682.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
          Length = 202

 Score = 47.4 bits (111), Expect = 6e-004
 Identities = 47/174 (27%), Positives = 74/174 (42%), Gaps = 28/174 (16%)

Query: 566 LSGDKFDLPIKTIAYHLSHINRFNGAVG---QYSVAQHCVQVXXXXXXXXX------XXX 616
           L     D+ +  IA+ L+ + R+NG       +SVAQH + V                  
Sbjct: 19  LDPSPLDVELSDIAHGLARVARWNGQTSGDHAFSVAQHSLVVEEIFRRTNRCDAQDCLMA 78

Query: 617 XXHDATEAVLCDVPAPLKRML-PDYQAIENRLQDAVDARFKV-----KTRHKRVREAD-- 668
             HDA E V+ D+ +P K ++   Y+ +E RL++AV  RF +     +   +R+++AD  
Sbjct: 79  LLHDAPEYVIGDMISPFKAVVGGGYKTVEKRLENAVHLRFGLPAHTPRELKERIKKADRI 138

Query: 669 ---------LSMLAAEARDFGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
                        A EAR F     P G       I P PA  A++ +   F+A
Sbjct: 139 AAYFEATELAGFSAEEARKFFGQ--PRGITREMLLIDPLPAVEAQRLFATRFNA 190
>emb|CAB38640.1| (Y17288) RNA helicase [Plasmodium falciparum]
          Length = 182

 Score = 44.7 bits (104), Expect = 0.004
 Identities = 20/72 (27%), Positives = 39/72 (53%)

Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
           ++I++    +  + + +E     +T + + G+ +Q +RD ++ +F  G+ +VLV  DL  
Sbjct: 95  QSIIYCNTRKKVDILTQEMHNRLFTVSCMHGDMDQKDRDLIMREFRSGSTRVLVTTDLLA 154

Query: 316 EGFDVPAVECVI 327
            G DV  V  VI
Sbjct: 155 RGIDVQQVSLVI 166
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.321 0.137 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 449,517,236 Number of Sequences: 1028106 Number of extensions: 18833824 Number of successful extensions: 44756 Number of sequences better than 5.0e-02: 765 Number of HSP's better than 0.1 without gapping: 596 Number of HSP's successfully gapped in prelim test: 169 Number of HSP's that attempted gapping in prelim test: 43629 Number of HSP's gapped (non-prelim): 1156 length of query: 718 length of database: 324,535,921 effective HSP length: 128 effective length of query: 590 effective length of database: 192,938,353 effective search space: 113833628270 effective search space used: 113833628270 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 95 (41.2 bits) Query= orf46 [13051-13221] (169 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.135 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 105,108,614 Number of Sequences: 1028106 Number of extensions: 3728264 Number of successful extensions: 7212 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 7212 Number of HSP's gapped (non-prelim): 0 length of query: 169 length of database: 324,535,921 effective HSP length: 112 effective length of query: 57 effective length of database: 209,388,049 effective search space: 11935118793 effective search space used: 11935118793 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 87 (38.1 bits) Query= orf47 [13236-13348] (111 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_215195.1|  (NC_000962) hypothetical protein Rv0681 [M...    47  4e-005
ref|NP_335121.1|  (NC_002755) transcriptional regulator, put...    47  4e-005
emb|CAC95193.1|  (AJ416905) hypothetical transcriptional reg...    43  6e-004
ref|NP_266283.1|  (NC_002662) transcription regulator [Lacto...    43  8e-004
ref|NP_386558.1|  (NC_003047) PUTATIVE TRANSCRIPTION REGULAT...    41  0.002
ref|NP_421917.1|  (NC_002696) transcriptional regulator, Tet...    41  0.003
ref|NP_626492.1|  (NC_003888) putative TetR-family transcrip...    40  0.004
sp|P51561|TER8_PASMU  TETRACYCLINE REPRESSOR PROTEIN CLASS H...    40  0.005
gb|AAD12754.1|  (AF038993) repressor protein [Proteus mirabi...    40  0.005
ref|NP_421590.1|  (NC_002696) transcriptional regulator, Tet...    40  0.005
ref|NP_149192.1|  (NC_001988) HTH transcriptional regulator ...    40  0.006
ref|NP_355336.1|  (NC_003062) AGR_C_4325p [Agrobacterium tum...    40  0.006
emb|CAA75662.1|  (Y15510) tetracycline resistance repressor ...    39  0.008
ref|NP_511232.1|  (NC_003292) tetracycline repressor protein...    39  0.014
ref|NP_628554.1|  (NC_003888) putative tetR-family transcrip...    38  0.019
ref|NP_630227.1|  (NC_003888) putative transcriptional regul...    37  0.032
>ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [Mycobacterium tuberculosis
           H37Rv]
 pir||B70827 hypothetical protein Rv0681 - Mycobacterium tuberculosis  (strain
           H37RV)
 emb|CAA17464.1| (AL021943) hypothetical protein Rv0681 [Mycobacterium tuberculosis
           H37Rv]
          Length = 196

 Score = 47.0 bits (110), Expect = 4e-005
 Identities = 33/95 (34%), Positives = 49/95 (50%), Gaps = 6/95 (6%)

Query: 8   RPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRG 67
           RPA   R E IV+  L    R GW+ +T  A+A  +G   PSL  H  ++  L+RA+   
Sbjct: 3   RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAVRIR 61

Query: 68  AIKADDGLVVAMAVKAGDFQRAKVDELLYIRGEHQ 102
            I  DD + +   V AG   RA+ D +L + G ++
Sbjct: 62  VI--DDIITMLNRVGAG---RARDDAVLVMAGAYR 91
>ref|NP_335121.1| (NC_002755) transcriptional regulator, putative [Mycobacterium
           tuberculosis CDC1551]
 gb|AAK44935.1| (AE006965) transcriptional regulator, putative [Mycobacterium
           tuberculosis CDC1551]
          Length = 218

 Score = 47.0 bits (110), Expect = 4e-005
 Identities = 33/95 (34%), Positives = 49/95 (50%), Gaps = 6/95 (6%)

Query: 8   RPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRG 67
           RPA   R E IV+  L    R GW+ +T  A+A  +G   PSL  H  ++  L+RA+   
Sbjct: 25  RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAVRIR 83

Query: 68  AIKADDGLVVAMAVKAGDFQRAKVDELLYIRGEHQ 102
            I  DD + +   V AG   RA+ D +L + G ++
Sbjct: 84  VI--DDIITMLNRVGAG---RARDDAVLVMAGAYR 113
>emb|CAC95193.1| (AJ416905) hypothetical transcriptional regulator [Rhizobium
          leguminosarum]
          Length = 216

 Score = 43.1 bits (100), Expect = 6e-004
 Identities = 21/64 (32%), Positives = 38/64 (58%)

Query: 2  AKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
          AK + +      ++EQI++A L V    G+ G T   +AE  G++ P+L+Y++RT   + 
Sbjct: 7  AKTLRRTRIQEEKEEQILEAALDVFSASGFRGSTIDQIAEVAGMSKPNLLYYFRTKEAMH 66

Query: 62 RAIV 65
          RA++
Sbjct: 67 RALI 70
>ref|NP_266283.1| (NC_002662) transcription regulator [Lactococcus lactis subsp.
          lactis]
 gb|AAK04225.1|AE006251_1 (AE006251) transcription regulator [Lactococcus lactis subsp.
          lactis]
          Length = 191

 Score = 42.7 bits (99), Expect = 8e-004
 Identities = 18/52 (34%), Positives = 36/52 (68%)

Query: 12 AVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
          A+ +++IVQA + +A ++G   +++  +AE+ GI APSL  H++ M +++ A
Sbjct: 5  ALTRDKIVQATIELAGKIGLSNVSFPRLAEYFGIKAPSLYNHFKNMEEVRVA 56
>ref|NP_386558.1| (NC_003047) PUTATIVE TRANSCRIPTION REGULATOR PROTEIN
          [Sinorhizobium meliloti]
 emb|CAC47031.1| (AL591790) PUTATIVE TRANSCRIPTION REGULATOR PROTEIN
          [Sinorhizobium meliloti]
          Length = 213

 Score = 41.2 bits (95), Expect = 0.002
 Identities = 18/52 (34%), Positives = 34/52 (64%)

Query: 14 RKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
          ++EQI++A L V    G+ G T   +A+  G++ P+L+Y++RT   + RA++
Sbjct: 19 KEEQILEAALEVFSANGFRGSTIDQIADVAGMSKPNLLYYFRTKEAMHRALI 70
>ref|NP_421917.1| (NC_002696) transcriptional regulator, TetR family [Caulobacter
          crescentus CB15]
 gb|AAK25085.1| (AE005976) transcriptional regulator, TetR family [Caulobacter
          crescentus CB15]
          Length = 202

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 25/91 (27%), Positives = 42/91 (45%)

Query: 1  MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL 60
          M +    R AP VR++ +V A   V  R G  G + +A+    G++   L +++  +  L
Sbjct: 9  MTRAAFTREAPDVRRQALVAAAEAVLAREGVGGTSVRAICAEAGVSPGLLRHYFEGVDDL 68

Query: 61 KRAIVRGAIKADDGLVVAMAVKAGDFQRAKV 91
            A      +  DG + A    AGD  RA++
Sbjct: 69 IAAAYEAVSQRIDGALDAALAAAGDDPRARL 99
>ref|NP_626492.1| (NC_003888) putative TetR-family transcriptional regulator.
          [Streptomyces coelicolor A3(2)]
 emb|CAB66177.1| (AL136500) putative TetR-family transcriptional regulator.
          [Streptomyces coelicolor A3(2)]
          Length = 192

 Score = 40.4 bits (93), Expect = 0.004
 Identities = 29/96 (30%), Positives = 48/96 (49%), Gaps = 5/96 (5%)

Query: 1  MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL 60
          M+ Q  K+   A R+  IV     +A + GW+ +T + +AE +  + P L  H+R   ++
Sbjct: 1  MSVQERKQREWAERERLIVATARELAEQQGWDAVTTRRLAERIEYSQPVLYSHFRGKREI 60

Query: 61 KRAIVRGAIKADDGLVVAM--AVKAGDFQRAKVDEL 94
            A+   A++    + VAM  A  A D  RA+V  L
Sbjct: 61 IGAV---ALQGATEMAVAMRAATAAADGPRARVHAL 93
>sp|P51561|TER8_PASMU TETRACYCLINE REPRESSOR PROTEIN CLASS H
 gb|AAC43249.1| (U00792) repressor protein [Pasteurella multocida]
          Length = 207

 Score = 40.0 bits (92), Expect = 0.005
 Identities = 17/50 (34%), Positives = 30/50 (60%)

Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI 64
          KEQ++   L++   VG EG+T + VA+ +G+  P+L +H +    L  A+
Sbjct: 6  KEQVIDDALILLNEVGIEGLTTRNVAQKIGVEQPTLYWHVKNKRALLDAL 55
>gb|AAD12754.1| (AF038993) repressor protein [Proteus mirabilis]
          Length = 207

 Score = 40.0 bits (92), Expect = 0.005
 Identities = 18/66 (27%), Positives = 36/66 (54%)

Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRGAIKADDG 74
          KEQ++   L++   VG EG+T + +A+ +G+  P+L +H +    L  A+    ++    
Sbjct: 6  KEQVIDNALILLNEVGMEGLTTRKLAQKLGVEQPTLYWHVKNKRALLDALAETILQKHHH 65

Query: 75 LVVAMA 80
           V+ +A
Sbjct: 66 HVLPLA 71
>ref|NP_421590.1| (NC_002696) transcriptional regulator, TetR family [Caulobacter
           crescentus CB15]
 gb|AAK24758.1| (AE005944) transcriptional regulator, TetR family [Caulobacter
           crescentus CB15]
          Length = 255

 Score = 40.0 bits (92), Expect = 0.005
 Identities = 19/56 (33%), Positives = 35/56 (61%)

Query: 10  APAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
           A   R+  I++A L V  R+G EG + +A+A+  G++ P+L+Y+Y +   L  A++
Sbjct: 58  AGEARRAAILKAALAVFARLGPEGASVEAIAQAAGVSKPNLLYYYPSKDALYLAVL 113
>ref|NP_149192.1| (NC_001988) HTH transcriptional regulator TetR family
          [Clostridium acetobutylicum]
 gb|AAK76774.1|AE001438_27 (AE001438) HTH transcriptional regulator TetR family [Clostridium
          acetobutylicum]
          Length = 187

 Score = 39.7 bits (91), Expect = 0.006
 Identities = 17/47 (36%), Positives = 30/47 (63%)

Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
          KE+I+Q    +A  +G + +T++ +AE + I +PSL  H+  M+ LK
Sbjct: 8  KEKIIQIAFSLADEIGLDKVTFQKLAEKLDIKSPSLYNHFSNMSDLK 54
>ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tumefaciens]
          [Agrobacterium tumefaciens str. C58 (Cereon)]
 ref|NP_533056.1| (NC_003304) transcriptional regulator, TetR family [Agrobacterium
          tumefaciens str. C58 (U. Washington)]
 gb|AAK88121.1| (AE008152) AGR_C_4325p [Agrobacterium tumefaciens str. C58
          (Cereon)]
 gb|AAL43372.1| (AE009186) transcriptional regulator, TetR family [Agrobacterium
          tumefaciens str. C58 (U. Washington)]
          Length = 213

 Score = 39.7 bits (91), Expect = 0.006
 Identities = 17/52 (32%), Positives = 33/52 (62%)

Query: 14 RKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
          ++E I++A L V    G+ G T   +AE  G++ P+++Y++RT   + RA++
Sbjct: 19 KQETILEAALSVFSTNGFRGSTIDQIAEAAGMSKPNVLYYFRTKEAMHRALI 70
>emb|CAA75662.1| (Y15510) tetracycline resistance repressor protein [Pasteurella
          multocida]
 emb|CAA76068.1| (Y16103) tetracycline resistance repressor protein [Mannheimia
          haemolytica]
 emb|CAC08219.1| (AJ245947) tet repressor protein [Pasteurella aerogenes]
          Length = 207

 Score = 39.3 bits (90), Expect = 0.008
 Identities = 16/50 (32%), Positives = 30/50 (60%)

Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI 64
          KEQ++   L++   VG EG+T + +A+ +G+  P+L +H +    L  A+
Sbjct: 6  KEQVIDNALILLNEVGIEGLTTRKLAQKIGVEQPTLYWHVKNKRALLDAL 55
>ref|NP_511232.1| (NC_003292) tetracycline repressor protein [IncN plasmid R46]
 sp|P03039|TER3_ECOLI TETRACYCLINE REPRESSOR PROTEIN CLASS C
 pir||RPECYS tetracycline repressor - Escherichia coli plasmid pSC101
 gb|AAA25677.1| (M36272) tetracycline resistance element repressor protein (tetR)
          [Plasmid pSC101]
 gb|AAG43219.1| (AY046276) tetracycline repressor protein [IncN plasmid R46]
 prf||1101269A repressor tet [Salmonella enterica subsp. enterica serovar
          Panama]
          Length = 219

 Score = 38.5 bits (88), Expect = 0.014
 Identities = 15/57 (26%), Positives = 33/57 (57%)

Query: 13 VRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRGAI 69
          +++E +++  L +   VG EG+T + +AE +G+  P+L +H++    L  A+    +
Sbjct: 4  LQREAVIRTALELLNDVGMEGLTTRRLAERLGVQQPALYWHFKNKRALLDALAEAML 60
>ref|NP_628554.1| (NC_003888) putative tetR-family transcriptional regulator
           [Streptomyces coelicolor A3(2)]
 emb|CAB95896.1| (AL359988) putative tetR-family transcriptional regulator
           [Streptomyces coelicolor A3(2)]
          Length = 202

 Score = 38.1 bits (87), Expect = 0.019
 Identities = 31/121 (25%), Positives = 48/121 (39%), Gaps = 18/121 (14%)

Query: 3   KQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKR 62
           ++V K+      ++++++A +      GW G T   VAEH G++  +  +H+ T   L  
Sbjct: 8   ERVPKQDRSRATRQRLLEAAVACLAERGWAGSTVSVVAEHAGVSRGAAQHHFPTREDLFT 67

Query: 63  AIV----------------RGAIKADDGLVVAMAVK--AGDFQRAKVDELLYIRGEHQLE 104
           A V                 GA   D   VVA  V    G   RA +   +    E QL 
Sbjct: 68  AAVEYVAEERSIALRALFPEGAAADDRRAVVAALVDLYTGPLFRAALHLWVAASNEEQLR 127

Query: 105 G 105
           G
Sbjct: 128 G 128
>ref|NP_630227.1| (NC_003888) putative transcriptional regulator [Streptomyces
          coelicolor A3(2)]
 pir||T35904 probable transcription regulator - Streptomyces coelicolor
 emb|CAA22788.1| (AL035212) putative transcriptional regulator [Streptomyces
          coelicolor A3(2)]
          Length = 192

 Score = 37.4 bits (85), Expect = 0.032
 Identities = 25/82 (30%), Positives = 41/82 (49%), Gaps = 5/82 (6%)

Query: 1  MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL 60
          M+ Q  K+   A R+  IV     +A + GW+ +T + +AE +  + P L  H+R     
Sbjct: 1  MSVQERKQRERAERERLIVATARELAEQQGWDAVTTRRLAERIEYSQPVLYSHFRG---- 56

Query: 61 KRAIVRGAIKADDGLVVAMAVK 82
          KR I+ GA+       +A AV+
Sbjct: 57 KREII-GAVALQGATELAAAVR 77
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.322 0.135 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 62,251,575 Number of Sequences: 1028106 Number of extensions: 2011169 Number of successful extensions: 4713 Number of sequences better than 5.0e-02: 26 Number of HSP's better than 0.1 without gapping: 21 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 4692 Number of HSP's gapped (non-prelim): 26 length of query: 111 length of database: 324,535,921 effective HSP length: 87 effective length of query: 24 effective length of database: 235,090,699 effective search space: 5642176776 effective search space used: 5642176776 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 84 (37.0 bits) Query= orf48 [13343-13405] (61 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.321 0.138 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 41,204,087 Number of Sequences: 1028106 Number of extensions: 1243960 Number of successful extensions: 2506 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2506 Number of HSP's gapped (non-prelim): 0 length of query: 61 length of database: 324,535,921 effective HSP length: 37 effective length of query: 24 effective length of database: 286,495,999 effective search space: 6875903976 effective search space used: 6875903976 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 85 (37.4 bits) Query= orf49 [13467-13520] (52 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.336 0.152 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 28,421,289 Number of Sequences: 1028106 Number of extensions: 716841 Number of successful extensions: 3399 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3399 Number of HSP's gapped (non-prelim): 0 length of query: 52 length of database: 324,535,921 effective HSP length: 28 effective length of query: 24 effective length of database: 295,748,953 effective search space: 7097974872 effective search space used: 7097974872 T: 11 A: 40 X1: 15 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.6 bits) S2: 85 (37.4 bits) Query= orf50 [13526-13637] (110 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.323 0.138 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 59,713,363 Number of Sequences: 1028106 Number of extensions: 1994436 Number of successful extensions: 5179 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 5179 Number of HSP's gapped (non-prelim): 0 length of query: 110 length of database: 324,535,921 effective HSP length: 86 effective length of query: 24 effective length of database: 236,118,805 effective search space: 5666851320 effective search space used: 5666851320 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 84 (37.0 bits) Query= orf51 [13636-13787] (150 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 94,576,630 Number of Sequences: 1028106 Number of extensions: 3400143 Number of successful extensions: 7870 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 7870 Number of HSP's gapped (non-prelim): 0 length of query: 150 length of database: 324,535,921 effective HSP length: 110 effective length of query: 40 effective length of database: 211,444,261 effective search space: 8457770440 effective search space used: 8457770440 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 85 (37.4 bits) Query= orf52 [13788-13858] (69 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.321 0.140 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 46,310,366 Number of Sequences: 1028106 Number of extensions: 1561078 Number of successful extensions: 3725 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3725 Number of HSP's gapped (non-prelim): 0 length of query: 69 length of database: 324,535,921 effective HSP length: 45 effective length of query: 24 effective length of database: 278,271,151 effective search space: 6678507624 effective search space used: 6678507624 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf53 [13859-13922] (62 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.324 0.139 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 36,707,921 Number of Sequences: 1028106 Number of extensions: 1078469 Number of successful extensions: 2415 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2415 Number of HSP's gapped (non-prelim): 0 length of query: 62 length of database: 324,535,921 effective HSP length: 38 effective length of query: 24 effective length of database: 285,467,893 effective search space: 6851229432 effective search space used: 6851229432 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 85 (37.4 bits) Query= orf54 [13941-14001] (59 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.322 0.139 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 36,225,386 Number of Sequences: 1028106 Number of extensions: 1064755 Number of successful extensions: 2330 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2330 Number of HSP's gapped (non-prelim): 0 length of query: 59 length of database: 324,535,921 effective HSP length: 35 effective length of query: 24 effective length of database: 288,552,211 effective search space: 6925253064 effective search space used: 6925253064 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 85 (37.4 bits) Query= orf55 [13999-14088] (88 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.321 0.135 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 52,183,734 Number of Sequences: 1028106 Number of extensions: 1776028 Number of successful extensions: 3371 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3371 Number of HSP's gapped (non-prelim): 0 length of query: 88 length of database: 324,535,921 effective HSP length: 64 effective length of query: 24 effective length of database: 258,737,137 effective search space: 6209691288 effective search space used: 6209691288 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 84 (37.0 bits) Query= orf56 [14087-14190] (102 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.316 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 59,829,153 Number of Sequences: 1028106 Number of extensions: 2127616 Number of successful extensions: 16132 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 16132 Number of HSP's gapped (non-prelim): 0 length of query: 102 length of database: 324,535,921 effective HSP length: 78 effective length of query: 24 effective length of database: 244,343,653 effective search space: 5864247672 effective search space used: 5864247672 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 84 (37.0 bits) Query= orf57 [14234-14372] (137 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_213407.1|  (NC_000918) polypeptide deformylase [Aquif...    74  2e-013
ref|NP_600815.1|  (NC_003450) COG0242:N-formylmethionyl-tRNA...    72  1e-012
ref|NP_625500.1|  (NC_003888) putative polypeptide deformyla...    70  3e-012
ref|NP_562660.1|  (NC_003366) polypeptide deformylase [Clost...    69  1e-011
ref|NP_246498.1|  (NC_002663) Def [Pasteurella multocida] >g...    67  3e-011
ref|NP_541241.1|  (NC_003318) POLYPEPTIDE DEFORMYLASE [Bruce...    67  4e-011
ref|NP_601934.1|  (NC_003450) COG0242:N-formylmethionyl-tRNA...    66  6e-011
ref|NP_389454.1|  (NC_000964) polypeptide deformylase [Bacil...    66  7e-011
ref|NP_269070.1|  (NC_002737) putative polypeptide deformyla...    64  3e-010
ref|NP_607075.1|  (NC_003485) putative polypeptide deformyla...    64  3e-010
pdb|2DEF|   Peptide Deformylase Catalytic Core (Residues 1 -...    60  3e-009
ref|NP_266716.1|  (NC_002662) polypeptide deformylase [Lacto...    59  9e-009
sp|Q48661|DEF_LACLA  Peptide deformylase (PDF) (Polypeptide ...    59  9e-009
ref|NP_359915.1|  (NC_003103) polypeptide deformylase [EC:3....    55  1e-007
sp|Q9FV53|DEFM_ARATH  Peptide deformylase, mitochondrial pre...    52  9e-007
ref|NP_345996.1|  (NC_003028) polypeptide deformylase [Strep...    52  1e-006
gb|EAA02964.1|  (AAAB01007479) agCP10822 [Anopheles gambiae ...    45  2e-004
gb|AAG09295.1|AF177768_1  (AF177768) ORF260 [Trypanosoma bru...    39  0.010
>ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquifex aeolicus]
 sp|O66847|DEF_AQUAE Peptide deformylase (PDF) (Polypeptide deformylase)
 pir||C70352 polypeptide deformylase - Aquifex aeolicus
 gb|AAC06802.1| (AE000696) polypeptide deformylase [Aquifex aeolicus]
          Length = 169

 Score = 74.3 bits (181), Expect = 2e-013
 Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 7/97 (7%)

Query: 46  NQLGVLKRVIVVR------APKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHN 99
           NQ+GV   V+V+       AP  K  ++NP I +   G +  +EGCLSFPG +V+ +R  
Sbjct: 49  NQIGVPLSVMVIDTSPKEDAPPLKLVLINPEI-KEGEGKIKYKEGCLSFPGLSVEVERFQ 107

Query: 100 KITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGVT 136
           K+ V   + H +P+++  +G  A   QHE+DHL G+T
Sbjct: 108 KVKVNALNEHGEPVELTLEGFPAIVFQHELDHLKGIT 144
>ref|NP_600815.1| (NC_003450) COG0242:N-formylmethionyl-tRNA deformylase
           [Corynebacterium glutamicum]
 dbj|BAB98994.1| (AP005279) N-formylmethionyl-tRNA deformylase [Corynebacterium
           glutamicum ATCC 13032]
          Length = 169

 Score = 72.0 bits (175), Expect = 1e-012
 Identities = 44/136 (32%), Positives = 61/136 (44%), Gaps = 4/136 (2%)

Query: 4   LTGDAYELHAIAADVPHGEDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPK-- 61
           L GD   +      V   E ++ L+D                NQ+GVL+RV V       
Sbjct: 8   LFGDPVLVSRADEVVDFDESLSTLIDDMFDTMEDAGGVGLAANQVGVLRRVFVFDTSHQE 67

Query: 62  --FKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKG 119
              +G ++NPV    T      +EGCLS P  + +  R+  + + G D    P+   A G
Sbjct: 68  GGLRGHVINPVWEPLTEDTQTGKEGCLSIPDVSAETTRYETVRLSGQDRDGNPVGFVANG 127

Query: 120 LTAFCIQHEIDHLNGV 135
           L A CIQHE DHL+GV
Sbjct: 128 LLARCIQHETDHLDGV 143
>ref|NP_625500.1| (NC_003888) putative polypeptide deformylase [Streptomyces
           coelicolor A3(2)]
 emb|CAC01493.1| (AL391017) putative polypeptide deformylase [Streptomyces
           coelicolor A3(2)]
          Length = 217

 Score = 70.5 bits (171), Expect = 3e-012
 Identities = 46/137 (33%), Positives = 61/137 (43%), Gaps = 7/137 (5%)

Query: 4   LTGDAYELHAIAADVPH-GEDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAP-- 60
           L GD   LHA  A+V   G ++  LV+                NQ+G   RV V   P  
Sbjct: 56  LLGDPV-LHARCAEVTDFGPELAALVEDLFATMYAAHGVGLAANQVGEAVRVFVYDCPDD 114

Query: 61  ---KFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEA 117
              +  G +VNP +       V   EGCLS PG     +R+++  V GF    +P+ V  
Sbjct: 115 EDERHLGHVVNPRLVETGGVVVRGPEGCLSLPGLEAGTERYDEAVVTGFTVAGEPVTVRG 174

Query: 118 KGLTAFCIQHEIDHLNG 134
            G  A C+QHE DHL G
Sbjct: 175 TGFFARCLQHECDHLEG 191
>ref|NP_562660.1| (NC_003366) polypeptide deformylase [Clostridium perfringens]
 dbj|BAB81450.1| (AP003191) polypeptide deformylase [Clostridium perfringens str.
           13]
          Length = 147

 Score = 68.6 bits (166), Expect = 1e-011
 Identities = 39/91 (42%), Positives = 50/91 (54%), Gaps = 3/91 (3%)

Query: 47  QLGVLKRVIVVRAPKFKGC--IVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVE 104
           Q+G+LKR+ VV A    G    +NP I    SG     EGCLS PG+    KR NKI ++
Sbjct: 51  QVGILKRIFVVDAMDGAGSRVFINPEILEK-SGEQTDEEGCLSLPGRHKPVKRANKIKIK 109

Query: 105 GFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
             D +     ++A+G  A  IQHE DHL GV
Sbjct: 110 ALDVNGNEFVLDAEGFLARAIQHEYDHLEGV 140
>ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida]
 sp|P57948|DEF_PASMU Peptide deformylase (PDF) (Polypeptide deformylase)
 gb|AAK03643.1| (AE006193) Def [Pasteurella multocida]
          Length = 170

 Score = 67.0 bits (162), Expect = 3e-011
 Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 4/117 (3%)

Query: 22  EDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPKFKG---CIVNPVITRHTSGH 78
           +++  +VD                 Q+ + +R+I +     K     ++NP I   + G 
Sbjct: 26  DEIREIVDNMFETMYLEEGIGLAATQVNIHQRIITIDVEGTKENQYVLINPEII-DSCGE 84

Query: 79  VNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
               EGCLS PG      R  K+T++  D H +   + A+GL A CIQHEIDHLNG+
Sbjct: 85  TGIEEGCLSLPGFRGFVPRKEKVTIKALDRHGEEYTLSAEGLLAICIQHEIDHLNGI 141
>ref|NP_541241.1| (NC_003318) POLYPEPTIDE DEFORMYLASE [Brucella melitensis]
 gb|AAL53505.1| (AE009665) POLYPEPTIDE DEFORMYLASE [Brucella melitensis]
          Length = 187

 Score = 66.6 bits (161), Expect = 4e-011
 Identities = 34/76 (44%), Positives = 42/76 (54%)

Query: 60  PKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKG 119
           PK     VNP I + +       EGCLS P    + +R   + V  FDA  +P  +EA G
Sbjct: 81  PKAPHIFVNPTIVQSSDKRSTYEEGCLSIPDYYAEVERPATVKVNYFDADGKPQSMEADG 140

Query: 120 LTAFCIQHEIDHLNGV 135
           L A C+QHEIDHLNGV
Sbjct: 141 LMATCLQHEIDHLNGV 156
>ref|NP_601934.1| (NC_003450) COG0242:N-formylmethionyl-tRNA deformylase
           [Corynebacterium glutamicum]
 dbj|BAC00133.1| (AP005282) N-formylmethionyl-tRNA deformylase [Corynebacterium
           glutamicum ATCC 13032]
          Length = 193

 Score = 66.2 bits (160), Expect = 6e-011
 Identities = 45/144 (31%), Positives = 63/144 (43%), Gaps = 24/144 (16%)

Query: 2   KVLTGDAYELHAIAADVPHGEDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPK 61
           +++T D  EL  + AD+    DV   V                 NQ+GV KR+ V   P 
Sbjct: 19  QLVTEDVSELQELIADMYETMDVANGVGLAA-------------NQIGVSKRIFVYDCPD 65

Query: 62  -----FKGCIVNPVITRHTSGHV------NSREGCLSFPGKTVDKKRHNKITVEGFDAHW 110
                 KGC +NPV+              +  EGCLS PG+     R +   V G +   
Sbjct: 66  DEGVMHKGCFINPVLETSEIPETMPADDGSDEEGCLSVPGEGFPTGRAHWAKVTGLNEKG 125

Query: 111 QPIKVEAKGLTAFCIQHEIDHLNG 134
           + + VEA+G  A C QHE+ HL+G
Sbjct: 126 EEVSVEAEGFLARCFQHEVGHLDG 149
>ref|NP_389454.1| (NC_000964) polypeptide deformylase [Bacillus subtilis]
 sp|P94462|DEF1_BACSU Peptide deformylase 1 (PDF 1) (Polypeptide deformylase 1)
 pir||F69613 polypeptide deformylase def - Bacillus subtilis
 emb|CAA71349.1| (Y10304) polypeptide deformylase [Bacillus subtilis]
 emb|CAA74262.1| (Y13937) putative Def protein [Bacillus subtilis]
 emb|CAB13445.1| (Z99112) polypeptide deformylase [Bacillus subtilis]
          Length = 160

 Score = 65.9 bits (159), Expect = 7e-011
 Identities = 39/91 (42%), Positives = 51/91 (55%), Gaps = 3/91 (3%)

Query: 47  QLGVLKRVIVVRAPKFKGCI--VNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVE 104
           Q+G+LKR  VV     +G I  VNP I    SG     EGCLSFP    D  R + + V 
Sbjct: 51  QIGILKRAAVVEIGDDRGRIDLVNPEILEK-SGEQTGIEGCLSFPNVYGDVTRADYVKVR 109

Query: 105 GFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
            F+   +P  +EA+G  A  +QHE+DHL+GV
Sbjct: 110 AFNRQGKPFILEARGFLARAVQHEMDHLDGV 140
>ref|NP_269070.1| (NC_002737) putative polypeptide deformylase [Streptococcus
           pyogenes] [Streptococcus pyogenes M1 GAS]
 gb|AAK33791.1| (AE006536) putative polypeptide deformylase [Streptococcus pyogenes
           M1 GAS]
          Length = 136

 Score = 63.9 bits (154), Expect = 3e-010
 Identities = 35/92 (38%), Positives = 52/92 (56%), Gaps = 2/92 (2%)

Query: 46  NQLGVLKRVIVVRAPKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEG 105
           N +G  KR+++V        + NPV+     G   ++E CLS  G     +R++KITVE 
Sbjct: 47  NMIGEQKRIVIVSMGFIDLVMFNPVMVSK-KGIYQTKESCLSLSGYR-KTQRYDKITVEY 104

Query: 106 FDAHWQPIKVEAKGLTAFCIQHEIDHLNGVTI 137
            D +W+P ++   GLTA   QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>ref|NP_607075.1| (NC_003485) putative polypeptide deformylase [Streptococcus
           pyogenes MGAS8232]
 gb|AAL97574.1| (AE010021) putative polypeptide deformylase [Streptococcus pyogenes
           MGAS8232]
          Length = 136

 Score = 63.9 bits (154), Expect = 3e-010
 Identities = 35/92 (38%), Positives = 52/92 (56%), Gaps = 2/92 (2%)

Query: 46  NQLGVLKRVIVVRAPKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEG 105
           N +G  KR+++V        + NPV+     G   ++E CLS  G     +R++KITVE 
Sbjct: 47  NMIGEQKRIVIVSMGFIDLVMFNPVMVSK-KGIYQTKESCLSLSGYR-KTQRYDKITVEY 104

Query: 106 FDAHWQPIKVEAKGLTAFCIQHEIDHLNGVTI 137
            D +W+P ++   GLTA   QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>pdb|2DEF|  Peptide Deformylase Catalytic Core (Residues 1 - 147), Nmr, 20
           Structures
          Length = 147

 Score = 60.5 bits (145), Expect = 3e-009
 Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 4/115 (3%)

Query: 23  DVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPKFKG---CIVNPVITRHTSGHV 79
           ++  +VD                 Q+ + +R+IV+   + +     ++NP +    SG  
Sbjct: 26  EIQRIVDDMFETMYAEEGIGLAATQVDIHQRIIVIDVSENRDERLVLINPELLEK-SGET 84

Query: 80  NSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
              EGCLS P +     R  K+ +   D   +P ++EA GL A CIQHE+DHL G
Sbjct: 85  GIEEGCLSIPEQRALVPRAEKVKIRALDRDGKPFELEADGLLAICIQHEMDHLVG 139
>ref|NP_266716.1| (NC_002662) polypeptide deformylase [Lactococcus lactis subsp.
           lactis]
 gb|AAK04658.1|AE006290_3 (AE006290) polypeptide deformylase [Lactococcus lactis subsp.
           lactis]
          Length = 211

 Score = 58.9 bits (141), Expect = 9e-009
 Identities = 40/114 (35%), Positives = 53/114 (46%), Gaps = 27/114 (23%)

Query: 46  NQLGVLKRVIVVRAP------------------KFKGCIVNPVITRHT--SGHVNSREGC 85
           NQLG+LK+VI V  P                  K +  + N  +  H+     V   EGC
Sbjct: 79  NQLGLLKKVIAVLIPNEPEVDEDGNEIPPKEAYKMREIMYNAKVVSHSVQDAAVEGGEGC 138

Query: 86  LSF----PGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
           LS     PG  V   RH ++TVE ++   +  K+  K   A C+QHEIDH NGV
Sbjct: 139 LSVDREVPGYVV---RHARVTVEYYNKEGEKKKIRLKDFPAICVQHEIDHTNGV 189
>sp|Q48661|DEF_LACLA Peptide deformylase (PDF) (Polypeptide deformylase)
          Length = 196

 Score = 58.9 bits (141), Expect = 9e-009
 Identities = 40/114 (35%), Positives = 53/114 (46%), Gaps = 27/114 (23%)

Query: 46  NQLGVLKRVIVVRAP------------------KFKGCIVNPVITRHT--SGHVNSREGC 85
           NQLG+LK+VI V  P                  K +  + N  +  H+     V   EGC
Sbjct: 64  NQLGLLKKVIAVLIPNEPEVDEDGNEIPPKEAYKMREIMYNAKVVSHSVQDAAVEGGEGC 123

Query: 86  LSF----PGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
           LS     PG  V   RH ++TVE ++   +  K+  K   A C+QHEIDH NGV
Sbjct: 124 LSVDREVPGYVV---RHARVTVEYYNKEGEKKKIRLKDFPAICVQHEIDHTNGV 174
>ref|NP_359915.1| (NC_003103) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
           conorii]
 sp|Q92IZ1|DEF_RICCN Peptide deformylase (PDF) (Polypeptide deformylase)
 gb|AAL02816.1| (AE008593) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
           conorii]
          Length = 175

 Score = 55.5 bits (132), Expect = 1e-007
 Identities = 35/98 (35%), Positives = 49/98 (49%), Gaps = 10/98 (10%)

Query: 47  QLGVLKRVIV--------VRAPK--FKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKK 96
           Q+GVLKR++V        V  PK  +   IVNP I       V + EGC+S P + ++  
Sbjct: 51  QVGVLKRILVIDIQDHDSVARPKDFYPLFIVNPEIIEKAEELVTANEGCISLPEQRIEVA 110

Query: 97  RHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
           R   I +   D H +  +++A    A  IQHE DHL G
Sbjct: 111 RPESIKIRYLDYHGKSRELKANDWLARVIQHEYDHLEG 148
>sp|Q9FV53|DEFM_ARATH Peptide deformylase, mitochondrial precursor (PDF) (Polypeptide
           deformylase)
 gb|AAD39667.1|AC007591_32 (AC007591) Simalar to gi|4377403 Polypeptide Deformylase from
           Chlamydia pneumoniae genome gb|AE001687. [Arabidopsis
           thaliana]
 gb|AAK32873.1|AF361861_1 (AF361861) At1g15390/F9L1_34 [Arabidopsis thaliana]
          Length = 259

 Score = 52.4 bits (124), Expect = 9e-007
 Identities = 45/155 (29%), Positives = 67/155 (43%), Gaps = 24/155 (15%)

Query: 3   VLTGDAYELHAIAADVPHGE----DVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVR 58
           V +GD   LH  A +V  GE     +  ++D                 Q+GV  R+IV+ 
Sbjct: 74  VASGDPV-LHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRIIVLE 132

Query: 59  --------APKFK-----------GCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHN 99
                   APK +             +VNPV+   ++      EGCLS  G     +R+ 
Sbjct: 133 DTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAVERYL 192

Query: 100 KITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
           ++ V G+D   + I+V A G  A  +QHE DHL+G
Sbjct: 193 EVVVTGYDRQGKRIEVNASGWQARILQHECDHLDG 227
>ref|NP_345996.1| (NC_003028) polypeptide deformylase [Streptococcus pneumoniae
           TIGR4]
 ref|NP_359001.1| (NC_003098) Formylmethionine deformylase [Streptococcus pneumoniae
           R6]
 gb|AAK75636.1| (AE007449) polypeptide deformylase [Streptococcus pneumoniae TIGR4]
 gb|AAL00212.1| (AE008510) Formylmethionine deformylase [Streptococcus pneumoniae
           R6]
 gb|AAK13237.1| (AY014508) peptide deformylase-like protein [Streptococcus
           pneumoniae]
          Length = 136

 Score = 52.0 bits (123), Expect = 1e-006
 Identities = 32/89 (35%), Positives = 42/89 (46%), Gaps = 2/89 (2%)

Query: 46  NQLGVLKRVIVVRAPKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEG 105
           N +GV KRVI+         + NPV+     G   + EGCLS  G     KR+  I +  
Sbjct: 47  NMIGVQKRVIIFNLGLVPVVMFNPVLLSF-EGSYEAEEGCLSLVGVR-STKRYETIRLAY 104

Query: 106 FDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
            D+ WQ   +   G  A   QHE+DHL G
Sbjct: 105 RDSKWQEQTITLTGFPAQICQHELDHLEG 133
>gb|EAA02964.1| (AAAB01007479) agCP10822 [Anopheles gambiae str. PEST]
          Length = 232

 Score = 44.7 bits (104), Expect = 2e-004
 Identities = 24/70 (34%), Positives = 35/70 (49%)

Query: 66  IVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCI 125
           ++NP +       V   E C S  G   D  R+ +I ++GFDA     ++   G  A   
Sbjct: 130 LLNPELKVLNYEKVIHTEACESVRGYRADVPRYREILLQGFDATGNRQELRLSGWNARIA 189

Query: 126 QHEIDHLNGV 135
           QHE+DHLNG+
Sbjct: 190 QHEMDHLNGI 199
>gb|AAG09295.1|AF177768_1 (AF177768) ORF260 [Trypanosoma brucei]
          Length = 260

 Score = 38.9 bits (89), Expect = 0.010
 Identities = 24/76 (31%), Positives = 32/76 (41%), Gaps = 5/76 (6%)

Query: 67  VNPVITRHTSGH-----VNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLT 121
           VNP +  +   H         E C+S    T    R   IT  G D +         G+ 
Sbjct: 113 VNPTVPGYDDRHSIAPMYGMWENCISCGACTAWVIRPQSITCSGLDEYGNEKTEVLDGMR 172

Query: 122 AFCIQHEIDHLNGVTI 137
           A C+ HE+DHL+G TI
Sbjct: 173 ARCLMHELDHLSGKTI 188
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.321 0.139 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 80,229,907 Number of Sequences: 1028106 Number of extensions: 2754842 Number of successful extensions: 3540 Number of sequences better than 5.0e-02: 119 Number of HSP's better than 0.1 without gapping: 103 Number of HSP's successfully gapped in prelim test: 16 Number of HSP's that attempted gapping in prelim test: 3412 Number of HSP's gapped (non-prelim): 122 length of query: 137 length of database: 324,535,921 effective HSP length: 113 effective length of query: 24 effective length of database: 208,359,943 effective search space: 5000638632 effective search space used: 5000638632 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 83 (36.6 bits) Query= orf58 [14371-14472] (100 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.133 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 52,888,008 Number of Sequences: 1028106 Number of extensions: 1611870 Number of successful extensions: 4417 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4417 Number of HSP's gapped (non-prelim): 0 length of query: 100 length of database: 324,535,921 effective HSP length: 76 effective length of query: 24 effective length of database: 246,399,865 effective search space: 5913596760 effective search space used: 5913596760 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 84 (37.0 bits) Query= orf59 [14471-14595] (123 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.325 0.141 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 89,577,650 Number of Sequences: 1028106 Number of extensions: 3520258 Number of successful extensions: 6513 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 6513 Number of HSP's gapped (non-prelim): 0 length of query: 123 length of database: 324,535,921 effective HSP length: 99 effective length of query: 24 effective length of database: 222,753,427 effective search space: 5346082248 effective search space used: 5346082248 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 84 (37.0 bits) Query= orf60 [14593-15385] (791 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_489430.1|  (NC_003273) hypothetical protein [Nostoc s...   101  3e-020
ref|NP_646225.1|  (NC_003923) hypothetical protein [Staphylo...    86  1e-015
ref|NP_371315.1|  (NC_002758) hypothetical protein [Staphylo...    84  7e-015
gb|AAL67615.1|  (U93688) orf15 [Staphylococcus aureus]             84  7e-015
gb|AAL04136.1|AF410775_12  (AF410775) unknown [Staphylococcu...    84  7e-015
gb|AAB00938.1|  (L31763) virulence-associated protein E [Dic...    83  2e-014
ref|NP_569125.1|  (NC_003374) hypothetical protein [Gluconob...    80  7e-014
pir||S49113  hypothetical protein 2 - Microcystis aeruginosa...    74  9e-012
ref|NP_279342.1|  (NC_002607) Vng0215c [Halobacterium sp. NR...    59  2e-007
>ref|NP_489430.1| (NC_003273) hypothetical protein [Nostoc sp. PCC 7120]
 dbj|BAB77438.1| (AP003604) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 836

 Score =  101 bits (252), Expect = 3e-020
 Identities = 92/338 (27%), Positives = 161/338 (47%), Gaps = 23/338 (6%)

Query: 467 KSDIKGMIAYMAHRYAHNPVKAWVESAP--WDGLDHVGLLFSHITLTPDEDRPMCEHLFR 524
           K   K ++  +A   +++PV  +++S    +  +D V LL          + P+   L +
Sbjct: 341 KEQAKSVVLELALANSYSPVVEYLQSVERRYPNVD-VSLLDKLAERYFGTNDPLHAALMK 399

Query: 525 KWMRAAVSAGVGDQEGC--EPVLVFVDEQGGVGKTRFFRTLCPEPFRADSVLLDVKDKDS 582
           + + AAV+     + GC  + + +   +Q  + K+ F+  L  E F  D   L+  +KD 
Sbjct: 400 RTLIAAVARAF--EPGCKHDEITILQGKQKSL-KSTFWEILAGEEFFTDD--LNGTEKDE 454

Query: 583 VKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVN 642
           V +   YW++E  E + ++ + + + LKAFLSR KD +R PYG     +PR +  VG+ N
Sbjct: 455 VLKISQYWILEYAEFENSYKKKDVSQLKAFLSRKKDSMRRPYGTDIEDFPRPSILVGTTN 514

Query: 643 EAEFLVDTTSNRRFIPMKVAALNHQHRVNTQQ---AWAQALAEARSG---AVTYVEAHEV 696
             EFL D T  RRF  +KV       ++  Q+    WA A+A  R+G    +T VE   +
Sbjct: 515 LDEFLYDPTGERRFWVIKVLFKKIPIKMLQQERDLIWAAAVAAYRNGEQWRLTEVEDEWL 574

Query: 697 AERNASFQATSAIDDVLSSRLNEATGERNVHMTVTDLLKRCGMHNPTKRDLNDAGKWLRS 756
              N  +Q+T   ++ + +   E   E +V   ++D+LK   +   +K + N     LRS
Sbjct: 575 DAANKQYQSTDTWEEAVMA-WAELQKEVSVGEILSDVLK-IELAKQSKAEQNRVAAILRS 632

Query: 757 NGYEKRIRSGVRGFML-----PDMSIGAQAFGGPQLEV 789
           +G+ +  R  V G ++     P   +  +   G + EV
Sbjct: 633 HGWTRGDRKRVNGRLIRPWVRPQQEVEQEVERGVEQEV 670
>ref|NP_646225.1| (NC_003923) hypothetical protein [Staphylococcus aureus subsp.
           aureus MW2]
 dbj|BAB95273.1| (AP004827) hypothetical protein [Staphylococcus aureus subsp.
           aureus MW2]
          Length = 815

 Score = 86.3 bits (212), Expect = 1e-015
 Identities = 78/284 (27%), Positives = 133/284 (46%), Gaps = 29/284 (10%)

Query: 449 DEAALSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHI 508
           D +  S IE I +++H  K+  K  I  +A + A++PV+ ++    WDG   +  LF   
Sbjct: 437 DSSLRSYIEKIYDIHHSGKT--KDAIISVAMQNAYHPVRDYLNKISWDGHKRLEKLFIKY 494

Query: 509 TLTPDEDRPMCEHLFRKWMRAAVSAGVGD--QEGCE-PVLVFVDEQGGVGKTRFFRTLCP 565
               D +      + R   + A++AG+    + GC+   ++ +    GVGK+   + L  
Sbjct: 495 LGVEDTE------VNRTTTKKALTAGIARVMEPGCKFDYMLTLYGPQGVGKSALLKKLGG 548

Query: 566 EPFRADSVLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYG 625
             F +DS L+ V  K++ +     WL+E+ EL  T  ++E  A+K F+S+  D  R+ YG
Sbjct: 549 AWF-SDS-LVSVTGKEAYEALQGVWLMEMAELAAT-RKAEVEAIKHFISKQVDRFRVAYG 605

Query: 626 RTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKV------AALNHQHRVNTQQAWAQA 679
                +PR   ++G+ N+ +FL D T  RRF PM V         +   +    Q WA+A
Sbjct: 606 HYIEDFPRQCIFIGTTNKVDFLRDETGGRRFWPMTVNPERVEVNWSKLTKEEIDQIWAEA 665

Query: 680 LAEARSGAVTYVEAHEVAERNASFQA--------TSAIDDVLSS 715
                 G   ++   E+ E   S Q+        T  ID+ L++
Sbjct: 666 KYYYEQGEELFLNP-ELEEEMRSIQSKHTEESPYTGIIDEYLNT 708
>ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
           aureus Mu50]
 dbj|BAB56953.1| (AP003360) hypothetical protein [Staphylococcus aureus subsp.
           aureus Mu50]
          Length = 489

 Score = 84.0 bits (206), Expect = 7e-015
 Identities = 59/207 (28%), Positives = 97/207 (46%), Gaps = 7/207 (3%)

Query: 453 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 512
           +S I+ + N+  + +  I  +I   A++   +P+K+ +ES  WDG+  +  LF  I    
Sbjct: 95  ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 151

Query: 513 DEDRPMCEHLFRKWMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADS 572
            ED      + +KWM  AV+         + +++    QG VGK+     L    +    
Sbjct: 152 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 209

Query: 573 VLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 632
            +   K  +  K+    W+ E+ EL   F +S    +K F+S   D  R  YG+   ++P
Sbjct: 210 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 267

Query: 633 RMTAYVGSVNEAEFLVDTTSNRRFIPM 659
           R   +VG+ N  EFL D T NRRF P+
Sbjct: 268 RQCVFVGTTNNYEFLKDQTGNRRFFPI 294
>gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus]
          Length = 477

 Score = 84.0 bits (206), Expect = 7e-015
 Identities = 59/207 (28%), Positives = 97/207 (46%), Gaps = 7/207 (3%)

Query: 453 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 512
           +S I+ + N+  + +  I  +I   A++   +P+K+ +ES  WDG+  +  LF  I    
Sbjct: 87  ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 143

Query: 513 DEDRPMCEHLFRKWMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADS 572
            ED      + +KWM  AV+         + +++    QG VGK+     L    +    
Sbjct: 144 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 201

Query: 573 VLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 632
            +   K  +  K+    W+ E+ EL   F +S    +K F+S   D  R  YG+   ++P
Sbjct: 202 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 259

Query: 633 RMTAYVGSVNEAEFLVDTTSNRRFIPM 659
           R   +VG+ N  EFL D T NRRF P+
Sbjct: 260 RQCVFVGTTNNYEFLKDQTGNRRFFPI 286
>gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcus aureus]
          Length = 474

 Score = 84.0 bits (206), Expect = 7e-015
 Identities = 59/207 (28%), Positives = 97/207 (46%), Gaps = 7/207 (3%)

Query: 453 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 512
           +S I+ + N+  + +  I  +I   A++   +P+K+ +ES  WDG+  +  LF  I    
Sbjct: 84  ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 140

Query: 513 DEDRPMCEHLFRKWMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADS 572
            ED      + +KWM  AV+         + +++    QG VGK+     L    +    
Sbjct: 141 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 198

Query: 573 VLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 632
            +   K  +  K+    W+ E+ EL   F +S    +K F+S   D  R  YG+   ++P
Sbjct: 199 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 256

Query: 633 RMTAYVGSVNEAEFLVDTTSNRRFIPM 659
           R   +VG+ N  EFL D T NRRF P+
Sbjct: 257 RQCVFVGTTNNYEFLKDQTGNRRFFPI 283
>gb|AAB00938.1| (L31763) virulence-associated protein E [Dichelobacter nodosus]
          Length = 437

 Score = 82.8 bits (203), Expect = 2e-014
 Identities = 57/194 (29%), Positives = 94/194 (48%), Gaps = 6/194 (3%)

Query: 466 PKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPMCEHLFRK 525
           P   I   I  +A  +   P++ ++ES  WDG   +   F        E       +   
Sbjct: 96  PIDSIDMGIKMLAREHIFCPIQQYLESLQWDGKPRIATAFHDFLGV--ERNTYTIGISTL 153

Query: 526 WMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADSVLLDVKDKDSVKQ 585
           + RA V+         + V+V   +QG +GK+ F R L  E +     +   + K++++ 
Sbjct: 154 FFRALVARVYQPAIKFDHVIVLQGDQG-IGKSSFLRILGGEWY--SDGIRKFEGKEAIEA 210

Query: 586 AISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAE 645
                + E+ EL G FS++E   +KAF++RT+D  R+ YGR    +PR T +VG+ N  +
Sbjct: 211 IQGVLIGEIPELQG-FSKAEIEEIKAFITRTEDRARMAYGRRVQSFPRRTVFVGTTNADD 269

Query: 646 FLVDTTSNRRFIPM 659
           +L D T NRRF P+
Sbjct: 270 YLRDATGNRRFFPI 283
>ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconobacter oxydans]
 emb|CAD21834.1| (AJ428837) hypothetical protein [Gluconobacter oxydans]
          Length = 416

 Score = 80.5 bits (197), Expect = 7e-014
 Identities = 68/233 (29%), Positives = 102/233 (43%), Gaps = 27/233 (11%)

Query: 464 HYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGL-LFSHITLTPDEDRPMCEHL 522
           ++    +   I  +A   A++P++ ++    WDG+  +   L   +     E       L
Sbjct: 99  NFSTQQVSAAIQSVAEENAYHPIRDYLTGLEWDGVQRLDTWLHDFLGAEDSEYTAGVGRL 158

Query: 523 FRKWMRAAVSAGVGDQEGC--EPVLVFVDEQGG---------VGKTRFFRTLCPEPFRAD 571
           F   M A V      + GC  + VL+F   QG           G +R+F    P P R  
Sbjct: 159 FLIGMVARVM-----EPGCKNDHVLIFEGNQGIRKSTACKVLAGGSRYFSDSLP-PIREG 212

Query: 572 SVLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKY 631
           S       KD        WLVE+ E+    SR+++ ALKAFL+RT +  R  YGR  +  
Sbjct: 213 S-------KDLASHLAGKWLVEVAEMSAA-SRADTEALKAFLTRTHETYRPAYGRNEITQ 264

Query: 632 PRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVN-TQQAWAQALAEA 683
           PR    VG+ N  E+L D+T +RR  P+K         ++  Q+A  Q  AEA
Sbjct: 265 PRQCVLVGTTNRKEYLRDSTGDRRVWPVKCGVNGEPCDIDGLQEARDQLFAEA 317
>pir||S49113 hypothetical protein 2 - Microcystis aeruginosa
 emb|CAA82191.1| (Z28337) ORF2 [Microcystis aeruginosa]
          Length = 502

 Score = 73.6 bits (179), Expect = 9e-012
 Identities = 68/228 (29%), Positives = 105/228 (45%), Gaps = 30/228 (13%)

Query: 556 KTRFFRTLCPEPFRADSVLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSR 615
           K+ FF+TL  E F  D+++    DKD +      W++E  ELD   +++    +K+FLSR
Sbjct: 278 KSTFFKTLGGEYF-TDAMMST--DKDGLLTMAKNWIIEWSELDHFTAKTYHGIIKSFLSR 334

Query: 616 TKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNTQQA 675
           + D  R PY R  M +PR +  VGS N   FL D T NRRF    +  +  +H++N    
Sbjct: 335 SDDFYREPYAREAMSHPRCSMIVGSTNRDTFLNDPTGNRRF---WIIPIPKKHKINID-- 389

Query: 676 WAQALAEARSGAVT--YVEAHEV----------AERNASFQATSAIDDVLSSRLNEATGE 723
           + QA  E   G     Y++              AE N  ++   + +D L+  L     +
Sbjct: 390 YLQAYREKLLGWAVWRYLDGEPCQLPEEFKALQAEANKQWENNDSWEDELAEFL-----D 444

Query: 724 RNVHMTVTDLLKRCGMH----NPTKRDLNDAGKWLRSNGY-EKRIRSG 766
           R   ++V + L+R        N  + D    G  LR  G+  KRI+ G
Sbjct: 445 RESDLSVKECLQRLEDRGYPINFGRSDEMRMGDILRKAGFSRKRIQRG 492
>ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NRC-1]
 gb|AAG18822.1| (AE004986) Vng0215c [Halobacterium sp. NRC-1]
          Length = 857

 Score = 59.3 bits (142), Expect = 2e-007
 Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 6/99 (6%)

Query: 188 DAKTIREALRHIDPGETREEWVKVGLSLKSGFGDDPQGLSLFDEWSSGALWRDGDEPANY 247
           D   I +AL H++P    +EW+K+G ++   +     G  LF++WS G     G +  N 
Sbjct: 220 DKSDIEDALSHVNPDCGYDEWLKLGFAVHD-WDSGSTGKRLFEQWSKG-----GSKWDNQ 273

Query: 248 VPEHIETQWDSFKAEGGRTIATVYHKAIEGGWQPPAGIN 286
              +I+  WDS     G T+ T+ H A +GGW  P   N
Sbjct: 274 AERNIQDIWDSASEGQGVTVGTLIHYAKDGGWTVPTRSN 312
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.316 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 516,679,759 Number of Sequences: 1028106 Number of extensions: 22303856 Number of successful extensions: 57628 Number of sequences better than 5.0e-02: 15 Number of HSP's better than 0.1 without gapping: 11 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 57601 Number of HSP's gapped (non-prelim): 20 length of query: 791 length of database: 324,535,921 effective HSP length: 129 effective length of query: 662 effective length of database: 191,910,247 effective search space: 127044583514 effective search space used: 127044583514 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 96 (41.6 bits) Query= orf61 [15399-15457] (57 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.320 0.134 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 19,570,331 Number of Sequences: 1028106 Number of extensions: 295251 Number of successful extensions: 533 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 533 Number of HSP's gapped (non-prelim): 0 length of query: 57 length of database: 324,535,921 effective HSP length: 33 effective length of query: 24 effective length of database: 290,608,423 effective search space: 6974602152 effective search space used: 6974602152 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 85 (37.4 bits) Query= orf62 [15563-15814] (250 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 1,028,106 sequences; 324,535,921 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Jul 12, 2002 12:04 AM Number of letters in database: 324,535,921 Number of sequences in database: 1,028,106 Lambda K H 0.318 0.130 0.000 Gapped Lambda K H 0.267 0.0410 4.65e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 133,621,696 Number of Sequences: 1028106 Number of extensions: 4833468 Number of successful extensions: 15642 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 15642 Number of HSP's gapped (non-prelim): 0 length of query: 250 length of database: 324,535,921 effective HSP length: 118 effective length of query: 132 effective length of database: 203,219,413 effective search space: 26824962516 effective search space used: 26824962516 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 90 (39.3 bits)