National Center for Biotechnology Information (NCBI)

welcome to the blast network service.


PEPTIDE SEQUENCE DATABASES

 nr        Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
 month     All new or revised GenBank CDS translation+PDB+SwissProt+PIR
           sequences released in the last 30 days
 pdb       PDB protein sequences
 yeast     Yeast (Saccharomyces cerevisiae) protein sequences.
 kabat     Kabat Sequences of Proteins of Immunological Interest
 alu *     Translations of Select Alu Repeats from REPBASE
 swissprot SwissProt sequences


NUCLEOTIDE SEQUENCE DATABASES

 nr       Non-redundant GenBank+EMBL+DDBJ+PDB sequences
          (but no EST, STS, GSS, or HTGS sequences) 
 month    All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
          the last 30 days
 yeast    Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
 est +    Non-redundant Database of GenBank+EMBL+DDBJ EST Division
 sts +    Non-redundant Database of GenBank+EMBL+DDBJ STS Division
 htgs     High Throughput Genomic Sequences 
 pdb      PDB nucleotide sequences
 vector   Vector subset of GenBank
 mito *   Database of mitochondrial sequences, Rel. 1.0, July 1995
 gss      Genome Survey Sequences (includes single-pass genomic data, exon-
          trapped sequences, and Alu PCR primers.
 kabat    Kabat Sequences of Nucleic Acid of Immunological Interest
 epd      Eukaryotic Promotor Database
 alu *+   Select Alu Repeats from REPBASE



BLASTP 2.2.2 [Dec-14-2001]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= orf1 [1-217]
         (213 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
           884,779 sequences; 277,083,050 total letters

Searching done
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF
    Posted date:  Feb 23, 2002  3:08 AM
  Number of letters in database: 277,083,050
  Number of sequences in database:  884,779
  
Lambda     K      H
   0.314    0.129    0.000 

Gapped
Lambda     K      H
   0.267   0.0410    0.000 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 90,918,893
Number of Sequences: 884779
Number of extensions: 2942183
Number of successful extensions: 6784
Number of sequences better than 5.0e-02: 0
Number of HSP's better than  0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6784
Number of HSP's gapped (non-prelim): 0
length of query: 213
length of database: 277,083,050
effective HSP length: 115
effective length of query: 98
effective length of database: 175,333,465
effective search space: 17182679570
effective search space used: 17182679570
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 88 (38.5 bits)
Query= orf2 [211-887]
         (675 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
           884,779 sequences; 277,083,050 total letters

Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_518974.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...   205  1e-051
dbj|BAA89640.1|  (AB036666) similar to terminase large subun...   203  5e-051
dbj|BAA89621.1|  (AB036665) similar to terminase large subun...   201  3e-050
ref|NP_297798.1|  (NC_002488) phage-related terminase large ...   115  2e-024
ref|NP_298001.1|  (NC_002488) phage-related terminase large ...    96  1e-018
ref|NP_461544.1|  (NC_003197) Gifsy-1 prophage: similar to t...    93  1e-017
ref|NP_310206.1|  (NC_002695) putative terminase large subun...    92  1e-017
ref|NP_287368.1|  (NC_002655) putative DNA packaging protein...    92  2e-017
ref|NP_040581.1|  (NC_001416) A (DNA packaging;641) [bacteri...    91  3e-017
sp|P36693|TERL_BPP21  TERMINASE LARGE SUBUNIT (GP2) >gi|5407...    89  1e-016
>ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14555.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 660

 Score =  205 bits (521), Expect = 1e-051
 Identities = 178/621 (28%), Positives = 282/621 (44%), Gaps = 59/621 (9%)

Query: 29  LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L  SEW++R R +  + S  PG +     PY R ++DC  P SP+  V   K AQ+GAT 
Sbjct: 26  LTVSEWSDRHRMLSSKASAEPGRWRTGRTPYLRAIMDCLSPTSPIERVVFMKAAQLGAT- 84

Query: 89  SVLENIIAYGIDYVKTASM-IFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGT 147
            +  N I Y I +     M ++ TV  ++ KR     I P++E S     + A   +   
Sbjct: 85  EMGSNWIGYVIHHAPGPMMAVWPTV--EMAKRNSKQRIDPLIEESA----VLAERIAPAR 138

Query: 148 KR-SGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLT 206
            R SG T    E+  GGVL   GA S   +RS PV +L  DEV G+P  V  +GD + L 
Sbjct: 139 SRDSGNTILAKEF-RGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLA 197

Query: 207 ETRTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKK 266
           E RT +F A RK+  +STP  AG  TI + ++  DQR + VPC HC   Q L F      
Sbjct: 198 EARTRTF-ARRKIFIVSTPTIAGASTIEREYDASDQRRYFVPCPHCDHRQWLRFE----- 251

Query: 267 GQGRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHL 326
              +L W T+G    P++  Y+C  C   +    K   + +G W    +        +HL
Sbjct: 252 ---QLRW-TKG---EPETAAYICEACFEPIHEHHKAWMLAQGEWRAMVETGGRT-AGFHL 303

Query: 327 SAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREV 386
           S++Y+P   RSW+EIA AW      +  +A     ++ F N +LGE + ++       + 
Sbjct: 304 SSLYSPVGWRSWREIAAAWESAVSKESGSAA---AIKTFRNTELGETW-VEEGEAPDWQR 359

Query: 387 YAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYV 446
              RR+ Y +G VP    +  AG  ++   + V V           WA      G   ++
Sbjct: 360 LLERREDYPIGTVPAGGLLLSAGADVQKDRIEVSV-----------WAFG---RGKEAWL 405

Query: 447 IDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYCNDW 506
           +++  + GD  T     WK+L E++  +   + G    +AR+ +D    T   Y +    
Sbjct: 406 VEHRVLMGD--TARDAVWKRLAELVEEQWTHASGATMPLARLALDTGFATQEAYAFVRAC 463

Query: 507 GE-NVIPIRG--RDLPIKGAQIKY---FNRQMSEKGVEYLSVTVDLYKDRWSPTLRK--E 558
           G+  V+ ++G  R   + G         N +   +G++  +V V + K  +   LRK  +
Sbjct: 464 GDARVMAVKGTARGAALIGTPTAVDVTRNGKKLRRGIKVFTVAVGIAKLEFYNNLRKAAD 523

Query: 559 WSGQG---EMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRNEM 615
            +  G     P G  + P  I++++L++L  E     RD +   I  + W +    RNE 
Sbjct: 524 VAKDGATIAFPTGFVHLP-KIDAEFLQQLCAEQLITRRDRSGFPI--RAWQKM-RERNEA 579

Query: 616 WDTLIYNTAIFESMVLEACED 636
            D  +Y  A   +  L+  E+
Sbjct: 580 LDCYVYARAAASAAGLDRFEE 600
>dbj|BAA89640.1| (AB036666) similar to terminase large subunit of phage lambda
           [Wolbachia sp. wKue]
          Length = 609

 Score =  203 bits (516), Expect = 5e-051
 Identities = 177/634 (27%), Positives = 280/634 (43%), Gaps = 83/634 (13%)

Query: 29  LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L  SEWA   R +    +  PG +  +  PY +E++D   P SP   V   KGAQ+G T 
Sbjct: 17  LKVSEWANEYRVLAPTAASEPGKWRTERTPYLKEIMDSLSPSSPTEKVVFMKGAQIGGTE 76

Query: 89  SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
           +   N I Y ID      M+      ++ KR       P++E +     +++      ++
Sbjct: 77  AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPC---LKSKVKDPRSR 131

Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
            SG T +  E+ GG V+   GA S   +RS PV +L  DE+  +P   G +GDP+ L+  
Sbjct: 132 DSGNTVQSKEFPGGIVVIT-GANSSVALRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSIA 190

Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQ 268
           RTN+F A RK+  +STP   G   I + FE  D+RY+ VPC HC  +Q L +        
Sbjct: 191 RTNTF-ARRKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW-------- 241

Query: 269 GRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSA 328
            ++ WE         +  YVC  C G++ N  K   +  G W PT +  + + + +HLS+
Sbjct: 242 SQIKWENSDS----RTAHYVCTECSGKIENHQKTEMLDRGEWRPTNR-VKGEKKGFHLSS 296

Query: 329 MYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYA 388
           +Y+P    SWQ+         +D  +A +    L+V+ N  LGE +  K      ++++ 
Sbjct: 297 LYSPVGWYSWQQAV-------EDFLHAKESEQLLKVWINTTLGETWVDKGEVPDWKQLF- 348

Query: 389 HRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVID 448
           +RR+ + +G VP    +  AG         VDVQ   L V  + W  S +      + ID
Sbjct: 349 NRREFFQIGTVPRREVVLTAG---------VDVQKDRLEVEVVAWGKSRE-----SWSID 394

Query: 449 YEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYC--NDW 506
           Y+  EGD  T  GE W KL+E+++       G  Y I+ + +DA   T  VY +   +  
Sbjct: 395 YQVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452

Query: 507 GENVIPIRG-------------RDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
              V+ ++G              D+ + G ++K        +G++   V V + K     
Sbjct: 453 SGRVMAVKGVNKALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQ 504

Query: 554 TLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRN 613
            L      +G+   G  + P     +Y K+LT E   +          +Q W +    RN
Sbjct: 505 LLNILKEEEGKALPGYCHFP-EYAPEYFKQLTAE---QLVSKVVKGYTKQEWQKV-RERN 559

Query: 614 EMWDTLIYNTAIFESMVLEACEDILGLEALVWPE 647
           E+ D  IY           A    LG++   WPE
Sbjct: 560 EVLDCRIY---------ARAASIALGIDR--WPE 582
>dbj|BAA89621.1| (AB036665) similar to terminase large subunit of lambda
           [bacteriophage WO]
          Length = 610

 Score =  201 bits (510), Expect = 3e-050
 Identities = 175/629 (27%), Positives = 278/629 (43%), Gaps = 75/629 (11%)

Query: 29  LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L  SEWA   R +    +  PG +  +  PY +E++D   P SP   V   KGAQ+G T 
Sbjct: 17  LKVSEWANEYRVLAATAASEPGKWRTERTPYLKEIMDSLSPSSPAEKVIFMKGAQIGGTE 76

Query: 89  SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
           +   N I Y ID      M+      ++ KR       P++E +     +++      ++
Sbjct: 77  AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPC---LKSKVKDPRSR 131

Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
            SG T +  E+ GG V+   GA S   +RS PV +L  DE+  +P   G +GDP+ L+  
Sbjct: 132 DSGNTVQSKEFPGGIVVIT-GANSSVGLRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSIA 190

Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQ 268
           RTN+F A RK+  +STP   G   I + FE  D+RY+ VPC HC  +Q L +        
Sbjct: 191 RTNTF-ARRKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW-------- 241

Query: 269 GRLIWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSA 328
            ++ WE +     P++  Y+C  CG ++ N  K   +  G W  T +    + + +HLS+
Sbjct: 242 SQIKWEDKN----PNTAHYICIECGKKIENHQKTEMLDRGEWRAT-EAKEGEKKGFHLSS 296

Query: 329 MYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYA 388
           +Y+P     W     +W    +D  +A +    L+V+ N  LGE +  K      ++++ 
Sbjct: 297 LYSPV---GW----YSWTQAVEDFLHAKESEQLLKVWINTTLGETWVDKGEVPDWKQLF- 348

Query: 389 HRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVID 448
           +RR+ + +G VP    +  AG         VDVQ   L V  + W  S ++     + ID
Sbjct: 349 NRREFFPVGTVPRREVVLTAG---------VDVQEDRLEVEVVAWGKSREN-----WSID 394

Query: 449 YEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEYCNDWGE 508
           Y   EGD  T  GE W KL+E+++       G  Y I+ + +DA   T  VY +      
Sbjct: 395 YRVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452

Query: 509 NVIPIRGRDLPIKGAQIKYFNR----------QMSEKGVEYLSVTVDLYKDRWSPTLRKE 558
           +     GR + +KG  +   +           Q  ++G++   V V + K      L   
Sbjct: 453 S-----GRVMAVKGVALVPLSSPSRVDITVGGQKLKRGIKLWPVGVSILKSELFQLLNVL 507

Query: 559 WSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGSRNEMWDT 618
              +G+   G  + P     +Y K+LT E   +          +Q W +    RNE+ D 
Sbjct: 508 KEEEGKALPGYCHFP-EYAPEYFKQLTAE---QLVSKVVKGYTKQEWQKV-RERNEVLDC 562

Query: 619 LIYNTAIFESMVLEACEDILGLEALVWPE 647
            IY           A    LG++   WPE
Sbjct: 563 RIY---------ARAASIALGIDR--WPE 580
>ref|NP_297798.1| (NC_002488) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
 pir||D82798 phage-related terminase large subunit XF0508 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF83318.1|AE003899_10 (AE003899) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
          Length = 699

 Score =  115 bits (287), Expect = 2e-024
 Identities = 116/472 (24%), Positives = 183/472 (38%), Gaps = 49/472 (10%)

Query: 32  SEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVL 91
           S+WA+  R I +     PG +  D  P  RE+++C   ++PV  V+  K  Q+GAT  + 
Sbjct: 35  SQWADAYRKIARGSGAEPGQWRTDRHPPLREIMNCLSDHTPVQQVSFMKSGQIGAT-EIG 93

Query: 92  ENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSG 151
            N + Y ID    + ++   V D      +  F   +L+   L + +  N+  +      
Sbjct: 94  INWVCYVIDRGIDSMIVTQPVKDLARTWTVAKFDPGVLDMPPLLNKLTTNNTFE------ 147

Query: 152 ATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTN 211
                 +   GG L+   A S  ++R     +   DE+  +P+ + + G   +    R  
Sbjct: 148 ------KQYPGGTLFVKWANSSSQLRQITACYAFLDEIDEYPRNLNNQGTADQQIAARIM 201

Query: 212 SFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
           S     K+    TP  AG   I   F  GDQR++ + C HCG  Q L       +  G  
Sbjct: 202 SHGERGKIYRACTPTVAGGSAIEASFLDGDQRHYHIHCPHCGSEQILDL--EHLQPDGTF 259

Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGHWVPTAKPSRPDFRSYHLSAMYA 331
                G I        +  H    ++ E      G   W P    + PD RSY+L A YA
Sbjct: 260 ACAVNGCI--------IQEHHKNTILKERGTG--GTAFWHPHNPSAPPDHRSYYLWAAYA 309

Query: 332 PY-YARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYELKSNRVKPREVYAHR 390
           P     SW++IA  W +       A +D  +L  F N  LG  ++ + +     EV    
Sbjct: 310 PLGLGLSWKQIADKWAE-------AKRDPSKLPGFTNLILGLPFQGERDARNAHEVATLA 362

Query: 391 RDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAPSADHSGYVPYVIDYE 450
                 G VP    +  AG         VD+ H    V  I     A   G   Y++DY 
Sbjct: 363 EPGVYRGLVPRGGLVLAAG---------VDLAHDRAEVHLI-----ATGRGQRRYIVDYA 408

Query: 451 HVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASELTDVVYEY 502
            ++ D   +  E +  L   +      + G    I+ V ID    T+ V ++
Sbjct: 409 VIDLDPTVL--ESYADLDAYLRGTWKTASGIDMPISAVAIDGGNWTETVAQF 458
>ref|NP_298001.1| (NC_002488) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
 ref|NP_299778.1| (NC_002488) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
 pir||C82551 phage-related terminase large subunit XF2500 XF0711 [imported] -
           Xylella fastidiosa (strain 9a5c)
 gb|AAF83521.1|AE003914_2 (AE003914) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
 gb|AAF85298.1|AE004057_11 (AE004057) phage-related terminase large subunit [Xylella
           fastidiosa 9a5c]
          Length = 580

 Score = 95.9 bits (237), Expect = 1e-018
 Identities = 142/561 (25%), Positives = 225/561 (39%), Gaps = 68/561 (12%)

Query: 76  VAVKKGAQVGATVSVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLA 135
           V +K GAQVG T+  L  +  Y I+    + M+      D+     T F   +  + GL 
Sbjct: 8   VTLKWGAQVGKTMLAL-CVQGYCIEMAPRSQMMLQPSQGDLQAWLETKFSPLIAANQGLQ 66

Query: 136 HLIQANDFSKGTKRSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQK 195
           HLI     +K   R G   + ++   GG L    + SP  MR    P ++ DE+ G+ + 
Sbjct: 67  HLI-----AKPRGRDGVNNQRMKSYPGGFLMFAWSGSPKTMRGRSAPLIVCDEIDGYERT 121

Query: 196 VGSDGDPMKLTETRTNSFSATRKVLDLSTPLPAGNDTITQRFEKGDQRYWEVPCKHCGGF 255
              +G P+ L   R  +F   R +L++STP   G   I   +  GDQR + V C  CG  
Sbjct: 122 --DEGHPVSLLWQRAATFGDERFLLEISTPTIEGASYIDDAYRAGDQRRFYVRCPACGYE 179

Query: 256 QRLYFRGNAKKGQGRLIWETEGGILV-----PDSVRYVCPHCGGEMINEDKVAFMGEGHW 310
           Q L +   +  G+     + +  +       P + RYVC  CG    +  ++A + +  W
Sbjct: 180 QTLEWEHVSWVGRQS---DPDADLAAIDAHQPHTARYVCQGCGVCWDDGQRIAAVRQARW 236

Query: 311 VPTAKPSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDL 370
              +KP      SY L+ +Y+ +  R    I Q ++D             +LQ F N  L
Sbjct: 237 -QASKPFN-GHASYELNELYSTF--RRQSAIVQDYLD--------KLKHQDLQTFTNVSL 284

Query: 371 GEAYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLT 430
              +   + +    ++   RR    L +VP        GG    +T  +D+Q   L V  
Sbjct: 285 ARVWSETAEQADIDDLL--RRLETYLADVP-------MGGVF--LTAGIDMQTDRLEVEI 333

Query: 431 IGWAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTR-QYESGGRLYDIARVG 489
           + W    +      + I    + GD   + G+ W  L   +ST  Q+ESG RL  I    
Sbjct: 334 VAWGIDEE-----SWSIHTAVLYGD--PLLGDVWDALDRYLSTTWQHESGLRL-SIQAAC 385

Query: 490 ID---ASELTDVVYEYCND-WGENVIPIR-----GRDLPIKGAQIKYFNRQMSEKGVEYL 540
           +D    S  T   Y+Y     G  +  I+     GR + +  AQ K+  R      +   
Sbjct: 386 LDTGGTSGYTQAAYQYLRTRTGRRLFGIKGVGGWGRPI-VDKAQRKHSGRNAPR--INLF 442

Query: 541 SVTVDLYKDRWSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKI 600
           +V VD  K      L     G      G  + P++    +  +LT E  R        + 
Sbjct: 443 TVGVDEAKLIVMRRLAITQPGP-----GYSHFPADRSPDWFAQLTAEKLRTR--YLKGQP 495

Query: 601 IRQVWHRPGGSRNEMWDTLIY 621
           +RQ W +P  + NE  D  +Y
Sbjct: 496 VRQ-WTKPDKTPNEALDCRVY 515
>ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to terminase large chain gp2
           of N15 [Salmonella typhimurium LT2]
 gb|AAL21503.1| (AE008818) Gifsy-1 prophage: similar to terminase large chain gp2
           of N15 [Salmonella typhimurium LT2]
          Length = 643

 Score = 92.8 bits (229), Expect = 1e-017
 Identities = 146/628 (23%), Positives = 246/628 (38%), Gaps = 90/628 (14%)

Query: 29  LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L   +WA+   ++P++ S  PG   ++  P+   +++    Y  +  V + K A+VG T 
Sbjct: 28  LTTVQWADEYYYLPKESSYTPG--KWETLPFQVAIMNAMG-YELIRVVNLIKSARVGYTK 84

Query: 89  SVLENIIAYGIDYVKTASMIFATVD---DDVTKRRLTNFI--LPMLEHSGLAHLIQANDF 143
            +L  +  Y I++    S++F   D   +D  K  +   I  +P+L       L  A  F
Sbjct: 85  MLL-GVEGYFIEHKSRNSLLFQPTDSSAEDFMKSHVEPTIRDVPVL-------LELAPWF 136

Query: 144 SKGTKRSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPM 203
            +  + +  T K   +  G   +  G  +    R   V  +  DE+S +   V  +G P 
Sbjct: 137 GRKHRDNTLTLK--RFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPT 194

Query: 204 KLTETRTNSFSATRKVLDLSTPLPAGNDTITQRF-EKGDQRYWEVPCKHCGGFQRLYFRG 262
            L + R    S   K +  STP   G+  I +   E      + VPC HCG  Q L F G
Sbjct: 195 LLGDKRIEG-SVWPKSIRGSTPKVKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKF-G 252

Query: 263 NAKKGQGRLIWETEGGILVPDSVRYVCPHCG--------------------GEMINEDKV 302
           +     G L WE       P++V Y+C H G                    G    +   
Sbjct: 253 DGSTPFG-LKWEKSK----PETVYYLCEHNGCVIRQSELDQKAGRWICDNTGMWTRDGLA 307

Query: 303 AFMGEGHWVPTAKPSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDEL 362
            F   G  VP      P   ++H+   Y+P+   +W +I   W+D       A KD + +
Sbjct: 308 YFSASGEEVPP-----PRSITFHIWTAYSPF--TTWIQIIYDWLD-------ALKDPNGV 353

Query: 363 QVFYNNDLGEAYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQ 422
           + F N  LGE YE         E  A +     L E   ++A A     +  +T  +D Q
Sbjct: 354 KTFINTTLGEPYE---------EAVAEKLSHELLLEKVIHYA-APVPERVVYLTAGIDSQ 403

Query: 423 HTWLAVLTIGWAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRL 482
                +   GWAP     G   ++ID + + G  +  + +  +++  +I+ +   + G  
Sbjct: 404 RNRYEMYVWGWAP-----GEEAFLIDKQIIMG--RHDDEDTLQRVDAVINKKYRHADGTD 456

Query: 483 YDIARV-----GIDASELTDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKG 536
             I+R+     GIDA    ++VY+     G   V+P++G    + G  +    ++ ++ G
Sbjct: 457 ISISRICWDIGGIDA----EIVYKRSKKHGIFRVLPVKGAS--VYGKPVITMPKKRNQSG 510

Query: 537 VEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAA 596
           V    +  D  K+     +    +   E        P N +     E       E  +  
Sbjct: 511 VFLCEIGTDTAKEMLYARMGAVTAPADEATPYAIRFPDNPDVFTEVEAKQLVAEELVEKL 570

Query: 597 TNKIIRQVWHRPGGSRNEMWDTLIYNTA 624
            N   R +W    G RNE  D L+Y +A
Sbjct: 571 VNGKFRLLWDAK-GRRNEALDCLVYASA 597
>ref|NP_310206.1| (NC_002695) putative terminase large subunit [Escherichia coli
           O157:H7]
 dbj|BAB35602.1| (AP002557) putative terminase large subunit [Escherichia coli
           O157:H7]
          Length = 641

 Score = 92.4 bits (228), Expect = 1e-017
 Identities = 142/643 (22%), Positives = 251/643 (38%), Gaps = 69/643 (10%)

Query: 33  EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
           EWA+   ++P++ +   G   ++  P+ R +++       +  V V K A+VG +  +L 
Sbjct: 32  EWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87

Query: 93  NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
            + AY I++ +  ++I+   D D      T+    + +   L  L  A  + K  + +  
Sbjct: 88  GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLSL--APWYGKKHRDNTL 145

Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
           T K   +  G   +  G ++    R   V     DE++ + + +  +G P  L + R   
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEG 203

Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
            S   K +  STP   G   I +   +      + V C HCG  Q L F    K+    L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260

Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
            W  +     P SV Y+C H    +I + ++ F    +              W  ++  +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315

Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
              PD  ++H+   Y+P+   +W +I + W+           D  + + F N  LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366

Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
            K       EV A R++ Y+   VP+  A   AG         +D Q     +   GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-ASVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416

Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
                G   ++ID + + G  +  + +   ++ E I+       G    ++R+  D   +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTGGI 469

Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
              +VYE     G   VIPI+G    + G  +    R+ ++ GV    +  D  K++   
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527

Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
                  G   +P  +   N P   +    ++LT E + E+      KI+     R    
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583

Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
           RNE  D  +Y  A     +     D+  L A +  E  A   K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_287368.1| (NC_002655) putative DNA packaging protein of prophage CP-933X
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309657.1| (NC_002695) terminase large subunit [Escherichia coli O157:H7]
 gb|AAG55980.1|AE005330_12 (AE005330) putative DNA packaging protein of prophage CP-933X
           [Escherichia coli O157:H7 EDL933]
 dbj|BAB35053.1| (AP002555) terminase large subunit [Escherichia coli O157:H7]
          Length = 641

 Score = 92.0 bits (227), Expect = 2e-017
 Identities = 142/643 (22%), Positives = 251/643 (38%), Gaps = 69/643 (10%)

Query: 33  EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
           EWA+   ++P++ +   G   ++  P+ R +++       +  V V K A+VG +  +L 
Sbjct: 32  EWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87

Query: 93  NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
            + AY I++ +  ++I+   D D      T+    + +   L  L  A  + K  + +  
Sbjct: 88  GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLAL--APWYGKKHRDNTL 145

Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
           T K   +  G   +  G ++    R   V     DE++ + + +  +G P  L + R   
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDKRIEG 203

Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
            S   K +  STP   G   I +   +      + V C HCG  Q L F    K+    L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260

Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
            W  +     P SV Y+C H    +I + ++ F    +              W  ++  +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315

Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
              PD  ++H+   Y+P+   +W +I + W+           D  + + F N  LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366

Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
            K       EV A R++ Y+   VP+  A   AG         +D Q     +   GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-ASVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416

Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
                G   ++ID + + G  +  + +   ++ E I+       G    ++R+  D   +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTGGI 469

Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
              +VYE     G   VIPI+G    + G  +    R+ ++ GV    +  D  K++   
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527

Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
                  G   +P  +   N P   +    ++LT E + E+      KI+     R    
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583

Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
           RNE  D  +Y  A     +     D+  L A +  E  A   K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_040581.1| (NC_001416) A (DNA packaging;641) [bacteriophage lambda]
 sp|P03708|TERL_LAMBD TERMINASE LARGE SUBUNIT (DNA PACKAGING PROTEIN A) (GPA)
 pir||JVBPAL DNA-packaging protein A - phage lambda
 gb|AAA96534.1| (J02459) A (DNA packaging;641) [bacteriophage lambda]
          Length = 641

 Score = 91.3 bits (225), Expect = 3e-017
 Identities = 143/643 (22%), Positives = 250/643 (38%), Gaps = 69/643 (10%)

Query: 33  EWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVLE 92
           EWA+   ++P++ +   G   ++  P+ R +++       +  V V K A+VG +  +L 
Sbjct: 32  EWADANYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKMLL- 87

Query: 93  NIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTKRSGA 152
            + AY I++ +  ++I+   D D      T+    + +   L  L  A  + K  + +  
Sbjct: 88  GVYAYFIEHKQRNTLIWLPTDGDAENFMKTHVEPTIRDIPSLLAL--APWYGKKHRDNTL 145

Query: 153 TAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTETRTNS 212
           T K   +  G   +  G ++    R   V     DE++ +   +  +G P  L + R   
Sbjct: 146 TMK--RFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDKRIEG 203

Query: 213 FSATRKVLDLSTPLPAGNDTITQRFEKGDQ-RYWEVPCKHCGGFQRLYFRGNAKKGQGRL 271
            S   K +  STP   G   I +   +      + V C HCG  Q L F    K+    L
Sbjct: 204 -SVWPKSIRGSTPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF--GDKETPFGL 260

Query: 272 IWETEGGILVPDSVRYVCPHCGGEMINEDKVAFMGEGH--------------WVPTA--K 315
            W  +     P SV Y+C H    +I + ++ F    +              W  ++  +
Sbjct: 261 KWTPDD----PSSVFYLCEH-NACVIRQQELDFTDARYICEKTGIWTRDGILWFSSSGEE 315

Query: 316 PSRPDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGEAYE 375
              PD  ++H+   Y+P+   +W +I + W+           D  + + F N  LGE +E
Sbjct: 316 IEPPDSVTFHIWTAYSPF--TTWVQIVKDWM-------KTKGDTGKRKTFVNTTLGETWE 366

Query: 376 LKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIGWAP 435
            K       EV A R++ Y+   VP+  A   AG         +D Q     +   GW P
Sbjct: 367 AKIGERPDAEVMAERKEHYS-APVPDRVAYLTAG---------IDSQLDRYEMRVWGWGP 416

Query: 436 SADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDASEL 495
                G   ++ID + + G  +  + +   ++ E I+       G    I+R+  D   +
Sbjct: 417 -----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTGGI 469

Query: 496 -TDVVYEYCNDWG-ENVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDRWSP 553
              +VYE     G   VIPI+G    + G  +    R+ ++ GV    +  D  K++   
Sbjct: 470 DPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQIYN 527

Query: 554 TLRKEWSGQGEMPRGMF--NAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGGS 611
                  G   +P  +   N P   +    ++LT E + E+      KI+     R    
Sbjct: 528 RFTLTPEGDEPLPGAVHFPNNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR---- 583

Query: 612 RNEMWDTLIYNTAIFESMVLEACEDILGLEALVWPEFWAIAGK 654
           RNE  D  +Y  A     +     D+  L A +  E  A   K
Sbjct: 584 RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2)
 pir||B49849 terminase large chain gp2 - phage 21
 gb|AAA32340.1| (M81255) terminase large subunit [Bacteriophage 21]
          Length = 642

 Score = 89.0 bits (219), Expect = 1e-016
 Identities = 138/611 (22%), Positives = 238/611 (38%), Gaps = 62/611 (10%)

Query: 29  LAPSEWAERKRHIPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
           L   +WA +  ++P++ S  PG   ++  P+   +++C      +  V + K A+VG T 
Sbjct: 28  LTTVQWANKHYYLPKESSYTPG--RWETLPFQVGIMNCMGN-DLIRTVNLIKSARVGYTK 84

Query: 89  SVLENIIAYGIDYVKTASMIFATVDDDVTKRRLTNFILPMLEHSGLAHLIQANDFSKGTK 148
            +L  + AY I++    S++F   D    +  + + + P +     A L  A  F +  +
Sbjct: 85  MLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDFMKSHVEPTIRDVP-AMLELAPWFGRKHR 141

Query: 149 RSGATAKGLEWVGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQKVGSDGDPMKLTET 208
            +  T K   +  G   +  G  +    R   V  +  DE+S +   V  +G P  L + 
Sbjct: 142 DNTLTLK--RFSSGVGFWCLGGAAAKNYREKSVDVVCYDELSSFEPDVEKEGSPTLLGDK 199

Query: 209 RTNSFSATRKVLDLSTPLPAGNDTITQRF-EKGDQRYWEVPCKHCGGFQRLYFRGNAKKG 267
           R    S   K +  STP   G+  I +   E      + VPC HCG  Q L F  +A   
Sbjct: 200 RIEG-SVWPKSIRGSTPKIKGSCQIEKAANESAHFMRFYVPCPHCGEEQYLKFGDDASPF 258

Query: 268 QGRLIWETEGGILVPDSVRYVCPHCG------------GEMINEDKVAFMGEGHWVPTAK 315
              L WE       P+SV Y+C H G            G  I E+   +  +G    +A+
Sbjct: 259 G--LKWEKNK----PESVFYLCEHHGCVIHQSELDQSNGRWICENTGMWTRDGLMFFSAR 312

Query: 316 PSR---PDFRSYHLSAMYAPYYARSWQEIAQAWIDCWDDDRNAAKDLDELQVFYNNDLGE 372
                 P   ++H+   Y+P+   +W +I   W+D       A KD + L+ F N  LGE
Sbjct: 313 GDEIPPPRSITFHIWTAYSPF--TTWVQIVYDWLD-------ALKDPNGLKTFVNTTLGE 363

Query: 373 AYELKSNRVKPREVYAHRRDAYNLGEVPNNHAMAHAGGPIEAITMAVDVQHTWLAVLTIG 432
            +E         +V   +   Y           A     +  +T  +D Q     +   G
Sbjct: 364 TWEEAVGEKLDHQVLMDKVVRYT----------AAVPARVVYLTAGIDSQRNRFEMYVWG 413

Query: 433 WAPSADHSGYVPYVIDYEHVEGDCKTIEGEGWKKLTEIISTRQYESGGRLYDIARVGIDA 492
           WAP     G   +++D   + G  +  E E   ++   I+ +   + G    I+RV  D 
Sbjct: 414 WAP-----GEEAFLVDKIIIMG--RPDEEETLLRVDAAINKKYRHADGTEMTISRVCWDI 466

Query: 493 SELT-DVVYEYCNDWGE-NVIPIRGRDLPIKGAQIKYFNRQMSEKGVEYLSVTVDLYKDR 550
             +  ++VY+     G   V+P++G    + G  +    +  +++GV    V  D  K+ 
Sbjct: 467 GGIDGEIVYQRSKKHGVFRVLPVKGAS--VYGKPVITMPKTRNQRGVYLCEVGTDTAKEI 524

Query: 551 WSPTLRKEWSGQGEMPRGMFNAPSNIESKYLKELTVEYKREERDAATNKIIRQVWHRPGG 610
               ++ + +   E        P + E     E       E  +      +R +W     
Sbjct: 525 LYARMKADPTPVDEATSYAIRFPDDPEIFSQTEAQQLVAEELVEKWEKGKMRLLWDNK-K 583

Query: 611 SRNEMWDTLIY 621
            RNE  D L+Y
Sbjct: 584 RRNEALDCLVY 594
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.136 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 412,883,444 Number of Sequences: 884779 Number of extensions: 18875352 Number of successful extensions: 42466 Number of sequences better than 5.0e-02: 21 Number of HSP's better than 0.1 without gapping: 4 Number of HSP's successfully gapped in prelim test: 17 Number of HSP's that attempted gapping in prelim test: 42391 Number of HSP's gapped (non-prelim): 28 length of query: 675 length of database: 277,083,050 effective HSP length: 127 effective length of query: 548 effective length of database: 164,716,117 effective search space: 90264432116 effective search space used: 90264432116 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 94 (40.8 bits) Query= orf3 [900-993] (92 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.320 0.136 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 45,264,236 Number of Sequences: 884779 Number of extensions: 1464462 Number of successful extensions: 3295 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3295 Number of HSP's gapped (non-prelim): 0 length of query: 92 length of database: 277,083,050 effective HSP length: 68 effective length of query: 24 effective length of database: 216,918,078 effective search space: 5206033872 effective search space used: 5206033872 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf4 [1015-1245] (229 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.136 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 131,181,179 Number of Sequences: 884779 Number of extensions: 5289705 Number of successful extensions: 9328 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 9328 Number of HSP's gapped (non-prelim): 0 length of query: 229 length of database: 277,083,050 effective HSP length: 116 effective length of query: 113 effective length of database: 174,448,686 effective search space: 19712701518 effective search space used: 19712701518 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 89 (38.9 bits) Query= orf5 [1245-1561] (315 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_459888.1|  (NC_003197) Fels-1 prophage [Salmonella ty...    72  9e-012
ref|NP_460007.1|  (NC_003197) Gifsy-2 prophage [Salmonella t...    70  2e-011
ref|NP_518978.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...    68  1e-010
sp|P36272|VG04_BPP21  PORTAL PROTEIN (HEAD PROTEIN GP4) >gi|...    64  2e-009
emb|CAC83156.1|  (AJ304858) putative portal protein [Escheri...    63  3e-009
ref|NP_288458.1|  (NC_002655) putative head-tail preconnecto...    63  3e-009
ref|NP_287794.1|  (NC_002655) putative capsid protein of pro...    62  9e-009
ref|NP_461542.1|  (NC_003197) Gifsy-1 prophage: similar to h...    59  5e-008
ref|NP_297800.1|  (NC_002488) conserved hypothetical protein...    45  0.001
>ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella typhimurium LT2]
 gb|AAL19847.1| (AE008738) Fels-1 prophage [Salmonella typhimurium LT2]
          Length = 511

 Score = 71.6 bits (174), Expect = 9e-012
 Identities = 54/189 (28%), Positives = 82/189 (42%), Gaps = 24/189 (12%)

Query: 7   KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSG 66
           +R  +ARG  LL  V+  + ++ +  D+ +  A + A +  FI++Q              
Sbjct: 233 RRLNQARGASLLAPVIIRLMDLKEYEDSERIAARIAASLGMFIKKQ-------------- 278

Query: 67  AQRRVQDQDFVGDGTVRQTNIAETPF--GMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAI 124
                   D   DG V      ET    GM+ +GL  GE+I    +D  +     F    
Sbjct: 279 --------DVGTDGYVAPEKRKETQIQPGMLFDGLNPGEDIGMIKSDRPNAGLESFRMGQ 330

Query: 125 LRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTM 184
           LRAVA  L   FS +A  +D +YSA R E+ E +     ++D       RP+YR WL   
Sbjct: 331 LRAVAAGLRGSFSSIARNYDGTYSAQRQELVEAQEGYSILQDSFIAAFTRPLYRRWLAAA 390

Query: 185 VLTRRIDAP 193
           V +  I+ P
Sbjct: 391 VASGAIEVP 399
>ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella typhimurium LT2]
 gb|AAL19966.1| (AE008744) Gifsy-2 prophage [Salmonella typhimurium LT2]
          Length = 437

 Score = 70.5 bits (171), Expect = 2e-011
 Identities = 69/266 (25%), Positives = 110/266 (40%), Gaps = 29/266 (10%)

Query: 7   KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSG 66
           +R  +ARG  LL  V+  + ++ +  D+ +  A + A    FI+R               
Sbjct: 155 RRLNQARGVTLLAPVIIRLLDLKEYEDSERLAARISAAFAMFIRR--------------- 199

Query: 67  AQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
           +   VQD D   D   +  ++   P G +++ L  GE+I    +D  +     F    LR
Sbjct: 200 SDAMVQDGD-APDYADKDRDLDIEP-GTILKDLLPGEDIGTIKSDRPNANLESFRMGQLR 257

Query: 127 AVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMVL 186
           AVA  +   FS +A  +D +YSA R E+ E +     ++D     + RP+YR WL T + 
Sbjct: 258 AVAAGVRGSFSSIARNYDGTYSAQRQELVEAQEGYAILQDNFIAAVSRPVYRRWLATAIT 317

Query: 187 TRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTRTK 246
              ID P               A    + + G V   +D  KE       I+ GA T + 
Sbjct: 318 AGVIDVP----------TDTDMATLFNAVYSGPVMPWIDPLKEANAWRILIRGGAATESD 367

Query: 247 AARMLTGTSFKTNTRQLAK--ENQLL 270
             R   G   +   R+ A+  EN+ L
Sbjct: 368 WVRARGGAPAEVKRRRKAEIDENRKL 393
>ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14559.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 508

 Score = 67.8 bits (164), Expect = 1e-010
 Identities = 66/266 (24%), Positives = 110/266 (40%), Gaps = 33/266 (12%)

Query: 8   RHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSGA 67
           R  + RGEP L+  +  + E+D+  DA   +    A   GF+ RQ               
Sbjct: 232 RPGQIRGEPWLSRALVKLNELDQYDDAELVRKKTAAMFAGFVTRQ--------------- 276

Query: 68  QRRVQDQDFVGDGTVRQTNIAETPFGM-MVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
                + + +G+G   +  I+        ++ L  GE+IK          +GEF     R
Sbjct: 277 ---SPEDNLMGEGLPDEAGISLVGLEPGTLQILEPGEDIKFSDPADVGGSYGEFLRTQFR 333

Query: 127 AVAWALGMPFSVLAMQFDS-SYSASRGEIKEF----ESVIHEMRDRDADTLLRPIYRSWL 181
           AVA ALG+ +  L       +YS+ R  + EF    E V H +       + RP++ +W+
Sbjct: 334 AVAAALGITYEQLTGDLTGVNYSSIRAGLLEFRRRCEMVQHSVL---VHQMCRPVWAAWM 390

Query: 182 RTMVLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGA 241
           +  VL+  + APGF       +       W    W     + VD EKE +     I+ G 
Sbjct: 391 KQAVLSGALVAPGFARGGAARRRQYLQVKWIPQGW-----QWVDPEKEFKAMLLAIRAGL 445

Query: 242 LTRTKAARMLTGTSFKTNTRQLAKEN 267
           ++R++A     G   +   R++A +N
Sbjct: 446 MSRSEAISTF-GYDAEDIDREIAADN 470
>sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4)
 pir||JN0538 head protein gp4 - phage 21
 gb|AAA32342.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
          Length = 530

 Score = 63.9 bits (154), Expect = 2e-009
 Identities = 61/263 (23%), Positives = 113/263 (42%), Gaps = 16/263 (6%)

Query: 11  EARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSGAQRR 70
           + RG      VM+ +  +D ++    + A + A     I+ ++   T     + +GA + 
Sbjct: 241 QTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESELD--TEKAFEYIAGAPQE 298

Query: 71  VQDQDFVGD----GTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
            +D   +       +   TN   T  G+ +  L  G+++K  T   +D  F   E+A+LR
Sbjct: 299 QKDNPLINILEKFSSWYDTNNV-TLGGVKIPHLFPGDDLKLQTAQDSDNGFSALEQALLR 357

Query: 127 AVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMV 185
            +A  LG+ +  L+  +   SYS++R    E        R   A  L   ++  WL   +
Sbjct: 358 YIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARLATQMFSCWLEEAL 417

Query: 186 LTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
           L       G +   R   +F    +AW+R++W G  + A+D  KE +    +I+ G  T 
Sbjct: 418 LR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQESVMRIEAGLSTY 471

Query: 245 TKAARMLTGTSFKTNTRQLAKEN 267
            K   ++ G  ++   RQ  +E+
Sbjct: 472 EKGLALM-GEDYQDIFRQQVRES 493
>emb|CAC83156.1| (AJ304858) putative portal protein [Escherichia coli]
          Length = 500

 Score = 63.2 bits (152), Expect = 3e-009
 Identities = 65/245 (26%), Positives = 104/245 (41%), Gaps = 33/245 (13%)

Query: 7   KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGG--KTPGTKPFT 64
           +R  + RG  +L+ V+  I+ + +  DA    A + A +  +I++  G   + PG K   
Sbjct: 231 RRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYIRKGDGQDYEDPGIKE-- 288

Query: 65  SGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAI 124
                           T R+ +I  TP G++ + L +GE+I    +D  +     F    
Sbjct: 289 ----------------TEREVHI--TP-GIIYDDLRKGEDIGMVKSDRPNPNLETFRNGQ 329

Query: 125 LRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTM 184
           LRAVA    + FS  A  ++ +YSA R E+ E       ++D     + RP+YR+WL  +
Sbjct: 330 LRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCFIGAVTRPVYRTWLNMV 389

Query: 185 VLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
           V    +  P  +       E  T    T S   G V   +D  KE      QI+ GA T 
Sbjct: 390 VAAGLLKIPADV-------EMKTLYNATYS---GPVMPWIDPVKEAEAWRIQIRGGAATE 439

Query: 245 TKAAR 249
           +   R
Sbjct: 440 SDWVR 444
>ref|NP_288458.1| (NC_002655) putative head-tail preconnector protein of prophage
           CP-933U [Escherichia coli O157:H7 EDL933]
 ref|NP_310988.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 dbj|BAB19579.1| (AP000400) portal protein [Escherichia coli O157:H7]
 gb|AAG57012.1|AE005420_17 (AE005420) putative head-tail preconnector protein of prophage
           CP-933U [Escherichia coli O157:H7 EDL933]
 dbj|BAB36384.1| (AP002560) putative portal protein [Escherichia coli O157:H7]
          Length = 500

 Score = 63.2 bits (152), Expect = 3e-009
 Identities = 65/245 (26%), Positives = 104/245 (41%), Gaps = 33/245 (13%)

Query: 7   KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGG--KTPGTKPFT 64
           +R  + RG  +L+ V+  I+ + +  DA    A + A +  +I++  G   + PG K   
Sbjct: 231 RRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYIRKGDGQDYEDPGIKE-- 288

Query: 65  SGAQRRVQDQDFVGDGTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAI 124
                           T R+ +I  TP G++ + L +GE+I    +D  +     F    
Sbjct: 289 ----------------TEREVHI--TP-GIIYDDLRKGEDIGMVKSDRPNPNLETFRNGQ 329

Query: 125 LRAVAWALGMPFSVLAMQFDSSYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTM 184
           LRAVA    + FS  A  ++ +YSA R E+ E       ++D     + RP+YR+WL  +
Sbjct: 330 LRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCFIGAVTRPVYRTWLNMV 389

Query: 185 VLTRRIDAPGFLAAARNPKEFPTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
           V    +  P  +       E  T    T S   G V   +D  KE      QI+ GA T 
Sbjct: 390 VAAGLLKIPADV-------EMKTLYNATYS---GPVMPWIDPVKEAEAWRIQIRGGAATE 439

Query: 245 TKAAR 249
           +   R
Sbjct: 440 SDWVR 444
>ref|NP_287794.1| (NC_002655) putative capsid protein of prophage CP-933R
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309134.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 ref|NP_310760.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
 gb|AAG56408.1|AE005368_14 (AE005368) putative capsid protein of prophage CP-933R [Escherichia
           coli O157:H7 EDL933]
 dbj|BAB34530.1| (AP002554) putative portal protein [Escherichia coli O157:H7]
 dbj|BAB36156.1| (AP002559) putative portal protein [Escherichia coli O157:H7]
          Length = 530

 Score = 61.6 bits (148), Expect = 9e-009
 Identities = 61/263 (23%), Positives = 112/263 (42%), Gaps = 16/263 (6%)

Query: 11  EARGEPLLTLVMQNIAEIDKMRDATQRKATLGAQIVGFIQRQIGGKTPGTKPFTSGAQRR 70
           + RG      VM+ +  +D ++    + A + A     I+  +   T     + +GA + 
Sbjct: 241 QTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESDLD--TEKAFEYIAGAPQG 298

Query: 71  VQDQDFVGD----GTVRQTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 126
            +D   +       +   TN   T  G+ +  L  G+++K  T   +D  F   E+A+LR
Sbjct: 299 QKDNPLINILEKFSSWYDTNNV-TLGGVKIPHLFPGDDLKLQTAQDSDNGFSALEQALLR 357

Query: 127 AVAWALGMPFSVLAMQFDS-SYSASRGEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMV 185
            +A  LG+ +  L+  +   SYS++R    E        R   A  L   ++  WL   +
Sbjct: 358 YIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARLATQMFSYWLEEAL 417

Query: 186 LTRRIDAPGFLAAARNPKEF-PTFAAWTRSQWFGMVKEAVDLEKETRGRGEQIKLGALTR 244
           L       G +   R   +F    +AW+R++W G  + A+D  KE +    +I+ G  T 
Sbjct: 418 LR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQESVMRIEAGLSTY 471

Query: 245 TKAARMLTGTSFKTNTRQLAKEN 267
            K   ++ G  ++   RQ  +E+
Sbjct: 472 EKEPALM-GEDYQDIFRQQVRES 493
>ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp4
           of phage 21 [Salmonella typhimurium LT2]
 gb|AAL21501.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp4
           of phage 21 [Salmonella typhimurium LT2]
          Length = 526

 Score = 59.3 bits (142), Expect = 5e-008
 Identities = 45/177 (25%), Positives = 81/177 (45%), Gaps = 7/177 (3%)

Query: 93  GMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDS-SYSASR 151
           G+ +  L  G+E+   T   +D  F   E+A+LR +A  LG+ +  L+  +   SYS++R
Sbjct: 322 GVKIPHLYPGDELNLQTAQDSDNGFSALEQALLRYIAAGLGVSYEQLSRDYSQVSYSSAR 381

Query: 152 GEIKEFESVIHEMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGFLAAARNPKEFPTFAAW 211
               E        R   A  L   ++  WL   ++   I AP    +    +     ++W
Sbjct: 382 ASANESWRYFLGRRRFIAGRLATQMFSCWLEEALIRGVIRAPRARFSFWEAR-----SSW 436

Query: 212 TRSQWFGMVKEAVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKENQ 268
           +RS+W G  + A+D  KE +    +I+ G  T  K   ++ G  ++   RQ  +E++
Sbjct: 437 SRSEWIGAGRMAIDGLKEVQESVMRIEAGLSTYEKELAIM-GEDYQEIFRQQVRESE 492
>ref|NP_297800.1| (NC_002488) conserved hypothetical protein [Xylella fastidiosa
           9a5c]
 pir||F82798 conserved hypothetical protein XF0510 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF83320.1|AE003899_12 (AE003899) conserved hypothetical protein [Xylella fastidiosa 9a5c]
          Length = 520

 Score = 44.7 bits (104), Expect = 0.001
 Identities = 41/181 (22%), Positives = 70/181 (38%), Gaps = 15/181 (8%)

Query: 94  MMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPFSVLAMQFDSSYSASRGE 153
           M+ +GL  GE I    +   +     + +  LRA+A  + + FS LA  ++ +YSA R E
Sbjct: 318 MIFDGLNPGERIDPIDSTRPNPNLESYRDGQLRAIAAPMRISFSSLAKNYNGTYSAQRQE 377

Query: 154 IKEFESVIHEMRDRDADTLLRPIYRSWLRTMVLTRRIDAPGF--LAAARNPKEFPTFAAW 211
           + E       +       ++RP+Y   +   V +  +  P    LA A      P    W
Sbjct: 378 LVEQYGAYGVLAHEFISQMVRPLYERLIAIAVTSGALTLPPNVPLANALGADYLPPSMPW 437

Query: 212 TRSQWFGMVKEAVDLEKETRGRGEQIKLGALTRTKAARMLTGTSFKTNTRQLAKENQLLA 271
                       +D  KE      Q++ G  + +       G  + T   QL+ + Q  A
Sbjct: 438 ------------IDPLKEINALTAQVQAGVRSLSSVIAERGGRMYDT-VEQLSIDKQWAA 484

Query: 272 D 272
           +
Sbjct: 485 E 485
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.134 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 166,160,910 Number of Sequences: 884779 Number of extensions: 6555490 Number of successful extensions: 14605 Number of sequences better than 5.0e-02: 15 Number of HSP's better than 0.1 without gapping: 6 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 14588 Number of HSP's gapped (non-prelim): 15 length of query: 315 length of database: 277,083,050 effective HSP length: 120 effective length of query: 195 effective length of database: 170,909,570 effective search space: 33327366150 effective search space used: 33327366150 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 91 (39.7 bits) Query= orf6 [1555-1951] (395 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

gb|AAB65791.1|  (AF009900) Z2Z3 protein [Vibrio shilonii]          93  5e-018
dbj|BAA89626.1|  (AB036665) similar to GPC of lambda [bacter...    64  3e-009
ref|NP_536361.1|  (NC_003309) gp5 [Bacteriophage phiE125] >g...    57  3e-007
dbj|BAA89643.1|  (AB036666) similar to minor capsid protein ...    56  7e-007
ref|NP_461541.1|  (NC_003197) Gifsy-1 prophage: similar to h...    52  1e-005
pir||T03571  probable peptidase - Rhodobacter capsulatus >gi...    51  2e-005
gb|AAC24140.1|  (AF071201) unknown [bacteriophage Felix 01]        49  1e-004
sp|P36273|VG05_BPP21  HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CO...    48  1e-004
ref|NP_287793.1|  (NC_002655) putative capsid assembly prote...    47  3e-004
ref|NP_287588.1|  (NC_002655) putative head-tail preconnecto...    47  3e-004
ref|NP_470963.1|  (NC_003212) similar to proteases [Listeria...    43  0.005
ref|NP_540710.1|  (NC_003317) PUTATIVE PROTEASE IV [Brucella...    43  0.006
ref|NP_110548.1|  (NC_002689) Secreted serine protease (ClpP...    42  0.014
>gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii]
          Length = 330

 Score = 92.8 bits (229), Expect = 5e-018
 Identities = 67/202 (33%), Positives = 101/202 (49%), Gaps = 10/202 (4%)

Query: 55  AQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAA-INAAESDDSVKNIEYYIDSPGGEAEG 113
           A I++ GV +   + F  + GG +T   +V A   NAA ++ SVK +   +DSPGGEA G
Sbjct: 40  AIIHVNGVISRYANLFHAVCGGVST---EVLAKEFNAALNESSVKAVILNVDSPGGEASG 96

Query: 114 AIKLGDMIRGT--KKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLS-MRRP 170
             +L +MI  +  KKP                      V     +R G+IG V+S ++RP
Sbjct: 97  IHELSEMIHASRGKKPVRAYVGGDGCSAAYWITTACDRVTMDATARVGSIGTVVSFVKRP 156

Query: 171 SQ--SSYIDVTSTDAPNKRPDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADVNQ 228
               +   +  S+ +PNKR DPE+E+G+  I+ Q LD +  +F   VA   G T   V  
Sbjct: 157 DAEGAKRFEFVSSQSPNKRLDPESEQGQTAIQTQ-LDAMAEVFISRVARNMGVTSDKVKS 215

Query: 229 NFGRGGSLFAETALKMGMIDEI 250
           +FG+GG    + A+  GM  E+
Sbjct: 216 DFGQGGVKIGQQAVDAGMAHEL 237
>dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacteriophage WO]
          Length = 353

 Score = 63.5 bits (153), Expect = 3e-009
 Identities = 57/208 (27%), Positives = 87/208 (41%), Gaps = 17/208 (8%)

Query: 54  TAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEG 113
           TA I + G+ T  P  F    G   T Y  +   I  A S+  ++ I   IDSPGGE  G
Sbjct: 47  TAVIAINGILTKKPGAFDDFLG--MTSYEKIQEEIEEALSNKDIETILLDIDSPGGEVNG 104

Query: 114 AIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLS---M 167
              L D I   R  K+                    +K V  SR S  G+IG + S    
Sbjct: 105 IFDLADFIYESRAKKRIIAIANDDAYSAAYAIASSAEK-VFVSRTSGVGSIGVIASHIDQ 163

Query: 168 RRPSQSSYIDVTSTDAPNKRPD-----PETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
            R  +   I  T+  A +++ D     P T E    ++++ +  L+ MF Q +A  RG +
Sbjct: 164 SRFDEKQGIKYTTIFAGSRKNDLNPHEPMTSESLESLQKE-VGRLYEMFLQLIARNRGLS 222

Query: 223 LADVNQNFGRGGSLFAETALKMGMIDEI 250
           +  +       G  F E A+++G+ D +
Sbjct: 223 IEKIRST--EAGLYFGEKAVEIGLADGV 248
>ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125]
 gb|AAL40278.1|AF447491_5 (AF447491) gp5 [Bacteriophage phiE125]
          Length = 301

 Score = 57.0 bits (136), Expect = 3e-007
 Identities = 51/184 (27%), Positives = 79/184 (42%), Gaps = 17/184 (9%)

Query: 79  TLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXXXX 136
           T Y  + +A+N A +D +V++I   IDS GG A GA +L D IR     KP         
Sbjct: 80  TSYEGLRSAVNQAVADPAVEHIVLDIDSNGGSATGAFELADDIRAASLVKPITAIVNFSA 139

Query: 137 XXXXXXXXXXDKIVATSRVSRAGNIGAV---LSMRRPSQSSYIDVTSTDAPNKRPDPETE 193
                        V  SR S  G+IG +   L + +  +   I VTS  A + + D    
Sbjct: 140 FSGGYLIAAAASKVIVSRTSGVGSIGVIANHLDVSKRDEQQGIKVTSVFAGDHKNDLTPH 199

Query: 194 EGRRVIREQALDPLHNM-------FAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGM 246
           E    + +Q+L  L +M       F  A+A+ RG +   V     + G  F +  ++ G+
Sbjct: 200 E---PLSDQSLTFLTSMVQNSYKQFVDAIANFRGLSTQAVKDT--QAGIFFGQKGVEAGL 254

Query: 247 IDEI 250
            D +
Sbjct: 255 ADSV 258
>dbj|BAA89643.1| (AB036666) similar to minor capsid protein GPC of phage lambda
           [Wolbachia sp. wKue]
          Length = 350

 Score = 55.8 bits (133), Expect = 7e-007
 Identities = 54/208 (25%), Positives = 86/208 (40%), Gaps = 17/208 (8%)

Query: 54  TAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEG 113
           TA I + G+ T  P+ F    G   T Y  +   I  A  D+ V+ I   IDSPGGE  G
Sbjct: 49  TAIIPIHGILTKKPEAFDDFLG--MTSYEKMQEEIEEALEDEKVEIILLDIDSPGGEVNG 106

Query: 114 AIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLSMRRP 170
              L D I   R  K+                    +KI  T R S  G+IG + S    
Sbjct: 107 IFDLADFIYNARSKKRIIAIANDDAYSAAYAIASSAEKIFLT-RTSGVGSIGVIASHIDQ 165

Query: 171 S---QSSYIDVTSTDAPNKRPD-----PETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
           S   +   I  ++  A +++ D     P T E    ++ + ++ L+ M  + +A  R  +
Sbjct: 166 SGFDEKQGIKYSTIFAGSRKNDLNPHEPITSESLENLKSE-VNRLYGMLVELIARNRSLS 224

Query: 223 LADVNQNFGRGGSLFAETALKMGMIDEI 250
           +  +       G  F E A+++G+ D +
Sbjct: 225 IEAIKST--EAGLYFGEKAVEIGLADGV 250
>ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp5
           of phage 21 [Salmonella typhimurium LT2]
 gb|AAL21500.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp5
           of phage 21 [Salmonella typhimurium LT2]
          Length = 498

 Score = 51.6 bits (122), Expect = 1e-005
 Identities = 51/183 (27%), Positives = 70/183 (37%), Gaps = 11/183 (6%)

Query: 77  GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMI--RGTKKPXXXXXXX 134
           G T Y  V A +  A SD  VK I   IDSPGG+A GA    DMI     +KP       
Sbjct: 99  GMTGYDGVTARLQQAVSDPEVKGILLDIDSPGGQAAGAFDCADMIYRMREQKPVWALANE 158

Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVT----STDAPNKR 187
                             ++ SR G+IG V+   S     +   ID+T         +  
Sbjct: 159 TACSAAMLLAAACSHRLVTQTSRMGSIGVVMAHTSYAEKLKQEGIDITLIYSGAHKADLT 218

Query: 188 PDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADV--NQNFGRGGSLFAETALKMG 245
           P  +  E      +Q +D    MFA+ VA   G ++  V   +     G     T L  G
Sbjct: 219 PSQKLPESVYADYQQRMDEARKMFAEKVARYTGLSVDAVMATEAAVYDGQAIITTGLADG 278

Query: 246 MID 248
           M++
Sbjct: 279 MVN 281
>pir||T03571 probable peptidase - Rhodobacter capsulatus
 gb|AAC16224.1| (AF010496) potential peptidase [Rhodobacter capsulatus]
          Length = 303

 Score = 50.8 bits (120), Expect = 2e-005
 Identities = 54/190 (28%), Positives = 78/190 (40%), Gaps = 18/190 (9%)

Query: 77  GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEA---EGAIKLGDMIRGTKKPXXXXXX 133
           G   Y  +  A     +DD+V  +    D+PGG     E A+     +   K        
Sbjct: 77  GWATYRGIAEACADLAADDAVSAVVVEFDTPGGYTLGIEDAVTALRDLAAVKPVHALAAP 136

Query: 134 XXXXXXXXXXXXXDKIVATSRVSRAGNIGAVLSMRR---PSQSSYI---DVTSTDAPNKR 187
                         ++V T R +  G+IG  ++      P  +S +   D TS  A  K 
Sbjct: 137 LAASAGYWLASQARELVMTPRAA-VGSIGVAVTAAANVAPGAASGVQLFDFTSRHARAKW 195

Query: 188 PDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLAD------VNQNFGRGGSLF-AET 240
           PDP TE+G+  IR + LD     F  AVA+GRG    D      V+ +   GG++F A  
Sbjct: 196 PDPATEDGKAEIR-RGLDRTEARFHAAVAEGRGIAPEDLAARLSVSDDPEDGGAVFEAAE 254

Query: 241 ALKMGMIDEI 250
           AL  G+ D I
Sbjct: 255 ALARGLADRI 264
>gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01]
          Length = 442

 Score = 48.5 bits (114), Expect = 1e-004
 Identities = 50/222 (22%), Positives = 90/222 (40%), Gaps = 13/222 (5%)

Query: 40  PTANADRVMVRMGDTAQINMQGVFTSAPDFFAMLFGGGNTLYGDVFAAINAAESDDSVKN 99
           P   A  ++    D A I + G  T    F   +  GG + Y  +    + A +D+S+K 
Sbjct: 48  PKGEARSLLKVKDDIAIIPIMGGLTHRMTFIDAMCTGGLSSYEGLRRGFDEALADESIKT 107

Query: 100 IEYYIDSPGGEAEGAIKLGDMI---RGTKKPXXXXXXXXXXXXXXXXXXXDKIVATSRVS 156
           I  +IDS GGEA G  +L   I   RG KK                    ++I+A S  +
Sbjct: 108 IVLHIDSGGGEASGCFELARHIMASRGQKKIIAYVDEFACSAAYALASSAEEIIA-SPDA 166

Query: 157 RAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKR----PDPETEEGRRVIREQALDPLHN 209
             G+IG ++    + +  + + + +    A   +    P     E  +   ++ ++  + 
Sbjct: 167 DVGSIGVIMVHQELTKAFEKNGVTINVIKAGEFKGMGSPFQALSEESKERLQKRINDTYA 226

Query: 210 MFAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGMIDEIL 251
            F   VA+ R  +   V           A+ AL++G+I+ I+
Sbjct: 227 TFTGFVAESRNLSEEAVKNT--EANVYSAQEALELGLINSIM 266
>sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CONTAINS: SCAFFOLD PROTEIN
           (HEAD PROTEIN GP6)]
 pir||JN0539 head protein gp5 - phage 21
 gb|AAA32343.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
          Length = 501

 Score = 48.1 bits (113), Expect = 1e-004
 Identities = 46/185 (24%), Positives = 77/185 (40%), Gaps = 13/185 (7%)

Query: 77  GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
           G T Y  + A +  A +D  V+ +   IDSPGG+A GA    DMI   +  KP       
Sbjct: 99  GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158

Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
                             ++ SR G+IG ++   S       + +D+T   +   + D  
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218

Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
             E      R   +Q +D    MFA+ VA   G ++  V    G   ++F  ++ ++ G+
Sbjct: 219 QFEALPAEVRQDMQQRIDAARRMFAEKVAMYTGLSVDAVT---GTEAAVFEGQSGIEAGL 275

Query: 247 IDEIL 251
            DE++
Sbjct: 276 ADELI 280
>ref|NP_287793.1| (NC_002655) putative capsid assembly protein of prophage CP-933R
           [Escherichia coli O157:H7 EDL933]
 ref|NP_309135.1| (NC_002695) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
 ref|NP_310759.1| (NC_002695) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
 gb|AAG56407.1|AE005368_13 (AE005368) putative capsid assembly protein of prophage CP-933R
           [Escherichia coli O157:H7 EDL933]
 dbj|BAB34531.1| (AP002554) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
 dbj|BAB36155.1| (AP002559) putative head-tail preconnector protein [Escherichia
           coli O157:H7]
          Length = 501

 Score = 47.0 bits (110), Expect = 3e-004
 Identities = 46/185 (24%), Positives = 76/185 (40%), Gaps = 13/185 (7%)

Query: 77  GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
           G T Y  + A +  A +D  V+ +   IDSPGG+A GA    DMI   +  KP       
Sbjct: 99  GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158

Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
                             ++ SR G+IG ++   S       + +D+T   +   + D  
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218

Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
             E      R   +Q +D    MFA+ VA   G ++  V    G   ++F  ++ +  G+
Sbjct: 219 QFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSGIDAGL 275

Query: 247 IDEIL 251
            DE++
Sbjct: 276 ADELV 280
>ref|NP_287588.1| (NC_002655) putative head-tail preconnector protein of prophage
           CP-933O [Escherichia coli O157:H7 EDL933]
 gb|AAG56200.1|AE005349_4 (AE005349) putative head-tail preconnector protein of prophage
           CP-933O [Escherichia coli O157:H7 EDL933]
          Length = 501

 Score = 47.0 bits (110), Expect = 3e-004
 Identities = 46/185 (24%), Positives = 76/185 (40%), Gaps = 13/185 (7%)

Query: 77  GNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK--KPXXXXXXX 134
           G T Y  + A +  A +D  V+ +   IDSPGG+A GA    DMI   +  KP       
Sbjct: 99  GMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVWALCND 158

Query: 135 XXXXXXXXXXXXDKIVATSRVSRAGNIGAVL---SMRRPSQSSYIDVTSTDAPNKRPDPE 191
                             ++ SR G+IG ++   S       + +D+T   +   + D  
Sbjct: 159 TACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAHKVDGN 218

Query: 192 TEEGR----RVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLF-AETALKMGM 246
             E      R   +Q +D    MFA+ VA   G ++  V    G   ++F  ++ +  G+
Sbjct: 219 QFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSGIDAGL 275

Query: 247 IDEIL 251
            DE++
Sbjct: 276 ADELV 280
>ref|NP_470963.1| (NC_003212) similar to proteases [Listeria innocua]
 emb|CAC96858.1| (AL596169) similar to proteases [Listeria innocua]
          Length = 337

 Score = 43.1 bits (100), Expect = 0.005
 Identities = 57/263 (21%), Positives = 96/263 (35%), Gaps = 33/263 (12%)

Query: 10  LRLMADAPNLTQAQIDQCMASSPLF-QGPGGPTANADRVMVRMG--DT-AQINMQGVFTS 65
           L +++     T +QI     SSP F +     T      ++  G  DT A +++ G    
Sbjct: 15  LLIVSALAKFTSSQIASTEESSPTFVESLFADTGELTETVIEEGGDDTIAVLSVDGTIQD 74

Query: 66  APDFFAMLFGGGNTLYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIRGTK 125
             D    LFG     +      +    +DD ++ +  Y++SPGG    + ++ D I   +
Sbjct: 75  TGDS-GSLFGDAGYDHSFFMQQLEQVRNDDYIQGVLLYVNSPGGGVMESAQIRDKILQIQ 133

Query: 126 KP-----XXXXXXXXXXXXXXXXXXXDKIVATSRVSRAGNIGAVL--------------S 166
           K                         DKI A S+ +  G++G ++              S
Sbjct: 134 KERNIPFYVSMGSMAASGGYYISAPADKIFA-SKETLTGSLGVIMQGYDYSELMKKLGVS 192

Query: 167 MRRPSQSSYIDVTSTDAPNKRPDPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADV 226
                   Y D+ S   P         E  + I +  +D  +N F + VA GRG + A+ 
Sbjct: 193 DNTIKSGEYKDIMSGTRP-------MTEDEKKIMQSMIDDSYNEFVKVVAKGRGMS-AEK 244

Query: 227 NQNFGRGGSLFAETALKMGMIDE 249
            +    G       A + G+IDE
Sbjct: 245 VRKIADGRIYDGRQAKENGLIDE 267
>ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella melitensis]
 gb|AAL52974.1| (AE009612) PUTATIVE PROTEASE IV [Brucella melitensis]
          Length = 301

 Score = 42.7 bits (99), Expect = 0.006
 Identities = 36/152 (23%), Positives = 59/152 (38%), Gaps = 15/152 (9%)

Query: 83  DVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIR--GTKKPXXXXXXXXXXXXX 140
           ++   ++    DD+VK +   +DSPGG   G   + D +R   TKKP             
Sbjct: 39  ELLKRLDKIAGDDAVKGVILLLDSPGGTTVGGEAIYDAVRKIATKKPVVTQVGTLAASAG 98

Query: 141 XXXXXXDKIVATSRVSRAGNIGAVLSMRRPSQSSYIDV------TSTDAPNKRP----DP 190
                    +   + S  G+IG +   + P  S  +D       T   +P K       P
Sbjct: 99  YMIASASDHIVARQTSIVGSIGVL--FQYPDLSKLLDTLGVKVETIKSSPLKAEPNYFSP 156

Query: 191 ETEEGRRVIREQALDPLHNMFAQAVADGRGTT 222
            +EE + +IR   +D  ++ F   V + R  T
Sbjct: 157 ASEEAKNMIRSMIMDS-YDWFVGIVQERRAFT 187
>ref|NP_110548.1| (NC_002689) Secreted serine protease (ClpP class) [Thermoplasma
           volcanium]
 dbj|BAB59171.1| (AP000991) proteinase IV [Thermoplasma volcanium]
          Length = 230

 Score = 41.6 bits (96), Expect = 0.014
 Identities = 41/182 (22%), Positives = 73/182 (39%), Gaps = 14/182 (7%)

Query: 80  LYGDVFAAINAAESDDSVKNIEYYIDSPGGEAEGAIKLGDMIR--GTKKPXXXXXXXXXX 137
           LY  ++ A+ AAE   S+  +    +S GG+A  +  + D+++    KKP          
Sbjct: 16  LYKYLYPALKAAEGKKSIAGLILVFNSGGGDAASSQLIHDLVKKIRKKKPVYSLALGICA 75

Query: 138 XXXXXXXXXDKIVATSRVSRAGNIGAVLSMRRPSQSSYIDVTSTDA---------PNKRP 188
                       +     S  G+IG V+S+ RP+    ++    D              P
Sbjct: 76  SGAYWIASASTKIYAIDTSLIGSIG-VISI-RPNVKKLMEKIGVDVMVYKSGKYKDMTSP 133

Query: 189 DPETEEGRRVIREQALDPLHNMFAQAVADGRGTTLADVNQNFGRGGSLFAETALKMGMID 248
             E  E  + + ++ LD +   F ++VA+ RG     +++    G    A+ A   G+ID
Sbjct: 134 FSEPNEEEKSVYQRLLDDIFEKFKRSVAEDRGIPSEKIDE-IANGMVYSAKMAADNGLID 192

Query: 249 EI 250
            I
Sbjct: 193 RI 194
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.315 0.131 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 176,519,459 Number of Sequences: 884779 Number of extensions: 6309375 Number of successful extensions: 15521 Number of sequences better than 5.0e-02: 17 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 15513 Number of HSP's gapped (non-prelim): 17 length of query: 395 length of database: 277,083,050 effective HSP length: 122 effective length of query: 273 effective length of database: 169,140,012 effective search space: 46175223276 effective search space used: 46175223276 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 92 (40.0 bits) Query= orf7 [1956-2091] (134 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_518980.1|  (NC_003295) PROBABLE BACTERIOPHAGE-RELATED...    37  0.025
>ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
          solanacearum]
 emb|CAD14561.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
          solanacearum]
          Length = 125

 Score = 37.4 bits (85), Expect = 0.025
 Identities = 18/41 (43%), Positives = 27/41 (64%)

Query: 33 AAETWPAGAVLGRVTATGRYVRYNASASDGSQVPSAILTEA 73
          A ++   G VLG VTATG+  + + SA+DGSQ  + +L +A
Sbjct: 31 AGQSLVLGTVLGMVTATGKVKQLDPSATDGSQYAAGVLMQA 71
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.135 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 59,690,860 Number of Sequences: 884779 Number of extensions: 1917181 Number of successful extensions: 4161 Number of sequences better than 5.0e-02: 1 Number of HSP's better than 0.1 without gapping: 1 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4160 Number of HSP's gapped (non-prelim): 1 length of query: 134 length of database: 277,083,050 effective HSP length: 110 effective length of query: 24 effective length of database: 179,757,360 effective search space: 4314176640 effective search space used: 4314176640 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 83 (36.6 bits) Query= orf8 [2096-2461] (364 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_056680.1|  (NC_002214) gp348 [Streptococcus thermophi...    46  5e-004
pir||T13320  hypothetical protein 31 - Streptococcus phage p...    44  0.002
>ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
 pir||T13625 hypothetical protein gp348 - Streptococcus phage phi-Sfi11
 gb|AAC34402.1| (AF158600) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
          Length = 348

 Score = 46.2 bits (108), Expect = 5e-004
 Identities = 42/167 (25%), Positives = 68/167 (40%), Gaps = 16/167 (9%)

Query: 37  GKTVEIDVKRFAENVAVVVTKLSGPNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLLG 96
           G++V +    F  NV V           D       + + P + EA     +D  Q  L 
Sbjct: 51  GQSVALKAAAFDTNVTV----------RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNL- 99

Query: 97  VNPYDDANIAYSSKLVGRLMDYFMEANDMIMRGIELQASQILQTGRLSLIDRAGEVAYEI 156
               D  N A  + +V  + +  +   +     +E    Q+L TG+++       V  +I
Sbjct: 100 --VKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSDG--VNKDI 155

Query: 157 DYSPKATHFPTTATAWSDDGADPLADLESLFGVIRADGKVNPDMIIM 203
           DY  K  H    + +W++ GA PLADLE      R  G +NP+  +M
Sbjct: 156 DYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELG-LNPERAVM 201
>pir||T13320 hypothetical protein 31 - Streptococcus phage phi-O1205
 gb|AAC79546.1| (U88974) ORF31 [Streptococcus thermophilus temperate bacteriophage
           O1205]
          Length = 348

 Score = 44.3 bits (103), Expect = 0.002
 Identities = 25/74 (33%), Positives = 38/74 (50%), Gaps = 3/74 (4%)

Query: 130 IELQASQILQTGRLSLIDRAGEVAYEIDYSPKATHFPTTATAWSDDGADPLADLESLFGV 189
           +E    Q+L TG+++       V  +IDY  K  H    + +W++ GA PLADLE     
Sbjct: 131 LEAMRMQVLATGKIAFTSDG--VNKDIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIET 188

Query: 190 IRADGKVNPDMIIM 203
            R  G +NP+  +M
Sbjct: 189 ARELG-LNPERAVM 201
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.137 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 205,071,628 Number of Sequences: 884779 Number of extensions: 8701858 Number of successful extensions: 24210 Number of sequences better than 5.0e-02: 2 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 2 Number of HSP's that attempted gapping in prelim test: 24209 Number of HSP's gapped (non-prelim): 3 length of query: 364 length of database: 277,083,050 effective HSP length: 121 effective length of query: 243 effective length of database: 170,024,791 effective search space: 41316024213 effective search space used: 41316024213 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 91 (39.7 bits) Query= orf9 [2477-2607] (129 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.309 0.137 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 63,642,766 Number of Sequences: 884779 Number of extensions: 2055492 Number of successful extensions: 2316 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2316 Number of HSP's gapped (non-prelim): 0 length of query: 129 length of database: 277,083,050 effective HSP length: 105 effective length of query: 24 effective length of database: 184,181,255 effective search space: 4420350120 effective search space used: 4420350120 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.6 bits) S2: 83 (36.6 bits) Query= orf10 [2609-2738] (128 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.314 0.133 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 67,790,511 Number of Sequences: 884779 Number of extensions: 2607104 Number of successful extensions: 4764 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4764 Number of HSP's gapped (non-prelim): 0 length of query: 128 length of database: 277,083,050 effective HSP length: 104 effective length of query: 24 effective length of database: 185,066,034 effective search space: 4441584816 effective search space used: 4441584816 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 83 (36.6 bits) Query= orf11 [2736-2965] (228 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.132 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 112,919,182 Number of Sequences: 884779 Number of extensions: 4236100 Number of successful extensions: 9074 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 9074 Number of HSP's gapped (non-prelim): 0 length of query: 228 length of database: 277,083,050 effective HSP length: 116 effective length of query: 112 effective length of database: 174,448,686 effective search space: 19538252832 effective search space used: 19538252832 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 89 (38.9 bits) Query= orf12 [2969-3470] (500 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_050643.1|  (NC_000929) L [Enterobacteria phage Mu] >g...    96  9e-019
ref|NP_404841.1|  (NC_003143) putative bacteriophage tail sh...    85  2e-015
ref|NP_284552.1|  (NC_003116) hypothetical protein [Neisseri...    77  5e-013
ref|NP_439661.1|  (NC_000907) sheath protein gpL (muL) [Haem...    66  7e-010
ref|NP_543096.1|  (NC_003356) putative sheath protein [Bacte...    65  2e-009
ref|NP_274135.1|  (NC_003112) phage sheath protein [Neisseri...    63  8e-009
ref|NP_313006.1|  (NC_002695) putative tail sheath protein [...    61  3e-008
>ref|NP_050643.1| (NC_000929) L [Enterobacteria phage Mu]
 sp|P79678|VPL_BPMU Tail sheath protein (gpL)
 dbj|BAA19195.1| (AB000833) sheath protein gpL [Enterobacteria phage Mu]
 gb|AAF01117.1|AF083977_36 (AF083977) L [Enterobacteria phage Mu]
          Length = 495

 Score = 95.9 bits (237), Expect = 9e-019
 Identities = 128/483 (26%), Positives = 193/483 (39%), Gaps = 29/483 (6%)

Query: 5   NAIDNSVRARVLGIKTEFRNFNTGRTFFLP-QHVALLGQ-GNTAATYALTPFRATSAAQV 62
           NAI + VR  +  I  EF N N       P Q V + GQ G+ A+     P R  S +Q 
Sbjct: 7   NAIPSDVRVPLTYI--EFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQA 64

Query: 63  GQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATV 122
              FG GS L L A   L  N    R   L   P G N  G AA G + LSGTA    ++
Sbjct: 65  SAAFGQGSMLALMADAFLNAN----RVAELWCIPQG-NGTGNAAVGEISLSGTAGENGSL 119

Query: 123 RVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAVDATNDNE-----VNVAA 177
              I  QR + V++  G T    A LLVA I+G P +P+TA V A + ++     V ++A
Sbjct: 120 VTYIAGQRLA-VSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHADVVLSA 178

Query: 178 KWKG-LSGNDL---VVSIEGAVAGLTVAITQXXXXXXXXXXXXXXXXFGENVWYTQIVSC 233
           K+ G LS  D+     + E    G+  A                    G+ + Y  IV  
Sbjct: 179 KFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGD-LQYKYIVMP 237

Query: 234 FNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLITPV 293
           +   +   LN   T  + RW P+ +      T  + T  + ++  G +R     + +  +
Sbjct: 238 YT--DEPNLNLLRTELQERWGPVNQADGFAVTVLSGTYGD-ISTFGVSRNDHLISCMG-I 293

Query: 294 PGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVKAGI 353
            GA                   + DPA     LTLPG  P     +++ ++R+ L+  GI
Sbjct: 294 AGAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALLFDGI 353

Query: 354 STSIVRD-GVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLYNLDLIFNTREWDGA 410
           ST  V D G  +I   +T Y     G+ +P YL        S + Y+L     T+++   
Sbjct: 354 STFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRI-TQKFPNY 412

Query: 411 PLIPD-DQPTTNARAKRPKXXXXXXXXXXXXXXXXXIISDVPFTLENIRASINDQNPNRL 469
            L  D  +  T      P                  ++ D     E +  + N  + +RL
Sbjct: 413 KLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDDKDRL 472

Query: 470 DII 472
           D++
Sbjct: 473 DVL 475
>ref|NP_404841.1| (NC_003143) putative bacteriophage tail sheath protein [Yersinia
           pestis]
 emb|CAC90076.1| (AJ414147) putative bacteriophage tail sheath protein [Yersinia
           pestis]
          Length = 502

 Score = 84.7 bits (208), Expect = 2e-015
 Identities = 95/371 (25%), Positives = 150/371 (39%), Gaps = 33/371 (8%)

Query: 21  EFRNFNTGRTFFLPQHVALLGQGNTAATY-ALTPFRATSAAQVGQRFGFGSPLHLAALQL 79
           EF N +   T    Q   ++GQ    +T  A  P   +S A V    G GS LH      
Sbjct: 20  EFDN-SQANTASTTQRTLIIGQTRAGSTLPANIPLLVSSTATVAGLSGAGSMLHGQMAAY 78

Query: 80  LPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTG 139
           L ++  G     + + P+ D D+ VAA G + ++  AS T  + + IG  R     + T 
Sbjct: 79  LANDTAG----EIYLLPLSDADSMVAAIGKITINTAASETGVISLYIGGIRVQTTVVATD 134

Query: 140 TTAEQAAALLVAGIQGNPFMPMTAAVDATNDNE---VNVAAKWKGLSGN--DLVVSIEGA 194
           + +  AAAL  A I+  P +P+T        +E   V + AK KG  GN  DL ++  G 
Sbjct: 135 SVSTIAAAL-AAAIENQPELPVTVVHTGDMVSEGAVVVLEAKNKGAHGNNIDLRLNYLGN 193

Query: 195 VAG------LTVAITQXXXXXXXXXXXXXXXXFGENVWYTQIVSCFNTANTDALNAFETF 248
             G      L + IT                   +  +   I    +TA+ DA+  F + 
Sbjct: 194 AGGEITPESLVLMITPMAGGAGAPELADGLANLQDRTFDFIINPYTDTASLDAIKDFLSD 253

Query: 249 GEGRWN---PILRRPLAVFTGTNETNANTLAAIGDARRAQRTNLITPVPGAQNLPCEI-- 303
             GRW+    +     A  +GT       L A G+ R  Q  +L+    G  N P     
Sbjct: 254 STGRWSYRQQLYGHSFAAQSGTY----GQLTAAGELRNDQHASLL----GIHNSPTPAHI 305

Query: 304 -XXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWSHTQRDLLVKAGIST-SIVRDG 361
                         NDP      L + G+     + +++ T+R+ L+ +GIST ++  D 
Sbjct: 306 WSAAYVGAIAQSLRNDPGRPLQTLAIRGVLAPPLSSRFTLTERNNLLHSGISTVTVADDS 365

Query: 362 VAEISDTVTTY 372
             ++ + +TTY
Sbjct: 366 TVQVENIITTY 376
>ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
 pir||E81810 hypothetical protein NMA1841 [imported] - Neisseria meningitidis
           (group A strain Z2491)
 emb|CAB71993.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
 emb|CAB85065.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
          Length = 475

 Score = 76.6 bits (187), Expect = 5e-013
 Identities = 59/186 (31%), Positives = 84/186 (44%), Gaps = 13/186 (6%)

Query: 19  KTEFRNFNTGRTFFLPQHVALLGQGNTAATYALTPFRAT-SAAQVGQRFGFGSPLHLAAL 77
           K   RN  T +     Q V ++ Q N  A   LT      SAA    ++G GS  HL   
Sbjct: 27  KLAVRNLPTNK-----QRVLIVAQHNNPALGELTELENVFSAADAAAKYGAGSMAHLMVT 81

Query: 78  QLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIP 137
             +           L++  + DN AGVAA G + LSGTA+    +RV I +  +  + I 
Sbjct: 82  AAIK----AYAYADLSLITVADNKAGVAAGGKITLSGTANTQGVLRVSIANADTLTIGIG 137

Query: 138 TGTTAEQAAALLVAGIQGNPFMPMTAAVDATNDNEVNVAAKWKGLSGNDLVVSIEGAVAG 197
              TA   AA + A I   P +P+TA V    +  V + AK KG +GN + +       G
Sbjct: 138 AEDTAATVAAAVKAAIDAVPDLPVTATV---AEAVVTLTAKNKGTAGNAIRIKTSNTAEG 194

Query: 198 LTVAIT 203
           +T A+T
Sbjct: 195 ITAAVT 200
>ref|NP_439661.1| (NC_000907) sheath protein gpL (muL) [Haemophilus influenzae Rd]
 sp|P44233|VPL_HAEIN Mu-like prophage FluMu tail sheath protein
 pir||I64033 hypothetical protein HI1511 - Haemophilus influenzae (strain Rd
           KW20)
 gb|AAC23158.1| (U32827) sheath protein gpL (muL) [Haemophilus influenzae Rd]
          Length = 487

 Score = 66.2 bits (160), Expect = 7e-010
 Identities = 81/364 (22%), Positives = 141/364 (38%), Gaps = 30/364 (8%)

Query: 45  TAATYALTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGV 104
           T +  A    R ++ AQ    FG GS L          N+    ++ L + P+ +  +G 
Sbjct: 46  TGSAKAGEAVRVSAYAQAKTLFGRGSQLAEMVKTFKAHNS----TLDLWVLPLDEAPSGA 101

Query: 105 AAEGVLDLSGTASATATVRVRI-GSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTA 163
            A G + ++GTA+   T  + I G+   + VT  +G TA+  A  L   I  +  +P+ A
Sbjct: 102 KATGSVQITGTATQAGTFSLMIAGNNYKTAVT--SGDTADVVAGKLQKLIAADQDVPVVA 159

Query: 164 AVDATNDNEVNVAAKWKGLSGNDLVV-----SIEGAVAGLTVAITQXXXXXXXXXXXXXX 218
            V     N + +  ++KG +GN++ +     S E    GL   IT               
Sbjct: 160 TVAG---NTITLTCRFKGETGNEIDLRCNYYSGEAFPEGLKATITDMKNGAVNPDMSVAI 216

Query: 219 XXFGENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAI 278
              G   W+  I++ F   +T++LN        RW P+ +     F     T+A      
Sbjct: 217 TGLGAE-WWNYIINPFT--DTESLNLLRADLVKRWGPLKQIDGICFMAKRGTHAEVTTFA 273

Query: 279 GDARRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNN---DPASDFARLTLPGLTPGT 335
                 QR + +  +      P                +   DPA     L +  L P +
Sbjct: 274 -----EQRNDYLFSLLATHKAPQPAYLWASAYAAVVAGSLAIDPARPVQTLVMD-LLPPS 327

Query: 336 DAQQWSHTQRDLLVKAGISTSIVRDG-VAEISDTVTTYHPT--GEVNPGYLFYKDTVKAS 392
            + +W   +R+ L+ +G+ST  V  G   ++   +T Y     G+ +  YL+ +     S
Sbjct: 328 MSDRWDLPERNTLLYSGVSTYTVNAGSQPQVEAAITMYRKNAFGDNDESYLYVETIATLS 387

Query: 393 NVLY 396
            + Y
Sbjct: 388 YLRY 391
>ref|NP_543096.1| (NC_003356) putative sheath protein [Bacteriophage P27]
 emb|CAC83562.1| (AJ298298) putative sheath protein [Bacteriophage P27]
          Length = 498

 Score = 65.1 bits (157), Expect = 2e-009
 Identities = 85/389 (21%), Positives = 142/389 (35%), Gaps = 29/389 (7%)

Query: 103 GVAAEGVLDLSGTASATATVRVRIGSQRSSL-VTIPTGTTAEQAAALLVAGIQGNPFMPM 161
           G AA   + ++G A  + T+ + +G  RSS+ V +  G  A   A  +   + G   +P 
Sbjct: 97  GAAATVRVTVTGEAEESGTLSLYVG--RSSVQVPVVNGDDATAVATAIKEAVNGVITLPF 154

Query: 162 TAAVDATNDNEVNVAAKWKGLSGNDLVVSI--------EGAVAGLTVAITQXXXXXXXXX 213
            A+ DA     V + A+ KGL GN+L V +        E   AGL V             
Sbjct: 155 AASSDA---GVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPD 211

Query: 214 XXXXXXXFGENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNAN 273
                   G+  +    +   + A+ + +        GRW+   +    V+T    T + 
Sbjct: 212 LTAAVAAMGDEAFDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGTLSE 271

Query: 274 TLAAIGDARRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTP 333
            + A GD    Q   L       Q+   E+             NDPA       L G+ P
Sbjct: 272 LVNA-GDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLP 330

Query: 334 GTDAQQWSHTQRDLLVKAGISTSIVRDGVAEISDTVTTY--HPTGEVNPGYLFYKDTVKA 391
               +++  T++  L+  G++T+ V  G   I  +VTTY  +  G  +  YL  +    +
Sbjct: 331 APKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTS 390

Query: 392 SNVLYNLDLIFNTREW------DGAPLIPDDQPTTNARAKRPKXXXXXXXXXXXXXXXXX 445
           + VL  L  +  ++        DG    P     T      P                  
Sbjct: 391 AYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVT------PAVIKGELLATYRQMERAG 444

Query: 446 IISDVPFTLENIRASINDQNPNRLDIIFP 474
           I+ +     + +    +  NPNRL+ +FP
Sbjct: 445 IVENYDLFKQYLIVERDADNPNRLNTLFP 473
>ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseria meningitidis MC58]
 pir||E81121 phage sheath protein NMB1104 [imported] - Neisseria meningitidis
           (group B strain MD58)
 gb|AAF41495.1| (AE002460) phage sheath protein [Neisseria meningitidis MC58]
          Length = 469

 Score = 62.8 bits (151), Expect = 8e-009
 Identities = 53/167 (31%), Positives = 74/167 (43%), Gaps = 10/167 (5%)

Query: 34  PQHVALLGQGNTAATY-ALTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPL 92
           PQ V ++    TA    AL P +  S A+    FG GS  HL   Q   +N      + L
Sbjct: 35  PQKVLMVAPMLTAGIQPALEPVQLFSDAEAADLFGQGSLAHLMVRQAFANNP----YLDL 90

Query: 93  TIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAG 152
           T+  + D+ AGV A   + LSGTA+A   V + IG ++ S   + TG TA   A  L   
Sbjct: 91  TVIGIADHSAGVQATATVTLSGTATAPGVVEITIGGKQVS-TAVNTGETAATVADRLKTA 149

Query: 153 IQGNPFMPMTAAVDATNDNEVNVAAKWKGLSGNDLVVSIEGAVAGLT 199
           I         +   A     V + AK KG  GN+  +++     GLT
Sbjct: 150 ITAADVTVTASGSGAA----VTLTAKHKGEIGNESGLTVSTGNTGLT 192
>ref|NP_313006.1| (NC_002695) putative tail sheath protein [Escherichia coli O157:H7]
 dbj|BAB38402.1| (AP002567) putative tail sheath protein [Escherichia coli O157:H7]
          Length = 500

 Score = 60.8 bits (146), Expect = 3e-008
 Identities = 96/435 (22%), Positives = 156/435 (35%), Gaps = 27/435 (6%)

Query: 53  PFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDL 112
           P R T A+Q  + +G GS + L   + +  N D      L     G N  G A    ++L
Sbjct: 58  PVRITRASQATELWGRGSMIALMVAEFIAINPDA----ELYAIAQG-NGTGQATAAAMNL 112

Query: 113 SGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAVDA-TNDN 171
           +GT +    V   +G +R +L   P G   ++    L A I  +P  P TA+  A + DN
Sbjct: 113 TGTVTRDGIVYACVGGRRYTLPA-PKGKKGKELTDELAALINADPDAPFTASSGAGSGDN 171

Query: 172 ------EVNVAAKWKG-LSGNDLVVSI---EGAVAGLTVAITQXXXXXXXXXXXXXXXXF 221
                  + + A++ G  S +D+ ++    E    G+ VAI                   
Sbjct: 172 GAGLKGSLGITARFTGECSVHDVRLNYHDGEATPEGIQVAIAYPKQKAANPDITRSVAGM 231

Query: 222 GENVWYTQIVSCFNTANTDALNAFETFGEGRWNPILRRPLAVFTGTNETNANTLAAIGDA 281
           G+  +   ++   + AN   L         RW P       ++     T     A    A
Sbjct: 232 GDRQYNYVVMPYKDDAN---LKIISDELLKRWGPAKMSDGVLWLAHTGTFGEVQAF--GA 286

Query: 282 RRAQRTNLITPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWS 341
           +R       T +P A                     DPA     L LP       A + +
Sbjct: 287 KRNDFLATCTSIPKAPEPDYLWAAAVCATCAPSLGADPARPLQTLALPSRMAPAPADRLT 346

Query: 342 HTQRDLLVKAGIST-SIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLYNL 398
             +R+ L+  GI+T S+    V +I   VT Y     GE +P YL  +     S + Y+L
Sbjct: 347 REERNSLLWGGIATVSVAAGDVVQIERQVTMYRQNVYGESDPSYLDVETIYTLSWLRYSL 406

Query: 399 DLIFNTREWDGAPLIPDDQPT-TNARAKRPKXXXXXXXXXXXXXXXXXIISDVPFTLENI 457
              F T+ +    L  D  P         P+                 ++ ++    +N+
Sbjct: 407 -RTFITQRFPRHKLADDGTPVRAGQNIVTPEIMKLQLISLGEEWVELGLVENLDTFKKNL 465

Query: 458 RASINDQNPNRLDII 472
               N  + NRLD++
Sbjct: 466 LVERNASDRNRLDVL 480
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.317 0.134 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 248,521,712 Number of Sequences: 884779 Number of extensions: 9678405 Number of successful extensions: 20487 Number of sequences better than 5.0e-02: 7 Number of HSP's better than 0.1 without gapping: 1 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 20478 Number of HSP's gapped (non-prelim): 7 length of query: 500 length of database: 277,083,050 effective HSP length: 124 effective length of query: 376 effective length of database: 167,370,454 effective search space: 62931290704 effective search space used: 62931290704 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 93 (40.4 bits) Query= orf13 [3474-3602] (127 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.310 0.128 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 65,088,242 Number of Sequences: 884779 Number of extensions: 2494057 Number of successful extensions: 7057 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 7057 Number of HSP's gapped (non-prelim): 0 length of query: 127 length of database: 277,083,050 effective HSP length: 103 effective length of query: 24 effective length of database: 185,950,813 effective search space: 4462819512 effective search space used: 4462819512 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 83 (36.6 bits) Query= orf14 [3627-3761] (133 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.135 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 70,441,945 Number of Sequences: 884779 Number of extensions: 2796944 Number of successful extensions: 7351 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 7351 Number of HSP's gapped (non-prelim): 0 length of query: 133 length of database: 277,083,050 effective HSP length: 109 effective length of query: 24 effective length of database: 180,642,139 effective search space: 4335411336 effective search space used: 4335411336 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 83 (36.6 bits) Query= orf15 [3793-3828] (34 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.326 0.144 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 15,999,558 Number of Sequences: 884779 Number of extensions: 270277 Number of successful extensions: 549 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 549 Number of HSP's gapped (non-prelim): 0 length of query: 34 length of database: 277,083,050 effective HSP length: 10 effective length of query: 24 effective length of database: 268,235,260 effective search space: 6437646240 effective search space used: 6437646240 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 84 (37.0 bits) Query= orf16 [3866-4448] (581 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_058453.1|  (NC_002321) orf 15 [Staphylococcus aureus ...   172  9e-042
ref|NP_061640.1|  (NC_002486) phi PVL ORF 15 and 16 homologu...   172  9e-042
ref|NP_310668.1|  (NC_002695) putative tail protein [Escheri...   163  4e-039
ref|NP_288365.1|  (NC_002655) putative tail fiber component ...   161  2e-038
pir||T13525  hypothetical protein 36 - Bacillus phage phi-10...   153  4e-036
ref|NP_108725.1|  (NC_002703) unknown [Bacteriophage Tuc2009...   153  5e-036
ref|NP_112708.1|  (NC_002747) TMP [Lactococcus lactis bacter...   152  9e-036
ref|NP_075512.1|  (NC_002661) orf1374 [Staphylococcus aureus...   145  1e-033
ref|NP_490625.1|  (NC_003278) orf25; similar to T gene of P2...   132  1e-029
ref|NP_046782.1|  (NC_001895) gpT [Enterobacteria phage P2] ...   120  4e-026
ref|NP_052273.1|  (NC_001317) G protein [Enterobacteria phag...   112  8e-024
ref|NP_471714.1|  (NC_003212) similar to hypothetical protei...   111  2e-023
gb|AAK85310.1|  (AY043263) phage-related tail protein [Wolba...    48  2e-004
>ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus bacteriophage PVL]
 pir||T00148 hypothetical protein 15 - Staphylococcus aureus phage phi PVL
 dbj|BAA31888.1| (AB009866) orf 15 [Staphylococcus aureus bacteriophage PVL]
          Length = 694

 Score =  172 bits (436), Expect = 9e-042
 Identities = 115/384 (29%), Positives = 191/384 (48%), Gaps = 25/384 (6%)

Query: 37  SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
           ++ GIEF+  + +     G       + ++ L   A+++G             + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234

Query: 97  GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
           G+D++Q +  L  ++ LA A+G +L   +DI TD L AFGL  KDS  LA       DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHLA-------DVL 287

Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
           A+TS+ ANT +  + E  +  AP A A G +IE  S  IG+M+N GIK E AGTA++  F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347

Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
             L++P       + RLGI + DS G M     ++  L      + + Q+ +    +FG 
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407

Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
           E ++G L +INA  +   +   ++ S+ GA+ RMA  +   + G LRTLRS +E +A+  
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467

Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
           ++    +++ +V   +             V  ++  +  +I   V G  L   V   L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFVAVLGPLVF 515

Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
            F   +S +  A+T+L  L+I  N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
           prophage phiPV83]
 dbj|BAA97857.1| (AB044554) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
           prophage phiPV83]
          Length = 1313

 Score =  172 bits (436), Expect = 9e-042
 Identities = 115/384 (29%), Positives = 191/384 (48%), Gaps = 25/384 (6%)

Query: 37  SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
           ++ GIEF+  + +     G       + ++ L   A+++G             + ++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234

Query: 97  GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
           G+D++Q +  L  ++ LA A+G +L   +DI TD L AFGL  KDS   A       DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287

Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
           A+TS+ ANT +  + E  +  AP A A G +IE  S  IG+M+N GIK E AGTA++  F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347

Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
             L++P       + RLGI + DS G M     ++  L      + + Q+ +    +FG 
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407

Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
           E ++G L +INA  +   +   ++ S+ GA+ RMA  +   + G LRTLRS +E +A+  
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467

Query: 337 FELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 395
           ++    +++ +V   +             V  ++  +  +I   V G  L A V   L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVIGFGLFAAVLGPLVF 515

Query: 396 AFNTIVSTITGAITLLN-LVIAAN 418
            F   +S +  A+T+L  L+I  N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_310668.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
 dbj|BAB36064.1| (AP002559) putative tail protein [Escherichia coli O157:H7]
          Length = 696

 Score =  163 bits (413), Expect = 4e-039
 Identities = 122/399 (30%), Positives = 182/399 (45%), Gaps = 47/399 (11%)

Query: 89  GIEFLAMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 148
           G  FLAMAGF  Q   AALP ++++A A G++L    DI ++ L  F L        A  
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299

Query: 149 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIA 208
           + RV D L    T  NT +  + ET++   P A   G S+E  +AM G++ANNG++   A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359

Query: 209 GTAVQNFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLA 268
           GTA++    RLA+P   A   L+ LG+ VAD+ G MR   DV+ DL  A  + G+  +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419

Query: 269 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 327
             + + G E   G    V  AG   L +    L  A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEEAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479

Query: 328 TVESVAIRFFELSGGSMRDVVDQATAWLR-----------ANRELIAQNVA--------- 367
             E + IR  +L  G +R V    T  L              R+L+    A         
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539

Query: 368 ----------GFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVSTITGAITLLNLVIAA 417
                     G L T+    D + R + ++  V   LW        +TG+++LL  VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVV-RVLPALWGM------VTGSVSLLGGVIGA 592

Query: 418 NPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
                                + +W+PI++FF  ++ GI
Sbjct: 593 --LFSPVGLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_288365.1| (NC_002655) putative tail fiber component of prophage CP-933T
           [Escherichia coli O157:H7 EDL933]
 gb|AAG56919.1|AE005413_13 (AE005413) putative tail fiber component of prophage CP-933T
           [Escherichia coli O157:H7 EDL933]
          Length = 696

 Score =  161 bits (407), Expect = 2e-038
 Identities = 121/399 (30%), Positives = 181/399 (45%), Gaps = 47/399 (11%)

Query: 89  GIEFLAMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 148
           G  FLAMAGF  Q   AALP ++++A A G++L    DI ++ L  F L        A  
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299

Query: 149 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIA 208
           + RV D L    T  NT +  + ET++   P A   G S+E  +AM G++ANNG++   A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359

Query: 209 GTAVQNFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLA 268
           GTA++    RLA+P   A   L+ LG+ VAD+ G MR   DV+ DL  A  + G+  +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419

Query: 269 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 327
             + + G     G    V  AG   L +    L  A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEXAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479

Query: 328 TVESVAIRFFELSGGSMRDVVDQATAWLR-----------ANRELIAQNVA--------- 367
             E + IR  +L  G +R V    T  L              R+L+    A         
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539

Query: 368 ----------GFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVSTITGAITLLNLVIAA 417
                     G L T+    D + R + ++  V   LW        +TG+++LL  VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVV-RVLPALWGM------VTGSVSLLGGVIGA 592

Query: 418 NPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
                                + +W+PI++FF  ++ GI
Sbjct: 593 --LFSPVGLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>pir||T13525 hypothetical protein 36 - Bacillus phage phi-105
 dbj|BAA36642.1| (AB016282) ORF36 [bacteriophage phi-105]
          Length = 1018

 Score =  153 bits (387), Expect = 4e-036
 Identities = 105/373 (28%), Positives = 179/373 (47%), Gaps = 16/373 (4%)

Query: 38  RAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAG 97
           + GI+FE  +++     G +  +  K    L + AK++G               FLAMAG
Sbjct: 173 QVGIDFEKQMSKVQAISGGSAAEIAK----LREQAKELGATTVFTASQAADAQGFLAMAG 228

Query: 98  FDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLA 157
           F       A+P ++SLA A  ++L  A DI ++ + AF L  K+S          +DV+A
Sbjct: 229 FKVNDIYDAMPGMLSLAAAGQLELGMAADITSNIMSAFALKAKES-------GHASDVIA 281

Query: 158 KTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFL 217
             + +ANT ++ M E ++  AP A + G  +E   A I    + G++  IAG A     +
Sbjct: 282 YAAANANTNVEQMGEAMKFLAPNANSLGWGMEESPAAIMAFGDAGLQGSIAGQAFGTSLI 341

Query: 218 RLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAE 277
           RLA+P G+A K++++LG    D+AGNM+   +VV ++   L  M + Q+ A ++ + GAE
Sbjct: 342 RLASPTGKASKLVKKLGFNFFDAAGNMKSMPEVVEEMEKGLKGMTKEQQAAALKTIVGAE 401

Query: 278 GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFF 337
                  ++  G  AL +    L  ++GAA +MA  + D+  GS+   +S +E   I+  
Sbjct: 402 AYKHWAVLLQKGSKALGDNTKALEKSDGAAKKMADTMLDNAHGSIVAFQSALEGAKIKLT 461

Query: 338 ELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAF 397
           E    ++ D+ ++ +  +     L +  V     T V     +  G+  + T  ATL A 
Sbjct: 462 ESLLPALGDLANKGSDLIMMFNNLDSGTVQTIAKTAV-----LATGVLGVTTAVATLTAG 516

Query: 398 NTIVSTITGAITL 410
              +   TG + L
Sbjct: 517 IGALLAFTGPVGL 529
>ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009]
 gb|AAK19873.1|AF109874_46 (AF109874) unknown [Bacteriophage Tuc2009]
          Length = 1025

 Score =  153 bits (386), Expect = 5e-036
 Identities = 114/441 (25%), Positives = 203/441 (45%), Gaps = 23/441 (5%)

Query: 37  SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
           ++ G +FE  ++R     G   +    ++++L   A  +G            G+E LA A
Sbjct: 69  AKIGGDFESQMSRVKAISGATGS----SFEELRQQAIDLGAKTAFSAKESASGMENLASA 124

Query: 97  GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
           GF+A++ + A+P ++ LA  +G D+  A++ A  +L  F L    S  +A       +V 
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVGLASENAATALRGFNLDASQSGHVA-------NVF 177

Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
           AK + + N  +  M E ++  AP A + G SIE VSA IG+M++ GIK   AGT+++   
Sbjct: 178 AKAAANTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237

Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
            RLA P    +  +  LG+   DS G M+   D +G L  A   +   Q+   +  ++G 
Sbjct: 238 SRLAKPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297

Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
           E L+G + +I+ G D L +   +L +++GAAD+MAK + D+M  SL  +   +ES AI  
Sbjct: 298 ESLSGMIALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGALESAAIVV 357

Query: 337 FELSGGSMRDVVDQATAWLR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 390
            ++   ++R V D  +  +    +     Q +   +  IV  I  ++    + + ++  V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVLVLQRV 417

Query: 391 FATLWAFNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFV 450
                A  + ++ I G+ T ++L +                       Y +W+ I  F  
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471

Query: 451 NLWDGIGIAFDTFAMNLANKW 471
            +W  +       A  +  KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacteriophage TP901-1]
 gb|AAG32164.1|AF252967_1 (AF252967) tape measure protein [Lactococcus lactis bacteriophage
           TP901-1]
 gb|AAK38062.1| (AF304433) TMP [Lactococcus lactis bacteriophage TP901-1]
          Length = 937

 Score =  152 bits (384), Expect = 9e-036
 Identities = 115/441 (26%), Positives = 201/441 (45%), Gaps = 23/441 (5%)

Query: 37  SRAGIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMA 96
           ++ G +FE  ++R     G   +    ++++L   A  +G            G+E LA A
Sbjct: 69  AKIGGDFESQMSRVKAISGATGS----SFEELRQQAIDLGAKTAFSAKESASGMENLASA 124

Query: 97  GFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 156
           GF+A++ + A+P ++ LA  +G D+A A++ A  +L  F L    S  +A       +V 
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVALASENAATALRGFNLDASQSGHVA-------NVF 177

Query: 157 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFF 216
           AK +   N  +  M E ++  AP A + G SIE VSA IG+M++ GIK   AGT+++   
Sbjct: 178 AKAAADTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237

Query: 217 LRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGA 276
            RLA P    +  +  LG+   DS G M+   D +G L  A   +   Q+   +  ++G 
Sbjct: 238 SRLADPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297

Query: 277 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
           E L+G + +I+ G D L +   +L +++GAAD+MAK + D+M  SL  +    ES AI  
Sbjct: 298 ESLSGMMALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGAFESAAIVV 357

Query: 337 FELSGGSMRDVVDQATAWLR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 390
            ++   ++R V D  +  +    +     Q +   +  IV  I  ++    + +  +  V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVVTLQRV 417

Query: 391 FATLWAFNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFV 450
                A  + ++ I G+ T ++L +                       Y +W+ I  F  
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471

Query: 451 NLWDGIGIAFDTFAMNLANKW 471
            +W  +       A  +  KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus temperate phage phiSLT]
 dbj|BAB21743.1| (AB045978) orf1374 [Staphylococcus aureus temperate phage phiSLT]
          Length = 1374

 Score =  145 bits (365), Expect = 1e-033
 Identities = 89/317 (28%), Positives = 160/317 (50%), Gaps = 18/317 (5%)

Query: 53  KMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFDAQQSVAALPKIVS 112
           ++G      +K  + + + A  +G            G+E LA  GF+A+Q++ A+P ++S
Sbjct: 345 RVGAIAQASSKDLKSMSNQAVDLGAKTSKSANEVAKGMEELAALGFNAKQTMEAMPGVIS 404

Query: 113 LATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFE 172
            A A+G ++A    +   ++ +FGL   D+  +A       D+LA+++  +   I  M +
Sbjct: 405 AAEASGAEMATTATVMASAINSFGLKASDANHVA-------DLLARSANDSAADIQYMGD 457

Query: 173 TIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRLAAPAGEARKILRR 232
            ++ A   A A G SIE  SA I V++N+G++   AGTA++  F+RLA P+    K +++
Sbjct: 458 ALKYAGTPAKALGVSIEDTSAAIEVLSNSGLEGSQAGTALRASFIRLANPSKNTAKEMKK 517

Query: 233 LGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAEGLAGNLGVINAGKDA 292
           LGI ++D+ G      +++      +  M   Q+LA +  + G E  +G L +I AG D 
Sbjct: 518 LGIHLSDAKGQFVGMGELIRQFQDNMKGMTREQKLATVATIVGTEAASGFLALIEAGPDK 577

Query: 293 LVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQAT 352
           +  Y  +L ++ G + + A  + D++ G+L  L    ES+AI           +V    T
Sbjct: 578 INSYSKSLKNSNGESKKAADLMKDNLKGALEQLGGAFESLAI-----------EVGKDLT 626

Query: 353 AWLRANRELIAQNVAGF 369
             +RA  E + + V GF
Sbjct: 627 PMIRAGAEGLTKLVDGF 643
>ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2 [bacteriophage phi CTX]
 dbj|BAA36253.1| (AB008550) orf25; similar to T gene of P2 [bacteriophage phi CTX]
          Length = 904

 Score =  132 bits (332), Expect = 1e-029
 Identities = 107/457 (23%), Positives = 205/457 (44%), Gaps = 54/457 (11%)

Query: 40  GIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFD 99
           G EF+  ++ AT  +     K     Q L   A+ +             G  FL   G++
Sbjct: 212 GYEFDATMS-ATQAVTRIERKDDPQMQALRQQARTLPLSSKFTDKEVAQGQYFLGRTGYN 270

Query: 100 AQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 159
           A+Q + A+P +++LA A  +DL    DIA++   A G+  +   Q+A       DVL   
Sbjct: 271 AKQILGAMPGMLNLAAAGDMDLGDTADIASNIQTAMGIPAEKMDQVA-------DVLTAA 323

Query: 160 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRL 219
            T  N  I M+ ++++ +A      GQS+ETV+    ++ N G++  +AGT++++   RL
Sbjct: 324 FTRNNVDIRMLGDSLKYSAGVGREYGQSLETVTTATALLGNAGVQGSMAGTSMRSVLTRL 383

Query: 220 AAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKVFGAEGL 279
                   K + +LG++  D+ GNMRD  D++ D+N   + MG  QR A+ + + G   +
Sbjct: 384 G-----TSKAVAKLGVQTKDANGNMRDMLDILKDINKKTAGMGNVQRGAIFKDIAGQYAV 438

Query: 280 AGNLGVINAGKDALVE-YRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE 338
                ++ A +    +  R +L ++EG A R+A    D++ G +  L + +E++++  F+
Sbjct: 439 TSFGTLMRAVEGGQFQTMRESLNNSEGEAARVAATQLDNLKGDMTMLHAALENISVELFD 498

Query: 339 LSGGSMRDVVDQAT-------AWLRANREL---IAQNVAGFLNTIVENIDSIVRGIK-LI 387
            +   +R++    +        +L+AN ++   I   VA F + ++  + S+   +  ++
Sbjct: 499 KNSPWLRELAADLSHLLHNIGEFLKANPQVSKGIVITVAAF-SALMATVGSLAITLAGIL 557

Query: 388 ATVFATLWAFNTIVSTITGAITLLNLV----------------------------IAANP 419
             + A  +   TI   + G I LL L+                            +AANP
Sbjct: 558 GPMIAVRFMLGTIGIRLPGLIGLLKLLFAPIRMLAGLLIGPLVTALRVVSIALWGLAANP 617

Query: 420 XXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGI 456
                              Y +W+ +K++ + +W+ I
Sbjct: 618 VVLAIAAVVAVLAGAAYLIYRNWDAVKAYLLGMWEEI 654
>ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2]
 gb|AAD03293.1| (AF063097) gpT [Enterobacteria phage P2]
          Length = 815

 Score =  120 bits (301), Expect = 4e-026
 Identities = 108/472 (22%), Positives = 186/472 (38%), Gaps = 46/472 (9%)

Query: 40  GIEFEHVINRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFD 99
           G EF    +     +G  V K +     L   A+Q+G                +A AG D
Sbjct: 212 GYEFAQKNSELQAVIG--VAKDSAEMAALRKQARQLGDNTAASADDAAGAQIIIAKAGGD 269

Query: 100 AQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 159
                AA P  +++A A    +     +      AF L           +A + DVL+ T
Sbjct: 270 VDAIQAATPVTLNMALANRRTMEENAALLMGMKSAFQLSND-------KVAHIGDVLSMT 322

Query: 160 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRL 219
                   D M + +  AAP A  AG SIE  +AM+G + +  I   +AGT  +    RL
Sbjct: 323 MNKTAADFDGMSDALTYAAPVAKNAGVSIEETAAMVGALHDAKITGSMAGTGSRAVLSRL 382

Query: 220 AAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVAL--SQMGERQRLAVIQKVFGAE 277
            AP G+A   L+ LG++ +DS GN R  F ++ ++  +   +++G  Q+   ++ +FG E
Sbjct: 383 QAPTGKAWDALKELGVKTSDSKGNTRPIFTILKEMQASFEKNRLGTAQQAEYMKTIFGEE 442

Query: 278 GLAGNLGVINAGKDALVEYRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 336
             +    ++ A     ++       A +G    +   + D++ G  +  +S  E+V    
Sbjct: 443 ASSAAAVLMTAASTGKLDKLTAAFKASDGKTAELVNIMQDNLGGDFKAFQSAYEAVGTDL 502

Query: 337 FELSGGSMRDVVDQAT-------AWLRANRELI---------AQNVAGFLNTIVENIDSI 380
           F+   G++R +   AT        W++ N+ L          A  + G +  I      +
Sbjct: 503 FDQQEGALRKLTQTATKYVLKLDGWIQKNKSLASTIGIIAGGALALTGIIGAIGLVAWPV 562

Query: 381 VRGIKLIATVFATLWA-FNTIVSTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXY 439
           + GI  I      + A F T+ S +  AI  ++  + A                      
Sbjct: 563 ITGINAIIAAAGAMGAVFTTVGSAVMTAIGAISWPVVA---------VVAAIVAGALLIR 613

Query: 440 MHWEPIKSFFVNLWDGIGIAFDTFAMNLANKWEALIAPIRSSIQWLLEQADA 491
            +WEP+ +FF  + +G+  AF             L  P++    WL E+  A
Sbjct: 614 KYWEPVSAFFGGVVEGLKAAFAPVG--------ELFTPLKPVFDWLGEKLQA 657
>ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phage 186]
 gb|AAC34170.1| (U32222) G protein [Enterobacteria phage 186]
          Length = 812

 Score =  112 bits (281), Expect = 8e-024
 Identities = 105/449 (23%), Positives = 182/449 (40%), Gaps = 37/449 (8%)

Query: 58  VTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAMAGFDAQQSVAALPKIVSLATAA 117
           V K +   Q L   A+ VG                +A +G DA    A     +++A + 
Sbjct: 227 VDKQSPEMQALRKQARHVGDNTAASADDAASAQIIIAKSGGDAAAIQARRRVTLNMALSN 286

Query: 118 GIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKA 177
              +     + T    AF L           +A + DVL+ T        D + + +  A
Sbjct: 287 RRTMEENAALLTGMKSAFQLSND-------KIAHIGDVLSMTMNKTAADFDGLSDALTYA 339

Query: 178 APTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNFFLRLAAPAGEARKILRRLGIEV 237
           AP A  AG SIE  +AM+G + +  I   +AGT  +    RL  P  +A + ++ LG++ 
Sbjct: 340 APVAKNAGVSIEQTAAMVGALHDAKITGSMAGTGSRAILSRLQLPPEKAFEAIKELGVKT 399

Query: 238 ADSAGNMRDAFDVVGDLNVAL--SQMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVE 295
           +DS GN R  F ++ ++  +   + +G  QR   ++ +FG E  +    ++ A     ++
Sbjct: 400 SDSKGNTRPIFSILKEMQRSFEKNNLGTSQRGEYMKTIFGEEASSAAAVLMEAASSGKLD 459

Query: 296 YRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQAT-- 352
                  A +G  + + K + D++ G  +  +S  E+V    F+   GS+R +   AT  
Sbjct: 460 RLTAAFKASDGKTEELVKVMQDNLGGDFKEFQSAYEAVGTDLFDQQEGSLRKLTQTATQY 519

Query: 353 -----AWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATLWAFNTIVS----- 402
                 W++ N+ L      G +      +  I+ GI L+A  +  +   N I++     
Sbjct: 520 VLKLDGWIQKNKGL--ATTIGIIAGGALALIGIIGGIGLVA--WPVVMGINAIIAAAGVL 575

Query: 403 ----TITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYMHWEPIKSFFVNLWDGIGI 458
               T+TG+     +V A                        +WEPI +FF  + +GI  
Sbjct: 576 GTVFTVTGSA----IVTALGAITWPIVAVGAAIVAGALLIRKYWEPISAFFSGVIEGIMS 631

Query: 459 AFDTFA---MNLANKWEALIAPIRSSIQW 484
           AF   A     LA  ++ L   +R   QW
Sbjct: 632 AFAPVAEMFAPLAPIFDGLGEKLRGVWQW 660
>ref|NP_471714.1| (NC_003212) similar to hypothetical protein [Lactobacillus casei
           bacteriophage A2] [Listeria innocua]
 emb|CAC97610.1| (AL596172) similar to hypothetical protein [Lactobacillus casei
           bacteriophage A2] [Listeria innocua]
          Length = 1601

 Score =  111 bits (278), Expect = 2e-023
 Identities = 74/340 (21%), Positives = 154/340 (44%), Gaps = 25/340 (7%)

Query: 37  SRAGIEFEHVI---NRATVKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFL 93
           ++A ++F++ +   N    + G++  +    Y  + D A++              G++ +
Sbjct: 100 TKAYLDFDNEVTEVNSLLRESGESAKEFGDRYTQVFDYAQKASVKYGVASEQTMLGMKEM 159

Query: 94  AMAGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVN 153
              G+D  Q++A++P I + A A+G D      + T +L  FG+++KD+ +      +V 
Sbjct: 160 VKKGYDINQTMASMPAIFNAARASGDDFETVMSVTTSTLEQFGMISKDTNKQMEYTNKVA 219

Query: 154 DVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQ 213
           DVL   +         M   +    P + + G S+   +A +G+++N GI+ + AGT ++
Sbjct: 220 DVLTYVADKTAAGFSDMGTAMNYVGPISHSLGYSLTDTAAAVGLLSNRGIEGQKAGTGLR 279

Query: 214 NFFLRLAAPAGEARKILRRLGIEVADSAGNMRDAFDVVGDLNVALSQMGERQRLAVIQKV 273
                L  P+  A + +  +G+ + D+ GNM+    ++ D+N    +M + Q+ + +  +
Sbjct: 280 GMLTSLLKPSKSAAEAMSAVGLTIEDNNGNMKTLPTLLDDINDKTKKMTKTQKNSFLTMI 339

Query: 274 FGAEGLAGNLGVINAGKDALVEYRNTLLSAEG----AADRMAK----------------- 312
           FG E L+    ++ AG D+L +Y      A G     AD M K                 
Sbjct: 340 FGREPLSAVNTLLEAGGDSLRKYSKGADEANGYTKQVADNMRKAGKFGVDQFKASLEVLE 399

Query: 313 -RIGDDMLGSLRTLRSTVESVAIRFFELSGGSMRDVVDQA 351
             +G  ++ +L  +      +  +F +LSG   ++++  A
Sbjct: 400 QNVGQKLMPALTPIIEWANKMIDKFNDLSGEQQQNIIKWA 439
>gb|AAK85310.1| (AY043263) phage-related tail protein [Wolbachia endosymbiont of
           Drosophila melanogaster]
          Length = 680

 Score = 48.1 bits (113), Expect = 2e-004
 Identities = 83/476 (17%), Positives = 170/476 (35%), Gaps = 62/476 (13%)

Query: 38  RAGIEFEHVINRAT--VKMGDNVTKGTKAYQDLVDVAKQVGXXXXXXXXXXXXGIEFLAM 95
           +  I+FE  +   T  V       + TK  + L ++++ +             G      
Sbjct: 110 KVAIDFESAMADVTKVVDFKKGTDEATKFAKKLKEMSRTIPLSAAELAQIAASG----GQ 165

Query: 96  AGFDAQQSVAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDV 155
            G   +        +  ++TA  +   +A D        +G+           +  V +V
Sbjct: 166 LGIKKEDLFMFTETVAKMSTAFDMSAEQAGDSIAKLSNVYGIDVS-------KMEYVGNV 218

Query: 156 LAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMANNGIKAEIAGTAVQNF 215
           +   S +       M E +     TA   G  I+  S+++    + G +   A TA+   
Sbjct: 219 INHLSDNTAAKAKDMVEALAIVGGTAKQFGLDIKETSSLVNAFVSLGKQPAKAATAINAL 278

Query: 216 FLRLAAP---AGEARKILRRLGIEVADSAGNMRD-AFDVVGDLNVALSQMGERQRLAVIQ 271
             +L       G+ +  L ++GI   +    + +   + +     AL +M  ++R  ++ 
Sbjct: 279 LSKLQTAEEQGGDFKAALEQMGITAEEIVQRISENGEEALLYFFQALKKMDNQERSTILM 338

Query: 272 KVFGAE---GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRST 328
           K+FG E    +A   G  N  +DA+    +T             R+ D     LR LR+ 
Sbjct: 339 KLFGQEYQDDIALLAGSFNKYEDAIRLLSDTEEYKSSLQKEFQNRV-DTTASKLRLLRNA 397

Query: 329 VESVAIRFFELSGGSMRDVVDQATAWLRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA 388
           +  V +    +   +++ + +      R+   L A+        I+  + +++  +K++A
Sbjct: 398 IAEVGMNLGSVMLPTLKSIAEFLQEKTRSIA-LFAEKYPTLTKAIMGTVAALI-SLKVVA 455

Query: 389 ------------TVFAT----LWAFNTIVSTITGAI-TLLNLVIAANPXXXXXXXXXXXX 431
                       T+F+     L  F+ + +T+  A+ T L  V  ANP            
Sbjct: 456 VGLGYGFTLLGSTIFSLKANLLGVFSFLSATVFPAVVTGLRAVTLANPIGLLITGLVTGA 515

Query: 432 XXXXXXXYMHWEPIKSFFVNLWDGIGIAFDTFAMNLANKWEALIAPIRSSIQWLLE 487
                    +W+ +K FF                  ++ W+++I PI  +  W+ E
Sbjct: 516 ALVIT----NWQKVKDFF------------------SSFWKSIIKPIEEAFSWIGE 549
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.133 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 223,816,781 Number of Sequences: 884779 Number of extensions: 7353514 Number of successful extensions: 20760 Number of sequences better than 5.0e-02: 26 Number of HSP's better than 0.1 without gapping: 17 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 20708 Number of HSP's gapped (non-prelim): 39 length of query: 581 length of database: 277,083,050 effective HSP length: 126 effective length of query: 455 effective length of database: 165,600,896 effective search space: 75348407680 effective search space used: 75348407680 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 94 (40.8 bits) Query= orf17 [4448-4866] (417 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_274140.1|  (NC_003112) phage virion protein, putative...    40  0.033
>ref|NP_274140.1| (NC_003112) phage virion protein, putative [Neisseria meningitidis
           MC58]
 pir||B81122 phage virion protein, probable NMB1109 [imported] - Neisseria
           meningitidis (group B strain MD58)
 gb|AAF41500.1| (AE002460) phage virion protein, putative [Neisseria meningitidis
           MC58]
          Length = 443

 Score = 40.4 bits (93), Expect = 0.033
 Identities = 32/129 (24%), Positives = 59/129 (44%), Gaps = 10/129 (7%)

Query: 3   WQTRLREAAYTPPSGNRLTFIYTSVSEEFDQKGGPFDFAGADGTFVQFLGVTGRRYPMTI 62
           W T L++A+Y    G  +  +  S  +   +   PF      G  ++ +G+TGR+  +  
Sbjct: 4   WHTLLQDASYKGV-GFDIEVVDESNGKALAEHARPF----VQGIDLEDMGMTGRQVQINA 58

Query: 63  IVSGDDYDLDAAAWMQALAEQGEAILEHPAYGRL--TVAPVGTVKRSENFVNGAGQATIE 120
           +  G  Y       + AL + G  +L HP +GR+   +A   + +   ++V+ AG   I+
Sbjct: 59  VFWGKGYAGRLKKLLDALEQPGGGVLVHPVWGRMHNMIAASWSYRHEADYVDYAG---ID 115

Query: 121 VTLFETTGA 129
           +T  E   A
Sbjct: 116 ITFREAAEA 124
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.317 0.134 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 175,870,520 Number of Sequences: 884779 Number of extensions: 6030334 Number of successful extensions: 14372 Number of sequences better than 5.0e-02: 1 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 14372 Number of HSP's gapped (non-prelim): 1 length of query: 417 length of database: 277,083,050 effective HSP length: 123 effective length of query: 294 effective length of database: 168,255,233 effective search space: 49467038502 effective search space used: 49467038502 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 92 (40.0 bits) Query= orf18 [4863-5281] (417 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_313011.1|  (NC_002695) putative tail protein [Escheri...    67  3e-010
ref|NP_439666.1|  (NC_000907) 43 kD tail protein [Haemophilu...    66  6e-010
ref|NP_050648.1|  (NC_000929) P [Enterobacteria phage Mu] >g...    65  2e-009
ref|NP_284542.1|  (NC_003116) hypothetical protein [Neisseri...    62  8e-009
ref|NP_543101.1|  (NC_003356) putative tail protein [Bacteri...    52  8e-006
>ref|NP_313011.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
 dbj|BAB38407.1| (AP002567) putative tail protein [Escherichia coli O157:H7]
          Length = 374

 Score = 67.0 bits (162), Expect = 3e-010
 Identities = 75/290 (25%), Positives = 114/290 (38%), Gaps = 20/290 (6%)

Query: 124 QPVSVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTP--PASAFPLQWDE 181
           Q + + + G+ + +G L   + R +A    +  S     G L DC+   P S    QW  
Sbjct: 52  QSLELRIDGQPVITGWLDQVRQRISATRFQITLSGRDKTGDLVDCSAIHPGS----QWRN 107

Query: 182 ATLQTIAADLCRPFGIQV--LAPDGT-GQTFERIAVEPAEKVMAVIARLAAQRNLVVRSD 238
            TL+ IA+DLC PFG+ V     D T  + F    +E +E V   + R A  R ++V S+
Sbjct: 108 RTLEHIASDLCAPFGVTVRWQVNDATAARPFSTFTLENSETVADALTRAARHRGVLVTSN 167

Query: 239 EQGRLVLLRPDT-RGEPVAEFIEGQQPPISVTPTFGNQDYY-------SHVTGITPTIVG 290
             G LV  +  + RG+ +           +V     + +Y            G   T   
Sbjct: 168 AAGELVFTQAGSQRGDTLTLGENLLDLDHNVDHRLRHSEYRVRGHGRGGGHAGDALTAGT 227

Query: 291 LEGPQATVRNPHLEGVLRPYVYNADDMAEAD-LVQAVQSKAGRMFAQSATYDVPVPTWRN 349
           L  P  TV +  +    RP +  AD   +AD   Q    +  R  A+S      V  W  
Sbjct: 228 LAAPVGTVTDSAIHR-YRPKIVLADHAVDADGARQRAVREMRRAVARSVRLTATVRHWFR 286

Query: 350 ANGDLWRVGDFVILEAPGAQVYRRTLMQIKTVRFSATTTERSAVLELIIP 399
            NG LW +     + AP   V  R L+ +  V FS           ++ P
Sbjct: 287 ENGQLWDINLLTAVTAPRTGVEERDLL-VCQVEFSLDANHGETTRLILAP 335
>ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilus influenzae Rd]
          Length = 420

 Score = 66.2 bits (160), Expect = 6e-010
 Identities = 48/188 (25%), Positives = 92/188 (48%), Gaps = 10/188 (5%)

Query: 68  TQDPDEVTLRINGLNFRFWRAVTITQHLDAVS-TVSLHAPFDPNDEQSRDAFRPYSYQPV 126
           T+  +++ L +NG     W+++ + + L+++S    L     P D+ S  A    +  P+
Sbjct: 7   TETNNKIELYLNGKILSGWKSLNLQRSLESMSGRFDLGIAVRPEDDISVLA----AGSPL 62

Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
            + +GG+ + +G L   + R + N++T+  S       L DC    +++  Q+   T + 
Sbjct: 63  VLKMGGQTVITGYLDEIKQRVSGNDKTISVSGRDKTCDLVDCAIIHNSY--QFKNQTAKQ 120

Query: 187 IAADLCRPFGIQVLAPDGTGQTFERI---AVEPAEKVMAVIARLAAQRNLVVRSDEQGRL 243
           IA  +C+PFGI V+      +  ERI    VEP E     ++++A  + ++V SD  G L
Sbjct: 121 IAEAICKPFGISVVWQVQAPEANERIPVWQVEPGETAFDNLSKIARHKGVLVTSDVDGNL 180

Query: 244 VLLRPDTR 251
           +   P  +
Sbjct: 181 LFTEPSNK 188
>ref|NP_050648.1| (NC_000929) P [Enterobacteria phage Mu]
 sp|P08558|VPP_BPMU 43 kDa tail protein (gpP)
 pir||ZPBPMU gene P protein - phage Mu
 emb|CAA29956.1| (X06796) 43 kDa tail protein [Enterobacteria phage Mu]
 gb|AAF01122.1|AF083977_41 (AF083977) P [Enterobacteria phage Mu]
          Length = 379

 Score = 64.7 bits (156), Expect = 2e-009
 Identities = 72/310 (23%), Positives = 129/310 (41%), Gaps = 30/310 (9%)

Query: 74  VTLRINGLNFRFWRAVTITQHLDAVSTV-SLHAPFDPNDEQSRDAFRPYSYQPVSVDVGG 132
           VTLR +G  F  W +V++T+ +++V+    L     P  + S  A      +  ++++GG
Sbjct: 5   VTLRADGRLFTGWTSVSVTRSIESVAGYFELGVNVPPGTDLSGLA----PGKKFTLEIGG 60

Query: 133 ERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQTIAADLC 192
           + + +G + + + +  A+   +  +       L DC    S    QW   TL+ IA DLC
Sbjct: 61  QIVCTGYIDSRRRQMTADSMKITVAGRDKTADLIDCAAVYSGG--QWKNRTLEQIARDLC 118

Query: 193 RPFGIQV---LAPDGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLLRPD 249
            P+G+ V   L+   +   F    ++ +E V   + R +  R +++ S+  G LV  R  
Sbjct: 119 APYGVTVRWELSDKESSAAFPGFTLDHSETVYEALVRASRARGVLMTSNAAGELVFSR-- 176

Query: 250 TRGEPVAEFIEGQQPPISVTPTFGNQDYYSHVTGIT----PTIVGLEGPQATVRN-PHLE 304
                  E + G+     +T  F  +D+    +  T        G EG     ++    +
Sbjct: 177 AASTATDELVLGEN---LLTLDF-EEDFRDRFSEYTVKGYARANGAEGDDIDAKSIVSRK 232

Query: 305 GV--------LRPYVYNADDMAEADLVQA-VQSKAGRMFAQSATYDVPVPTWRNANGDLW 355
           G          RP +  AD    A   QA    +  R  A+S T++  +  W   +G LW
Sbjct: 233 GTATDSDVTRYRPMIIIADSKITAKDAQARALREQRRRLAKSITFEAEIDGWTRKDGQLW 292

Query: 356 RVGDFVILEA 365
                V ++A
Sbjct: 293 MPNLLVTIDA 302
>ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
 pir||C81809 hypothetical protein NMA1830 [imported] - Neisseria meningitidis
           (group A strain Z2491)
 emb|CAB72001.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
 emb|CAB85055.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
          Length = 379

 Score = 62.4 bits (150), Expect = 8e-009
 Identities = 82/313 (26%), Positives = 122/313 (38%), Gaps = 34/313 (10%)

Query: 71  PDE-VTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDEQSRDAFRPYSYQPV 126
           PD  VTL ING     W    I   L   +   +V L  P D   +  R   +      V
Sbjct: 4   PDNTVTLLINGKTHGQWTNYDIVSDLLTPADDFSVMLGRPVDAKPDAVRAGDK------V 57

Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
            V VGG+ + SG +   Q  T    +T+        GVL DC+ P         +  L  
Sbjct: 58  EVRVGGDTVLSGRIDRVQTVTEKGGKTLTIQGRDDAGVLLDCSAPL----FNAQDMDLNQ 113

Query: 187 IAADLCRPFGIQVLAPDG--TGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV 244
           I   + +P G+  +  D   T +T  ++ +EP  +    +   A    L    +  G LV
Sbjct: 114 IIEKIVKPLGLAKIRIDAAKTDKT-HKVQIEPGSRAWDALLEYAEANGLWPWLEPDGTLV 172

Query: 245 LLRPDTRGEPVAEFI---EGQQPPISVTPTFGNQDY---YSHVTGITPTIVGLEGPQATV 298
           +  PD    PVAE +    GQ   I       N+D    YS VT +  +  G    +AT 
Sbjct: 173 VGGPDYTAAPVAELVLRTNGQNNNIKRLEV--NRDMAARYSEVTVLAQSHSGKNNIKATA 230

Query: 299 RNPHLEGVLRPYVYNADDMAEADLVQAVQSKAGRMFAQS----ATYDVPVPTWRNANGDL 354
           ++  ++ + RP +    D+   D     Q KA +  A S     T    V   R  +G L
Sbjct: 231 KDESVK-LHRPLIVTEPDI---DSQAQAQRKAKKRLADSRLEGLTITATVQGHRTDDGTL 286

Query: 355 WRVGDFV-ILEAP 366
           W+ G  + +L  P
Sbjct: 287 WQPGQRINVLSEP 299
>ref|NP_543101.1| (NC_003356) putative tail protein [Bacteriophage P27]
 emb|CAC83567.1| (AJ298298) putative tail protein [Bacteriophage P27]
          Length = 351

 Score = 52.4 bits (124), Expect = 8e-006
 Identities = 75/315 (23%), Positives = 117/315 (36%), Gaps = 42/315 (13%)

Query: 74  VTLRINGLNFRFWRAVTITQHLDAVSTVSLHAPFDPNDEQSRDAFRPYSYQP-------V 126
           V LR++G  +  W +V I+  ++ ++    +         SRD      YQP       V
Sbjct: 5   VLLRVSGREWGGWTSVRISAGINRIAR-DFNVAITTRWPGSRD------YQPRIKNGELV 57

Query: 127 SVDVGGERLFSGTLVNPQPRTAANERTVRASCYSTPGVLDDCTPPASAFPLQWDEATLQT 186
            V +G E + +G +     R  A+  ++     S    L DC    SA PLQ     L  
Sbjct: 58  EVLIGDEPVLTGYVEALPLRYDASSVSMGIVGRSKTADLVDC----SALPLQQSGKNLLR 113

Query: 187 IAADLCRPFGIQVLAPDGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLL 246
           I ++L  PFGI V+              E  E V   + RL  Q   +   DE GRLVL 
Sbjct: 114 IVSELAAPFGITVIDAGVPQTAVIDAQPEHGETVADCLNRLLGQVQTLAYDDECGRLVLG 173

Query: 247 RPDTRGEPVAEFIEGQ-----QPPISVTPTF----------GNQDYYSHVTGITPTIVGL 291
           +P T G+     + G+         S+   F          GN D +   T        +
Sbjct: 174 KPGT-GKAATALVLGENILSCDTERSIRERFSEYQVSGQRPGNDDDFGEAT--------I 224

Query: 292 EGPQATVRNPHLEGVLRPYVYNADDMAEADLVQAVQSKAGRMFAQSATYDVPVPTWRNAN 351
              + T+++  +       +  +     A      + +A +  A +      V  WR  +
Sbjct: 225 AAIRQTIQDSGVTRYRPLLIQQSGTATTATCKARCEFEARQRAALTRETTYTVQGWRQGS 284

Query: 352 GDLWRVGDFVILEAP 366
           G LWR G  VI+  P
Sbjct: 285 GALWRPGLSVIVFDP 299
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.317 0.134 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 234,132,124 Number of Sequences: 884779 Number of extensions: 9822749 Number of successful extensions: 20636 Number of sequences better than 5.0e-02: 5 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 20629 Number of HSP's gapped (non-prelim): 7 length of query: 417 length of database: 277,083,050 effective HSP length: 123 effective length of query: 294 effective length of database: 168,255,233 effective search space: 49467038502 effective search space used: 49467038502 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 92 (40.0 bits) Query= orf19 [5293-5489] (195 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_267614.1|  (NC_002662) UNKNOWN PROTEIN [Lactococcus l...    41  0.006
emb|CAC02002.2|  (AL390114) possible l6202.3 [Leishmania major]    40  0.013
gb|AAD39531.2|AF149108_1  (AF149108) outer membrane protein ...    39  0.021
ref|NP_523104.1|  (NC_003296) PUTATIVE HEMAGGLUTININ-RELATED...    39  0.021
ref|NP_519008.1|  (NC_003295) PROBABLE HEMAGGLUTININ-RELATED...    39  0.037
ref|NP_522101.1|  (NC_003296) PROBABLE HEMAGGLUTININ-RELATED...    39  0.037
gb|AAF50719.1|  (AE003564) CG13292 gene product [Drosophila ...    38  0.048
>ref|NP_267614.1| (NC_002662) UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis]
 gb|AAK05556.1|AE006376_2 (AE006376) UNKNOWN PROTEIN [Lactococcus lactis subsp. lactis]
          Length = 1441

 Score = 41.2 bits (95), Expect = 0.006
 Identities = 63/224 (28%), Positives = 81/224 (36%), Gaps = 41/224 (18%)

Query: 4    NPGGGATVDAYHAQPAGDDCHPLPTDTAVL--------------VETPRSNNYAAVG-VV 48
            NPGG  TV     +   DD   L  ++ V+                TP  N     G VV
Sbjct: 1064 NPGGSVTVPGPDGKTGTDDDTTLNPNSPVVPGDNGSVTLPGGGTASTPNGNITLPGGTVV 1123

Query: 49   DPNNRQTAGPGERRVYSRNADGE-----QVAEVFLHDDGQVR-----VSNDEGFVDLAAD 98
            DP+       G+      N DG      Q  +    DDG+++     +  D G V L   
Sbjct: 1124 DPDGTIHLPGGD----IVNPDGTITLPGQDGKTGTGDDGKIKPNGPIIPGDNGSVTLPGG 1179

Query: 99   GTIQASNGQASATLQGTSITLDDGA----GGSISINGGVITLTG----TAIRLIGPVDAN 150
            GT+    G  +  + G S+   DG     GG I    G ITL G    T     G V  N
Sbjct: 1180 GTVTTPGG--TINVPGGSVVDPDGTVHLPGGDIVNPDGTITLPGQDGKTGTGDDGKVKPN 1237

Query: 151  GATISEA-GQITDADGLSVHGHNHTQANDSDNDVQQPTSTAQIP 193
            G +IS   G IT   G +V     T  N     V  P  T  +P
Sbjct: 1238 GPSISNPDGSITLPGGGTVTTPGGT-INVPGGSVVDPDGTVHLP 1280
>emb|CAC02002.2| (AL390114) possible l6202.3 [Leishmania major]
          Length = 1376

 Score = 40.0 bits (92), Expect = 0.013
 Identities = 48/194 (24%), Positives = 72/194 (36%), Gaps = 22/194 (11%)

Query: 4    NPGGGATVDAYHAQPAGDDCHPLPTDTAVLVETPRSNNYAAVGVVDPNNRQTAGPGERRV 63
            +P G     + HA  AG+   P P   A     P   N +   + +  +RQ A  G R  
Sbjct: 832  HPSGSGVRTSSHALAAGEHSQP-PAQRAAAAAAPPLANTSGASLQELQDRQRASSGSRSS 890

Query: 64   YSRNADGEQVAEVFLHDDGQVRVSN-------------DEGFVDLAADGTIQA--SNGQA 108
             S+    E+ A + L     VR SN             D        D T++A  +  +A
Sbjct: 891  SSKEGSEERPA-LHLSGVSSVRPSNGQEERLHSLVEAGDASTASADGDNTVEAFTARPEA 949

Query: 109  SATLQGTSITLDDGAGGSISINGGVITLTGTAIRLIGPVDANGATISEAGQITDADGLSV 168
             A L  TS    DG  GS S+     T T T  +L  P   +    S  G ++    +  
Sbjct: 950  VARLPRTS----DGTNGSASVPLFATTRTETVEQLSAPAH-DSTQASAVGNVSHCTAIRE 1004

Query: 169  HGHNHTQANDSDND 182
                 + ++ SD+D
Sbjct: 1005 FTTITSSSSLSDSD 1018
>gb|AAD39531.2|AF149108_1 (AF149108) outer membrane protein A [Rickettsia australis]
          Length = 2106

 Score = 39.3 bits (90), Expect = 0.021
 Identities = 30/82 (36%), Positives = 40/82 (48%), Gaps = 7/82 (8%)

Query: 91  GFVDLAADGTIQASNGQASA--TLQGTSITLDDGAG---GSISINGGVI--TLTGTAIRL 143
           G V+ AADG + A+NG A A  T    + TL  GAG   G+I +NGG I   +       
Sbjct: 265 GAVNFAADGALTANNGIAGAVTTANNNTGTLTVGAGDVTGAIGVNGGNILKQVLFNGASN 324

Query: 144 IGPVDANGATISEAGQITDADG 165
           +  +DA   TI+ A     A G
Sbjct: 325 VATIDATNVTINNAAANVTAAG 346
>ref|NP_523104.1| (NC_003296) PUTATIVE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD18696.1| (AL646085) PUTATIVE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 2691

 Score = 39.3 bits (90), Expect = 0.021
 Identities = 35/114 (30%), Positives = 49/114 (42%), Gaps = 18/114 (15%)

Query: 95  LAADGTIQA-----SNGQASATLQGT-----SITLDDGAG-----GSISINGGVITLTGT 139
           LA  G + A     +NGQA  TL GT      +T    AG     GS+  + G + LTGT
Sbjct: 611 LATSGAVLAGGNVTANGQAGTTLSGTVYAAKGVTAQSSAGATSATGSVVAHSGKVALTGT 670

Query: 140 AIRLIGPVDANGATISEAGQ-ITDADGLSVHGHNHTQANDSDNDVQQPTSTAQI 192
           ++ + G   + G T   A Q     DG +      T    +  D+  P STA +
Sbjct: 671 SVAVSGSTQSGGDTALRATQGDATIDGQAAALGKLTVT--ASQDITGPGSTASV 722
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 3501

 Score = 38.5 bits (88), Expect = 0.037
 Identities = 28/77 (36%), Positives = 37/77 (47%), Gaps = 3/77 (3%)

Query: 91  GFVDLAADGTIQASNGQASATLQGTS--ITLDDGAGGSISINGGVITLTGTAIRLIGPVD 148
           G +D+ A G +    G+ SA  Q T    +LD+ AGG +   G  IT  G      G V 
Sbjct: 742 GALDVQA-GAVSNQGGKLSAQSQATLNVASLDNSAGGYVGAQGVAITDQGALNNAGGTVA 800

Query: 149 ANGATISEAGQITDADG 165
           A+GA    AG I +A G
Sbjct: 801 ASGALTVSAGAIANAGG 817
>ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD17691.1| (AL646079) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 3552

 Score = 38.5 bits (88), Expect = 0.037
 Identities = 28/77 (36%), Positives = 37/77 (47%), Gaps = 3/77 (3%)

Query: 91  GFVDLAADGTIQASNGQASATLQGTS--ITLDDGAGGSISINGGVITLTGTAIRLIGPVD 148
           G +D+ A G +    G+ SA  Q T    +LD+ AGG +   G  IT  G      G V 
Sbjct: 741 GALDVQA-GAVSNQGGKLSAQSQATLNVASLDNSAGGYVGAQGVAITDQGALNNAGGTVA 799

Query: 149 ANGATISEAGQITDADG 165
           A+GA    AG I +A G
Sbjct: 800 ASGALTVSAGAIANAGG 816
>gb|AAF50719.1| (AE003564) CG13292 gene product [Drosophila melanogaster]
          Length = 174

 Score = 38.1 bits (87), Expect = 0.048
 Identities = 21/84 (25%), Positives = 39/84 (46%)

Query: 107 QASATLQGTSITLDDGAGGSISINGGVITLTGTAIRLIGPVDANGATISEAGQITDADGL 166
           +A+  +   ++  D  +  S  I G   T  GTA   +   + NG+T S +     AD  
Sbjct: 5   EAATMITNNNLDYDQASASSTIIAGATSTTAGTAATSVAMSNRNGSTSSSSNSSPTADNS 64

Query: 167 SVHGHNHTQANDSDNDVQQPTSTA 190
           + + ++++ AN + N  Q P S +
Sbjct: 65  NSNSNSNSNANINRNLNQNPNSNS 88
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.310 0.130 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 116,963,488 Number of Sequences: 884779 Number of extensions: 4932970 Number of successful extensions: 11861 Number of sequences better than 5.0e-02: 7 Number of HSP's better than 0.1 without gapping: 1 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 11827 Number of HSP's gapped (non-prelim): 35 length of query: 195 length of database: 277,083,050 effective HSP length: 114 effective length of query: 81 effective length of database: 176,218,244 effective search space: 14273677764 effective search space used: 14273677764 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 87 (38.1 bits) Query= orf20 [5492-5628] (135 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_518999.1|  (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni...    67  4e-011
gb|AAB05794.1|  (U62615) ORF 3 [Synechococcus sp. PCC 7942]        37  0.032
>ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD14580.1| (AL646061) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 132

 Score = 66.6 bits (161), Expect = 4e-011
 Identities = 38/125 (30%), Positives = 59/125 (46%), Gaps = 3/125 (2%)

Query: 3   LAFSQNIRNQRLSVVSTAADAGDGAALIRIYDGTRPPTGGQVTN--LLAQLEMSDPAFNA 60
           +  S  +RN RL V+  A DAG    L+R+Y   RP  G  +    LLA++ +  P   +
Sbjct: 2   ITLSVPVRNSRLVVIGQALDAGTAGGLLRLYSAPRPDVGQALAEQVLLAEVRLPQPCMGS 61

Query: 61  PANGSMTARAITPEGSTPIGGTATWFRIVDSEGNFVLDGDVGLNGSNSEIELSDVNILSN 120
              G +    I  +      G   W R+ DS+G +V D D GL  S +E+EL  + + + 
Sbjct: 62  LEGGRLVFAPIG-QALCRRSGIVAWARLCDSDGRWVADLDAGLPDSGAEVELPKLQVFAG 120

Query: 121 QEVRI 125
             V +
Sbjct: 121 GAVNV 125
>gb|AAB05794.1| (U62615) ORF 3 [Synechococcus sp. PCC 7942]
          Length = 136

 Score = 37.0 bits (84), Expect = 0.032
 Identities = 36/135 (26%), Positives = 61/135 (44%), Gaps = 14/135 (10%)

Query: 1   MTLAFSQNIRNQRLSVVSTAADAGDGAALIRIYDGTRPPTGGQVTNLLAQLEMSDPAFNA 60
           MT+A    +RN    ++   A  G   ALI +Y G  P +  Q+T  + +  ++   F  
Sbjct: 1   MTVALGLELRNSLCELIRDRASNG---ALI-LYAGYDPHSD-QLTEEIVRFPLAAQPFRP 55

Query: 61  PANGSMTARAITPEGSTPIGGTATWFRIVDS------EGNFVLDGDVG--LNGSNSEIEL 112
             +G+M A  I P   +  GG   ++ I  S          +L G +G   N  ++++ L
Sbjct: 56  VESGTMAAAPIAPARCSR-GGALRYWEIRSSPLHDTHPDQMLLRGSIGDITNPGDADLVL 114

Query: 113 SDVNILSNQEVRIAT 127
             V +L NQ + IA+
Sbjct: 115 DTVAVLQNQAIAIAS 129
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.312 0.131 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 74,832,581 Number of Sequences: 884779 Number of extensions: 3151938 Number of successful extensions: 5395 Number of sequences better than 5.0e-02: 2 Number of HSP's better than 0.1 without gapping: 1 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 5393 Number of HSP's gapped (non-prelim): 2 length of query: 135 length of database: 277,083,050 effective HSP length: 111 effective length of query: 24 effective length of database: 178,872,581 effective search space: 4292941944 effective search space used: 4292941944 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 83 (36.6 bits) Query= orf21 [5650-5851] (200 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_404848.1|  (NC_003143) putative bacteriophage protein...    40  0.013
>ref|NP_404848.1| (NC_003143) putative bacteriophage protein GP46 [Yersinia pestis]
 emb|CAC90083.1| (AJ414147) putative bacteriophage protein GP46 [Yersinia pestis]
          Length = 151

 Score = 40.0 bits (92), Expect = 0.013
 Identities = 28/85 (32%), Positives = 39/85 (44%), Gaps = 7/85 (8%)

Query: 87  NGDLYRTDGLETAVYLSLYG---GNPEDNGQDANRLGWWGNANQEEPARQMVSRFQHLVE 143
           NG L   D L+TA+ LSL+       +D     +R GWWG++     A   +     L+ 
Sbjct: 22  NGGLLEGDDLQTAILLSLFTDRLARADDAIDSPDRRGWWGDSG----AASAIGSRLWLLR 77

Query: 144 GIPLTSGNLQRLEDAATADLEWLSE 168
              LT+    + ED AT  L WL E
Sbjct: 78  REKLTTQVAIKAEDYATEALAWLIE 102
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.315 0.134 5.49e-269 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 111,035,066 Number of Sequences: 884779 Number of extensions: 4889814 Number of successful extensions: 8247 Number of sequences better than 5.0e-02: 1 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 8247 Number of HSP's gapped (non-prelim): 1 length of query: 200 length of database: 277,083,050 effective HSP length: 114 effective length of query: 86 effective length of database: 176,218,244 effective search space: 15154768984 effective search space used: 15154768984 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 88 (38.5 bits) Query= orf22 [5854-6047] (192 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.135 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 96,602,847 Number of Sequences: 884779 Number of extensions: 3693502 Number of successful extensions: 7308 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 7308 Number of HSP's gapped (non-prelim): 0 length of query: 192 length of database: 277,083,050 effective HSP length: 113 effective length of query: 79 effective length of database: 177,103,023 effective search space: 13991138817 effective search space used: 13991138817 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 87 (38.1 bits) Query= orf23 [6108-6251] (142 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.316 0.134 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 71,856,967 Number of Sequences: 884779 Number of extensions: 2632971 Number of successful extensions: 13430 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 13430 Number of HSP's gapped (non-prelim): 0 length of query: 142 length of database: 277,083,050 effective HSP length: 118 effective length of query: 24 effective length of database: 172,679,128 effective search space: 4144299072 effective search space used: 4144299072 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 83 (36.6 bits) Query= orf24 [6249-6490] (240 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

dbj|BAB13413.1|  (AB046807) KIAA1587 protein [Homo sapiens]        40  0.014
ref|XP_034014.1|  (XM_034014) KIAA1587 protein [Homo sapiens]      40  0.014
>dbj|BAB13413.1| (AB046807) KIAA1587 protein [Homo sapiens]
          Length = 991

 Score = 40.4 bits (93), Expect = 0.014
 Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 2/65 (3%)

Query: 145 EPLAQCGEPTAQAGNSLQTPPTPQGRLLVNIIQTDTGQVEYAIPTDPAEFP--HIIYVGG 202
           E L+   +PTA  G+S   PPTP G L  ++  T T ++  ++P  P E P   ++ + G
Sbjct: 336 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPG 395

Query: 203 EEFGT 207
           E   T
Sbjct: 396 EGLST 400
>ref|XP_034014.1| (XM_034014) KIAA1587 protein [Homo sapiens]
          Length = 957

 Score = 40.4 bits (93), Expect = 0.014
 Identities = 22/65 (33%), Positives = 34/65 (51%), Gaps = 2/65 (3%)

Query: 145 EPLAQCGEPTAQAGNSLQTPPTPQGRLLVNIIQTDTGQVEYAIPTDPAEFP--HIIYVGG 202
           E L+   +PTA  G+S   PPTP G L  ++  T T ++  ++P  P E P   ++ + G
Sbjct: 302 EGLSTSVQPTAGEGSSTSVPPTPGGGLSTSVPPTATEELSTSVPPTPGEGPSTSVLPIPG 361

Query: 203 EEFGT 207
           E   T
Sbjct: 362 EGLST 366
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.320 0.140 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 150,099,548 Number of Sequences: 884779 Number of extensions: 7068811 Number of successful extensions: 14409 Number of sequences better than 5.0e-02: 2 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 14395 Number of HSP's gapped (non-prelim): 14 length of query: 240 length of database: 277,083,050 effective HSP length: 117 effective length of query: 123 effective length of database: 173,563,907 effective search space: 21348360561 effective search space used: 21348360561 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 89 (38.9 bits) Query= orf25 [6494-6939] (444 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_519813.1|  (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni...    67  3e-010
ref|NP_520042.1|  (NC_003295) PUTATIVE TAIL FIBER-RELATED PR...    65  2e-009
ref|NP_518238.1|  (NC_003295) CONSERVED HYPOTHETICAL PROTEIN...    50  6e-005
ref|NP_298995.1|  (NC_002488) phage-related tail fiber prote...    47  4e-004
ref|NP_488746.1|  (NC_003272) unknown protein [Nostoc sp. PC...    43  0.006
ref|NP_105312.1|  (NC_002678) microcystin dependent protein ...    41  0.021
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 225

 Score = 67.4 bits (163), Expect = 3e-010
 Identities = 33/46 (71%), Positives = 35/46 (75%)

Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGRPVG 149
           WL  NG AVSRTTYA LF ++GTTFGAGDGSTTFNLP   A  P G
Sbjct: 80  WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRG 125
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 554

 Score = 64.7 bits (156), Expect = 2e-009
 Identities = 29/37 (78%), Positives = 33/37 (88%)

Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
           WL ANG AVSRTTYA+L+A +GTTFGAGDG+ TFNLP
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLP 451
>ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD13645.1| (AL646057) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 177

 Score = 49.7 bits (117), Expect = 6e-005
 Identities = 32/64 (50%), Positives = 37/64 (57%), Gaps = 7/64 (10%)

Query: 101 APQ-WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGRPVGQQFVNLGTAT 159
           APQ WLL  GQ +  ++   L+ALLGTT+G GDG TTF LP      PVGQ     G   
Sbjct: 16  APQGWLLCQGQLIDISSNEILYALLGTTYG-GDGRTTFALPNLQGALPVGQ-----GQGP 69

Query: 160 GLPA 163
           GL A
Sbjct: 70  GLTA 73
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
           9a5c]
 pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
           9a5c]
          Length = 387

 Score = 47.0 bits (110), Expect = 4e-004
 Identities = 21/39 (53%), Positives = 28/39 (70%)

Query: 102 PQWLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
           P  LL +G+AVSR  Y  LF  + T++GAGDG +TFN+P
Sbjct: 151 PGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIP 189
>ref|NP_488746.1| (NC_003272) unknown protein [Nostoc sp. PCC 7120]
 dbj|BAB76405.1| (AP003597) ORF_ID:alr4706~unknown protein [Nostoc sp. PCC 7120]
          Length = 101

 Score = 43.1 bits (100), Expect = 0.006
 Identities = 20/37 (54%), Positives = 26/37 (70%), Gaps = 1/37 (2%)

Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLP 140
           W L NGQ +S   Y +++++LGTTFG GDG  TF LP
Sbjct: 38  WALCNGQLLSIQEYTAVYSILGTTFG-GDGVQTFALP 73
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
 dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
          Length = 169

 Score = 41.2 bits (95), Expect = 0.021
 Identities = 23/48 (47%), Positives = 31/48 (63%), Gaps = 3/48 (6%)

Query: 104 WLLANGQAVSRTTYASLFALLGTTFGAGDGSTTFNLPADGAGR-PVGQ 150
           W+   GQ +  + Y +LF L+GTT+G GDG +TF LP D  GR P+ Q
Sbjct: 21  WMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALP-DLRGRVPIHQ 66
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.315 0.136 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 269,562,679 Number of Sequences: 884779 Number of extensions: 13034613 Number of successful extensions: 19678 Number of sequences better than 5.0e-02: 6 Number of HSP's better than 0.1 without gapping: 3 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 19674 Number of HSP's gapped (non-prelim): 6 length of query: 444 length of database: 277,083,050 effective HSP length: 123 effective length of query: 321 effective length of database: 168,255,233 effective search space: 54009929793 effective search space used: 54009929793 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 92 (40.0 bits) Query= orf26 [6939-7384] (444 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_519813.1|  (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni...    67  5e-010
ref|NP_251770.1|  (NC_002516) hypothetical protein [Pseudomo...    60  3e-008
ref|NP_520042.1|  (NC_003295) PUTATIVE TAIL FIBER-RELATED PR...    60  4e-008
ref|NP_298995.1|  (NC_002488) phage-related tail fiber prote...    55  2e-006
ref|NP_347555.1|  (NC_003030) Probably secreted sialidase; s...    53  5e-006
emb|CAC13065.1|  (AL445503) conserved hypothetical protein [...    50  5e-005
gb|AAK06388.1|  (AF078042) glycosyl hydrolase 5 [Caldicellul...    49  1e-004
pir||T35237  probable secreted cellulase - Streptomyces coel...    47  5e-004
ref|NP_105312.1|  (NC_002678) microcystin dependent protein ...    47  5e-004
ref|NP_487884.1|  (NC_003272) hypothetical protein [Nostoc s...    45  0.002
ref|NP_197703.1|  (NM_122218) photosystem II stability/assem...    42  0.010
pir||UMMS  period clock protein - mouse (fragment) >gi|13341...    40  0.048
ref|NP_032862.1|  (NM_008836) per-hexamer repeat gene 5 [Mus...    40  0.048
ref|NP_189520.1|  (NM_113799) histone-H4-like protein [Arabi...    40  0.048
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
 emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
          Length = 225

 Score = 66.6 bits (161), Expect = 5e-010
 Identities = 42/96 (43%), Positives = 53/96 (54%), Gaps = 9/96 (9%)

Query: 104 YLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPT-------PTPGETFV 156
           +L+ +G+AVSRTTYA LF VIGTT+GAGDGSTTFNLP+   + P           G  F 
Sbjct: 80  WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRGWDDGRGVDAGRVFG 139

Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGTTI 192
             Q+         TAI    DG+ L  +G  + T I
Sbjct: 140 SVQAQALSSHQHKTAIGF--DGSNLFGWGDGNATPI 173
>ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
 pir||D83260 hypothetical protein PA3080 [imported] - Pseudomonas aeruginosa
           (strain PAO1)
 gb|AAG06468.1|AE004732_2 (AE004732) hypothetical protein [Pseudomonas aeruginosa]
          Length = 365

 Score = 60.5 bits (145), Expect = 3e-008
 Identities = 60/236 (25%), Positives = 97/236 (40%), Gaps = 17/236 (7%)

Query: 195 SGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTG- 253
           S D G TW       G  +  + I  A++   + +         L  ++DGG++W D   
Sbjct: 138 SDDAGSTWARQFEDLGREAPLLDIWFADEQHGLAV----GAYGALLETRDGGQHWEDVSE 193

Query: 254 -LGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDST 312
            L N    +   IAA  +     V   G++FRS D GA W ++ G    + F +I     
Sbjct: 194 RLDNEDQFHLNAIAAVKDNGLLVVGEAGSLFRSKDWGATWEKLEGPYEGSLFGAIGTADA 253

Query: 313 GAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPD 372
           G V V+        ++RSA+ G +W     + A   L   L+  A  A  +G + ++   
Sbjct: 254 GGVLVYG---LRGHLFRSADFGDSWEEIPLKAASGDLEFGLSDGALLA--DGRIVVVG-H 307

Query: 373 NNQGYRSVDNGASWSPVSGLIADFSFAYVTSTPNGSIYAVAGTFSAGGIFASEGSG 428
                 S D G S+S V       S A V++T +G +  V      GG+  +  +G
Sbjct: 308 GGSVLESTDGGRSFS-VFNRPDRLSLAGVSATGDGHLILV----GQGGVHLAAANG 358
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
 emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
           solanacearum]
          Length = 554

 Score = 60.1 bits (144), Expect = 4e-008
 Identities = 27/42 (64%), Positives = 35/42 (83%)

Query: 104 YLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQ 145
           +L+ +G+AVSRTTYA L+A IGTT+GAGDG+ TFNLP   G+
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGE 456
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
           9a5c]
 pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
           fastidiosa (strain 9a5c)
 gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
           9a5c]
          Length = 387

 Score = 54.7 bits (130), Expect = 2e-006
 Identities = 29/59 (49%), Positives = 36/59 (60%)

Query: 82  TAIAAPRVGERLRTYGTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLP 140
           TA++    G+ + T G  A P  L  DG AVSR  Y  LF  I T+YGAGDG +TFN+P
Sbjct: 131 TALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEINTSYGAGDGVSTFNIP 189
>ref|NP_347555.1| (NC_003030) Probably secreted sialidase; several ASP-boxes and
           dockerin domain [Clostridium acetobutylicum]
 gb|AAK78895.1|AE007608_1 (AE007608) Probably secreted sialidase; several ASP-boxes and
           dockerin domain [Clostridium acetobutylicum]
          Length = 839

 Score = 53.1 bits (126), Expect = 5e-006
 Identities = 59/204 (28%), Positives = 88/204 (42%), Gaps = 18/204 (8%)

Query: 232 RNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGD--FYYVNNGGAIFRSTDNG 289
           +N   +    S DGG+NW   G         G +AA  +     +    G     STDNG
Sbjct: 526 KNQQDKDCGISYDGGKNWFSAGSNISGVYKAGTVAAGADAKTIVWSPEEGANAAYSTDNG 585

Query: 290 ANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGL 349
             W+  SG LP    A +  D       F  F+ G + Y S + GA +     + +++GL
Sbjct: 586 NKWTPCSG-LPQG--AKVRSDRVNP-KKFYGFLNG-KFYISTDAGATFT----QSSQTGL 636

Query: 350 PDTLNPVAFGA--GRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIADFSFAYVTS-TPN 406
           P T     F    G  GD++I    +   + S D+GA+++ VSG+ A  +     S T +
Sbjct: 637 P-TKGKGIFKTVIGHEGDIWIAGGKDGL-WHSTDSGATFTKVSGVDASDTVGLGKSKTDD 694

Query: 407 G--SIYAVAGTFSAGGIFASEGSG 428
           G  +IY  A      GIF S+  G
Sbjct: 695 GYPAIYMDATIDGTAGIFRSDDEG 718
 Score = 44.3 bits (103), Expect = 0.003
 Identities = 53/221 (23%), Positives = 78/221 (34%), Gaps = 53/221 (23%)

Query: 195 SGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSD-TG 253
           S DGG  W S G  N    Y    +AA      ++      +   + S D G  W+  +G
Sbjct: 536 SYDGGKNWFSAG-SNISGVYKAGTVAAGADAKTIVWSPEEGANAAY-STDNGNKWTPCSG 593

Query: 254 LGNGLTINRGDIAAAP-----NGDFYYVNNGGAIF------------------------- 283
           L  G  +    +         NG FY   + GA F                         
Sbjct: 594 LPQGAKVRSDRVNPKKFYGFLNGKFYISTDAGATFTQSSQTGLPTKGKGIFKTVIGHEGD 653

Query: 284 -----------RSTDNGANWSQVSGQLPDNTFA---SIAVDSTGAVYVFAEFVTGTRVYR 329
                       STD+GA +++VSG    +T     S   D   A+Y+ A       ++R
Sbjct: 654 IWIAGGKDGLWHSTDSGATFTKVSGVDASDTVGLGKSKTDDGYPAIYMDATIDGTAGIFR 713

Query: 330 SANNGANWAIAAEEGAESGLPD---TLNPVAFG---AGRNG 364
           S + GA W    ++  + G PD   T +P  +G    G NG
Sbjct: 714 SDDEGATWVRINDDAHQYGSPDYCITGDPNKYGRVFVGTNG 754
>emb|CAC13065.1| (AL445503) conserved hypothetical protein [Streptomyces coelicolor]
          Length = 384

 Score = 50.1 bits (118), Expect = 5e-005
 Identities = 60/223 (26%), Positives = 94/223 (41%), Gaps = 42/223 (18%)

Query: 202 WLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGN----- 256
           W+  G G GL +    +  A D G++ +     ++  +FR+ DGG +W+ +  G      
Sbjct: 165 WVPGGGGEGLHTV---LTDARDPGSVTVAV---STAGVFRTADGGASWAPSNSGVSAVFL 218

Query: 257 -------GLTINR-GDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDN-TFASI 307
                  G  +++    AA P  D  Y+ N   ++RS D GA+W+ +   LP    FA  
Sbjct: 219 PDPDPEFGQCVHKVARDAATP--DRLYLQNHWGVYRSDDAGAHWTDIGEGLPSTFGFAVA 276

Query: 308 AVDSTG-AVYVF-----AEFVTG---TRVYRSANNGANWAIAAEEGAESGLP--DTLNPV 356
           A    G   YVF     A+ V      RV+R+A+ G +W     E   +GLP  D    V
Sbjct: 277 AHPHRGDTAYVFPINADADRVPADHRCRVFRTADAGKSW-----EPLSAGLPREDHYGTV 331

Query: 357 AFGAGRNGDL----YIISPDNNQGYRSVDNGASWSPVSGLIAD 395
              A    D           N + + S D+G SW  ++  + D
Sbjct: 332 LRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQLASHLPD 374
 Score = 46.2 bits (108), Expect = 7e-004
 Identities = 33/127 (25%), Positives = 56/127 (43%), Gaps = 19/127 (14%)

Query: 192 IRKSGDGGDTWLSTGLGNGL-SSYNIPIMA---ANDSGTILLIDRNNNSRP------LFR 241
           + +S D G  W  T +G GL S++   + A     D+  +  I+ + +  P      +FR
Sbjct: 250 VYRSDDAGAHW--TDIGEGLPSTFGFAVAAHPHRGDTAYVFPINADADRVPADHRCRVFR 307

Query: 242 SQDGGENWSDTGLGNGLTINRGDIAA-------APNGDFYYVNNGGAIFRSTDNGANWSQ 294
           + D G++W     G     + G +         A     Y+ N  G +F S D+G +W Q
Sbjct: 308 TADAGKSWEPLSAGLPREDHYGTVLRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQ 367

Query: 295 VSGQLPD 301
           ++  LPD
Sbjct: 368 LASHLPD 374
>gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellulosiruptor sp. Tok7B.1]
          Length = 996

 Score = 48.9 bits (115), Expect = 1e-004
 Identities = 53/197 (26%), Positives = 81/197 (40%), Gaps = 16/197 (8%)

Query: 233 NNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGD-FYYVNNGGAIFRSTDNGAN 291
           N N+  +  S DGG++W         T   G +AAA +G    +   G  +  STDNG  
Sbjct: 529 NPNTNRIGFSYDGGKSWFQGNTEPQGTSEGGTVAAAADGSAVVWAPKGAKVCYSTDNGNK 588

Query: 292 WSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPD 351
           W + +    +    S  V+       F  F  G + Y SA+ G  +     E   +GLP 
Sbjct: 589 WVECANVPSEAIVYSDRVNPN----KFYAFKNG-KFYISADKGKTFI----ESPAAGLPI 639

Query: 352 TLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIADFSFAY---VTSTPNGS 408
           + N      G  GD++++   NN  + S D G S+  +SG+    S  +          +
Sbjct: 640 SGN-FKTVPGIEGDIWLVG--NNGMWHSTDGGYSFVKISGVEDAASIGFGKPAEGETYPA 696

Query: 409 IYAVAGTFSAGGIFASE 425
           IY  A      GIF S+
Sbjct: 697 IYTYAKINGVRGIFRSD 713
>pir||T35237 probable secreted cellulase - Streptomyces coelicolor
 emb|CAA20642.1| (AL031515) putative secreted cellulase [Streptomyces coelicolor
           A3(2)]
          Length = 890

 Score = 46.6 bits (109), Expect = 5e-004
 Identities = 56/223 (25%), Positives = 84/223 (37%), Gaps = 35/223 (15%)

Query: 198 GGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGL--- 254
           G D W  TG+    S    P       GT    D +  +  + RS D G +W    L   
Sbjct: 94  GWDDWGHTGVVALASDAVDPDRVYAAVGTYTN-DWDPTNGAVLRSADRGASWEKADLPFK 152

Query: 255 --GNGLTINRGD-IAAAPN-GDFYYVN--NGGAIFRSTDNGANWSQVSGQLPDNTFASIA 308
             GN      G+ +A  P+  D  Y+   +G  ++RSTD G  WS+V+       +A   
Sbjct: 153 LGGNMPGRGMGERLAVDPHDNDVLYLGAPSGHGLWRSTDAGVTWSEVTAFPNPGNYAQDP 212

Query: 309 VDSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYI 368
            D++G              Y S N G  W +  +E    G       +  G         
Sbjct: 213 NDTSG--------------YASDNQGITW-VTFDESTGGGAGTATRTLYVG--------- 248

Query: 369 ISPDNNQGYRSVDNGASWSPVSGLIADF-SFAYVTSTPNGSIY 410
           ++   N  YRS D GA+W  ++G    + +   V    NG +Y
Sbjct: 249 VADKENAVYRSTDAGATWERLAGQPTGYLAHKGVLDAENGYLY 291
 Score = 40.4 bits (93), Expect = 0.036
 Identities = 39/167 (23%), Positives = 67/167 (39%), Gaps = 34/167 (20%)

Query: 195 SGDGGDTWL----STGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWS 250
           S + G TW+    STG G G ++  + +  A+    +            +RS D G  W 
Sbjct: 220 SDNQGITWVTFDESTGGGAGTATRTLYVGVADKENAV------------YRSTDAGATWE 267

Query: 251 D-TGLGNGLTINRGDIAAAPNGDFY---------YVNNGGAIFRSTDNGANWSQVSGQLP 300
              G   G   ++G +  A NG  Y         Y    G ++R       W+ +S    
Sbjct: 268 RLAGQPTGYLAHKG-VLDAENGYLYLAYSDTGGPYDGGKGRLYRYATATGTWTDISPAAE 326

Query: 301 DNT---FASIAVD----STGAVYVFAEFVTGTRVYRSANNGANWAIA 340
            +T   F+ + VD     T     ++ +   T+++RS ++GA W+ A
Sbjct: 327 ADTYYGFSGLTVDRQRPGTVMATAYSSWWPDTQIFRSTDSGATWSQA 373
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
 dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
          Length = 169

 Score = 46.6 bits (109), Expect = 5e-004
 Identities = 26/64 (40%), Positives = 36/64 (55%), Gaps = 1/64 (1%)

Query: 84  IAAPRVGERLRTYGTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEG 143
           +A P +GE     G  A   ++  +G  +  + Y  LF +IGTTYG GDG +TF LP   
Sbjct: 1   MAQPYIGEIRMFAGNFAPAGWMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALPDLR 59

Query: 144 GQVP 147
           G+VP
Sbjct: 60  GRVP 63
>ref|NP_487884.1| (NC_003272) hypothetical protein [Nostoc sp. PCC 7120]
 dbj|BAB75543.1| (AP003594) ORF_ID:alr3844~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 339

 Score = 44.7 bits (104), Expect = 0.002
 Identities = 40/161 (24%), Positives = 62/161 (37%), Gaps = 18/161 (11%)

Query: 235 NSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQ 294
           +S  L  + DGG NW    L       R D  +    + + V     +  +TD G +WS+
Sbjct: 62  SSATLLETNDGGNNWQPLNLALDDDRYRFDSVSFAGKEGWIVGEPSLLLHTTDEGRSWSR 121

Query: 295 V--SGQLPDNTFASIAVDSTGAVYVFAEFVTGT-RVYRSANNGANW--AIAAEEGAESGL 349
           +  S +LP N    IA+ + G     AE  T    +Y++ + G NW   + A  G    L
Sbjct: 122 IPLSEKLPGN---PIAIQALGT--DIAEMATDVGAIYKTTDGGKNWKAQVEAAVGVVRNL 176

Query: 350 PDTLNPVAFGAGRNGDLYII--------SPDNNQGYRSVDN 382
             +++         G  Y           P N    R V+N
Sbjct: 177 ERSVDGKYVAVSAKGSFYSTWEAGQNAWVPHNRNSSRRVEN 217
>ref|NP_197703.1| (NM_122218) photosystem II stability/assembly factor HCF136
           (sp|O82660) [Arabidopsis thaliana]
 sp|O82660|H136_ARATH Photosystem II stability/assembly factor HCF136, chloroplast
           precursor
 pir||T51828 probable photosystem II stability protein HCF136 [imported] -
           Arabidopsis thaliana
 emb|CAA75723.1| (Y15628) HCF136 protein [Arabidopsis thaliana]
 dbj|BAB09829.1| (AB006708) photosystem II stability/assembly factor HCF136
           [Arabidopsis thaliana]
 gb|AAK74049.1| (AY045691) AT5g23120/MYJ24_11 [Arabidopsis thaliana]
          Length = 403

 Score = 42.4 bits (98), Expect = 0.010
 Identities = 52/224 (23%), Positives = 83/224 (36%), Gaps = 43/224 (19%)

Query: 239 LFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY--------------VNNGGAIFR 284
           L  ++DGG  W           N   I +A   DF Y              +     +  
Sbjct: 120 LLETKDGGSTW-----------NPRSIPSAEEEDFNYRFNSISFKGKEGWIIGKPAILLY 168

Query: 285 STDNGANWSQV--SGQLP-DNTFASIAVDSTGAVYVFAEFVTGT-RVYRSANNGANWAIA 340
           + D G NW ++  S QLP D  F     D +      AE VT    +Y ++N G NW  A
Sbjct: 169 TADAGENWDRIPLSSQLPGDMVFIKATEDKS------AEMVTDEGAIYVTSNRGYNWKAA 222

Query: 341 AEEGAESGLPDTLNPVAFGAG-RNGDLYIISPDNNQGYRSVDNGA----SWSPVSGLIAD 395
            +E   + L  T++    GA    G    ++   +  Y +V +      +W P       
Sbjct: 223 IQETVSATLNRTVSSGISGASYYTGTFSAVNRSPDGRYVAVSSRGNFFLTWEPGQPYWQP 282

Query: 396 FSFAYVTSTPNGSIYAVAGTF---SAGGIFASEGSGVMPDPTDV 436
            + A      N    A  G +     GG++ S+G+G+  +  +V
Sbjct: 283 HNRAVARRIQNMGWRADGGLWLLVRGGGLYLSKGTGITEEFEEV 326
>pir||UMMS period clock protein - mouse (fragment)
 emb|CAA26710.1| (X02966) unidentified reading frame (first ATG at pos. 210) [Mus
           musculus]
 prf||1111288A gene cp2.2 [Mus musculus]
          Length = 713

 Score = 40.0 bits (92), Expect = 0.048
 Identities = 74/338 (21%), Positives = 108/338 (31%), Gaps = 33/338 (9%)

Query: 97  GTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPTPTPGETFV 156
           GT +     +  G+A    T  D     GT  G G GS +      G    T T  ET  
Sbjct: 391 GTGSGSGTAKVTGTATGTGTGTDTGTGTGTGTGTGTGSGSGTAKVTGTATTTATVTETGT 450

Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGT-TIRKSGDGGDTWLSTGLGNGLSSYN 215
            K +     D  +  + G   GT     G  +GT T   +G G  T   TG G G  +  
Sbjct: 451 AKVTG---TDTGTAKVTGTGTGT-----GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT-G 501

Query: 216 IPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY 275
                 + SGT  +   +  +  +  +  G    + TG G G     G  + + +G    
Sbjct: 502 TGTGTGSGSGTAKVTGTDTGTAKVTGTGTGTGTGTGTGTGTGTGTGTGTGSGSGSGS--- 558

Query: 276 VNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGA 335
             +G      T  G      SG        +  V  TG         TGT     + +G+
Sbjct: 559 -GSGSGSGTGTGTGLGSGSGSGTAKVTGTGTAKVTGTG---------TGTGTGTGSGSGS 608

Query: 336 NWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIAD 395
                +  G+ SG          G+G      + S   +        G      +G    
Sbjct: 609 GSGSGSGSGSGSG---------SGSGTGTGTGLGSGSGSGSGTGTGTGTGTGTGTGTGTG 659

Query: 396 FSFAYVTSTPNGSIYAV-AGTFSAGGIFASEGSGVMPD 432
            S   V  T  G+  A   GT +  G     G+G   D
Sbjct: 660 TSTVTVRGTGTGTATATGTGTGTGTGTGTGTGTGTGTD 697
>ref|NP_032862.1| (NM_008836) per-hexamer repeat gene 5 [Mus musculus]
 sp|P08399|PHX5_MOUSE PER-HEXAMER REPEAT PROTEIN 5
 gb|AAA88320.1| (M12039) ORF starts at 87, first start codon is found at 210.;
           putative [Mus musculus]
          Length = 672

 Score = 40.0 bits (92), Expect = 0.048
 Identities = 74/338 (21%), Positives = 108/338 (31%), Gaps = 33/338 (9%)

Query: 97  GTAAEPQYLRTDGSAVSRTTYADLFAVIGTTYGAGDGSTTFNLPQEGGQVPTPTPGETFV 156
           GT +     +  G+A    T  D     GT  G G GS +      G    T T  ET  
Sbjct: 350 GTGSGSGTAKVTGTATGTGTGTDTGTGTGTGTGTGTGSGSGTAKVTGTATTTATVTETGT 409

Query: 157 EKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSGT-TIRKSGDGGDTWLSTGLGNGLSSYN 215
            K +     D  +  + G   GT     G  +GT T   +G G  T   TG G G  +  
Sbjct: 410 AKVTG---TDTGTAKVTGTGTGT-----GTGTGTGTGTGTGTGTGTGTGTGTGTGTGT-G 460

Query: 216 IPIMAANDSGTILLIDRNNNSRPLFRSQDGGENWSDTGLGNGLTINRGDIAAAPNGDFYY 275
                 + SGT  +   +  +  +  +  G    + TG G G     G  + + +G    
Sbjct: 461 TGTGTGSGSGTAKVTGTDTGTAKVTGTGTGTGTGTGTGTGTGTGTGTGTGSGSGSGS--- 517

Query: 276 VNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAVDSTGAVYVFAEFVTGTRVYRSANNGA 335
             +G      T  G      SG        +  V  TG         TGT     + +G+
Sbjct: 518 -GSGSGSGTGTGTGLGSGSGSGTAKVTGTGTAKVTGTG---------TGTGTGTGSGSGS 567

Query: 336 NWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYIISPDNNQGYRSVDNGASWSPVSGLIAD 395
                +  G+ SG          G+G      + S   +        G      +G    
Sbjct: 568 GSGSGSGSGSGSG---------SGSGTGTGTGLGSGSGSGSGTGTGTGTGTGTGTGTGTG 618

Query: 396 FSFAYVTSTPNGSIYAV-AGTFSAGGIFASEGSGVMPD 432
            S   V  T  G+  A   GT +  G     G+G   D
Sbjct: 619 TSTVTVRGTGTGTATATGTGTGTGTGTGTGTGTGTGTD 656
>ref|NP_189520.1| (NM_113799) histone-H4-like protein [Arabidopsis thaliana]
 dbj|BAB03175.1| (AP002057) gene_id:T19N8.7~unknown protein [Arabidopsis thaliana]
          Length = 614

 Score = 40.0 bits (92), Expect = 0.048
 Identities = 75/298 (25%), Positives = 109/298 (36%), Gaps = 53/298 (17%)

Query: 130 AGDGSTTFNLPQEGGQVPTPTPGETFVEKQSDGYPRDVRSTAIAGLPDGTILAAFGRFSG 189
           +G  S     P +GG     T G++     S  YP D  S + AG P G+         G
Sbjct: 147 SGSPSADTGSPTDGGSYGDTT-GDSGSSAGSPSYPSDDGSGSTAGGPSGSTT------DG 199

Query: 190 TTIRKSGDGGDTWLSTGLGNGLSSYNIPIMAANDSGTILLIDRNNNSRPLFRSQDGGENW 249
           +   +S  GGD   S+  G           AA +SG+    D  + +     +  GG   
Sbjct: 200 SAGGESSMGGD---SSSAG-----------AAGESGSAATADSGDAA----GADSGGAAG 241

Query: 250 SDTGLGNGLTINRGDIAAAPNGDFYYVNNGGAIFRSTDNGANWSQVSGQLPDNTFASIAV 309
           +D+  G   + + G  AA          +GGA    T  G+  ++  G+      AS A 
Sbjct: 242 ADS--GGAASADSGGAAAGETA------SGGAAAADTSGGS--AETGGESASGGAASGAG 291

Query: 310 DSTGAVYVFAEFVTGTRVYRSANNGANWAIAAEEGAESGLPDTLNPVAFGAGRNGDLYII 369
            ++G     A   TG     +A+ G     +AE G ES         + GA   G     
Sbjct: 292 AASG-----ASAKTGGESGEAASGG-----SAETGGES--------ASAGAASGGSAETG 333

Query: 370 SPDNNQGYRSVDNGASWSPVSGLIADFSFAYVTSTPNGSIYAVAGTFSAGGIFASEGS 427
               + G  S    AS    SG   +   +  T   + S  A +G  SA G  AS GS
Sbjct: 334 GESGSGGAASGGESASGGATSGGSPETGGSAETGGESASGGAASGGESASGGAASSGS 391
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.314 0.135 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 290,306,667 Number of Sequences: 884779 Number of extensions: 14619616 Number of successful extensions: 24257 Number of sequences better than 5.0e-02: 22 Number of HSP's better than 0.1 without gapping: 3 Number of HSP's successfully gapped in prelim test: 19 Number of HSP's that attempted gapping in prelim test: 24149 Number of HSP's gapped (non-prelim): 70 length of query: 444 length of database: 277,083,050 effective HSP length: 123 effective length of query: 321 effective length of database: 168,255,233 effective search space: 54009929793 effective search space used: 54009929793 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 92 (40.0 bits) Query= orf27 [7391-7566] (174 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_274047.1|  (NC_003112) hypothetical protein [Neisseri...   125  2e-028
ref|NP_108221.1|  (NC_002678) hypothetical protein [Mesorhiz...    84  4e-016
ref|NP_539912.1|  (NC_003317) SECRETION ACTIVATOR PROTEIN [B...    82  3e-015
ref|NP_108621.1|  (NC_002678) hypothetical protein [Mesorhiz...    79  1e-014
ref|NP_540027.1|  (NC_003317) SECRETION ACTIVATOR PROTEIN [B...    72  2e-012
ref|NP_103651.1|  (NC_002678) hypothetical protein [Mesorhiz...    64  5e-010
ref|NP_437108.1|  (NC_003078) putative protein [Sinorhizobiu...    59  2e-008
>ref|NP_274047.1| (NC_003112) hypothetical protein [Neisseria meningitidis MC58]
 ref|NP_283994.1| (NC_003116) hypothetical protein NMA1230 [Neisseria meningitidis
           Z2491]
 pir||E81131 hypothetical protein NMB1012 [imported] - Neisseria meningitidis
           (group B strain MD58, group A strain Z2491)
 gb|AAF41413.1| (AE002452) hypothetical protein [Neisseria meningitidis MC58]
 emb|CAB84490.1| (AL162755) hypothetical protein NMA1230 [Neisseria meningitidis
           Z2491]
          Length = 167

 Score =  125 bits (313), Expect = 2e-028
 Identities = 68/171 (39%), Positives = 101/171 (58%), Gaps = 8/171 (4%)

Query: 2   DTFDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVD 61
           D F++  +RV+ HEGG+ N  KD     GG+   G  K T    A     +  ++ +T +
Sbjct: 3   DKFNQFINRVLSHEGGYANHPKD----PGGETNWGITKRT----AQANGYNGSMRAMTRE 54

Query: 62  EAKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTI 121
           +A +IYR+ +W+         A+ +Q FDA +NHG  NA +MLQRA+G  DDG+IG  ++
Sbjct: 55  QAISIYRKAFWERYRADQMPEAVAFQFFDACVNHGYGNAARMLQRAAGVPDDGVIGAVSL 114

Query: 122 GAARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRIAGNLRLAAEDN 172
            A  +   +DLL+ F AERL F+T + TF  +G+GW RR+A NL  A+ DN
Sbjct: 115 KAINSLPENDLLLRFNAERLVFYTKLGTFTSFGKGWVRRVAQNLIHASADN 165
>ref|NP_108221.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
 dbj|BAB53682.1| (AP003013) hypothetical protein [Mesorhizobium loti]
          Length = 283

 Score = 84.3 bits (207), Expect = 4e-016
 Identities = 52/158 (32%), Positives = 78/158 (48%), Gaps = 8/158 (5%)

Query: 10  RVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEAKAIYRR 69
           RV+ HEGG+ N   D G  T   V      G + G    T     +K +T DE  AIY R
Sbjct: 12  RVLAHEGGYSNHPNDPGGPTMKGVTQRVYDGYRKGKGLATR---SVKGITTDELNAIYDR 68

Query: 70  DWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQR----ASGAHDDGIIGPKTIGAAR 125
            +WDA+        + Y +FD A+N G   +   LQ+    A     DG++G  T+ A +
Sbjct: 69  QYWDAVRGDDLPAGVDYVVFDGAVNSGPSRSIMWLQQALRPAYTGRIDGVLGMGTLAALK 128

Query: 126 ATDLDDLLMLFLAE-RLEFFTNIKTFDHYGRGWSRRIA 162
           A   +D L+  +   R+ F  ++ TF ++GRGW+ R+A
Sbjct: 129 ADKNNDALIDRICNARMAFLKHLSTFGNFGRGWTARVA 166
>ref|NP_539912.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
 gb|AAL52176.1| (AE009540) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
          Length = 252

 Score = 81.6 bits (200), Expect = 3e-015
 Identities = 47/156 (30%), Positives = 71/156 (45%), Gaps = 4/156 (2%)

Query: 3   TFDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDE 62
           TF KA   V   EGG+ +  KD     GG   +G    T            ++K LT  +
Sbjct: 5   TFAKAMPHVFSEEGGYVDHPKD----PGGATNMGITLATLSAWEGRKVSKAEVKALTKTK 60

Query: 63  AKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTIG 122
           A  IYR ++W+ +        + +   D AI+ G   A KMLQ+  G   DG+IG KT+ 
Sbjct: 61  ATDIYRENYWNKVAGDDLPAGVDHATLDFAIHSGPARALKMLQKVVGVDQDGVIGAKTLA 120

Query: 123 AARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWS 158
           A R    + ++      RL +   + TF  +G+GW+
Sbjct: 121 AVRKMAANRIINELCDARLAWLKGLGTFSTFGKGWT 156
>ref|NP_108621.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
 dbj|BAB54407.1| (AP003014) hypothetical protein [Mesorhizobium loti]
          Length = 283

 Score = 79.3 bits (194), Expect = 1e-014
 Identities = 50/158 (31%), Positives = 76/158 (47%), Gaps = 8/158 (5%)

Query: 10  RVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEAKAIYRR 69
           RV+ HEGG+ N   D G  T   V      G + G    T     +K++T  E   IY R
Sbjct: 12  RVLAHEGGYSNHPNDPGGPTMKGVTQRVYDGFRKGKGLATR---SVKSITTAELNEIYDR 68

Query: 70  DWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQR----ASGAHDDGIIGPKTIGAAR 125
            +WDA+        + Y +FD A+N G   +   LQ+    A     DG++G  T+ A +
Sbjct: 69  QYWDAVQGDALPAGVDYVVFDGAVNSGPGRSIMWLQQALRPAYTGRIDGVLGMGTLAALK 128

Query: 126 ATDLDDLLM-LFLAERLEFFTNIKTFDHYGRGWSRRIA 162
           A   +D L+    + R+ F  ++ TF  +GRGW+ R+A
Sbjct: 129 ADKNNDALIDRICSARMAFLKHLSTFGTFGRGWTARVA 166
>ref|NP_540027.1| (NC_003317) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
 gb|AAL52291.1| (AE009550) SECRETION ACTIVATOR PROTEIN [Brucella melitensis]
          Length = 253

 Score = 72.4 bits (176), Expect = 2e-012
 Identities = 42/158 (26%), Positives = 67/158 (41%), Gaps = 4/158 (2%)

Query: 4   FDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEA 63
           F      +   EGG+ ++  D     GG   +G    T            D+K LT   A
Sbjct: 5   FQTVMPYIFSEEGGYADNPAD----PGGATNMGITINTLSAWEGRQVSPQDVKELTQATA 60

Query: 64  KAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDGIIGPKTIGA 123
             IY+ ++W+ +        + Y +FD A+N G   A K LQ+     +DGIIG +T+ A
Sbjct: 61  TQIYQVEFWNKIDGNDLPSGVDYALFDFAVNSGPGRAAKTLQKILAMPEDGIIGAQTVAA 120

Query: 124 ARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRI 161
           A A   + ++      R  +   + T   +G GW  R+
Sbjct: 121 AAARSPEGIINALCDARAAWLRGLSTAATFGNGWLARV 158
>ref|NP_103651.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
 dbj|BAB49437.1| (AP002999) hypothetical protein [Mesorhizobium loti]
          Length = 409

 Score = 64.3 bits (155), Expect = 5e-010
 Identities = 46/172 (26%), Positives = 78/172 (44%), Gaps = 10/172 (5%)

Query: 4   FDKAFDRVIGHEGGFQNDQKDRGNWTGGKVGVGENKGTKFGLAAMTYPDLDIKNLTVDEA 63
           F++A   V   EGG+ +D KD     GG    G  + T            D+ N++  EA
Sbjct: 49  FERAQGIVREFEGGYSDDPKD----PGGATNFGIARKTLAAWRGKPVSKADVANMSYQEA 104

Query: 64  KAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHD-----DGIIGP 118
           K I+  ++W     G     +   +++ A++ G+R+A   LQ+A   +      DG IG 
Sbjct: 105 KEIFFSEYWSKSSCGAMPGPLALAVYNVAVHAGVRDAATFLQKALNQNGAAVAVDGGIGG 164

Query: 119 KTIGAARATDLDDLLMLFLAERLEFFTNIKTFDHYGRGWSRRIAGNLRLAAE 170
           +T+GA     L D++   +          + ++HY  G++RR+   LRL  E
Sbjct: 165 ETLGAIPKAPLPDVIGDIIDLYDAKLRAHRDYEHYKNGFNRRV-NKLRLETE 215
>ref|NP_437108.1| (NC_003078) putative protein [Sinorhizobium meliloti]
 emb|CAC48968.1| (AL603644) putative protein [Sinorhizobium meliloti]
          Length = 91

 Score = 58.9 bits (141), Expect = 2e-008
 Identities = 25/71 (35%), Positives = 40/71 (56%)

Query: 55  IKNLTVDEAKAIYRRDWWDALGMGTFRPAMQYQMFDAAINHGMRNATKMLQRASGAHDDG 114
           ++ ++ +EA  IYRR +W   G+    P + Y +FD  +N G   A K LQ+  G  +DG
Sbjct: 6   VRAMSREEAADIYRRSYWPQCGVDLLPPGLDYAVFDFGVNSGPARAVKTLQKVVGVREDG 65

Query: 115 IIGPKTIGAAR 125
            +G +T+ A R
Sbjct: 66  HVGEQTLAAVR 76
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.139 5.49e-269 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 99,344,891 Number of Sequences: 884779 Number of extensions: 4043632 Number of successful extensions: 7365 Number of sequences better than 5.0e-02: 13 Number of HSP's better than 0.1 without gapping: 8 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 7345 Number of HSP's gapped (non-prelim): 13 length of query: 174 length of database: 277,083,050 effective HSP length: 112 effective length of query: 62 effective length of database: 177,987,802 effective search space: 11035243724 effective search space used: 11035243724 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 86 (37.7 bits) Query= orf28 [7565-7764] (198 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.320 0.135 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 87,719,920 Number of Sequences: 884779 Number of extensions: 2728446 Number of successful extensions: 6008 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 6008 Number of HSP's gapped (non-prelim): 0 length of query: 198 length of database: 277,083,050 effective HSP length: 114 effective length of query: 84 effective length of database: 176,218,244 effective search space: 14802332496 effective search space used: 14802332496 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 87 (38.1 bits) Query= orf29 [7799-7878] (78 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.313 0.128 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 39,118,827 Number of Sequences: 884779 Number of extensions: 1273635 Number of successful extensions: 2025 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2025 Number of HSP's gapped (non-prelim): 0 length of query: 78 length of database: 277,083,050 effective HSP length: 54 effective length of query: 24 effective length of database: 229,304,984 effective search space: 5503319616 effective search space used: 5503319616 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.9 bits) S2: 84 (37.0 bits) Query= orf30 [7879-8577] (697 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_496184.1|  (NM_063783) polycistic kidney disease prot...    42  0.022
sp|Q09625|YS8A_CAEEL  Hypothetical 84.3 kDa protein ZK945.10...    42  0.022
ref|NP_506158.1|  (NM_073757) F56H9.1.p [Caenorhabditis eleg...    42  0.028
pir||T29634  hypothetical protein C12D12.1 - Caenorhabditis ...    41  0.048
ref|NP_508603.1|  (NM_076202) C12D12.1a.p [Caenorhabditis el...    41  0.048
>ref|NP_496184.1| (NM_063783) polycistic kidney disease protein 1 like with repeated
           domains at the N-terminus [Caenorhabditis elegans]
 emb|CAB70201.1| (Z48582) N-terminus contains five repeated domains and the
           C-terminus shows similarity to the polycistic kidney
           disease protein 1 (PIR accession number A44604),
           contains similarity to Pfam domain: PF01477 (PLAT/LH2
           domain), Score=130.4, E-value=1.1e-35>
 emb|CAB70192.1| (Z48544) N-terminus contains five repeated domains and the
           C-terminus shows similarity to the polycistic kidney
           disease protein 1 (PIR accession number A44604),
           contains similarity to Pfam domain: PF01477 (PLAT/LH2
           domain), Score=130.4, E-value=1.1e-35>
          Length = 3178

 Score = 42.0 bits (97), Expect = 0.022
 Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 27/238 (11%)

Query: 470 ENTWRIFASHDFNETRSGNLGQASLHSLSTSLPDSVNVGTALPTSVTVEYNVT-----NH 524
           E+T   F S     T S +  Q S  ++ TS P S  + T++PT+ T E   T     ++
Sbjct: 321 ESTSTTFTSTASTSTSSTSTTQQSSSTI-TSSPSSTTLSTSIPTTTTPEITSTLSSLPDN 379

Query: 525 ADIRTLQLQIDGQSFTPPLPTQDGPQS---------YTATVTAISTTAPRNIITSLTGFR 575
           A    L       +FT  + T    +           T+T + ++TT P   +T+ T   
Sbjct: 380 AICSYLDETTTSTTFTTTMLTSTTTEEPSTSTTTTEVTSTSSTVTTTEPTTTLTTSTAST 439

Query: 576 RNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTATLQSAPRANRMTVGTGQAA--- 632
                  +   ++    P  S     S+S+   T  T+T +S   +   TV T   A   
Sbjct: 440 STTEPSTSTVTTSPSTSPVTSTVTSSSSSSTTVTTPTST-ESTSTSPSSTVTTSTTAPST 498

Query: 633 --AGDYFIVLAPDSQAISKITDTV------LDQDVLSLFTKTEDVRQIAGLEYDAYVI 682
              G       P S A S ++ T             S  TK+E      G   D Y +
Sbjct: 499 STTGPSSSSSTPSSTASSSVSSTASSTQSSTSTQQSSTTTKSETTTSSDGTNPDFYFV 556
>sp|Q09625|YS8A_CAEEL Hypothetical 84.3 kDa protein ZK945.10 in chromosome II
 pir||T21460 hypothetical protein ZK945.10 - Caenorhabditis elegans
          Length = 796

 Score = 42.0 bits (97), Expect = 0.022
 Identities = 55/238 (23%), Positives = 86/238 (36%), Gaps = 27/238 (11%)

Query: 470 ENTWRIFASHDFNETRSGNLGQASLHSLSTSLPDSVNVGTALPTSVTVEYNVT-----NH 524
           E+T   F S     T S +  Q S  ++ TS P S  + T++PT+ T E   T     ++
Sbjct: 321 ESTSTTFTSTASTSTSSTSTTQQSSSTI-TSSPSSTTLSTSIPTTTTPEITSTLSSLPDN 379

Query: 525 ADIRTLQLQIDGQSFTPPLPTQDGPQS---------YTATVTAISTTAPRNIITSLTGFR 575
           A    L       +FT  + T    +           T+T + ++TT P   +T+ T   
Sbjct: 380 AICSYLDETTTSTTFTTTMLTSTTTEEPSTSTTTTEVTSTSSTVTTTEPTTTLTTSTAST 439

Query: 576 RNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTATLQSAPRANRMTVGTGQAA--- 632
                  +   ++    P  S     S+S+   T  T+T +S   +   TV T   A   
Sbjct: 440 STTEPSTSTVTTSPSTSPVTSTVTSSSSSSTTVTTPTST-ESTSTSPSSTVTTSTTAPST 498

Query: 633 --AGDYFIVLAPDSQAISKITDTV------LDQDVLSLFTKTEDVRQIAGLEYDAYVI 682
              G       P S A S ++ T             S  TK+E      G   D Y +
Sbjct: 499 STTGPSSSSSTPSSTASSSVSSTASSTQSSTSTQQSSTTTKSETTTSSDGTNPDFYFV 556
>ref|NP_506158.1| (NM_073757) F56H9.1.p [Caenorhabditis elegans]
 pir||T22808 hypothetical protein F56H9.1 - Caenorhabditis elegans
 emb|CAA98949.1| (Z74473) predicted using Genefinder [Caenorhabditis elegans]
          Length = 770

 Score = 41.6 bits (96), Expect = 0.028
 Identities = 30/119 (25%), Positives = 47/119 (39%), Gaps = 2/119 (1%)

Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATV- 556
           +T++P S    T+  T+ TV    T+ A   T+       + T  +P      + T TV 
Sbjct: 35  TTTVPVSTTTSTSSTTTTTVPSTTTSSATTTTVPSTTTSSTTTTTVPPTTTSTTTTTTVP 94

Query: 557 -TAISTTAPRNIITSLTGFRRNGSALNTVTDSTAVVVPPPSVFWGRSASNNPATVDTAT 614
            T  STT    +  + T      +   T T ST     PP+     + +  P T  T+T
Sbjct: 95  PTTTSTTTTTTVPPTTTSTTTTTTVPPTTTSSTTTTTVPPTTTSSTTTTTVPPTTTTST 153
>pir||T29634 hypothetical protein C12D12.1 - Caenorhabditis elegans
          Length = 825

 Score = 40.8 bits (94), Expect = 0.048
 Identities = 44/163 (26%), Positives = 64/163 (38%), Gaps = 14/163 (8%)

Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATVT 557
           +T +P S  V T  PT+VTV    T+ A + T        +  P  PT+  P S  +TV 
Sbjct: 640 TTVVPSSATVPTTPPTTVTVAATTTSKAPVVT-----TSPTLAPTSPTK-LPTSPPSTV- 692

Query: 558 AISTTAPRNIITSLTGFRRNGSA-------LNTVTDSTAVVVPPPSVFWGRSASNNPATV 610
             S TAP N+ T  T      S+       +N  + +TA  VPP +     + ++   T 
Sbjct: 693 GTSPTAPANLTTPTTAPVNPTSSTTAPTAPVNPTSPTTAPTVPPVTTTTPTTTTSTTTTT 752

Query: 611 DTATLQSAPRANRMTVGTGQAAAGDYFIVLAPDSQAISKITDT 653
            T T  +        V T          V  P SQ  ++   T
Sbjct: 753 TTTTTTTQTTPTTPVVTTPSTITPTTRPVTQPASQPATQPATT 795
>ref|NP_508603.1| (NM_076202) C12D12.1a.p [Caenorhabditis elegans]
 gb|AAA96080.2| (U51998) Hypothetical protein C12D12.1a [Caenorhabditis elegans]
          Length = 769

 Score = 40.8 bits (94), Expect = 0.048
 Identities = 44/163 (26%), Positives = 64/163 (38%), Gaps = 14/163 (8%)

Query: 498 STSLPDSVNVGTALPTSVTVEYNVTNHADIRTLQLQIDGQSFTPPLPTQDGPQSYTATVT 557
           +T +P S  V T  PT+VTV    T+ A + T        +  P  PT+  P S  +TV 
Sbjct: 584 TTVVPSSATVPTTPPTTVTVAATTTSKAPVVT-----TSPTLAPTSPTK-LPTSPPSTV- 636

Query: 558 AISTTAPRNIITSLTGFRRNGSA-------LNTVTDSTAVVVPPPSVFWGRSASNNPATV 610
             S TAP N+ T  T      S+       +N  + +TA  VPP +     + ++   T 
Sbjct: 637 GTSPTAPANLTTPTTAPVNPTSSTTAPTAPVNPTSPTTAPTVPPVTTTTPTTTTSTTTTT 696

Query: 611 DTATLQSAPRANRMTVGTGQAAAGDYFIVLAPDSQAISKITDT 653
            T T  +        V T          V  P SQ  ++   T
Sbjct: 697 TTTTTTTQTTPTTPVVTTPSTITPTTRPVTQPASQPATQPATT 739
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.315 0.132 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 378,287,469 Number of Sequences: 884779 Number of extensions: 16344170 Number of successful extensions: 39022 Number of sequences better than 5.0e-02: 5 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 5 Number of HSP's that attempted gapping in prelim test: 38976 Number of HSP's gapped (non-prelim): 24 length of query: 697 length of database: 277,083,050 effective HSP length: 127 effective length of query: 570 effective length of database: 164,716,117 effective search space: 93888186690 effective search space used: 93888186690 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 94 (40.8 bits) Query= orf31 [8578-9669] (1090 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.315 0.132 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 594,227,895 Number of Sequences: 884779 Number of extensions: 26169864 Number of successful extensions: 43435 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 43435 Number of HSP's gapped (non-prelim): 0 length of query: 1090 length of database: 277,083,050 effective HSP length: 131 effective length of query: 959 effective length of database: 161,177,001 effective search space: 154568743959 effective search space used: 154568743959 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (22.0 bits) S2: 96 (41.6 bits) Query= orf32 [9748-9906] (157 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.334 0.148 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 55,526,976 Number of Sequences: 884779 Number of extensions: 1426651 Number of successful extensions: 3170 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3170 Number of HSP's gapped (non-prelim): 0 length of query: 157 length of database: 277,083,050 effective HSP length: 110 effective length of query: 47 effective length of database: 179,757,360 effective search space: 8448595920 effective search space used: 8448595920 T: 11 A: 40 X1: 15 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.6 bits) S2: 85 (37.4 bits) Query= orf33 [9910-10000] (89 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.135 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 32,160,522 Number of Sequences: 884779 Number of extensions: 824397 Number of successful extensions: 2163 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2163 Number of HSP's gapped (non-prelim): 0 length of query: 89 length of database: 277,083,050 effective HSP length: 65 effective length of query: 24 effective length of database: 219,572,415 effective search space: 5269737960 effective search space used: 5269737960 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 84 (37.0 bits) Query= orf34 [10005-10089] (83 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.320 0.136 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 37,086,067 Number of Sequences: 884779 Number of extensions: 880080 Number of successful extensions: 1874 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 1874 Number of HSP's gapped (non-prelim): 0 length of query: 83 length of database: 277,083,050 effective HSP length: 59 effective length of query: 24 effective length of database: 224,881,089 effective search space: 5397146136 effective search space used: 5397146136 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf35 [10080-10160] (79 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.311 0.130 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 34,827,107 Number of Sequences: 884779 Number of extensions: 815998 Number of successful extensions: 1227 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 1227 Number of HSP's gapped (non-prelim): 0 length of query: 79 length of database: 277,083,050 effective HSP length: 55 effective length of query: 24 effective length of database: 228,420,205 effective search space: 5482084920 effective search space used: 5482084920 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 84 (37.0 bits) Query= orf36 [10150-10339] (188 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_059586.1|  (NC_002371) eac [Enterobacteria phage P22]...    39  0.026
>ref|NP_059586.1| (NC_002371) eac [Enterobacteria phage P22]
 sp|Q03545|VEAC_BPP22 Eac protein
 pir||S35280 eaC protein - phage P22
 gb|AAC18880.1| (L06296) EaC [Enterobacteria phage P22]
 gb|AAF75004.1| (AF217253) eac [Enterobacteria phage P22]
          Length = 211

 Score = 38.9 bits (89), Expect = 0.026
 Identities = 33/166 (19%), Positives = 72/166 (42%), Gaps = 11/166 (6%)

Query: 2   FKYFQVNPDVGHYLVDKYNSTVNAERERIVSHFLSAVGAVGIVMVREW-GMPSRIEALVF 60
           + Y+ V  +    L+  Y+ T+N +R  I++     VGA+     R W G    +++ V+
Sbjct: 8   YDYYMVEGEDVKELIQSYD-TINDQRNSILTTAAEKVGAIAWTTARSWGGEGGLLQSFVW 66

Query: 61  PATHEICNMEGVKLTEHRQGQVVTF---DDNAPFAGHYYEAISGL----NAELVAYPDFS 113
              +E      +K  +   G+ V       N      Y + +  +    NA+L + P+++
Sbjct: 67  EKGYEFPCQITIKREDFLDGKRVVIARGKGNTKEGRAYNKELDAIMHNANAKLKSLPEWN 126

Query: 114 DWLHHTMGVTRYALAEKDDEAGTYKKIATH--SHLMADGRILFAVP 157
            ++ +  G+ R  +  +      +  ++T+   H   D  ++FA+P
Sbjct: 127 YYITNHYGIMRTGIGGQSGRGFGFVMLSTYGGKHPQRDDCLIFAIP 172
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.134 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 103,847,221 Number of Sequences: 884779 Number of extensions: 3974563 Number of successful extensions: 7256 Number of sequences better than 5.0e-02: 1 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 1 Number of HSP's that attempted gapping in prelim test: 7256 Number of HSP's gapped (non-prelim): 1 length of query: 188 length of database: 277,083,050 effective HSP length: 113 effective length of query: 75 effective length of database: 177,103,023 effective search space: 13282726725 effective search space used: 13282726725 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 87 (38.1 bits) Query= orf37 [10337-10513] (175 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.137 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 96,295,591 Number of Sequences: 884779 Number of extensions: 3505451 Number of successful extensions: 6196 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 6196 Number of HSP's gapped (non-prelim): 0 length of query: 175 length of database: 277,083,050 effective HSP length: 112 effective length of query: 63 effective length of database: 177,987,802 effective search space: 11213231526 effective search space used: 11213231526 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 86 (37.7 bits) Query= orf38 [10532-10595] (62 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|P06225|DPOL_BPSP2  DNA POLYMERASE >gi|67052|pir||DJBPS2 D...    51  3e-006
ref|NP_297973.1|  (NC_002488) phage-related DNA polymerase [...    41  0.003
ref|NP_299570.1|  (NC_002488) phage-related protein [Xylella...    41  0.003
ref|NP_299803.1|  (NC_002488) phage-related DNA polymerase [...    41  0.003
ref|NP_051006.1|  (NC_000935) P45 [Acyrthosiphon pisum bacte...    40  0.004
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
 pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
 gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
           SPO2]
 emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
          Length = 648

 Score = 50.8 bits (120), Expect = 3e-006
 Identities = 22/62 (35%), Positives = 35/62 (55%)

Query: 1   MKKCENNGYPVVMHTHDELIAEVPEDFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRYQ 60
           + + +N GY  VMH HDE + +VP     ++++ A+M E   W    P+ A G+  D Y+
Sbjct: 587 LMRLDNAGYKTVMHVHDEAVLDVPRGKNELDKVEAIMGEPISWAKGLPLTADGFVTDYYK 646

Query: 61  KD 62
           KD
Sbjct: 647 KD 648
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
 pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
          Length = 726

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)

Query: 1   MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
           M   E  GY +V+  HDE+I E  +    +   +AALM    PW    P+ A G+   RY
Sbjct: 664 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 723

Query: 60  QK 61
           +K
Sbjct: 724 RK 725
>ref|NP_299570.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
 pir||H82574 phage-related protein XF2291 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85090.1|AE004041_2 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
          Length = 252

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)

Query: 1   MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
           M   E  GY +V+  HDE+I E  +    +   +AALM    PW    P+ A G+   RY
Sbjct: 190 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 249

Query: 60  QK 61
           +K
Sbjct: 250 RK 251
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
 pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
          Length = 726

 Score = 40.8 bits (94), Expect = 0.003
 Identities = 22/62 (35%), Positives = 31/62 (49%), Gaps = 1/62 (1%)

Query: 1   MKKCENNGYPVVMHTHDELIAEVPED-FGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
           M   E  GY +V+  HDE+I E  +    +   +AALM    PW    P+ A G+   RY
Sbjct: 664 MPAIEAAGYSIVLTVHDEIITEADDHPCFNAAHLAALMATPPPWAEGLPLAAEGFDTYRY 723

Query: 60  QK 61
           +K
Sbjct: 724 RK 725
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
 sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
 gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
          Length = 993

 Score = 40.4 bits (93), Expect = 0.004
 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 1/63 (1%)

Query: 1   MKKCENNGYPVVMHTHDELIAEVPE-DFGSVEEMAALMTEREPWRAWWPIRAAGWQDDRY 59
           M   E  GY +V+  HDE+I+E P+    S E ++ L++    W    P+ A G++  RY
Sbjct: 931 MPPIEKAGYEIVLTVHDEIISEAPDTPQFSAEGLSKLLSFNSDWAWDLPLSANGFETYRY 990

Query: 60  QKD 62
           +K+
Sbjct: 991 RKE 993
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.317 0.133 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 37,384,509 Number of Sequences: 884779 Number of extensions: 1135657 Number of successful extensions: 3794 Number of sequences better than 5.0e-02: 5 Number of HSP's better than 0.1 without gapping: 1 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 3790 Number of HSP's gapped (non-prelim): 5 length of query: 62 length of database: 277,083,050 effective HSP length: 38 effective length of query: 24 effective length of database: 243,461,448 effective search space: 5843074752 effective search space used: 5843074752 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 84 (37.0 bits) Query= orf39 [10593-10898] (304 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|P06225|DPOL_BPSP2  DNA POLYMERASE >gi|67052|pir||DJBPS2 D...   126  2e-028
ref|NP_051006.1|  (NC_000935) P45 [Acyrthosiphon pisum bacte...    80  2e-014
ref|NP_299803.1|  (NC_002488) phage-related DNA polymerase [...    66  5e-010
ref|NP_297973.1|  (NC_002488) phage-related DNA polymerase [...    65  6e-010
ref|NP_299570.1|  (NC_002488) phage-related protein [Xylella...    52  9e-006
sp|Q9S1G2|DPO1_RHILE  DNA polymerase I (POL I) >gi|5596366|g...    44  0.002
ref|NP_540742.1|  (NC_003317) DNA POLYMERASE I [Brucella mel...    43  0.003
ref|NP_254180.1|  (NC_002516) DNA polymerase I [Pseudomonas ...    43  0.003
ref|NP_384281.1|  (NC_003047) PROBABLE DNA POLYMERASE I PROT...    43  0.004
ref|NP_353146.1|  (NC_003062) AGR_C_169p [Agrobacterium tume...    42  0.010
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
 pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
 gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
           SPO2]
 emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
          Length = 648

 Score =  126 bits (317), Expect = 2e-028
 Identities = 96/266 (36%), Positives = 135/266 (50%), Gaps = 35/266 (13%)

Query: 18  NKDWTVDAVRWALKDLTHRNLDLIIAIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIE 77
           NK   +D  R  LK   +  ++L+   +G    +LS  +R  FI + G +F   DFSAIE
Sbjct: 335 NKIEDLDTARNLLKGGHYEAIELL---YGQVPFVLSQLVRTAFIPSEGNEFYVSDFSAIE 391

Query: 78  AVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEEMMAYKEQNKHDHPLRKSLGKVA 137
           A V+A LA  EWRLEVF THGKIYE SA+++  VP E +           PLR+  GKVA
Sbjct: 392 ARVIAWLAGEEWRLEVFNTHGKIYEASAAQMFKVPVESI-------TKGSPLRQK-GKVA 443

Query: 138 ELASGYGGWLGAWKAFGAEAHFADDDEMKKAILGWRDASPEIVEFWGGQYRRDPFTRKWT 197
           ELA GY G  GA    GA      + E+ + +  WR A+ +IV+FW              
Sbjct: 444 ELALGYQGGKGALIQMGALNMGLAEGELPELVKAWRTANKKIVKFW-------------- 489

Query: 198 KEYFGLEGAAIMAILNPGKCYAVHDITYAVHDDVLYCRLPSGRFLQYHQPRLVESDSWKG 257
              + +E AAI A+         H +T+     +L+ +LPSGR L Y +P+L E D   G
Sbjct: 490 ---YDVEAAAIKAVKERKPVKLQHGLTFLYESGILFVQLPSGRRLAYAKPKL-ELDERFG 545

Query: 258 PEYQITFMGYNTNSQKGPVGWVRMDT 283
            E  +T+ G   + +     W R++T
Sbjct: 546 KE-ALTYEGKLESGK-----WGRLNT 565
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
 sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
 gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
          Length = 993

 Score = 80.1 bits (196), Expect = 2e-014
 Identities = 81/268 (30%), Positives = 117/268 (43%), Gaps = 51/268 (19%)

Query: 44  IWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR--THGK-- 99
           I  D + L S  LRG  IA PGKK V  D S IE  +LA LA   W++  F    +GK  
Sbjct: 359 ICDDIMQLTSSALRGCIIAPPGKKLVISDLSNIEGRMLAWLAGENWKVNAFSEFDNGKGD 418

Query: 100 -IYEMSASKISGVPFEEMMAYKEQNKHDHPLRKSLGKVAELASGYGGWLGAWKAFGAEAH 158
            +Y+++ ++   +  E++   KEQ        + +GKV EL  GYGG + A+  F A A+
Sbjct: 419 DLYKLAYARAFNLLPEDVT--KEQ--------RQIGKVMELGLGYGGGVAAFLTF-ALAY 467

Query: 159 FADDDEM------------KKAILGWRDASPEIVEFWGGQYR----RDPFTRKWTKE--- 199
             D DE+            K+  + W   S E  + +G   +     D   R W      
Sbjct: 468 GLDLDELAEAALPNIPPGVKREAMRWYQKSVETDKTYGLSEKIFVTCDSLKRMWRNAHPQ 527

Query: 200 ----YFGLEGAAIMAILNPGKCYAVHDITYAVHDDVLYCRLPSGRFLQYHQPRLVESDSW 255
               ++ +E A   AI +PG  +    ++       L   LPSGR L Y   R+      
Sbjct: 528 TVSFWYDIEDAVKQAIQSPGIAFKCRKLSVRRDKSWLRICLPSGRSLCYPSARIENG--- 584

Query: 256 KGPEYQITFMGYNTNSQKGPVGWVRMDT 283
                QIT+MG N  S+K    W R+ T
Sbjct: 585 -----QITYMGTNPYSRK----WERLKT 603
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
 pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
          Length = 726

 Score = 65.9 bits (159), Expect = 5e-010
 Identities = 78/296 (26%), Positives = 113/296 (37%), Gaps = 62/296 (20%)

Query: 39  DLIIAIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR--- 95
           D +  ++ D + L S  LR   IA   KK V  D S IE  VLA LA    +L  FR   
Sbjct: 358 DCVDLVFDDVMALTSSALRSCLIAPTHKKLVVADLSNIEGRVLAWLAGETPKLHAFREFD 417

Query: 96  ---------------THGKIYEMSASKISGVPFEEMMA----YKEQNKHDHPL------- 129
                          THG +     +         +      YK    H   +       
Sbjct: 418 TCQGVDGCWHTGAAITHGALRAAPLALQRNADHAPIRRGDDIYKRAYAHSFGIAPQAVSK 477

Query: 130 -RKSLGKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILG--WRDASPEIVEFWGGQ 186
            ++ +GKV ELA GYGG +GA+ AF A  H   +    +A+        + E++E+    
Sbjct: 478 EQRQIGKVQELALGYGGGVGAFAAFAALYHIDLEAMAAQAVFPPLLLQEATELLEWTKAN 537

Query: 187 YRR------------DPFTRKWTKE-------YFGLEGAAIMAILNPGKCYAVHDITYAV 227
           +R             D F R W          +  L+ A   AI +P   +    IT   
Sbjct: 538 HRPTFDLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETTHTCCGITVQY 597

Query: 228 HDDVLYCRLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 283
               L  RLPSGR L Y  PR+ E  +       +++MG +  ++K    W R+ T
Sbjct: 598 RRAWLRLRLPSGRVLYYAAPRVDEHGA-------LSYMGTHPITRK----WTRLTT 642
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
 pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
          Length = 726

 Score = 65.5 bits (158), Expect = 6e-010
 Identities = 78/291 (26%), Positives = 111/291 (37%), Gaps = 62/291 (21%)

Query: 44  IWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFR-------- 95
           ++ D + L S  LR   IA   KK V  D S IE  VLA LA    +L  FR        
Sbjct: 363 VFDDVMALTSSALRSCLIAPKNKKLVVADLSNIEGRVLAWLAGETPKLHAFREFDTCQGV 422

Query: 96  ----------THGKIYEMSASKISGVPFEEMMA----YKEQNKHDHPL--------RKSL 133
                     THG +     +         +      YK    H   +        ++ +
Sbjct: 423 DGTWHTGAAITHGALRGAPIALQRNADHAPIRRGDDIYKRAYAHSFGIAPQAVSKEQRQI 482

Query: 134 GKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILG--WRDASPEIVEFWGGQYRR-- 189
           GKV ELA GYGG +GA+ AF A  H   +    +A L       + E++E+    +R   
Sbjct: 483 GKVQELALGYGGGVGAFAAFAAMYHIDLEAMAAQAALPPLLLQQATELLEWTKANHRPTF 542

Query: 190 ----------DPFTRKWTKE-------YFGLEGAAIMAILNPGKCYAVHDITYAVHDDVL 232
                     D F R W          +  L+ A   AI +P   +    IT       L
Sbjct: 543 DLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETAHTCCGITVQYRRAWL 602

Query: 233 YCRLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 283
             RLPSGR L Y  PR+ E  +       +++MG +  ++K    W R+ T
Sbjct: 603 RLRLPSGRVLYYAAPRVDEHGA-------LSYMGTHPITRK----WTRLTT 642
>ref|NP_299570.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
 pir||H82574 phage-related protein XF2291 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85090.1|AE004041_2 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
          Length = 252

 Score = 51.6 bits (122), Expect = 9e-006
 Identities = 49/175 (28%), Positives = 75/175 (42%), Gaps = 32/175 (18%)

Query: 130 RKSLGKVAELASGYGGWLGAWKAFGAEAHFADDDEMKKAILG--WRDASPEIVEFWGGQY 187
           ++ +GKV ELA GYGG +GA+ AF A  H   +    +A+        + E++E+    +
Sbjct: 5   QRQIGKVQELALGYGGGVGAFAAFAALYHIDLEAMAAQAVFPPLLLQEATELLEWTKANH 64

Query: 188 RR------------DPFTRKWTKE-------YFGLEGAAIMAILNPGKCYAVHDITYAVH 228
           R             D F R W          +  L+ A   AI +P   +    IT    
Sbjct: 65  RPTFDLSDRAWLACDVFKRAWRNAHPAIAAFWQALQFAVTDAIHHPETTHTCCGITVQYR 124

Query: 229 DDVLYCRLPSGRFLQYHQPRLVESDSWKGPEYQITFMGYNTNSQKGPVGWVRMDT 283
              L  RLPSGR L Y  PR+ E  +       +++MG +  ++K    W R+ T
Sbjct: 125 RAWLRLRLPSGRVLYYAAPRVDEHGA-------LSYMGTHPITRK----WTRLTT 168
>sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I)
 gb|AAD45559.1|U86403_1 (U86403) DNA polymerase I [Rhizobium leguminosarum]
          Length = 1016

 Score = 43.9 bits (102), Expect = 0.002
 Identities = 24/61 (39%), Positives = 34/61 (55%)

Query: 56  LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
           +R  FI+ PG K +  D+S IE  VLA +A      + F     I+ M+AS++ GVP E 
Sbjct: 771 IRTAFISTPGHKLISADYSQIELRVLAHVAEIPQLTKAFEDGVDIHAMTASEMFGVPVEG 830

Query: 116 M 116
           M
Sbjct: 831 M 831
>ref|NP_540742.1| (NC_003317) DNA POLYMERASE I [Brucella melitensis]
 gb|AAL53006.1| (AE009615) DNA POLYMERASE I [Brucella melitensis]
          Length = 994

 Score = 43.1 bits (100), Expect = 0.003
 Identities = 24/61 (39%), Positives = 34/61 (55%)

Query: 56  LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
           +R  FIA PG K +  D+S IE  +LA +A      + F     I+ M+AS++ GVP E 
Sbjct: 751 IRTAFIAEPGNKLISADYSQIELRILAHVADIAQLKQAFADGIDIHAMTASEMFGVPVEG 810

Query: 116 M 116
           M
Sbjct: 811 M 811
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
 pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
           (strain PAO1)
 gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
          Length = 913

 Score = 43.1 bits (100), Expect = 0.003
 Identities = 20/61 (32%), Positives = 37/61 (59%)

Query: 56  LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
           +R  F+A  G K +  D+S IE  ++A LA+ +  L+ FR    ++  +A+++ GVP E+
Sbjct: 674 IRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEVFGVPLED 733

Query: 116 M 116
           +
Sbjct: 734 V 734
>ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
           meliloti]
 emb|CAC41562.1| (AL591782) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
           meliloti]
          Length = 1004

 Score = 42.7 bits (99), Expect = 0.004
 Identities = 24/61 (39%), Positives = 34/61 (55%)

Query: 56  LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
           +R  F+A PG K V  D+S IE  VLA +A      + F     I+ M+AS++ GVP + 
Sbjct: 761 IRTAFVATPGHKLVSADYSQIELRVLAHVADIPQLRQAFADGVDIHAMTASEMFGVPVDG 820

Query: 116 M 116
           M
Sbjct: 821 M 821
>ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 ref|NP_530819.1| (NC_003304) DNA polymerase I [Agrobacterium tumefaciens str. C58
           (U. Washington)]
 gb|AAK85931.1| (AE007952) AGR_C_169p [Agrobacterium tumefaciens str. C58 (Cereon)]
 gb|AAL41135.1| (AE008985) DNA polymerase I [Agrobacterium tumefaciens str. C58 (U.
           Washington)]
          Length = 1031

 Score = 41.6 bits (96), Expect = 0.010
 Identities = 24/61 (39%), Positives = 33/61 (53%)

Query: 56  LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRLEVFRTHGKIYEMSASKISGVPFEE 115
           +R  FI+ PG K +  D+S IE  VLA +A        F     I+ M+AS++ GVP E 
Sbjct: 788 IRTAFISTPGHKLLSADYSQIELRVLAHVADIPQLRNAFENGIDIHAMTASEMFGVPVEG 847

Query: 116 M 116
           M
Sbjct: 848 M 848
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.322 0.138 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 189,865,443 Number of Sequences: 884779 Number of extensions: 7915968 Number of successful extensions: 14743 Number of sequences better than 5.0e-02: 10 Number of HSP's better than 0.1 without gapping: 7 Number of HSP's successfully gapped in prelim test: 3 Number of HSP's that attempted gapping in prelim test: 14723 Number of HSP's gapped (non-prelim): 12 length of query: 304 length of database: 277,083,050 effective HSP length: 119 effective length of query: 185 effective length of database: 171,794,349 effective search space: 31781954565 effective search space used: 31781954565 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 90 (39.3 bits) Query= orf40 [10968-11308] (339 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

sp|P06225|DPOL_BPSP2  DNA POLYMERASE >gi|67052|pir||DJBPS2 D...    87  3e-016
ref|NP_051006.1|  (NC_000935) P45 [Acyrthosiphon pisum bacte...    74  2e-012
ref|NP_297973.1|  (NC_002488) phage-related DNA polymerase [...    72  1e-011
ref|NP_299803.1|  (NC_002488) phage-related DNA polymerase [...    72  1e-011
gb|AAA23103.1|  (M94320) putative DNA polymerase [Citrus gre...    60  3e-008
gb|AAB25243.1|  DNA polymerase homolog [bacterium-like organ...    60  3e-008
ref|NP_299569.1|  (NC_002488) phage-related protein [Xylella...    44  0.003
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
 pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
 gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
           SPO2]
 emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
          Length = 648

 Score = 86.7 bits (213), Expect = 3e-016
 Identities = 82/274 (29%), Positives = 118/274 (42%), Gaps = 22/274 (8%)

Query: 72  GTPVYAEHPSTEILCLYYDLKDGKGRRAYIPGGPK-PTDLIEHIANGGPI-AAWNITFEF 129
           G   Y E P  EIL   Y   D   +   +  G   P +++  + +   I  A+N  FE 
Sbjct: 20  GVYAYTEAPDFEILLFAYAFDDDPVKIIDLAQGDTLPHEVLVALTSSKVIKTAYNANFE- 78

Query: 130 WIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSLDAAAQALKTSP-KDKKGKQLINK 188
                 +AK      L   Q  C    A    LPG+LD  A+ALK S  KDK GK LI  
Sbjct: 79  ---RTCIAKHFNLMLLPA-QWRCTAVHATTLGLPGNLDGVAKALKLSAQKDKAGKALIRY 134

Query: 189 LCRPVSATKA-----RPLPRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTPDMTP--SE 241
              P   TKA     R LP       P+ +     YC QDV+ E  +        P  SE
Sbjct: 135 FSVPCKPTKANGQRVRNLPEHD----PEKWEKFKVYCIQDVEVERAIKNRISKFEPLESE 190

Query: 242 RETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDF 301
            + W  DQ IN RGV++DV+ + +A+   +Q       E   +T   + + +  A++  +
Sbjct: 191 HKLWALDQEINDRGVRIDVDLVKHAIACDEQYQAGLIAEAKKLT--GLPNPNSTAQLKKW 248

Query: 302 LAGLGCRMHNLTADSVSEALERDDLNPTARRVLR 335
           L   G  + +L  D + E +E  + + T  RVLR
Sbjct: 249 LEEKGLTISSLAKDKIEELIENTN-DETVHRVLR 281
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
 sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
 gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
          Length = 993

 Score = 74.3 bits (181), Expect = 2e-012
 Identities = 58/234 (24%), Positives = 106/234 (44%), Gaps = 11/234 (4%)

Query: 107 PTDLIEHIANGGPIAAWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGSL 166
           P DL  ++ +   +  W+      +++ V+ KR     L L + +  + +A    LPG+L
Sbjct: 54  PADLKAYLDDSEILTVWH---NGGMFDTVILKRVLNIDLPLSRVHDTLVQALAHGLPGAL 110

Query: 167 DAAAQALKTSP---KDKKGKQLINKLCRPVSATKARPLPRWTLYTAPQDYSDLYAYCDQD 223
                    +    KDK+GK LI+ LC+P    K   + R T  T  +++     Y   D
Sbjct: 111 GLLCDIFNVNSDKAKDKEGKALISLLCKP--RPKNSKIQRATALTHAEEWQRFKDYAGSD 168

Query: 224 VKSEDHVSALTPD--MTPSERETWLADQRINARGVQVDVESLDNALDIVDQTTRRFTMEL 281
           + +   +    P+  M   E E W  DQ+IN RG+ +DVE   +AL  V+   +R +   
Sbjct: 169 ILAMREIYQHLPNWNMNVHETELWQLDQKINRRGMCMDVELAKSALTAVENEQKRLSTVT 228

Query: 282 AAITNGAVGSVSEVAKMTDFLA-GLGCRMHNLTADSVSEALERDDLNPTARRVL 334
             +T+ AV + ++   +   +A   G  + ++ A ++   +   D+ P  R +L
Sbjct: 229 QQLTDNAVQNATQRDALLQHIASAFGITLPDMQASTLQRRINDPDIPPALRELL 282
>ref|NP_297973.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
 pir||H82774 phage-related DNA polymerase XF0683 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF83493.1|AE003912_5 (AE003912) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
          Length = 726

 Score = 71.6 bits (174), Expect = 1e-011
 Identities = 55/196 (28%), Positives = 90/196 (45%), Gaps = 8/196 (4%)

Query: 145 LQLEQCYCDMAKARRFSLPGSLDAAAQALKTS---PKDKKGKQLINKLCRPVSATKARPL 201
           + LE+    MA+A   SLPG+L    + L+      KDK GK+L+   C+P  A     L
Sbjct: 93  IPLERWRDSMAQALAHSLPGALGTLCELLRVPVEHAKDKDGKRLVALFCKPRPAHCT--L 150

Query: 202 PRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVD 259
            R T  T P D++    Y  +DV +   V    P  +   +E   W  DQ IN RGV +D
Sbjct: 151 RRATRDTHPTDWAQFVEYAQRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLID 210

Query: 260 VESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVS 318
            + +  A+  V++  +       A+T GAV + ++   +   L    G  + ++   +V+
Sbjct: 211 TDLVQAAIRAVERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVA 270

Query: 319 EALERDDLNPTARRVL 334
             ++   L  T R +L
Sbjct: 271 RCVDDPALPETVRELL 286
>ref|NP_299803.1| (NC_002488) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
 pir||C82548 phage-related DNA polymerase XF2525 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85323.1|AE004059_13 (AE004059) phage-related DNA polymerase [Xylella fastidiosa 9a5c]
          Length = 726

 Score = 71.6 bits (174), Expect = 1e-011
 Identities = 55/196 (28%), Positives = 90/196 (45%), Gaps = 8/196 (4%)

Query: 145 LQLEQCYCDMAKARRFSLPGSLDAAAQALKTS---PKDKKGKQLINKLCRPVSATKARPL 201
           + LE+    MA+A   SLPG+L    + L+      KDK GK+L+   C+P  A     L
Sbjct: 93  IPLERWRDSMAQALAHSLPGALGTLCELLRVPVEHAKDKDGKRLVALFCKPRPAHCT--L 150

Query: 202 PRWTLYTAPQDYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVD 259
            R T  T P D++    Y  +DV +   V    P  +   +E   W  DQ IN RGV +D
Sbjct: 151 RRATRDTHPTDWAQFVEYAQRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLID 210

Query: 260 VESLDNALDIVDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVS 318
            + +  A+  V++  +       A+T GAV + ++   +   L    G  + ++   +V+
Sbjct: 211 TDLVQAAIRAVERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVA 270

Query: 319 EALERDDLNPTARRVL 334
             ++   L  T R +L
Sbjct: 271 RCVDDPALPETVRELL 286
>gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus greening
           disease-associated bacterium]
          Length = 206

 Score = 60.1 bits (144), Expect = 3e-008
 Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 26/166 (15%)

Query: 107 PTDLIEHIANGGPIA-AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGS 165
           P+DL++++ +   +  A N  FE      +L K+     +  ++  C    AR   LP S
Sbjct: 54  PSDLLQYLRDETVMCVAHNSLFE-----RILFKKTLGIDIPSKRWICTSVLARTNGLPSS 108

Query: 166 LDAAAQALKTSP---KDKKGKQLINKLCR------PVSATKARPLPRWTLYTAPQDYSDL 216
           L  A  AL+      K ++GK LI + C+      P   T+A  +  W L+         
Sbjct: 109 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFRE------- 161

Query: 217 YAYCDQDVKSEDHV-SALTPDMTPSERETWLADQRINARGVQVDVE 261
             YC +DV++   +  +LTP + P ER  WL DQ IN RG ++D++
Sbjct: 162 --YCKRDVEATRELFKSLTP-LNPEERTLWLLDQEINDRGYRIDLD 204
>gb|AAB25243.1| DNA polymerase homolog [bacterium-like organism, citrus greening
           disease-associated, Peptide, 207 aa]
          Length = 207

 Score = 60.1 bits (144), Expect = 3e-008
 Identities = 49/166 (29%), Positives = 79/166 (47%), Gaps = 26/166 (15%)

Query: 107 PTDLIEHIANGGPIA-AWNITFEFWIWNMVLAKREGWPWLQLEQCYCDMAKARRFSLPGS 165
           P+DL++++ +   +  A N  FE      +L K+     +  ++  C    AR   LP S
Sbjct: 55  PSDLLQYLRDETVMCVAHNSLFE-----RILFKKTLGIDIPSKRWICTSVLARTNGLPSS 109

Query: 166 LDAAAQALKTSP---KDKKGKQLINKLCR------PVSATKARPLPRWTLYTAPQDYSDL 216
           L  A  AL+      K ++GK LI + C+      P   T+A  +  W L+         
Sbjct: 110 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFRE------- 162

Query: 217 YAYCDQDVKSEDHV-SALTPDMTPSERETWLADQRINARGVQVDVE 261
             YC +DV++   +  +LTP + P ER  WL DQ IN RG ++D++
Sbjct: 163 --YCKRDVEATRELFKSLTP-LNPEERTLWLLDQEINDRGYRIDLD 205
>ref|NP_299569.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
 pir||G82574 phage-related protein XF2290 [imported] - Xylella fastidiosa
           (strain 9a5c)
 gb|AAF85089.1|AE004041_1 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
          Length = 312

 Score = 43.5 bits (101), Expect = 0.003
 Identities = 30/126 (23%), Positives = 56/126 (43%), Gaps = 3/126 (2%)

Query: 212 DYSDLYAYCDQDVKSEDHVSALTP--DMTPSERETWLADQRINARGVQVDVESLDNALDI 269
           D++    Y  +DV +   V    P  +   +E   W  DQ IN RGV +D + +  A+  
Sbjct: 2   DWAQFVEYARRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLIDTDLVQAAIGA 61

Query: 270 VDQTTRRFTMELAAITNGAVGSVSEVAKMTDFL-AGLGCRMHNLTADSVSEALERDDLNP 328
           V++  +       A+T GAV + ++   +   L    G  + ++   +V+  ++   L  
Sbjct: 62  VERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVARCVDDPALPE 121

Query: 329 TARRVL 334
           T R +L
Sbjct: 122 TVRELL 127
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.317 0.133 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 194,837,355 Number of Sequences: 884779 Number of extensions: 8603425 Number of successful extensions: 18374 Number of sequences better than 5.0e-02: 7 Number of HSP's better than 0.1 without gapping: 3 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 18363 Number of HSP's gapped (non-prelim): 7 length of query: 339 length of database: 277,083,050 effective HSP length: 121 effective length of query: 218 effective length of database: 170,024,791 effective search space: 37065404438 effective search space used: 37065404438 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 91 (39.7 bits) Query= orf41 [11307-11478] (170 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.132 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 95,252,776 Number of Sequences: 884779 Number of extensions: 3764828 Number of successful extensions: 8556 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8556 Number of HSP's gapped (non-prelim): 0 length of query: 170 length of database: 277,083,050 effective HSP length: 111 effective length of query: 59 effective length of database: 178,872,581 effective search space: 10553482279 effective search space used: 10553482279 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 86 (37.7 bits) Query= orf42 [11499-11837] (337 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.138 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 112,778,215 Number of Sequences: 884779 Number of extensions: 4929411 Number of successful extensions: 8412 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 8412 Number of HSP's gapped (non-prelim): 0 length of query: 337 length of database: 277,083,050 effective HSP length: 120 effective length of query: 217 effective length of database: 170,909,570 effective search space: 37087376690 effective search space used: 37087376690 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 91 (39.7 bits) Query= orf43 [11841-12192] (350 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.135 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 198,552,102 Number of Sequences: 884779 Number of extensions: 8335154 Number of successful extensions: 18488 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 18488 Number of HSP's gapped (non-prelim): 0 length of query: 350 length of database: 277,083,050 effective HSP length: 121 effective length of query: 229 effective length of database: 170,024,791 effective search space: 38935677139 effective search space used: 38935677139 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 91 (39.7 bits) Query= orf44 [12218-12506] (287 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.309 0.127 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 164,641,227 Number of Sequences: 884779 Number of extensions: 7046383 Number of successful extensions: 15184 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 15184 Number of HSP's gapped (non-prelim): 0 length of query: 287 length of database: 277,083,050 effective HSP length: 119 effective length of query: 168 effective length of database: 171,794,349 effective search space: 28861450632 effective search space used: 28861450632 T: 11 A: 40 X1: 16 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.7 bits) S2: 90 (39.3 bits) Query= orf45 [12600-12686] (85 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.325 0.139 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 47,940,177 Number of Sequences: 884779 Number of extensions: 1648020 Number of successful extensions: 3403 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3403 Number of HSP's gapped (non-prelim): 0 length of query: 85 length of database: 277,083,050 effective HSP length: 61 effective length of query: 24 effective length of database: 223,111,531 effective search space: 5354676744 effective search space used: 5354676744 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.6 bits) S2: 84 (37.0 bits) Query= orf46 [12684-13235] (550 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAB63670.1|  (AJ251789) hypothetical protein [Lactobacil...   199  6e-050
ref|NP_511031.1|  (NC_003291) helicase [Bacteriophage PSA] >...   189  6e-047
ref|NP_050128.1|  (NC_000896) putative DEAH-family helicase ...   187  2e-046
ref|NP_049952.1|  (NC_000871) orf443 gp [Streptococcus therm...   185  9e-046
ref|NP_049999.1|  (NC_000872) orf443 gp [Streptococcus therm...   185  9e-046
pir||T13299  probable helicase - Streptococcus phage phi-O12...   185  9e-046
emb|CAC04160.1|  (AJ292531) putative helicase [Lactococcus b...   183  3e-045
gb|AAL74161.1|AF442520_1  (AF442520) putative helicase [Stre...   183  5e-045
ref|NP_471919.1|  (NC_003212) similar to DEAH-family helicas...   177  2e-043
sp|P24125|V51K_BPL79  51.5 kDa protein >gi|76364|pir||JU0445...   174  2e-042
ref|NP_490263.1|  (NC_003276) putative helicase [Nostoc sp. ...   155  8e-037
ref|NP_459877.1|  (NC_003197) Fels-1 prophage; putative DNA ...   114  2e-024
ref|NP_231273.1|  (NC_002505) helicase-related protein [Vibr...   110  5e-023
ref|NP_142208.1|  (NC_000961) hypothetical protein [Pyrococc...    77  5e-013
gb|AAF12716.1|AF066865_14  (AF066865) unknown [bacteriophage...    66  1e-009
ref|NP_559418.1|  (NC_003364) helicase, possible DNA repair ...    65  2e-009
ref|NP_248512.1|  (NC_000909) ATP dependent RNA helicase, pu...    55  2e-006
sp|O02494|IF4A_CRYPV  Eukaryotic initiation factor 4A (eIF-4...    50  6e-005
ref|NP_421775.1|  (NC_002696) excinuclease ABC, subunit B [C...    48  3e-004
ref|NP_375993.1|  (NC_003106) 621aa long conserved hypotheti...    44  0.006
>emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacillus casei bacteriophage
           A2]
          Length = 455

 Score =  199 bits (506), Expect = 6e-050
 Identities = 139/422 (32%), Positives = 205/422 (47%), Gaps = 52/422 (12%)

Query: 1   MFKLRDYQQLAYNRI---MAAWQEYRSVLAVLPTGAGKTVIFSKIIH---DHVGAAAAIV 54
           MF+L  YQ+   N+    +AA    +SVL V P G+GK+VI ++I     +  G    +V
Sbjct: 1   MFQLHPYQKELVNQAREKLAAGN--KSVLLVSPAGSGKSVIIAEIARLAVERSGHVMFMV 58

Query: 55  HRREIVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLT 114
           HR+E+V QI                                     Q  Q   I +Q  T
Sbjct: 59  HRQELVNQIV------------------------------------QTFQADEIDLQATT 82

Query: 115 SKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXX 174
             +    A  +  + + +L + DE HH + +  + K      +   L  +A+P R +   
Sbjct: 83  IMTVGKIANRLNRLPRPSLIITDESHHSLAK-TYRKIYSFYADVPRLGFSASPWRMNGQG 141

Query: 175 XXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKAR 234
                    ++M+EGP+  WLI+  +L+ + Y AP T ID++ L     GD+  K++ A 
Sbjct: 142 LGNVY----ESMVEGPSVKWLIDHQYLAPYDYYAP-TLIDVQKLQTSSTGDYTNKSMDAA 196

Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERD 294
           V ++ I GDVV HY+     ++AIV+A  ++ ++Q+ E F+ AG TA     +T  G+RD
Sbjct: 197 VPKA-IFGDVVSHYQHLAGGRQAIVYAHSIEASKQVVEAFQSAGITAVHADAKTPKGQRD 255

Query: 295 RLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
           R++ +F+ G I IL NVDL  EGF+VP V  +I+ RPT SL   +Q   R +R   GK  
Sbjct: 256 RIMADFKAGKITILSNVDLISEGFNVPDVGVIIMLRPTASLVLDIQQSMRGMRYKPGKR- 314

Query: 355 AIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYC 414
           +IIID V N  R G P     WTL  R K +   +   P KTCDAC        K CP C
Sbjct: 315 SIIIDHVANAYRFGLPDTEHEWTLNDRPKKKKRDNTGPPIKTCDACYAVIPVQCKICPIC 374

Query: 415 GH 416
           GH
Sbjct: 375 GH 376
>ref|NP_511031.1| (NC_003291) helicase [Bacteriophage PSA]
 emb|CAC85606.1| (AJ312240) helicase [Bacteriophage PSA]
          Length = 418

 Score =  189 bits (480), Expect = 6e-047
 Identities = 126/388 (32%), Positives = 195/388 (49%), Gaps = 47/388 (12%)

Query: 32  GAGKTVIFSKIIH---DHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXXXX 88
           GAGK+VI S+II     +      +VHR+E++ QI  +L    V                
Sbjct: 9   GAGKSVILSEIIRMTTHNKNYVLFLVHRKELIDQIRNTLIMNEVD--------------- 53

Query: 89  XXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIW 148
                   SY+          VQT+  +        +    + +L + DE HH +    +
Sbjct: 54  -------MSYVKLGM------VQTIVRR--------LNKTSEPSLIIIDESHHVLANS-Y 91

Query: 149 AKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRA 208
            K +H    AK++  TATP R +             T+IE     WLIE  FL+ + Y A
Sbjct: 92  KKIIHHFSKAKVVGFTATPVRINGGGLGDIN----DTLIEKVNVKWLIENQFLAHYKYYA 147

Query: 209 PQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDTAE 268
           P+T +    L V + G+F+  +L  +  +  I GDV++HY+K  + ++AI++A+ +  +E
Sbjct: 148 PET-VQTETLNVKRTGEFDMTSLDDQFNKRMIWGDVIKHYQKLADGEQAILYASSIYQSE 206

Query: 269 QMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVIL 328
           +MA  F  AG ++A + G+T    RD ++++F +G +++L N+DL  EGFDVP    VI+
Sbjct: 207 KMAASFNAAGISSAHIDGKTPKPIRDDIIKQFREGELKVLCNLDLIGEGFDVPDCSTVIM 266

Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSG 388
            RPTQSL+ ++Q   R +R  +GK  A IID V N +R G P   RTW+L+ R KG +S 
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYKQGKT-ATIIDHVGNVKRFGLPDMERTWSLEPR-KGSNST 324

Query: 389 SDTVPQKTCDACTIPYEAYNKACPYCGH 416
               P K C  C +      K C +CGH
Sbjct: 325 KAEAPVKICKECFMTVSQTAKKCEHCGH 352
>ref|NP_050128.1| (NC_000896) putative DEAH-family helicase [Lactobacillus
           bacteriophage phi adh]
 emb|CAB52498.1| (AJ131519) putative DEAH-family helicase [Lactobacillus
           bacteriophage phi adh]
          Length = 455

 Score =  187 bits (475), Expect = 2e-046
 Identities = 132/443 (29%), Positives = 214/443 (47%), Gaps = 50/443 (11%)

Query: 4   LRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIFSKI---IHDHVGAAAAIVHRREI 59
           +R YQ+   + +  + +  +++VL   P G+GK++  +++   I  +      +VHRRE+
Sbjct: 5   IRSYQKKLIDDLRLSLKHGHKNVLIQSPAGSGKSITMAELAKRITSNGERVLFVVHRREL 64

Query: 60  VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
           V QI  + +S+GV                                C +  VQT + +   
Sbjct: 65  VKQIKNTFSSWGVNMNL----------------------------CEIYMVQTASRR--- 93

Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXX 179
                +  +      + DE HH + +  + K  +  +NA +L  TATP R          
Sbjct: 94  -----LEKLMTPNYILVDEAHHSLAK-TYKKINNHFKNAHILGFTATPVRLSGKGFKDIY 147

Query: 180 XXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESH 239
                 ++ GP   WLIE  +L+ + Y +    ID   L     GD+  K+++    ++ 
Sbjct: 148 ----DDLVLGPKISWLIENHYLAPYTYYSVNL-IDQTKLKKSSTGDYTHKSIE-NAGKNI 201

Query: 240 IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
           + GDV+Q Y K+  N +AI+++  V + +Q+A+EF K    A  V G+T+  +RD+ +++
Sbjct: 202 VYGDVIQSYHKFANNTKAIIYSYSVHSCQQIAKEFNKNNIPAKEVDGKTKKEDRDKAMQD 261

Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
           F  G I+IL+N +L+ EG DVP  E VI+ RPTQSL+ F+Q   R +R    K+ AIIID
Sbjct: 262 FRDGKIKILVNAELYGEGVDVPDCETVIMLRPTQSLSLFIQQSMRCMRYQPDKQ-AIIID 320

Query: 360 PVRNWERHGPPTAVRTWTLKGREK--GQSSGSDTVPQKTCDACTIPYEAYNKACPYCGHV 417
            V N+ R G P   R WTL+ R K   +  GSD +  KTC  C     A    CP CG+ 
Sbjct: 321 QVANYTRFGLPDMDRVWTLEDRAKHPQREGGSDGIAIKTCPNCFGVIMASYHKCPLCGYS 380

Query: 418 NVYPDRSAPDKVDGDLVELDLDA 440
                R   +    +L +++LDA
Sbjct: 381 FEAEFRKLAEDKRAELEKINLDA 403
>ref|NP_049952.1| (NC_000871) orf443 gp [Streptococcus thermophilus bacteriophage
           Sfi19]
 gb|AAC97921.1| (AF077306) putative helicase; gp443 [Streptococcus thermophilus
           bacteriophage Sfi19]
 gb|AAD44071.1|AF115102_30 (AF115102) orf443 gp [Streptococcus thermophilus bacteriophage
           Sfi19]
          Length = 443

 Score =  185 bits (470), Expect = 9e-046
 Identities = 132/423 (31%), Positives = 207/423 (48%), Gaps = 54/423 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
           +LR+YQ    N I  +  +  + ++   P  +GKTV+ + I          I+   HR+E
Sbjct: 2   ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61

Query: 59  IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
           I  Q+  +    GV                           D N    ++++ ++T  + 
Sbjct: 62  INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90

Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
             D      +++ ++ + DE HH V+   + K +    N+ +L  T TP R D       
Sbjct: 91  NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143

Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
               A  ++ G    WL + G ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
            I GDV++HY K  + K+AIV+   V+ +E ++  F + GYT+ A+SG+T    RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257

Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F  G + I++N +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
           D V N +RHG P A R W+LKG  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374

Query: 416 HVN 418
           H N
Sbjct: 375 HEN 377
>ref|NP_049999.1| (NC_000872) orf443 gp [Streptococcus thermophilus bacteriophage
           Sfi21]
 ref|NP_056709.1| (NC_002214) putative helicase [Streptococcus thermophilus
           bacteriophage Sfi11]
 gb|AAC72435.1| (AF004379) orf443 [Streptococcus thermophilus bacteriophage Sfi21]
 gb|AAD44104.1|AF115103_34 (AF115103) orf443 gp [Streptococcus thermophilus bacteriophage
           Sfi21]
 gb|AAF63056.1|AF158600_10 (AF158600) putative helicase [Streptococcus thermophilus
           bacteriophage Sfi11]
 gb|AAF63085.1|AF158601_13 (AF158601) putative helicase [Streptococcus thermophilus
           bacteriophage SFi18]
          Length = 443

 Score =  185 bits (470), Expect = 9e-046
 Identities = 132/423 (31%), Positives = 207/423 (48%), Gaps = 54/423 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
           +LR+YQ    N I  +  +  + ++   P  +GKTV+ + I          I+   HR+E
Sbjct: 2   ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61

Query: 59  IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
           I  Q+  +    GV                           D N    ++++ ++T  + 
Sbjct: 62  INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90

Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
             D      +++ ++ + DE HH V+   + K +    N+ +L  T TP R D       
Sbjct: 91  NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143

Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
               A  ++ G    WL + G ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
            I GDV++HY K  + K+AIV+   V+ +E ++  F + GYT+ A+SG+T    RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257

Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F  G + I++N +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
           D V N +RHG P A R W+LKG  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374

Query: 416 HVN 418
           H N
Sbjct: 375 HEN 377
>pir||T13299 probable helicase - Streptococcus phage phi-O1205
 gb|AAC79526.1| (U88974) ORF10 [Streptococcus thermophilus temperate bacteriophage
           O1205]
          Length = 443

 Score =  185 bits (470), Expect = 9e-046
 Identities = 131/423 (30%), Positives = 208/423 (48%), Gaps = 54/423 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
           +LR+YQ    N I  +  +  + ++   P  +GKTV+ + I          I+   HR+E
Sbjct: 2   ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61

Query: 59  IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
           I  Q+  +    GV                           D N    ++++ ++T  + 
Sbjct: 62  INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKIAR 90

Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
             D      +++ ++ + DE HH V+   + K +    N+ +L  T TP R D       
Sbjct: 91  NLDR-----IQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143

Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
               A  ++ G +  WL + G ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKSVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
            I GDV++HY K  + K+AIV+   V+ +E ++  F + GYT+ A+SG+T    RDR ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257

Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F  G + I++N +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
           D V N +RHG P A R W++KG  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSIKGVNKTKKKLKLGEPTT--RTCDECYATFWSAERICPMCG 374

Query: 416 HVN 418
           H N
Sbjct: 375 HEN 377
>emb|CAC04160.1| (AJ292531) putative helicase [Lactococcus bacteriophage phi31]
          Length = 448

 Score =  183 bits (465), Expect = 3e-045
 Identities = 145/442 (32%), Positives = 212/442 (47%), Gaps = 56/442 (12%)

Query: 4   LRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIF---SKIIHDHVGAAAAIVHRREI 59
           LRDYQ+     I +++ +  RS++   P  +GKTV+    SK   D         HR+EI
Sbjct: 3   LRDYQEELVESIKSSFLKGNRSIIVQSPPRSGKTVVMADISKGATDKKNHVLFFSHRKEI 62

Query: 60  VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
             Q+   + +F +                         Y+       + SVQ+L  K   
Sbjct: 63  NDQV---VKTFELNQV-------------------NMEYVT------IGSVQSLVRK--- 91

Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXX 179
                +  +    + + DE HH ++   + K +    NA  LF T TP R +        
Sbjct: 92  -----IDELPLPDIILVDEAHH-IKANSYKKILEAFPNALKLFFTGTPIRLNGQGFED-- 143

Query: 180 XXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESH 239
              A  +I G +  WL E G ++ F Y AP   ID   L    +GDF  K++     +  
Sbjct: 144 --MADDLITGKSIKWLQEHGNIAPFKYYAPNI-IDTSQLKK-TSGDFTQKSMD-EAFKRA 198

Query: 240 IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
           I GDV+ HY K  + K+AI +A +V TA+ ++EEF +AG TA  V G+T   ER+ ++ +
Sbjct: 199 IYGDVIAHYNKLSKGKQAICYAHNVATAQHISEEFNQAGITAEVVHGKTPKTEREAIMNK 258

Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
           F  G I +LINV+LF EG D+P V   I+ RPTQSL+ FLQ   R L    GK  AI+ID
Sbjct: 259 FRAGEILVLINVELFTEGVDLPDVTTCIMLRPTQSLSLFLQFAMRPLNPKPGKT-AILID 317

Query: 360 PVRNWERHGPPTAVRTWTLKGREKGQS----SGSDTVPQKTCDACTIPYEAYN-KACPYC 414
            V N+ RHG P   R WTL G  K +S     G  T+  K C+ C   +++ N + CPYC
Sbjct: 318 HVGNYTRHGLPNEDREWTLSGISKKRSEYNTKGELTI--KQCEMCFGCFDSSNARTCPYC 375

Query: 415 GHVNVYPDRSAPDKVDGDLVEL 436
           GH     +R   +  + +L E+
Sbjct: 376 GHEPELTERELENIKEIELQEI 397
>gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Streptococcus thermophilus
           bacteriophage k3]
          Length = 443

 Score =  183 bits (464), Expect = 5e-045
 Identities = 131/423 (30%), Positives = 207/423 (47%), Gaps = 54/423 (12%)

Query: 3   KLRDYQQLAYNRI-MAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIV---HRRE 58
           +LR+YQ    N I  +  +  + ++   P  +GKTV+ + I          ++   HR+E
Sbjct: 2   ELRNYQNDLVNGIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTVLFFSHRKE 61

Query: 59  IVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKST 118
           I  Q+  +    GV                           D N    ++++ ++T  + 
Sbjct: 62  INEQVVNTFKRNGV---------------------------DMN----LVTIDSVTKVAR 90

Query: 119 MNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXX 178
             D      + + ++ + DE HH V+   + K +    N+ +L  T TP R D       
Sbjct: 91  NLDR-----ISEPSIILIDEAHH-VKAKTYLKIIEYYINSIVLMFTGTPARLDGSGFDD- 143

Query: 179 XXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVES 238
               A  ++ G +  WL E G ++ F Y AP   ID  +L   + G+F  K++    ++ 
Sbjct: 144 ---IADDIVLGKSVKWLQENGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197

Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLE 298
            I GDV++HY K  + K+AIV+   V+ +E ++  F+K GYT+ A+SG+T    R+R ++
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFKKNGYTSIAISGKTPPEVRERAMQ 257

Query: 299 EFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
            F    + I++N +LF EG D+P V+  I+ RPTQSL+ +LQ   RAL   EGK  AIII
Sbjct: 258 TFRDRELTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316

Query: 359 DPVRNWERHGPPTAVRTWTLKGREKGQSS---GSDTVPQKTCDACTIPYEAYNKACPYCG 415
           D V N +RHG P A R W+LKG  K +     G  T   +TCD C   + +  + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGINKTKKKLKLGEPTT--RTCDECYATFWSAERICPLCG 374

Query: 416 HVN 418
           H N
Sbjct: 375 HEN 377
>ref|NP_471919.1| (NC_003212) similar to DEAH-family helicase [Listeria innocua]
 emb|CAC97816.1| (AL596173) similar to DEAH-family helicase [Listeria innocua]
          Length = 418

 Score =  177 bits (450), Expect = 2e-043
 Identities = 121/388 (31%), Positives = 188/388 (48%), Gaps = 47/388 (12%)

Query: 32  GAGKTVIFSKIIH---DHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXXXX 88
           GAGK+VI S+II     +      +VHR+E++ QI  +L    V                
Sbjct: 9   GAGKSVILSEIIRMTTRNKNNVLFLVHRKELIDQIRNTLTMNDVD--------------- 53

Query: 89  XXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQGIW 148
                    +++         VQT+  +        +    +  L + DE HH +    +
Sbjct: 54  -------MKFVNLGM------VQTVVRR--------LEKTSEPALIIIDESHHVLANS-Y 91

Query: 149 AKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRA 208
            K ++   NAK++  TATP R +              +IE     WLIE  FL+ + Y A
Sbjct: 92  KKIINHFSNAKVVGFTATPVRINGGGLGDIN----DMLIEKVNVKWLIENQFLAPYKYFA 147

Query: 209 PQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDTAE 268
           P+  +    L + + G+F+   L  +  +  I GDV++HY+K    ++AI++A+ +  +E
Sbjct: 148 PEI-VQTETLDIKRTGEFDMTGLDDQFNKRMIWGDVIKHYQKLANGEQAILYASSLYQSE 206

Query: 269 QMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVIL 328
           +MA  F   G T+A + G+T    RD +++ F +G I++L N+DL  EGFDVP    VI+
Sbjct: 207 KMAMSFESVGITSAHIDGKTPKSIRDDIIQRFREGEIKVLCNLDLIGEGFDVPDCSTVIM 266

Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSG 388
            RPTQSL+ ++Q   R +R   GK  AIIID V N  R G P   R W+L+ + KG +S 
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYRTGKT-AIIIDHVGNVNRFGLPDMERKWSLEAK-KGSNSN 324

Query: 389 SDTVPQKTCDACTIPYEAYNKACPYCGH 416
               P K C  C +   + N  C +CGH
Sbjct: 325 KAEAPVKICPDCFMTVLSTNIKCSHCGH 352
>sp|P24125|V51K_BPL79 51.5 kDa protein
 pir||JU0445 51K protein GP51C - Lactococcus phage mi7-9
 gb|AAA32363.1| (M36388) conserved bacteriophage encoded protein [Lactococcus
           bacteriophage (ISOLATE 7-9)]
          Length = 452

 Score =  174 bits (442), Expect = 2e-042
 Identities = 129/394 (32%), Positives = 188/394 (46%), Gaps = 55/394 (13%)

Query: 30  PTGAGKTVIFS---KIIHDHVGAAAAIVHRREIVGQISLSLASFGVKHXXXXXXXXXXXX 86
           P G+GK+V+ S   K+  +  G    +VHR+E++ QI+ S    GV              
Sbjct: 5   PPGSGKSVVISEITKLATEKGGRVLFLVHRKELIDQITNSFKVHGVP------------- 51

Query: 87  XXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMRWVKQVTLSVYDEGHHYVEQG 146
                               +  V+ LT     N  +++R   + TL + DEGHH  +  
Sbjct: 52  --------------------LNQVELLTVGKAKNRLSVLR---KPTLIITDEGHHG-KAA 87

Query: 147 IWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLIEQGFLSKFVY 206
            + K      +   L  TATP R                MIEG T  WLI    L+ + Y
Sbjct: 88  TYQKIYEFYADVPRLGFTATPWRMSGDGFKDTYDY----MIEGKTVEWLINNKRLAPYQY 143

Query: 207 RAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVDT 266
            +  + ID+  L + KNGD++ +++   + ++ I GDVVQ Y K    ++AI++A  V+ 
Sbjct: 144 YSLPS-IDISKLRI-KNGDYSNQSIDDALGKT-IFGDVVQEYVKHANGQKAILYAHSVEA 200

Query: 267 AEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECV 326
           ++  A+EF++AG  A  V  +T   ERD+L+ +F  G I++L NVDL  EGFDVP     
Sbjct: 201 SQSFAKEFQEAGIKAVHVDAKTPKNERDKLMLDFRNGRIKVLCNVDLISEGFDVPDCTVT 260

Query: 327 ILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWT--LKG--RE 382
           IL RPT+SL  FLQ   R++R    K  AIIID V NW  HG P     W    +G  ++
Sbjct: 261 ILCRPTKSLVLFLQQSMRSMRYQHRKT-AIIIDNVMNWHAHGLPDTHHDWKDYFEGGWKK 319

Query: 383 KGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGH 416
           KGQ    + V  K C  C+  +    K C  CGH
Sbjct: 320 KGQ---KNIVQAKQCPDCSAMWPLSQKMCNLCGH 350
>ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. PCC 7120]
 dbj|BAB78241.1| (AP003600) ORF_ID:alr7157~putative helicase [Nostoc sp. PCC 7120]
          Length = 629

 Score =  155 bits (393), Expect = 8e-037
 Identities = 121/421 (28%), Positives = 182/421 (42%), Gaps = 56/421 (13%)

Query: 1   MFKLRDYQQLAYNRIMAAWQE-YRSVLAVLPTGAGKTVIFSKIIHDHVG---AAAAIVHR 56
           MF LRDYQQ   ++  AAW    R VL  L TG GKTVIF++I             + HR
Sbjct: 1   MFNLRDYQQDLISKTFAAWSSGIRKVLLQLSTGGGKTVIFAEIASKMTAQGEGVLVVAHR 60

Query: 57  REIVGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSK 116
            E++ Q +  L +                            Y   ++   + S+QTL  +
Sbjct: 61  EELILQAAEKLTA----------------VTKLQPGIIKAGYKSTDSPLQIASIQTLARR 104

Query: 117 STMNDATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXX 176
            T   A ++         + DE HH      + K +    +A +L +TATP R D     
Sbjct: 105 QTYPSAQLV---------IIDEAHH-SSANSYRKLLDAYPHALVLGLTATPRRED----G 150

Query: 177 XXXXXFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVV 236
                    +I   +T  LI  G+L+ +   A           V +  DF  K L+  V 
Sbjct: 151 YGLRDIFDHLICSVSTKELITLGYLTDYKLIA---GFKYSRHKVPQKRDFTRKELE-EVA 206

Query: 237 ESHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRL 296
             +   +V++ ++ +   K+ ++FA +V  ++Q+A  F   G T   + GET   ER  +
Sbjct: 207 SDYKPSEVLKQWQNFCAGKKTVIFAVNVMHSKQIAAAFCADGITCEHLDGETPHNERQAI 266

Query: 297 LEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAI 356
           L  F  G  Q++ N  +  EGFD P     ++ARPT S+  +LQM+GR LR   GK+YA 
Sbjct: 267 LNRFRSGQTQVISNCAILTEGFDCPDSSAAVIARPTSSVTLWLQMIGRVLRPAPGKDYAT 326

Query: 357 IIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKACPYCGH 416
           I+D   NW R G P   R W+L+                +CD    P    ++ CP+C H
Sbjct: 327 ILDMTDNWFRLGRPCDNRKWSLE--------------PVSCD----PDTQGSRCCPHCHH 368

Query: 417 V 417
           V
Sbjct: 369 V 369
>ref|NP_459877.1| (NC_003197) Fels-1 prophage; putative DNA or RNA helicases of
           superfamily II [Salmonella typhimurium LT2]
 gb|AAL19836.1| (AE008738) Fels-1 prophage; putative DNA or RNA helicases of
           superfamily II [Salmonella typhimurium LT2]
          Length = 527

 Score =  114 bits (286), Expect = 2e-024
 Identities = 100/413 (24%), Positives = 172/413 (41%), Gaps = 46/413 (11%)

Query: 19  WQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAI---VHRREIVGQISLSLASFGVKHX 75
           W+ + + +   PTG+GKT + + I+   V     +   V  + ++ Q +     +G+   
Sbjct: 6   WKSHNTFMVYAPTGSGKTGLAAFIVDGFVSRGMRVMFCVPYQILITQTASRFIEYGLS-- 63

Query: 76  XXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMNDATIMR-WVKQVTLS 134
                              G  +         + +Q  ++     D  I R +   + L 
Sbjct: 64  ---------------GDEIGYVWASHPNHNPALKIQIASA-----DTLIRREFPDNIDLL 103

Query: 135 VYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHW 194
           + DE H   ++ +           K++ ++ TP              +   +I+  T   
Sbjct: 104 IIDEAHLRKKRILQDIERLRASGVKVIGLSGTP-------FSPFLGKYYDRLIKPTTIGE 156

Query: 195 LIEQGFLSKFVYRAPQTDIDLRDLAVGKN----GDFNAKALKARVVESHIVGDVVQHYRK 250
           LI++G LSK+ + AP T  DL+ +    +     D+N   L   +  S +VGD+VQ++ +
Sbjct: 157 LIQRGDLSKYEFYAP-TKPDLKGVKTKSSLEYGSDYNEAQLAEIMCGSTLVGDIVQNWLE 215

Query: 251 WGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILIN 310
            G +   I F  +V  A  +  +F  AG  A  ++ +T   ER  ++  FE G  +I+++
Sbjct: 216 NGRDLPTIAFCVNVAHANYLTIQFNLAGINAEVMTADTPVDERQTIIHRFETGATKIIVS 275

Query: 311 VDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPP 370
           V +   GFD   V C+I ARPT+S  ++LQ +GR LR   GKE  +I D      R G P
Sbjct: 276 VGVLVAGFD-SDVRCIIYARPTKSEIRWLQAIGRGLRTAPGKESCLIFDHSGTVHRLGYP 334

Query: 371 TAVRTWTLKGREKGQSSGSDTVPQ-------KTCDACTIPYEAYNKACPYCGH 416
            ++    L G+  G    +    Q       + C  C     A    CP CGH
Sbjct: 335 DSIEYDDLPGKSDGMEESARRSSQERAEKLPRECSQCHYMKPAGVYVCPKCGH 387
>ref|NP_231273.1| (NC_002505) helicase-related protein [Vibrio cholerae]
 pir||B82176 helicase-related protein VC1636 [imported] - Vibrio cholerae (group
           O1 strain N16961)
 gb|AAF94787.1| (AE004241) helicase-related protein [Vibrio cholerae]
          Length = 657

 Score =  110 bits (274), Expect = 5e-023
 Identities = 111/441 (25%), Positives = 186/441 (42%), Gaps = 74/441 (16%)

Query: 1   MFKLRDYQQLAYNRIMAAWQEYRS-VLAVLPTGAGKTVIFSKIIHDHVGAAAAIVHRREI 59
           M+ LR YQ  +   ++  ++++ +  + VLPTGAGK+++ +++     G    + H +E+
Sbjct: 64  MYTLRPYQADSVKAVVHYFRQHSTPAVIVLPTGAGKSLVIAELARLARGRVLVLAHVKEL 123

Query: 60  VGQISLSLASFGVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTM 119
           V Q       +G+                      G+   DQ  Q    SVQ++      
Sbjct: 124 VEQNHAKYEGYGL-------------TGAIFSAGLGRKETDQ--QVVFASVQSVVRN--- 165

Query: 120 NDATIMRWVKQVTLSVYDEGHHYVEQ--GIWAKAV-HLME---NAKLLFVTATPERADXX 173
               +  +  Q +L V DE H   +     + K + HL+E     K+L +TATP R    
Sbjct: 166 ----LSEFQNQFSLLVIDECHRVPDDKNSSYQKVITHLLELNPGMKVLGLTATPYRLGMG 221

Query: 174 XXXXXXXX----------FAQTMIEGPTTHWLIEQGFLSKF-VYRAPQTDIDLRDLAVGK 222
                             F   + E P  H+L+++GFL+   +   P    D   L    
Sbjct: 222 WIYQYHTRGLVRSEEPRFFRDCIFELPI-HYLLDEGFLTPAQLIDTPVMSYDFSQLKPAN 280

Query: 223 NGDFNAKALKARVVESH-----IVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKA 277
            G +    L   + +S      IV  +++  +     +  ++FA  V  AE++       
Sbjct: 281 TGRYRESELDLVIEQSQRATPQIVAQIIELAKA---RQGVMIFAATVRHAEEIYRLLPTE 337

Query: 278 GYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAK 337
              AA V G+T + ERDR+++ F+Q  I+ L+NV +   GFD P V+ + + RPT+S++ 
Sbjct: 338 H--AALVIGDTPTPERDRIIQAFKQQQIKFLVNVSVLTTGFDAPHVDLIAILRPTESVSL 395

Query: 338 FLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTC 397
           + Q+VGR LR+  GK   +++D   N            + L   E G     D  P    
Sbjct: 396 YQQIVGRGLRLAAGKSDCLVLDYAGN-----------QYDLYQPEVG-----DPKPDSDS 439

Query: 398 DACTIPYEAYNKACPYCGHVN 418
           +  TIP       CP CG  N
Sbjct: 440 EIITIP-------CPACGFNN 453
>ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
 pir||H71243 probable helicase protein PH0210 - Pyrococcus horikoshii
 dbj|BAA29279.1| (AP000001) 444aa long hypothetical protein [Pyrococcus horikoshii]
          Length = 444

 Score = 77.0 bits (188), Expect = 5e-013
 Identities = 91/359 (25%), Positives = 154/359 (42%), Gaps = 37/359 (10%)

Query: 2   FKLRDYQQLAYNRIMAAWQEYRSVLAVLPTGAGKTVIFSKIIHDHVGAAAAIVHRREIVG 61
           +KLR YQ+ A   +  A +E   VLA LP G+GKTV+  +IIH+   +A  IVH +E++ 
Sbjct: 93  YKLRKYQKKA---VKLAIKEKMGVLA-LPVGSGKTVVGLRIIHEINKSALVIVHTKELLY 148

Query: 62  QISLSLASF-GVKHXXXXXXXXXXXXXXXXXXXXGQSYIDQNAQCGVISVQTLTSKSTMN 120
           Q +  +    GV+                                 V  +QTL S+ T  
Sbjct: 149 QWANKVREILGVEPGIIGDNKWS------------------EGPITVAMIQTLLSRGTD- 189

Query: 121 DATIMRWVKQVTLSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXX 180
                +   +  + ++DE H       + K    +       ++ATP R           
Sbjct: 190 -----KLQNKYAIVMFDECHRTSAAEKFYKVGISLPQVYRFGLSATPWRRLRGEEMKIEG 244

Query: 181 XFAQTMIEGPTTHWLIEQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHI 240
                + E      LI++GFL+K  +   + D  +  LA  K  +   +A+      +  
Sbjct: 245 VVGPIIYEVKAED-LIKEGFLAKPKFEVIEYDSKMPALA-DKYKELYEEAVMENEERNRA 302

Query: 241 VGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEF 300
           + +      K G   R ++    +D  E + +  R  G  A  +S  ++S  R  +LE++
Sbjct: 303 IVEKAIELAKQGH--RVLIDVKRIDHGEILVKMLRDRGINAEFLS--SQSPNRWEILEKY 358

Query: 301 EQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
           ++G I +L++  L  EG D+P +  +ILA   +S    +Q +GRALR   G E A+I+D
Sbjct: 359 KKGEIPVLVST-LLKEGVDIPEISAIILAGGGKSDVMTIQTIGRALRPKPGGE-AVIVD 415
>gb|AAF12716.1|AF066865_14 (AF066865) unknown [bacteriophage TPW22]
          Length = 205

 Score = 65.9 bits (159), Expect = 1e-009
 Identities = 46/136 (33%), Positives = 67/136 (48%), Gaps = 8/136 (5%)

Query: 137 DEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXXXFAQTMIEGPTTHWLI 196
           DE HH ++   + K +    NA  LF T TP R +           A  +I G +  WL 
Sbjct: 78  DEAHH-IKANSYKKILEAFPNALKLFFTGTPIRLNGQGFED----MADDLITGKSIKWLQ 132

Query: 197 EQGFLSKFVYRAPQTDIDLRDLAVGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKR 256
           E G ++ F Y AP   ID   L    +GDF  K++     +  I GDV+ HY K  + K+
Sbjct: 133 EHGNIAPFKYYAPNI-IDTSQLKK-TSGDFTQKSID-EAFKRAIYGDVIAHYNKLSKGKQ 189

Query: 257 AIVFATDVDTAEQMAE 272
           AI +A +V TA+ ++E
Sbjct: 190 AICYAHNVATAQHISE 205
>ref|NP_559418.1| (NC_003364) helicase, possible DNA repair rad25 [Pyrobaculum
           aerophilum]
 gb|AAL63600.1| (AE009830) helicase, possible DNA repair rad25 [Pyrobaculum
           aerophilum]
          Length = 662

 Score = 65.1 bits (157), Expect = 2e-009
 Identities = 63/242 (26%), Positives = 104/242 (42%), Gaps = 32/242 (13%)

Query: 133 LSVYDEGHHYVEQGIWAKAVHLMENAKLLFVTATPERADXXXXXXXXX-------XFAQT 185
           L ++DE HH   +     A+ L E+   L ++ATPER D                   ++
Sbjct: 387 LVIFDEAHHVPAETFKEVALSL-ESPFRLALSATPEREDQNEHLIYEAVGPPVYRASYRS 445

Query: 186 MIEG----PTTHWLIEQGFL--SKFVYRA-PQTD-IDLRDLAVGKNGDFNAKALKARVVE 237
           MIE     P  H+ I        + +Y + PQ + I LR+ A        AKA K   + 
Sbjct: 446 MIEAGLVVPVEHYRIYVRMTREEEALYASLPQNNAIVLRNAA--------AKASKKIPIA 497

Query: 238 SHIVGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLL 297
             +V   V          + +VF   +D AE++ +  R++G  A  ++  +E G R+   
Sbjct: 498 VRVVVQEV------ALGSKVLVFTQFIDQAEELYKRLRESGIAAELIT--SEEGNREAAF 549

Query: 298 EEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAII 357
             F  G  ++++   + DEG DVP  E  ++   T S  + +Q  GR +R   GK  A +
Sbjct: 550 RRFSNGLSKVVVTTTVLDEGVDVPDAEVAVIVSGTGSRRQMIQRAGRVVRATAGKRAARV 609

Query: 358 ID 359
            +
Sbjct: 610 YE 611
>ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, putative [Methanococcus
           jannaschii]
 sp|Q58900|YF05_METJA Putative ATP-dependent RNA helicase MJ1505
 gb|AAB99518.1| (U67591) ATP dependent RNA helicase, putative [Methanococcus
           jannaschii]
          Length = 778

 Score = 55.1 bits (131), Expect = 2e-006
 Identities = 35/124 (28%), Positives = 63/124 (50%), Gaps = 8/124 (6%)

Query: 241 VGDVVQHYRKWGENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESG-------ER 293
           V D+V++  +  +++R I+FA   DT E++     + G  A    G+           E+
Sbjct: 339 VVDMVKNILEKNKDERIIIFAQYRDTVEKIVNLLTQNGIKAIRFIGQANKEGKGMSQKEQ 398

Query: 294 DRLLEEFEQGTIQILINVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKE 353
              +E F++    +L++  + +EG D+P+V  +I   P  S  +F+Q  GRA+R   GK 
Sbjct: 399 IEAIERFKKEG-SVLVSTSVSEEGIDIPSVNYIIFYEPVPSEIRFIQRRGRAMRGEGGKV 457

Query: 354 YAII 357
           Y +I
Sbjct: 458 YVLI 461
>sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4A) (eIF4A)
 gb|AAB58726.1| (AF001211) translation initiation factor [Cryptosporidium parvum]
 gb|AAB58799.1| (AF001378) translation initiation factor [Cryptosporidium parvum]
          Length = 405

 Score = 50.1 bits (118), Expect = 6e-005
 Identities = 29/117 (24%), Positives = 59/117 (49%), Gaps = 5/117 (4%)

Query: 256 RAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLFD 315
           +AI++       +Q+ ++ R+  +T +++ G+ +  +R+ ++ +F  G+ ++LI  DL  
Sbjct: 272 QAIIYCNTRRRVDQLTKQMRERDFTCSSMHGDMDQKDREVIMRQFRSGSSRVLITTDLLA 331

Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALRI-MEGKEYAIIIDP----VRNWERH 367
            G DV  V  VI      S   ++  +GR+ R   +G     + D     +R+ ERH
Sbjct: 332 RGIDVQQVSLVINYDLPVSPETYIHRIGRSGRFGKKGVSINFVTDDDIVCLRDIERH 388
>ref|NP_421775.1| (NC_002696) excinuclease ABC, subunit B [Caulobacter crescentus]
 gb|AAK24943.1| (AE005961) excinuclease ABC, subunit B [Caulobacter crescentus]
          Length = 749

 Score = 47.8 bits (112), Expect = 3e-004
 Identities = 59/275 (21%), Positives = 107/275 (38%), Gaps = 29/275 (10%)

Query: 241 VGDVVQHYRKW-GENKRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEE 299
           V DVV   R+   +  R +V       AE + E   + G     +  + ++ ER  ++ +
Sbjct: 478 VDDVVDEIRQTIQKGYRTLVTVLTKKMAEDLTEYLTEQGIRVRYMHSDVDTIERIEIIRD 537

Query: 300 FEQGTIQILINVDLFDEGFDVPAVECVILA-------RPTQSLAKFLQMVGRALRIMEGK 352
              G   +L+ ++L  EG D+P  EC ++A          +S    +Q +GRA R ++GK
Sbjct: 538 LRLGHFDVLVGINLLREGLDIP--ECGLVAILDADKEGFLRSETSLIQTIGRAARNVDGK 595

Query: 353 EYAIIIDPVRNWERHGPPTAVRTWTLKGREKGQSSGSDTVPQKTCDACTIPYEAYNKAC- 411
                     + ER    TA R               ++V +   D    PYE  ++   
Sbjct: 596 VILYADRVTGSMERAMAETARRREKQHAYNLEHGITPESVKRDIKDILNSPYERGDRVLV 655

Query: 412 ----------PYCGHVNVYPDRSAPDKVDGDLVELDLDAWNALFAEIERAKMSDDDYELD 461
                     P+ G       +    K+      L+ +    L  EI+R K+ D ++  +
Sbjct: 656 PMGMSETDDRPFSGDNFKAALKDLEAKMREAAANLEFETAARLRDEIKRMKLMDLEFANE 715

Query: 462 M-------IRRGVPQIGRGQELRRHRAAKYRREVL 489
           +       + + +P+  R  ELR  +A  +R+  L
Sbjct: 716 VLTAPGEAVDKAMPKRVRA-ELRAEQAEAFRKSRL 749
>ref|NP_375993.1| (NC_003106) 621aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
 dbj|BAB65102.1| (AP000981) 621aa long conserved hypothetical protein [Sulfolobus
           tokodaii]
          Length = 621

 Score = 43.5 bits (101), Expect = 0.006
 Identities = 29/107 (27%), Positives = 50/107 (46%), Gaps = 14/107 (13%)

Query: 255 KRAIVFATDVDTAEQMAEEFRKAGYTAAAVSGETESGERDRLLEEFEQGTIQILINVDLF 314
           K A+VF    ++AE +AE  RK    A           RD+++E F +G +++L++    
Sbjct: 215 KNALVFVRSRNSAETLAETLRKFSIPALPYHSGLPYDVRDKVIESFMKGEVRVLVSTTAL 274

Query: 315 DEGFDVPAVECVI--LARPTQ------------SLAKFLQMVGRALR 347
            +G ++P    V   ++ P              S+A+F Q+ GRA R
Sbjct: 275 GQGVNLPVYATVFYDISLPDSDDKGEFKGWRDLSVAEFKQIAGRAGR 321
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.136 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 278,304,758 Number of Sequences: 884779 Number of extensions: 10979687 Number of successful extensions: 26850 Number of sequences better than 5.0e-02: 632 Number of HSP's better than 0.1 without gapping: 488 Number of HSP's successfully gapped in prelim test: 144 Number of HSP's that attempted gapping in prelim test: 25932 Number of HSP's gapped (non-prelim): 966 length of query: 550 length of database: 277,083,050 effective HSP length: 125 effective length of query: 425 effective length of database: 166,485,675 effective search space: 70756411875 effective search space used: 70756411875 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 93 (40.4 bits) Query= orf47 [13233-13402] (168 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_416862.1|  (NC_000913) orf, hypothetical protein [Esc...    96  1e-019
sp|P76514|YFDR_ECOLI  Hypothetical protein yfdR                    96  1e-019
ref|NP_420264.1|  (NC_002696) conserved hypothetical protein...    58  4e-008
ref|NP_531749.1|  (NC_003304) conserved hypothetical protein...    54  6e-007
ref|NP_354075.1|  (NC_003062) AGR_C_1942p [Agrobacterium tum...    54  6e-007
dbj|BAA34643.1|  (AB020211) unnamed protein product [Roseoba...    54  8e-007
ref|NP_108031.1|  (NC_002678) hypothetical protein [Mesorhiz...    53  1e-006
ref|NP_385209.1|  (NC_003047) CONSERVED HYPOTHETICAL PROTEIN...    52  3e-006
>ref|NP_416862.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
 pir||F65009 hypothetical protein b2361 - Escherichia coli (strain K-12)
 gb|AAC75420.1| (AE000324) orf, hypothetical protein [Escherichia coli K12]
          Length = 187

 Score = 95.9 bits (237), Expect = 1e-019
 Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 2/116 (1%)

Query: 19  DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
           +K D+ +  IA  LS+I RF G +   YSVAQH V  + L+PQ+     L+HDATEA   
Sbjct: 27  NKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 86

Query: 78  DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
           D+PAPLKR+LPDY+R+EE++   +  ++ +       V+ ADL MLA E RD  LD
Sbjct: 87  DIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 142
>sp|P76514|YFDR_ECOLI Hypothetical protein yfdR
          Length = 178

 Score = 95.9 bits (237), Expect = 1e-019
 Identities = 53/116 (45%), Positives = 73/116 (62%), Gaps = 2/116 (1%)

Query: 19  DKFDMPLATIAYHLSHINRFNGAVGQ-YSVAQHCVQVAALLPQDLKLAGLLHDATEAVLC 77
           +K D+ +  IA  LS+I RF G +   YSVAQH V  + L+PQ+     L+HDATEA   
Sbjct: 18  NKDDIVINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 77

Query: 78  DVPAPLKRMLPDYQRIEERLQDAVDRRFNV-KTRHKRVREADLSMLAAEARDFNLD 132
           D+PAPLKR+LPDY+R+EE++   +  ++ +       V+ ADL MLA E RD  LD
Sbjct: 78  DIPAPLKRLLPDYKRMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLD 133
>ref|NP_420264.1| (NC_002696) conserved hypothetical protein [Caulobacter crescentus]
 gb|AAK23432.1| (AE005819) conserved hypothetical protein [Caulobacter crescentus]
          Length = 250

 Score = 57.8 bits (138), Expect = 4e-008
 Identities = 44/121 (36%), Positives = 69/121 (56%), Gaps = 16/121 (13%)

Query: 22  DMPLATIAYHLSHINRFNG-AVGQ--YSVAQHCV---QVAALLPQDL----KLAGLLHDA 71
           D+ +  IA+ L+ + R+NG  VG   +SVAQH +   ++AA +  DL    +LA LLHDA
Sbjct: 68  DIEIEDIAHGLARVARWNGQTVGDHGFSVAQHSLVVEEIAAHIKPDLEPRWRLAALLHDA 127

Query: 72  TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNVKTR-----HKRVREADLSMLAAE 125
           +E V+ D+ +P K  L   Y+  E RL+DA+  RF +  +      K +++AD +    E
Sbjct: 128 SEYVIGDMISPFKAALGVSYKDFETRLEDAIHIRFGLPVKTPTPIKKLIKQADRACAFFE 187

Query: 126 A 126
           A
Sbjct: 188 A 188
>ref|NP_531749.1| (NC_003304) conserved hypothetical protein [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
 gb|AAL42065.1| (AE009069) conserved hypothetical protein [Agrobacterium
           tumefaciens str. C58 (U. Washington)]
          Length = 207

 Score = 53.9 bits (128), Expect = 6e-007
 Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 18/124 (14%)

Query: 22  DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL--------PQDLKLAGLLHD 70
           D+ +A IA+ L+ + R+NG       ++VAQHC+ V  +         P ++++A LLHD
Sbjct: 28  DVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQMA-LLHD 86

Query: 71  ATEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREADLSMLAA 124
           A E V+ D+ +P K ++   Y+ +E+RL+ AV  RF +     +    R+++AD      
Sbjct: 87  APEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADTVAAFF 146

Query: 125 EARD 128
           EA +
Sbjct: 147 EATE 150
>ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 gb|AAK86860.1| (AE008035) AGR_C_1942p [Agrobacterium tumefaciens str. C58
           (Cereon)]
          Length = 226

 Score = 53.9 bits (128), Expect = 6e-007
 Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 18/124 (14%)

Query: 22  DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL--------PQDLKLAGLLHD 70
           D+ +A IA+ L+ + R+NG       ++VAQHC+ V  +         P ++++A LLHD
Sbjct: 47  DVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQMA-LLHD 105

Query: 71  ATEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREADLSMLAA 124
           A E V+ D+ +P K ++   Y+ +E+RL+ AV  RF +     +    R+++AD      
Sbjct: 106 APEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADTVAAFF 165

Query: 125 EARD 128
           EA +
Sbjct: 166 EATE 169
>dbj|BAA34643.1| (AB020211) unnamed protein product [Roseobacter denitrificans]
          Length = 195

 Score = 53.5 bits (127), Expect = 8e-007
 Identities = 35/97 (36%), Positives = 53/97 (54%), Gaps = 11/97 (11%)

Query: 22  DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALL-------PQDLKLAGLLHDA 71
           D+ +  IA  L+ + R+NG       YSVA+H + V  L        P   +LA LLHDA
Sbjct: 23  DVEIEDIARGLAFVARWNGQTSGAHAYSVAEHSLLVETLFTRIADKSPAKWRLAALLHDA 82

Query: 72  TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV 107
            E V+ D+ +P+K  + P Y+ ++ERL  A+  RF +
Sbjct: 83  PEYVIGDMISPVKAAVGPGYEALDERLTAAIHLRFGL 119
>ref|NP_108031.1| (NC_002678) hypothetical protein [Mesorhizobium loti]
 dbj|BAB54176.1| (AP003012) hypothetical protein [Mesorhizobium loti]
          Length = 211

 Score = 53.1 bits (126), Expect = 1e-006
 Identities = 37/97 (38%), Positives = 58/97 (59%), Gaps = 11/97 (11%)

Query: 22  DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAAL----LPQDL---KLAGLLHDA 71
           D+ ++ IA+ L+ + R+NG       +SVAQH + V AL    +PQ     +LA LLHDA
Sbjct: 31  DIEISDIAHGLARVARWNGQTSGEHAFSVAQHSLLVEALFNDLVPQASATDRLAALLHDA 90

Query: 72  TEAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV 107
            E V+ D+ +P K ++   Y+  E RLQ A+ +RF++
Sbjct: 91  PEYVIGDMISPFKSVMGGSYKDCELRLQRAIHQRFSL 127
>ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
 emb|CAC45682.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
          Length = 202

 Score = 51.6 bits (122), Expect = 3e-006
 Identities = 36/112 (32%), Positives = 63/112 (56%), Gaps = 15/112 (13%)

Query: 22  DMPLATIAYHLSHINRFNGAVG---QYSVAQHCVQVAALLPQDLK------LAGLLHDAT 72
           D+ L+ IA+ L+ + R+NG       +SVAQH + V  +  +  +      L  LLHDA 
Sbjct: 25  DVELSDIAHGLARVARWNGQTSGDHAFSVAQHSLVVEEIFRRTNRCDAQDCLMALLHDAP 84

Query: 73  EAVLCDVPAPLKRML-PDYQRIEERLQDAVDRRFNV-----KTRHKRVREAD 118
           E V+ D+ +P K ++   Y+ +E+RL++AV  RF +     +   +R+++AD
Sbjct: 85  EYVIGDMISPFKAVVGGGYKTVEKRLENAVHLRFGLPAHTPRELKERIKKAD 136
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.324 0.139 2.98e-071 Gapped Lambda K H 0.267 0.0410 -1.21e-305 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 75,839,941 Number of Sequences: 884779 Number of extensions: 2768435 Number of successful extensions: 6282 Number of sequences better than 5.0e-02: 11 Number of HSP's better than 0.1 without gapping: 2 Number of HSP's successfully gapped in prelim test: 9 Number of HSP's that attempted gapping in prelim test: 6272 Number of HSP's gapped (non-prelim): 11 length of query: 168 length of database: 277,083,050 effective HSP length: 111 effective length of query: 57 effective length of database: 178,872,581 effective search space: 10195737117 effective search space used: 10195737117 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 86 (37.7 bits) Query= orf48 [13402-13573] (170 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.133 -1.21e-305 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 92,549,307 Number of Sequences: 884779 Number of extensions: 3526080 Number of successful extensions: 9240 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 9240 Number of HSP's gapped (non-prelim): 0 length of query: 170 length of database: 277,083,050 effective HSP length: 111 effective length of query: 59 effective length of database: 178,872,581 effective search space: 10553482279 effective search space used: 10553482279 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 86 (37.7 bits) Query= orf49 [13588-13700] (111 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_335121.1|  (NC_002755) transcriptional regulator, put...    48  2e-005
ref|NP_215195.1|  (NC_000962) hypothetical protein Rv0681 [M...    46  6e-005
emb|CAC95193.1|  (AJ416905) hypothetical transcriptional reg...    42  9e-004
ref|NP_266283.1|  (NC_002662) transcription regulator [Lacto...    42  0.001
pir||T34657  probable transcription regulator - Streptomyces...    41  0.002
ref|NP_241219.1|  (NC_002570) BH0353~unknown conserved prote...    40  0.003
ref|NP_252662.1|  (NC_002516) probable transcriptional regul...    39  0.007
ref|NP_149192.1|  (NC_001988) HTH transcriptional regulator ...    39  0.009
ref|NP_355336.1|  (NC_003062) AGR_C_4325p [Agrobacterium tum...    39  0.012
ref|NP_251575.1|  (NC_002516) probable transcriptional regul...    38  0.021
emb|CAC03642.1|  (AL391338) putative TetR-family transcripti...    37  0.027
ref|NP_534677.1|  (NC_003305) transcriptional regulator, Tet...    37  0.027
ref|NP_511232.1|  (NC_003292) tetracycline repressor protein...    37  0.027
pir||RPECR1  tet repressor protein (Tn 1721) - Escherichia c...    37  0.047
>ref|NP_335121.1| (NC_002755) transcriptional regulator, putative [Mycobacterium
           tuberculosis CDC1551]
 gb|AAK44935.1| (AE006965) transcriptional regulator, putative [Mycobacterium
           tuberculosis CDC1551]
          Length = 218

 Score = 47.8 bits (112), Expect = 2e-005
 Identities = 34/101 (33%), Positives = 51/101 (49%), Gaps = 8/101 (7%)

Query: 2   AKKITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
           A K   RPA   R E IV+  L    R GW+S+T  A+A  +G   PSL  H  ++  L+
Sbjct: 19  AVKSVARPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLR 77

Query: 62  RAIVRAAVRANDGLVVAM-AVQAGDWSRDKTDELLYIRGKH 101
           RA+    +R  D ++  +  V AG   R + D +L + G +
Sbjct: 78  RAV---RIRVIDDIITMLNRVGAG---RARDDAVLVMAGAY 112
>ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [Mycobacterium tuberculosis
           H37Rv]
 pir||B70827 hypothetical protein Rv0681 - Mycobacterium tuberculosis  (strain
           H37RV)
 emb|CAA17464.1| (AL021943) hypothetical protein Rv0681 [Mycobacterium tuberculosis
           H37Rv]
          Length = 196

 Score = 46.2 bits (108), Expect = 6e-005
 Identities = 32/95 (33%), Positives = 49/95 (50%), Gaps = 8/95 (8%)

Query: 8   RPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRA 67
           RPA   R E IV+  L    R GW+S+T  A+A  +G   PSL  H  ++  L+RA+   
Sbjct: 3   RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAV--- 58

Query: 68  AVRANDGLVVAM-AVQAGDWSRDKTDELLYIRGKH 101
            +R  D ++  +  V AG   R + D +L + G +
Sbjct: 59  RIRVIDDIITMLNRVGAG---RARDDAVLVMAGAY 90
>emb|CAC95193.1| (AJ416905) hypothetical transcriptional regulator [Rhizobium
          leguminosarum]
          Length = 216

 Score = 42.4 bits (98), Expect = 9e-004
 Identities = 20/64 (31%), Positives = 38/64 (59%)

Query: 2  AKKITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
          AK + +      ++EQI++A L V   +G+   T   +AE  G++ P+L+Y++RT   + 
Sbjct: 7  AKTLRRTRIQEEKEEQILEAALDVFSASGFRGSTIDQIAEVAGMSKPNLLYYFRTKEAMH 66

Query: 62 RAIV 65
          RA++
Sbjct: 67 RALI 70
>ref|NP_266283.1| (NC_002662) transcription regulator [Lactococcus lactis subsp.
          lactis]
 gb|AAK04225.1|AE006251_1 (AE006251) transcription regulator [Lactococcus lactis subsp.
          lactis]
          Length = 191

 Score = 41.6 bits (96), Expect = 0.001
 Identities = 18/52 (34%), Positives = 36/52 (68%)

Query: 12 AVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
          A+ +++IVQA + +A + G  ++++  +AE+ GI APSL  H++ M +++ A
Sbjct: 5  ALTRDKIVQATIELAGKIGLSNVSFPRLAEYFGIKAPSLYNHFKNMEEVRVA 56
>pir||T34657 probable transcription regulator - Streptomyces coelicolor
 emb|CAA22378.1| (AL034446) putative transcriptional regulator [Streptomyces
          coelicolor A3(2)]
          Length = 189

 Score = 41.2 bits (95), Expect = 0.002
 Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 3/72 (4%)

Query: 9  PAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAA 68
          P   +  +++V A   +A  TG+ES+T  A+A H G+   SL  H R +  L+   VR A
Sbjct: 2  PRVGLTTDRVVAAAADLADETGFESVTVSALARHFGVKDASLYTHVRNLQDLR---VRVA 58

Query: 69 VRANDGLVVAMA 80
          + A   L+  +A
Sbjct: 59 LLAGGELIEEIA 70
>ref|NP_241219.1| (NC_002570) BH0353~unknown conserved protein [Bacillus
          halodurans]
 dbj|BAB04072.1| (AP001508) BH0353~unknown conserved protein [Bacillus halodurans]
          Length = 197

 Score = 40.4 bits (93), Expect = 0.003
 Identities = 22/73 (30%), Positives = 39/73 (53%)

Query: 13 VRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRAN 72
          V KE++++A        G  S+T K+VAE  G+   ++ YH+R+  QL   IV +   + 
Sbjct: 7  VTKEELIEAAKRCIVEKGIHSLTLKSVAEGAGVTQGTVYYHFRSKDQLMLEIVESMCTSA 66

Query: 73 DGLVVAMAVQAGD 85
             + +M+ Q G+
Sbjct: 67 WHSLESMSKQEGE 79
>ref|NP_252662.1| (NC_002516) probable transcriptional regulator [Pseudomonas
          aeruginosa]
 pir||C83149 probable transcription regulator PA3973 [imported] - Pseudomonas
          aeruginosa (strain PAO1)
 gb|AAG07360.1|AE004815_4 (AE004815) probable transcriptional regulator [Pseudomonas
          aeruginosa]
          Length = 215

 Score = 39.3 bits (90), Expect = 0.007
 Identities = 27/82 (32%), Positives = 44/82 (52%), Gaps = 3/82 (3%)

Query: 1  MAKKITKR--PAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMT 58
          M  ++T+R     +  +E+I+Q GL      G+ ++T +A+A+  GIA  SL  H+R   
Sbjct: 1  MVYRVTERRLQRDSALRERILQLGLRRVVEGGFAALTMQALADDAGIATGSLYRHFRGKG 60

Query: 59 QLKRAIV-RAAVRANDGLVVAM 79
          +L   I  RA+ R  D L V +
Sbjct: 61 ELAAEIFRRASQREVDALAVVL 82
>ref|NP_149192.1| (NC_001988) HTH transcriptional regulator TetR family
          [Clostridium acetobutylicum]
 gb|AAK76774.1|AE001438_27 (AE001438) HTH transcriptional regulator TetR family [Clostridium
          acetobutylicum]
          Length = 187

 Score = 38.9 bits (89), Expect = 0.009
 Identities = 17/47 (36%), Positives = 29/47 (61%)

Query: 15 KEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
          KE+I+Q    +A   G + +T++ +AE + I +PSL  H+  M+ LK
Sbjct: 8  KEKIIQIAFSLADEIGLDKVTFQKLAEKLDIKSPSLYNHFSNMSDLK 54
>ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tumefaciens]
          [Agrobacterium tumefaciens str. C58 (Cereon)]
 ref|NP_533056.1| (NC_003304) transcriptional regulator, TetR family [Agrobacterium
          tumefaciens str. C58 (U. Washington)]
 gb|AAK88121.1| (AE008152) AGR_C_4325p [Agrobacterium tumefaciens str. C58
          (Cereon)]
 gb|AAL43372.1| (AE009186) transcriptional regulator, TetR family [Agrobacterium
          tumefaciens str. C58 (U. Washington)]
          Length = 213

 Score = 38.5 bits (88), Expect = 0.012
 Identities = 16/52 (30%), Positives = 32/52 (60%)

Query: 14 RKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
          ++E I++A L V    G+   T   +AE  G++ P+++Y++RT   + RA++
Sbjct: 19 KQETILEAALSVFSTNGFRGSTIDQIAEAAGMSKPNVLYYFRTKEAMHRALI 70
>ref|NP_251575.1| (NC_002516) probable transcriptional regulator [Pseudomonas
          aeruginosa]
 pir||C83286 probable transcriptional regulator PA2885 [imported] -
          Pseudomonas aeruginosa (strain PAO1)
 gb|AAG06273.1|AE004714_10 (AE004714) probable transcriptional regulator [Pseudomonas
          aeruginosa]
          Length = 198

 Score = 37.7 bits (86), Expect = 0.021
 Identities = 18/75 (24%), Positives = 39/75 (52%)

Query: 11 PAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVR 70
          P   + +++Q    + R  G+E  T + +A  VGI + S+ +H+++  ++ R+++   + 
Sbjct: 13 PESARGKLLQTAAHLFRSKGYERTTVRDLASAVGIQSGSIFHHFKSKDEILRSVMEETIL 72

Query: 71 ANDGLVVAMAVQAGD 85
           N  L+ A    A D
Sbjct: 73 YNTALMRAALADAED 87
>emb|CAC03642.1| (AL391338) putative TetR-family transcriptional regulator.
           [Streptomyces coelicolor A3(2)]
          Length = 190

 Score = 37.4 bits (85), Expect = 0.027
 Identities = 32/104 (30%), Positives = 51/104 (48%), Gaps = 10/104 (9%)

Query: 4   KITKRPAPAVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
           K  +R A A R +QI+ A   +    G ++++ + +AE VGI APSL  H+       +A
Sbjct: 2   KSGERSAVADRAQQIMSAARELLDAEGPDALSMRRIAERVGIRAPSLYKHF-----ADKA 56

Query: 64  IVRAAVRANDGLVVAMAVQAGDWSRDKTDELLYIRG----KHAL 103
            + A ++A  GL +   VQ    +    D  L + G    +HAL
Sbjct: 57  ALVAGLQA-QGLTLLAEVQEAAEAEIGADPPLLVLGYAYRRHAL 99
>ref|NP_534677.1| (NC_003305) transcriptional regulator, TetR family [Agrobacterium
          tumefaciens str. C58 (U. Washington)]
 gb|AAL44993.1| (AE009349) transcriptional regulator, TetR family [Agrobacterium
          tumefaciens str. C58 (U. Washington)]
          Length = 190

 Score = 37.4 bits (85), Expect = 0.027
 Identities = 20/72 (27%), Positives = 37/72 (50%), Gaps = 11/72 (15%)

Query: 12 AVRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI------- 64
          A R+++I+ A  ++A R GW S+T + +A+ +  + P L  H+     +  A+       
Sbjct: 12 ADREKRIIAAARMLAERDGWASVTVRRLAQEIEYSQPVLYAHFENRDAIVGAVALEGFGE 71

Query: 65 ----VRAAVRAN 72
              +RA+VR N
Sbjct: 72 LGPTLRASVRRN 83
>ref|NP_511232.1| (NC_003292) tetracycline repressor protein [IncN plasmid R46]
 sp|P03039|TER3_ECOLI TETRACYCLINE REPRESSOR PROTEIN CLASS C
 pir||RPECYS tetracycline repressor - Escherichia coli plasmid pSC101
 gb|AAA25677.1| (M36272) tetracycline resistance element repressor protein (tetR)
          [Plasmid pSC101]
 gb|AAG43219.1| (AY046276) tetracycline repressor protein [IncN plasmid R46]
 prf||1101269A repressor tet [Salmonella enterica subsp. enterica serovar
          Panama]
          Length = 219

 Score = 37.4 bits (85), Expect = 0.027
 Identities = 15/60 (25%), Positives = 33/60 (55%)

Query: 13 VRKEQIVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRAN 72
          +++E +++  L +    G E +T + +AE +G+  P+L +H++    L  A+  A +  N
Sbjct: 4  LQREAVIRTALELLNDVGMEGLTTRRLAERLGVQQPALYWHFKNKRALLDALAEAMLTIN 63
>pir||RPECR1 tet repressor protein (Tn 1721) - Escherichia coli plasmid RP1
 emb|CAA24908.1| (X00006) tetR regulatory protein [unidentified bacterium]
 gb|AAB67685.1| (U75324) TetR [Cloning vector pJB3Tc20]
 gb|AAB82322.1| (U82001) TetR [Expression vector pJB866]
          Length = 216

 Score = 36.6 bits (83), Expect = 0.047
 Identities = 19/69 (27%), Positives = 34/69 (48%), Gaps = 2/69 (2%)

Query: 18 IVQAGLLVARRTGWESITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRAAVRANDGLVV 77
          +++A L +    G + +T + +AE +G+  P+L +H+R    L  A+  A +  N    V
Sbjct: 9  VIRAALDLLNEVGVDGLTTRKLAERLGVQQPALYWHFRNKRALLDALAEAMLAENHSTSV 68

Query: 78 AMAVQAGDW 86
            A    DW
Sbjct: 69 PRA--DDDW 75
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.131 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 53,834,554 Number of Sequences: 884779 Number of extensions: 1672650 Number of successful extensions: 3826 Number of sequences better than 5.0e-02: 27 Number of HSP's better than 0.1 without gapping: 21 Number of HSP's successfully gapped in prelim test: 6 Number of HSP's that attempted gapping in prelim test: 3805 Number of HSP's gapped (non-prelim): 27 length of query: 111 length of database: 277,083,050 effective HSP length: 87 effective length of query: 24 effective length of database: 200,107,277 effective search space: 4802574648 effective search space used: 4802574648 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 83 (36.6 bits) Query= orf50 [13719-13803] (83 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.323 0.137 0.000 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 42,149,018 Number of Sequences: 884779 Number of extensions: 1364954 Number of successful extensions: 2024 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2024 Number of HSP's gapped (non-prelim): 0 length of query: 83 length of database: 277,083,050 effective HSP length: 59 effective length of query: 24 effective length of database: 224,881,089 effective search space: 5397146136 effective search space used: 5397146136 T: 11 A: 40 X1: 16 ( 7.5 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 84 (37.0 bits) Query= orf51 [13820-13891] (70 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.135 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 39,140,978 Number of Sequences: 884779 Number of extensions: 1138345 Number of successful extensions: 1923 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 1923 Number of HSP's gapped (non-prelim): 0 length of query: 70 length of database: 277,083,050 effective HSP length: 46 effective length of query: 24 effective length of database: 236,383,216 effective search space: 5673197184 effective search space used: 5673197184 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 84 (37.0 bits) Query= orf52 [13890-13943] (52 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.340 0.156 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 23,870,899 Number of Sequences: 884779 Number of extensions: 624806 Number of successful extensions: 3386 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 3386 Number of HSP's gapped (non-prelim): 0 length of query: 52 length of database: 277,083,050 effective HSP length: 28 effective length of query: 24 effective length of database: 252,309,238 effective search space: 6055421712 effective search space used: 6055421712 T: 11 A: 40 X1: 15 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 39 (21.8 bits) S2: 84 (37.0 bits) Query= orf53 [13938-14059] (120 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.326 0.139 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 53,994,972 Number of Sequences: 884779 Number of extensions: 1833223 Number of successful extensions: 6142 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 6142 Number of HSP's gapped (non-prelim): 0 length of query: 120 length of database: 277,083,050 effective HSP length: 96 effective length of query: 24 effective length of database: 192,144,266 effective search space: 4611462384 effective search space used: 4611462384 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 83 (36.6 bits) Query= orf54 [14058-14205] (146 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.327 0.141 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 69,897,528 Number of Sequences: 884779 Number of extensions: 2206116 Number of successful extensions: 4882 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4882 Number of HSP's gapped (non-prelim): 0 length of query: 146 length of database: 277,083,050 effective HSP length: 108 effective length of query: 38 effective length of database: 181,526,918 effective search space: 6898022884 effective search space used: 6898022884 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 85 (37.4 bits) Query= orf55 [14203-14366] (162 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.323 0.141 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 84,006,742 Number of Sequences: 884779 Number of extensions: 3123567 Number of successful extensions: 4673 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4673 Number of HSP's gapped (non-prelim): 0 length of query: 162 length of database: 277,083,050 effective HSP length: 110 effective length of query: 52 effective length of database: 179,757,360 effective search space: 9347382720 effective search space used: 9347382720 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 86 (37.7 bits) Query= orf56 [14371-14438] (66 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.322 0.141 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 37,685,106 Number of Sequences: 884779 Number of extensions: 1219724 Number of successful extensions: 2261 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2261 Number of HSP's gapped (non-prelim): 0 length of query: 66 length of database: 277,083,050 effective HSP length: 42 effective length of query: 24 effective length of database: 239,922,332 effective search space: 5758135968 effective search space used: 5758135968 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf57 [14438-14502] (63 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.327 0.139 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 23,620,112 Number of Sequences: 884779 Number of extensions: 438664 Number of successful extensions: 856 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 856 Number of HSP's gapped (non-prelim): 0 length of query: 63 length of database: 277,083,050 effective HSP length: 39 effective length of query: 24 effective length of database: 242,576,669 effective search space: 5821840056 effective search space used: 5821840056 T: 11 A: 40 X1: 15 ( 7.1 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 40 (21.7 bits) S2: 84 (37.0 bits) Query= orf58 [14503-14581] (77 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.136 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 39,661,005 Number of Sequences: 884779 Number of extensions: 1249866 Number of successful extensions: 2792 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2792 Number of HSP's gapped (non-prelim): 0 length of query: 77 length of database: 277,083,050 effective HSP length: 53 effective length of query: 24 effective length of database: 230,189,763 effective search space: 5524554312 effective search space used: 5524554312 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 84 (37.0 bits) Query= orf59 [14600-14648] (47 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.321 0.140 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 23,039,454 Number of Sequences: 884779 Number of extensions: 544599 Number of successful extensions: 1239 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 1239 Number of HSP's gapped (non-prelim): 0 length of query: 47 length of database: 277,083,050 effective HSP length: 23 effective length of query: 24 effective length of database: 256,733,133 effective search space: 6161595192 effective search space used: 6161595192 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 84 (37.0 bits) Query= orf60 [14647-14740] (92 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.130 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 47,667,396 Number of Sequences: 884779 Number of extensions: 1565823 Number of successful extensions: 2931 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 2931 Number of HSP's gapped (non-prelim): 0 length of query: 92 length of database: 277,083,050 effective HSP length: 68 effective length of query: 24 effective length of database: 216,918,078 effective search space: 5206033872 effective search space used: 5206033872 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf61 [14739-14855] (115 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.318 0.136 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 53,313,387 Number of Sequences: 884779 Number of extensions: 1819600 Number of successful extensions: 14774 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 14774 Number of HSP's gapped (non-prelim): 0 length of query: 115 length of database: 277,083,050 effective HSP length: 91 effective length of query: 24 effective length of database: 196,568,161 effective search space: 4717635864 effective search space used: 4717635864 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.7 bits) S2: 83 (36.6 bits) Query= orf62 [14853-14991] (137 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

emb|CAC01493.1|  (AL391017) putative polypeptide deformylase...    80  2e-015
ref|NP_213407.1|  (NC_000918) polypeptide deformylase [Aquif...    69  1e-011
ref|NP_438782.1|  (NC_000907) polypeptide deformylase (def) ...    68  1e-011
ref|NP_248709.1|  (NC_002516) polypeptide deformylase [Pseud...    68  2e-011
sp|Q9XAQ2|DEF_STRCO  Polypeptide deformylase (PDF) (Formylme...    67  2e-011
ref|NP_273168.1|  (NC_003112) polypeptide deformylase [Neiss...    67  4e-011
ref|NP_297006.1|  (NC_002620) polypeptide deformylase [Chlam...    66  5e-011
ref|NP_219861.1|  (NC_000117) Polypeptide Deformylase [Chlam...    66  6e-011
emb|CAB94606.1|  (AL359214) putative polypeptide deformylase...    63  5e-010
ref|NP_246498.1|  (NC_002663) Def [Pasteurella multocida] >g...    63  5e-010
ref|NP_487119.1|  (NC_003272) polypeptide deformylase [Nosto...    62  7e-010
ref|NP_354552.1|  (NC_003062) AGR_C_2856p [Agrobacterium tum...    59  8e-009
ref|NP_269070.1|  (NC_002737) putative polypeptide deformyla...    57  3e-008
ref|NP_563974.1|  (NM_101408) expressed protein [Arabidopsis...    56  5e-008
sp|Q9FV53|DEFM_ARATH  Polypeptide deformylase, mitochondrial...    56  5e-008
ref|NP_518191.1|  (NC_003295) PROBABLE POLYPEPTIDE DEFORMYLA...    55  1e-007
ref|NP_345996.1|  (NC_003028) polypeptide deformylase [Strep...    52  7e-007
ref|NP_360311.1|  (NC_003103) polypeptide deformylase [EC:3....    51  2e-006
ref|NP_229461.1|  (NC_000853) polypeptide deformylase [Therm...    50  4e-006
ref|NP_561676.1|  (NC_003366) polypeptide deformylase [Clost...    47  4e-005
sp|O08450|DEF_CLOBE  Polypeptide deformylase (PDF) (Formylme...    44  3e-004
>emb|CAC01493.1| (AL391017) putative polypeptide deformylase [Streptomyces
           coelicolor A3(2)]
          Length = 217

 Score = 80.5 bits (197), Expect = 2e-015
 Identities = 49/132 (37%), Positives = 63/132 (47%), Gaps = 6/132 (4%)

Query: 9   PELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFK---- 63
           P LHA  AEV   G ++  LV D               NQVG   R+ V  CP  +    
Sbjct: 60  PVLHARCAEVTDFGPELAALVEDLFATMYAAHGVGLAANQVGEAVRVFVYDCPDDEDERH 119

Query: 64  -GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTA 122
            G V+NP +       V  PEGCLS PG     +R ++ VV G+ +  +P+T+   G  A
Sbjct: 120 LGHVVNPRLVETGGVVVRGPEGCLSLPGLEAGTERYDEAVVTGFTVAGEPVTVRGTGFFA 179

Query: 123 FCLQHEIDHLNG 134
            CLQHE DHL G
Sbjct: 180 RCLQHECDHLEG 191
>ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquifex aeolicus]
 sp|O66847|DEF_AQUAE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||C70352 polypeptide deformylase - Aquifex aeolicus
 gb|AAC06802.1| (AE000696) polypeptide deformylase [Aquifex aeolicus]
          Length = 169

 Score = 68.6 bits (166), Expect = 1e-011
 Identities = 39/121 (32%), Positives = 61/121 (50%), Gaps = 7/121 (5%)

Query: 22  EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLR------CPTFKGCVINPIITRHT 75
           ++VK+L+ D               NQ+GV   ++V+        P  K  +INP I +  
Sbjct: 25  KEVKNLIRDMFDTMYEAEGVGLAANQIGVPLSVMVIDTSPKEDAPPLKLVLINPEI-KEG 83

Query: 76  DGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
           +G +   EGCLS+PG +V  +R  KV V   +   +P+ +  +G  A   QHE+DHL G+
Sbjct: 84  EGKIKYKEGCLSFPGLSVEVERFQKVKVNALNEHGEPVELTLEGFPAIVFQHELDHLKGI 143

Query: 136 T 136
           T
Sbjct: 144 T 144
>ref|NP_438782.1| (NC_000907) polypeptide deformylase (def) [Haemophilus influenzae
           Rd]
 sp|P44786|DEF_HAEIN Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||D64082 N-formylmethionylaminoacyl-tRNA deformylase (EC 3.5.1.27) -
           Haemophilus influenzae (strain Rd KW20)
 gb|AAC22282.1| (U32745) polypeptide deformylase (def) [Haemophilus influenzae Rd]
          Length = 169

 Score = 68.2 bits (165), Expect = 1e-011
 Identities = 40/92 (43%), Positives = 51/92 (54%), Gaps = 4/92 (4%)

Query: 47  QVGVLKRIIVLRCPTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVV 103
           QV +L+RII +     K     +INP I   ++G     EGCLS PG      R+ KV V
Sbjct: 51  QVDILQRIITIDVEGDKQNQFVLINPEILA-SEGETGIEEGCLSIPGFRALVPRKEKVTV 109

Query: 104 EGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
              D D +  T+ A GL A C+QHEIDHLNG+
Sbjct: 110 RALDRDGKEFTLDADGLLAICIQHEIDHLNGI 141
>ref|NP_248709.1| (NC_002516) polypeptide deformylase [Pseudomonas aeruginosa]
 sp|Q9I7A8|DEF_PSEAE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||H83643 polypeptide deformylase PA0019 [imported] - Pseudomonas aeruginosa
           (strain PAO1)
 gb|AAG03409.1|AE004441_10 (AE004441) polypeptide deformylase [Pseudomonas aeruginosa]
          Length = 168

 Score = 67.8 bits (164), Expect = 2e-011
 Identities = 49/138 (35%), Positives = 61/138 (43%), Gaps = 4/138 (2%)

Query: 1   MKILKDDAPELHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
           + IL+   P L  IA  V   +D V+ L+ D                QV V KRI+V+  
Sbjct: 4   LNILEFPDPRLRTIAKPVEVVDDAVRQLIDDMFETMYEAPGIGLAATQVNVHKRIVVMDL 63

Query: 60  PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
              K      INP     T+      EGCLS PG      R  KV ++  D D  P    
Sbjct: 64  SEDKSEPRVFINPEFEPLTEDMDQYQEGCLSVPGFYENVDRPQKVRIKALDRDGNPFEEV 123

Query: 117 AKGLTAFCLQHEIDHLNG 134
           A+GL A C+QHE DHLNG
Sbjct: 124 AEGLLAVCIQHECDHLNG 141
>sp|Q9XAQ2|DEF_STRCO Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||T34626 probable polypetide deformylase - Streptomyces coelicolor
 emb|CAB44533.1| (AL078618) putative polypetide deformylase [Streptomyces
           coelicolor]
          Length = 208

 Score = 67.4 bits (163), Expect = 2e-011
 Identities = 41/122 (33%), Positives = 56/122 (45%), Gaps = 8/122 (6%)

Query: 21  GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-----VINPIITRHT 75
           G D+  L+ D               NQVGV  R+ V  CP  +G      ++NP++    
Sbjct: 48  GPDLAALIDDMFRTMYVAEGAGLAANQVGVDLRLFVYDCPDDEGVRHVGHLVNPVLDALD 107

Query: 76  DGH---VYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHL 132
                 +   EGCLS PG  +A  R ++ VV G D D  P+ +   G  A CL HE DH+
Sbjct: 108 PAARRLLDEGEGCLSVPGAVMAVPRPDRAVVRGLDKDGVPLLVEGTGYFARCLAHETDHV 167

Query: 133 NG 134
           NG
Sbjct: 168 NG 169
>ref|NP_273168.1| (NC_003112) polypeptide deformylase [Neisseria meningitidis MC58]
 ref|NP_283013.1| (NC_003116) polypeptide deformylase [Neisseria meningitidis Z2491]
 sp|Q9JQN0|DEF_NEIMA Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||E81238 formylmethionine deformylase (EC 3.5.1.31) NMA0164 [imported] -
           Neisseria meningitidis (group B strain MD58, group A
           strain Z2491)
 gb|AAF40569.1| (AE002369) polypeptide deformylase [Neisseria meningitidis MC58]
 emb|CAB83478.1| (AL162752) polypeptide deformylase [Neisseria meningitidis Z2491]
          Length = 167

 Score = 66.6 bits (161), Expect = 4e-011
 Identities = 44/139 (31%), Positives = 63/139 (44%), Gaps = 5/139 (3%)

Query: 1   MKILKDDAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
           + IL+     LH +A  V    E ++ L+ D                QV V +R++V+  
Sbjct: 4   LNILQYPDERLHTVAKPVEQVDERIRKLIADMFETMYESRGIGLAATQVDVHERVVVMDL 63

Query: 60  PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
              +      INP+I    DG     EGCLS PG      R  +V VE  +   +  T+ 
Sbjct: 64  TEDRSEPRVFINPVIVEK-DGETTYEEGCLSVPGIYDTVTRAERVKVEALNEKGEKFTLE 122

Query: 117 AKGLTAFCLQHEIDHLNGV 135
           A GL A C+QHE+DHL G+
Sbjct: 123 ADGLLAICVQHELDHLMGI 141
>ref|NP_297006.1| (NC_002620) polypeptide deformylase [Chlamydia muridarum]
 sp|Q9PK41|DEF_CHLMU Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||C81680 polypeptide deformylase TC0632 [imported] - Chlamydia muridarum
           (strain Nigg)
 gb|AAF39461.1| (AE002332) polypeptide deformylase [Chlamydia muridarum]
          Length = 181

 Score = 66.2 bits (160), Expect = 5e-011
 Identities = 42/141 (29%), Positives = 65/141 (45%), Gaps = 12/141 (8%)

Query: 7   DAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT---- 61
           D+P L  +AA +    ++++ LVLD                QVG    + ++        
Sbjct: 9   DSPILRKVAAPIDEITDELRQLVLDMSETMTFYKGVGLAAPQVGHSVALFIMGVEKELDD 68

Query: 62  -------FKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPIT 114
                  F    INP+IT+ ++  VY  EGCLS PG      R +K+ V   ++D QP +
Sbjct: 69  GELIFCDFPKVFINPVITQKSEQLVYGNEGCLSIPGLRGEVARPDKITVTAKNLDGQPFS 128

Query: 115 IAAKGLTAFCLQHEIDHLNGV 135
           +  +G  A  + HE DHL+GV
Sbjct: 129 MTLEGFLARIVMHETDHLHGV 149
>ref|NP_219861.1| (NC_000117) Polypeptide Deformylase [Chlamydia trachomatis]
 sp|O84357|DEF_CHLTR Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||B71526 probable polypeptide deformylase - Chlamydia trachomatis  (serotype
           D, strain UW3/Cx)
 gb|AAC67948.1| (AE001308) Polypeptide Deformylase [Chlamydia trachomatis]
          Length = 181

 Score = 65.9 bits (159), Expect = 6e-011
 Identities = 43/141 (30%), Positives = 65/141 (45%), Gaps = 12/141 (8%)

Query: 7   DAPELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPT---- 61
           D+P L  +AA V    ++++ LVLD                QVG    + ++        
Sbjct: 9   DSPILRKVAAPVTEITDELRQLVLDMSETMAFYKGVGLAAPQVGQSISLFIMGVERELED 68

Query: 62  -------FKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPIT 114
                  F    INP+IT+ ++  VY  EGCLS PG      R +K+ V   ++D Q  +
Sbjct: 69  GELVFCDFPRVFINPVITQKSEQLVYGNEGCLSIPGLRGEVARPDKITVSAKNLDGQQFS 128

Query: 115 IAAKGLTAFCLQHEIDHLNGV 135
           +A +G  A  + HE DHL+GV
Sbjct: 129 LALEGFLARIVMHETDHLHGV 149
>emb|CAB94606.1| (AL359214) putative polypeptide deformylase [Streptomyces
           coelicolor A3(2)]
          Length = 216

 Score = 62.8 bits (151), Expect = 5e-010
 Identities = 46/138 (33%), Positives = 57/138 (40%), Gaps = 15/138 (10%)

Query: 9   PELHAIAAEVPH-GEDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-- 65
           P LH    +V   GE+ + LV D               NQ+GV K++ V  CP  +G   
Sbjct: 50  PVLHKECEDVTDFGEEFQQLVADMFASQRTAEGVGLAANQIGVSKKVFVYDCPDDEGVRH 109

Query: 66  ---VINPIIT------RHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
              V NP +       R  D    S EGCLS P       R +   V G D    P+ + 
Sbjct: 110 VGVVCNPRLVELPADRRRLDD---SNEGCLSVPTAYAPLARPDYAEVTGQDEKGNPVKVR 166

Query: 117 AKGLTAFCLQHEIDHLNG 134
             G  A CLQHE DHL G
Sbjct: 167 GTGYFARCLQHETDHLYG 184
>ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida]
 sp|P57948|DEF_PASMU Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 gb|AAK03643.1| (AE006193) Def [Pasteurella multocida]
          Length = 170

 Score = 62.8 bits (151), Expect = 5e-010
 Identities = 38/117 (32%), Positives = 59/117 (49%), Gaps = 4/117 (3%)

Query: 22  EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKG---CVINPIITRHTDGH 78
           ++++++V +                QV + +RII +     K     +INP I   + G 
Sbjct: 26  DEIREIVDNMFETMYLEEGIGLAATQVNIHQRIITIDVEGTKENQYVLINPEII-DSCGE 84

Query: 79  VYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
               EGCLS PG      R+ KV ++  D   +  T++A+GL A C+QHEIDHLNG+
Sbjct: 85  TGIEEGCLSLPGFRGFVPRKEKVTIKALDRHGEEYTLSAEGLLAICIQHEIDHLNGI 141
>ref|NP_487119.1| (NC_003272) polypeptide deformylase [Nostoc sp. PCC 7120]
 dbj|BAB74778.1| (AP003591) polypeptide deformylase [Nostoc sp. PCC 7120]
          Length = 187

 Score = 62.4 bits (150), Expect = 7e-010
 Identities = 39/119 (32%), Positives = 57/119 (47%), Gaps = 5/119 (4%)

Query: 22  EDVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRCPTFKGC-----VINPIITRHTD 76
           ++ + L+ D                QVG+ K++IV+ C           +INP I + + 
Sbjct: 39  DETRQLIRDMLQTMYSSDGIGLAAPQVGINKQLIVIDCEPDNPANPPLILINPTIKQVSR 98

Query: 77  GHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGV 135
               + EGCLS PG  +  KR   V V   D + +P T+ A  L   C+QHE+DHLNGV
Sbjct: 99  EICSAQEGCLSIPGVYMDVKRPEVVEVAYKDENGRPQTLKATDLLGRCIQHEMDHLNGV 157
>ref|NP_354552.1| (NC_003062) AGR_C_2856p [Agrobacterium tumefaciens] [Agrobacterium
           tumefaciens str. C58 (Cereon)]
 ref|NP_532239.1| (NC_003304) polypeptide deformylase [Agrobacterium tumefaciens str.
           C58 (U. Washington)]
 gb|AAK87337.1| (AE008078) AGR_C_2856p [Agrobacterium tumefaciens str. C58
           (Cereon)]
 gb|AAL42555.1| (AE009113) polypeptide deformylase [Agrobacterium tumefaciens str.
           C58 (U. Washington)]
          Length = 164

 Score = 58.9 bits (141), Expect = 8e-009
 Identities = 43/127 (33%), Positives = 60/127 (46%), Gaps = 2/127 (1%)

Query: 11  LHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC-PTFKGCVIN 68
           L  I A V   +D +++LV D                 +GVL+R+ VL   P      IN
Sbjct: 14  LSGICAPVTVFDDHLRELVTDLIDTMRAAPGVGITAAHIGVLQRVFVLELTPGTILTYIN 73

Query: 69  PIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIAAKGLTAFCLQHE 128
           P IT H+   +   EG +S PG T   +R + V V   D+     T  A+G  A C+QHE
Sbjct: 74  PEITSHSPQTMRHVEGSVSMPGFTDEVERPSTVEVRFQDITGAEQTETAEGFHAICIQHE 133

Query: 129 IDHLNGV 135
           ID L+G+
Sbjct: 134 IDQLDGI 140
>ref|NP_269070.1| (NC_002737) putative polypeptide deformylase [Streptococcus
           pyogenes] [Streptococcus pyogenes M1 GAS]
 gb|AAK33791.1| (AE006536) putative polypeptide deformylase [Streptococcus pyogenes
           M1 GAS]
          Length = 136

 Score = 57.0 bits (136), Expect = 3e-008
 Identities = 32/92 (34%), Positives = 49/92 (52%), Gaps = 2/92 (2%)

Query: 46  NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
           N +G  KRI+++        + NP++     G   + E CLS  G    + R +K+ VE 
Sbjct: 47  NMIGEQKRIVIVSMGFIDLVMFNPVMVSKK-GIYQTKESCLSLSGYRKTQ-RYDKITVEY 104

Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
            D +W+P  ++  GLTA   QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>ref|NP_563974.1| (NM_101408) expressed protein [Arabidopsis thaliana]
 gb|AAG33973.1|AF250959_1 (AF250959) peptide deformylase-like protein [Arabidopsis thaliana]
          Length = 269

 Score = 56.2 bits (134), Expect = 5e-008
 Identities = 47/156 (30%), Positives = 66/156 (42%), Gaps = 23/156 (14%)

Query: 2   KILKDDAPELHAIAAEVPHGE----DVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVL 57
           +I+    P LH  A EV  GE     ++ ++ D                Q+GV  RIIVL
Sbjct: 82  EIVASGDPVLHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRIIVL 141

Query: 58  R--------CPTFK-----------GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRR 98
                     P  +             ++NP++   ++      EGCLS  G   A +R 
Sbjct: 142 EDTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAVERY 201

Query: 99  NKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
            +VVV GYD   + I + A G  A  LQHE DHL+G
Sbjct: 202 LEVVVTGYDRQGKRIEVNASGWQARILQHECDHLDG 237
>sp|Q9FV53|DEFM_ARATH Polypeptide deformylase, mitochondrial precursor (PDF)
           (Formylmethionine deformylase)
 gb|AAD39667.1|AC007591_32 (AC007591) Simalar to gi|4377403 Polypeptide Deformylase from
           Chlamydia pneumoniae genome gb|AE001687. [Arabidopsis
           thaliana]
 gb|AAK32873.1|AF361861_1 (AF361861) At1g15390/F9L1_34 [Arabidopsis thaliana]
          Length = 259

 Score = 56.2 bits (134), Expect = 5e-008
 Identities = 47/156 (30%), Positives = 66/156 (42%), Gaps = 23/156 (14%)

Query: 2   KILKDDAPELHAIAAEVPHGE----DVKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVL 57
           +I+    P LH  A EV  GE     ++ ++ D                Q+GV  RIIVL
Sbjct: 72  EIVASGDPVLHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRIIVL 131

Query: 58  R--------CPTFK-----------GCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRR 98
                     P  +             ++NP++   ++      EGCLS  G   A +R 
Sbjct: 132 EDTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAVERY 191

Query: 99  NKVVVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
            +VVV GYD   + I + A G  A  LQHE DHL+G
Sbjct: 192 LEVVVTGYDRQGKRIEVNASGWQARILQHECDHLDG 227
>ref|NP_518191.1| (NC_003295) PROBABLE POLYPEPTIDE DEFORMYLASE PROTEIN [Ralstonia
           solanacearum]
 emb|CAD13598.1| (AL646057) PROBABLE POLYPEPTIDE DEFORMYLASE PROTEIN [Ralstonia
           solanacearum]
          Length = 169

 Score = 55.1 bits (131), Expect = 1e-007
 Identities = 40/138 (28%), Positives = 60/138 (42%), Gaps = 4/138 (2%)

Query: 1   MKILKDDAPELHAIAAEVPHGED-VKDLVLDXXXXXXXXXXXXXXXNQVGVLKRIIVLRC 59
           + IL+   P LH +A  V   +D ++ LV D                QV V +R+I +  
Sbjct: 4   LNILQYPDPRLHKVAKPVAVVDDRIRKLVADMAETMYDAPGIGLAATQVDVHERVITIDV 63

Query: 60  PTFKG---CVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGYDMDWQPITIA 116
              +      INP I   ++      EGCLS P      +R ++V V   +   +   + 
Sbjct: 64  SESRDELRVFINPEIVWASEARKVWDEGCLSVPDIYDKVERPDRVRVRALNEKGESFELE 123

Query: 117 AKGLTAFCLQHEIDHLNG 134
             GL A C+QHE+DHL G
Sbjct: 124 TDGLLAVCIQHEMDHLMG 141
>ref|NP_345996.1| (NC_003028) polypeptide deformylase [Streptococcus pneumoniae
           TIGR4]
 ref|NP_359001.1| (NC_003098) Formylmethionine deformylase [Streptococcus pneumoniae
           R6]
 gb|AAK75636.1| (AE007449) polypeptide deformylase [Streptococcus pneumoniae TIGR4]
 gb|AAL00212.1| (AE008510) Formylmethionine deformylase [Streptococcus pneumoniae
           R6]
 gb|AAK13237.1| (AY014508) peptide deformylase-like protein [Streptococcus
           pneumoniae]
          Length = 136

 Score = 52.4 bits (124), Expect = 7e-007
 Identities = 31/89 (34%), Positives = 43/89 (47%), Gaps = 2/89 (2%)

Query: 46  NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEG 105
           N +GV KR+I+         + NP++    +G   + EGCLS  G   + KR   + +  
Sbjct: 47  NMIGVQKRVIIFNLGLVPVVMFNPVLLSF-EGSYEAEEGCLSLVGVR-STKRYETIRLAY 104

Query: 106 YDMDWQPITIAAKGLTAFCLQHEIDHLNG 134
            D  WQ  TI   G  A   QHE+DHL G
Sbjct: 105 RDSKWQEQTITLTGFPAQICQHELDHLEG 133
>ref|NP_360311.1| (NC_003103) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
           conorii]
 gb|AAL03212.1| (AE008626) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
           conorii]
          Length = 183

 Score = 50.8 bits (120), Expect = 2e-006
 Identities = 34/95 (35%), Positives = 49/95 (50%), Gaps = 4/95 (4%)

Query: 46  NQVGVLKRIIVLRC----PTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKV 101
           N VG+LKRI V+       +     INP IT  ++      EG LS+PG   +  R   +
Sbjct: 54  NMVGILKRIAVVDLHENNKSSPIVFINPNITYFSEEKQTFIEGSLSFPGIEASITRSKAI 113

Query: 102 VVEGYDMDWQPITIAAKGLTAFCLQHEIDHLNGVT 136
            V+  D +     +AA+G  A  +QHEI++LNG T
Sbjct: 114 KVKYLDYNGNKQELAAEGFLATVIQHEIEYLNGKT 148
>ref|NP_229461.1| (NC_000853) polypeptide deformylase [Thermotoga maritima]
 sp|P96113|DEF_THEMA Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 pir||C72224 polypeptide deformylase - Thermotoga maritima (strain MSB8)
 emb|CAA71356.1| (Y10306) polypeptide deformylase [Thermotoga maritima]
 gb|AAD36728.1|AE001808_3 (AE001808) polypeptide deformylase [Thermotoga maritima]
          Length = 164

 Score = 50.1 bits (118), Expect = 4e-006
 Identities = 31/91 (34%), Positives = 44/91 (48%)

Query: 47  QVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVVVEGY 106
           QVG+ +R  V+        VINP I         + EGCLS+P   V  +R  ++ V+  
Sbjct: 49  QVGISQRFFVMDVGNGPVAVINPEILEIDPETEVAEEGCLSFPEIFVEIERSKRIKVKYQ 108

Query: 107 DMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
           +   + +    +G  A   QHE DHLNGV I
Sbjct: 109 NTRGEYVEEELEGYAARVFQHEFDHLNGVLI 139
>ref|NP_561676.1| (NC_003366) polypeptide deformylase [Clostridium perfringens]
 dbj|BAB80466.1| (AP003188) polypeptide deformylase [Clostridium perfringens]
          Length = 136

 Score = 46.6 bits (109), Expect = 4e-005
 Identities = 30/93 (32%), Positives = 47/93 (50%), Gaps = 4/93 (4%)

Query: 46  NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPGKTVAKKRRNKVV-VE 104
           N +GV KRI+V         ++NP+I +    +  + E CLS  G    K  R +++ VE
Sbjct: 47  NMIGVKKRILVFSIGNIIVPMVNPVILKKEKSYE-TEESCLSLTG--FRKTTRYEIIEVE 103

Query: 105 GYDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
             D +++       G TA  +QHE+DH  G+ I
Sbjct: 104 YLDRNFKKHKETFSGFTAQIIQHEVDHFEGIII 136
>sp|O08450|DEF_CLOBE Polypeptide deformylase (PDF) (Formylmethionine deformylase)
 emb|CAB09662.1| (Z96934) formylmethionine deformylase; peptide deformylase
           [Clostridium beijerinckii]
          Length = 136

 Score = 43.9 bits (102), Expect = 3e-004
 Identities = 32/94 (34%), Positives = 45/94 (47%), Gaps = 6/94 (6%)

Query: 46  NQVGVLKRIIVLRCPTFKGCVINPIITRHTDGHVYSPEGCLSYPG--KTVAKKRRNKVVV 103
           N +GV KRI+V         +INP+I +    +  + E CLS  G  KT   KR   + V
Sbjct: 47  NMIGVKKRILVFTVGNLIVPMINPVILKKEKPYE-TEESCLSLIGFRKT---KRYETIEV 102

Query: 104 EGYDMDWQPITIAAKGLTAFCLQHEIDHLNGVTI 137
              D ++        G TA  +QHE+DH  G+ I
Sbjct: 103 TYLDRNFNKKKQVFNGFTAQIIQHEMDHFEGIII 136
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.322 0.141 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 70,984,581 Number of Sequences: 884779 Number of extensions: 2425202 Number of successful extensions: 3955 Number of sequences better than 5.0e-02: 98 Number of HSP's better than 0.1 without gapping: 83 Number of HSP's successfully gapped in prelim test: 15 Number of HSP's that attempted gapping in prelim test: 3851 Number of HSP's gapped (non-prelim): 99 length of query: 137 length of database: 277,083,050 effective HSP length: 113 effective length of query: 24 effective length of database: 177,103,023 effective search space: 4250472552 effective search space used: 4250472552 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.9 bits) S2: 83 (36.6 bits) Query= orf63 [14990-15085] (94 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.320 0.137 7.19e+252 Gapped Lambda K H 0.267 0.0410 8.76e+252 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 42,933,908 Number of Sequences: 884779 Number of extensions: 1333473 Number of successful extensions: 4398 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 4398 Number of HSP's gapped (non-prelim): 0 length of query: 94 length of database: 277,083,050 effective HSP length: 70 effective length of query: 24 effective length of database: 215,148,520 effective search space: 5163564480 effective search space used: 5163564480 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 84 (37.0 bits) Query= orf64 [15068-15204] (135 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.324 0.140 8.76e+252 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 92,315,289 Number of Sequences: 884779 Number of extensions: 3835217 Number of successful extensions: 6623 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 6623 Number of HSP's gapped (non-prelim): 0 length of query: 135 length of database: 277,083,050 effective HSP length: 111 effective length of query: 24 effective length of database: 178,872,581 effective search space: 4292941944 effective search space used: 4292941944 T: 11 A: 40 X1: 15 ( 7.0 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (22.0 bits) S2: 83 (36.6 bits) Query= orf65 [15212-16004] (791 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done


                                                                   Score     E
Sequences producing significant alignments:                        (bits)  Value

ref|NP_489430.1|  (NC_003273) ORF_ID:all8519~hypothetical pr...    99  2e-019
gb|AAB00938.1|  (L31763) virulence-associated protein E [Dic...    85  3e-015
ref|NP_371315.1|  (NC_002758) hypothetical protein [Staphylo...    85  3e-015
gb|AAL67615.1|  (U93688) orf15 [Staphylococcus aureus]             85  3e-015
gb|AAL04136.1|AF410775_12  (AF410775) unknown [Staphylococcu...    84  6e-015
ref|NP_569125.1|  (NC_003374) hypothetical protein [Gluconob...    83  1e-014
pir||S49113  hypothetical protein 2 - Microcystis aeruginosa...    81  5e-014
ref|NP_279342.1|  (NC_002607) Vng0215c [Halobacterium sp. NR...    55  2e-006
>ref|NP_489430.1| (NC_003273) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
           7120]
 dbj|BAB77438.1| (AP003604) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
           7120]
          Length = 836

 Score = 98.6 bits (244), Expect = 2e-019
 Identities = 87/317 (27%), Positives = 147/317 (45%), Gaps = 20/317 (6%)

Query: 466 KSDIKGMIAYMAHRYAHNPVKAWVESAP--WDGLDHVGLLFSHITLTPDEDRPLCEHLFR 523
           K   K ++  +A   +++PV  +++S    +  +D V LL          + PL   L +
Sbjct: 341 KEQAKSVVLELALANSYSPVVEYLQSVERRYPNVD-VSLLDKLAERYFGTNDPLHAALMK 399

Query: 524 KWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDSVLLDVKDKDSVK 583
           + + AAV+         + + +   +Q  + K+ F+  L  E F TD   L+  +KD V 
Sbjct: 400 RTLIAAVARAFEPGCKHDEITILQGKQKSL-KSTFWEILAGEEFFTDD--LNGTEKDEVL 456

Query: 584 TAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEA 643
               YW++E  E + ++ + + + LKAFLSR KD +R PYG     +PR +  VG+ N  
Sbjct: 457 KISQYWILEYAEFENSYKKKDVSQLKAFLSRKKDSMRRPYGTDIEDFPRPSILVGTTNLD 516

Query: 644 EFLVDTTSNRRFIPMKVAALNHQHRVNVQQ---VWAQALQEARSG---MATYVEAHEVAQ 697
           EFL D T  RRF  +KV       ++  Q+   +WA A+   R+G     T VE   +  
Sbjct: 517 EFLYDPTGERRFWVIKVLFKKIPIKMLQQERDLIWAAAVAAYRNGEQWRLTEVEDEWLDA 576

Query: 698 RNASFQATSAIDDLLSERLNEATGERNVHMTVTALLK---RCGLHNPTKRDLNDAGKWLR 754
            N  +Q+T   ++ +      A  E    ++V  +L    +  L   +K + N     LR
Sbjct: 577 ANKQYQSTDTWEEAVM-----AWAELQKEVSVGEILSDVLKIELAKQSKAEQNRVAAILR 631

Query: 755 SNGYERCVRTGVRGFML 771
           S+G+ R  R  V G ++
Sbjct: 632 SHGWTRGDRKRVNGRLI 648
>gb|AAB00938.1| (L31763) virulence-associated protein E [Dichelobacter nodosus]
          Length = 437

 Score = 85.1 bits (209), Expect = 3e-015
 Identities = 71/250 (28%), Positives = 121/250 (48%), Gaps = 20/250 (8%)

Query: 465 PKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPLCEHLFRK 524
           P   I   I  +A  +   P++ ++ES  WDG   +   F        E       +   
Sbjct: 96  PIDSIDMGIKMLAREHIFCPIQQYLESLQWDGKPRIATAFHDFLGV--ERNTYTIGISTL 153

Query: 525 WMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDSVLLDVKDKDSVKT 584
           + RA V+         + V+V   +QG +GK+ F R L  E +     +   + K++++ 
Sbjct: 154 FFRALVARVYQPAIKFDHVIVLQGDQG-IGKSSFLRILGGEWYSDG--IRKFEGKEAIEA 210

Query: 585 AISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAE 644
                + E+ EL G FS++E   +KAF++RT+D  R+ YGR    +PR T +VG+ N  +
Sbjct: 211 IQGVLIGEIPELQG-FSKAEIEEIKAFITRTEDRARMAYGRRVQSFPRRTVFVGTTNADD 269

Query: 645 FLVDTTSNRRFIPMKV-AALNHQHRVNV-QQVWAQALQEARSGMATYV----------EA 692
           +L D T NRRF P+    AL+ +    V +Q++A+A+   ++  A YV          E 
Sbjct: 270 YLRDATGNRRFFPIICHKALDFEALAEVREQLFAEAVVHYKA--APYVPLILQGKEALEQ 327

Query: 693 HEVAQRNASF 702
            ++AQ+ AS+
Sbjct: 328 AKLAQQAASY 337
>ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
           aureus Mu50]
 dbj|BAB56953.1| (AP003360) hypothetical protein [Staphylococcus aureus subsp.
           aureus Mu50]
          Length = 489

 Score = 84.7 bits (208), Expect = 3e-015
 Identities = 65/234 (27%), Positives = 107/234 (44%), Gaps = 14/234 (5%)

Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
           +S I+ + N+  + +  I  +I   A++   +P+K+ +ES  WDG+  +  LF  I    
Sbjct: 95  ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 151

Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
            ED      + +KWM  AV+         + +++    QG VGK+     L    +    
Sbjct: 152 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 209

Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
            +   K  +  K     W+ E+ EL   F +S    +K F+S   D  R  YG+   ++P
Sbjct: 210 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 267

Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPMKV-------AALNHQHRVNVQQVWAQA 678
           R   +VG+ N  EFL D T NRRF P+         +  +    V VQQ++A+A
Sbjct: 268 RQCVFVGTTNNYEFLKDQTGNRRFFPITTDKNKATKSPFDDLTPVVVQQMFAEA 321
>gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus]
          Length = 477

 Score = 84.7 bits (208), Expect = 3e-015
 Identities = 65/234 (27%), Positives = 107/234 (44%), Gaps = 14/234 (5%)

Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
           +S I+ + N+  + +  I  +I   A++   +P+K+ +ES  WDG+  +  LF  I    
Sbjct: 87  ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 143

Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
            ED      + +KWM  AV+         + +++    QG VGK+     L    +    
Sbjct: 144 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 201

Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
            +   K  +  K     W+ E+ EL   F +S    +K F+S   D  R  YG+   ++P
Sbjct: 202 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 259

Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPMKV-------AALNHQHRVNVQQVWAQA 678
           R   +VG+ N  EFL D T NRRF P+         +  +    V VQQ++A+A
Sbjct: 260 RQCVFVGTTNNYEFLKDQTGNRRFFPITTDKNKATKSPFDDLTPVVVQQMFAEA 313
>gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcus aureus]
          Length = 474

 Score = 84.0 bits (206), Expect = 6e-015
 Identities = 59/207 (28%), Positives = 96/207 (45%), Gaps = 7/207 (3%)

Query: 452 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 511
           +S I+ + N+  + +  I  +I   A++   +P+K+ +ES  WDG+  +  LF  I    
Sbjct: 84  ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 140

Query: 512 DEDRPLCEHLFRKWMRAAVSAGVGDQAGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRTDS 571
            ED      + +KWM  AV+         + +++    QG VGK+     L    +    
Sbjct: 141 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 198

Query: 572 VLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 631
            +   K  +  K     W+ E+ EL   F +S    +K F+S   D  R  YG+   ++P
Sbjct: 199 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 256

Query: 632 RMTAYVGSVNEAEFLVDTTSNRRFIPM 658
           R   +VG+ N  EFL D T NRRF P+
Sbjct: 257 RQCVFVGTTNNYEFLKDQTGNRRFFPI 283
>ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconobacter oxydans]
 emb|CAD21834.1| (AJ428837) hypothetical protein [Gluconobacter oxydans]
          Length = 416

 Score = 82.8 bits (203), Expect = 1e-014
 Identities = 73/248 (29%), Positives = 108/248 (43%), Gaps = 35/248 (14%)

Query: 463 HYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGL-LFSHITLTPDEDRPLCEHL 521
           ++    +   I  +A   A++P++ ++    WDG+  +   L   +     E       L
Sbjct: 99  NFSTQQVSAAIQSVAEENAYHPIRDYLTGLEWDGVQRLDTWLHDFLGAEDSEYTAGVGRL 158

Query: 522 FRKWMRAAVSAGVGDQAGC--EPVLVFVDEQGG---------VGKTRFFRTLCPEPFRTD 570
           F   M A V      + GC  + VL+F   QG           G +R+F    P P R  
Sbjct: 159 FLIGMVARVM-----EPGCKNDHVLIFEGNQGIRKSTACKVLAGGSRYFSDSLP-PIREG 212

Query: 571 SVLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKY 630
           S       KD        WLVE+ E+    SR+++ ALKAFL+RT +  R  YGR  +  
Sbjct: 213 S-------KDLASHLAGKWLVEVAEMSAA-SRADTEALKAFLTRTHETYRPAYGRNEITQ 264

Query: 631 PRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNVQQVWAQALQEARSGMATYV 690
           PR    VG+ N  E+L D+T +RR  P+K         VN +      LQEAR  +  + 
Sbjct: 265 PRQCVLVGTTNRKEYLRDSTGDRRVWPVKCG-------VNGEPCDIDGLQEARDQL--FA 315

Query: 691 EAHEVAQR 698
           EA  + QR
Sbjct: 316 EAFHLYQR 323
>pir||S49113 hypothetical protein 2 - Microcystis aeruginosa
 emb|CAA82191.1| (Z28337) ORF2 [Microcystis aeruginosa]
          Length = 502

 Score = 80.9 bits (198), Expect = 5e-014
 Identities = 67/218 (30%), Positives = 106/218 (47%), Gaps = 21/218 (9%)

Query: 555 KTRFFRTLCPEPFRTDSVLLDVKDKDSVKTAISYWLVELGELDGTFSRSESNALKAFLSR 614
           K+ FF+TL  E F TD+++    DKD + T    W++E  ELD   +++    +K+FLSR
Sbjct: 278 KSTFFKTLGGEYF-TDAMMST--DKDGLLTMAKNWIIEWSELDHFTAKTYHGIIKSFLSR 334

Query: 615 TKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNVQQV 674
           + D  R PY R  M +PR +  VGS N   FL D T NRRF    +  +  +H++N+   
Sbjct: 335 SDDFYREPYAREAMSHPRCSMIVGSTNRDTFLNDPTGNRRF---WIIPIPKKHKINID-- 389

Query: 675 WAQALQEARSGMAT--YVEAHEV----------AQRNASFQATSAIDDLLSERLNEATGE 722
           + QA +E   G A   Y++              A+ N  ++   + +D L+E L+  + +
Sbjct: 390 YLQAYREKLLGWAVWRYLDGEPCQLPEEFKALQAEANKQWENNDSWEDELAEFLDRES-D 448

Query: 723 RNVHMTVTALLKRCGLHNPTKRDLNDAGKWLRSNGYER 760
            +V   +  L  R    N  + D    G  LR  G+ R
Sbjct: 449 LSVKECLQRLEDRGYPINFGRSDEMRMGDILRKAGFSR 486
>ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NRC-1]
 gb|AAG18822.1| (AE004986) Vng0215c [Halobacterium sp. NRC-1]
          Length = 857

 Score = 55.5 bits (132), Expect = 2e-006
 Identities = 33/112 (29%), Positives = 55/112 (48%), Gaps = 11/112 (9%)

Query: 175 EHDSAERVEITDQEAQTVREALSYIDPGDKREEWVKVGLALKSGFGDDPQGLSLFDEWSS 234
           ++D  E ++ +D     + +ALS+++P    +EW+K+G A+   +     G  LF++WS 
Sbjct: 212 DYDGDEWLDKSD-----IEDALSHVNPDCGYDEWLKLGFAVHD-WDSGSTGKRLFEQWSK 265

Query: 235 GALWQDGDMPANYVPEHIETQWHSFKAEGGRTIATVYHKAIEGGWQPPAGIN 286
           G    D     N    +I+  W S     G T+ T+ H A +GGW  P   N
Sbjct: 266 GGSKWD-----NQAERNIQDIWDSASEGQGVTVGTLIHYAKDGGWTVPTRSN 312
Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.317 0.134 5.49e-269 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 449,803,643 Number of Sequences: 884779 Number of extensions: 19827664 Number of successful extensions: 52924 Number of sequences better than 5.0e-02: 14 Number of HSP's better than 0.1 without gapping: 10 Number of HSP's successfully gapped in prelim test: 4 Number of HSP's that attempted gapping in prelim test: 52901 Number of HSP's gapped (non-prelim): 17 length of query: 791 length of database: 277,083,050 effective HSP length: 128 effective length of query: 663 effective length of database: 163,831,338 effective search space: 108620177094 effective search space used: 108620177094 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 95 (41.2 bits) Query= orf66 [16203-16434] (230 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.319 0.132 5.03e+175 Gapped Lambda K H 0.267 0.0410 0.000 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 104,471,067 Number of Sequences: 884779 Number of extensions: 3759278 Number of successful extensions: 11290 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 11290 Number of HSP's gapped (non-prelim): 0 length of query: 230 length of database: 277,083,050 effective HSP length: 116 effective length of query: 114 effective length of database: 174,448,686 effective search space: 19887150204 effective search space used: 19887150204 T: 11 A: 40 X1: 16 ( 7.4 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.8 bits) S2: 89 (38.9 bits) Query= orf67 [16507-16558] (51 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF 884,779 sequences; 277,083,050 total letters Searching done Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF Posted date: Feb 23, 2002 3:08 AM Number of letters in database: 277,083,050 Number of sequences in database: 884,779 Lambda K H 0.315 0.128 0.000 Gapped Lambda K H 0.267 0.0410 5.49e-269 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 19,334,521 Number of Sequences: 884779 Number of extensions: 414668 Number of successful extensions: 1433 Number of sequences better than 5.0e-02: 0 Number of HSP's better than 0.1 without gapping: 0 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 1433 Number of HSP's gapped (non-prelim): 0 length of query: 51 length of database: 277,083,050 effective HSP length: 27 effective length of query: 24 effective length of database: 253,194,017 effective search space: 6076656408 effective search space used: 6076656408 T: 11 A: 40 X1: 16 ( 7.3 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 41 (21.6 bits) S2: 84 (37.0 bits)