National Center for Biotechnology Information (NCBI)
welcome to the blast network service.
PEPTIDE SEQUENCE DATABASES
nr Non-redundant GenBank CDS translations+PDB+SwissProt+PIR
month All new or revised GenBank CDS translation+PDB+SwissProt+PIR
sequences released in the last 30 days
pdb PDB protein sequences
yeast Yeast (Saccharomyces cerevisiae) protein sequences.
kabat Kabat Sequences of Proteins of Immunological Interest
alu * Translations of Select Alu Repeats from REPBASE
swissprot SwissProt sequences
NUCLEOTIDE SEQUENCE DATABASES
nr Non-redundant GenBank+EMBL+DDBJ+PDB sequences
(but no EST, STS, GSS, or HTGS sequences)
month All new or revised GenBank+EMBL+DDBJ+PDB sequences released in
the last 30 days
yeast Yeast (Saccharomyces cerevisiae) genomic nucleotide sequences.
est + Non-redundant Database of GenBank+EMBL+DDBJ EST Division
sts + Non-redundant Database of GenBank+EMBL+DDBJ STS Division
htgs High Throughput Genomic Sequences
pdb PDB nucleotide sequences
vector Vector subset of GenBank
mito * Database of mitochondrial sequences, Rel. 1.0, July 1995
gss Genome Survey Sequences (includes single-pass genomic data, exon-
trapped sequences, and Alu PCR primers.
kabat Kabat Sequences of Nucleic Acid of Immunological Interest
epd Eukaryotic Promotor Database
alu *+ Select Alu Repeats from REPBASE
BLASTP 2.2.3 [Apr-24-2002]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= orf1 [1-28]
(27 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.140 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 21,062,022
Number of Sequences: 1028106
Number of extensions: 378603
Number of successful extensions: 459
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 459
Number of HSP's gapped (non-prelim): 0
length of query: 27
length of database: 324,535,921
effective HSP length: 3
effective length of query: 24
effective length of database: 321,451,603
effective search space: 7714838472
effective search space used: 7714838472
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf2 [41-279]
(237 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.319 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 103,298,903
Number of Sequences: 1028106
Number of extensions: 3022093
Number of successful extensions: 8164
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8164
Number of HSP's gapped (non-prelim): 0
length of query: 237
length of database: 324,535,921
effective HSP length: 118
effective length of query: 119
effective length of database: 203,219,413
effective search space: 24183110147
effective search space used: 24183110147
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf3 [273-947]
(673 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAA89640.1| (AB036666) similar to terminase large subun... 210 5e-053
dbj|BAA89621.1| (AB036665) similar to terminase large subun... 206 1e-051
ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 205 2e-051
ref|NP_297798.1| (NC_002488) phage-related terminase large ... 123 6e-027
ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to t... 112 1e-023
sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2) >gi|5407... 102 1e-020
ref|NP_310206.1| (NC_002695) putative terminase large subun... 97 6e-019
ref|NP_287368.1| (NC_002655) putative DNA packaging protein... 96 1e-018
ref|NP_040581.1| (NC_001416) DNA packaging protein [Bacteri... 95 2e-018
ref|NP_046897.1| (NC_001901) gp2 [Bacteriophage N15] >gi|74... 93 9e-018
emb|CAA26343.1| (X02501) gp 2 (aa 1-309) [Bacteriophage 21]... 73 1e-011
>dbj|BAA89640.1| (AB036666) similar to terminase large subunit of phage lambda
[Wolbachia sp. wKue]
Length = 609
Score = 210 bits (534), Expect = 5e-053
Identities = 172/615 (27%), Positives = 276/615 (43%), Gaps = 80/615 (13%)
Query: 29 LTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L S+WA + R L + PG + + PY +E++D P SP V KGAQ+G T
Sbjct: 17 LKVSEWANEYRVLAPTAASEPGKWRTERTPYLKEIMDSLSPSSPTEKVVFMKGAQIGGTE 76
Query: 89 SVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGA 148
+ N IGY ID M+ ++ KR P++ + +S ++ +
Sbjct: 77 AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPCLKSKVK----DPRS 130
Query: 149 KRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTE 208
+ G + + E+ GG V+ GA S +RS PV +L DE+ +P G +GDP+ L+
Sbjct: 131 RDSGNTVQSKEFPGGIVVIT-GANSSVALRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSI 189
Query: 209 TRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKNG 268
RTN+FA RKI +STP + G I + FE D+RY+ VPC HC YQ L +
Sbjct: 190 ARTNTFAR-RKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW------- 241
Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGEGNWVPTAKPTRPDFRSYHLS 328
++ WE + YVC C G++ N K ++ G W PT + + + + +HLS
Sbjct: 242 -SQIKWENSDS----RTAHYVCTECSGKIENHQKTEMLDRGEWRPTNR-VKGEKKGFHLS 295
Query: 329 AMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVY 388
++Y+P SWQ+ + ++ + E+ L+V+ N LGE + K ++++
Sbjct: 296 SLYSPVGWYSWQQAVEDFLHAKESEQ-------LLKVWINTTLGETWVDKGEVPDWKQLF 348
Query: 389 AHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIID 448
R + G VP + +T VDVQ L V + W S + + ID
Sbjct: 349 NRREFFQIGTVPRREVV---------LTAGVDVQKDRLEVEVVAWGKSRE-----SWSID 394
Query: 449 YEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYC--NDW 506
Y+ EGD T GE W KL+E+++ G Y I+ + +DA T VY + +
Sbjct: 395 YQVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452
Query: 507 GENVMPIRG-------------RDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRWSP 553
VM ++G D+ + G ++K +G++ V V + K
Sbjct: 453 SGRVMAVKGVNKALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQ 504
Query: 554 TLRKEWSGQGEMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKII----RQVWHRPG 609
L +G+ G P P+ + K+LT E +K++ +Q W +
Sbjct: 505 LLNILKEEEGKALPGYCHFPEYAPE-YFKQLTAEQL-------VSKVVKGYTKQEWQKV- 555
Query: 610 GSRNELWDTLIYNTA 624
RNE+ D IY A
Sbjct: 556 RERNEVLDCRIYARA 570
>dbj|BAA89621.1| (AB036665) similar to terminase large subunit of lambda
[bacteriophage WO]
Length = 610
Score = 206 bits (523), Expect = 1e-051
Identities = 170/613 (27%), Positives = 275/613 (44%), Gaps = 78/613 (12%)
Query: 29 LTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
L S+WA + R L + PG + + PY +E++D P SP V KGAQ+G T
Sbjct: 17 LKVSEWANEYRVLAATAASEPGKWRTERTPYLKEIMDSLSPSSPAEKVIFMKGAQIGGTE 76
Query: 89 SVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGA 148
+ N IGY ID M+ ++ KR P++ + +S ++ +
Sbjct: 77 AG-NNWIGYIIDQTP-GPMLVVQPTVEMGKRWSKGRFAPLIESTPCLKSKVK----DPRS 130
Query: 149 KRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTE 208
+ G + + E+ GG V+ GA S +RS PV +L DE+ +P G +GDP+ L+
Sbjct: 131 RDSGNTVQSKEFPGGIVVIT-GANSSVGLRSMPVKYLFLDEIDAYPGDSGGEGDPVLLSI 189
Query: 209 TRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKNG 268
RTN+FA RKI +STP + G I + FE D+RY+ VPC HC YQ L +
Sbjct: 190 ARTNTFAR-RKIFLVSTPTIHGISRIEKEFEATDKRYFFVPCPHCNYYQVLKW------- 241
Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGEGNWVPTAKPTRPDFRSYHLS 328
++ WE + P++ Y+C CG ++ N K ++ G W T + + + +HLS
Sbjct: 242 -SQIKWEDKN----PNTAHYICIECGKKIENHQKTEMLDRGEWRAT-EAKEGEKKGFHLS 295
Query: 329 AMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVY 388
++Y+P SW + + ++ + E+ L+V+ N LGE + K ++++
Sbjct: 296 SLYSPVGWYSWTQAVEDFLHAKESEQ-------LLKVWINTTLGETWVDKGEVPDWKQLF 348
Query: 389 AHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIID 448
R + G VP + +T VDVQ L V + W S + + ID
Sbjct: 349 NRREFFPVGTVPRREVV---------LTAGVDVQEDRLEVEVVAWGKSREN-----WSID 394
Query: 449 YEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYC--NDW 506
Y EGD T GE W KL+E+++ G Y I+ + +DA T VY + +
Sbjct: 395 YRVFEGD--TGGGEVWGKLSELLNHHFIGENGLEYMISMMAVDAGYATQEVYNWVRGHQG 452
Query: 507 GENVMPIRG-----------RDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRWSPTL 555
VM ++G D+ + G ++K +G++ V V + K L
Sbjct: 453 SGRVMAVKGVALVPLSSPSRVDITVGGQKLK--------RGIKLWPVGVSILKSELFQLL 504
Query: 556 RKEWSGQGEMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKII----RQVWHRPGGS 611
+G+ G P P+ + K+LT E +K++ +Q W +
Sbjct: 505 NVLKEEEGKALPGYCHFPEYAPE-YFKQLTAEQL-------VSKVVKGYTKQEWQKV-RE 555
Query: 612 RNELWDTLIYNTA 624
RNE+ D IY A
Sbjct: 556 RNEVLDCRIYARA 568
>ref|NP_518974.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14555.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 660
Score = 205 bits (521), Expect = 2e-051
Identities = 176/620 (28%), Positives = 275/620 (43%), Gaps = 57/620 (9%)
Query: 29 LTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATV 88
LT S+W+++ R L + S PG + PY R ++DC P SP+ V K AQ+GAT
Sbjct: 26 LTVSEWSDRHRMLSSKASAEPGRWRTGRTPYLRAIMDCLSPTSPIERVVFMKAAQLGAT- 84
Query: 89 SVLENLIGYGIDYVKTASM-IFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKG 147
+ N IGY I + M ++ TV ++ KR I P++ S + A +
Sbjct: 85 EMGSNWIGYVIHHAPGPMMAVWPTV--EMAKRNSKQRIDPLI-----EESAVLAERIAPA 137
Query: 148 AKRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLT 207
R + + GGVL GA S +RS PV +L DEV G+P V +GD + L
Sbjct: 138 RSRDSGNTILAKEFRGGVLVMTGANSAVGLRSMPVRYLFLDEVDGYPLDVEGEGDAISLA 197
Query: 208 ETRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKN 267
E RT +FA RKI +STP +AG TI + ++ DQR Y VPC HC Q L F
Sbjct: 198 EARTRTFAR-RKIFIVSTPTIAGASTIEREYDASDQRRYFVPCPHCDHRQWLRFE----- 251
Query: 268 GQGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGEGNWVPTAKPTRPDFRSYHL 327
+L W T+G P++ Y+C C + K ++ +G W + T +HL
Sbjct: 252 ---QLRW-TKG---EPETAAYICEACFEPIHEHHKAWMLAQGEWRAMVE-TGGRTAGFHL 303
Query: 328 SAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREV 387
S++Y+P RSW+EIA AW E A ++ F N +LGE + + + +
Sbjct: 304 SSLYSPVGWRSWREIAAAWESAVSKESGSAA---AIKTFRNTELGETWVEEGEAPDWQRL 360
Query: 388 YAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYII 447
R DY G VP + G ++ + V V WA G +++
Sbjct: 361 LERREDYPIGTVPAGGLLLSAGADVQKDRIEVSV-----------WAFG---RGKEAWLV 406
Query: 448 DYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYCNDWG 507
++ + GD T W++L E++ + + G +AR+ +D T Y + G
Sbjct: 407 EHRVLMGD--TARDAVWKRLAELVEEQWTHASGATMPLARLALDTGFATQEAYAFVRACG 464
Query: 508 E-NVMPIRG--RDLPIKGAQIKY---FNRQVNEKGVEYLSVTVDLYKDRWSPTLRK--EW 559
+ VM ++G R + G N + +G++ +V V + K + LRK +
Sbjct: 465 DARVMAVKGTARGAALIGTPTAVDVTRNGKKLRRGIKVFTVAVGIAKLEFYNNLRKAADV 524
Query: 560 SGQG---EMPRGMLSAPVDMPDKHLKELTVEYKREERDPDTNKIIRQVWHRPGGSRNELW 616
+ G P G + P + + L++L E RD I + W + RNE
Sbjct: 525 AKDGATIAFPTGFVHLP-KIDAEFLQQLCAEQLITRRDRSGFPI--RAWQKM-RERNEAL 580
Query: 617 DTLIYNTAIFESMVLEACED 636
D +Y A + L+ E+
Sbjct: 581 DCYVYARAAASAAGLDRFEE 600
>ref|NP_297798.1| (NC_002488) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
pir||D82798 phage-related terminase large subunit XF0508 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF83318.1|AE003899_10 (AE003899) phage-related terminase large subunit [Xylella
fastidiosa 9a5c]
Length = 699
Score = 123 bits (309), Expect = 6e-027
Identities = 160/647 (24%), Positives = 237/647 (35%), Gaps = 122/647 (18%)
Query: 32 SQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVSVL 91
SQWA+ R + + PG + D P RE+++C ++PV V+ K Q+GAT +
Sbjct: 35 SQWADAYRKIARGSGAEPGQWRTDRHPPLREIMNCLSDHTPVQQVSFMKSGQIGAT-EIG 93
Query: 92 ENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAKRR 151
N + Y ID + ++ V D V F +L L N F K
Sbjct: 94 INWVCYVIDRGIDSMIVTQPVKDLARTWTVAKFDPGVLDMPPLLNKLTTNNTFEKQYP-- 151
Query: 152 GASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTETRT 211
GG L+ A S ++R + DE+ +P+++ G + R
Sbjct: 152 -----------GGTLFVKWANSSSQLRQITACYAFLDEIDEYPRNLNNQGTADQQIAARI 200
Query: 212 NSFANSRKILDLSTPLLAGTDTITQRFEKGDQRYYEVPCKHCGEYQRLYFRGNAKNGQGR 271
S KI TP +AG I F GDQR+Y + C HCG Q L
Sbjct: 201 MSHGERGKIYRACTPTVAGGSAIEASFLDGDQRHYHIHCPHCGSEQILDLEH-------- 252
Query: 272 LIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIM------GEGNWVPTAKPTRPDFRSY 325
L PD + C G + K TI+ G W P PD RSY
Sbjct: 253 ---------LQPDGT-FACAVNGCIIQEHHKNTILKERGTGGTAFWHPHNPSAPPDHRSY 302
Query: 326 HLSAMYAPY-YARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKP 384
+L A YAP SW++IA W E +D +L F N LG ++ +
Sbjct: 303 YLWAAYAPLGLGLSWKQIADKWAEA-------KRDPSKLPGFTNLILGLPFQGERDARNA 355
Query: 385 REVYA-HRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYA 443
EV RG VP GG + + VD+ H V I A G
Sbjct: 356 HEVATLAEPGVYRGLVPR-------GGLV--LAAGVDLAHDRAEVHLI-----ATGRGQR 401
Query: 444 PYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDASELTDVVYEYC 503
YI+DY ++ D + E + L + T+ G I+ V ID T+ V ++
Sbjct: 402 RYIVDYAVIDLDPTVL--ESYADLDAYLRGTWKTASGIDMPISAVAIDGGNWTETVAQFV 459
Query: 504 ND---WG--------------ENVMPIRGR-DLPIKGAQIKYFNRQVNEKGVEYLSVTVD 545
W + + +RGR ++ A + V+E+G + L+ V
Sbjct: 460 KQHVGWSGQSRILDTPQGFLKQTLYLVRGRAEVKSDRAVYRPSKTTVDERG-KTLARDVG 518
Query: 546 LY---------------------KDRWSPT----------LRKEWSGQGEMPRGMLSAPV 574
++ KD + T LR ++P + P
Sbjct: 519 VWGVGTSVLKHMIYGWLGAALTAKDNAAQTGTVEDLSARMLRFPGGRGDDIPDPLHPDPG 578
Query: 575 DMPDKHLKELTVEYKREERDPDTNKIIRQVWHRPGGSRNELWDTLIY 621
+P+ + LT EY D D + W +P G RNE DT +Y
Sbjct: 579 ALPEHYFAGLTAEY----FDKDAGR-----WIKPRGVRNEPLDTAVY 616
>ref|NP_461544.1| (NC_003197) Gifsy-1 prophage: similar to terminase large chain gp2
of N15 [Salmonella typhimurium LT2]
gb|AAL21503.1| (AE008818) Gifsy-1 prophage: similar to terminase large chain gp2
of N15 [Salmonella typhimurium LT2]
Length = 643
Score = 112 bits (280), Expect = 1e-023
Identities = 157/653 (24%), Positives = 266/653 (40%), Gaps = 100/653 (15%)
Query: 9 AESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFD 68
A +A R I L LT QWA++ +LP++ S PG ++ P+ +++
Sbjct: 8 ANNANRAITNGLIALHIPVPLTTVQWADEYYYLPKESSYTPG--KWETLPFQVAIMNAMG 65
Query: 69 PYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPM 128
Y + V + K A+VG T +L + GY I++ S++F D + + + + P
Sbjct: 66 -YELIRVVNLIKSARVGYTKMLL-GVEGYFIEHKSRNSLLFQPTDSSA-EDFMKSHVEPT 122
Query: 129 LRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRD 188
+R L A F G K R + ++ G + G + R V + D
Sbjct: 123 IRDVPVLLEL--APWF--GRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREKSVDVVCYD 178
Query: 189 EVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ--RYY 246
E+S + V K+G P L + R + I STP + G+ I + + R+Y
Sbjct: 179 ELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRG-STPKVKGSCQIEKAANESAHFMRFY 237
Query: 247 EVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIM 306
VPC HCGE Q L F G+ G L WE P++V Y+C H G +I + ++
Sbjct: 238 -VPCPHCGEEQYLKF-GDGSTPFG-LKWEKS----KPETVYYLCEH-NGCVIRQSELD-Q 288
Query: 307 GEGNWV--PTAKPTR---------------PDFRSYHLSAMYAPYYARSWQEIAQAWIEC 349
G W+ T TR P ++H+ Y+P+ +W +I W++
Sbjct: 289 KAGRWICDNTGMWTRDGLAYFSASGEEVPPPRSITFHIWTAYSPF--TTWIQIIYDWLDA 346
Query: 350 WDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTG 409
KD + ++ F N LGE YE + E+ + + VP
Sbjct: 347 -------LKDPNGVKTFINTTLGEPYEEAVAEKLSHELLLEKVIHYAAPVPER------- 392
Query: 410 GPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTE 469
+ +T +D Q + GWAP G ++ID + + G + + + Q++
Sbjct: 393 --VVYLTAGIDSQRNRYEMYVWGWAP-----GEEAFLIDKQIIMG--RHDDEDTLQRVDA 443
Query: 470 IISTRQYTSGGRVYDIARV-----GIDASELTDVVYEYCNDWG-ENVMPIRGRDLPIKGA 523
+I+ + + G I+R+ GIDA ++VY+ G V+P++G + G
Sbjct: 444 VINKKYRHADGTDISISRICWDIGGIDA----EIVYKRSKKHGIFRVLPVKGAS--VYGK 497
Query: 524 QIKYFNRQVNEKGVEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMLSAPVD-------- 575
+ ++ N+ GV + D K+ + G ++AP D
Sbjct: 498 PVITMPKKRNQSGVFLCEIGTDTAKEMLYARM------------GAVTAPADEATPYAIR 545
Query: 576 MPDKHLKELTVEYKREERDPDTNKII----RQVWHRPGGSRNELWDTLIYNTA 624
PD VE K+ + K++ R +W G RNE D L+Y +A
Sbjct: 546 FPDNPDVFTEVEAKQLVAEELVEKLVNGKFRLLWDAK-GRRNEALDCLVYASA 597
>sp|P36693|TERL_BPP21 TERMINASE LARGE SUBUNIT (GP2)
pir||B49849 terminase large chain gp2 - phage 21
gb|AAA32340.1| (M81255) terminase large subunit [Bacteriophage 21]
Length = 642
Score = 102 bits (255), Expect = 1e-020
Identities = 153/645 (23%), Positives = 259/645 (39%), Gaps = 75/645 (11%)
Query: 1 MYSASDYYAESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYW 60
M S + A +A + GLL+ LT QWA K +LP++ S PG ++ P+
Sbjct: 1 MISDAQKAANAAGAIATGLLSLIIPVP-LTTVQWANKHYYLPKESSYTPG--RWETLPFQ 57
Query: 61 REVVDCFDPYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRR 120
+++C + V + K A+VG T +L + Y I++ S++F D +
Sbjct: 58 VGIMNCMGN-DLIRTVNLIKSARVGYTKMLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDF 114
Query: 121 VTNFILPMLRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSF 180
+ + + P +R A L A F G K R + ++ G + G + R
Sbjct: 115 MKSHVEPTIRDVPAMLEL--APWF--GRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREK 170
Query: 181 PVPWLLRDEVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEK 240
V + DE+S + V K+G P L + R + I STP + G+ I + +
Sbjct: 171 SVDVVCYDELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRG-STPKIKGSCQIEKAANE 229
Query: 241 GDQ--RYYEVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMI 298
R+Y VPC HCGE Q L F +A L WE P+SV Y+C H G +I
Sbjct: 230 SAHFMRFY-VPCPHCGEEQYLKFGDDA--SPFGLKWEKN----KPESVFYLCEH-HGCVI 281
Query: 299 NEDKVTIMGEGNWV--PTAKPTR---------------PDFRSYHLSAMYAPYYARSWQE 341
++ ++ G W+ T TR P ++H+ Y+P+ +W +
Sbjct: 282 HQSELD-QSNGRWICENTGMWTRDGLMFFSARGDEIPPPRSITFHIWTAYSPF--TTWVQ 338
Query: 342 IAQAWIECWDDERNQAKDLDELQVFYNNDLGEAYELKSSRVKPREVYAHRRDYLRGEVPN 401
I W++ KD + L+ F N LGE +E +V + VP
Sbjct: 339 IVYDWLDA-------LKDPNGLKTFVNTTLGETWEEAVGEKLDHQVLMDKVVRYTAAVP- 390
Query: 402 EHAMAHTGGPIEAITMSVDVQHTWLAVCTIGWAPSADRSGYAPYIIDYEHVEGDCKTIEG 461
+ +T +D Q + GWAP G +++D + G + E
Sbjct: 391 --------ARVVYLTAGIDSQRNRFEMYVWGWAP-----GEEAFLVDKIIIMG--RPDEE 435
Query: 462 EGWQKLTEIISTRQYTSGGRVYDIARVGIDASELT-DVVYEYCNDWGE-NVMPIRGRDLP 519
E ++ I+ + + G I+RV D + ++VY+ G V+P++G
Sbjct: 436 ETLLRVDAAINKKYRHADGTEMTISRVCWDIGGIDGEIVYQRSKKHGVFRVLPVKGAS-- 493
Query: 520 IKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRWSPTLRKEWSGQGEMPRGMLSAPVD---M 576
+ G + + N++GV V D K+ ++ + + E + P D
Sbjct: 494 VYGKPVITMPKTRNQRGVYLCEVGTDTAKEILYARMKADPTPVDEATSYAIRFPDDPEIF 553
Query: 577 PDKHLKELTVEYKREERDPDTNKIIRQVWHRPGGSRNELWDTLIY 621
++L E E+ + +R +W RNE D L+Y
Sbjct: 554 SQTEAQQLVAEELVEKWEKGK---MRLLWDNK-KRRNEALDCLVY 594
>ref|NP_310206.1| (NC_002695) putative terminase large subunit [Escherichia coli
O157:H7]
dbj|BAB35602.1| (AP002557) putative terminase large subunit [Escherichia coli
O157:H7]
Length = 641
Score = 97.1 bits (240), Expect = 6e-019
Identities = 143/647 (22%), Positives = 254/647 (39%), Gaps = 71/647 (10%)
Query: 30 TPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVS 89
T +WA+ +LP++ + G ++ P+ R +++ + V V K A+VG +
Sbjct: 29 TAVEWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKM 85
Query: 90 VLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAK 149
+L + Y I++ + ++I+ D D T+ + P +R + SL G K
Sbjct: 86 LL-GVYAYFIEHKQRNTLIWLPTDGDAENFMKTH-VEPTIRDIPSLLSLAPWY----GKK 139
Query: 150 RRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTET 209
R + + G + G ++ R V DE++ + + + ++G P L +
Sbjct: 140 HRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDK 199
Query: 210 RTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ-RYYEVPCKHCGEYQRLYFRGNAKNG 268
R + I STP + GT I + + + V C HCGE Q L F G+ +
Sbjct: 200 RIEGSVWPKSIRG-STPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF-GDKETP 257
Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINED------KVTIMGEGNWVPTA------- 315
G L W + P SV Y+C H + ++ + G W
Sbjct: 258 FG-LKWTPDD----PSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSS 312
Query: 316 --KPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGE 373
+ PD ++H+ Y+P+ +W +I + W++ D + + F N LGE
Sbjct: 313 GEEIEPPDSVTFHIWTAYSPF--TTWVQIVKDWMKTKGDTGKR-------KTFVNTTLGE 363
Query: 374 AYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGW 433
+E K EV A R+++ VP+ + +T +D Q + GW
Sbjct: 364 TWEAKIGERPDAEVMAERKEHYSASVPDR---------VAYLTAGIDSQLDRYEMRVWGW 414
Query: 434 APSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDAS 493
P G ++ID + + G + + + ++ E I+ G ++R+ D
Sbjct: 415 GP-----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTG 467
Query: 494 EL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRW 551
+ +VYE G V+PI+G + G + R+ N+ GV + D K++
Sbjct: 468 GIDPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQI 525
Query: 552 SPTLRKEWSGQGEMPRGMLSAPVDMPD----KHLKELTVEYKREERDPDTNKIIRQVWHR 607
G +P G + P + PD ++LT E + E+ KI+ R
Sbjct: 526 YNRFTLTPEGDEPLP-GAVHFP-NNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR 583
Query: 608 PGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 ----RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_287368.1| (NC_002655) putative DNA packaging protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
ref|NP_309657.1| (NC_002695) terminase large subunit [Escherichia coli O157:H7]
gb|AAG55980.1|AE005330_12 (AE005330) putative DNA packaging protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
dbj|BAB35053.1| (AP002555) terminase large subunit [Escherichia coli O157:H7]
Length = 641
Score = 95.9 bits (237), Expect = 1e-018
Identities = 142/647 (21%), Positives = 254/647 (38%), Gaps = 71/647 (10%)
Query: 30 TPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVS 89
T +WA+ +LP++ + G ++ P+ R +++ + V V K A+VG +
Sbjct: 29 TAVEWADASYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKM 85
Query: 90 VLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAK 149
+L + Y I++ + ++I+ D D T+ + P +R + +L G K
Sbjct: 86 LL-GVYAYFIEHKQRNTLIWLPTDGDAENFMKTH-VEPTIRDIPSLLALAPWY----GKK 139
Query: 150 RRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTET 209
R + + G + G ++ R V DE++ + + + ++G P L +
Sbjct: 140 HRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDEDIEQEGSPTFLGDK 199
Query: 210 RTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ-RYYEVPCKHCGEYQRLYFRGNAKNG 268
R + I STP + GT I + + + V C HCGE Q L F G+ +
Sbjct: 200 RIEGSVWPKSIRG-STPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF-GDKETP 257
Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINED------KVTIMGEGNWVPTA------- 315
G L W + P SV Y+C H + ++ + G W
Sbjct: 258 FG-LKWTPDD----PSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSS 312
Query: 316 --KPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGE 373
+ PD ++H+ Y+P+ +W +I + W++ D + + F N LGE
Sbjct: 313 GEEIEPPDSVTFHIWTAYSPF--TTWVQIVKDWMKTKGDTGKR-------KTFVNTTLGE 363
Query: 374 AYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGW 433
+E K EV A R+++ VP+ + +T +D Q + GW
Sbjct: 364 TWEAKIGERPDAEVMAERKEHYSASVPDR---------VAYLTAGIDSQLDRYEMRVWGW 414
Query: 434 APSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDAS 493
P G ++ID + + G + + + ++ E I+ G ++R+ D
Sbjct: 415 GP-----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSVSRICWDTG 467
Query: 494 EL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRW 551
+ +VYE G V+PI+G + G + R+ N+ GV + D K++
Sbjct: 468 GIDPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQI 525
Query: 552 SPTLRKEWSGQGEMPRGMLSAPVDMPD----KHLKELTVEYKREERDPDTNKIIRQVWHR 607
G +P G + P + PD ++LT E + E+ KI+ R
Sbjct: 526 YNRFTLTPEGDEPLP-GAVHFP-NNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR 583
Query: 608 PGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 ----RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_040581.1| (NC_001416) DNA packaging protein [Bacteriophage lambda]
sp|P03708|TERL_LAMBD TERMINASE LARGE SUBUNIT (DNA PACKAGING PROTEIN A) (GPA)
pir||JVBPAL DNA-packaging protein A - phage lambda
gb|AAA96534.1| (J02459) A (DNA packaging;641) [bacteriophage lambda]
Length = 641
Score = 95.1 bits (235), Expect = 2e-018
Identities = 143/647 (22%), Positives = 253/647 (39%), Gaps = 71/647 (10%)
Query: 30 TPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGATVS 89
T +WA+ +LP++ + G ++ P+ R +++ + V V K A+VG +
Sbjct: 29 TAVEWADANYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIREVNVVKSARVGYSKM 85
Query: 90 VLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSKGAK 149
+L + Y I++ + ++I+ D D T+ + P +R + +L G K
Sbjct: 86 LL-GVYAYFIEHKQRNTLIWLPTDGDAENFMKTH-VEPTIRDIPSLLALAPWY----GKK 139
Query: 150 RRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKLTET 209
R + + G + G ++ R V DE++ + + ++G P L +
Sbjct: 140 HRDNTLTMKRFTNGRGFWCLGGKAAKNYREKSVDVAGYDELAAFDDDIEQEGSPTFLGDK 199
Query: 210 RTNSFANSRKILDLSTPLLAGTDTITQRFEKGDQ-RYYEVPCKHCGEYQRLYFRGNAKNG 268
R + I STP + GT I + + + V C HCGE Q L F G+ +
Sbjct: 200 RIEGSVWPKSIRG-STPKVRGTCQIERAASESPHFMRFHVACPHCGEEQYLKF-GDKETP 257
Query: 269 QGRLIWETEGGVLVPDSVRYVCPHCGGEMINED------KVTIMGEGNWVPTA------- 315
G L W + P SV Y+C H + ++ + G W
Sbjct: 258 FG-LKWTPDD----PSSVFYLCEHNACVIRQQELDFTDARYICEKTGIWTRDGILWFSSS 312
Query: 316 --KPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNNDLGE 373
+ PD ++H+ Y+P+ +W +I + W++ D + + F N LGE
Sbjct: 313 GEEIEPPDSVTFHIWTAYSPF--TTWVQIVKDWMKTKGDTGKR-------KTFVNTTLGE 363
Query: 374 AYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVCTIGW 433
+E K EV A R+++ VP+ + +T +D Q + GW
Sbjct: 364 TWEAKIGERPDAEVMAERKEHYSAPVPDR---------VAYLTAGIDSQLDRYEMRVWGW 414
Query: 434 APSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVGIDAS 493
P G ++ID + + G + + + ++ E I+ G I+R+ D
Sbjct: 415 GP-----GEESWLIDRQIIMG--RHDDEQTLLRVDEAINKTYTRRNGAEMSISRICWDTG 467
Query: 494 EL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLYKDRW 551
+ +VYE G V+PI+G + G + R+ N+ GV + D K++
Sbjct: 468 GIDPTIVYERSKKHGLFRVIPIKGAS--VYGKPVASMPRKRNKNGVYLTEIGTDTAKEQI 525
Query: 552 SPTLRKEWSGQGEMPRGMLSAPVDMPD----KHLKELTVEYKREERDPDTNKIIRQVWHR 607
G +P G + P + PD ++LT E + E+ KI+ R
Sbjct: 526 YNRFTLTPEGDEPLP-GAVHFP-NNPDIFDLTEAQQLTAEEQVEKWVDGRKKILWDSKKR 583
Query: 608 PGGSRNELWDTLIYNTAIFESMVLEACEDIVGLEALVWPEFWAIAGK 654
RNE D +Y A + D+ L A + E A K
Sbjct: 584 ----RNEALDCFVYALAALRISISRWQLDLSALLASLQEEDGAATNK 626
>ref|NP_046897.1| (NC_001901) gp2 [Bacteriophage N15]
pir||T13088 DNA-packaging protein - phage N15
gb|AAC19038.1| (AF064539) gp2 [Bacteriophage N15]
Length = 640
Score = 93.2 bits (230), Expect = 9e-018
Identities = 142/624 (22%), Positives = 245/624 (38%), Gaps = 79/624 (12%)
Query: 27 EFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYWREVVDCFDPYSPVHFVAVKKGAQVGA 86
E +T +WA+ +LP++ + G ++ P+ R +++ + V V K A+VG
Sbjct: 26 EPMTAVEWADAHYYLPKESAYQEG--RWETLPFQRAIMNAMGS-DYIRIVNVIKSARVGY 82
Query: 87 TVSVLENLIGYGIDYVKTASMIFATVDDDVTKRRVTNFILPMLRAAQASRSLIQANDFSK 146
+ +L +I Y I++ + +++ D D + + + P +R + SL
Sbjct: 83 SKMLL-GVIAYFIEHKQRNELLWLPTDGDADNF-MKSHVEPTIRDVPSLLSLAPWY---- 136
Query: 147 GAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSFPVPWLLRDEVSGWPQSVGKDGDPMKL 206
G K R + + G + G ++ R V + DE++ + + K+G P L
Sbjct: 137 GKKHRDNTLSMKRFTNGRGFWCLGGKAAKNYREKSVDVVGYDELAAFDADIEKEGSPTFL 196
Query: 207 TETRTNSFANSRKILDLSTPLLAGTDTITQRF-EKGDQRYYEVPCKHCGEYQRLYFRGNA 265
+ R + I STP L GT I + E G + V C HCGE Q L F G+
Sbjct: 197 GDKRIEGSVWPKSIRG-STPKLRGTCQIERAAKESGHFMRFHVACPHCGEEQYLKF-GDR 254
Query: 266 KNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMINEDKVTIMGE-------GNWV------ 312
G WE E ++V Y+C H +I + ++ G W
Sbjct: 255 DTPFG-FKWEPEQA----ETVYYLCEH-NACVIKQHELDFSNARYICELTGIWTRDGLRW 308
Query: 313 ---PTAKPTRPDFRSYHLSAMYAPYYARSWQEIAQAWIECWDDERNQAKDLDELQVFYNN 369
A+ P+ ++H+ Y+P+ +W +I + W + D + + F N
Sbjct: 309 FSSSNAEIDPPESVTFHIWTAYSPF--TTWVQIVKDWFKTKGDTGKR-------KTFVNT 359
Query: 370 DLGEAYELKSSRVKPREVYAHRRDYLRGEVPNEHAMAHTGGPIEAITMSVDVQHTWLAVC 429
LGE +E K +V A R+++ VP + +T +D Q +
Sbjct: 360 TLGETWEAKIGDRPDADVLAERKEHFDAAVPER---------VAYLTAGIDSQLDRYEMR 410
Query: 430 TIGWAPSADRSGYAPYIIDYEHVEGDCKTIEGEGWQKLTEIISTRQYTSGGRVYDIARVG 489
GW P G ++ID + + G + + ++ E I+ G I+R+
Sbjct: 411 VWGWGP-----GEESWLIDRQIIMG--RHDDESTLARVDEAINKTYTRRNGVEMSISRIC 463
Query: 490 IDASEL-TDVVYEYCNDWG-ENVMPIRGRDLPIKGAQIKYFNRQVNEKGVEYLSVTVDLY 547
D + +VY G V+PI+G + G + R+ N+ GV V D
Sbjct: 464 WDIGGIDPTIVYNRSKKHGLFRVIPIKGAS--VYGKPVANMPRKRNKNGVYLTEVGTDTA 521
Query: 548 KDRWSPTLRKEWSGQGEMPRGMLSAPVDMPD-------KHLKELTVEYKREERDPDTNKI 600
K++ R +G+ P L+ V P+ ++LT E E+ KI
Sbjct: 522 KEQIYN--RFTLIVEGDEP---LAGAVHFPNNPDIYDLSEAQQLTAEELVEKWVDGKRKI 576
Query: 601 IRQVWHRPGGSRNELWDTLIYNTA 624
I R RNE D +Y A
Sbjct: 577 IWDSKKR----RNEALDCFVYALA 596
>emb|CAA26343.1| (X02501) gp 2 (aa 1-309) [Bacteriophage 21]
gb|AAA32338.1| (M23775) glycoprotein 2 [Bacteriophage 21]
Length = 309
Score = 73.2 bits (178), Expect = 1e-011
Identities = 86/314 (27%), Positives = 138/314 (43%), Gaps = 22/314 (7%)
Query: 1 MYSASDYYAESAERVIEGLLATTDEREFLTPSQWAEKKRHLPQQHSPMPGPFSFDDAPYW 60
M S + A +A + GLL+ LT QWA K +LP++ S PG ++ P+
Sbjct: 1 MISDAQKAANAAGAIATGLLSLIIPVP-LTTVQWANKHYYLPKESSYTPG--RWETLPFQ 57
Query: 61 REVVDCFDPYSPVHFVAVKKGAQVGATVSVLENLIGYGIDYVKTASMIFATVDDDVTKRR 120
+++C + V + K A+VG T +L + Y I++ S++F D +
Sbjct: 58 VGIMNCMGN-DLIRTVNLIKSARVGYTKMLL-GVEAYFIEHKSRNSLLFQPTDS-AAEDF 114
Query: 121 VTNFILPMLRAAQASRSLIQANDFSKGAKRRGASAKGIEWAGGGVLYPFGARSPGKMRSF 180
+ + + P +R A L A F G K R + ++ G + G + R
Sbjct: 115 MKSHVEPTIRDVPALLEL--APWF--GRKHRDNTLTLKRFSSGVGFWCLGGAAAKNYREK 170
Query: 181 PVPWLLRDEVSGWPQSVGKDGDPMKLTETRTNSFANSRKILDLSTPLLAGTDTITQRFEK 240
V + DE+S + V K+G P L + R + I STP + G+ I + +
Sbjct: 171 SVDVVCYDELSSFEPDVEKEGSPTLLGDKRIEGSVWPKSIRG-STPKIKGSCQIEKAANE 229
Query: 241 GDQ--RYYEVPCKHCGEYQRLYFRGNAKNGQGRLIWETEGGVLVPDSVRYVCPHCGGEMI 298
R+Y VPC HCGE Q L F +A L WE P+SV Y+C H G +I
Sbjct: 230 SAHFMRFY-VPCPHCGEEQYLKFGDDA--SPFGLKWEKN----KPESVFYLCEH-HGCVI 281
Query: 299 NEDKVTIMGEGNWV 312
++ ++ G W+
Sbjct: 282 HQSELD-QSNGRWI 294
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 482,511,288
Number of Sequences: 1028106
Number of extensions: 22114366
Number of successful extensions: 47607
Number of sequences better than 5.0e-02: 21
Number of HSP's better than 0.1 without gapping: 5
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 47521
Number of HSP's gapped (non-prelim): 34
length of query: 673
length of database: 324,535,921
effective HSP length: 128
effective length of query: 545
effective length of database: 192,938,353
effective search space: 105151402385
effective search space used: 105151402385
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 95 (41.2 bits)
Query= orf4 [961-1054]
(92 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.320 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 52,111,724
Number of Sequences: 1028106
Number of extensions: 1659105
Number of successful extensions: 3987
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3987
Number of HSP's gapped (non-prelim): 0
length of query: 92
length of database: 324,535,921
effective HSP length: 68
effective length of query: 24
effective length of database: 254,624,713
effective search space: 6110993112
effective search space used: 6110993112
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf5 [1075-1621]
(545 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_040583.1| (NC_001416) capsid component [Bacteriophag... 82 2e-014
ref|NP_287370.1| (NC_002655) putative capsid structural pro... 81 4e-014
ref|NP_046899.1| (NC_001901) gp4 [Bacteriophage N15] >gi|74... 80 5e-014
ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 79 1e-013
sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4) >gi|... 77 4e-013
ref|NP_287794.1| (NC_002655) putative capsid protein of pro... 75 2e-012
ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to h... 74 6e-012
ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella ty... 72 2e-011
ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella t... 69 1e-010
emb|CAC83156.1| (AJ304858) putative portal protein [Escheri... 65 2e-009
ref|NP_288458.1| (NC_002655) putative head-tail preconnecto... 65 2e-009
>ref|NP_040583.1| (NC_001416) capsid component [Bacteriophage lambda]
sp|P03710|VMCB_LAMBD PORTAL PROTEIN (GPB) (MINOR CAPSID PROTEIN B)
pir||VHBPBL minor capsid protein B - phage lambda
gb|AAA96536.1| (J02459) B (capsid component;533) [bacteriophage lambda]
Length = 533
Score = 81.6 bits (200), Expect = 2e-014
Identities = 103/489 (21%), Positives = 192/489 (39%), Gaps = 30/489 (6%)
Query: 1 MPVVADPSGPVMSVENTSRFNGRGDLAHAGYRYTRHDGEKYEGGLGPIEILYTDYWSQRR 60
+P + P G + S+ + ++G G R E + L P ++
Sbjct: 6 IPTLLGPDG-MTSLREYAGYHGGGSGFGGQLRSWNPPSESVDAALLP------NFTRGNA 58
Query: 61 RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
R++ + N YA I+ ++V + L+ P+ L I E+ A ++ V ++ +
Sbjct: 59 RADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEF 118
Query: 121 GELPEVC-DFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGL---PTYELIDGDMVQ 176
E C D ++ TF R A G++ V ++++ + ++ +
Sbjct: 119 AEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTSSSRLFRTQFRMVSPKRIS 178
Query: 177 TPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKA-----TGANGRRIAWL 231
P N G S GV++ G + ++V ++G+ + G R +++
Sbjct: 179 NP----NNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFI 234
Query: 232 YYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTK 291
+ + RG + VM+ + +D +++ + A + A I+ ++ ++A
Sbjct: 235 HVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSA--M 292
Query: 292 PFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEIKAFTNDTTDEKFG 348
F GA + Q G E A P G V L G+ + T TD +
Sbjct: 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS 352
Query: 349 EFEEAILRAVAWALGMPYSVLAMQF-DSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVY 407
FE+++LR +A LG+ Y L+ + SYS +R E A R A ++
Sbjct: 353 VFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMF 412
Query: 408 RSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQI 467
WL ++ R + P + AR + A +AW W G + A+D KE + I
Sbjct: 413 LCWLEEAIVRRVVTLP---SKARFSFQEAR-SAWGNCDWIGSGRMAIDGLKEVQEAVMLI 468
Query: 468 KLGALTRTK 476
+ G T K
Sbjct: 469 EAGLSTYEK 477
>ref|NP_287370.1| (NC_002655) putative capsid structural protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
ref|NP_309659.1| (NC_002695) portal protein [Escherichia coli O157:H7]
ref|NP_310204.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
gb|AAG55982.1|AE005330_14 (AE005330) putative capsid structural protein of prophage CP-933X
[Escherichia coli O157:H7 EDL933]
dbj|BAB35055.1| (AP002555) portal protein [Escherichia coli O157:H7]
dbj|BAB35600.1| (AP002557) putative portal protein [Escherichia coli O157:H7]
Length = 533
Score = 80.9 bits (198), Expect = 4e-014
Identities = 104/489 (21%), Positives = 190/489 (38%), Gaps = 30/489 (6%)
Query: 1 MPVVADPSGPVMSVENTSRFNGRGDLAHAGYRYTRHDGEKYEGGLGPIEILYTDYWSQRR 60
+P + P G + S+ + ++G G R GE + L P ++
Sbjct: 6 IPTLLGPDG-MTSLREYAGYHGGGSGFGGQLRAWNPPGESVDAALLP------NFTRGNA 58
Query: 61 RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
R++ + N YA I+ ++V + L+ P+ L I E+ A ++ V ++ +
Sbjct: 59 RADDLVRNNGYAANAIQLHQDHIVGSFFRLSHRPSWRYLGIGEEEARAFSREVEAAWKEF 118
Query: 121 GELPEVC-DFKKLYTFGQWQRVAKAEALIAGDVLVV---EHHNSATGLPTYELIDGDMVQ 176
E C D ++ TF R A G++ V + S + ++ +
Sbjct: 119 AEDDCCCIDVERKRTFTMMIREGVAMHAFNGELFVQATWDTRPSRLFRTQFRMVSPKRIS 178
Query: 177 TPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKA-----TGANGRRIAWL 231
P N S GV++ G + ++V ++G+ + G R +++
Sbjct: 179 NP----NNTSDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFI 234
Query: 232 YYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTK 291
+ + RG + VM+ + +D +++ + A + A I+ ++ ++A
Sbjct: 235 HVFEPVEDGQTRGANVFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSA--M 292
Query: 292 PFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEIKAFTNDTTDEKFG 348
F GA + Q G E A P G V L G+ + T TD +
Sbjct: 293 DFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGAKVPHLMPGDSLNLQTAQDTDNGYS 352
Query: 349 EFEEAILRAVAWALGMPYSVLAMQF-DSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVY 407
FE+++LR +A LG+ Y L+ + SYS +R E A R A ++
Sbjct: 353 VFEQSLLRYIAAGLGVSYEQLSRNYAQMSYSTARASANESWAYFMGRRKFVASRQASQMF 412
Query: 408 RSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQI 467
WL ++ R + P + AR + A +AW W G + A+D KE + I
Sbjct: 413 LCWLEEAIVRRVVTLP---SKARFSFQEAR-SAWGNCDWIGSGRMAIDGLKEVQEAVMLI 468
Query: 468 KLGALTRTK 476
+ G T K
Sbjct: 469 EAGLSTYEK 477
>ref|NP_046899.1| (NC_001901) gp4 [Bacteriophage N15]
pir||T13090 probable minor capsid protein gp4 - phage N15
gb|AAC19040.1| (AF064539) gp4 [Bacteriophage N15]
Length = 530
Score = 80.5 bits (197), Expect = 5e-014
Identities = 98/442 (22%), Positives = 180/442 (40%), Gaps = 29/442 (6%)
Query: 51 LYTDYWSQRRRSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWA 110
L +Y R++ + N YA ++ ++V + L+ P+ L I+E+ + ++
Sbjct: 46 LLPNYSRGNARADDLVRNNGYAANAVQLHQDHIVGSFFRLSYRPSWRYLGINEEDSRAFS 105
Query: 111 ENVNDLFEAWGELPE--VC--DFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNS-ATGL- 164
+V AW E E C D ++ TF R A G++ V +S +T L
Sbjct: 106 RDVE---AAWNEYAEDDFCGIDAERKRTFTMMIREGVAMHAFNGELCVQATWDSDSTRLF 162
Query: 165 -PTYELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGF-----KHKRI 218
++++ V P N IG + GV++ G + ++V +G+ ++
Sbjct: 163 RTQFKMVSPKRVSNP----NNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTY 218
Query: 219 KATGANGRRIAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGF 278
G R ++++ + RG VM+ + +D +++ + A + A
Sbjct: 219 IPRELPGGRPSFIHVFEPMEDGQTRGANAFYSVMEQMKMLDTLQNTQLQSAIVKAMYAAT 278
Query: 279 IQRQIGGKTAGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEI 335
I+ ++ ++A F GA + Q + G E + P G V L G+ +
Sbjct: 279 IESELDTQSA--MDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGARVPHLLPGDSL 336
Query: 336 KAFTNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEM 394
+ TD + FE+++LR +A LG+ Y L+ + SYS +R E A
Sbjct: 337 NLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYSTARASANESWAYFMGR 396
Query: 395 RDRDADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAV 454
R A ++ WL ++ R + P + AR + A AW + W G + A+
Sbjct: 397 RKFVASRQACQMFLCWLEEAIVRRVVTLP---SKARFSFQEAR-TAWGNANWIGSGRMAI 452
Query: 455 DLEKETRGRGEQIKLGALTRTK 476
D KE + I+ G T K
Sbjct: 453 DGLKEVQEAVMLIEAGLSTYEK 474
>ref|NP_518978.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14559.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 508
Score = 79.3 bits (194), Expect = 1e-013
Identities = 104/454 (22%), Positives = 168/454 (36%), Gaps = 62/454 (13%)
Query: 43 GGLGPIEILYTDYWSQRRRSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPID 102
G G + L R +S +RN +A+ I V N V TG + P
Sbjct: 41 GNPGAVAALLASGEDLRIKSRDLVRRNAWAQAGIEAFVANAVGTG----------IKPQS 90
Query: 103 EDAADTWAENVNDLFEAWGELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLV-VEHHNSA 161
D + +V L+ W + Y Q +A L G+ L+ +
Sbjct: 91 LSTDDAFKADVQALWRDWTAEADAAGQTDFYGL---QALACRAMLEGGECLIRLRPRRPE 147
Query: 162 TGLPT---YELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRI 218
GL +L++ + + P + + + +G+E G VA+ + + R+
Sbjct: 148 DGLTVPLQLQLLEAEHL--PMTLNVDLPSGNAVRSGIEFDGLGRRVAYHLYRSHPDDGRL 205
Query: 219 KATGANG---------RRIAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKA 269
G I LY R + RGEP L+ + + E+D+ DA +
Sbjct: 206 APMSGQGGLDTVRVDASEIIHLYRVL--RPGQIRGEPWLSRALVKLNELDQYDDAELVRK 263
Query: 270 SLGAQIVGFIQRQIGGKTAGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPFGM-MVEG 328
A GF+ RQ + + +G+G I+ ++
Sbjct: 264 KTAAMFAGFVTRQ------------------SPEDNLMGEGLPDEAGISLVGLEPGTLQI 305
Query: 329 LAEGEEIKAFTNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEF 387
L GE+IK +GEF RAVA ALG+ Y L +YS+ R + EF
Sbjct: 306 LEPGEDIKFSDPADVGGSYGEFLRTQFRAVAAALGITYEQLTGDLTGVNYSSIRAGLLEF 365
Query: 388 ----EAVIHEMRDRDADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTR 443
E V H + + RPV+ +W++ VL+ + APGF R W
Sbjct: 366 RRRCEMVQHSVL---VHQMCRPVWAAWMKQAVLSGALVAPGFARGGAARRRQYLQVKWIP 422
Query: 444 SQWYGMVKEAVDLEKETRGRGEQIKLGALTRTKA 477
W + VD EKE + I+ G ++R++A
Sbjct: 423 QGW-----QWVDPEKEFKAMLLAIRAGLMSRSEA 451
>sp|P36272|VG04_BPP21 PORTAL PROTEIN (HEAD PROTEIN GP4)
pir||JN0538 head protein gp4 - phage 21
gb|AAA32342.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
Length = 530
Score = 77.4 bits (189), Expect = 4e-013
Identities = 101/468 (21%), Positives = 193/468 (40%), Gaps = 39/468 (8%)
Query: 61 RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
R++ + N A + ++V +++ PN L + E AA ++ + V AW
Sbjct: 56 RADDLVRNNGIAANAVALHKDHIVGHMFLISYRPNWRWLGMRETAAKSFVDEVE---AAW 112
Query: 121 GELPE----VCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGL---PTYELIDGD 173
E E D + TF ++ R G++ V ++ T ++ +
Sbjct: 113 SEYAEGMFGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRTRFKAVSPK 172
Query: 174 MVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGF------KHKRIKATGANGRR 227
V TP + + GVE+ + G VA+ + ++ F + +RI GR
Sbjct: 173 RVDTPGHGMG----NRFLRAGVEVDRYGRAVAYHICEDDFPFSGSGRWERIPRELPTGRP 228
Query: 228 IAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKT 287
A L+ + RG VM+ + +D ++ + A + A I+ ++ +
Sbjct: 229 -AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESELDTEK 287
Query: 288 AGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF----GMMVEGLAEGEEIKAFTNDTT 343
A + +GA + +D + E+ ++ +T G+ + L G+++K T +
Sbjct: 288 AFE--YIAGAPQEQKDNPLINI-LEKFSSWYDTNNVTLGGVKIPHLFPGDDLKLQTAQDS 344
Query: 344 DEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEMRDRDADTL 402
D F E+A+LR +A LG+ Y L+ + SYS++R E R A L
Sbjct: 345 DNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARL 404
Query: 403 LRPVYRSWLRAMVLNRRIDAPGFLAAARNPREF-ATYAAWTRSQWYGMVKEAVDLEKETR 461
++ WL +L G + R +F +AW+R++W G + A+D KE +
Sbjct: 405 ATQMFSCWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQ 458
Query: 462 GRGEQIKLGALTRTKAARMLTGTSFKTNMRKLAKEN--QMLADAMRPI 507
+I+ G T K ++ G ++ R+ +E+ + A RP+
Sbjct: 459 ESVMRIEAGLSTYEKGLALM-GEDYQDIFRQQVRESAERQKAGLSRPV 505
>ref|NP_287794.1| (NC_002655) putative capsid protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
ref|NP_309134.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
ref|NP_310760.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
gb|AAG56408.1|AE005368_14 (AE005368) putative capsid protein of prophage CP-933R [Escherichia
coli O157:H7 EDL933]
dbj|BAB34530.1| (AP002554) putative portal protein [Escherichia coli O157:H7]
dbj|BAB36156.1| (AP002559) putative portal protein [Escherichia coli O157:H7]
Length = 530
Score = 75.1 bits (183), Expect = 2e-012
Identities = 101/468 (21%), Positives = 192/468 (40%), Gaps = 39/468 (8%)
Query: 61 RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
R++ + N A + ++V +++ PN L + E AA ++ + V AW
Sbjct: 56 RADDLVRNNGIAANAVALHKDHIVGHMFLISYRPNWRWLGMRETAAKSFVDEVE---AAW 112
Query: 121 GELPE----VCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGL---PTYELIDGD 173
E E D + TF ++ R G++ V ++ T ++ +
Sbjct: 113 SEYAEGMSGEIDVEGKRTFTEFIREGVGVHAFNGEIFVQPVWDTETTQLFRTRFKAVSPK 172
Query: 174 MVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGF------KHKRIKATGANGRR 227
V TP + + GVE+ + G VA+ + ++ F + +RI GR
Sbjct: 173 RVDTPGHGMG----NRFLRAGVEVDRYGRAVAYHICEDDFPRSGSGRWERIPRELPTGRP 228
Query: 228 IAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKT 287
A L+ + RG VM+ + +D ++ + A + A I+ + +
Sbjct: 229 -AMLHIFEPVEDGQTRGANQFYSVMERLKMLDSLQATQLQSAIVKAMYAATIESDLDTEK 287
Query: 288 AGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF----GMMVEGLAEGEEIKAFTNDTT 343
A + +GA + +D + E+ ++ +T G+ + L G+++K T +
Sbjct: 288 AFE--YIAGAPQGQKDNPLINI-LEKFSSWYDTNNVTLGGVKIPHLFPGDDLKLQTAQDS 344
Query: 344 DEKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEMRDRDADTL 402
D F E+A+LR +A LG+ Y L+ + SYS++R E R A L
Sbjct: 345 DNGFSALEQALLRYIAAGLGVSYEQLSRDYSKVSYSSARASANESWRYFMGRRKFIAARL 404
Query: 403 LRPVYRSWLRAMVLNRRIDAPGFLAAARNPREF-ATYAAWTRSQWYGMVKEAVDLEKETR 461
++ WL +L G + R +F +AW+R++W G + A+D KE +
Sbjct: 405 ATQMFSYWLEEALLR------GIIRPPRARFDFYQARSAWSRAEWIGAGRMAIDGLKEVQ 458
Query: 462 GRGEQIKLGALTRTKAARMLTGTSFKTNMRKLAKEN--QMLADAMRPI 507
+I+ G T K ++ G ++ R+ +E+ + A RP+
Sbjct: 459 ESVMRIEAGLSTYEKEPALM-GEDYQDIFRQQVRESAERQKAGLSRPV 505
>ref|NP_461542.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp4
of phage 21 [Salmonella typhimurium LT2]
gb|AAL21501.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp4
of phage 21 [Salmonella typhimurium LT2]
Length = 526
Score = 73.6 bits (179), Expect = 6e-012
Identities = 100/466 (21%), Positives = 186/466 (39%), Gaps = 37/466 (7%)
Query: 61 RSNTFFKRNLYARGIIRRIVGNVVNTGQVLAANPNAAMLPIDEDAADTWAENVNDLFEAW 120
R++ + N A + ++V +++ PN L + E AA ++ + V AW
Sbjct: 56 RADDLVRNNGIAANAVALHKDHIVGHLFLISYRPNWRYLGMRESAAKSFVDEVE---AAW 112
Query: 121 GELPE----VCDFKKLYTFGQWQRVAKAEALIAGDVL---VVEHHNSATGLPTYELIDGD 173
E + D + TF ++ R G++ V + + ++ +
Sbjct: 113 TEYCDGIFGEMDAEGKRTFTEFIREGVGVHAFNGEIFLQPVWDAETTQVFRTRFKAVSPK 172
Query: 174 MVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAF------WVVQEGFKHKRIKATGANGRR 227
V TP A+ + ++ GVE ++G+ +A+ W V G + RI +GR
Sbjct: 173 RVDTPGYARG----NRQLRAGVETDRNGKALAYHVCDDDWPVAGGERWTRIPRFLPSGRP 228
Query: 228 IAWLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKT 287
A L+ + RG VM+ + +D ++ + A + A I+ ++ +
Sbjct: 229 -AMLHIFEPVEDGQTRGANQFYSVMERLKMLDTLQATQLQSAIVKAMYAATIESELDSE- 286
Query: 288 AGTKPFSSGAQRRVQDTDFVGDGTERRTNIAETPF---GMMVEGLAEGEEIKAFTNDTTD 344
K F +DT V + G+ + L G+E+ T +D
Sbjct: 287 ---KAFEYITAADNKDTPLVNMLANYARYYSTNSIKLGGVKIPHLYPGDELNLQTAQDSD 343
Query: 345 EKFGEFEEAILRAVAWALGMPYSVLAMQFDS-SYSASRGEIKEFEAVIHEMRDRDADTLL 403
F E+A+LR +A LG+ Y L+ + SYS++R E R A L
Sbjct: 344 NGFSALEQALLRYIAAGLGVSYEQLSRDYSQVSYSSARASANESWRYFLGRRRFIAGRLA 403
Query: 404 RPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGR 463
++ WL ++ I AP + R ++W+RS+W G + A+D KE +
Sbjct: 404 TQMFSCWLEEALIRGVIRAPRARFSFWEAR-----SSWSRSEWIGAGRMAIDGLKEVQES 458
Query: 464 GEQIKLGALTRTKAARMLTGTSFKTNMRKLAKENQ--MLADAMRPI 507
+I+ G T K ++ G ++ R+ +E++ A RP+
Sbjct: 459 VMRIEAGLSTYEKELAIM-GEDYQEIFRQQVRESEERRAAGLSRPV 503
>ref|NP_459888.1| (NC_003197) Fels-1 prophage [Salmonella typhimurium LT2]
gb|AAL19847.1| (AE008738) Fels-1 prophage [Salmonella typhimurium LT2]
Length = 511
Score = 72.0 bits (175), Expect = 2e-011
Identities = 66/262 (25%), Positives = 106/262 (40%), Gaps = 30/262 (11%)
Query: 237 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTKPFSSG 296
+R +ARG LL V+ + ++ + D+ + A + A + FI++Q G P
Sbjct: 233 RRLNQARGASLLAPVIIRLMDLKEYEDSERIAARIAASLGMFIKKQDVGTDGYVAP---- 288
Query: 297 AQRRVQDTDFVGDGTERRTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 356
E+R P GM+ +GL GE+I +D + F LR
Sbjct: 289 ---------------EKRKETQIQP-GMLFDGLNPGEDIGMIKSDRPNAGLESFRMGQLR 332
Query: 357 AVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVYRSWLRAMVL 416
AVA L +S +A +D +YSA R E+ E + ++D RP+YR WL A V
Sbjct: 333 AVAAGLRGSFSSIARNYDGTYSAQRQELVEAQEGYSILQDSFIAAFTRPLYRRWLAAAVA 392
Query: 417 NRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQIKLGALTRTK 476
+ I+ P ++ + + G V +D KE I+ GA T
Sbjct: 393 SGAIEVPA----------GTDMSSLFNAVYSGPVMPWIDPLKEANAWRVLIRGGAATEGD 442
Query: 477 AARMLTGTSFKTNMRKLAKENQ 498
R G R+ A+ ++
Sbjct: 443 WVRARGGAPADVKRRRKAETDE 464
>ref|NP_460007.1| (NC_003197) Gifsy-2 prophage [Salmonella typhimurium LT2]
gb|AAL19966.1| (AE008744) Gifsy-2 prophage [Salmonella typhimurium LT2]
Length = 437
Score = 68.9 bits (167), Expect = 1e-010
Identities = 69/266 (25%), Positives = 112/266 (41%), Gaps = 29/266 (10%)
Query: 237 KRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFIQRQIGGKTAGTKPFSSG 296
+R +ARG LL V+ + ++ + D+ + A + A FI+R
Sbjct: 155 RRLNQARGVTLLAPVIIRLLDLKEYEDSERLAARISAAFAMFIRR--------------- 199
Query: 297 AQRRVQDTDFVGDGTERRTNIAETPFGMMVEGLAEGEEIKAFTNDTTDEKFGEFEEAILR 356
+ VQD D D ++ ++ P G +++ L GE+I +D + F LR
Sbjct: 200 SDAMVQDGD-APDYADKDRDLDIEP-GTILKDLLPGEDIGTIKSDRPNANLESFRMGQLR 257
Query: 357 AVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRDADTLLRPVYRSWLRAMVL 416
AVA + +S +A +D +YSA R E+ E + ++D + RPVYR WL +
Sbjct: 258 AVAAGVRGSFSSIARNYDGTYSAQRQELVEAQEGYAILQDNFIAAVSRPVYRRWLATAIT 317
Query: 417 NRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEKETRGRGEQIKLGALTRTK 476
ID P + AT + + G V +D KE I+ GA T +
Sbjct: 318 AGVIDVP-------TDTDMATL---FNAVYSGPVMPWIDPLKEANAWRILIRGGAATESD 367
Query: 477 AARMLTGTSFKTNMRKLAK--ENQML 500
R G + R+ A+ EN+ L
Sbjct: 368 WVRARGGAPAEVKRRRKAEIDENRKL 393
>emb|CAC83156.1| (AJ304858) putative portal protein [Escherichia coli]
Length = 500
Score = 65.1 bits (157), Expect = 2e-009
Identities = 92/381 (24%), Positives = 151/381 (39%), Gaps = 51/381 (13%)
Query: 110 AENVNDLFEAWGELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGLPT--- 166
AE D+ W E D YT +R+ L G+V + GL
Sbjct: 104 AELAADIRRLWAEWSVSPDVTGQYTRPVLERLLLRTWLRDGEVFAQMVSGAGNGLERTAG 163
Query: 167 ----YELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATG 222
E ++ D V D +S ++ GV + + G + V +K+ +
Sbjct: 164 VPFWLEAMEPDFVPMRTD------ESAGLNQGVFLDEWGRPKKYLV----YKNYPVSGRQ 213
Query: 223 ANGRRIA---WLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFI 279
++ + IA ++ +R + RG +L+ V+ I+ + + DA A + A + +I
Sbjct: 214 SDTKEIAAGKMIHLKFTRRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYI 273
Query: 280 QRQIGGKTAGTKPFSSGAQRRVQDTDFVG-DGTERRTNIAETPFGMMVEGLAEGEEIKAF 338
++ G QD + G TER +I TP G++ + L +GE+I
Sbjct: 274 RKGDG-----------------QDYEDPGIKETEREVHI--TP-GIIYDDLRKGEDIGMV 313
Query: 339 TNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRD 398
+D + F LRAVA + +S A ++ +YSA R E+ E ++D
Sbjct: 314 KSDRPNPNLETFRNGQLRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCF 373
Query: 399 ADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEK 458
+ RPVYR+WL +V A G L + Y A + G V +D K
Sbjct: 374 IGAVTRPVYRTWLNMVV------AAGLLKIPADVEMKTLYNA----TYSGPVMPWIDPVK 423
Query: 459 ETRGRGEQIKLGALTRTKAAR 479
E QI+ GA T + R
Sbjct: 424 EAEAWRIQIRGGAATESDWVR 444
>ref|NP_288458.1| (NC_002655) putative head-tail preconnector protein of prophage
CP-933U [Escherichia coli O157:H7 EDL933]
ref|NP_310988.1| (NC_002695) putative portal protein [Escherichia coli O157:H7]
dbj|BAB19579.1| (AP000400) portal protein [Escherichia coli O157:H7]
gb|AAG57012.1|AE005420_17 (AE005420) putative head-tail preconnector protein of prophage
CP-933U [Escherichia coli O157:H7 EDL933]
dbj|BAB36384.1| (AP002560) putative portal protein [Escherichia coli O157:H7]
Length = 500
Score = 65.1 bits (157), Expect = 2e-009
Identities = 92/381 (24%), Positives = 151/381 (39%), Gaps = 51/381 (13%)
Query: 110 AENVNDLFEAWGELPEVCDFKKLYTFGQWQRVAKAEALIAGDVLVVEHHNSATGLPT--- 166
AE D+ W E D YT +R+ L G+V + GL
Sbjct: 104 AELAADIRRLWAEWSVSPDVTGQYTRPVLERLLLRTWLRDGEVFAQMVSGAGNGLERTAG 163
Query: 167 ----YELIDGDMVQTPYDAQNKIGKSHRIDNGVEMTKDGEHVAFWVVQEGFKHKRIKATG 222
E ++ D V D +S ++ GV + + G + V +K+ +
Sbjct: 164 VPFWLEAMEPDFVPMRTD------ESAGLNQGVFLDEWGRPKKYLV----YKNYPVSGRQ 213
Query: 223 ANGRRIA---WLYYGTDKRHTEARGEPLLTLVMQNIAEIDKMRDATQRKASLGAQIVGFI 279
++ + IA ++ +R + RG +L+ V+ I+ + + DA A + A + +I
Sbjct: 214 SDTKEIAAGKMIHLKFTRRLHQTRGSSMLSGVLMRISALKEYEDAELTAARIAAALGLYI 273
Query: 280 QRQIGGKTAGTKPFSSGAQRRVQDTDFVG-DGTERRTNIAETPFGMMVEGLAEGEEIKAF 338
++ G QD + G TER +I TP G++ + L +GE+I
Sbjct: 274 RKGDG-----------------QDYEDPGIKETEREVHI--TP-GIIYDDLRKGEDIGMV 313
Query: 339 TNDTTDEKFGEFEEAILRAVAWALGMPYSVLAMQFDSSYSASRGEIKEFEAVIHEMRDRD 398
+D + F LRAVA + +S A ++ +YSA R E+ E ++D
Sbjct: 314 KSDRPNPNLETFRNGQLRAVAAGSRLSFSSAARNYNGTYSAQRQELVESTDGYLILQDCF 373
Query: 399 ADTLLRPVYRSWLRAMVLNRRIDAPGFLAAARNPREFATYAAWTRSQWYGMVKEAVDLEK 458
+ RPVYR+WL +V A G L + Y A + G V +D K
Sbjct: 374 IGAVTRPVYRTWLNMVV------AAGLLKIPADVEMKTLYNA----TYSGPVMPWIDPVK 423
Query: 459 ETRGRGEQIKLGALTRTKAAR 479
E QI+ GA T + R
Sbjct: 424 EAEAWRIQIRGGAATESDWVR 444
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 355,839,259
Number of Sequences: 1028106
Number of extensions: 15123247
Number of successful extensions: 32458
Number of sequences better than 5.0e-02: 15
Number of HSP's better than 0.1 without gapping: 6
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 32431
Number of HSP's gapped (non-prelim): 17
length of query: 545
length of database: 324,535,921
effective HSP length: 126
effective length of query: 419
effective length of database: 194,994,565
effective search space: 81702722735
effective search space used: 81702722735
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 94 (40.8 bits)
Query= orf6 [1616-2012]
(395 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii] 94 4e-018
dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacter... 65 2e-009
gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01] 60 6e-008
dbj|BAA89643.1| (AB036666) similar to minor capsid protein ... 59 1e-007
ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125] >g... 56 8e-007
ref|NP_518979.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 53 5e-006
pir||T03571 probable peptidase - Rhodobacter capsulatus >gi... 52 1e-005
ref|NP_208226.1| (NC_000915) protease IV (PspA) [Helicobact... 51 2e-005
ref|NP_224046.1| (NC_000921) putative PROTEASE IV [Helicoba... 50 4e-005
ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella... 50 6e-005
ref|NP_470963.1| (NC_003212) similar to proteases [Listeria... 49 1e-004
ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to h... 48 2e-004
sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CO... 45 0.001
ref|NP_287793.1| (NC_002655) putative capsid assembly prote... 44 0.003
ref|NP_275945.1| (NC_000916) protease IV [Methanothermobact... 44 0.004
>gb|AAB65791.1| (AF009900) Z2Z3 protein [Vibrio shilonii]
Length = 330
Score = 93.6 bits (231), Expect = 4e-018
Identities = 67/214 (31%), Positives = 106/214 (49%), Gaps = 8/214 (3%)
Query: 54 ARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGA 113
A I++ GV++ + F + GG + L + +AA N+ S+K + +DSPGGEA G
Sbjct: 40 AIIHVNGVISRYANLFHAVCGGVSTEV--LAKEFNAALNESSVKAVILNVDSPGGEASGI 97
Query: 114 IKLGDKIR--NAKKPXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLS-MRRPS 170
+L + I KKP + V +RVG+IG V+S ++RP
Sbjct: 98 HELSEMIHASRGKKPVRAYVGGDGCSAAYWITTACDRVTMDATARVGSIGTVVSFVKRPD 157
Query: 171 T--SVYVDVTSTDAPNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVNQN 228
+ + S+ +PNKR DPESE+G+ I+ Q LD + +F VA G T V +
Sbjct: 158 AEGAKRFEFVSSQSPNKRLDPESEQGQTAIQTQ-LDAMAEVFISRVARNMGVTSDKVKSD 216
Query: 229 FGRGGSLFAETALKMGMIDKILTAETESATSTGG 262
FG+GG + A+ GM ++ + E A+ + G
Sbjct: 217 FGQGGVKIGQQAVDAGMAHELGSLEALIASLSNG 250
>dbj|BAA89626.1| (AB036665) similar to GPC of lambda [bacteriophage WO]
Length = 353
Score = 64.7 bits (156), Expect = 2e-009
Identities = 58/209 (27%), Positives = 90/209 (42%), Gaps = 15/209 (7%)
Query: 53 TARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEG 112
TA I + G++T P F G + Y + E I+ A ++ I+ I IDSPGGE G
Sbjct: 47 TAVIAINGILTKKPGAFDDFLGMTS--YEKIQEEIEEALSNKDIETILLDIDSPGGEVNG 104
Query: 113 AIKLGDKI--RNAKKPXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLS---MR 167
L D I AKK E V SR S VG+IG + S
Sbjct: 105 IFDLADFIYESRAKKRIIAIANDDAYSAAYAIASSAEKVFVSRTSGVGSIGVIASHIDQS 164
Query: 168 RPSTSVYVDVTSTDAPNKRPD-----PESEEGRRVIREQSLDPLHNMFAQAVADGRGTTL 222
R + T+ A +++ D P + E ++++ + L+ MF Q +A RG ++
Sbjct: 165 RFDEKQGIKYTTIFAGSRKNDLNPHEPMTSESLESLQKE-VGRLYEMFLQLIARNRGLSI 223
Query: 223 SDVNQNFGRGGSLFAETALKMGMIDKILT 251
+ G F E A+++G+ D + T
Sbjct: 224 EKIRST--EAGLYFGEKAVEIGLADGVTT 250
>gb|AAC24140.1| (AF071201) unknown [bacteriophage Felix 01]
Length = 442
Score = 59.7 bits (143), Expect = 6e-008
Identities = 72/290 (24%), Positives = 120/290 (40%), Gaps = 26/290 (8%)
Query: 39 PTANADRVMTVAGDTARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKR 98
P A ++ V D A I + G +T F + GG Y L D A D+SIK
Sbjct: 48 PKGEARSLLKVKDDIAIIPIMGGLTHRMTFIDAMCTGGLSSYEGLRRGFDEALADESIKT 107
Query: 99 IEYAIDSPGGEAEGAIKLGDKI---RNAKKPXXXXXXXXXXXXXXXXXXXDEIVAASRVS 155
I IDS GGEA G +L I R KK +EI+ AS +
Sbjct: 108 IVLHIDSGGGEASGCFELARHIMASRGQKKIIAYVDEFACSAAYALASSAEEII-ASPDA 166
Query: 156 RVGNIGAV-----LSMRRPSTSVYVDVTSTDAPNKRPDP---ESEEGRRVIREQSLDPLH 207
VG+IG + L+ V ++V P SEE + ++++ ++ +
Sbjct: 167 DVGSIGVIMVHQELTKAFEKNGVTINVIKAGEFKGMGSPFQALSEESKERLQKR-INDTY 225
Query: 208 NMFAQAVADGRGTTLSDVNQNFGRGGSLFAETALKMGMIDKILTAETESATSTGGAEA-- 265
F VA+ R LS+ A+ AL++G+I+ I++ + G EA
Sbjct: 226 ATFTGFVAESR--NLSEEAVKNTEANVYSAQEALELGLINSIMSQDDFLNYLQGSEEAPV 283
Query: 266 -----NAQDDTTGVQTMDLEKLKSEHPAVYAQAIEEGKQIGLTEERNRVA 310
N+ ++ T + +LE L+ + AQ + ++ L++ N+++
Sbjct: 284 SLNVNNSGEEMTEQEKQELEALRLQ----VAQMKAKEQEAALSDLTNKIS 329
>dbj|BAA89643.1| (AB036666) similar to minor capsid protein GPC of phage lambda
[Wolbachia sp. wKue]
Length = 350
Score = 58.9 bits (141), Expect = 1e-007
Identities = 54/212 (25%), Positives = 88/212 (41%), Gaps = 21/212 (9%)
Query: 53 TARINMQGVMTSTPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEG 112
TA I + G++T P+ F G + Y + E I+ A D+ ++ I IDSPGGE G
Sbjct: 49 TAIIPIHGILTKKPEAFDDFLGMTS--YEKMQEEIEEALEDEKVEIILLDIDSPGGEVNG 106
Query: 113 AIKLGDKIRNA--KKPXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPS 170
L D I NA KK E + +R S VG+IG + S S
Sbjct: 107 IFDLADFIYNARSKKRIIAIANDDAYSAAYAIASSAEKIFLTRTSGVGSIGVIASHIDQS 166
Query: 171 ----------TSVYVDVTSTDA-PNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRG 219
++++ D P++ ES E + ++ L+ M + +A R
Sbjct: 167 GFDEKQGIKYSTIFAGSRKNDLNPHEPITSESLENLK----SEVNRLYGMLVELIARNRS 222
Query: 220 TTLSDVNQNFGRGGSLFAETALKMGMIDKILT 251
++ + G F E A+++G+ D + T
Sbjct: 223 LSIEAIKST--EAGLYFGEKAVEIGLADGVTT 252
>ref|NP_536361.1| (NC_003309) gp5 [Bacteriophage phiE125]
gb|AAL40278.1|AF447491_5 (AF447491) gp5 [Bacteriophage phiE125]
Length = 301
Score = 55.8 bits (133), Expect = 8e-007
Identities = 52/186 (27%), Positives = 78/186 (40%), Gaps = 17/186 (9%)
Query: 80 YCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXXXXXXXXXXX 137
Y L A++ A D +++ I IDS GG A GA +L D IR A KP
Sbjct: 82 YEGLRSAVNQAVADPAVEHIVLDIDSNGGSATGAFELADDIRAASLVKPITAIVNFSAFS 141
Query: 138 XXXXXXXXDEIVAASRVSRVGNIGAV---LSMRRPSTSVYVDVTSTDAPNKRPDPESEEG 194
V SR S VG+IG + L + + + VTS A + + D E
Sbjct: 142 GGYLIAAAASKVIVSRTSGVGSIGVIANHLDVSKRDEQQGIKVTSVFAGDHKNDLTPHE- 200
Query: 195 RRVIREQSLDPLHNM-------FAQAVADGRGTTLSDVNQNFGRGGSLFAETALKMGMID 247
+ +QSL L +M F A+A+ RG + V + G F + ++ G+ D
Sbjct: 201 --PLSDQSLTFLTSMVQNSYKQFVDAIANFRGLSTQAVKDT--QAGIFFGQKGVEAGLAD 256
Query: 248 KILTAE 253
+ T +
Sbjct: 257 SVETPQ 262
>ref|NP_518979.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14560.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 416
Score = 53.1 bits (126), Expect = 5e-006
Identities = 61/252 (24%), Positives = 93/252 (36%), Gaps = 25/252 (9%)
Query: 76 GNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXXXXXXX 133
G Y + E +DAA D I I +DSPGGE+ G L D+IR A KP
Sbjct: 80 GLASYTAIGEQLDAALADPGIAAILLDVDSPGGESGGVFDLADRIRVAAAIKPVWAAAND 139
Query: 134 XXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPS----------TSVYVDVTSTDAP 183
V SR VG+IG + S T+V+ D
Sbjct: 140 MAFSAAYALACAASRVFVSRTGGVGSIGVIAMHVDQSVKDAKDGVRYTAVFAGARKNDLN 199
Query: 184 NKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLFAETALKM 243
P + + + + + ++ +F V RG + V G F + A+
Sbjct: 200 PHEPITDEAQAQ---LQAEVSRIYRLFVATVGSYRGLSAEAVTAT--EAGLFFGQDAVAA 254
Query: 244 GMIDKILT-----AETESATSTGGAEANAQDDTTGVQTMDLEKLKSEHPAVYAQAIEEGK 298
G+ D + T A+ ++ S A+A+ + +Q MD PA A ++
Sbjct: 255 GLADAVGTFDDALAQLTASLSVPAPVASARTPSLNLQ-MDCSMNTQPDPA--AVSVPAAD 311
Query: 299 QIGLTEERNRVA 310
IG T + VA
Sbjct: 312 AIGTTAQPPVVA 323
>pir||T03571 probable peptidase - Rhodobacter capsulatus
gb|AAC16224.1| (AF010496) potential peptidase [Rhodobacter capsulatus]
Length = 303
Score = 52.0 bits (123), Expect = 1e-005
Identities = 54/191 (28%), Positives = 79/191 (41%), Gaps = 16/191 (8%)
Query: 76 GNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN--AKKPXXXXXXX 133
G Y + EA DD++ + D+PGG G +R+ A KP
Sbjct: 77 GWATYRGIAEACADLAADDAVSAVVVEFDTPGGYTLGIEDAVTALRDLAAVKPVHALAAP 136
Query: 134 XXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRR---PSTSVYV---DVTSTDAPNKRP 187
+ + + VG+IG ++ P + V D TS A K P
Sbjct: 137 LAASAGYWLASQARELVMTPRAAVGSIGVAVTAAANVAPGAASGVQLFDFTSRHARAKWP 196
Query: 188 DPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSD------VNQNFGRGGSLF-AETA 240
DP +E+G+ IR + LD F AVA+GRG D V+ + GG++F A A
Sbjct: 197 DPATEDGKAEIR-RGLDRTEARFHAAVAEGRGIAPEDLAARLSVSDDPEDGGAVFEAAEA 255
Query: 241 LKMGMIDKILT 251
L G+ D+I T
Sbjct: 256 LARGLADRIET 266
>ref|NP_208226.1| (NC_000915) protease IV (PspA) [Helicobacter pylori 26695]
pir||C64699 proteinase IV (EC 3.4.-.-) - Helicobacter pylori (strain 26695)
gb|AAD08477.1| (AE000644) protease IV (PspA) [Helicobacter pylori 26695]
Length = 292
Score = 51.2 bits (121), Expect = 2e-005
Identities = 52/198 (26%), Positives = 76/198 (38%), Gaps = 11/198 (5%)
Query: 64 STPDFFAMLFGGGNCLYCDLFEA-IDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN 122
S P A L+ G + F+ +D SIK + IDSPGG +++L +KI +
Sbjct: 45 SAPPNLAKLYLNGAIFSTEDFDKEVDKILKTPSIKGVLLLIDSPGGAVSASVELSEKIAD 104
Query: 123 AKK--PXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPSTSVYVDVTST 180
K+ P V AS+ S +G+IG + S + +T
Sbjct: 105 LKQKMPVLAYARGVMASGSYYAGMQASEVYASKASLIGSIGVIFSGANVENLLNKVGVAT 164
Query: 181 DAPNKRPDPESEEGRRVIREQSLDPLHN-------MFAQAVADGRGTTLSDVNQNFGRGG 233
+ E R + D L N MF VA R D ++F G
Sbjct: 165 QGVHAGEYKEIGTFTRAWKPNEKDFLQNLVNEQYQMFVNDVAKARKLDAKDY-KDFAEGK 223
Query: 234 SLFAETALKMGMIDKILT 251
A+ ALK+ +IDKI T
Sbjct: 224 VFSAQKALKLKLIDKIST 241
>ref|NP_224046.1| (NC_000921) putative PROTEASE IV [Helicobacter pylori J99]
pir||F71819 probable proteinase IV - Helicobacter pylori (strain J99)
gb|AAD06917.1| (AE001556) putative PROTEASE IV [Helicobacter pylori J99]
Length = 292
Score = 50.4 bits (119), Expect = 4e-005
Identities = 51/198 (25%), Positives = 76/198 (37%), Gaps = 11/198 (5%)
Query: 64 STPDFFAMLFGGGNCLYCDLFEA-IDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN 122
S P A L+ G + F+ +D SIK + IDSPGG +++L +KI +
Sbjct: 45 SEPPNLAKLYLNGAIFSTEDFDKEVDKILKTPSIKGVLLLIDSPGGAVSASVELSEKIAD 104
Query: 123 AKK--PXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLSMRRPSTSVYVDVTST 180
K+ P V AS+ S +G+IG + S + +T
Sbjct: 105 LKQKMPVLAYARGVMASGSYYAGMQASEVYASKASLIGSIGVIFSSANVENLLNKVGVAT 164
Query: 181 DAPNKRPDPESEEGRRVIREQSLDPLHN-------MFAQAVADGRGTTLSDVNQNFGRGG 233
+ E R + + L N MF VA R D ++F G
Sbjct: 165 QGVHAGEYKEIGTFTRAWKPNEKEFLQNLVNEQYQMFVDDVAKARKLNAKDY-KDFAEGK 223
Query: 234 SLFAETALKMGMIDKILT 251
A+ ALK+ +IDKI T
Sbjct: 224 VFSAQKALKLKLIDKIST 241
>ref|NP_540710.1| (NC_003317) PUTATIVE PROTEASE IV [Brucella melitensis]
gb|AAL52974.1| (AE009612) PUTATIVE PROTEASE IV [Brucella melitensis]
Length = 301
Score = 49.7 bits (117), Expect = 6e-005
Identities = 38/150 (25%), Positives = 62/150 (41%), Gaps = 11/150 (7%)
Query: 82 DLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRN--AKKPXXXXXXXXXXXXX 139
+L + +D DD++K + +DSPGG G + D +R KKP
Sbjct: 39 ELLKRLDKIAGDDAVKGVILLLDSPGGTTVGGEAIYDAVRKIATKKPVVTQVGTLAASAG 98
Query: 140 XXXXXXDEIVAASRVSRVGNIGAV-----LSMRRPSTSVYVDVTSTDAPNKRPD---PES 191
+ + A + S VG+IG + LS + V V+ + P+ P S
Sbjct: 99 YMIASASDHIVARQTSIVGSIGVLFQYPDLSKLLDTLGVKVETIKSSPLKAEPNYFSPAS 158
Query: 192 EEGRRVIREQSLDPLHNMFAQAVADGRGTT 221
EE + +IR +D ++ F V + R T
Sbjct: 159 EEAKNMIRSMIMDS-YDWFVGIVQERRAFT 187
>ref|NP_470963.1| (NC_003212) similar to proteases [Listeria innocua]
emb|CAC96858.1| (AL596169) similar to proteases [Listeria innocua]
Length = 337
Score = 48.5 bits (114), Expect = 1e-004
Identities = 59/262 (22%), Positives = 103/262 (38%), Gaps = 32/262 (12%)
Query: 10 LQLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTV---AGDT--ARINMQGVMTS 64
L +++ T Q+ SSP F+ + + TV GD A +++ G +
Sbjct: 15 LLIVSALAKFTSSQIASTEESSPTFVESLFADTGELTETVIEEGGDDTIAVLSVDGTIQD 74
Query: 65 TPDFFAMLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK 124
T D LFG + + ++ NDD I+ + ++SPGG + ++ DKI +
Sbjct: 75 TGDS-GSLFGDAGYDHSFFMQQLEQVRNDDYIQGVLLYVNSPGGGVMESAQIRDKILQIQ 133
Query: 125 K----PXXXXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVLS-------MRRPSTS- 172
K P + + AS+ + G++G ++ M++ S
Sbjct: 134 KERNIPFYVSMGSMAASGGYYISAPADKIFASKETLTGSLGVIMQGYDYSELMKKLGVSD 193
Query: 173 ------VYVDVTSTDAPNKRPDPESEEGRRVIREQSLDPLHNMFAQAVADGRGTTLSDVN 226
Y D+ S RP E E + I + +D +N F + VA GRG + V
Sbjct: 194 NTIKSGEYKDIMS----GTRPMTEDE---KKIMQSMIDDSYNEFVKVVAKGRGMSAEKV- 245
Query: 227 QNFGRGGSLFAETALKMGMIDK 248
+ G A + G+ID+
Sbjct: 246 RKIADGRIYDGRQAKENGLIDE 267
>ref|NP_461541.1| (NC_003197) Gifsy-1 prophage: similar to head-tail preconnector gp5
of phage 21 [Salmonella typhimurium LT2]
gb|AAL21500.1| (AE008818) Gifsy-1 prophage: similar to head-tail preconnector gp5
of phage 21 [Salmonella typhimurium LT2]
Length = 498
Score = 47.8 bits (112), Expect = 2e-004
Identities = 48/203 (23%), Positives = 79/203 (38%), Gaps = 17/203 (8%)
Query: 87 IDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKI--RNAKKPXXXXXXXXXXXXXXXXXX 144
+ A +D +K I IDSPGG+A GA D I +KP
Sbjct: 110 LQQAVSDPEVKGILLDIDSPGGQAAGAFDCADMIYRMREQKPVWALANETACSAAMLLAA 169
Query: 145 XDEIVAASRVSRVGNIGAVLS-------MRRPSTSVYVDVTSTDAPNKRPDPESEEGRRV 197
++ SR+G+IG V++ +++ + + + + P + E
Sbjct: 170 ACSHRLVTQTSRMGSIGVVMAHTSYAEKLKQEGIDITLIYSGAHKADLTPSQKLPESVYA 229
Query: 198 IREQSLDPLHNMFAQAVADGRGTTLSDV--NQNFGRGGSLFAETALKMGMIDKI----LT 251
+Q +D MFA+ VA G ++ V + G T L GM++ +
Sbjct: 230 DYQQRMDEARKMFAEKVARYTGLSVDAVMATEAAVYDGQAIITTGLADGMVNAADAIGVM 289
Query: 252 AETESATSTGGA--EANAQDDTT 272
AE ++ TGG E +A D T
Sbjct: 290 AEAINSNKTGGTMPELSAADAVT 312
>sp|P36273|VG05_BPP21 HEAD-TAIL PRECONNECTOR PROTEIN GP5 [CONTAINS: SCAFFOLD PROTEIN
(HEAD PROTEIN GP6)]
pir||JN0539 head protein gp5 - phage 21
gb|AAA32343.1| (M81255) head-tail preconnector gp5 [Bacteriophage 21]
Length = 501
Score = 45.1 bits (105), Expect = 0.001
Identities = 61/271 (22%), Positives = 101/271 (36%), Gaps = 25/271 (9%)
Query: 11 QLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTVAGDTARINMQGVMTSTPDFFA 70
Q+ DAP + E D+ MA G P RV V A + + G +
Sbjct: 46 QVQFDAPGMLAE-TDEYMAG-----GKRPA----RVYRVVNGIAVLPVTGTLVHR--LGG 93
Query: 71 MLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXX 128
M G Y + + A D ++ + IDSPGG+A GA D I + KP
Sbjct: 94 MRPFSGMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVW 153
Query: 129 XXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVL---SMRRPSTSVYVDVTSTDAPNK 185
++ SR+G+IG ++ S VD+T +
Sbjct: 154 ALCNDTACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAH 213
Query: 186 RPDPESEEGR----RVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLF-AETA 240
+ D E R +Q +D MFA+ VA G ++ V G ++F ++
Sbjct: 214 KVDGNQFEALPAEVRQDMQQRIDAARRMFAEKVAMYTGLSVDAVT---GTEAAVFEGQSG 270
Query: 241 LKMGMIDKILTAETESATSTGGAEANAQDDT 271
++ G+ D+++ A + +N + T
Sbjct: 271 IEAGLADELINASDAISVMATALNSNVRGGT 301
>ref|NP_287793.1| (NC_002655) putative capsid assembly protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
ref|NP_309135.1| (NC_002695) putative head-tail preconnector protein [Escherichia
coli O157:H7]
ref|NP_310759.1| (NC_002695) putative head-tail preconnector protein [Escherichia
coli O157:H7]
gb|AAG56407.1|AE005368_13 (AE005368) putative capsid assembly protein of prophage CP-933R
[Escherichia coli O157:H7 EDL933]
dbj|BAB34531.1| (AP002554) putative head-tail preconnector protein [Escherichia
coli O157:H7]
dbj|BAB36155.1| (AP002559) putative head-tail preconnector protein [Escherichia
coli O157:H7]
Length = 501
Score = 44.3 bits (103), Expect = 0.003
Identities = 61/271 (22%), Positives = 100/271 (36%), Gaps = 25/271 (9%)
Query: 11 QLMADAPDLTQEQVDKCMASSPLFLGNGPTANADRVMTVAGDTARINMQGVMTSTPDFFA 70
Q+ DAP + E D+ MA G P RV V A + + G +
Sbjct: 46 QVQLDAPGMLAE-TDEYMAG-----GKRPA----RVYRVVNGIAVLPVTGTLVHR--LGG 93
Query: 71 MLFGGGNCLYCDLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAK--KPXX 128
M G Y + + A D ++ + IDSPGG+A GA D I + KP
Sbjct: 94 MRPFSGMTGYDGIVACLQQAMADSQVRGVLLDIDSPGGQAAGAFDCADMIYRLRQQKPVW 153
Query: 129 XXXXXXXXXXXXXXXXXDEIVAASRVSRVGNIGAVL---SMRRPSTSVYVDVTSTDAPNK 185
++ SR+G+IG ++ S VD+T +
Sbjct: 154 ALCNDTACSAAMLLASACSRRLVTQTSRIGSIGVMMSHVSYAGHLAQAGVDITLIYSGAH 213
Query: 186 RPDPESEEGR----RVIREQSLDPLHNMFAQAVADGRGTTLSDVNQNFGRGGSLF-AETA 240
+ D E R +Q +D MFA+ VA G ++ V G ++F ++
Sbjct: 214 KVDGNQFEALPAEVRQNMQQRIDAARRMFAEKVAMFTGLSVDAVT---GTEAAVFEGQSG 270
Query: 241 LKMGMIDKILTAETESATSTGGAEANAQDDT 271
+ G+ D+++ A + +N + T
Sbjct: 271 IDAGLADELVNASDAISVMATALNSNVRGGT 301
>ref|NP_275945.1| (NC_000916) protease IV [Methanothermobacter thermautotrophicus]
[Methanothermobacter thermautotrophicus str. Delta H]
pir||F69207 proteinase IV - Methanobacterium thermoautotrophicum (strain Delta
H)
gb|AAB85306.1| (AE000858) protease IV [Methanothermobacter thermautotrophicus str.
Delta H]
Length = 287
Score = 43.5 bits (101), Expect = 0.004
Identities = 25/85 (29%), Positives = 39/85 (45%)
Query: 82 DLFEAIDAAENDDSIKRIEYAIDSPGGEAEGAIKLGDKIRNAKKPXXXXXXXXXXXXXXX 141
D+ E I A +D S+K I I+SPGG + +L D I ++KP
Sbjct: 59 DIKELIAEANSDPSVKAIVLDINSPGGTPVASEELMDAINKSEKPVVSWISDSGTSGAYL 118
Query: 142 XXXXDEIVAASRVSRVGNIGAVLSM 166
+ + AS + VG+IG +L +
Sbjct: 119 AASASDRIVASPSAWVGSIGVILDL 143
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.312 0.129 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 221,328,797
Number of Sequences: 1028106
Number of extensions: 8485333
Number of successful extensions: 19869
Number of sequences better than 5.0e-02: 27
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 25
Number of HSP's that attempted gapping in prelim test: 19859
Number of HSP's gapped (non-prelim): 28
length of query: 395
length of database: 324,535,921
effective HSP length: 123
effective length of query: 272
effective length of database: 198,078,883
effective search space: 53877456176
effective search space used: 53877456176
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 92 (40.0 bits)
Query= orf7 [2017-2153]
(135 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED... 41 0.003
pir||S34243 16.5K structural protein - actinophage VWB >gi|... 38 0.022
>ref|NP_518980.1| (NC_003295) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14561.1| (AL646061) PROBABLE BACTERIOPHAGE-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 125
Score = 40.8 bits (94), Expect = 0.003
Identities = 19/42 (45%), Positives = 29/42 (68%)
Query: 32 AAAETWPAGAVLGRVTATGRYVRYNPAASDGSQVPSAILTEA 73
AA ++ G VLG VTATG+ + +P+A+DGSQ + +L +A
Sbjct: 30 AAGQSLVLGTVLGMVTATGKVKQLDPSATDGSQYAAGVLMQA 71
>pir||S34243 16.5K structural protein - actinophage VWB
emb|CAA50984.1| (X72092) ORFp14 [Bacteriophage VWB]
Length = 129
Score = 37.7 bits (86), Expect = 0.022
Identities = 20/50 (40%), Positives = 29/50 (58%)
Query: 39 AGAVLGRVTATGRYVRYNPAASDGSQVPSAILTEAVTQSAAGNITYSVAV 88
+G LG++TA+G+Y YN AASDG++ L A+ A I S A+
Sbjct: 50 SGIPLGKITASGKYGPYNNAASDGTETLVGFLYSALQAPAVNTIDPSGAM 99
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 9.75e+199
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 69,411,284
Number of Sequences: 1028106
Number of extensions: 2206067
Number of successful extensions: 4372
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4370
Number of HSP's gapped (non-prelim): 2
length of query: 135
length of database: 324,535,921
effective HSP length: 111
effective length of query: 24
effective length of database: 210,416,155
effective search space: 5049987720
effective search space used: 5049987720
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf8 [2157-2522]
(364 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophi... 43 0.005
pir||T13320 hypothetical protein 31 - Streptococcus phage p... 41 0.019
>ref|NP_056680.1| (NC_002214) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
pir||T13625 hypothetical protein gp348 - Streptococcus phage phi-Sfi11
gb|AAC34402.1| (AF158600) gp348 [Streptococcus thermophilus bacteriophage Sfi11]
Length = 348
Score = 43.1 bits (100), Expect = 0.005
Identities = 41/167 (24%), Positives = 68/167 (40%), Gaps = 16/167 (9%)
Query: 37 GKTVEIDVKRFSENVAVVITKLSGPNFNDASLISTKEFEPPEYGEAFATDVDDLLQRLIG 96
G++V + F NV V D + + P + EA +D Q +
Sbjct: 51 GQSVALKAAAFDTNVTV----------RDRVSAEMHDEQMPFFKEAMLVKENDRQQLNL- 99
Query: 97 VNPYDDANIAYSSKLVGRLMDYFMEANDMIMRGVEIQASQILQTGRLNLLDRDGETAYEI 156
D N A + +V + + + + +E Q+L TG++ DG +I
Sbjct: 100 --VKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTS-DGVNK-DI 155
Query: 157 DYAPKATHFPTVTTAWSDDGADPIDDLRALFEVIRADGKVNPDMIIM 203
DY K H V+ +W++ GA P+ DL E R G +NP+ +M
Sbjct: 156 DYGVKPDHKKQVSKSWAEPGATPLADLEDAIETARELG-LNPERAVM 201
>pir||T13320 hypothetical protein 31 - Streptococcus phage phi-O1205
gb|AAC79546.1| (U88974) ORF31 [Streptococcus thermophilus temperate bacteriophage
O1205]
Length = 348
Score = 41.2 bits (95), Expect = 0.019
Identities = 25/74 (33%), Positives = 38/74 (50%), Gaps = 3/74 (4%)
Query: 130 VEIQASQILQTGRLNLLDRDGETAYEIDYAPKATHFPTVTTAWSDDGADPIDDLRALFEV 189
+E Q+L TG++ DG +IDY K H V+ +W++ GA P+ DL E
Sbjct: 131 LEAMRMQVLATGKIAFTS-DGVNK-DIDYGVKPDHKKQVSKSWAEPGATPLADLEDAIET 188
Query: 190 IRADGKVNPDMIIM 203
R G +NP+ +M
Sbjct: 189 ARELG-LNPERAVM 201
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.319 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 239,165,556
Number of Sequences: 1028106
Number of extensions: 10169464
Number of successful extensions: 26545
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 2
Number of HSP's that attempted gapping in prelim test: 26544
Number of HSP's gapped (non-prelim): 3
length of query: 364
length of database: 324,535,921
effective HSP length: 122
effective length of query: 242
effective length of database: 199,106,989
effective search space: 48183891338
effective search space used: 48183891338
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 92 (40.0 bits)
Query= orf9 [2651-2682]
(30 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.313 0.143 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 23,737,498
Number of Sequences: 1028106
Number of extensions: 438023
Number of successful extensions: 682
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 682
Number of HSP's gapped (non-prelim): 0
length of query: 30
length of database: 324,535,921
effective HSP length: 6
effective length of query: 24
effective length of database: 318,367,285
effective search space: 7640814840
effective search space used: 7640814840
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf10 [2682-2811]
(128 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.313 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 62,409,598
Number of Sequences: 1028106
Number of extensions: 2086195
Number of successful extensions: 3050
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3050
Number of HSP's gapped (non-prelim): 0
length of query: 128
length of database: 324,535,921
effective HSP length: 104
effective length of query: 24
effective length of database: 217,612,897
effective search space: 5222709528
effective search space used: 5222709528
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf11 [2808-3038]
(229 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 134,864,601
Number of Sequences: 1028106
Number of extensions: 5071371
Number of successful extensions: 10485
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 10485
Number of HSP's gapped (non-prelim): 0
length of query: 229
length of database: 324,535,921
effective HSP length: 117
effective length of query: 112
effective length of database: 204,247,519
effective search space: 22875722128
effective search space used: 22875722128
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 89 (38.9 bits)
Query= orf12 [3042-3543]
(500 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_050643.1| (NC_000929) major tail subunit [Enterobact... 105 1e-021
ref|NP_404841.1| (NC_003143) putative bacteriophage tail sh... 104 2e-021
ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseri... 97 3e-019
ref|NP_599043.1| (NC_003444) tail sheath protein [Shigella ... 84 3e-015
ref|NP_313006.1| (NC_002695) putative tail sheath protein [... 76 8e-013
ref|NP_543096.1| (NC_003356) putative sheath protein [Bacte... 74 4e-012
ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseri... 74 5e-012
>ref|NP_050643.1| (NC_000929) major tail subunit [Enterobacteria phage Mu]
sp|P79678|VPL_BPMU Tail sheath protein (gpL)
dbj|BAA19195.1| (AB000833) sheath protein gpL [Enterobacteria phage Mu]
gb|AAF01117.1|AF083977_36 (AF083977) L [Enterobacteria phage Mu]
Length = 495
Score = 105 bits (262), Expect = 1e-021
Identities = 136/487 (27%), Positives = 208/487 (41%), Gaps = 37/487 (7%)
Query: 5 NAIDNSVRARVLGIKTEFRNFNTGRTFFLP-QHVALLGQ-GNTAATYELTPFRATSAAQV 62
NAI + VR + I EF N N P Q V + GQ G+ A+ P R S +Q
Sbjct: 7 NAIPSDVRVPLTYI--EFDNSNAVSGTPAPRQRVLMFGQSGSKASAAPNVPVRIRSGSQA 64
Query: 63 GQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATV 122
FG GS L L A L N R L P G N G AA G + LSGTA ++
Sbjct: 65 SAAFGQGSMLALMADAFLNAN----RVAELWCIPQG-NGTGNAAVGEISLSGTAGENGSL 119
Query: 123 RVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTAAV------DGTNANEVDVT 176
I QR + V++ G T A LLVA I+G P +P+TA V D T+A+ V ++
Sbjct: 120 VTYIAGQRLA-VSVAAGATGAALADLLVARIKGQPDLPVTAEVRADSGDDDTHADVV-LS 177
Query: 177 AKWQG-LSGNDL---VVSIEGSIPGITVAITQPTGGAADPEVADTLALFGENVWYTQIVS 232
AK+ G LS D+ + E + GI A +G +P+++ ++A G+ + Y IV
Sbjct: 178 AKFTGALSAVDVRWNYYAGETTPYGIITAFKAASGKNGNPDISASIAGMGD-LQYKYIVM 236
Query: 233 CFNTANTDALDAFETFGEGRWDPILRRP---LAVFTGTNETDPNTLAAIGDARRAQRTNL 289
+ + L+ T + RW P+ + + V +GT ++ G +R +
Sbjct: 237 PYT--DEPNLNLLRTELQERWGPVNQADGFAVTVLSGTY----GDISTFGVSRNDHLISC 290
Query: 290 VTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLV 349
+ + GA + DPA LTLPG P ++T ++R+ L+
Sbjct: 291 MG-IAGAPEPSYLYAATLCAVASQALSIDPARPLQTLTLPGRMPPAVGDRFTWSERNALL 349
Query: 350 KAGISTSIVRD-GVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLFNLDLIFNTRE 406
GIST V D G +I +T Y G+ +P YL S + ++L T++
Sbjct: 350 FDGISTFNVNDGGEMQIERMITMYRTNKYGDSDPSYLNVNTIATLSYLRYSLRTRI-TQK 408
Query: 407 WDGAPLIPD-DQPTTNARAKRPKDAVAVLSVLANNLGLDAIISDVPFTLENIRASINDQN 465
+ L D + T P L L ++ D E + + N +
Sbjct: 409 FPNYKLASDGTRFATGQAVVTPSVIKTELLALFEEWENAGLVEDFDTFKEELYVARNKDD 468
Query: 466 PNRLDII 472
+RLD++
Sbjct: 469 KDRLDVL 475
>ref|NP_404841.1| (NC_003143) putative bacteriophage tail sheath protein [Yersinia
pestis]
emb|CAC90076.1| (AJ414147) putative bacteriophage tail sheath protein [Yersinia
pestis]
Length = 502
Score = 104 bits (260), Expect = 2e-021
Identities = 103/371 (27%), Positives = 160/371 (42%), Gaps = 33/371 (8%)
Query: 21 EFRNFNTGRTFFLPQHVALLGQGNTAATYELT-PFRATSAAQVGQRFGFGSPLHLAALQL 79
EF N + T Q ++GQ +T P +S A V G GS LH
Sbjct: 20 EFDN-SQANTASTTQRTLIIGQTRAGSTLPANIPLLVSSTATVAGLSGAGSMLHGQMAAY 78
Query: 80 LPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTG 139
L ++ G + + P+ D D+ VAA G + ++ AS T + + IG R + T
Sbjct: 79 LANDTAG----EIYLLPLSDADSMVAAIGKITINTAASETGVISLYIGGIRVQTTVVATD 134
Query: 140 TTAEQAAALLVAGIQGNPFMPMTAAVDGTNANE---VDVTAKWQGLSGN--DLVVSIEGS 194
+ + AAAL A I+ P +P+T G +E V + AK +G GN DL ++ G+
Sbjct: 135 SVSTIAAAL-AAAIENQPELPVTVVHTGDMVSEGAVVVLEAKNKGAHGNNIDLRLNYLGN 193
Query: 195 IPG------ITVAITQPTGGAADPEVADTLALFGENVWYTQIVSCFNTANTDALDAFETF 248
G + + IT GGA PE+AD LA + + I +TA+ DA+ F +
Sbjct: 194 AGGEITPESLVLMITPMAGGAGAPELADGLANLQDRTFDFIINPYTDTASLDAIKDFLSD 253
Query: 249 GEGRW---DPILRRPLAVFTGTNETDPNTLAAIGDARRAQRTNLVTPVPGAQNLPCEI-- 303
GRW + A +GT L A G+ R Q +L+ G N P
Sbjct: 254 STGRWSYRQQLYGHSFAAQSGTY----GQLTAAGELRNDQHASLL----GIHNSPTPAHI 305
Query: 304 -XXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLVKAGIST-SIVRDG 361
NDP L + G+ + ++T T+R+ L+ +GIST ++ D
Sbjct: 306 WSAAYVGAIAQSLRNDPGRPLQTLAIRGVLAPPLSSRFTLTERNNLLHSGISTVTVADDS 365
Query: 362 VAEISDTVTTY 372
++ + +TTY
Sbjct: 366 TVQVENIITTY 376
>ref|NP_284552.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
pir||E81810 hypothetical protein NMA1841 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB71993.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
emb|CAB85065.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
Length = 475
Score = 97.4 bits (241), Expect = 3e-019
Identities = 105/369 (28%), Positives = 156/369 (41%), Gaps = 40/369 (10%)
Query: 19 KTEFRNFNTGRTFFLPQHVALLGQGNTAATYELTPFRAT-SAAQVGQRFGFGSPLHLAAL 77
K RN T + Q V ++ Q N A ELT SAA ++G GS HL
Sbjct: 27 KLAVRNLPTNK-----QRVLIVAQHNNPALGELTELENVFSAADAAAKYGAGSMAHLMVT 81
Query: 78 QLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIP 137
+ L++ + DN AGVAA G + LSGTA+ +RV I + + + I
Sbjct: 82 AAIK----AYAYADLSLITVADNKAGVAAGGKITLSGTANTQGVLRVSIANADTLTIGIG 137
Query: 138 TGTTAEQAAALLVAGIQGNPFMPMTAAVDGTNANEVDVTAKWQGLSGNDLVVSIEGSIPG 197
TA AA + A I P +P+TA V V +TAK +G +GN + + + G
Sbjct: 138 AEDTAATVAAAVKAAIDAVPDLPVTATVAEA---VVTLTAKNKGTAGNAIRIKTSNTAEG 194
Query: 198 ITVAITQPTGGAADPEVADTL-ALFGENVWYTQIVSCFNTANTDA------LDAFETFGE 250
IT A+T TGG A+P++A L A+ E I++C T+ LD + E
Sbjct: 195 ITAAVTAMTGGDANPDIAAALNAVVAEG---HHIIACGINDETNLLKLRAHLDTVASPME 251
Query: 251 GRWDPILRRPLAVFTGTNETDPNTLA-AIGDARRAQRTNLVTP-VPGAQNLPCEIXXXXX 308
RW + V+ T TLA A A R ++V+ G +LPCE+
Sbjct: 252 KRW------AICVYGQT-----GTLAQATTLAGRLNHGHIVSAWYRGTPSLPCEL--AAA 298
Query: 309 XXXXXXXNNDPASDFARLTLPGLTPGTDAQQWTHTQRDLLVKAGIS--TSIVRDGVAEIS 366
DPA L L + + T+++ + G++ + A+I
Sbjct: 299 FAAVMASEEDPARPLNTLALNSIGVCESKDKTMRTEQENALYNGVTPIETSPAGTQAQIV 358
Query: 367 DTVTTYHPT 375
+TTY T
Sbjct: 359 RAITTYTKT 367
>ref|NP_599043.1| (NC_003444) tail sheath protein [Shigella flexneri bacteriophage V]
gb|AAL89420.1| (U82619) tail sheath protein [Shigella flexneri bacteriophage V]
Length = 498
Score = 84.3 bits (207), Expect = 3e-015
Identities = 92/388 (23%), Positives = 150/388 (37%), Gaps = 27/388 (6%)
Query: 103 GVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMT 162
G AA L ++G A+ + TV V +G R + G A+ + I P +P T
Sbjct: 97 GAAATVTLTVTGEATESGTVNVYVGRTRVQ-APVTNGDNVTTIASSIQDAINAVPTLPFT 155
Query: 163 AAVDGTNANEVDVTAKWQGLSGNDLVVSIE------GSI--PGITVAITQPTGGAADPEV 214
A+ ++A V +TA+ +GL GN++ VS+ G + G+ +A+ T G P +
Sbjct: 156 AS---SSAGVVTLTARHKGLCGNEIPVSLNYYGFGGGEVLPAGVQIAVATGTAGTGAPVL 212
Query: 215 ADTLALFGENVWYTQIVSCFNTANTDALDAFETFGEGRWDPILRRPLAVFTGTNETDPNT 274
+A + + + +TA+ + L GRW + V+T T +
Sbjct: 213 TGAVAAMADEPFDYIGLPFNDTASVNTLVTEMNDTSGRWSYARQLYGHVYTAKTGT-LSE 271
Query: 275 LAAIGDARRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPG 334
L GD Q L Q E+ NDPA L G+ P
Sbjct: 272 LVNAGDQFNQQHITLAGYEKETQTPADELAASRTARAAVFIRNDPARPTQTGELVGMLPA 331
Query: 335 TDAQQWTHTQRDLLVKAGISTSIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKAS 392
+++T T++ L+ G++T+ V GV I VTTY G + YL + ++
Sbjct: 332 PKGKRFTMTEQQTLLSHGVATAYVESGVLRIQRDVTTYRKNAYGVADNSYLDSETLHTSA 391
Query: 393 NVLFNLDLIFNTR------EWDGAPLIPDDQPTTNARAKRPKDAVAVLSVLANNLGLDAI 446
VL L + ++ DG P T A K L L I
Sbjct: 392 YVLRKLKSVITSKYGRHKLASDGTRFGPGQAIVTPAVIK------GELLATYRQLERAGI 445
Query: 447 ISDVPFTLENIRASINDQNPNRLDIIFP 474
+ + + + + PNRL+ +FP
Sbjct: 446 VENYELFKQYLVVERDASVPNRLNTLFP 473
>ref|NP_313006.1| (NC_002695) putative tail sheath protein [Escherichia coli O157:H7]
dbj|BAB38402.1| (AP002567) putative tail sheath protein [Escherichia coli O157:H7]
Length = 500
Score = 76.3 bits (186), Expect = 8e-013
Identities = 107/438 (24%), Positives = 176/438 (39%), Gaps = 33/438 (7%)
Query: 53 PFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPLTIFPMGDNDAGVAAEGVLDL 112
P R T A+Q + +G GS + L + + N D L G N G A ++L
Sbjct: 58 PVRITRASQATELWGRGSMIALMVAEFIAINPDA----ELYAIAQG-NGTGQATAAAMNL 112
Query: 113 SGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAGIQGNPFMPMTA---AVDGTN 169
+GT + V +G +R +L P G ++ L A I +P P TA A G N
Sbjct: 113 TGTVTRDGIVYACVGGRRYTLPA-PKGKKGKELTDELAALINADPDAPFTASSGAGSGDN 171
Query: 170 A----NEVDVTAKWQG-LSGNDLVVSI---EGSIPGITVAITQPTGGAADPEVADTLALF 221
+ +TA++ G S +D+ ++ E + GI VAI P AA+P++ ++A
Sbjct: 172 GAGLKGSLGITARFTGECSVHDVRLNYHDGEATPEGIQVAIAYPKQKAANPDITRSVAGM 231
Query: 222 GENVWYTQIVSCFNTANTDALDAFETFGEGRWDPILRRPLAVFTGTNETDPNTLAAIGDA 281
G+ + ++ + AN L RW P ++ T + A G A
Sbjct: 232 GDRQYNYVVMPYKDDAN---LKIISDELLKRWGPAKMSDGVLWLAHTGTF-GEVQAFG-A 286
Query: 282 RRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTPGTDAQQWT 341
+R T +P A DPA L LP A + T
Sbjct: 287 KRNDFLATCTSIPKAPEPDYLWAAAVCATCAPSLGADPARPLQTLALPSRMAPAPADRLT 346
Query: 342 HTQRDLLVKAGIST-SIVRDGVAEISDTVTTYHPT--GEVNPGYLFYKDTVKASNVLFNL 398
+R+ L+ GI+T S+ V +I VT Y GE +P YL + S + ++L
Sbjct: 347 REERNSLLWGGIATVSVAAGDVVQIERQVTMYRQNVYGESDPSYLDVETIYTLSWLRYSL 406
Query: 399 DLIFNTREWDGAPLIPDDQPTTNARAKRPKDAVAVLSVLANNLGLD----AIISDVPFTL 454
F T+ + L D P RA + ++ + +LG + ++ ++
Sbjct: 407 -RTFITQRFPRHKLADDGTP---VRAGQNIVTPEIMKLQLISLGEEWVELGLVENLDTFK 462
Query: 455 ENIRASINDQNPNRLDII 472
+N+ N + NRLD++
Sbjct: 463 KNLLVERNASDRNRLDVL 480
>ref|NP_543096.1| (NC_003356) putative sheath protein [Bacteriophage P27]
emb|CAC83562.1| (AJ298298) putative sheath protein [Bacteriophage P27]
Length = 498
Score = 73.9 bits (180), Expect = 4e-012
Identities = 90/389 (23%), Positives = 152/389 (38%), Gaps = 29/389 (7%)
Query: 103 GVAAEGVLDLSGTASATATVRVRIGSQRSSL-VTIPTGTTAEQAAALLVAGIQGNPFMPM 161
G AA + ++G A + T+ + +G RSS+ V + G A A + + G +P
Sbjct: 97 GAAATVRVTVTGEAEESGTLSLYVG--RSSVQVPVVNGDDATAVATAIKEAVNGVITLPF 154
Query: 162 TAAVDGTNANEVDVTAKWQGLSGNDLVVSIE------GSI--PGITVAITQPTGGAADPE 213
A+ D A V +TA+ +GL GN+L V + G I G+ V T G+ P+
Sbjct: 155 AASSD---AGVVTLTARHKGLYGNELPVCLNYYGSGGGEILPAGLQVVTEAGTAGSGAPD 211
Query: 214 VADTLALFGENVWYTQIVSCFNTANTDALDAFETFGEGRWDPILRRPLAVFTGTNETDPN 273
+ +A G+ + + + A+ + + GRW + V+T T +
Sbjct: 212 LTAAVAAMGDEAFDFIGLPFNDAASINMMMTEMNDSSGRWSYARQLYGHVYTAKLGT-LS 270
Query: 274 TLAAIGDARRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLTLPGLTP 333
L GD Q L Q+ E+ NDPA L G+ P
Sbjct: 271 ELVNAGDMHNQQHITLAGYEKETQSPVDELVASRLAREAVFIRNDPARPTQTGELVGMLP 330
Query: 334 GTDAQQWTHTQRDLLVKAGISTSIVRDGVAEISDTVTTY--HPTGEVNPGYLFYKDTVKA 391
+++ T++ L+ G++T+ V G I +VTTY + G + YL + +
Sbjct: 331 APKGKRFIMTEQQTLLSHGVATAYVEGGTLRIQRSVTTYKKNAYGVADNSYLDSETLHTS 390
Query: 392 SNVLFNLDLIFNTREW------DGAPLIPDDQPTTNARAKRPKDAVAVLSVLANNLGLDA 445
+ VL L + ++ DG P T A K L +
Sbjct: 391 AYVLRKLKSVITSKYGRHKLANDGTRFGPGQAIVTPAVIK------GELLATYRQMERAG 444
Query: 446 IISDVPFTLENIRASINDQNPNRLDIIFP 474
I+ + + + + NPNRL+ +FP
Sbjct: 445 IVENYDLFKQYLIVERDADNPNRLNTLFP 473
>ref|NP_274135.1| (NC_003112) phage sheath protein [Neisseria meningitidis MC58]
pir||E81121 phage sheath protein NMB1104 [imported] - Neisseria meningitidis
(group B strain MD58)
gb|AAF41495.1| (AE002460) phage sheath protein [Neisseria meningitidis MC58]
Length = 469
Score = 73.6 bits (179), Expect = 5e-012
Identities = 88/345 (25%), Positives = 131/345 (37%), Gaps = 27/345 (7%)
Query: 34 PQHVALLGQGNTAATYE-LTPFRATSAAQVGQRFGFGSPLHLAALQLLPDNNDGLRSIPL 92
PQ V ++ TA L P + S A+ FG GS HL Q +N + L
Sbjct: 35 PQKVLMVAPMLTAGIQPALEPVQLFSDAEAADLFGQGSLAHLMVRQAFANNP----YLDL 90
Query: 93 TIFPMGDNDAGVAAEGVLDLSGTASATATVRVRIGSQRSSLVTIPTGTTAEQAAALLVAG 152
T+ + D+ AGV A + LSGTA+A V + IG ++ S + TG TA A L
Sbjct: 91 TVIGIADHSAGVQATATVTLSGTATAPGVVEITIGGKQVS-TAVNTGETAATVADRLKTA 149
Query: 153 IQGNPFMPMTAAVDGTNANEVDVTAKWQGLSGNDLVVSIEGSIPGITVAITQPTGGAADP 212
I +T G+ A V +TAK +G GN+ +++ G+T TGGA +
Sbjct: 150 ITA---ADVTVTASGSGA-AVTLTAKHKGEIGNESGLTVSTGNTGLTYQANAFTGGAKNA 205
Query: 213 EVADTLALFGENVWYTQIVSCFNTANTDALDAFETFGEGRWDP-----ILRRPLAVFTGT 267
++A L+ ++ + AN AL T + +L A+ T T
Sbjct: 206 DIATALSKVAGKHYHIICSPFSDDANAKALSNHITNVSNAIEQRGCIGVLGMSAALSTAT 265
Query: 268 NETDPNTLAAIGDARRAQRTNLVTPVPGAQNLPCEIXXXXXXXXXXXXNNDPASDFARLT 327
T I D R P DPA L
Sbjct: 266 TAT-----GEINDGRMTCAWYKGAVEPNG-------IIAAGYAAVLAFEEDPAKPLNTLE 313
Query: 328 LPGLTPGTDAQQWTHTQRDLLVKAGISTSIVRDGVAEISDTVTTY 372
+ GL DAQ + + + G++ V + +I V+TY
Sbjct: 314 IKGLAVTPDAQWPLFAECNNALYNGLTPLTVVNNRVQIMRAVSTY 358
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 316,395,816
Number of Sequences: 1028106
Number of extensions: 13381695
Number of successful extensions: 29261
Number of sequences better than 5.0e-02: 7
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 29247
Number of HSP's gapped (non-prelim): 8
length of query: 500
length of database: 324,535,921
effective HSP length: 125
effective length of query: 375
effective length of database: 196,022,671
effective search space: 73508501625
effective search space used: 73508501625
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 93 (40.4 bits)
Query= orf13 [3605-3675]
(69 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.307 0.125 0.000
Gapped
Lambda K H
0.267 0.0410 2.50e+262
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 38,340,155
Number of Sequences: 1028106
Number of extensions: 1211910
Number of successful extensions: 4970
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4970
Number of HSP's gapped (non-prelim): 0
length of query: 69
length of database: 324,535,921
effective HSP length: 45
effective length of query: 24
effective length of database: 278,271,151
effective search space: 6678507624
effective search space used: 6678507624
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf14 [3703-3837]
(133 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 84,081,426
Number of Sequences: 1028106
Number of extensions: 3350964
Number of successful extensions: 8801
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8801
Number of HSP's gapped (non-prelim): 0
length of query: 133
length of database: 324,535,921
effective HSP length: 109
effective length of query: 24
effective length of database: 212,472,367
effective search space: 5099336808
effective search space used: 5099336808
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 83 (36.6 bits)
Query= orf15 [3868-3903]
(34 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.325 0.142 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 18,686,389
Number of Sequences: 1028106
Number of extensions: 313471
Number of successful extensions: 679
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 679
Number of HSP's gapped (non-prelim): 0
length of query: 34
length of database: 324,535,921
effective HSP length: 10
effective length of query: 24
effective length of database: 314,254,861
effective search space: 7542116664
effective search space used: 7542116664
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf16 [3919-4523]
(603 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus ... 169 7e-041
ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologu... 169 7e-041
gb|AAL82371.1| (AF424783) tail length tape measure protein ... 169 1e-040
ref|NP_646712.1| (NC_003923) hypothetical protein [Staphylo... 166 1e-039
ref|NP_310668.1| (NC_002695) putative tail protein [Escheri... 163 5e-039
ref|NP_288365.1| (NC_002655) putative tail fiber component ... 161 2e-038
ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009... 157 5e-037
ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacter... 156 8e-037
ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus... 145 1e-033
ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2... 130 5e-029
ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2] ... 114 3e-024
ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phag... 111 2e-023
ref|NP_471714.1| (NC_003212) similar to hypothetical protei... 110 6e-023
gb|AAK85310.1| (AY043263) phage-related tail protein [Wolba... 49 2e-004
>ref|NP_058453.1| (NC_002321) orf 15 [Staphylococcus aureus bacteriophage PVL]
pir||T00148 hypothetical protein 15 - Staphylococcus aureus phage phi PVL
dbj|BAA31888.1| (AB009866) orf 15 [Staphylococcus aureus bacteriophage PVL]
Length = 694
Score = 169 bits (429), Expect = 7e-041
Identities = 114/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)
Query: 59 SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
++ GIEF+ + + G + ++ L A+E+G ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
G++++Q M L ++ LA A+G +L +DI TD L AFGL KDS LA DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHLA-------DVL 287
Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
A+TS+ ANT + + E + AP A A G +IE S IG+M++ GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 359 FELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 417
++ ++ +V + V ++ + +I V G L V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFVAVLGPLVF 515
Query: 418 AFNTIVTTITGAITLLN-LVIAAN 440
F ++ + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_061640.1| (NC_002486) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
prophage phiPV83]
dbj|BAA97857.1| (AB044554) phi PVL ORF 15 and 16 homologue [Staphylococcus aureus
prophage phiPV83]
Length = 1313
Score = 169 bits (429), Expect = 7e-041
Identities = 114/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)
Query: 59 SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
++ GIEF+ + + G + ++ L A+E+G ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
G++++Q M L ++ LA A+G +L +DI TD L AFGL KDS A DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287
Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
A+TS+ ANT + + E + AP A A G +IE S IG+M++ GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 359 FELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 417
++ ++ +V + V ++ + +I V G L A V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVIGFGLFAAVLGPLVF 515
Query: 418 AFNTIVTTITGAITLLN-LVIAAN 440
F ++ + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>gb|AAL82371.1| (AF424783) tail length tape measure protein [Staphylococcus aureus
phage phi 13]
Length = 1550
Score = 169 bits (427), Expect = 1e-040
Identities = 113/384 (29%), Positives = 191/384 (49%), Gaps = 25/384 (6%)
Query: 59 SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
++ GIEF+ + + G + ++ L A+E+G ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
G++++Q M L ++ LA A+G +L +DI TD L AFGL KDS A D+L
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELEAVSDIVTDGLTAFGLKAKDSGHFA-------DIL 287
Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
A+TS+ ANT + + E + AP A A G +IE S IG+M++ GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 359 FELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIATVFATL-W 417
++ ++ +V + V ++ + +I V G L A V L +
Sbjct: 468 YDRIEPALKIIVSAFS------------KVVTWVTKLPTSIQLAVVGFGLFAAVLGPLVF 515
Query: 418 AFNTIVTTITGAITLLN-LVIAAN 440
F ++ + A+T+L L+I N
Sbjct: 516 MFGLFISVMGNAMTVLGPLLINVN 539
>ref|NP_646712.1| (NC_003923) hypothetical protein [Staphylococcus aureus subsp.
aureus MW2]
dbj|BAB95760.1| (AP004828) hypothetical protein [Staphylococcus aureus subsp.
aureus MW2]
Length = 1549
Score = 166 bits (419), Expect = 1e-039
Identities = 98/302 (32%), Positives = 160/302 (52%), Gaps = 11/302 (3%)
Query: 59 SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
++ GIEF+ + + G + ++ L A+E+G ++++A+A
Sbjct: 179 AKKGIEFDDSMRKVKATSGAT----GEEFEALKKKAREMGATTKFSASDSAEALNYMALA 234
Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
G++++Q M L ++ LA A+G +L +DI TD L AFGL KDS A DVL
Sbjct: 235 GWDSKQMMEGLSGVMDLAAASGEELGAVSDIVTDGLTAFGLKAKDSGHFA-------DVL 287
Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
A+TS+ ANT + + E + AP A A G +IE S IG+M++ GIK E AGTA++ F
Sbjct: 288 AQTSSKANTDVRGLGEAFKYVAPVAGALGYTIEDTSIAIGLMSNAGIKGEKAGTALRTMF 347
Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
L++P + RLGI + DS G M ++ L + + Q+ + +FG
Sbjct: 348 TNLSSPTRAMGNEMERLGISITDSNGKMIPMRKLLDQLREKFKHLSKDQQASSAATIFGK 407
Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
E ++G L +INA + + ++ S+ GA+ RMA + + G LRTLRS +E +A+
Sbjct: 408 EAMSGALAIINASDEDYQKLTKSIDSSTGASKRMADTMESGLGGKLRTLRSQLEELALTI 467
Query: 359 FE 360
++
Sbjct: 468 YD 469
>ref|NP_310668.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
dbj|BAB36064.1| (AP002559) putative tail protein [Escherichia coli O157:H7]
Length = 696
Score = 163 bits (413), Expect = 5e-039
Identities = 122/392 (31%), Positives = 187/392 (47%), Gaps = 33/392 (8%)
Query: 111 GIDFLAMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 170
G FLAMAGF + AALP ++++A A G++L DI ++ L F L A
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299
Query: 171 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVA 230
+ RV D L T NT + + ET++ P A G S+E +AM G++A+NG++ A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359
Query: 231 GTAVQNFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLA 290
GTA++ RLA+P A L+ LG+ VAD+ G MR DV+ DL A G+ +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419
Query: 291 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 349
+ + G E G V AG L + L A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEEAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479
Query: 350 TVESVAIRFFELSGGPMRD-------VVDQATAWIRANRELIAQNV--AGFLNTIVENID 400
E + IR +L GP+R V+++ T+ +A+ L Q + G L + I
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539
Query: 401 SIVRGIKLIATVFATLWAFNTIVT--------------TITGAITLLNLVIAANPXXXXX 446
S+ I ++ ATL I+T +TG+++LL VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVVRVLPALWGMVTGSVSLLGGVIGA--LFSPV 597
Query: 447 XXXXXXXXXXXXXXYLHWEPIKTFFTDLWDGI 478
+ +W+PI+ FF ++ GI
Sbjct: 598 GLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_288365.1| (NC_002655) putative tail fiber component of prophage CP-933T
[Escherichia coli O157:H7 EDL933]
gb|AAG56919.1|AE005413_13 (AE005413) putative tail fiber component of prophage CP-933T
[Escherichia coli O157:H7 EDL933]
Length = 696
Score = 161 bits (407), Expect = 2e-038
Identities = 121/392 (30%), Positives = 186/392 (46%), Gaps = 33/392 (8%)
Query: 111 GIDFLAMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALN 170
G FLAMAGF + AALP ++++A A G++L DI ++ L F L A
Sbjct: 247 GQSFLAMAGFTPQAIQAALPGVLNMALAGGVELGETADIGSNILTQFNLT-------ADQ 299
Query: 171 LARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVA 230
+ RV D L T NT + + ET++ P A G S+E +AM G++A+NG++ A
Sbjct: 300 MDRVGDTLTAAFTRTNTDLRALGETMKYTGPVAAKLGISLEEAAAMAGMLANNGLRGSDA 359
Query: 231 GTAVQNFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLA 290
GTA++ RLA+P A L+ LG+ VAD+ G MR DV+ DL A G+ +++
Sbjct: 360 GTAMRASLSRLASPPKAAADALKELGVSVADARGKMRPMEDVLLDLYKATQKYGQVDQVS 419
Query: 291 VIQKVFGAEGLAG-NLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRS 349
+ + G G V AG L + L A G ADR+AK + D++ G L+ L S
Sbjct: 420 FFKDIAGEXAFVGLQTLVAAAGSGELQKLTRELQGARGEADRVAKVMADNLDGDLKNLDS 479
Query: 350 TVESVAIRFFELSGGPMRD-------VVDQATAWIRANRELIAQNV--AGFLNTIVENID 400
E + IR +L GP+R V+++ T+ +A+ L Q + G L + I
Sbjct: 480 AWEGLRIRISDLVDGPLRSVTQWLTRVLEKITSLAQAHPVLTRQLLIAGGALLAMTATIG 539
Query: 401 SIVRGIKLIATVFATLWAFNTIVT--------------TITGAITLLNLVIAANPXXXXX 446
S+ I ++ ATL I+T +TG+++LL VI A
Sbjct: 540 SLSLVIGVLYGKLATLRLGFDILTRSMNVVRVLPALWGMVTGSVSLLGGVIGA--LFSPV 597
Query: 447 XXXXXXXXXXXXXXYLHWEPIKTFFTDLWDGI 478
+ +W+PI+ FF ++ GI
Sbjct: 598 GLIVAALAGAAVLIWKYWDPIRAFFAGVFSGI 629
>ref|NP_108725.1| (NC_002703) unknown [Bacteriophage Tuc2009]
gb|AAK19873.1|AF109874_46 (AF109874) unknown [Bacteriophage Tuc2009]
Length = 1025
Score = 157 bits (396), Expect = 5e-037
Identities = 115/441 (26%), Positives = 199/441 (45%), Gaps = 23/441 (5%)
Query: 59 SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
++ G +FE ++R G + + Q +D+ G G++ LA A
Sbjct: 69 AKIGGDFESQMSRVKAISGATGSSFEELRQQAIDL----GAKTAFSAKESASGMENLASA 124
Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
GFNA++ M A+P ++ LA +G D+ A++ A +L F L S +A +V
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVGLASENAATALRGFNLDASQSGHVA-------NVF 177
Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
AK + + N + M E ++ AP A + G SIE VSA IG+M+ GIK AGT+++
Sbjct: 178 AKAAANTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237
Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
RLA P + + LG+ DS G M+ D +G L A + Q+ + ++G
Sbjct: 238 SRLAKPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297
Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
E L+G + +I+ G D L + +L +++GAAD+MAK + D+M SL + +ES AI
Sbjct: 298 ESLSGMIALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGALESAAIVV 357
Query: 359 FELSGGPMRDVVDQATAWIR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 412
++ +R V D + + + Q + + IV I ++ + + ++ V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVLVLQRV 417
Query: 413 FATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLHWEPIKTFFT 472
A + + I G+ T ++L + Y +W+ I F
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471
Query: 473 DLWDGIGNAFDTFATNLANKW 493
++W + A + KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_112708.1| (NC_002747) TMP [Lactococcus lactis bacteriophage TP901-1]
gb|AAG32164.1|AF252967_1 (AF252967) tape measure protein [Lactococcus lactis bacteriophage
TP901-1]
gb|AAK38062.1| (AF304433) TMP [Lactococcus lactis bacteriophage TP901-1]
Length = 937
Score = 156 bits (394), Expect = 8e-037
Identities = 116/441 (26%), Positives = 197/441 (44%), Gaps = 23/441 (5%)
Query: 59 SRAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMA 118
++ G +FE ++R G + + Q +D+ G G++ LA A
Sbjct: 69 AKIGGDFESQMSRVKAISGATGSSFEELRQQAIDL----GAKTAFSAKESASGMENLASA 124
Query: 119 GFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVL 178
GFNA++ M A+P ++ LA +G D+A A++ A +L F L S +A +V
Sbjct: 125 GFNAKEIMEAMPGLLDLAAVSGGDVALASENAATALRGFNLDASQSGHVA-------NVF 177
Query: 179 AKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFF 238
AK + N + M E ++ AP A + G SIE VSA IG+M+ GIK AGT+++
Sbjct: 178 AKAAADTNAEVGDMGEAMKYIAPVANSMGLSIEEVSAAIGIMSDAGIKGSQAGTSLRGAL 237
Query: 239 LRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGA 298
RLA P + + LG+ DS G M+ D +G L A + Q+ + ++G
Sbjct: 238 SRLADPTDAMQAKMDELGLSFYDSEGKMKPLKDQIGMLKDAFKGLTPEQQQNALVTLYGQ 297
Query: 299 EGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
E L+G + +I+ G D L + +L +++GAAD+MAK + D+M SL + ES AI
Sbjct: 298 ESLSGMMALIDKGPDKLGKLTESLKNSDGAADKMAKTMQDNMNSSLEQMMGAFESAAIVV 357
Query: 359 FELSGGPMRDVVDQATAWIR--ANRELIAQNVAGFLNTIVENIDSIV----RGIKLIATV 412
++ +R V D + + + Q + + IV I ++ + + + V
Sbjct: 358 QKILAPAVRKVADSISGLVDKFVSAPEPVQKMIVTIGLIVAAIGPLLVIFGQAVVTLQRV 417
Query: 413 FATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLHWEPIKTFFT 472
A + + I G+ T ++L + Y +W+ I F
Sbjct: 418 KVGFLALRSGLALIGGSFTAISLPVLG------IIAAIAAVIAIGILVYKNWDKISKFGK 471
Query: 473 DLWDGIGNAFDTFATNLANKW 493
++W + A + KW
Sbjct: 472 EVWANVKKFASDAAEVIKEKW 492
>ref|NP_075512.1| (NC_002661) orf1374 [Staphylococcus aureus temperate phage phiSLT]
dbj|BAB21743.1| (AB045978) orf1374 [Staphylococcus aureus temperate phage phiSLT]
Length = 1374
Score = 145 bits (366), Expect = 1e-033
Identities = 88/298 (29%), Positives = 153/298 (50%), Gaps = 18/298 (6%)
Query: 94 AKEVGXXXXXXXXXXXXGIDFLAMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDS 153
A ++G G++ LA GFNA+Q+M A+P ++S A A+G ++A + +
Sbjct: 364 AVDLGAKTSKSANEVAKGMEELAALGFNAKQTMEAMPGVISAAEASGAEMATTATVMASA 423
Query: 154 LGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETV 213
+ +FGL D+ +A D+LA+++ + I M + ++ A A A G SIE
Sbjct: 424 INSFGLKASDANHVA-------DLLARSANDSAADIQYMGDALKYAGTPAKALGVSIEDT 476
Query: 214 SAMIGVMASNGIKAEVAGTAVQNFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVV 273
SA I V++++G++ AGTA++ F+RLA P+ K +++LGI ++D+ G +++
Sbjct: 477 SAAIEVLSNSGLEGSQAGTALRASFIRLANPSKNTAKEMKKLGIHLSDAKGQFVGMGELI 536
Query: 274 GDLNGALATMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVEYRNTLLSAEGAADRMA 333
+ M Q+LA + + G E +G L +I AG D + Y +L ++ G + + A
Sbjct: 537 RQFQDNMKGMTREQKLATVATIVGTEAASGFLALIEAGPDKINSYSKSLKNSNGESKKAA 596
Query: 334 KRIGDDMLGSLRTLRSTVESVAIRFFELSGGPMRDVVDQATAWIRANRELIAQNVAGF 391
+ D++ G+L L ES+AI +V T IRA E + + V GF
Sbjct: 597 DLMKDNLKGALEQLGGAFESLAI-----------EVGKDLTPMIRAGAEGLTKLVDGF 643
>ref|NP_490625.1| (NC_003278) orf25; similar to T gene of P2 [Bacteriophage phi CTX]
[bacteriophage phi CTX]
dbj|BAA36253.1| (AB008550) orf25; similar to T gene of P2 [bacteriophage phi CTX]
Length = 904
Score = 130 bits (327), Expect = 5e-029
Identities = 107/457 (23%), Positives = 202/457 (43%), Gaps = 54/457 (11%)
Query: 62 GIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAGFN 121
G EF+ ++ AT + K Q L A+ + G FL G+N
Sbjct: 212 GYEFDATMS-ATQAVTRIERKDDPQMQALRQQARTLPLSSKFTDKEVAQGQYFLGRTGYN 270
Query: 122 AEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 181
A+Q + A+P +++LA A +DL DIA++ A G+ + Q+A DVL
Sbjct: 271 AKQILGAMPGMLNLAAAGDMDLGDTADIASNIQTAMGIPAEKMDQVA-------DVLTAA 323
Query: 182 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLRL 241
T N I M+ ++++ +A GQS+ETV+ ++ + G++ +AGT++++ RL
Sbjct: 324 FTRNNVDIRMLGDSLKYSAGVGREYGQSLETVTTATALLGNAGVQGSMAGTSMRSVLTRL 383
Query: 242 AAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKVFGAEGL 301
K + +LG+ D+ GNMRD D++ D+N A MG QR A+ + + G +
Sbjct: 384 GTS-----KAVAKLGVQTKDANGNMRDMLDILKDINKKTAGMGNVQRGAIFKDIAGQYAV 438
Query: 302 AGNLGVINAGKDALVE-YRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFE 360
++ A + + R +L ++EG A R+A D++ G + L + +E++++ F+
Sbjct: 439 TSFGTLMRAVEGGQFQTMRESLNNSEGEAARVAATQLDNLKGDMTMLHAALENISVELFD 498
Query: 361 LSGGPMRDVVDQAT-------AWIRANREL---IAQNVAGFLNTIVENIDSIVRGIK-LI 409
+ +R++ + +++AN ++ I VA F + ++ + S+ + ++
Sbjct: 499 KNSPWLRELAADLSHLLHNIGEFLKANPQVSKGIVITVAAF-SALMATVGSLAITLAGIL 557
Query: 410 ATVFATLWAFNTIVTTITGAITLLNLV----------------------------IAANP 441
+ A + TI + G I LL L+ +AANP
Sbjct: 558 GPMIAVRFMLGTIGIRLPGLIGLLKLLFAPIRMLAGLLIGPLVTALRVVSIALWGLAANP 617
Query: 442 XXXXXXXXXXXXXXXXXXXYLHWEPIKTFFTDLWDGI 478
Y +W+ +K + +W+ I
Sbjct: 618 VVLAIAAVVAVLAGAAYLIYRNWDAVKAYLLGMWEEI 654
>ref|NP_046782.1| (NC_001895) gpT [Enterobacteria phage P2]
gb|AAD03293.1| (AF063097) gpT [Enterobacteria phage P2]
Length = 815
Score = 114 bits (285), Expect = 3e-024
Identities = 105/470 (22%), Positives = 179/470 (37%), Gaps = 42/470 (8%)
Query: 62 GIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAGFN 121
G EF + +G V K + L A+++G +A AG +
Sbjct: 212 GYEFAQKNSELQAVIG--VAKDSAEMAALRKQARQLGDNTAASADDAAGAQIIIAKAGGD 269
Query: 122 AEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKT 181
+ AA P +++A A + + AF L +A + DVL+ T
Sbjct: 270 VDAIQAATPVTLNMALANRRTMEENAALLMGMKSAFQLSND-------KVAHIGDVLSMT 322
Query: 182 STSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLRL 241
D M + + AAP A AG SIE +AM+G + I +AGT + RL
Sbjct: 323 MNKTAADFDGMSDALTYAAPVAKNAGVSIEETAAMVGALHDAKITGSMAGTGSRAVLSRL 382
Query: 242 AAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALA--TMGERQRLAVIQKVFGAE 299
AP G+A L+ LG+ +DS GN R F ++ ++ + +G Q+ ++ +FG E
Sbjct: 383 QAPTGKAWDALKELGVKTSDSKGNTRPIFTILKEMQASFEKNRLGTAQQAEYMKTIFGEE 442
Query: 300 GLAGNLGVINAGKDALVEYRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRF 358
+ ++ A ++ A +G + + D++ G + +S E+V
Sbjct: 443 ASSAAAVLMTAASTGKLDKLTAAFKASDGKTAELVNIMQDNLGGDFKAFQSAYEAVGTDL 502
Query: 359 FELSGGPMRDVVDQAT-------AWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA- 410
F+ G +R + AT WI+ N+ L + G + + I+ I L+A
Sbjct: 503 FDQQEGALRKLTQTATKYVLKLDGWIQKNKSL--ASTIGIIAGGALALTGIIGAIGLVAW 560
Query: 411 -------TVFATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLH 463
+ A A + TT+ A+ + A +
Sbjct: 561 PVITGINAIIAAAGAMGAVFTTVGSAV-----MTAIGAISWPVVAVVAAIVAGALLIRKY 615
Query: 464 WEPIKTFFTDLWDGIGNAFDTFATNLANKWEALIAPIRSSIQWLLEQADA 513
WEP+ FF + +G+ AF L P++ WL E+ A
Sbjct: 616 WEPVSAFFGGVVEGLKAAFAPVG--------ELFTPLKPVFDWLGEKLQA 657
>ref|NP_052273.1| (NC_001317) G protein [Enterobacteria phage 186]
gb|AAC34170.1| (U32222) G protein [Enterobacteria phage 186]
Length = 812
Score = 111 bits (278), Expect = 2e-023
Identities = 106/456 (23%), Positives = 180/456 (39%), Gaps = 51/456 (11%)
Query: 80 VTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAGFNAEQSMAALPKIVSLATAA 139
V K + Q L A+ VG +A +G +A A +++A +
Sbjct: 227 VDKQSPEMQALRKQARHVGDNTAASADDAASAQIIIAKSGGDAAAIQARRRVTLNMALSN 286
Query: 140 GIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLAKTSTSANTTIDMMFETIRKA 199
+ + T AF L +A + DVL+ T D + + + A
Sbjct: 287 RRTMEENAALLTGMKSAFQLSND-------KIAHIGDVLSMTMNKTAADFDGLSDALTYA 339
Query: 200 APTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFLRLAAPAGEARKILRRLGIDV 259
AP A AG SIE +AM+G + I +AGT + RL P +A + ++ LG+
Sbjct: 340 APVAKNAGVSIEQTAAMVGALHDAKITGSMAGTGSRAILSRLQLPPEKAFEAIKELGVKT 399
Query: 260 ADSAGNMRDAFDVVGDLNGALA--TMGERQRLAVIQKVFGAEGLAGNLGVINAGKDALVE 317
+DS GN R F ++ ++ + +G QR ++ +FG E + ++ A ++
Sbjct: 400 SDSKGNTRPIFSILKEMQRSFEKNNLGTSQRGEYMKTIFGEEASSAAAVLMEAASSGKLD 459
Query: 318 YRNTLLSA-EGAADRMAKRIGDDMLGSLRTLRSTVESVAIRFFELSGGPMRDVVDQAT-- 374
A +G + + K + D++ G + +S E+V F+ G +R + AT
Sbjct: 460 RLTAAFKASDGKTEELVKVMQDNLGGDFKEFQSAYEAVGTDLFDQQEGSLRKLTQTATQY 519
Query: 375 -----AWIRANREL------IAQNVAGFLNTI----------VENIDSIVRGIKLIATVF 413
WI+ N+ L IA + I V I++I+ ++ TVF
Sbjct: 520 VLKLDGWIQKNKGLATTIGIIAGGALALIGIIGGIGLVAWPVVMGINAIIAAAGVLGTVF 579
Query: 414 ATLWAFNTIVTTITGAITLLNLVIAANPXXXXXXXXXXXXXXXXXXXYLHWEPIKTFFTD 473
+ + T GAIT + + A +WEPI FF+
Sbjct: 580 TVT---GSAIVTALGAITWPIVAVGA------------AIVAGALLIRKYWEPISAFFSG 624
Query: 474 LWDGIGNAFDTFA---TNLANKWEALIAPIRSSIQW 506
+ +GI +AF A LA ++ L +R QW
Sbjct: 625 VIEGIMSAFAPVAEMFAPLAPIFDGLGEKLRGVWQW 660
>ref|NP_471714.1| (NC_003212) similar to hypothetical protein [Lactobacillus casei
bacteriophage A2] [Listeria innocua]
emb|CAC97610.1| (AL596172) similar to hypothetical protein [Lactobacillus casei
bacteriophage A2] [Listeria innocua]
Length = 1601
Score = 110 bits (274), Expect = 6e-023
Identities = 74/340 (21%), Positives = 153/340 (44%), Gaps = 25/340 (7%)
Query: 59 SRAGIEFEHVI---NRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFL 115
++A ++F++ + N + G++ + D Y + D A++ G+ +
Sbjct: 100 TKAYLDFDNEVTEVNSLLRESGESAKEFGDRYTQVFDYAQKASVKYGVASEQTMLGMKEM 159
Query: 116 AMAGFNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVN 175
G++ Q+MA++P I + A A+G D + T +L FG+++KD+ + +V
Sbjct: 160 VKKGYDINQTMASMPAIFNAARASGDDFETVMSVTTSTLEQFGMISKDTNKQMEYTNKVA 219
Query: 176 DVLAKTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQ 235
DVL + M + P + + G S+ +A +G++++ GI+ + AGT ++
Sbjct: 220 DVLTYVADKTAAGFSDMGTAMNYVGPISHSLGYSLTDTAAAVGLLSNRGIEGQKAGTGLR 279
Query: 236 NFFLRLAAPAGEARKILRRLGIDVADSAGNMRDAFDVVGDLNGALATMGERQRLAVIQKV 295
L P+ A + + +G+ + D+ GNM+ ++ D+N M + Q+ + + +
Sbjct: 280 GMLTSLLKPSKSAAEAMSAVGLTIEDNNGNMKTLPTLLDDINDKTKKMTKTQKNSFLTMI 339
Query: 296 FGAEGLAGNLGVINAGKDALVEYRNTLLSAEG----AADRMAK----------------- 334
FG E L+ ++ AG D+L +Y A G AD M K
Sbjct: 340 FGREPLSAVNTLLEAGGDSLRKYSKGADEANGYTKQVADNMRKAGKFGVDQFKASLEVLE 399
Query: 335 -RIGDDMLGSLRTLRSTVESVAIRFFELSGGPMRDVVDQA 373
+G ++ +L + + +F +LSG ++++ A
Sbjct: 400 QNVGQKLMPALTPIIEWANKMIDKFNDLSGEQQQNIIKWA 439
>gb|AAK85310.1| (AY043263) phage-related tail protein [Wolbachia endosymbiont of
Drosophila melanogaster]
Length = 680
Score = 48.9 bits (115), Expect = 2e-004
Identities = 87/474 (18%), Positives = 163/474 (34%), Gaps = 58/474 (12%)
Query: 60 RAGIEFEHVINRATVKMGDNVTKGTDAYQGLVDIAKEVGXXXXXXXXXXXXGIDFLAMAG 119
+ I+FE + A V + KGTD KE+ G
Sbjct: 110 KVAIDFESAM--ADVTKVVDFKKGTDEATKFAKKLKEMSRTIPLSAAELAQIAASGGQLG 167
Query: 120 FNAEQSMAALPKIVSLATAAGIDLARATDIATDSLGAFGLMTKDSTQLALNLARVNDVLA 179
E + ++TA + +A D +G+ + V +V+
Sbjct: 168 IKKEDLFMFTETVAKMSTAFDMSAEQAGDSIAKLSNVYGIDVS-------KMEYVGNVIN 220
Query: 180 KTSTSANTTIDMMFETIRKAAPTATAAGQSIETVSAMIGVMASNGIKAEVAGTAVQNFFL 239
S + M E + TA G I+ S+++ S G + A TA+
Sbjct: 221 HLSDNTAAKAKDMVEALAIVGGTAKQFGLDIKETSSLVNAFVSLGKQPAKAATAINALLS 280
Query: 240 RLAAP---AGEARKILRRLGIDVADSAGNMRD-AFDVVGDLNGALATMGERQRLAVIQKV 295
+L G+ + L ++GI + + + + + AL M ++R ++ K+
Sbjct: 281 KLQTAEEQGGDFKAALEQMGITAEEIVQRISENGEEALLYFFQALKKMDNQERSTILMKL 340
Query: 296 FGAE---GLAGNLGVINAGKDALVEYRNTLLSAEGAADRMAKRIGDDMLGSLRTLRSTVE 352
FG E +A G N +DA+ +T R+ D LR LR+ +
Sbjct: 341 FGQEYQDDIALLAGSFNKYEDAIRLLSDTEEYKSSLQKEFQNRV-DTTASKLRLLRNAIA 399
Query: 353 SVAIRFFELSGGPMRDVVDQATAWIRANRELIAQNVAGFLNTIVENIDSIVRGIKLIA-- 410
V + + ++ + + R+ L A+ I+ + +++ +K++A
Sbjct: 400 EVGMNLGSVMLPTLKSIAEFLQEKTRSIA-LFAEKYPTLTKAIMGTVAALI-SLKVVAVG 457
Query: 411 ----------TVFAT----LWAFNTIVTTITGAI-TLLNLVIAANPXXXXXXXXXXXXXX 455
T+F+ L F+ + T+ A+ T L V ANP
Sbjct: 458 LGYGFTLLGSTIFSLKANLLGVFSFLSATVFPAVVTGLRAVTLANPIGLLITGLVTGAAL 517
Query: 456 XXXXXYLHWEPIKTFFTDLWDGIGNAFDTFATNLANKWEALIAPIRSSIQWLLE 509
+W+ +K FF+ W+++I PI + W+ E
Sbjct: 518 VIT----NWQKVKDFFSSF------------------WKSIIKPIEEAFSWIGE 549
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.319 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 269,999,329
Number of Sequences: 1028106
Number of extensions: 8713787
Number of successful extensions: 24707
Number of sequences better than 5.0e-02: 34
Number of HSP's better than 0.1 without gapping: 23
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 24614
Number of HSP's gapped (non-prelim): 74
length of query: 603
length of database: 324,535,921
effective HSP length: 127
effective length of query: 476
effective length of database: 193,966,459
effective search space: 92328034484
effective search space used: 92328034484
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 94 (40.8 bits)
Query= orf17 [4523-4941]
(417 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_274140.1| (NC_003112) phage virion protein, putative... 40 0.039
>ref|NP_274140.1| (NC_003112) phage virion protein, putative [Neisseria meningitidis
MC58]
pir||B81122 phage virion protein, probable NMB1109 [imported] - Neisseria
meningitidis (group B strain MD58)
gb|AAF41500.1| (AE002460) phage virion protein, putative [Neisseria meningitidis
MC58]
Length = 443
Score = 40.4 bits (93), Expect = 0.039
Identities = 33/129 (25%), Positives = 59/129 (45%), Gaps = 10/129 (7%)
Query: 3 WQTRLREAAYTPPSGNRLTFIYTGVSEEFDQKGGPFDFAGAQGTYVQFLGVTGRRYPMTI 62
W T L++A+Y G + + + + PF QG ++ +G+TGR+ +
Sbjct: 4 WHTLLQDASYKGV-GFDIEVVDESNGKALAEHARPF----VQGIDLEDMGMTGRQVQINA 58
Query: 63 IVSGDDYDLDAAAWMRALAEQGEAVLEHPAYGRL--TVAPVGTVKRSENFVNGAGQATIE 120
+ G Y + AL + G VL HP +GR+ +A + + ++V+ AG I+
Sbjct: 59 VFWGKGYAGRLKKLLDALEQPGGGVLVHPVWGRMHNMIAASWSYRHEADYVDYAG---ID 115
Query: 121 VTLFETTGA 129
+T E A
Sbjct: 116 ITFREAAEA 124
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.317 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 2.92e-152
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 200,742,413
Number of Sequences: 1028106
Number of extensions: 6748262
Number of successful extensions: 13026
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 1
Number of HSP's that attempted gapping in prelim test: 13026
Number of HSP's gapped (non-prelim): 1
length of query: 417
length of database: 324,535,921
effective HSP length: 124
effective length of query: 293
effective length of database: 197,050,777
effective search space: 57735877661
effective search space used: 57735877661
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 93 (40.4 bits)
Query= orf18 [4938-5356]
(417 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilu... 63 6e-009
ref|NP_050648.1| (NC_000929) putative tail protein [Enterob... 62 2e-008
ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseri... 60 4e-008
ref|NP_313011.1| (NC_002695) putative tail protein [Escheri... 60 4e-008
ref|NP_599048.1| (NC_003444) tail protein [Shigella flexner... 51 3e-005
>ref|NP_439666.1| (NC_000907) 43 kD tail protein [Haemophilus influenzae Rd]
Length = 420
Score = 63.2 bits (152), Expect = 6e-009
Identities = 46/188 (24%), Positives = 90/188 (47%), Gaps = 10/188 (5%)
Query: 68 TQDPDEVTLRINGLNFRFWRAVTITQHLDAVS-TVSLHAPFDPNDQQSRDAFRPYSYQPV 126
T+ +++ L +NG W+++ + + L+++S L P D S A + P+
Sbjct: 7 TETNNKIELYLNGKILSGWKSLNLQRSLESMSGRFDLGIAVRPEDDISVLA----AGSPL 62
Query: 127 AVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQT 186
+ +GG+ + +G L + + N++T+ S L DC +++ Q+ T +
Sbjct: 63 VLKMGGQTVITGYLDEIKQRVSGNDKTISVSGRDKTCDLVDCAIIHNSY--QFKNQTAKQ 120
Query: 187 IAADLCRPFGVQVLAPNGTGQTFERI---AVEPAEKVMAVIARLAAQRNLVVRSDEQGRL 243
IA +C+PFG+ V+ + ERI VEP E ++++A + ++V SD G L
Sbjct: 121 IAEAICKPFGISVVWQVQAPEANERIPVWQVEPGETAFDNLSKIARHKGVLVTSDVDGNL 180
Query: 244 VLLRPDTQ 251
+ P +
Sbjct: 181 LFTEPSNK 188
>ref|NP_050648.1| (NC_000929) putative tail protein [Enterobacteria phage Mu]
sp|P08558|VPP_BPMU 43 kDa tail protein (gpP)
pir||ZPBPMU gene P protein - phage Mu
emb|CAA29956.1| (X06796) 43 kDa tail protein [Enterobacteria phage Mu]
gb|AAF01122.1|AF083977_41 (AF083977) P [Enterobacteria phage Mu]
Length = 379
Score = 61.6 bits (148), Expect = 2e-008
Identities = 72/311 (23%), Positives = 124/311 (39%), Gaps = 32/311 (10%)
Query: 74 VTLRINGLNFRFWRAVTITQHLDAVSTVSLHAPFDPNDQQSRDAFRPYSYQPVAVDVGGE 133
VTLR +G F W +V++T+ +++V+ + N D + +++GG+
Sbjct: 5 VTLRADGRLFTGWTSVSVTRSIESVAG---YFELGVNVPPGTDLSGLAPGKKFTLEIGGQ 61
Query: 134 RLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQTIAADLCR 193
+ +G + + + A+ + + L DC S QW TL+ IA DLC
Sbjct: 62 IVCTGYIDSRRRQMTADSMKITVAGRDKTADLIDCAAVYSGG--QWKNRTLEQIARDLCA 119
Query: 194 PFGVQV---LAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLVLLRPDT 250
P+GV V L+ + F ++ +E V + R + R +++ S+ G LV R +
Sbjct: 120 PYGVTVRWELSDKESSAAFPGFTLDHSETVYEALVRASRARGVLMTSNAAGELVFSRAAS 179
Query: 251 Q-------GEPV--AEFIEGQQPPISVTPTFGNQDYYSHVTGI------TPTIVGLEGPQ 295
GE + +F E + S G Y+ G +IV +G
Sbjct: 180 TATDELVLGENLLTLDFEEDFRDRFSEYTVKG----YARANGAEGDDIDAKSIVSRKGTA 235
Query: 296 ATVRNPRLEGVLRPFVYNADDMDEADLVQA-VQSKAGRMFAAAATYDVPVPTWRNANGGL 354
R RP + AD A QA + R A + T++ + W +G L
Sbjct: 236 TDSDVTR----YRPMIIIADSKITAKDAQARALREQRRRLAKSITFEAEIDGWTRKDGQL 291
Query: 355 WRVGDFVILEA 365
W V ++A
Sbjct: 292 WMPNLLVTIDA 302
>ref|NP_284542.1| (NC_003116) hypothetical protein [Neisseria meningitidis Z2491]
pir||C81809 hypothetical protein NMA1830 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB72001.1| (AJ391256) hypothetical protein [Neisseria meningitidis]
emb|CAB85055.1| (AL162757) hypothetical protein [Neisseria meningitidis Z2491]
Length = 379
Score = 60.5 bits (145), Expect = 4e-008
Identities = 79/310 (25%), Positives = 122/310 (38%), Gaps = 28/310 (9%)
Query: 71 PDE-VTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDQQSRDAFRPYSYQPV 126
PD VTL ING W I L + +V L P D DA R + V
Sbjct: 4 PDNTVTLLINGKTHGQWTNYDIVSDLLTPADDFSVMLGRPVDAKP----DAVR--AGDKV 57
Query: 127 AVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQT 186
V VGG+ + SG + Q T +T+ GVL DC+ P + L
Sbjct: 58 EVRVGGDTVLSGRIDRVQTVTEKGGKTLTIQGRDDAGVLLDCSAPL----FNAQDMDLNQ 113
Query: 187 IAADLCRPFGVQVLAPNG--TGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV 244
I + +P G+ + + T +T ++ +EP + + A L + G LV
Sbjct: 114 IIEKIVKPLGLAKIRIDAAKTDKT-HKVQIEPGSRAWDALLEYAEANGLWPWLEPDGTLV 172
Query: 245 LLRPDTQGEPVAEFI---EGQQPPISVTPTFGNQDY---YSHVTGITPTIVGLEGPQATV 298
+ PD PVAE + GQ I N+D YS VT + + G +AT
Sbjct: 173 VGGPDYTAAPVAELVLRTNGQNNNIKRLEV--NRDMAARYSEVTVLAQSHSGKNNIKATA 230
Query: 299 RNPRLEGVLRPFVYNADDMD-EADLVQAVQSKAGRMFAAAATYDVPVPTWRNANGGLWRV 357
++ ++ + RP + D+D +A + + + T V R +G LW+
Sbjct: 231 KDESVK-LHRPLIVTEPDIDSQAQAQRKAKKRLADSRLEGLTITATVQGHRTDDGTLWQP 289
Query: 358 GDFV-ILEAP 366
G + +L P
Sbjct: 290 GQRINVLSEP 299
>ref|NP_313011.1| (NC_002695) putative tail protein [Escherichia coli O157:H7]
dbj|BAB38407.1| (AP002567) putative tail protein [Escherichia coli O157:H7]
Length = 374
Score = 60.5 bits (145), Expect = 4e-008
Identities = 70/268 (26%), Positives = 105/268 (39%), Gaps = 21/268 (7%)
Query: 124 QPVAVDVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTP--PASAFPLQWDE 181
Q + + + G+ + +G L + +A + S G L DC+ P S QW
Sbjct: 52 QSLELRIDGQPVITGWLDQVRQRISATRFQITLSGRDKTGDLVDCSAIHPGS----QWRN 107
Query: 182 ATLQTIAADLCRPFGVQV---LAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSD 238
TL+ IA+DLC PFGV V + + F +E +E V + R A R ++V S+
Sbjct: 108 RTLEHIASDLCAPFGVTVRWQVNDATAARPFSTFTLENSETVADALTRAARHRGVLVTSN 167
Query: 239 EQGRLVLLRPDTQ-------GEPVAEFIEGQQPPI--SVTPTFGNQDYYSHVTGITPTIV 289
G LV + +Q GE + + + S G+ H G T
Sbjct: 168 AAGELVFTQAGSQRGDTLTLGENLLDLDHNVDHRLRHSEYRVRGHGRGGGH-AGDALTAG 226
Query: 290 GLEGPQATVRNPRLEGVLRPFVYNADDMDEAD-LVQAVQSKAGRMFAAAATYDVPVPTWR 348
L P TV + + RP + AD +AD Q + R A + V W
Sbjct: 227 TLAAPVGTVTDSAIHR-YRPKIVLADHAVDADGARQRAVREMRRAVARSVRLTATVRHWF 285
Query: 349 NANGGLWRVGDFVILEAPGAQVYRRTLM 376
NG LW + + AP V R L+
Sbjct: 286 RENGQLWDINLLTAVTAPRTGVEERDLL 313
>ref|NP_599048.1| (NC_003444) tail protein [Shigella flexneri bacteriophage V]
gb|AAL89425.1| (U82619) tail protein [Shigella flexneri bacteriophage V]
Length = 359
Score = 50.8 bits (120), Expect = 3e-005
Identities = 81/367 (22%), Positives = 132/367 (35%), Gaps = 59/367 (16%)
Query: 72 DEVTLRINGLNFRFWRAVTITQHLDAVS---TVSLHAPFDPNDQQSRDAFRPYSYQPVAV 128
D VTLR+NG + W +V I ++ ++ +V + + ++ + R + V V
Sbjct: 3 DNVTLRVNGREWNGWTSVRIGAGIERLARDFSVDITRQWPGDEGITTLQPRIKNGSKVEV 62
Query: 129 DVGGERLFSGTLVNPQPTTAANERTVRASCYSTPGVLGDCTPPASAFPLQWDEATLQTIA 188
+G E + +G + A + + S L DC +A P Q++ +L IA
Sbjct: 63 LIGDELVITGWVEATPVRYDARSVSTGIAGRSLTADLIDC----AAEPTQFNGRSLVQIA 118
Query: 189 ADLCRPFGVQVLAPNGTGQTFERIAVEPAEKVMAVIARLAAQRNLVVRSDEQGRLV---- 244
L PFG++V+ + + E V+ VI ++ Q+ + D GRLV
Sbjct: 119 QALAAPFGIEVVNNGAPSGVIPDVQPDHGETVIEVINKILGQQQALAYDDPHGRLVIGGI 178
Query: 245 --------------LLRPDTQGEPVAEF----IEGQQPPISVTPTFGNQDYYSHVTGITP 286
+L DT+ F + GQ+ GN D + T
Sbjct: 179 GSTRAHTALVLGENILSCDTEKSIRERFSVYQVAGQRA--------GNDDDFGEAT---- 226
Query: 287 TIVGLEGPQATVRNPRLEGVL----RP-FVYNADDMDEADLVQAVQSKAGRMFAAAATYD 341
T R E RP ++ A + +A + A
Sbjct: 227 ---------TTALRARTEDAFIARYRPMYIRQTGQATGAGCIARADFEARQRAARTDETT 277
Query: 342 VPVPTWRNANGGLWRVGDFVILEAPGAQVYRRTLMQIKTVRFTATPAERTAVLELIIPGS 401
V WR NG LW+ VI+ P + T + + V FT ++ L I G
Sbjct: 278 YVVQGWRQGNGTLWQPNQRVIVFDPVCG-FDNTELLVSEVTFT---QDQNGTLTEIRVGP 333
Query: 402 LSGQLPE 408
LPE
Sbjct: 334 PDAYLPE 340
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.317 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 278,634,757
Number of Sequences: 1028106
Number of extensions: 11907888
Number of successful extensions: 25791
Number of sequences better than 5.0e-02: 6
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 6
Number of HSP's that attempted gapping in prelim test: 25782
Number of HSP's gapped (non-prelim): 9
length of query: 417
length of database: 324,535,921
effective HSP length: 124
effective length of query: 293
effective length of database: 197,050,777
effective search space: 57735877661
effective search space used: 57735877661
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 93 (40.4 bits)
Query= orf19 [5349-5564]
(214 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.312 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 2.09e-115
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 139,201,419
Number of Sequences: 1028106
Number of extensions: 5630057
Number of successful extensions: 9531
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 9531
Number of HSP's gapped (non-prelim): 0
length of query: 214
length of database: 324,535,921
effective HSP length: 116
effective length of query: 98
effective length of database: 205,275,625
effective search space: 20117011250
effective search space used: 20117011250
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 89 (38.9 bits)
Query= orf20 [5568-5704]
(135 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 47 3e-005
>ref|NP_518999.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD14580.1| (AL646061) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 132
Score = 47.4 bits (111), Expect = 3e-005
Identities = 33/125 (26%), Positives = 51/125 (40%), Gaps = 3/125 (2%)
Query: 3 LAFSQNVRNQRLXXXXXXXXXGSGPALIRIYNXXX--XXXXXXXXXLLAQLEMSDPAFDA 60
+ S VRN RL G+ L+R+Y+ LLA++ + P +
Sbjct: 2 ITLSVPVRNSRLVVIGQALDAGTAGGLLRLYSAPRPDVGQALAEQVLLAEVRLPQPCMGS 61
Query: 61 PANGTMTARAITPEGSTPIGGTATWFRITDSEGNFVLDGDVGLDGSNAELELGDVNFLSN 120
G + I + G W R+ DS+G +V D D GL S AE+EL + +
Sbjct: 62 LEGGRLVFAPIG-QALCRRSGIVAWARLCDSDGRWVADLDAGLPDSGAEVELPKLQVFAG 120
Query: 121 QEVRI 125
V +
Sbjct: 121 GAVNV 125
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.314 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 72,014,635
Number of Sequences: 1028106
Number of extensions: 2549127
Number of successful extensions: 3840
Number of sequences better than 5.0e-02: 1
Number of HSP's better than 0.1 without gapping: 1
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3838
Number of HSP's gapped (non-prelim): 1
length of query: 135
length of database: 324,535,921
effective HSP length: 111
effective length of query: 24
effective length of database: 210,416,155
effective search space: 5049987720
effective search space used: 5049987720
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 83 (36.6 bits)
Query= orf21 [5705-5927]
(221 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.316 0.138 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 133,914,063
Number of Sequences: 1028106
Number of extensions: 5487482
Number of successful extensions: 10382
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 10382
Number of HSP's gapped (non-prelim): 0
length of query: 221
length of database: 324,535,921
effective HSP length: 117
effective length of query: 104
effective length of database: 204,247,519
effective search space: 21241741976
effective search space used: 21241741976
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 89 (38.9 bits)
Query= orf22 [5930-6320]
(389 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_313014.1| (NC_002695) hypothetical protein [Escheric... 65 2e-009
ref|NP_415670.1| (NC_000913) orf, hypothetical protein [Esc... 58 2e-007
dbj|BAA35978.1| (D90749) Hypothetical protein HI1520 [Esche... 58 2e-007
ref|NP_599051.1| (NC_003444) tail protein [Shigella flexner... 57 5e-007
ref|NP_284072.1| (NC_003116) hypothetical protein NMA1323 [... 56 8e-007
ref|NP_404849.1| (NC_003143) putative bacteriophage protein... 49 1e-004
ref|NP_439669.1| (NC_000907) conserved hypothetical protein... 48 2e-004
ref|NP_050651.1| (NC_000929) Hypothetical protein [Enteroba... 47 3e-004
ref|NP_470623.1| (NC_003212) similar to a B. subtilis PBSX ... 45 0.001
>ref|NP_313014.1| (NC_002695) hypothetical protein [Escherichia coli O157:H7]
dbj|BAB38410.1| (AP002567) hypothetical protein [Escherichia coli O157:H7]
Length = 361
Score = 64.7 bits (156), Expect = 2e-009
Identities = 69/308 (22%), Positives = 116/308 (37%), Gaps = 26/308 (8%)
Query: 20 ISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQMFIRTASNTPVTINGQTVT 79
IS +L P + +A AL+G+ + + ++ Q+ A + +
Sbjct: 20 ISQRLPGSQPGVNETTLNAIAYALAGLSAQEHEHLAWISRQIIPTEADEA------ELLK 73
Query: 80 PLAEWGRILGAGPQRAAVQTQLDVSVAVVTQGGVLASGEQLQGPNGIIYITTGDTLLNXX 139
A WG I ++ A + V + + T G+ G LQ +G++Y TG
Sbjct: 74 HCAFWGVI-----RKPASRADGPVQLMLTTDAGI-TEGVLLQRSDGVVYRITGSA----- 122
Query: 140 XXXXXXXXXXXXXXXXXXXXXXNLADGAVLNFVQPLAALQPTATVSSTARTG-IDAETTE 198
N G L+F+ P A + TATV+ T TG D ET
Sbjct: 123 --TGKAGTLNVNVEAESAGRAGNTPTGTRLSFITPQAGINQTATVTGTGLTGGADVETVP 180
Query: 199 AYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAY-PYTGDTPGTVAVYIESSTEADGI 257
SR+ R Q P GG D++ WA V A+ G+V V D
Sbjct: 181 ELLSRLVFRVQNPPSGGTQYDFERWAREVPGVTRAWCKPEWPEAGSVGVTFVQDNNPDIF 240
Query: 258 PTNGQLLEARTAI---NQSPNGRRNRAPVGTLVNTFPITRE--TYNVTVSGLNVDNPADV 312
P G + I + G+ P+G ++ F +T + + + + +N A V
Sbjct: 241 PGEGDVKRVADYIRSHDDPATGQPVGQPLGPTISVFKLTNKPVAFEIRIVPKTPENQAAV 300
Query: 313 RRDISNAL 320
++ +++ L
Sbjct: 301 KQALTDLL 308
>ref|NP_415670.1| (NC_000913) orf, hypothetical protein [Escherichia coli K12]
sp|P75981|YMFP_ECOLI Hypothetical protein ymfP in lambdoid prophage e14 region
pir||E64860 hypothetical protein b1152 - Escherichia coli (strain K-12)
gb|AAC74236.1| (AE000214) orf, hypothetical protein [Escherichia coli K12]
Length = 263
Score = 57.8 bits (138), Expect = 2e-007
Identities = 48/164 (29%), Positives = 66/164 (39%), Gaps = 9/164 (5%)
Query: 108 VTQGGVLASGEQLQGPNGIIYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLADGA 167
V+ G + +G +Q + + Y TT D N DG
Sbjct: 9 VSDGLKVTAGSVIQRDDLVQYTTTDDA-------TSSGGVLRVPIACSSAGAVGNADDGT 61
Query: 168 VLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETS 227
L V P+ L P++ V+ T G D E E +R+R+ ER P GGA DY +WA+
Sbjct: 62 ALILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEV 120
Query: 228 SAVLNAYPYTG-DTPGTVAVYIESSTEADGIPTNGQLLEARTAI 270
+ A+ Y GTV V I SS + IP AR I
Sbjct: 121 PGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHI 164
>dbj|BAA35978.1| (D90749) Hypothetical protein HI1520 [Escherichia coli]
dbj|BAA35990.1| (D90750) Hypothetical protein HI1520 [Escherichia coli]
Length = 276
Score = 57.8 bits (138), Expect = 2e-007
Identities = 48/164 (29%), Positives = 66/164 (39%), Gaps = 9/164 (5%)
Query: 108 VTQGGVLASGEQLQGPNGIIYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLADGA 167
V+ G + +G +Q + + Y TT D N DG
Sbjct: 22 VSDGLKVTAGSVIQRDDLVQYTTTDDA-------TSSGGVLRVPIACSSAGAVGNADDGT 74
Query: 168 VLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETS 227
L V P+ L P++ V+ T G D E E +R+R+ ER P GGA DY +WA+
Sbjct: 75 ALILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYVVWAKEV 133
Query: 228 SAVLNAYPYTG-DTPGTVAVYIESSTEADGIPTNGQLLEARTAI 270
+ A+ Y GTV V I SS + IP AR I
Sbjct: 134 PGITRAWTYRHLMGTGTVGVMIASSDLINPIPEESTETAARQHI 177
>ref|NP_599051.1| (NC_003444) tail protein [Shigella flexneri bacteriophage V]
gb|AAL89428.1| (U82619) tail protein [Shigella flexneri bacteriophage V]
Length = 352
Score = 56.6 bits (135), Expect = 5e-007
Identities = 40/110 (36%), Positives = 52/110 (46%), Gaps = 2/110 (1%)
Query: 162 NLADGAVLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQ 221
N DG L V P+ L P++ V+ T G D E E +R+R+ ER P GGA DY
Sbjct: 145 NADDGTSLILVTPVNGL-PSSGVADTLTGGFDTEELETWRARVIERYYWTPQGGADGDYV 203
Query: 222 LWAETSSAVLNAYPYT-GDTPGTVAVYIESSTEADGIPTNGQLLEARTAI 270
+WA+ + A+ Y GTV V I SS + IP AR I
Sbjct: 204 VWAKEVPGITRAWTYRHWMGTGTVGVMIASSDLINPIPEESTETAARQHI 253
>ref|NP_284072.1| (NC_003116) hypothetical protein NMA1323 [Neisseria meningitidis
Z2491]
pir||C81901 hypothetical protein NMA1323 [imported] - Neisseria meningitidis
(group A strain Z2491)
emb|CAB84574.1| (AL162755) hypothetical protein NMA1323 [Neisseria meningitidis
Z2491]
Length = 351
Score = 55.8 bits (133), Expect = 8e-007
Identities = 51/174 (29%), Positives = 82/174 (46%), Gaps = 14/174 (8%)
Query: 187 TARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPY-TGDTPGTVA 245
T + G D E+ + +R+ E +R P GG DY+ WA + V +AY Y GTV
Sbjct: 162 TVQGGTDRESDASLLARLLEIIRRPPAGGNRYDYKNWALSVDGVTSAYVYPLRRGLGTVD 221
Query: 246 VYIESSTEADGIPTNGQLLEARTAINQ-SPNGRRNRAPVGTLVNTFPITRETYNVTVSGL 304
+ I T ADG+P+ + + I++ P +N + V P+ T V + G+
Sbjct: 222 IAI---TSADGVPSEETVRRVQAYIDEMRPVTAKNALVLKPTVTAVPV---TVQVKLDGI 275
Query: 305 NVDNPADVRRDISNALAEYFLQREPFITGVTTGVRRDQITQIAVGGVVEGIVTA 358
++D + +R I AL EYF + I G V + + V GV++ +TA
Sbjct: 276 DLD---EAKRRIRTALKEYF---DTLIPGDGLTVSQIEAAISNVDGVIDLRLTA 323
>ref|NP_404849.1| (NC_003143) putative bacteriophage protein [Yersinia pestis]
emb|CAC90084.1| (AJ414147) putative bacteriophage protein [Yersinia pestis]
Length = 378
Score = 48.9 bits (115), Expect = 1e-004
Identities = 58/246 (23%), Positives = 90/246 (36%), Gaps = 15/246 (6%)
Query: 7 PTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQMFIRTA 66
PT + + I S+L G + + V++ A +G+ Y Y ++ Q
Sbjct: 6 PTLSELRQRNQSYIQSELKTGGNLLRFSNIGVISDADAGMAHLHYGYLDYIARQA----- 60
Query: 67 SNTPVTINGQTVTPLAEWGRILGAGPQRAAVQTQLDVSVAVVTQGGVLASGEQLQGPNGI 126
TP + LA WG + + A T +V + + G V+ +G L +G
Sbjct: 61 --TPYNATDEY---LAAWGALKDVFRKAANPATSNEVRFSGIA-GRVIPAGRLLNRADGY 114
Query: 127 IYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLAD---GAVLNFVQPLAALQPTAT 183
Y + ++ N + G VL + +Q TAT
Sbjct: 115 QYQLNKEVIIAEQGSALGEITAILPSPLDDATGGGNRGNSPAGTVLTLDIAIDGVQATAT 174
Query: 184 VSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPYTG-DTPG 242
+ G D E+ +A+RSR+ Q P GG DYQ WA V + G
Sbjct: 175 ALTKISGGADIESEDAFRSRMLLAYQNVPQGGNDTDYQSWALAVPGVTRCWVKRRLMGAG 234
Query: 243 TVAVYI 248
TV VYI
Sbjct: 235 TVGVYI 240
>ref|NP_439669.1| (NC_000907) conserved hypothetical protein [Haemophilus influenzae
Rd]
sp|P44240|VG47_HAEIN Mu-like prophage FluMu protein gp47
pir||G64034 hypothetical protein HI1520 - Haemophilus influenzae (strain Rd
KW20)
gb|AAC23162.1| (U32827) conserved hypothetical protein [Haemophilus influenzae Rd]
Length = 355
Score = 48.1 bits (113), Expect = 2e-004
Identities = 58/241 (24%), Positives = 87/241 (36%), Gaps = 23/241 (9%)
Query: 94 RAAVQTQLDVSVAVVT----QGGVLASGEQLQGPN-GIIYITTGDTLLNXXXXXXXXXXX 148
+ V+ Q + +VT + +G + N G+ +ITT +T++
Sbjct: 74 KGIVRKQASTATGLVTVTAANDTTIPAGTVFEDTNTGLTFITTQETVVKAGTADIAVKCE 133
Query: 149 XXXXXXXXXXXXXNLADGAVLNFVQPLAALQPTATVSSTARTGIDAETTEAYRSRIDERT 208
NL G L+ + L PTATV + G D E+ +R+ R
Sbjct: 134 TTGVEG-------NLKAGTSLSLTSAILGLLPTATVKVMSG-GADIESLSRLLARLIYRV 185
Query: 209 QRRPLGGAPVDYQLWAETSSAVLNAY---PYTGDTPGTVAVYIESSTEADGIPTNGQLLE 265
Q P GGA DY WA V A+ Y G GTV V D +PT +
Sbjct: 186 QYPPAGGASHDYIRWATEVPGVTRAWCFERYYGG--GTVGVAFACDEREDILPTPEDIAR 243
Query: 266 ARTAINQSPN---GRRNRAPVGTLVNTFPITRETYNVTVSGLNVDNPADVRRDISNALAE 322
R I N G+ P + F + N + N +R+ + +LA
Sbjct: 244 VRAYIEGHKNEVTGQFEGMPANVELYVFAPQFQAVNFKIR--LAPNTPTLRQAVRKSLAA 301
Query: 323 Y 323
Y
Sbjct: 302 Y 302
>ref|NP_050651.1| (NC_000929) Hypothetical protein [Enterobacteria phage Mu]
sp|Q9T1V2|VG47_BPMU Protein gp47
gb|AAF01125.1|AF083977_44 (AF083977) gp47 [Enterobacteria phage Mu]
Length = 360
Score = 47.4 bits (111), Expect = 3e-004
Identities = 55/247 (22%), Positives = 91/247 (36%), Gaps = 21/247 (8%)
Query: 2 VAPTTPTTQSISDNIVAQISSQLGQGAPIFAKAFTRVLARALSGVVVTLYRYGGFMFLQM 61
+A + PT S+ I +L P + +A A +G+ + + ++ Q+
Sbjct: 1 MAYSPPTLSSLIARTEQNIEQRLPGSWPQAREKTLSAIAYAQAGLAAGCHEHISWVGRQI 60
Query: 62 FIRTASNTPVTINGQTVTPLAEWGRILGAGPQRA-AVQTQLDVSVAVVTQGGVLASGEQL 120
TA L E R G ++A A L V+ + T + +G +
Sbjct: 61 IPSTADEDE----------LLEHCRFWGVRRKQATAASGPLTVTTSAAT---TIPAGTRW 107
Query: 121 QGPNGIIYITTGDTLLNXXXXXXXXXXXXXXXXXXXXXXXXNLADGAVLNFVQPLAALQP 180
Q +G++Y + DT++ N + +L + P+A +
Sbjct: 108 QRADGVVY-SLADTIV-----IDRAGTTEITVTALAAGEAGNTGENTLLTLITPVACVVS 161
Query: 181 TATVSSTARTGIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPY-TGD 239
A G D E+ SR++ R Q P GG DY WA S V A+ + T
Sbjct: 162 DAITVKGFSGGADIESAAELLSRLEYRVQYPPFGGNQFDYVRWAREVSGVTRAWCFPTWK 221
Query: 240 TPGTVAV 246
GTV V
Sbjct: 222 GGGTVGV 228
>ref|NP_470623.1| (NC_003212) similar to a B. subtilis PBSX prophage protein
[Listeria innocua]
emb|CAC96518.1| (AL596168) similar to a B. subtilis PBSX prophage protein [Listeria
innocua]
Length = 361
Score = 45.4 bits (106), Expect = 0.001
Identities = 47/194 (24%), Positives = 82/194 (42%), Gaps = 15/194 (7%)
Query: 191 GIDAETTEAYRSRIDERTQRRPLGGAPVDYQLWAETSSAVLNAYPY-TGDTPGTVAVYIE 249
G D E E +R R + Q+ G Y+LWAE V A Y G GTV + I
Sbjct: 166 GSDEEELETFRKRFFTKIQKDAFSGNKNHYKLWAEEVLGVGKAKIYPLGYGEGTVKIVI- 224
Query: 250 SSTEADGIPTNGQLLE-ARTAINQSPNGRRNRAPVGTLVNTFPITRETYNVTVSGLNVDN 308
T+ + P + L+E R I+ P AP+G V +T N++ + D
Sbjct: 225 --TDTNLHPASPTLVEQVRNYIDPLPQKGEGVAPIGAKVFIESAAWKTINISGQLVLKDG 282
Query: 309 PADVRRDISNALAEYFLQREPFITGVTTGVRRDQITQIAVGGVVEGIVTAAGGTLT--GV 366
+ +++ + E + +T D++T + + V+ I+ +A L +
Sbjct: 283 -----KSVNDVIEEITMDINKLFKSIT--FPEDELTSVKL-AVISNIIFSAPSVLDNFNI 334
Query: 367 SVTNEAGDAVVSLN 380
+ NEA + +++ N
Sbjct: 335 KLNNEAANIILAEN 348
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.315 0.132 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 219,559,145
Number of Sequences: 1028106
Number of extensions: 8329913
Number of successful extensions: 23385
Number of sequences better than 5.0e-02: 10
Number of HSP's better than 0.1 without gapping: 5
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 23379
Number of HSP's gapped (non-prelim): 10
length of query: 389
length of database: 324,535,921
effective HSP length: 123
effective length of query: 266
effective length of database: 198,078,883
effective search space: 52688982878
effective search space used: 52688982878
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf23 [6324-6565]
(240 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.319 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 1.51e-154
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 174,536,852
Number of Sequences: 1028106
Number of extensions: 8124398
Number of successful extensions: 23542
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 23542
Number of HSP's gapped (non-prelim): 0
length of query: 240
length of database: 324,535,921
effective HSP length: 118
effective length of query: 122
effective length of database: 203,219,413
effective search space: 24792768386
effective search space used: 24792768386
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 89 (38.9 bits)
Query= orf24 [6569-7014]
(444 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 63 8e-009
ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PR... 59 1e-007
gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger] 53 6e-006
ref|NP_630626.1| (NC_003888) putative secreted cellulase [S... 46 0.001
ref|NP_637119.1| (NC_003902) cellulase [Xanthomonas campest... 45 0.002
ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN... 42 0.011
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 225
Score = 62.8 bits (151), Expect = 8e-009
Identities = 28/37 (75%), Positives = 30/37 (80%)
Query: 104 WLLANGQAVSRTTYSELFAVLGTRFGQGDGSTTFNLP 140
WL NG AVSRTTY+ LF V+GT FG GDGSTTFNLP
Sbjct: 80 WLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLP 116
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 554
Score = 58.9 bits (141), Expect = 1e-007
Identities = 26/37 (70%), Positives = 30/37 (80%)
Query: 104 WLLANGQAVSRTTYSELFAVLGTRFGQGDGSTTFNLP 140
WL ANG AVSRTTY+ L+A +GT FG GDG+ TFNLP
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLP 451
>gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger]
Length = 857
Score = 53.1 bits (126), Expect = 6e-006
Identities = 53/197 (26%), Positives = 82/197 (40%), Gaps = 19/197 (9%)
Query: 106 LANGQAVSRTTYSELFAVLGTRFGQ----GDGSTTFNLPADGASKEPGQEFVNLGTGTGL 161
LA + + Y++ A T GQ D T + +DGA + N T + +
Sbjct: 513 LALSSSFGESWYADYAASSSTATGQVALSADADTILLMNSDGAYRS-----ANSATLSAV 567
Query: 162 PTYNSEARFARAPNGNMYFVATGPTRLYRSTDNGLTWT-TTNIGSGLPANFDNPAIMVGQ 220
+ S A A N YF + Y S+D T+T TT +GS AN +
Sbjct: 568 SSLPSGAVIASDKANNTYFYGASGSSFYLSSDTAATFTVTTTLGSSTTANAIRAQPSLAG 627
Query: 221 NNYVYFIDRTNADLRRSTDDGVSFTNVGVGGGLPSSIGAPFLASAPTTTGSNPEILYFID 280
+ +V T+ L ST+ G SFT +G G + G F P++ G P + F
Sbjct: 628 DVWV----STDTGLFHSTNYGKSFTQIGSG----CTEGWSFGFGKPSSDGDYPVLFGFFT 679
Query: 281 SANDT-LYVSTNEGINW 296
T L+ + ++G+NW
Sbjct: 680 VDGVTGLFKTEDQGVNW 696
Score = 45.8 bits (107), Expect = 0.001
Identities = 65/252 (25%), Positives = 100/252 (38%), Gaps = 48/252 (19%)
Query: 188 LYRSTDNGLTWTTTN----IGSGLPANF--DNPAIMVGQNNYVYFIDRTNADLRRSTDDG 241
+ RSTD G TW T +G +P + A+ N+ +YF R+ L +STD G
Sbjct: 118 ILRSTDQGDTWEETKLPFKVGGNMPGRGVGERLAVDPNDNSILYFGARSGNGLWKSTDYG 177
Query: 242 VSFTNVGV--GGGLPSSIGAPFLASAP---------TTTGSN----PEILYFIDSANDTL 286
+++NV G + S P +T+GS+ P I + +++
Sbjct: 178 ETWSNVTAFKWTGTYFQDSSSTYTSDPVGIAWVTFDSTSGSSGSPTPRIFVGVVDTGESV 237
Query: 287 YVSTNEGINWTTFGVGD---GLPP-----------------NGLGEMIGAPDGYFYGIDI 326
+VS + G WT G+ G P NG G G +G + +I
Sbjct: 238 FVSEDAGETWTWVS-GEPMYGFLPHKGILSPSEHTLYISYSNGAGPYDGT-NGTVHKYNI 295
Query: 327 VSDVLYRQVPPSTTWSSVGIGNGLPSTISSPSMSVDPDGTLYFIDGADRDRLYRSTDRGV 386
S V P S T + G G GL + P + ++ D + ++RSTD G
Sbjct: 296 TSGVWTDISPTSMTDTYYGYG-GLAVDLQVPGTVMVAALNCWWPD----ELIWRSTDSGG 350
Query: 387 TWSSQFLGQGLP 398
TWS + G P
Sbjct: 351 TWSPIWAWNGYP 362
>ref|NP_630626.1| (NC_003888) putative secreted cellulase [Streptomyces coelicolor
A3(2)]
pir||T35237 probable secreted cellulase - Streptomyces coelicolor
emb|CAA20642.1| (AL031515) putative secreted cellulase [Streptomyces coelicolor
A3(2)]
Length = 890
Score = 45.8 bits (107), Expect = 0.001
Identities = 64/266 (24%), Positives = 93/266 (34%), Gaps = 56/266 (21%)
Query: 158 GTGLPTYNSEARFARAPNGN--MYFVATGPTRLYRSTDNGLTWTTTNIGSGLPANFDNPA 215
G +P R A P+ N +Y A L+RSTD G+TW+ F NP
Sbjct: 154 GGNMPGRGMGERLAVDPHDNDVLYLGAPSGHGLWRSTDAGVTWSEVTA-------FPNPG 206
Query: 216 IMVGQNNYVYFIDRTNADLRRSTDDGVSFTNV--GVGGGLPSSIGAPFLASAPTTTGSNP 273
NY D + S + G+++ GGG G+
Sbjct: 207 ------NYAQ--DPNDTSGYASDNQGITWVTFDESTGGG----------------AGTAT 242
Query: 274 EILYF-IDSANDTLYVSTNEGINWTTFGVGDGLPPNGLGE--MIGAPDGYFY-------- 322
LY + + +Y ST+ G W G P L ++ A +GY Y
Sbjct: 243 RTLYVGVADKENAVYRSTDAGATWERLA---GQPTGYLAHKGVLDAENGYLYLAYSDTGG 299
Query: 323 GIDIVSDVLYRQVPPSTTWSSVGIGNGLPSTISSPSMSVDPD------GTLYFIDGADRD 376
D LYR + TW+ + + ++VD T Y D
Sbjct: 300 PYDGGKGRLYRYATATGTWTDISPAAEADTYYGFSGLTVDRQRPGTVMATAYSSWWPDT- 358
Query: 377 RLYRSTDRGVTWSSQFLGQGLPDGGN 402
+++RSTD G TWS + PD N
Sbjct: 359 QIFRSTDSGATWSQAWSYTSYPDREN 384
>ref|NP_637119.1| (NC_003902) cellulase [Xanthomonas campestris pv. campestris str.
ATCC 33913]
gb|AAM41043.1| (AE012276) cellulase [Xanthomonas campestris pv. campestris str.
ATCC 33913]
Length = 751
Score = 44.7 bits (104), Expect = 0.002
Identities = 51/201 (25%), Positives = 81/201 (39%), Gaps = 41/201 (20%)
Query: 224 VYFIDRTNADLRRSTDDGV------SFTNVGVGG-------GLPSSIGAPFL---ASAPT 267
V + +A L RS D G SF + + G G ++G F+ A++
Sbjct: 179 VLLLGSRDAGLWRSDDRGAHWAKVASFPDAALAGATARNHVGREQAVGIAFVVFDAASGN 238
Query: 268 TTGSNPEILYFIDSANDTLYVSTNEGINWTTF-GVGDGLPPNGLGEMIGAPDGYFY---- 322
T P I + + +LYVS + G +W G GL P+ M G DG++Y
Sbjct: 239 TGTPTPRIYVGVSTEQTSLYVSEDAGRSWAPVAGQPRGLRPS---HMAGGSDGHWYLSYG 295
Query: 323 ---GIDIVSD-VLYRQVPPSTTWSSVGIGNGLPSTISSP-----SMSVDPDGTLYFIDGA 373
G D+++ L++ P W + + +P S +++VDP +
Sbjct: 296 DQPGPDLMAGGALWKFTPAQGRWREI---SPIPQPASGDGFGWGAVAVDPQQPQVLLAST 352
Query: 374 -----DRDRLYRSTDRGVTWS 389
RD LYRS D G W+
Sbjct: 353 FRRRTPRDELYRSVDGGKHWA 373
>ref|NP_518238.1| (NC_003295) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD13645.1| (AL646057) CONSERVED HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 177
Score = 42.4 bits (98), Expect = 0.011
Identities = 28/62 (45%), Positives = 35/62 (56%), Gaps = 7/62 (11%)
Query: 101 APQ-WLLANGQAVSRTTYSELFAVLGTRFGQGDGSTTFNLPADGASKEPGQEFVNLGTGT 159
APQ WLL GQ + ++ L+A+LGT +G GDG TTF LP + GQ G G
Sbjct: 16 APQGWLLCQGQLIDISSNEILYALLGTTYG-GDGRTTFALPNLQGALPVGQ-----GQGP 69
Query: 160 GL 161
GL
Sbjct: 70 GL 71
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.313 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 337,325,147
Number of Sequences: 1028106
Number of extensions: 17038337
Number of successful extensions: 27753
Number of sequences better than 5.0e-02: 11
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 8
Number of HSP's that attempted gapping in prelim test: 27714
Number of HSP's gapped (non-prelim): 31
length of query: 444
length of database: 324,535,921
effective HSP length: 124
effective length of query: 320
effective length of database: 197,050,777
effective search space: 63056248640
effective search space used: 63056248640
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 93 (40.4 bits)
Query= orf25 [7015-7465]
(449 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstoni... 67 4e-010
ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PR... 59 1e-007
ref|NP_298995.1| (NC_002488) phage-related tail fiber prote... 57 6e-007
ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomo... 47 5e-004
ref|NP_625406.1| (NC_003888) conserved hypothetical protein... 45 0.002
ref|NP_640423.1| (NC_003919) microcystin dependent protein ... 44 0.003
ref|NP_640422.1| (NC_003919) microcystin dependent protein ... 44 0.004
gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger] 44 0.004
ref|NP_440411.1| (NC_000911) hypothetical protein [Synechoc... 44 0.005
ref|NP_105312.1| (NC_002678) microcystin dependent protein ... 43 0.007
ref|NP_347555.1| (NC_003030) Probably secreted sialidase; s... 43 0.009
gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellul... 43 0.009
emb|CAC02964.1| (AJ292929) CEL6 protein [Agaricus bisporus] 41 0.033
ref|NP_642098.1| (NC_003919) cellulase [Xanthomonas axonopo... 41 0.033
>ref|NP_519813.1| (NC_003295) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
emb|CAD15394.1| (AL646065) HYPOTHETICAL PROTEIN [Ralstonia solanacearum]
Length = 225
Score = 67.0 bits (162), Expect = 4e-010
Identities = 41/100 (41%), Positives = 52/100 (52%), Gaps = 9/100 (9%)
Query: 100 PEPQWLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQEPT-------PTPG 152
P WL+ +G AVSRT YA LF V+GT +GAGDGSTTFN+P+ + P G
Sbjct: 76 PPAGWLKGNGAAVSRTTYARLFGVIGTTFGAGDGSTTFNLPELRAEFPRGWDDGRGVDAG 135
Query: 153 ETFVEKQSDGYPRDVRSTAIAGLPDGTILGAFGRFSGTPI 192
F Q+ TAI DG+ L +G + TPI
Sbjct: 136 RVFGSVQAQALSSHQHKTAIGF--DGSNLFGWGDGNATPI 173
>ref|NP_520042.1| (NC_003295) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD15623.1| (AL646067) PUTATIVE TAIL FIBER-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 554
Score = 58.9 bits (141), Expect = 1e-007
Identities = 47/141 (33%), Positives = 66/141 (46%), Gaps = 31/141 (21%)
Query: 104 WLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQEPTPTPGETFVEKQSDGY 163
WL+ +G AVSRT YA L+A +GT +GAGDG+ TFN+P G+ F+ DG
Sbjct: 415 WLKANGAAVSRTTYAALYAEIGTTFGAGDGAATFNLPDLRGE---------FLRGWDDG- 464
Query: 164 PRDVRSTAIAGLPDGTILGAFGRFSGTPIRKSGDGG-DTWLSTGLGNGL----SSYNIPV 218
G+ G G + SG+P+ GG ++ T LG+G S+ P
Sbjct: 465 ---------RGVDSGRGFGTWQ--SGSPVVHDDVGGIASFNITALGDGTNVAWSNIADPW 513
Query: 219 MAA-----NDSGTILLIDINN 234
+ A DS +D NN
Sbjct: 514 VGAFPLTMYDSSAATFVDANN 534
>ref|NP_298995.1| (NC_002488) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
pir||C82649 phage-related tail fiber protein XF1706 [imported] - Xylella
fastidiosa (strain 9a5c)
gb|AAF84515.1|AE003994_14 (AE003994) phage-related tail fiber protein [Xylella fastidiosa
9a5c]
Length = 387
Score = 56.6 bits (135), Expect = 6e-007
Identities = 34/75 (45%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 66 NNTPDSVGNPQYLDALTAITAPRVGQTLRTYGVLPEPQWLRTDGRAVSRTEYADLFAVVG 125
NNT + N + TA++ GQ + T G P L DGRAVSR Y LF +
Sbjct: 117 NNTDTPLQNEP--NRWTALSRYEPGQIVYTAGKRALPGTLLCDGRAVSRAMYPRLFEEIN 174
Query: 126 TQYGAGDGSTTFNIP 140
T YGAGDG +TFNIP
Sbjct: 175 TSYGAGDGVSTFNIP 189
>ref|NP_251770.1| (NC_002516) hypothetical protein [Pseudomonas aeruginosa]
pir||D83260 hypothetical protein PA3080 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG06468.1|AE004732_2 (AE004732) hypothetical protein [Pseudomonas aeruginosa]
Length = 365
Score = 47.0 bits (110), Expect = 5e-004
Identities = 52/240 (21%), Positives = 92/240 (37%), Gaps = 22/240 (9%)
Query: 195 SGDGGDTWLSTGLGNGLSSYNIPVMAANDSGTILLIDINNDSRPLFRSQDGGENWSXXXX 254
S D G TW G + + + A++ + + L ++DGG++W
Sbjct: 138 SDDAGSTWARQFEDLGREAPLLDIWFADEQHGLAV----GAYGALLETRDGGQHWEDVSE 193
Query: 255 XXXXXXXXX--XXAAAPNGDFYYINRDETIFRSTDNGENWAQVSGQLPSHSFAGITVDST 312
AA + + ++FRS D G W ++ G F I
Sbjct: 194 RLDNEDQFHLNAIAAVKDNGLLVVGEAGSLFRSKDWGATWEKLEGPYEGSLFGAIGTADA 253
Query: 313 GAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLP-EVSNRSSLGAGRNGDLYI 371
G V V+ + ++RS + G +W + + L +S+ + L GR +
Sbjct: 254 GGVLVYGLRG------HLFRSADFGDSWEEIPLKAASGDLEFGLSDGALLADGR----IV 303
Query: 372 IAPDRNQGYRSVDNGASWSPVSGLIDDFSFAYVTSTPNGNIYVVAGTFSTGGVYASEGSG 431
+ S D G S+S V D S A V++T +G++ +V GGV+ + +G
Sbjct: 304 VVGHGGSVLESTDGGRSFS-VFNRPDRLSLAGVSATGDGHLILV----GQGGVHLAAANG 358
>ref|NP_625406.1| (NC_003888) conserved hypothetical protein [Streptomyces coelicolor
A3(2)]
emb|CAC13065.1| (AL445503) conserved hypothetical protein [Streptomyces coelicolor
A3(2)]
Length = 384
Score = 45.1 bits (105), Expect = 0.002
Identities = 55/223 (24%), Positives = 88/223 (38%), Gaps = 39/223 (17%)
Query: 202 WLSTGLGNGLSSYNIPVMAANDSGTILLIDINNDSRPLFRSQDGGENWSXXXXXXXXXXX 261
W+ G G GL + + A D G++ + + +FR+ DGG +W+
Sbjct: 165 WVPGGGGEGLHTV---LTDARDPGSVT---VAVSTAGVFRTADGGASWAPSNSGVSAVFL 218
Query: 262 -------------XXXXAAAPNGDFYYINRDETIFRSTDNGENWAQVSGQLPS-HSFAGI 307
AA P D Y+ ++RS D G +W + LPS FA
Sbjct: 219 PDPDPEFGQCVHKVARDAATP--DRLYLQNHWGVYRSDDAGAHWTDIGEGLPSTFGFAVA 276
Query: 308 TVDSTG-AVYVFAV---FDVVPS--VTRVYRSVNNGANWTLVAEEGDGSGLPEVSN---- 357
G YVF + D VP+ RV+R+ + G +W E +GLP +
Sbjct: 277 AHPHRGDTAYVFPINADADRVPADHRCRVFRTADAGKSW-----EPLSAGLPREDHYGTV 331
Query: 358 -RSSLGAGRNGDLYIIAPDRN-QGYRSVDNGASWSPVSGLIDD 398
R +L + +RN + + S D+G SW ++ + D
Sbjct: 332 LRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQLASHLPD 374
Score = 42.4 bits (98), Expect = 0.011
Identities = 32/126 (25%), Positives = 53/126 (41%), Gaps = 19/126 (15%)
Query: 192 IRKSGDGGDTWLSTGLGNGL-SSYNIPVMA---ANDSGTILLIDINNDSRP------LFR 241
+ +S D G W T +G GL S++ V A D+ + I+ + D P +FR
Sbjct: 250 VYRSDDAGAHW--TDIGEGLPSTFGFAVAAHPHRGDTAYVFPINADADRVPADHRCRVFR 307
Query: 242 SQDGGENWSXXXXXX-------XXXXXXXXXAAAPNGDFYYINRDETIFRSTDNGENWAQ 294
+ D G++W A Y+ NR+ +F S D+G++W Q
Sbjct: 308 TADAGKSWEPLSAGLPREDHYGTVLRDALSTDDAEQAGVYFGNRNGEVFASADDGDSWRQ 367
Query: 295 VSGQLP 300
++ LP
Sbjct: 368 LASHLP 373
>ref|NP_640423.1| (NC_003919) microcystin dependent protein [Xanthomonas axonopodis
pv. citri str. 306]
gb|AAM34959.1| (AE011629) microcystin dependent protein [Xanthomonas axonopodis
pv. citri str. 306]
Length = 174
Score = 44.3 bits (103), Expect = 0.003
Identities = 21/44 (47%), Positives = 28/44 (62%), Gaps = 1/44 (2%)
Query: 104 WLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQEP 147
WL DGR + T L +++GTQ+G GDG TTFN+P G+ P
Sbjct: 16 WLPCDGRTLPITSNVALASLLGTQFG-GDGKTTFNLPDLRGRTP 58
>ref|NP_640422.1| (NC_003919) microcystin dependent protein [Xanthomonas axonopodis
pv. citri str. 306]
gb|AAM34958.1| (AE011629) microcystin dependent protein [Xanthomonas axonopodis
pv. citri str. 306]
Length = 183
Score = 43.9 bits (102), Expect = 0.004
Identities = 22/46 (47%), Positives = 29/46 (62%), Gaps = 2/46 (4%)
Query: 101 EPQ-WLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQ 145
EPQ W DGR + Y L++++GT YG GDG TTFN+P G+
Sbjct: 16 EPQGWAFCDGRQLQINTYMALYSLIGTTYG-GDGRTTFNLPDLRGR 60
>gb|AAK77227.1| (AY040839) endoglucanase C [Aspergillus niger]
Length = 857
Score = 43.9 bits (102), Expect = 0.004
Identities = 39/172 (22%), Positives = 67/172 (38%), Gaps = 10/172 (5%)
Query: 204 STGLGNGLSSYNIPVMAANDSGTILLIDINNDS-------RPLFRSQDGGENWSXXXXXX 256
S G +S + +++ SG ++ D N++ + S D ++
Sbjct: 552 SDGAYRSANSATLSAVSSLPSGAVIASDKANNTYFYGASGSSFYLSSDTAATFTVTTTLG 611
Query: 257 XXXXXXXXXAAAPNGDFYYINRDETIFRSTDNGENWAQV-SGQLPSHSFAGITVDSTGAV 315
A +++ D +F ST+ G+++ Q+ SG SF S G
Sbjct: 612 SSTTANAIRAQPSLAGDVWVSTDTGLFHSTNYGKSFTQIGSGCTEGWSFGFGKPSSDGDY 671
Query: 316 YVFAVFDVVPSVTRVYRSVNNGANWTLV--AEEGDGSGLPEVSNRSSLGAGR 365
V F V VT ++++ + G NW ++ AE G GS V N GR
Sbjct: 672 PVLFGFFTVDGVTGLFKTEDQGVNWQIISDAEHGFGSASANVVNGDLQNYGR 723
>ref|NP_440411.1| (NC_000911) hypothetical protein [Synechocystis sp. PCC 6803]
sp|P73069|Y48L_SYNY3 Ycf48-like protein
pir||S75177 hypothetical protein slr2034 - Synechocystis sp. (strain PCC 6803)
dbj|BAA17091.1| (D90903) ORF_ID:slr2034~hypothetical protein [Synechocystis sp. PCC
6803]
Length = 342
Score = 43.5 bits (101), Expect = 0.005
Identities = 31/156 (19%), Positives = 60/156 (37%), Gaps = 18/156 (11%)
Query: 239 LFRSQDGGENWSXXXXXXXXXXXXXXXAAAPNGDFYYINRDETIFRSTDNGENWAQV--S 296
+F + DGG+ W + + + + + +TD G+ WA++ S
Sbjct: 72 IFETTDGGDTWEQKLIDLGEEKASFSAVSFSGNEGWITGKPSILLHTTDGGQTWARIPLS 131
Query: 297 GQLPSHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVS 356
+LP ++ I + A +++ + +Y++ N G NW + E G+
Sbjct: 132 EKLPGAPYSIIALGPQTA-------EMITDLGAIYKTTNGGKNWKALVE--GAVGVARTI 182
Query: 357 NRSSLGAGRNGDLYIIAPDRNQGYRSVDNG-ASWSP 391
RS+ G Y+ R Y + G W+P
Sbjct: 183 QRSTDGR------YVAVSARGNFYSTWAPGQTEWTP 212
>ref|NP_105312.1| (NC_002678) microcystin dependent protein MdpB [Mesorhizobium loti]
dbj|BAB51098.1| (AP003004) microcystin dependent protein; MdpB [Mesorhizobium loti]
Length = 169
Score = 43.1 bits (100), Expect = 0.007
Identities = 21/61 (34%), Positives = 33/61 (53%), Gaps = 1/61 (1%)
Query: 87 PRVGQTLRTYGVLPEPQWLRTDGRAVSRTEYADLFAVVGTQYGAGDGSTTFNIPQEAGQE 146
P +G+ G W+ +G+ + +EY LF ++GT YG GDG +TF +P G+
Sbjct: 4 PYIGEIRMFAGNFAPAGWMFCEGQLLPISEYETLFQLIGTTYG-GDGQSTFALPDLRGRV 62
Query: 147 P 147
P
Sbjct: 63 P 63
>ref|NP_347555.1| (NC_003030) Probably secreted sialidase; several ASP-boxes and
dockerin domain [Clostridium acetobutylicum]
gb|AAK78895.1|AE007608_1 (AE007608) Probably secreted sialidase; several ASP-boxes and
dockerin domain [Clostridium acetobutylicum]
Length = 839
Score = 42.7 bits (99), Expect = 0.009
Identities = 51/198 (25%), Positives = 81/198 (40%), Gaps = 23/198 (11%)
Query: 242 SQDGGENWSXXXXXXXXXXXXXXXAAAPNGDFYYINRDE--TIFRSTDNGENWAQVSGQL 299
S DGG+NW AA + + +E STDNG W SG L
Sbjct: 536 SYDGGKNWFSAGSNISGVYKAGTVAAGADAKTIVWSPEEGANAAYSTDNGNKWTPCSG-L 594
Query: 300 PSHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEG---DGSGLPEVS 356
P A + D + + + Y S + GA +T ++ G G G+ +
Sbjct: 595 PQG--AKVRSDRVNPKKFYGFLN-----GKFYISTDAGATFTQSSQTGLPTKGKGIFKTV 647
Query: 357 NRSSLGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGL-IDDFSFAYVTSTPNG--NIY 413
G GD++ IA ++ + S D+GA+++ VSG+ D + T +G IY
Sbjct: 648 ------IGHEGDIW-IAGGKDGLWHSTDSGATFTKVSGVDASDTVGLGKSKTDDGYPAIY 700
Query: 414 VVAGTFSTGGVYASEGSG 431
+ A T G++ S+ G
Sbjct: 701 MDATIDGTAGIFRSDDEG 718
>gb|AAK06388.1| (AF078042) glycosyl hydrolase 5 [Caldicellulosiruptor sp. Tok7B.1]
Length = 996
Score = 42.7 bits (99), Expect = 0.009
Identities = 44/191 (23%), Positives = 72/191 (37%), Gaps = 19/191 (9%)
Query: 242 SQDGGENWSXXXXXXXXXXXXXXXAAAPNGD-FYYINRDETIFRSTDNGENWAQVSGQLP 300
S DGG++W AAA +G + + + STDNG W + + +P
Sbjct: 538 SYDGGKSWFQGNTEPQGTSEGGTVAAAADGSAVVWAPKGAKVCYSTDNGNKWVECA-NVP 596
Query: 301 SHSFAGITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVSNRSS 360
S + + Y F + Y S + G + E +GLP +S
Sbjct: 597 SEAIVYSDRVNPNKFYAF-------KNGKFYISADKGKTFI----ESPAAGLP-ISGNFK 644
Query: 361 LGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGLIDDFSFAYVTSTPNGN---IYVVAG 417
G GD++++ N + S D G S+ +SG+ D S + IY A
Sbjct: 645 TVPGIEGDIWLVG--NNGMWHSTDGGYSFVKISGVEDAASIGFGKPAEGETYPAIYTYAK 702
Query: 418 TFSTGGVYASE 428
G++ S+
Sbjct: 703 INGVRGIFRSD 713
>emb|CAC02964.1| (AJ292929) CEL6 protein [Agaricus bisporus]
Length = 806
Score = 40.8 bits (94), Expect = 0.033
Identities = 60/275 (21%), Positives = 99/275 (35%), Gaps = 27/275 (9%)
Query: 181 LGAFGRFSGTPIRKSGDGGDTWLST-GLGNGLSSYNIPVMAAND------SGTILLIDIN 233
+G SG + S D G TW G + + + + A D +G +++ N
Sbjct: 510 IGTGDSTSGKQVAISSDYGVTWNQHFGAPDNVQGGKVAISADADIILWRTNGNGVMVSRN 569
Query: 234 N---------DSRPLFRSQDGGENWSXXXXXXXXXXXXXXXAAAPNGDFYYINRDETIFR 284
+ + S D G+ +S + + D+ +F
Sbjct: 570 QATFNIFYGASGKTFYVSTDNGKTFSAHGSLGSATSVFDITVHPSVSGDIWASTDKGLFH 629
Query: 285 STDNGENWAQVSGQLPSHSFA-GITVDSTGAVYVFAVFDVVPSVTRVYRSVNNGANWTLV 343
STD+G ++ +SG + A G + G VFA + V +RS + G NW +
Sbjct: 630 STDSGATFSAISGITQAWGVALGAPRSTGGYPAVFAAANYGGEVA-YFRSDDRGVNWVKI 688
Query: 344 AEEGDGSGLPEVSNRSSLGAGRNGDLYIIAPDRNQGYRSVDNGASWSPVSGLIDDFSFAY 403
+ G G +N + G +YI R Y V G++ P S S
Sbjct: 689 NDAAHGFGAAS-ANCMAADPRVYGRVYIGTNGRGIFYGDV-AGSAPPPTSSATSTTS--- 743
Query: 404 VTSTPNGNIYVVAGTFSTGGVYASEGSGGPAPGTG 438
V++T + T +T S G+G P G G
Sbjct: 744 VSTTTTRSTTTTRSTTTT----TSSGNGSPVSGFG 774
>ref|NP_642098.1| (NC_003919) cellulase [Xanthomonas axonopodis pv. citri str. 306]
gb|AAM36634.1| (AE011809) cellulase [Xanthomonas axonopodis pv. citri str. 306]
Length = 751
Score = 40.8 bits (94), Expect = 0.033
Identities = 49/178 (27%), Positives = 69/178 (38%), Gaps = 43/178 (24%)
Query: 277 NRDETIFRSTDNGENWAQVSGQLPSHSFAGIT----VDSTGAVYV-FAVFD--------- 322
+RD ++RS D G +WA+V G P+ + AG T V S AV + F VFD
Sbjct: 185 SRDAGLWRSDDRGAHWARVEG-FPADALAGATARNHVGSEQAVGIAFVVFDAASGHAGAA 243
Query: 323 -------VVPSVTRVYRSVNNGANWTLVAEEGDGSGLPEVSNRSSLGAGRNGDLYIIAPD 375
V + T +Y S + G W+ VA G GL R S G N + ++
Sbjct: 244 TPRIYVGVSTAQTSLYVSEDAGRTWSAVA--GQPKGL-----RPSHMVGHNAGQWYLSYG 296
Query: 376 RNQGYRSVDNGASWS--------------PVSGLIDDFSFAYVTSTPNGNIYVVAGTF 419
G + GA W+ P D F + V P ++A TF
Sbjct: 297 DRPGPDLMAGGALWNYDATQGRWREISPIPQPATGDGFGWGAVAVDPQHPQVLLASTF 354
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.314 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 339,263,776
Number of Sequences: 1028106
Number of extensions: 16873702
Number of successful extensions: 30518
Number of sequences better than 5.0e-02: 17
Number of HSP's better than 0.1 without gapping: 3
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 30426
Number of HSP's gapped (non-prelim): 80
length of query: 449
length of database: 324,535,921
effective HSP length: 124
effective length of query: 325
effective length of database: 197,050,777
effective search space: 64041502525
effective search space used: 64041502525
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 93 (40.4 bits)
Query= orf26 [7519-7603]
(83 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.143 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 67,709,073
Number of Sequences: 1028106
Number of extensions: 2777088
Number of successful extensions: 4718
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4718
Number of HSP's gapped (non-prelim): 0
length of query: 83
length of database: 324,535,921
effective HSP length: 59
effective length of query: 24
effective length of database: 263,877,667
effective search space: 6333064008
effective search space used: 6333064008
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf27 [7603-7802]
(198 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 119,367,929
Number of Sequences: 1028106
Number of extensions: 4571982
Number of successful extensions: 12946
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 12946
Number of HSP's gapped (non-prelim): 0
length of query: 198
length of database: 324,535,921
effective HSP length: 115
effective length of query: 83
effective length of database: 206,303,731
effective search space: 17123209673
effective search space used: 17123209673
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 88 (38.5 bits)
Query= orf28 [7841-7919]
(77 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.311 0.126 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 43,039,292
Number of Sequences: 1028106
Number of extensions: 1312202
Number of successful extensions: 2139
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2139
Number of HSP's gapped (non-prelim): 0
length of query: 77
length of database: 324,535,921
effective HSP length: 53
effective length of query: 24
effective length of database: 270,046,303
effective search space: 6481111272
effective search space used: 6481111272
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf29 [7921-8761]
(839 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED... 44 0.011
ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED... 44 0.011
>ref|NP_519008.1| (NC_003295) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD14589.1| (AL646061) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3501
Score = 43.5 bits (101), Expect = 0.011
Identities = 37/128 (28%), Positives = 53/128 (40%), Gaps = 32/128 (25%)
Query: 443 ENKIGTDRGNIDFSGDL-----SQFSPANKSDYWTAVIASGSESIGGLTFNNGDRLIAVQ 497
+ +G + N+ SG L S S A+ D +V+A GS S G G+ +
Sbjct: 379 QQNLGINANNVASSGTLGAGVNSDGSFAHAGDL--SVVAGGSMSATGQNVAGGNATL--- 433
Query: 498 SNDVVPTTLNSPSWRVDPKTVPQASATTLGGVKISTTSALEINASTGNLDVKAATANQAG 557
Q ++ L G + S L +NA TGNLD+ ATA+ G
Sbjct: 434 ----------------------QGASVNLAGSQTSANGNLNLNAQTGNLDLTGATASAGG 471
Query: 558 ALKADAVG 565
AL A+A G
Sbjct: 472 ALSANAQG 479
>ref|NP_522101.1| (NC_003296) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
emb|CAD17691.1| (AL646079) PROBABLE HEMAGGLUTININ-RELATED PROTEIN [Ralstonia
solanacearum]
Length = 3552
Score = 43.5 bits (101), Expect = 0.011
Identities = 51/203 (25%), Positives = 75/203 (36%), Gaps = 67/203 (33%)
Query: 371 TKTADNQPLQVQGS-DNTGPNPGGPHRAVDYQ-------PLAYRQSLAVGDYELIFDELN 422
T + N + QG DNTG G LA +Q+LAV + + +
Sbjct: 335 TNASGNLSVSAQGGIDNTGTTYGRQSVTASTSGDLTNSGTLAAQQNLAVNAHNVA----S 390
Query: 423 NAPEGPGLNAEKVKGIEAFIENKIGTDRGNIDFSGDLSQFSPANKSDYWTAVIASGSESI 482
+ G G+N++ G++ +GDLS V+A GS S
Sbjct: 391 SGTLGAGVNSD-----------------GSLAHAGDLS-------------VVAGGSMSA 420
Query: 483 GGLTFNNGDRLIAVQSNDVVPTTLNSPSWRVDPKTVPQASATTLGGVKISTTSALEINAS 542
G G+ + Q ++ L G + S L +NA
Sbjct: 421 TGQNVAGGNATL-------------------------QGASVNLAGSQTSANGNLSLNAQ 455
Query: 543 TGNLDVKAATANQAGALKADAVG 565
TGNLD+ ATA+ GAL A+A G
Sbjct: 456 TGNLDLTGATASAGGALSANAQG 478
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.314 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 527,885,665
Number of Sequences: 1028106
Number of extensions: 23443597
Number of successful extensions: 39838
Number of sequences better than 5.0e-02: 2
Number of HSP's better than 0.1 without gapping: 2
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 39829
Number of HSP's gapped (non-prelim): 7
length of query: 839
length of database: 324,535,921
effective HSP length: 130
effective length of query: 709
effective length of database: 190,882,141
effective search space: 135335437969
effective search space used: 135335437969
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 96 (41.6 bits)
Query= orf30 [8762-9136]
(373 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.311 0.128 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 182,819,854
Number of Sequences: 1028106
Number of extensions: 5879262
Number of successful extensions: 10720
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 10720
Number of HSP's gapped (non-prelim): 0
length of query: 373
length of database: 324,535,921
effective HSP length: 123
effective length of query: 250
effective length of database: 198,078,883
effective search space: 49519720750
effective search space used: 49519720750
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 92 (40.0 bits)
Query= orf31 [9212-9372]
(159 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.332 0.140 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 73,066,036
Number of Sequences: 1028106
Number of extensions: 2164437
Number of successful extensions: 8179
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8179
Number of HSP's gapped (non-prelim): 0
length of query: 159
length of database: 324,535,921
effective HSP length: 111
effective length of query: 48
effective length of database: 210,416,155
effective search space: 10099975440
effective search space used: 10099975440
T: 11
A: 40
X1: 15 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (22.0 bits)
S2: 86 (37.7 bits)
Query= orf32 [9375-9464]
(88 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.323 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,268,036
Number of Sequences: 1028106
Number of extensions: 1081384
Number of successful extensions: 2718
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2718
Number of HSP's gapped (non-prelim): 0
length of query: 88
length of database: 324,535,921
effective HSP length: 64
effective length of query: 24
effective length of database: 258,737,137
effective search space: 6209691288
effective search space used: 6209691288
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf33 [9470-9574]
(103 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.141 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 77,005,740
Number of Sequences: 1028106
Number of extensions: 3028339
Number of successful extensions: 4946
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4946
Number of HSP's gapped (non-prelim): 0
length of query: 103
length of database: 324,535,921
effective HSP length: 79
effective length of query: 24
effective length of database: 243,315,547
effective search space: 5839573128
effective search space used: 5839573128
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf34 [9570-9645]
(74 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.314 0.131 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,016,557
Number of Sequences: 1028106
Number of extensions: 1225595
Number of successful extensions: 2134
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2134
Number of HSP's gapped (non-prelim): 0
length of query: 74
length of database: 324,535,921
effective HSP length: 50
effective length of query: 24
effective length of database: 273,130,621
effective search space: 6555134904
effective search space used: 6555134904
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf35 [9636-9823]
(186 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.320 0.136 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 115,113,120
Number of Sequences: 1028106
Number of extensions: 4308382
Number of successful extensions: 7861
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7861
Number of HSP's gapped (non-prelim): 0
length of query: 186
length of database: 324,535,921
effective HSP length: 114
effective length of query: 72
effective length of database: 207,331,837
effective search space: 14927892264
effective search space used: 14927892264
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 87 (38.1 bits)
Query= orf36 [9820-9880]
(59 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.312 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 44,848,135
Number of Sequences: 1028106
Number of extensions: 1497367
Number of successful extensions: 1995
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 1995
Number of HSP's gapped (non-prelim): 0
length of query: 59
length of database: 324,535,921
effective HSP length: 35
effective length of query: 24
effective length of database: 288,552,211
effective search space: 6925253064
effective search space used: 6925253064
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf37 [9877-10062]
(184 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.322 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 106,714,148
Number of Sequences: 1028106
Number of extensions: 4067607
Number of successful extensions: 8466
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8466
Number of HSP's gapped (non-prelim): 0
length of query: 184
length of database: 324,535,921
effective HSP length: 114
effective length of query: 70
effective length of database: 207,331,837
effective search space: 14513228590
effective search space used: 14513228590
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 87 (38.1 bits)
Query= orf38 [10064-10226]
(161 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 100,547,735
Number of Sequences: 1028106
Number of extensions: 3769955
Number of successful extensions: 8101
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 8101
Number of HSP's gapped (non-prelim): 0
length of query: 161
length of database: 324,535,921
effective HSP length: 111
effective length of query: 50
effective length of database: 210,416,155
effective search space: 10520807750
effective search space used: 10520807750
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 86 (37.7 bits)
Query= orf39 [10245-10932]
(686 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
sp|P06225|DPOL_BPSP2 DNA POLYMERASE >gi|67052|pir||DJBPS2 D... 264 3e-069
gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylo... 253 5e-066
ref|NP_646242.1| (NC_003923) DNA polymerase [Staphylococcus... 252 9e-066
ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacte... 159 1e-037
ref|NP_299569.1| (NC_002488) phage-related protein [Xylella... 90 1e-016
gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus gre... 62 2e-008
gb|AAB25243.1| DNA polymerase homolog [bacterium-like organ... 62 2e-008
sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I) >gi|5596366|g... 44 0.007
ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROT... 42 0.019
ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas ... 42 0.019
ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tume... 41 0.043
>sp|P06225|DPOL_BPSP2 DNA POLYMERASE
pir||DJBPS2 DNA-directed DNA polymerase (EC 2.7.7.7) - phage SPO2
gb|AAA32600.1| (K02752) DNA polymerase (gene L; ttg start codon) [Bacteriophage
SPO2]
emb|CAA25691.1| (X01458) SP02 DNA polymerase (aa 1-648) [Bacteriophage SPO2]
Length = 648
Score = 264 bits (674), Expect = 3e-069
Identities = 201/633 (31%), Positives = 296/633 (46%), Gaps = 83/633 (13%)
Query: 59 QCYCDMAKARRFSLPGSLDTAAKALR-TAEKDKKGKQLIGKLCRPVSATKARPEPRWTLY 117
Q C A LPG+LD AKAL+ +A+KDK GK LI P TKA + L
Sbjct: 94 QWRCTAVHATTLGLPGNLDGVAKALKLSAQKDKAGKALIRYFSVPCKPTKANGQRVRNLP 153
Query: 118 TA-PQDYADLYRYCDQDVKSEDHVSALVPDMTPAERE--TWLADQRINARGVLVDVESLD 174
P+ + YC QDV+ E + + P E E W DQ IN RGV +DV+ +
Sbjct: 154 EHDPEKWEKFKVYCIQDVEVERAIKNRISKFEPLESEHKLWALDQEINDRGVRIDVDLVK 213
Query: 175 NALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAETLERD 234
+A+ Q E +T + + + A+L +L G + +L + + E +E
Sbjct: 214 HAIACDEQYQAGLIAEAKKLTG--LPNPNSTAQLKKWLEEKGLTISSLAKDKIEELIENT 271
Query: 235 DLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSAGGVQLQ 293
+ + T R+L +R+ +A +VKK +++ + D R+R F+GA TGRW+ VQ+Q
Sbjct: 272 N-DETVHRVLRLRQEMAKTSVKKYLAMEKALCPDNRVRGLLQFYGASRTGRWAGRLVQVQ 330
Query: 294 NLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWALKDL 353
NL NK +D R LK
Sbjct: 331 NLPQ----------------------------------------NKIEDLDTARNLLKGG 350
Query: 354 KHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEV 413
+ ++L+ +G +LS +R FI + G +F DFSAIEA V+A LA EWR+EV
Sbjct: 351 HYEAIELL---YGQVPFVLSQLVRTAFIPSEGNEFYVSDFSAIEARVIAWLAGEEWRLEV 407
Query: 414 FRTHGKIYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGAWKAF 473
F THGKIYE SA+++ VP E + PLR+ GKVAELA GY G GA
Sbjct: 408 FNTHGKIYEASAAQMFKVPVESI-------TKGSPLRQK-GKVAELALGYQGGKGALIQM 459
Query: 474 GAEEHFEDDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIMAILN 533
GA + + + + WR A+ +IV+FW Y +E A I A+
Sbjct: 460 GALNMGLAEGELPELVKAWRTANKKIVKFW-----------------YDVEAAAIKAVKE 502
Query: 534 PGKCYAVGDITYAVFDDVLYCRLPSGRFLQYHQPKLIESDSWKGPEYQVTFMGYNTNSQK 593
+T+ +L+ +LPSGR L Y +PKL E D G E +T+ G + +
Sbjct: 503 RKPVKLQHGLTFLYESGILFVQLPSGRRLAYAKPKL-ELDERFGKE-ALTYEGKLESGK- 559
Query: 594 GPVGWVRMDTYGGRLAENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVPGDFGS 653
W R++TYGG+L EN+ Q + + +N GY VMH HDE + +VP
Sbjct: 560 ----WGRLNTYGGKLVENIVQATARDCLAITLMRLDNAGYKTVMHVHDEAVLDVPRGKNE 615
Query: 654 VEEMAALMTGREPWREWWPIRAAGWEDDRYQKD 686
++++ A+M W + P+ A G+ D Y+KD
Sbjct: 616 LDKVEAIMGEPISWAKGLPLTADGFVTDYYKKD 648
>gb|AAL82293.1| (AF424782) DNA polymerase A domain [Staphylococcus aureus phage phi
12]
Length = 650
Score = 253 bits (646), Expect = 5e-066
Identities = 197/638 (30%), Positives = 302/638 (46%), Gaps = 91/638 (14%)
Query: 58 EQCYCDMAKARRFSLPGSLDTAAKALRT-AEKDKKGKQLIGKL---CRP--VSATKARPE 111
E+ C M + R LP SLD + LR ++KDK GK LI C+P V+ + R
Sbjct: 95 EEWICTMVNSMRIGLPASLDKVGEVLRLQSQKDKAGKNLIRYFSIPCKPTKVNGGRTRNL 154
Query: 112 PRWTLYTAPQDYADLYRYCDQDVKSEDHVSALVPD--MTPAERETWLADQRINARGVLVD 169
P L + + YC +DV+ E ++ + D +T E+ W+ DQ IN RG+ +
Sbjct: 155 PEHDL----EKWQQFIDYCIRDVEVEMAIANKIKDFPVTAIEQTYWVFDQHINDRGIKLS 210
Query: 170 VESLDNALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAE 229
+SL +++ + ++ ++ A G S L G + NL+ +TV E
Sbjct: 211 -KSLMLGANVLDKQSKEELLKQAKHITGLENPNSPTQLLAWLKDEQGLDIPNLQKKTVQE 269
Query: 230 TLERDDLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGG 289
L+ + A+++LEIR ++ +VKK + + SD R+R + F+GAGTGRW+ G
Sbjct: 270 YLK--EATGKAKKMLEIRLQMSKTSVKKYNKMHDMMCSDERVRGLFQFYGAGTGRWAGRG 327
Query: 290 VQLQNLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWA 349
VQLQNLT D ++ R
Sbjct: 328 VQLQNLTKHYIS--------------------------------------DTELEIARDL 349
Query: 350 LKDLKHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEW 409
+K+ + +LDL++++ P LLS +R F A G + DFSAIEA V+A A+ +W
Sbjct: 350 IKEQRFDDLDLLLNV--HPQDLLSQLVRTTFTAEEGNELAVSDFSAIEARVIAWYAKEQW 407
Query: 410 RIEVFRTHGKIYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGA 469
R++VF THGKIYE SAS++ VP E + K D PLR+ GKV+ELA GY G GA
Sbjct: 408 RLDVFNTHGKIYEASASQMFNVPVESI------TKGD-PLRQK-GKVSELALGYQGGAGA 459
Query: 470 WKAFGAEEHFEDDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIM 529
KA GA E +++ ++ + WR+A+P IV FW + A I
Sbjct: 460 LKAMGALEMGIEENELQGLVDSWRNANPNIVNFWKA-----------------CQEAAIN 502
Query: 530 AILNPGKCYAVGDITYAVFDDVLYCRLPSGRFLQYHQPK-LIESDSWKGPEYQVTFMGYN 588
+ + K + + + + L LPSGR L Y PK L+ +SW V FMG +
Sbjct: 503 TVKSR-KTHHTHGLRFYMKKGFLMIELPSGRALAY--PKALVGENSWGSQV--VEFMGLD 557
Query: 589 TNSQKGPVGWVRMDTYGGRLAENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVP 648
N + W ++ TYGG+L EN+ Q + + E G+ +V H HDE+I E+P
Sbjct: 558 LNRK-----WSKLKTYGGKLVENIVQATARDLLAISIARLEALGFKIVGHVHDEVIVEIP 612
Query: 649 GDFGSVEEMAALMTGREPWREWWPIRAAGWEDDRYQKD 686
++E+ +M W + + + G+ Y KD
Sbjct: 613 RGSNGLKEIETIMNKPVDWAKGLNLNSDGFTSPFYMKD 650
>ref|NP_646242.1| (NC_003923) DNA polymerase [Staphylococcus aureus subsp. aureus
MW2]
dbj|BAB95290.1| (AP004827) DNA polymerase [Staphylococcus aureus subsp. aureus MW2]
Length = 653
Score = 252 bits (644), Expect = 9e-066
Identities = 195/637 (30%), Positives = 299/637 (46%), Gaps = 89/637 (13%)
Query: 58 EQCYCDMAKARRFSLPGSLDTAAKALRTA-EKDKKGKQLIGKL---CRP--VSATKARPE 111
E+ C M + R LP SLD + LR +KDK GK LI C+P V+ + R
Sbjct: 98 EEWICTMVNSMRIGLPASLDKVGEVLRLQNQKDKAGKNLIRYFSIPCKPTKVNGGRTRNL 157
Query: 112 PRWTLYTAPQDYADLYRYCDQDVKSEDHVSALVPD--MTPAERETWLADQRINARGVLVD 169
P L + + YC +DV+ E ++ + D +T E+ W+ DQ IN RG+ +
Sbjct: 158 PEHDL----EKWQQFIDYCIRDVEVEMTIAHKIKDFPVTAIEQAYWVFDQHINDRGIKLS 213
Query: 170 VESLDNALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATVGCRMHNLKSETVAE 229
+SL +++ + ++ + A G S L G + NL+ +TV E
Sbjct: 214 -KSLMLGANVLDKQSKEELLNQAKHITGLENPNSPTQLLAWLKDDQGLDIPNLQKKTVQE 272
Query: 230 TLERDDLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAGTGRWSAGG 289
L+ + A+++LEIR ++ +VKK + + SD R+R + F+GAGTGRW+ G
Sbjct: 273 YLK--EATGKAKKMLEIRLQMSKTSVKKYNKMHDMMCSDERVRGLFQFYGAGTGRWAGRG 330
Query: 290 VQLQNLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWA 349
VQLQNLT D ++ R
Sbjct: 331 VQLQNLTKHYIS--------------------------------------DTELEIARDL 352
Query: 350 LKDLKHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEW 409
+K+ + +LDL++++ P LLS +R F A G + DFSAIEA V+A A+ +W
Sbjct: 353 IKEQRFDDLDLLLNV--HPQDLLSQLVRTTFTAEEGNELAVSDFSAIEARVIAWYAKEQW 410
Query: 410 RIEVFRTHGKIYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASGYGGWIGA 469
R++VF THGKIYE SAS++ VP E + K D PLR+ GKV+ELA GY G GA
Sbjct: 411 RLDVFNTHGKIYEASASQMFNVPVESI------TKGD-PLRQK-GKVSELALGYQGGAGA 462
Query: 470 WKAFGAEEHFEDDDAMKKAILGWRDASPEIVEFWGGQYRRDPYTRKWVREYYGLEGAVIM 529
KA GA E +++ ++ + WR+A+P IV FW + A I
Sbjct: 463 LKAMGALEMGIEENELQGLVDSWRNANPNIVNFW-----------------KACQEAAIN 505
Query: 530 AILNPGKCYAVGDITYAVFDDVLYCRLPSGRFLQYHQPKLIESDSWKGPEYQVTFMGYNT 589
+ K + + + + L LPSGR L Y + + E +SW V FMG +
Sbjct: 506 TV-KSRKTHHTHGVRFYMKKGFLMIELPSGRALAYPKASVGE-NSW--GSQVVEFMGLDL 561
Query: 590 NSQKGPVGWVRMDTYGGRLAENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVPG 649
N + W ++ TYGG+L EN+ Q + + E +G+ +V H HDE+I E+P
Sbjct: 562 NRK-----WSKLKTYGGKLVENIVQATARDLLAISIARLEASGFKIVGHVHDEVIVEIPR 616
Query: 650 DFGSVEEMAALMTGREPWREWWPIRAAGWEDDRYQKD 686
++E+ +M W + + + G+ Y KD
Sbjct: 617 GSNGLKEIETIMNKPVDWAKGLNLNSDGFTSPFYMKD 653
>ref|NP_051006.1| (NC_000935) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
sp|Q9T1Q3|DPOL_BPAPS Probable DNA polymerase (P45)
gb|AAF03988.1|AF157835_45 (AF157835) P45 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 993
Score = 159 bits (401), Expect = 1e-037
Identities = 159/587 (27%), Positives = 246/587 (41%), Gaps = 102/587 (17%)
Query: 55 LQLEQCYCDMAKARRFSLPGSLDTAAKALRTAE---KDKKGKQLIGKLCRPVSATKARPE 111
L L + + + +A LPG+L KDK+GK LI LC+P K
Sbjct: 89 LPLSRVHDTLVQALAHGLPGALGLLCDIFNVNSDKAKDKEGKALISLLCKP--RPKNSKI 146
Query: 112 PRWTLYTAPQDYADLYRYCDQDVKSEDHVSALVPD--MTPAERETWLADQRINARGVLVD 169
R T T +++ Y D+ + + +P+ M E E W DQ+IN RG+ +D
Sbjct: 147 QRATALTHAEEWQRFKDYAGSDILAMREIYQHLPNWNMNVHETELWQLDQKINRRGMCMD 206
Query: 170 VESLDNALDIVGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATV-GCRMHNLKSETVA 228
VE +AL V +R + +T+ AV + ++ L +A+ G + ++++ T+
Sbjct: 207 VELAKSALTAVENEQKRLSTVTQQLTDNAVQNATQRDALLQHIASAFGITLPDMQASTLQ 266
Query: 229 ETLERDDLNPTARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSA 287
+ D+ P R +L +R + K +L +S+DGRLR F GA TGRW+
Sbjct: 267 RRINDPDIPPALRELLSVRLQSCTTSTSKYKALLKSVSADGRLRGTKQFCGASRTGRWAG 326
Query: 288 GGVQLQNLTSKGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVR 347
Q NL P N T+D
Sbjct: 327 RVFQPDNLPR------------------------------PTLDPN--------TIDN-- 346
Query: 348 WALKDLKHRNLDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARC 407
++ LK +LI D D + L S LRG IA PGKK V D S IE +LA LA
Sbjct: 347 -GIEALKAGCAELICD---DIMQLTSSALRGCIIAPPGKKLVISDLSNIEGRMLAWLAGE 402
Query: 408 EWRIEVFR--THGK---IYEMSASKISGVPFEEMMAYKEKNKHDHPLRKSLGKVAELASG 462
W++ F +GK +Y+++ ++ + E++ KE ++ +GKV EL G
Sbjct: 403 NWKVNAFSEFDNGKGDDLYKLAYARAFNLLPEDVT--KE--------QRQIGKVMELGLG 452
Query: 463 YGGWIGAWKAFGAEEHFEDDD-----------AMKKAILGWRDASPEIVEFWGGQYR--- 508
YGG + A+ F + D+ +K+ + W S E + +G +
Sbjct: 453 YGGGVAAFLTFALAYGLDLDELAEAALPNIPPGVKREAMRWYQKSVETDKTYGLSEKIFV 512
Query: 509 -RDPYTRKW-------VREYYGLEGAVIMAILNPGKCYAVGDITYAVFDDVLYCRLPSGR 560
D R W V +Y +E AV AI +PG + ++ L LPSGR
Sbjct: 513 TCDSLKRMWRNAHPQTVSFWYDIEDAVKQAIQSPGIAFKCRKLSVRRDKSWLRICLPSGR 572
Query: 561 FLQYHQPKLIESDSWKGPEYQVTFMGYNTNSQKGPVGWVRMDTYGGR 607
L Y ++ Q+T+MG N S+K W R+ TYGG+
Sbjct: 573 SLCYPSARIENG--------QITYMGTNPYSRK----WERLKTYGGK 607
Score = 45.1 bits (105), Expect = 0.003
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 610 ENVTQXXXXXXXXXXMKQCENNGYPVVMHTHDELIAEVPG-DFGSVEEMAALMTGREPWR 668
EN+ Q M E GY +V+ HDE+I+E P S E ++ L++ W
Sbjct: 916 ENICQAAARDVLAYNMPPIEKAGYEIVLTVHDEIISEAPDTPQFSAEGLSKLLSFNSDWA 975
Query: 669 EWWPIRAAGWEDDRYQKD 686
P+ A G+E RY+K+
Sbjct: 976 WDLPLSANGFETYRYRKE 993
>ref|NP_299569.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||G82574 phage-related protein XF2290 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85089.1|AE004041_1 (AE004041) phage-related protein [Xylella fastidiosa 9a5c]
Length = 312
Score = 89.7 bits (221), Expect = 1e-016
Identities = 86/298 (28%), Positives = 122/298 (40%), Gaps = 48/298 (16%)
Query: 122 DYADLYRYCDQDVKSEDHVSALVP--DMTPAERETWLADQRINARGVLVDVESLDNALDI 179
D+A Y +DV + V +P + AE W DQ IN RGVL+D + + A+
Sbjct: 2 DWAQFVEYARRDVAAMRDVVKRLPSHNYAGAELALWFLDQTINDRGVLIDTDLVQAAIGA 61
Query: 180 VGQTTRRFTMELAAITNGAVGSVSEVAKLTDFLATV-GCRMHNLKSETVAETLERDDLNP 238
V + + A+T GAV + ++ L L T G + +++ TVA ++ L
Sbjct: 62 VERAKQTLAQRTEALTGGAVQAATQRDALLQHLNTAHGVALPDMQQHTVARCVDDPALPE 121
Query: 239 TARRILEIREALAGANVKKLFSLKAQISSDGRLRNQYNFFGAG-TGRWSAGGVQLQNLTS 297
T R +L IR + + K +L S DGRLR F GA TGRW+ Q NL
Sbjct: 122 TVRELLSIRRQASSTSTAKYQALLHCTSPDGRLRGTLQFNGASRTGRWAGRLFQPHNL-- 179
Query: 298 KGPKSKTCNGCGRIVGKSCRVEVMGAAGLCPECGANDWTDNKDWTVDAVRWALKDLKHRN 357
P+ S V +G +DA++ DL
Sbjct: 180 --PRPTL----------SQEVIAVG--------------------IDAMKAGCVDL---- 203
Query: 358 LDLIIDIWGDPITLLSGCLRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFR 415
++ D + L S LR IA KK V D S IE VLA LA ++ FR
Sbjct: 204 ------VFDDVMALTSSALRSCLIAPTHKKLVVADLSIIEGRVLAWLAGETPKLHAFR 255
>gb|AAA23103.1| (M94320) putative DNA polymerase [Citrus greening
disease-associated bacterium]
Length = 206
Score = 62.4 bits (150), Expect = 2e-008
Identities = 47/165 (28%), Positives = 73/165 (43%), Gaps = 24/165 (14%)
Query: 17 PTDLIDHIANGGPIA-AWNITFEFWIWNMVLAKSEGWPWLQLEQCYCDMAKARRFSLPGS 75
P+DL+ ++ + + A N FE ++ L + ++ C AR LP S
Sbjct: 54 PSDLLQYLRDETVMCVAHNSLFERILFKKTLGID-----IPSKRWICTSVLARTNGLPSS 108
Query: 76 LDTAAKALRTAE---KDKKGKQLIGKLCR------PVSATKARPEPRWTLYTAPQDYADL 126
L A ALR E K ++GK LI + C+ P T+A W L+
Sbjct: 109 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFR-------- 160
Query: 127 YRYCDQDVKSEDHVSALVPDMTPAERETWLADQRINARGVLVDVE 171
YC +DV++ + + + P ER WL DQ IN RG +D++
Sbjct: 161 -EYCKRDVEATRELFKSLTPLNPEERTLWLLDQEINDRGYRIDLD 204
>gb|AAB25243.1| DNA polymerase homolog [bacterium-like organism, citrus greening
disease-associated, Peptide, 207 aa]
Length = 207
Score = 62.4 bits (150), Expect = 2e-008
Identities = 47/165 (28%), Positives = 73/165 (43%), Gaps = 24/165 (14%)
Query: 17 PTDLIDHIANGGPIA-AWNITFEFWIWNMVLAKSEGWPWLQLEQCYCDMAKARRFSLPGS 75
P+DL+ ++ + + A N FE ++ L + ++ C AR LP S
Sbjct: 55 PSDLLQYLRDETVMCVAHNSLFERILFKKTLGID-----IPSKRWICTSVLARTNGLPSS 109
Query: 76 LDTAAKALRTAE---KDKKGKQLIGKLCR------PVSATKARPEPRWTLYTAPQDYADL 126
L A ALR E K ++GK LI + C+ P T+A W L+
Sbjct: 110 LKNACLALRFPEHLTKKEEGKALIARFCKGSIDSPPYDCTRANHVQAWALFR-------- 161
Query: 127 YRYCDQDVKSEDHVSALVPDMTPAERETWLADQRINARGVLVDVE 171
YC +DV++ + + + P ER WL DQ IN RG +D++
Sbjct: 162 -EYCKRDVEATRELFKSLTPLNPEERTLWLLDQEINDRGYRIDLD 205
>sp|Q9S1G2|DPO1_RHILE DNA polymerase I (POL I)
gb|AAD45559.1|U86403_1 (U86403) DNA polymerase I [Rhizobium leguminosarum]
Length = 1016
Score = 43.9 bits (102), Expect = 0.007
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
+R FI+ PG K + D+S IE VLA +A + F I+ M+AS++ GVP E
Sbjct: 771 IRTAFISTPGHKLISADYSQIELRVLAHVAEIPQLTKAFEDGVDIHAMTASEMFGVPVEG 830
Query: 436 M 436
M
Sbjct: 831 M 831
>ref|NP_384281.1| (NC_003047) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
meliloti]
emb|CAC41562.1| (AL591782) PROBABLE DNA POLYMERASE I PROTEIN [Sinorhizobium
meliloti]
Length = 1004
Score = 42.4 bits (98), Expect = 0.019
Identities = 24/61 (39%), Positives = 34/61 (55%)
Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
+R F+A PG K V D+S IE VLA +A + F I+ M+AS++ GVP +
Sbjct: 761 IRTAFVATPGHKLVSADYSQIELRVLAHVADIPQLRQAFADGVDIHAMTASEMFGVPVDG 820
Query: 436 M 436
M
Sbjct: 821 M 821
>ref|NP_254180.1| (NC_002516) DNA polymerase I [Pseudomonas aeruginosa]
pir||F82958 DNA polymerase I PA5493 [imported] - Pseudomonas aeruginosa
(strain PAO1)
gb|AAG08878.1|AE004962_2 (AE004962) DNA polymerase I [Pseudomonas aeruginosa]
Length = 913
Score = 42.4 bits (98), Expect = 0.019
Identities = 19/61 (31%), Positives = 37/61 (60%)
Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
+R F+A G K + D+S IE ++A LA+ + ++ FR ++ +A+++ GVP E+
Sbjct: 674 IRQAFVAPQGYKLLAADYSQIELRIMAHLAKDDGLLDAFRHDLDVHRATAAEVFGVPLED 733
Query: 436 M 436
+
Sbjct: 734 V 734
>ref|NP_353146.1| (NC_003062) AGR_C_169p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
ref|NP_530819.1| (NC_003304) DNA polymerase I [Agrobacterium tumefaciens str. C58
(U. Washington)]
gb|AAK85931.1| (AE007952) AGR_C_169p [Agrobacterium tumefaciens str. C58 (Cereon)]
gb|AAL41135.1| (AE008985) DNA polymerase I [Agrobacterium tumefaciens str. C58 (U.
Washington)]
Length = 1031
Score = 41.2 bits (95), Expect = 0.043
Identities = 24/61 (39%), Positives = 33/61 (53%)
Query: 376 LRGLFIAAPGKKFVCCDFSAIEAVVLAALARCEWRIEVFRTHGKIYEMSASKISGVPFEE 435
+R FI+ PG K + D+S IE VLA +A F I+ M+AS++ GVP E
Sbjct: 788 IRTAFISTPGHKLLSADYSQIELRVLAHVADIPQLRNAFENGIDIHAMTASEMFGVPVEG 847
Query: 436 M 436
M
Sbjct: 848 M 848
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.319 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 473,031,019
Number of Sequences: 1028106
Number of extensions: 20509499
Number of successful extensions: 42979
Number of sequences better than 5.0e-02: 15
Number of HSP's better than 0.1 without gapping: 12
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 42901
Number of HSP's gapped (non-prelim): 36
length of query: 686
length of database: 324,535,921
effective HSP length: 128
effective length of query: 558
effective length of database: 192,938,353
effective search space: 107659600974
effective search space used: 107659600974
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 95 (41.2 bits)
Query= orf40 [10952-11194]
(241 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 164,092,175
Number of Sequences: 1028106
Number of extensions: 6878976
Number of successful extensions: 10673
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 10673
Number of HSP's gapped (non-prelim): 0
length of query: 241
length of database: 324,535,921
effective HSP length: 118
effective length of query: 123
effective length of database: 203,219,413
effective search space: 24995987799
effective search space used: 24995987799
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 89 (38.9 bits)
Query= orf41 [11217-11573]
(355 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.319 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 125,280,771
Number of Sequences: 1028106
Number of extensions: 4585833
Number of successful extensions: 5950
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5950
Number of HSP's gapped (non-prelim): 0
length of query: 355
length of database: 324,535,921
effective HSP length: 122
effective length of query: 233
effective length of database: 199,106,989
effective search space: 46391928437
effective search space used: 46391928437
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 92 (40.0 bits)
Query= orf42 [11577-11953]
(375 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_297976.1| (NC_002488) phage-related protein [Xylella... 46 8e-004
ref|NP_299800.1| (NC_002488) phage-related protein [Xylella... 46 8e-004
ref|NP_051012.1| (NC_000935) hypothetical protein [Acyrthos... 45 0.002
>ref|NP_297976.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||C82775 phage-related protein XF0686 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF83496.1|AE003912_8 (AE003912) phage-related protein [Xylella fastidiosa 9a5c]
Length = 425
Score = 45.8 bits (107), Expect = 8e-004
Identities = 47/177 (26%), Positives = 73/177 (40%), Gaps = 9/177 (5%)
Query: 4 QHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLNDLDAEDYIGH 63
QHA + PS A W HC + + D + GT +H + A+ DA Y+G
Sbjct: 3 QHAMLSPSSAHRWLHCPASVPLTRTCKDDASPFADEGTVAHTVAANALRTGSDASAYVGE 62
Query: 64 A-DPDGTI--VDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRVYMPDIHPDNWGTLDYA 120
+ +G V MA Q VD VR G +EQ +++ I +
Sbjct: 63 RHEINGRTWEVTAEMAASVQEYVDYVRA-----IAGVRLVEQPLHIASITGEQGAKGTAD 117
Query: 121 YVLPAIKRVILGDYKHGH-LEVSPERNLQLVDYLKGLENLVGYTFDGWRIDLKVCQP 176
V+ A + + D K+G ++V E N QL Y + G+ D ++ L + QP
Sbjct: 118 VVILAGDALTIVDLKYGKGVKVFAEGNEQLQLYALAALSEFGWVEDFQQVRLVIVQP 174
>ref|NP_299800.1| (NC_002488) phage-related protein [Xylella fastidiosa 9a5c]
pir||H82547 phage-related protein XF2522 [imported] - Xylella fastidiosa
(strain 9a5c)
gb|AAF85320.1|AE004059_10 (AE004059) phage-related protein [Xylella fastidiosa 9a5c]
Length = 425
Score = 45.8 bits (107), Expect = 8e-004
Identities = 47/177 (26%), Positives = 73/177 (40%), Gaps = 9/177 (5%)
Query: 4 QHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLNDLDAEDYIGH 63
QHA + PS A W HC + + D + GT +H + A+ DA Y+G
Sbjct: 3 QHAMLSPSSAHRWLHCPASVPLTRTCQDDASPFADEGTVAHTVAANALRTGSDASAYVGE 62
Query: 64 A-DPDGTI--VDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRVYMPDIHPDNWGTLDYA 120
+ +G V MA Q VD VR G +EQ +++ I +
Sbjct: 63 RHEINGRTWEVTAEMAASVQEYVDYVRA-----IAGVRLVEQPLHIASITGEQGAKGTAD 117
Query: 121 YVLPAIKRVILGDYKHGH-LEVSPERNLQLVDYLKGLENLVGYTFDGWRIDLKVCQP 176
V+ A + + D K+G ++V E N QL Y + G+ D ++ L + QP
Sbjct: 118 VVILAGDALTIVDLKYGKGVKVFAEGNEQLQLYALAALSEFGWVEDFQQVRLVIVQP 174
>ref|NP_051012.1| (NC_000935) hypothetical protein [Acyrthosiphon pisum bacteriophage
APSE-1]
sp|Q9T1P7|VP51_BPAPS Putative protein P51
gb|AAF03994.1|AF157835_51 (AF157835) P51 [Acyrthosiphon pisum bacteriophage APSE-1]
Length = 439
Score = 44.7 bits (104), Expect = 0.002
Identities = 50/193 (25%), Positives = 83/193 (42%), Gaps = 26/193 (13%)
Query: 4 QHASIPPSGAPIWGHCSGYLGVMHKVADFDTDATRAGTASHWAFSEAVLN---------- 53
+HA + PS A W C G + + D + G+A+H A +E++L
Sbjct: 10 EHAKLAPSSAHRWLKCPGSTALEATLPDKTSPFAEEGSAAH-ALAESILKMRQNPFSDGR 68
Query: 54 ----DLDAEDYIGH---ADPDGTIVDEGMAHGAQLMVDMVREDMEKHPGGQLYIEQRV-Y 105
D ++YIG + VDE M Q +D V + G L +E+RV +
Sbjct: 69 RRTYGFDVKEYIGTYPLLKANSPQVDEEMIEHVQTYIDTV---WQLANGKMLQVEERVDF 125
Query: 106 MPDIHPDN-WGTLDYAYVLPAIKRVILGDYKHGH-LEVSPERNLQLVDYLKGLENLVGYT 163
I DN +GT D ++ + + + D K+G ++V + N QL+ Y G +
Sbjct: 126 SAVIGVDNSFGTAD--AIIVSSDELQIHDLKYGKGVKVDAQNNEQLMLYALGALHQFDLV 183
Query: 164 FDGWRIDLKVCQP 176
+D + L + QP
Sbjct: 184 YDFKTVRLFIHQP 196
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.317 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 254,503,813
Number of Sequences: 1028106
Number of extensions: 10752186
Number of successful extensions: 25572
Number of sequences better than 5.0e-02: 3
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 3
Number of HSP's that attempted gapping in prelim test: 25572
Number of HSP's gapped (non-prelim): 3
length of query: 375
length of database: 324,535,921
effective HSP length: 123
effective length of query: 252
effective length of database: 198,078,883
effective search space: 49915878516
effective search space used: 49915878516
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 92 (40.0 bits)
Query= orf43 [11954-12238]
(283 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.309 0.127 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 185,208,683
Number of Sequences: 1028106
Number of extensions: 7804335
Number of successful extensions: 27615
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 27615
Number of HSP's gapped (non-prelim): 0
length of query: 283
length of database: 324,535,921
effective HSP length: 120
effective length of query: 163
effective length of database: 201,163,201
effective search space: 32789601763
effective search space used: 32789601763
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 90 (39.3 bits)
Query= orf44 [12267-12335]
(67 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.323 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 31,807,219
Number of Sequences: 1028106
Number of extensions: 669762
Number of successful extensions: 2653
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2653
Number of HSP's gapped (non-prelim): 0
length of query: 67
length of database: 324,535,921
effective HSP length: 43
effective length of query: 24
effective length of database: 280,327,363
effective search space: 6727856712
effective search space used: 6727856712
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf45 [12332-13051]
(718 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacil... 202 9e-051
ref|NP_511031.1| (NC_003291) helicase [Listeria phage 2389]... 197 5e-049
ref|NP_050128.1| (NC_000896) putative DEAH-family helicase ... 196 6e-049
ref|NP_049952.1| (NC_000871) putative helicase [Streptococc... 190 6e-047
ref|NP_049999.1| (NC_000872) putative helicase [Streptococc... 190 6e-047
pir||T13299 probable helicase - Streptococcus phage phi-O12... 190 6e-047
ref|NP_478500.1| (NC_003267) ORF_ID:all8075~probable helica... 189 1e-046
ref|NP_471919.1| (NC_003212) similar to DEAH-family helicas... 188 2e-046
gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Stre... 187 3e-046
sp|P24125|V51K_BPL79 51.5 kDa protein >gi|76364|pir||JU0445... 179 1e-043
ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. ... 170 6e-041
ref|NP_607398.1| (NC_003485) hypothetical phage protein [St... 169 1e-040
ref|NP_459877.1| (NC_003197) Fels-1 prophage; putative DNA ... 120 8e-026
ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococc... 85 3e-015
ref|NP_599062.1| (NC_003444) unknown [Shigella flexneri bac... 83 1e-014
ref|NP_602261.1| (NC_003450) COG1061:DNA or RNA helicases o... 66 2e-009
ref|XP_132906.1| (XM_132906) similar to KIAA0111 gene produ... 57 6e-007
ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, pu... 55 3e-006
sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4... 54 5e-006
ref|NP_540200.1| (NC_003317) Hypothetical Cytosolic Protein... 53 1e-005
ref|NP_531749.1| (NC_003304) conserved hypothetical protein... 49 3e-004
ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tum... 49 3e-004
gb|AAB67607.1| (U73459) translational initiation factor eIF... 49 3e-004
ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN... 47 6e-004
emb|CAB38640.1| (Y17288) RNA helicase [Plasmodium falciparum] 45 0.004
>emb|CAB63670.1| (AJ251789) hypothetical protein [Lactobacillus casei bacteriophage
A2]
Length = 455
Score = 202 bits (515), Expect = 9e-051
Identities = 144/422 (34%), Positives = 214/422 (50%), Gaps = 52/422 (12%)
Query: 1 MFKLRDYQQLAYNRI---MAAWEKYRSVLAVLPTGAGKTVIFSKIIH---DHTGAAAAIV 54
MF+L YQ+ N+ +AA K SVL V P G+GK+VI ++I + +G +V
Sbjct: 1 MFQLHPYQKELVNQAREKLAAGNK--SVLLVSPAGSGKSVIIAEIARLAVERSGHVMFMV 58
Query: 55 HRREIVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLT 114
HR+E+V QI + + + A T+ T+ K
Sbjct: 59 HRQELVNQIVQTFQA----DEIDLQATTIMTVGK-------------------------- 88
Query: 115 SKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVG 174
+ LN + + +L + DE HH + + + K + + L +A+P R +G G
Sbjct: 89 IANRLNR------LPRPSLIITDESHHSLAK-TYRKIYSFYADVPRLGFSASPWRMNGQG 141
Query: 175 LGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKAR 234
LG E M+EGP+ WLI++ +L+ + Y AP T ID++ L T GD+ K++ A
Sbjct: 142 LGN----VYESMVEGPSVKWLIDHQYLAPYDYYAP-TLIDVQKLQTSSTGDYTNKSMDAA 196
Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERD 294
V ++ I GDVV HY+ ++AIV+A +E ++Q+ E F+ AG TA +T +G+RD
Sbjct: 197 VPKA-IFGDVVSHYQHLAGGRQAIVYAHSIEASKQVVEAFQSAGITAVHADAKTPKGQRD 255
Query: 295 HLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
++A F+ G I +L NVDL EGF+VP V +I+ RPT SL +Q R +R GK
Sbjct: 256 RIMADFKAGKITILSNVDLISEGFNVPDVGVIIMLRPTASLVLDIQQSMRGMRYKPGKR- 314
Query: 355 AIIIDPVRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYC 414
+IIID V N R G P W+L R K + P KTCDAC K CP C
Sbjct: 315 SIIIDHVANAYRFGLPDTEHEWTLNDRPKKKKRDNTGPPIKTCDACYAVIPVQCKICPIC 374
Query: 415 GN 416
G+
Sbjct: 375 GH 376
>ref|NP_511031.1| (NC_003291) helicase [Listeria phage 2389]
emb|CAC85606.1| (AJ312240) helicase [Bacteriophage PSA]
Length = 418
Score = 197 bits (500), Expect = 5e-049
Identities = 138/431 (32%), Positives = 213/431 (49%), Gaps = 47/431 (10%)
Query: 32 GAGKTVIFSKIIHDHT---GAAAAIVHRREIVAQISLSLASFGVKHRVIAPAKTLKTIRK 88
GAGK+VI S+II T +VHR+E++ QI +L V +
Sbjct: 9 GAGKSVILSEIIRMTTHNKNYVLFLVHRKELIDQIRNTLIMNEVDMSYV----------- 57
Query: 89 KHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQGIW 148
K G VQT+ + + ++ +L + DE HH + +
Sbjct: 58 ----KLGM-------------VQTIVRR--------LNKTSEPSLIIIDESHHVLANS-Y 91
Query: 149 AKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRA 208
K +H F AK++ TATP R +G GLG + +IE WLIEN FL+ + Y A
Sbjct: 92 KKIIHHFSKAKVVGFTATPVRINGGGLGD----INDTLIEKVNVKWLIENQFLAHYKYYA 147
Query: 209 PQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVETAE 268
P+T + L + G+F+ +L + + I GDV++HY+K + ++AI++A+ + +E
Sbjct: 148 PET-VQTETLNVKRTGEFDMTSLDDQFNKRMIWGDVIKHYQKLADGEQAILYASSIYQSE 206
Query: 269 QMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVIL 328
+MA F AG ++A + G+T + RD ++ QF EG ++VL N+DL EGFDVP VI+
Sbjct: 207 KMAASFNAAGISSAHIDGKTPKPIRDDIIKQFREGELKVLCNLDLIGEGFDVPDCSTVIM 266
Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWSLMGREKGQGSG 388
RPTQSL+ ++Q R +R +GK A IID V N +R G P R WSL R KG S
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYKQGKT-ATIIDHVGNVKRFGLPDMERTWSLEPR-KGSNST 324
Query: 389 GDTIPQKTCDACTVPYEAYYKACPYCGNVNEIADRSAPDKVDGDLVELDLDAWNALFAEI 448
P K C C + K C +CG+ ++ + +L E+ + ++
Sbjct: 325 KAEAPVKICKECFMTVSQTAKKCEHCGHEFKVEVKPIQIDEAAELQEITEAVFKVNYSSP 384
Query: 449 ERAKMSDDDYE 459
+ K + YE
Sbjct: 385 DECKNMKELYE 395
>ref|NP_050128.1| (NC_000896) putative DEAH-family helicase [Lactobacillus
bacteriophage phi adh]
emb|CAB52498.1| (AJ131519) putative DEAH-family helicase [Lactobacillus
bacteriophage phi adh]
Length = 455
Score = 196 bits (499), Expect = 6e-049
Identities = 138/443 (31%), Positives = 220/443 (49%), Gaps = 50/443 (11%)
Query: 4 LRDYQQLAYNRIMAAWEK-YRSVLAVLPTGAGKTVIFSKI---IHDHTGAAAAIVHRREI 59
+R YQ+ + + + + +++VL P G+GK++ +++ I + +VHRRE+
Sbjct: 5 IRSYQKKLIDDLRLSLKHGHKNVLIQSPAGSGKSITMAELAKRITSNGERVLFVVHRREL 64
Query: 60 VAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTL 119
V QI + +S+GV + C + VQT + + L
Sbjct: 65 VKQIKNTFSSWGVNMNL----------------------------CEIYMVQTASRR--L 94
Query: 120 NNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGE 179
++ + DE HH + + + K + F+NA +L TATP R G KG
Sbjct: 95 EKLMTPNYI------LVDEAHHSLAK-TYKKINNHFKNAHILGFTATPVRLSG----KGF 143
Query: 180 GGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVESH 239
+ ++ GP WLIEN +L+ + Y + ID L GD+ K+++ ++
Sbjct: 144 KDIYDDLVLGPKISWLIENHYLAPYTYYSVNL-IDQTKLKKSSTGDYTHKSIE-NAGKNI 201
Query: 240 IVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQ 299
+ GDV+Q Y K+ N +AI+++ V + +Q+A+EF + A V G+T++ +RD +
Sbjct: 202 VYGDVIQSYHKFANNTKAIIYSYSVHSCQQIAKEFNKNNIPAKEVDGKTKKEDRDKAMQD 261
Query: 300 FEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIID 359
F +G I++LVN +L+ EG DVP E VI+ RPTQSL+ F+Q R +R K+ AIIID
Sbjct: 262 FRDGKIKILVNAELYGEGVDVPDCETVIMLRPTQSLSLFIQQSMRCMRYQPDKQ-AIIID 320
Query: 360 PVRNWERHGPPTAVRNWSLMGREK--GQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNV 417
V N+ R G P R W+L R K + G D I KTC C A Y CP CG
Sbjct: 321 QVANYTRFGLPDMDRVWTLEDRAKHPQREGGSDGIAIKTCPNCFGVIMASYHKCPLCGYS 380
Query: 418 NEIADRSAPDKVDGDLVELDLDA 440
E R + +L +++LDA
Sbjct: 381 FEAEFRKLAEDKRAELEKINLDA 403
>ref|NP_049952.1| (NC_000871) putative helicase [Streptococcus thermophilus
bacteriophage Sfi19]
gb|AAC97921.1| (AF077306) putative helicase; gp443 [Streptococcus thermophilus
bacteriophage Sfi19]
gb|AAD44071.1|AF115102_30 (AF115102) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi19]
Length = 443
Score = 190 bits (482), Expect = 6e-047
Identities = 146/481 (30%), Positives = 232/481 (47%), Gaps = 62/481 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
+LR+YQ N I + + ++ P +GKTV+ + I T I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
I Q+ + FK+ G V + +T S
Sbjct: 62 INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85
Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
A + + + ++ + DE HH V+ + K + + N+ +L T TP R DG G
Sbjct: 86 TKIARNLDRIQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
A+ ++ G WL +NG ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
I GDV++HY K + K+AIV+ VE +E ++ F GYT+ ++SG+T RD +
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F +G + ++VN +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
D V N +RHG P A R WSL G K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPT--TRTCDECYATFWSAERICPMCG 374
Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
+ N+ P K + +++ E++L + + +++E SD ++ +K Q G
Sbjct: 375 HENK------PTKEEVEIIREIELEERRQEVASKVETFVTSDQCQTVEELKEFAKQHGYK 428
Query: 474 P 474
P
Sbjct: 429 P 429
>ref|NP_049999.1| (NC_000872) putative helicase [Streptococcus thermophilus
bacteriophage Sfi21]
ref|NP_056709.1| (NC_002214) putative helicase [Streptococcus thermophilus
bacteriophage Sfi11]
gb|AAC72435.1| (AF004379) orf443 [Streptococcus thermophilus bacteriophage Sfi21]
gb|AAD44104.1|AF115103_34 (AF115103) orf443 gp [Streptococcus thermophilus bacteriophage
Sfi21]
gb|AAF63056.1|AF158600_10 (AF158600) putative helicase [Streptococcus thermophilus
bacteriophage Sfi11]
gb|AAF63085.1|AF158601_13 (AF158601) putative helicase [Streptococcus thermophilus
bacteriophage SFi18]
Length = 443
Score = 190 bits (482), Expect = 6e-047
Identities = 146/481 (30%), Positives = 232/481 (47%), Gaps = 62/481 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
+LR+YQ N I + + ++ P +GKTV+ + I T I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
I Q+ + FK+ G V + +T S
Sbjct: 62 INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85
Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
A + + + ++ + DE HH V+ + K + + N+ +L T TP R DG G
Sbjct: 86 TKIARNLDRIQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
A+ ++ G WL +NG ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKPVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
I GDV++HY K + K+AIV+ VE +E ++ F GYT+ ++SG+T RD +
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F +G + ++VN +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
D V N +RHG P A R WSL G K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGVNKTKKKLKLGEPT--TRTCDECYATFWSAERICPMCG 374
Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
+ N+ P K + +++ E++L + + +++E SD ++ +K Q G
Sbjct: 375 HENK------PTKEEVEIIREIELEERRQEVASKVETFVTSDQCQTVEELKEFAKQHGYK 428
Query: 474 P 474
P
Sbjct: 429 P 429
>pir||T13299 probable helicase - Streptococcus phage phi-O1205
gb|AAC79526.1| (U88974) ORF10 [Streptococcus thermophilus temperate bacteriophage
O1205]
Length = 443
Score = 190 bits (482), Expect = 6e-047
Identities = 145/481 (30%), Positives = 233/481 (48%), Gaps = 62/481 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
+LR+YQ N I + + ++ P +GKTV+ + I T I+ HR+E
Sbjct: 2 ELRNYQSDLVNDIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTILFFSHRKE 61
Query: 59 IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
I Q+ + FK+ G V + +T S
Sbjct: 62 INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85
Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
A + + + ++ + DE HH V+ + K + + N+ +L T TP R DG G
Sbjct: 86 TKIARNLDRIQEPSIILIDEAHH-VKAKTYLKIIEYYSNSIVLMFTGTPARLDGSGFDD- 143
Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
A+ ++ G + WL +NG ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKSVKWLQDNGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
I GDV++HY K + K+AIV+ VE +E ++ F GYT+ ++SG+T RD +
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFNEQGYTSIAISGKTPPEVRDRAMQ 257
Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F +G + ++VN +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 AFRDGKLTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
D V N +RHG P A R WS+ G K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSIKGVNKTKKKLKLGEPT--TRTCDECYATFWSAERICPMCG 374
Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
+ N+ P K + +++ E++L + + +++E SD ++ +K Q G
Sbjct: 375 HENK------PTKEEVEIIREIELEERRQEVASKVETFVTSDQCQTVEELKEFAKQHGYK 428
Query: 474 P 474
P
Sbjct: 429 P 429
>ref|NP_478500.1| (NC_003267) ORF_ID:all8075~probable helicase [Nostoc sp. PCC 7120]
dbj|BAB77405.1| (AP003603) ORF_ID:all8075~probable helicase [Nostoc sp. PCC 7120]
Length = 681
Score = 189 bits (480), Expect = 1e-046
Identities = 138/452 (30%), Positives = 206/452 (45%), Gaps = 51/452 (11%)
Query: 1 MFKLRDYQQLAYNRIMAAWEK-YRSVLAVLPTGAGKTVIFSKIIH---DHTGAAAAIVHR 56
+++LR+YQ I +W++ R VLA LPTGAGKT+ F+ I I HR
Sbjct: 7 IYELRNYQHQWIKDIWNSWKRGNRRVLAQLPTGAGKTICFAHICQKFFQKQQKVLVIAHR 66
Query: 57 REIVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSK 116
E++ Q + L V P +K G P V S+QTL +
Sbjct: 67 IELITQAAEKLEQI-----VGEPVGIIK----------GGCAAHPERRIQVASIQTLARR 111
Query: 117 STLNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLG 176
I+ + L ++DE HH + + + ++ A++L VTATP+R DG G
Sbjct: 112 D------ILELPLNIGLLLFDEAHHSSASS-YRRLIEHYQEAQILGVTATPQRIDGQGFQ 164
Query: 177 KGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKN-GDFNAKALKARV 235
+ + ++ G +T LI+ G+LSKF R T+ + + K+ GDF AK L V
Sbjct: 165 E----LFDDLVVGISTSTLIKEGYLSKF--RLFTTNQTISTIGVKKSRGDFRAKELAVAV 218
Query: 236 VESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDH 295
V ++ Q+Y K+ +N R ++FA +E + +A +FRR G A + GET R
Sbjct: 219 TSQIGVDEIFQNYLKYAKNLRTVIFACSLEHSRALAAKFRRNGIKAEHLDGETPPELRVQ 278
Query: 296 LLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYA 355
+L +F G QV+ N ++ EG+D P +EC+ RPT+S +LQM GR LR K A
Sbjct: 279 ILERFRGGETQVITNYEILTEGYDCPNIECIYCVRPTESSTLWLQMTGRVLRTHTLKPTA 338
Query: 356 IIIDPVRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVP------------ 403
+IID NW++HG P R WSL + S I C P
Sbjct: 339 VIIDVTDNWKKHGLPDEQRQWSLEAKTLTASSSLGLIQCPHCTHAFKPLSHELAIVDGEI 398
Query: 404 ------YEAYYKACPYCGNVNEIADRSAPDKV 429
E + CP CG E + + +
Sbjct: 399 GEDGLLIEHHEAICPNCGQAVEFTTKETTEPI 430
>ref|NP_471919.1| (NC_003212) similar to DEAH-family helicase [Listeria innocua]
emb|CAC97816.1| (AL596173) similar to DEAH-family helicase [Listeria innocua]
Length = 418
Score = 188 bits (477), Expect = 2e-046
Identities = 129/388 (33%), Positives = 194/388 (49%), Gaps = 47/388 (12%)
Query: 32 GAGKTVIFSKIIHDHT---GAAAAIVHRREIVAQISLSLASFGVKHRVIAPAKTLKTIRK 88
GAGK+VI S+II T +VHR+E++ QI +L V + +
Sbjct: 9 GAGKSVILSEIIRMTTRNKNNVLFLVHRKELIDQIRNTLTMNDVDMKFV----------- 57
Query: 89 KHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQGIW 148
+ VQT+ + + ++ L + DE HH + +
Sbjct: 58 -----------------NLGMVQTVVRR--------LEKTSEPALIIIDESHHVLANS-Y 91
Query: 149 AKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRA 208
K ++ F NAK++ TATP R +G GLG +++IE WLIEN FL+ + Y A
Sbjct: 92 KKIINHFSNAKVVGFTATPVRINGGGLGD----INDMLIEKVNVKWLIENQFLAPYKYFA 147
Query: 209 PQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVETAE 268
P+ + L + G+F+ L + + I GDV++HY+K ++AI++A+ + +E
Sbjct: 148 PEI-VQTETLDIKRTGEFDMTGLDDQFNKRMIWGDVIKHYQKLANGEQAILYASSLYQSE 206
Query: 269 QMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVIL 328
+MA F G T+A + G+T + RD ++ +F EG I+VL N+DL EGFDVP VI+
Sbjct: 207 KMAMSFESVGITSAHIDGKTPKSIRDDIIQRFREGEIKVLCNLDLIGEGFDVPDCSTVIM 266
Query: 329 ARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWSLMGREKGQGSG 388
RPTQSL+ ++Q R +R GK AIIID V N R G P R WSL + KG S
Sbjct: 267 LRPTQSLSLYIQQSMRGMRYRTGKT-AIIIDHVGNVNRFGLPDMERKWSLEAK-KGSNSN 324
Query: 389 GDTIPQKTCDACTVPYEAYYKACPYCGN 416
P K C C + + C +CG+
Sbjct: 325 KAEAPVKICPDCFMTVLSTNIKCSHCGH 352
>gb|AAL74161.1|AF442520_1 (AF442520) putative helicase [Streptococcus thermophilus
bacteriophage k3]
Length = 443
Score = 187 bits (476), Expect = 3e-046
Identities = 144/481 (29%), Positives = 235/481 (47%), Gaps = 62/481 (12%)
Query: 3 KLRDYQQLAYNRI-MAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HRRE 58
+LR+YQ N I + + ++ P +GKTV+ + I T ++ HR+E
Sbjct: 2 ELRNYQNDLVNGIKQSILRGNKRIMVQSPPRSGKTVVMAHIAKGATDKGNTVLFFSHRKE 61
Query: 59 IVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKST 118
I Q+ + FK+ G V + +T S
Sbjct: 62 INEQVVNT------------------------FKRNG------------VDMNLVTIDSV 85
Query: 119 LNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKG 178
A + +++ ++ + DE HH V+ + K + + N+ +L T TP R DG G
Sbjct: 86 TKVARNLDRISEPSIILIDEAHH-VKAKTYLKIIEYYINSIVLMFTGTPARLDGSGFDD- 143
Query: 179 EGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVES 238
A+ ++ G + WL ENG ++ F Y AP ID +L + G+F K++ ++
Sbjct: 144 ---IADDIVLGKSVKWLQENGNIAPFKYYAPSL-IDTTNLKK-RGGEFTKKSVDD-TMKR 197
Query: 239 HIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLA 298
I GDV++HY K + K+AIV+ VE +E ++ F++ GYT+ ++SG+T R+ +
Sbjct: 198 VIYGDVIRHYEKLAKGKQAIVYTHSVEASESVSNTFKKNGYTSIAISGKTPPEVRERAMQ 257
Query: 299 QFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIII 358
F + + ++VN +LF EG D+P V+ I+ RPTQSL+ +LQ RAL EGK AIII
Sbjct: 258 TFRDRELTIMVNCELFTEGIDLPNVDVCIMLRPTQSLSLYLQFAMRALNPREGKT-AIII 316
Query: 359 DPVRNWERHGPPTAVRNWSLMGREKGQGS---GGDTIPQKTCDACTVPYEAYYKACPYCG 415
D V N +RHG P A R WSL G K + G T +TCD C + + + CP CG
Sbjct: 317 DHVGNVDRHGLPNADREWSLKGINKTKKKLKLGEPT--TRTCDECYATFWSAERICPLCG 374
Query: 416 NVNEIADRSAPDKVDGDLV-ELDL-DAWNALFAEIERAKMSDDDYELDMIKRGVPQVGRG 473
+ N+ P K + +++ E++L + + +++E SD ++ +K Q G
Sbjct: 375 HENQ------PTKEEIEIIREIELEERRQEVASKVETFVTSDQCQSVEELKEFAKQHGYK 428
Query: 474 P 474
P
Sbjct: 429 P 429
>sp|P24125|V51K_BPL79 51.5 kDa protein
pir||JU0445 51K protein GP51C - Lactococcus phage mi7-9
gb|AAA32363.1| (M36388) conserved bacteriophage encoded protein [Lactococcus
bacteriophage (ISOLATE 7-9)]
Length = 452
Score = 179 bits (454), Expect = 1e-043
Identities = 133/410 (32%), Positives = 195/410 (47%), Gaps = 53/410 (12%)
Query: 30 PTGAGKTVIFS---KIIHDHTGAAAAIVHRREIVAQISLSLASFGVKHRVIAPAKTLKTI 86
P G+GK+V+ S K+ + G +VHR+E++ QI+ S GV
Sbjct: 5 PPGSGKSVVISEITKLATEKGGRVLFLVHRKELIDQITNSFKVHGVP------------- 51
Query: 87 RKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHHYVEQG 146
+ V+ LT N +++R + TL + DEGHH +
Sbjct: 52 --------------------LNQVELLTVGKAKNRLSVLR---KPTLIITDEGHHG-KAA 87
Query: 147 IWAKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFLSKFVY 206
+ K + + L TATP R G G + MIEG T WLI N L+ + Y
Sbjct: 88 TYQKIYEFYADVPRLGFTATPWRMSGDGFKDTY----DYMIEGKTVEWLINNKRLAPYQY 143
Query: 207 RA-PQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFATDVE 265
+ P DI + KNGD++ +++ + ++ I GDVVQ Y K ++AI++A VE
Sbjct: 144 YSLPSIDISKLRI---KNGDYSNQSIDDALGKT-IFGDVVQEYVKHANGQKAILYAHSVE 199
Query: 266 TAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVPAVEC 325
++ A+EF+ AG A V +T + ERD L+ F G I+VL NVDL EGFDVP
Sbjct: 200 ASQSFAKEFQEAGIKAVHVDAKTPKNERDKLMLDFRNGRIKVLCNVDLISEGFDVPDCTV 259
Query: 326 VILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWS--LMGREK 383
IL RPT+SL FLQ R++R K AIIID V NW HG P +W G K
Sbjct: 260 TILCRPTKSLVLFLQQSMRSMRYQHRKT-AIIIDNVMNWHAHGLPDTHHDWKDYFEGGWK 318
Query: 384 GQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNVNEIADRSAPDKVDGDL 433
+G + + K C C+ + K C CG+ I ++ +++ +L
Sbjct: 319 KKGQ-KNIVQAKQCPDCSAMWPLSQKMCNLCGHDFSIEEKHEKLRLEAEL 367
>ref|NP_490263.1| (NC_003276) putative helicase [Nostoc sp. PCC 7120]
dbj|BAB78241.1| (AP003600) ORF_ID:alr7157~putative helicase [Nostoc sp. PCC 7120]
Length = 629
Score = 170 bits (430), Expect = 6e-041
Identities = 129/421 (30%), Positives = 194/421 (45%), Gaps = 56/421 (13%)
Query: 1 MFKLRDYQQLAYNRIMAAWEK-YRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIV---HR 56
MF LRDYQQ ++ AAW R VL L TG GKTVIF++I T ++ HR
Sbjct: 1 MFNLRDYQQDLISKTFAAWSSGIRKVLLQLSTGGGKTVIFAEIASKMTAQGEGVLVVAHR 60
Query: 57 REIVAQISLSLASFGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSK 116
E++ Q + ++ A K I K +K ++P + S+QTL +
Sbjct: 61 EELILQAA---------EKLTAVTKLQPGIIKAGYKS-------TDSPLQIASIQTLARR 104
Query: 117 STLNNAAIMRWVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLG 176
T +A ++ + DE HH + K + + +A +L +TATP R DG GL
Sbjct: 105 QTYPSAQLV---------IIDEAHHSSANS-YRKLLDAYPHALVLGLTATPRREDGYGLR 154
Query: 177 KGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVV 236
+ +I +T LI G+L+ + A + DF K L+ V
Sbjct: 155 D----IFDHLICSVSTKELITLGYLTDYKLIA---GFKYSRHKVPQKRDFTRKELE-EVA 206
Query: 237 ESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHL 296
+ +V++ ++ + K+ ++FA +V ++Q+A F G T + GET ER +
Sbjct: 207 SDYKPSEVLKQWQNFCAGKKTVIFAVNVMHSKQIAAAFCADGITCEHLDGETPHNERQAI 266
Query: 297 LAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAI 356
L +F G QV+ N + EGFD P ++ARPT S+ +LQM+GR LR GK+YA
Sbjct: 267 LNRFRSGQTQVISNCAILTEGFDCPDSSAAVIARPTSSVTLWLQMIGRVLRPAPGKDYAT 326
Query: 357 IIDPVRNWERHGPPTAVRNWSLMGREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYCGN 416
I+D NW R G P R WSL +CD P + CP+C +
Sbjct: 327 ILDMTDNWFRLGRPCDNRKWSL--------------EPVSCD----PDTQGSRCCPHCHH 368
Query: 417 V 417
V
Sbjct: 369 V 369
>ref|NP_607398.1| (NC_003485) hypothetical phage protein [Streptococcus pyogenes
MGAS8232]
gb|AAL97897.1| (AE010051) hypothetical phage protein [Streptococcus pyogenes
MGAS8232]
Length = 455
Score = 169 bits (428), Expect = 1e-040
Identities = 124/414 (29%), Positives = 196/414 (46%), Gaps = 50/414 (12%)
Query: 25 VLAVLPTGAGKTVIFSKIIHDHT---GAAAAIVHRREIVAQISLSLASFGVKHRVIAPAK 81
++ V P G+GK+V+ S I T G +VHR+E++ QI+ S G+ +
Sbjct: 1 MMIVSPPGSGKSVVISDIAKSATQKNGHVLFLVHRKELIDQITNSFKFHGIDMNKV---- 56
Query: 82 TLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLNNAAIMRWVNQVTLGVYDEGHH 141
L T+ K +K+ L+ + + TL + DEGHH
Sbjct: 57 DLMTVGK--------------------------AKNRLDK------LTKPTLIITDEGHH 84
Query: 142 YVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVMIEGPTTHWLIENGFL 201
+ + F + + TATP R G G +VM+ G T WLI N L
Sbjct: 85 G-KASTYQIIYEYFSDVPRIGFTATPWRLSGDGFTDTY----DVMVLGKTVEWLINNNKL 139
Query: 202 SKFVYRAPQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENKRAIVFA 261
+ + Y + + ID L +NGD++ K++ + I GDVVQ Y K ++AI++A
Sbjct: 140 APYDYYSVLS-IDTAKLKV-QNGDYSNKSIDESFGKK-IFGDVVQEYIKKANGQKAILYA 196
Query: 262 TDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFDEGFDVP 321
VE ++ A+EF+ G A +T + +RD ++ F +G IQV+ NVDL EGFDVP
Sbjct: 197 HSVEASQAFAKEFQSMGINAIHADAKTPKAKRDKIMKDFRDGKIQVICNVDLISEGFDVP 256
Query: 322 AVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVRNWERHGPPTAVRNWS--LM 379
IL RPT+SL FLQ R++R K+ AII+D V NW HG P +W
Sbjct: 257 DCTVTILCRPTKSLVLFLQQSMRSMRYQPNKK-AIILDHVGNWNIHGLPDTPHHWENYFR 315
Query: 380 GREKGQGSGGDTIPQKTCDACTVPYEAYYKACPYCGNVNEIADRSAPDKVDGDL 433
G K + + +T+ K C C+ + + C C + + ++ ++++ +L
Sbjct: 316 GGWKKESNKTNTVHAKECPVCSALWPLSQQLCELCNHDFGLKEKQEKERIEAEL 369
>ref|NP_459877.1| (NC_003197) Fels-1 prophage; putative DNA or RNA helicases of
superfamily II [Salmonella typhimurium LT2]
gb|AAL19836.1| (AE008738) Fels-1 prophage; putative DNA or RNA helicases of
superfamily II [Salmonella typhimurium LT2]
Length = 527
Score = 120 bits (300), Expect = 8e-026
Identities = 111/421 (26%), Positives = 178/421 (41%), Gaps = 62/421 (14%)
Query: 19 WEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIVHRREIVAQISLSLASFGVKHRVIA 78
W+ + + + PTG+GK TG AA IV S G++
Sbjct: 6 WKSHNTFMVYAPTGSGK-----------TGLAAFIVD----------GFVSRGMRVMFCV 44
Query: 79 PAKTLKTIRKKHFKKYGKS-----FI-------DPNAPCGVVSVQTLTSKSTLNNAAIMR 126
P + L T F +YG S ++ +P + S TL + +N
Sbjct: 45 PYQILITQTASRFIEYGLSGDEIGYVWASHPNHNPALKIQIASADTLIRREFPDN----- 99
Query: 127 WVNQVTLGVYDEGHHYVEQGIWAKGVHVFENAKLLFVTATPERADGVGLGKGEGGFAEVM 186
+ L + DE H ++ + K++ ++ TP G + + +
Sbjct: 100 ----IDLLIIDEAHLRKKRILQDIERLRASGVKVIGLSGTP-------FSPFLGKYYDRL 148
Query: 187 IEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKN----GDFNAKALKARVVESHIVG 242
I+ T LI+ G LSK+ + AP T DL+ + T + D+N L + S +VG
Sbjct: 149 IKPTTIGELIQRGDLSKYEFYAP-TKPDLKGVKTKSSLEYGSDYNEAQLAEIMCGSTLVG 207
Query: 243 DVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEE 302
D+VQ++ + G + I F +V A + +F AG A ++ +T ER ++ +FE
Sbjct: 208 DIVQNWLENGRDLPTIAFCVNVAHANYLTIQFNLAGINAEVMTADTPVDERQTIIHRFET 267
Query: 303 GAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEYAIIIDPVR 362
GA +++V+V + GFD V C+I ARPT+S ++LQ +GR LR GKE +I D
Sbjct: 268 GATKIIVSVGVLVAGFD-SDVRCIIYARPTKSEIRWLQAIGRGLRTAPGKESCLIFDHSG 326
Query: 363 NWERHGPPTAVRNWSLMGREKGQGSGGDTIPQ-------KTCDACTVPYEAYYKACPYCG 415
R G P ++ L G+ G Q + C C A CP CG
Sbjct: 327 TVHRLGYPDSIEYDDLPGKSDGMEESARRSSQERAEKLPRECSQCHYMKPAGVYVCPKCG 386
Query: 416 N 416
+
Sbjct: 387 H 387
>ref|NP_142208.1| (NC_000961) hypothetical protein [Pyrococcus horikoshii]
pir||H71243 probable helicase protein PH0210 - Pyrococcus horikoshii
dbj|BAA29279.1| (AP000001) 444aa long hypothetical protein [Pyrococcus horikoshii]
Length = 444
Score = 85.1 bits (209), Expect = 3e-015
Identities = 99/365 (27%), Positives = 164/365 (44%), Gaps = 49/365 (13%)
Query: 2 FKLRDYQQLAYNRIMAAWEKYRSVLAVLPTGAGKTVIFSKIIHDHTGAAAAIVHRREIVA 61
+KLR YQ+ A + A ++ VLA LP G+GKTV+ +IIH+ +A IVH +E++
Sbjct: 93 YKLRKYQKKA---VKLAIKEKMGVLA-LPVGSGKTVVGLRIIHEINKSALVIVHTKELLY 148
Query: 62 QISLSLAS-FGVKHRVIAPAKTLKTIRKKHFKKYGKSFIDPNAPCGVVSVQTLTSKSTLN 120
Q + + GV+ +I K P V +QTL S+ T
Sbjct: 149 QWANKVREILGVEPGIIGDNKW------------------SEGPITVAMIQTLLSRGT-- 188
Query: 121 NAAIMRWVNQVTLGVYDEGHH------YVEQGIWAKGVHVFENAKLLFVTATPERADGVG 174
+ N+ + ++DE H + + GI V+ F ++ATP R
Sbjct: 189 ----DKLQNKYAIVMFDECHRTSAAEKFYKVGISLPQVYRFG------LSATPWRRLRGE 238
Query: 175 LGKGEGGFAEVMIEGPTTHWLIENGFLSKFVYRAPQTDIDLRDLATGKNGDFNAKALKAR 234
K EG ++ E LI+ GFL+K + + D + LA K + +A+
Sbjct: 239 EMKIEGVVGPIIYEVKAED-LIKEGFLAKPKFEVIEYDSKMPALA-DKYKELYEEAVMEN 296
Query: 235 VVESHIVGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERD 294
+ + + K G R ++ ++ E + + R G A +S ++ R
Sbjct: 297 EERNRAIVEKAIELAKQGH--RVLIDVKRIDHGEILVKMLRDRGINAEFLSSQSP--NRW 352
Query: 295 HLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKEY 354
+L ++++G I VLV+ L EG D+P + +ILA +S +Q +GRALR G E
Sbjct: 353 EILEKYKKGEIPVLVST-LLKEGVDIPEISAIILAGGGKSDVMTIQTIGRALRPKPGGE- 410
Query: 355 AIIID 359
A+I+D
Sbjct: 411 AVIVD 415
>ref|NP_599062.1| (NC_003444) unknown [Shigella flexneri bacteriophage V]
gb|AAL89432.1| (U82619) unknown [Shigella flexneri bacteriophage V]
Length = 178
Score = 82.8 bits (203), Expect = 1e-014
Identities = 53/125 (42%), Positives = 68/125 (54%), Gaps = 4/125 (3%)
Query: 569 DKFDLPIKTIAYHLSHINRFNGAVGQ-YSVAQHCVQVXXXXXXXXXXXXXXHDATEAVLC 627
+K D+ I IA LS+I RF G + YSVAQH V HDATEA
Sbjct: 18 NKDDIDINDIAVSLSNICRFAGHLSHFYSVAQHAVLCSQLVPQEFAFEALMHDATEAYCQ 77
Query: 628 DVPAPLKRMLPDYQAIENRLQDAVDARFKV-KTRHKRVREADLSMLAAEARDFGLDLGPL 686
D+PAPLKR+LPDY+ +E ++ + ++ + V+ ADL MLA E RD GLD G
Sbjct: 78 DIPAPLKRLLPDYKQMEEKIDAVIREKYGLPPVMSTPVKYADLIMLATERRDLGLDDG-- 135
Query: 687 GFEPV 691
F PV
Sbjct: 136 SFWPV 140
>ref|NP_602261.1| (NC_003450) COG1061:DNA or RNA helicases of superfamily II
[Corynebacterium glutamicum]
dbj|BAC00463.1| (AP005283) Restriction enzymes type I helicase subunits and related
helicases [Corynebacterium glutamicum ATCC 13032]
Length = 1643
Score = 65.9 bits (159), Expect = 2e-009
Identities = 72/255 (28%), Positives = 100/255 (38%), Gaps = 52/255 (20%)
Query: 158 AKLLFVTATPERADGVGLGKGEGGFAEVM------IEGPTTHWL-----IENGFLSKFVY 206
AK L++TATP D GK AEV I GP H L +E G L+ +
Sbjct: 360 AKRLYMTATPRLFDDSVKGKAADHSAEVSSMDDEAIYGPEFHRLGFGEAVEKGLLTDYKV 419
Query: 207 RAPQTDIDLRDLATGKNGDFNAKALKARVVESHIVGDVVQHYRKWGENK----------- 255
D + A G + L + S I+G ++ G+ +
Sbjct: 420 VVMTVDEQVAASALTVLGSTPGEELTLDMT-SAIIGAWNGLAKRSGKEQDTKTGFSSSDA 478
Query: 256 ---RAIVFATDVETAEQMAEEFRRA--GYTAA--------------------SVSGETEQ 290
RA+ FA D++T++Q+AE F R YT V G
Sbjct: 479 AMERAVAFARDIKTSQQIAESFPRVVNAYTTELEVKNDDVDEHNLNLSVACQHVDGSMNA 538
Query: 291 GERDHLL----AQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRAL 346
ER+ L A + ++L N EG DVPA++ VI P S+ +Q VGR +
Sbjct: 539 LERNSRLTWLKAPTQSMETKILTNARCLSEGVDVPALDSVIFFNPRNSMVDVVQSVGRVM 598
Query: 347 RIMEGKEYAIIIDPV 361
R GK Y II PV
Sbjct: 599 RKSPGKNYGYIILPV 613
>ref|XP_132906.1| (XM_132906) similar to KIAA0111 gene product [Mus musculus]
Length = 411
Score = 57.4 bits (137), Expect = 6e-007
Identities = 26/92 (28%), Positives = 51/92 (55%)
Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
+A++F + + E+ R A +T +S+ G+ Q ER+ ++ +F GA +VL++ D++
Sbjct: 279 QAVIFCITKRKVDWLTEKMREANFTVSSMHGDMPQKERESIMKEFRSGASRVLISTDVWA 338
Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALR 347
G DVP V +I + ++ +GR+ R
Sbjct: 339 RGLDVPQVSLIINYDLPNNRELYIHRIGRSGR 370
>ref|NP_248512.1| (NC_000909) ATP dependent RNA helicase, putative [Methanococcus
jannaschii] [Methanocaldococcus jannaschii]
sp|Q58900|YF05_METJA Putative ATP-dependent RNA helicase MJ1505
gb|AAB99518.1| (U67591) ATP dependent RNA helicase, putative [Methanococcus
jannaschii] [Methanocaldococcus jannaschii]
Length = 778
Score = 55.1 bits (131), Expect = 3e-006
Identities = 36/124 (29%), Positives = 64/124 (51%), Gaps = 8/124 (6%)
Query: 241 VGDVVQHYRKWGENKRAIVFATDVETAEQMAEEFRRAGYTAASVSGETE-------QGER 293
V D+V++ + +++R I+FA +T E++ + G A G+ Q E+
Sbjct: 339 VVDMVKNILEKNKDERIIIFAQYRDTVEKIVNLLTQNGIKAIRFIGQANKEGKGMSQKEQ 398
Query: 294 DHLLAQFEEGAIQVLVNVDLFDEGFDVPAVECVILARPTQSLAKFLQMVGRALRIMEGKE 353
+ +F++ VLV+ + +EG D+P+V +I P S +F+Q GRA+R GK
Sbjct: 399 IEAIERFKKEG-SVLVSTSVSEEGIDIPSVNYIIFYEPVPSEIRFIQRRGRAMRGEGGKV 457
Query: 354 YAII 357
Y +I
Sbjct: 458 YVLI 461
>sp|O02494|IF4A_CRYPV Eukaryotic initiation factor 4A (eIF-4A) (eIF4A)
gb|AAB58726.1| (AF001211) translation initiation factor [Cryptosporidium parvum]
gb|AAB58799.1| (AF001378) translation initiation factor [Cryptosporidium parvum]
Length = 405
Score = 54.3 bits (129), Expect = 5e-006
Identities = 32/117 (27%), Positives = 59/117 (50%), Gaps = 5/117 (4%)
Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
+AI++ +Q+ ++ R +T +S+ G+ +Q +R+ ++ QF G+ +VL+ DL
Sbjct: 272 QAIIYCNTRRRVDQLTKQMRERDFTCSSMHGDMDQKDREVIMRQFRSGSSRVLITTDLLA 331
Query: 316 EGFDVPAVECVILARPTQSLAKFLQMVGRALRI-MEGKEYAIIIDP----VRNWERH 367
G DV V VI S ++ +GR+ R +G + D +R+ ERH
Sbjct: 332 RGIDVQQVSLVINYDLPVSPETYIHRIGRSGRFGKKGVSINFVTDDDIVCLRDIERH 388
>ref|NP_540200.1| (NC_003317) Hypothetical Cytosolic Protein [Brucella melitensis]
gb|AAL52464.1| (AE009566) Hypothetical Cytosolic Protein [Brucella melitensis]
Length = 206
Score = 53.1 bits (126), Expect = 1e-005
Identities = 51/173 (29%), Positives = 74/173 (42%), Gaps = 25/173 (14%)
Query: 566 LSGDKFDLPIKTIAYHLSHINRFNG-AVGQ--YSVAQHCVQVXXX-------XXXXXXXX 615
L D+ I+ IA+ L+ + R+NG VG+ +SVAQH + V
Sbjct: 26 LDPSPLDVEIEDIAHGLARVARWNGQTVGEHAFSVAQHSLLVDQIFNRLVPGASIEWQML 85
Query: 616 XXXHDATEAVLCDVPAPLKRMLP-DYQAIENRLQDAVDARF--------KVKTRHKRVRE 666
HDA E V+ D+ +P K ++ +Y+ IE RL+ A+ RF K+KT KR +
Sbjct: 86 ALLHDAPEYVIGDMISPFKAVMGGNYKLIETRLESAIHLRFSLPVSPPAKLKTLIKRADQ 145
Query: 667 ADLSMLAAEARDFGLDLG------PLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
A F G P G PV I P P + + +L F A
Sbjct: 146 VAAFFEATRLAGFTEAEGVKYFGRPRGLNPVGLDIVPRPTQDVQADFLKRFSA 198
>ref|NP_531749.1| (NC_003304) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAL42065.1| (AE009069) conserved hypothetical protein [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 207
Score = 48.5 bits (114), Expect = 3e-004
Identities = 48/176 (27%), Positives = 75/176 (42%), Gaps = 31/176 (17%)
Query: 566 LSGDKFDLPIKTIAYHLSHINRFNGAVG---QYSVAQHCVQVXXX-------XXXXXXXX 615
L D+ I IA+ L+ + R+NG ++VAQHC+ V
Sbjct: 22 LDPSPLDVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQM 81
Query: 616 XXXHDATEAVLCDVPAPLKRML-PDYQAIENRLQDAVDARFKV-----KTRHKRVREADL 669
HDA E V+ D+ +P K ++ Y+ +E RL+ AV RF + + R+++AD
Sbjct: 82 ALLHDAPEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADT 141
Query: 670 SMLAAEARD------------FGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
EA + FGL P G I P P+ A++ ++A F A
Sbjct: 142 VAAFFEATELAGFSTAEAQKFFGL---PRGITRDMFDIIPLPSTEAQRLFIARFEA 194
>ref|NP_354075.1| (NC_003062) AGR_C_1942p [Agrobacterium tumefaciens] [Agrobacterium
tumefaciens str. C58 (Cereon)]
gb|AAK86860.1| (AE008035) AGR_C_1942p [Agrobacterium tumefaciens str. C58
(Cereon)]
Length = 226
Score = 48.5 bits (114), Expect = 3e-004
Identities = 48/176 (27%), Positives = 75/176 (42%), Gaps = 31/176 (17%)
Query: 566 LSGDKFDLPIKTIAYHLSHINRFNGAVG---QYSVAQHCVQVXXX-------XXXXXXXX 615
L D+ I IA+ L+ + R+NG ++VAQHC+ V
Sbjct: 41 LDPSPLDVEIADIAHGLARVARWNGQTRGDHAFTVAQHCLIVETIFCRMCPGATPDEMQM 100
Query: 616 XXXHDATEAVLCDVPAPLKRML-PDYQAIENRLQDAVDARFKV-----KTRHKRVREADL 669
HDA E V+ D+ +P K ++ Y+ +E RL+ AV RF + + R+++AD
Sbjct: 101 ALLHDAPEYVIGDMISPFKSVVGGGYKTVEKRLEAAVHLRFGLPPHASRELKDRIKKADT 160
Query: 670 SMLAAEARD------------FGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
EA + FGL P G I P P+ A++ ++A F A
Sbjct: 161 VAAFFEATELAGFSTAEAQKFFGL---PRGITRDMFDIIPLPSTEAQRLFIARFEA 213
>gb|AAB67607.1| (U73459) translational initiation factor eIF-4A [Zea mays]
Length = 414
Score = 48.5 bits (114), Expect = 3e-004
Identities = 26/93 (27%), Positives = 48/93 (50%), Gaps = 2/93 (2%)
Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
++++F + + ++ R +T ++ G+ +Q RD ++ +F G+ +VL+ DL
Sbjct: 282 QSVIFVNTRRKVDWLTDKMRSRDHTVSATHGDMDQNTRDIIMREFRSGSSRVLITTDLLA 341
Query: 316 EGFDVPAVECVI-LARPTQSLAKFLQMVGRALR 347
G DV V VI PTQ +L +GR+ R
Sbjct: 342 RGIDVQQVSLVINYDLPTQP-ENYLHRIGRSGR 373
>ref|NP_385209.1| (NC_003047) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
emb|CAC45682.1| (AL591786) CONSERVED HYPOTHETICAL PROTEIN [Sinorhizobium meliloti]
Length = 202
Score = 47.4 bits (111), Expect = 6e-004
Identities = 47/174 (27%), Positives = 74/174 (42%), Gaps = 28/174 (16%)
Query: 566 LSGDKFDLPIKTIAYHLSHINRFNGAVG---QYSVAQHCVQVXXXXXXXXX------XXX 616
L D+ + IA+ L+ + R+NG +SVAQH + V
Sbjct: 19 LDPSPLDVELSDIAHGLARVARWNGQTSGDHAFSVAQHSLVVEEIFRRTNRCDAQDCLMA 78
Query: 617 XXHDATEAVLCDVPAPLKRML-PDYQAIENRLQDAVDARFKV-----KTRHKRVREAD-- 668
HDA E V+ D+ +P K ++ Y+ +E RL++AV RF + + +R+++AD
Sbjct: 79 LLHDAPEYVIGDMISPFKAVVGGGYKTVEKRLENAVHLRFGLPAHTPRELKERIKKADRI 138
Query: 669 ---------LSMLAAEARDFGLDLGPLGFEPVSATIKPWPAKTAEQAWLAAFHA 713
A EAR F P G I P PA A++ + F+A
Sbjct: 139 AAYFEATELAGFSAEEARKFFGQ--PRGITREMLLIDPLPAVEAQRLFATRFNA 190
>emb|CAB38640.1| (Y17288) RNA helicase [Plasmodium falciparum]
Length = 182
Score = 44.7 bits (104), Expect = 0.004
Identities = 20/72 (27%), Positives = 39/72 (53%)
Query: 256 RAIVFATDVETAEQMAEEFRRAGYTAASVSGETEQGERDHLLAQFEEGAIQVLVNVDLFD 315
++I++ + + + +E +T + + G+ +Q +RD ++ +F G+ +VLV DL
Sbjct: 95 QSIIYCNTRKKVDILTQEMHNRLFTVSCMHGDMDQKDRDLIMREFRSGSTRVLVTTDLLA 154
Query: 316 EGFDVPAVECVI 327
G DV V VI
Sbjct: 155 RGIDVQQVSLVI 166
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.137 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 449,517,236
Number of Sequences: 1028106
Number of extensions: 18833824
Number of successful extensions: 44756
Number of sequences better than 5.0e-02: 765
Number of HSP's better than 0.1 without gapping: 596
Number of HSP's successfully gapped in prelim test: 169
Number of HSP's that attempted gapping in prelim test: 43629
Number of HSP's gapped (non-prelim): 1156
length of query: 718
length of database: 324,535,921
effective HSP length: 128
effective length of query: 590
effective length of database: 192,938,353
effective search space: 113833628270
effective search space used: 113833628270
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 95 (41.2 bits)
Query= orf46 [13051-13221]
(169 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 105,108,614
Number of Sequences: 1028106
Number of extensions: 3728264
Number of successful extensions: 7212
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7212
Number of HSP's gapped (non-prelim): 0
length of query: 169
length of database: 324,535,921
effective HSP length: 112
effective length of query: 57
effective length of database: 209,388,049
effective search space: 11935118793
effective search space used: 11935118793
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 87 (38.1 bits)
Query= orf47 [13236-13348]
(111 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [M... 47 4e-005
ref|NP_335121.1| (NC_002755) transcriptional regulator, put... 47 4e-005
emb|CAC95193.1| (AJ416905) hypothetical transcriptional reg... 43 6e-004
ref|NP_266283.1| (NC_002662) transcription regulator [Lacto... 43 8e-004
ref|NP_386558.1| (NC_003047) PUTATIVE TRANSCRIPTION REGULAT... 41 0.002
ref|NP_421917.1| (NC_002696) transcriptional regulator, Tet... 41 0.003
ref|NP_626492.1| (NC_003888) putative TetR-family transcrip... 40 0.004
sp|P51561|TER8_PASMU TETRACYCLINE REPRESSOR PROTEIN CLASS H... 40 0.005
gb|AAD12754.1| (AF038993) repressor protein [Proteus mirabi... 40 0.005
ref|NP_421590.1| (NC_002696) transcriptional regulator, Tet... 40 0.005
ref|NP_149192.1| (NC_001988) HTH transcriptional regulator ... 40 0.006
ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tum... 40 0.006
emb|CAA75662.1| (Y15510) tetracycline resistance repressor ... 39 0.008
ref|NP_511232.1| (NC_003292) tetracycline repressor protein... 39 0.014
ref|NP_628554.1| (NC_003888) putative tetR-family transcrip... 38 0.019
ref|NP_630227.1| (NC_003888) putative transcriptional regul... 37 0.032
>ref|NP_215195.1| (NC_000962) hypothetical protein Rv0681 [Mycobacterium tuberculosis
H37Rv]
pir||B70827 hypothetical protein Rv0681 - Mycobacterium tuberculosis (strain
H37RV)
emb|CAA17464.1| (AL021943) hypothetical protein Rv0681 [Mycobacterium tuberculosis
H37Rv]
Length = 196
Score = 47.0 bits (110), Expect = 4e-005
Identities = 33/95 (34%), Positives = 49/95 (50%), Gaps = 6/95 (6%)
Query: 8 RPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRG 67
RPA R E IV+ L R GW+ +T A+A +G PSL H ++ L+RA+
Sbjct: 3 RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAVRIR 61
Query: 68 AIKADDGLVVAMAVKAGDFQRAKVDELLYIRGEHQ 102
I DD + + V AG RA+ D +L + G ++
Sbjct: 62 VI--DDIITMLNRVGAG---RARDDAVLVMAGAYR 91
>ref|NP_335121.1| (NC_002755) transcriptional regulator, putative [Mycobacterium
tuberculosis CDC1551]
gb|AAK44935.1| (AE006965) transcriptional regulator, putative [Mycobacterium
tuberculosis CDC1551]
Length = 218
Score = 47.0 bits (110), Expect = 4e-005
Identities = 33/95 (34%), Positives = 49/95 (50%), Gaps = 6/95 (6%)
Query: 8 RPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRG 67
RPA R E IV+ L R GW+ +T A+A +G PSL H ++ L+RA+
Sbjct: 25 RPAKLSR-ESIVEGALTFLDREGWDSLTINALATQLGTKGPSLYNHVDSLEDLRRAVRIR 83
Query: 68 AIKADDGLVVAMAVKAGDFQRAKVDELLYIRGEHQ 102
I DD + + V AG RA+ D +L + G ++
Sbjct: 84 VI--DDIITMLNRVGAG---RARDDAVLVMAGAYR 113
>emb|CAC95193.1| (AJ416905) hypothetical transcriptional regulator [Rhizobium
leguminosarum]
Length = 216
Score = 43.1 bits (100), Expect = 6e-004
Identities = 21/64 (32%), Positives = 38/64 (58%)
Query: 2 AKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
AK + + ++EQI++A L V G+ G T +AE G++ P+L+Y++RT +
Sbjct: 7 AKTLRRTRIQEEKEEQILEAALDVFSASGFRGSTIDQIAEVAGMSKPNLLYYFRTKEAMH 66
Query: 62 RAIV 65
RA++
Sbjct: 67 RALI 70
>ref|NP_266283.1| (NC_002662) transcription regulator [Lactococcus lactis subsp.
lactis]
gb|AAK04225.1|AE006251_1 (AE006251) transcription regulator [Lactococcus lactis subsp.
lactis]
Length = 191
Score = 42.7 bits (99), Expect = 8e-004
Identities = 18/52 (34%), Positives = 36/52 (68%)
Query: 12 AVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRA 63
A+ +++IVQA + +A ++G +++ +AE+ GI APSL H++ M +++ A
Sbjct: 5 ALTRDKIVQATIELAGKIGLSNVSFPRLAEYFGIKAPSLYNHFKNMEEVRVA 56
>ref|NP_386558.1| (NC_003047) PUTATIVE TRANSCRIPTION REGULATOR PROTEIN
[Sinorhizobium meliloti]
emb|CAC47031.1| (AL591790) PUTATIVE TRANSCRIPTION REGULATOR PROTEIN
[Sinorhizobium meliloti]
Length = 213
Score = 41.2 bits (95), Expect = 0.002
Identities = 18/52 (34%), Positives = 34/52 (64%)
Query: 14 RKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
++EQI++A L V G+ G T +A+ G++ P+L+Y++RT + RA++
Sbjct: 19 KEEQILEAALEVFSANGFRGSTIDQIADVAGMSKPNLLYYFRTKEAMHRALI 70
>ref|NP_421917.1| (NC_002696) transcriptional regulator, TetR family [Caulobacter
crescentus CB15]
gb|AAK25085.1| (AE005976) transcriptional regulator, TetR family [Caulobacter
crescentus CB15]
Length = 202
Score = 40.8 bits (94), Expect = 0.003
Identities = 25/91 (27%), Positives = 42/91 (45%)
Query: 1 MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL 60
M + R AP VR++ +V A V R G G + +A+ G++ L +++ + L
Sbjct: 9 MTRAAFTREAPDVRRQALVAAAEAVLAREGVGGTSVRAICAEAGVSPGLLRHYFEGVDDL 68
Query: 61 KRAIVRGAIKADDGLVVAMAVKAGDFQRAKV 91
A + DG + A AGD RA++
Sbjct: 69 IAAAYEAVSQRIDGALDAALAAAGDDPRARL 99
>ref|NP_626492.1| (NC_003888) putative TetR-family transcriptional regulator.
[Streptomyces coelicolor A3(2)]
emb|CAB66177.1| (AL136500) putative TetR-family transcriptional regulator.
[Streptomyces coelicolor A3(2)]
Length = 192
Score = 40.4 bits (93), Expect = 0.004
Identities = 29/96 (30%), Positives = 48/96 (49%), Gaps = 5/96 (5%)
Query: 1 MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL 60
M+ Q K+ A R+ IV +A + GW+ +T + +AE + + P L H+R ++
Sbjct: 1 MSVQERKQREWAERERLIVATARELAEQQGWDAVTTRRLAERIEYSQPVLYSHFRGKREI 60
Query: 61 KRAIVRGAIKADDGLVVAM--AVKAGDFQRAKVDEL 94
A+ A++ + VAM A A D RA+V L
Sbjct: 61 IGAV---ALQGATEMAVAMRAATAAADGPRARVHAL 93
>sp|P51561|TER8_PASMU TETRACYCLINE REPRESSOR PROTEIN CLASS H
gb|AAC43249.1| (U00792) repressor protein [Pasteurella multocida]
Length = 207
Score = 40.0 bits (92), Expect = 0.005
Identities = 17/50 (34%), Positives = 30/50 (60%)
Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI 64
KEQ++ L++ VG EG+T + VA+ +G+ P+L +H + L A+
Sbjct: 6 KEQVIDDALILLNEVGIEGLTTRNVAQKIGVEQPTLYWHVKNKRALLDAL 55
>gb|AAD12754.1| (AF038993) repressor protein [Proteus mirabilis]
Length = 207
Score = 40.0 bits (92), Expect = 0.005
Identities = 18/66 (27%), Positives = 36/66 (54%)
Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRGAIKADDG 74
KEQ++ L++ VG EG+T + +A+ +G+ P+L +H + L A+ ++
Sbjct: 6 KEQVIDNALILLNEVGMEGLTTRKLAQKLGVEQPTLYWHVKNKRALLDALAETILQKHHH 65
Query: 75 LVVAMA 80
V+ +A
Sbjct: 66 HVLPLA 71
>ref|NP_421590.1| (NC_002696) transcriptional regulator, TetR family [Caulobacter
crescentus CB15]
gb|AAK24758.1| (AE005944) transcriptional regulator, TetR family [Caulobacter
crescentus CB15]
Length = 255
Score = 40.0 bits (92), Expect = 0.005
Identities = 19/56 (33%), Positives = 35/56 (61%)
Query: 10 APAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
A R+ I++A L V R+G EG + +A+A+ G++ P+L+Y+Y + L A++
Sbjct: 58 AGEARRAAILKAALAVFARLGPEGASVEAIAQAAGVSKPNLLYYYPSKDALYLAVL 113
>ref|NP_149192.1| (NC_001988) HTH transcriptional regulator TetR family
[Clostridium acetobutylicum]
gb|AAK76774.1|AE001438_27 (AE001438) HTH transcriptional regulator TetR family [Clostridium
acetobutylicum]
Length = 187
Score = 39.7 bits (91), Expect = 0.006
Identities = 17/47 (36%), Positives = 30/47 (63%)
Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLK 61
KE+I+Q +A +G + +T++ +AE + I +PSL H+ M+ LK
Sbjct: 8 KEKIIQIAFSLADEIGLDKVTFQKLAEKLDIKSPSLYNHFSNMSDLK 54
>ref|NP_355336.1| (NC_003062) AGR_C_4325p [Agrobacterium tumefaciens]
[Agrobacterium tumefaciens str. C58 (Cereon)]
ref|NP_533056.1| (NC_003304) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
gb|AAK88121.1| (AE008152) AGR_C_4325p [Agrobacterium tumefaciens str. C58
(Cereon)]
gb|AAL43372.1| (AE009186) transcriptional regulator, TetR family [Agrobacterium
tumefaciens str. C58 (U. Washington)]
Length = 213
Score = 39.7 bits (91), Expect = 0.006
Identities = 17/52 (32%), Positives = 33/52 (62%)
Query: 14 RKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIV 65
++E I++A L V G+ G T +AE G++ P+++Y++RT + RA++
Sbjct: 19 KQETILEAALSVFSTNGFRGSTIDQIAEAAGMSKPNVLYYFRTKEAMHRALI 70
>emb|CAA75662.1| (Y15510) tetracycline resistance repressor protein [Pasteurella
multocida]
emb|CAA76068.1| (Y16103) tetracycline resistance repressor protein [Mannheimia
haemolytica]
emb|CAC08219.1| (AJ245947) tet repressor protein [Pasteurella aerogenes]
Length = 207
Score = 39.3 bits (90), Expect = 0.008
Identities = 16/50 (32%), Positives = 30/50 (60%)
Query: 15 KEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAI 64
KEQ++ L++ VG EG+T + +A+ +G+ P+L +H + L A+
Sbjct: 6 KEQVIDNALILLNEVGIEGLTTRKLAQKIGVEQPTLYWHVKNKRALLDAL 55
>ref|NP_511232.1| (NC_003292) tetracycline repressor protein [IncN plasmid R46]
sp|P03039|TER3_ECOLI TETRACYCLINE REPRESSOR PROTEIN CLASS C
pir||RPECYS tetracycline repressor - Escherichia coli plasmid pSC101
gb|AAA25677.1| (M36272) tetracycline resistance element repressor protein (tetR)
[Plasmid pSC101]
gb|AAG43219.1| (AY046276) tetracycline repressor protein [IncN plasmid R46]
prf||1101269A repressor tet [Salmonella enterica subsp. enterica serovar
Panama]
Length = 219
Score = 38.5 bits (88), Expect = 0.014
Identities = 15/57 (26%), Positives = 33/57 (57%)
Query: 13 VRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKRAIVRGAI 69
+++E +++ L + VG EG+T + +AE +G+ P+L +H++ L A+ +
Sbjct: 4 LQREAVIRTALELLNDVGMEGLTTRRLAERLGVQQPALYWHFKNKRALLDALAEAML 60
>ref|NP_628554.1| (NC_003888) putative tetR-family transcriptional regulator
[Streptomyces coelicolor A3(2)]
emb|CAB95896.1| (AL359988) putative tetR-family transcriptional regulator
[Streptomyces coelicolor A3(2)]
Length = 202
Score = 38.1 bits (87), Expect = 0.019
Identities = 31/121 (25%), Positives = 48/121 (39%), Gaps = 18/121 (14%)
Query: 3 KQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQLKR 62
++V K+ ++++++A + GW G T VAEH G++ + +H+ T L
Sbjct: 8 ERVPKQDRSRATRQRLLEAAVACLAERGWAGSTVSVVAEHAGVSRGAAQHHFPTREDLFT 67
Query: 63 AIV----------------RGAIKADDGLVVAMAVK--AGDFQRAKVDELLYIRGEHQLE 104
A V GA D VVA V G RA + + E QL
Sbjct: 68 AAVEYVAEERSIALRALFPEGAAADDRRAVVAALVDLYTGPLFRAALHLWVAASNEEQLR 127
Query: 105 G 105
G
Sbjct: 128 G 128
>ref|NP_630227.1| (NC_003888) putative transcriptional regulator [Streptomyces
coelicolor A3(2)]
pir||T35904 probable transcription regulator - Streptomyces coelicolor
emb|CAA22788.1| (AL035212) putative transcriptional regulator [Streptomyces
coelicolor A3(2)]
Length = 192
Score = 37.4 bits (85), Expect = 0.032
Identities = 25/82 (30%), Positives = 41/82 (49%), Gaps = 5/82 (6%)
Query: 1 MAKQVTKRPAPAVRKEQIVQAGLLVARRVGWEGITYKAVAEHVGIAAPSLVYHYRTMTQL 60
M+ Q K+ A R+ IV +A + GW+ +T + +AE + + P L H+R
Sbjct: 1 MSVQERKQRERAERERLIVATARELAEQQGWDAVTTRRLAERIEYSQPVLYSHFRG---- 56
Query: 61 KRAIVRGAIKADDGLVVAMAVK 82
KR I+ GA+ +A AV+
Sbjct: 57 KREII-GAVALQGATELAAAVR 77
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.322 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 62,251,575
Number of Sequences: 1028106
Number of extensions: 2011169
Number of successful extensions: 4713
Number of sequences better than 5.0e-02: 26
Number of HSP's better than 0.1 without gapping: 21
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 4692
Number of HSP's gapped (non-prelim): 26
length of query: 111
length of database: 324,535,921
effective HSP length: 87
effective length of query: 24
effective length of database: 235,090,699
effective search space: 5642176776
effective search space used: 5642176776
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf48 [13343-13405]
(61 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.138 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 41,204,087
Number of Sequences: 1028106
Number of extensions: 1243960
Number of successful extensions: 2506
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2506
Number of HSP's gapped (non-prelim): 0
length of query: 61
length of database: 324,535,921
effective HSP length: 37
effective length of query: 24
effective length of database: 286,495,999
effective search space: 6875903976
effective search space used: 6875903976
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 85 (37.4 bits)
Query= orf49 [13467-13520]
(52 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.336 0.152 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 28,421,289
Number of Sequences: 1028106
Number of extensions: 716841
Number of successful extensions: 3399
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3399
Number of HSP's gapped (non-prelim): 0
length of query: 52
length of database: 324,535,921
effective HSP length: 28
effective length of query: 24
effective length of database: 295,748,953
effective search space: 7097974872
effective search space used: 7097974872
T: 11
A: 40
X1: 15 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 39 (21.6 bits)
S2: 85 (37.4 bits)
Query= orf50 [13526-13637]
(110 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.323 0.138 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,713,363
Number of Sequences: 1028106
Number of extensions: 1994436
Number of successful extensions: 5179
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 5179
Number of HSP's gapped (non-prelim): 0
length of query: 110
length of database: 324,535,921
effective HSP length: 86
effective length of query: 24
effective length of database: 236,118,805
effective search space: 5666851320
effective search space used: 5666851320
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 84 (37.0 bits)
Query= orf51 [13636-13787]
(150 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 94,576,630
Number of Sequences: 1028106
Number of extensions: 3400143
Number of successful extensions: 7870
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 7870
Number of HSP's gapped (non-prelim): 0
length of query: 150
length of database: 324,535,921
effective HSP length: 110
effective length of query: 40
effective length of database: 211,444,261
effective search space: 8457770440
effective search space used: 8457770440
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 85 (37.4 bits)
Query= orf52 [13788-13858]
(69 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.140 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 46,310,366
Number of Sequences: 1028106
Number of extensions: 1561078
Number of successful extensions: 3725
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3725
Number of HSP's gapped (non-prelim): 0
length of query: 69
length of database: 324,535,921
effective HSP length: 45
effective length of query: 24
effective length of database: 278,271,151
effective search space: 6678507624
effective search space used: 6678507624
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 84 (37.0 bits)
Query= orf53 [13859-13922]
(62 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.324 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,707,921
Number of Sequences: 1028106
Number of extensions: 1078469
Number of successful extensions: 2415
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2415
Number of HSP's gapped (non-prelim): 0
length of query: 62
length of database: 324,535,921
effective HSP length: 38
effective length of query: 24
effective length of database: 285,467,893
effective search space: 6851229432
effective search space used: 6851229432
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 85 (37.4 bits)
Query= orf54 [13941-14001]
(59 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.322 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 36,225,386
Number of Sequences: 1028106
Number of extensions: 1064755
Number of successful extensions: 2330
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 2330
Number of HSP's gapped (non-prelim): 0
length of query: 59
length of database: 324,535,921
effective HSP length: 35
effective length of query: 24
effective length of database: 288,552,211
effective search space: 6925253064
effective search space used: 6925253064
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 85 (37.4 bits)
Query= orf55 [13999-14088]
(88 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.135 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 52,183,734
Number of Sequences: 1028106
Number of extensions: 1776028
Number of successful extensions: 3371
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 3371
Number of HSP's gapped (non-prelim): 0
length of query: 88
length of database: 324,535,921
effective HSP length: 64
effective length of query: 24
effective length of database: 258,737,137
effective search space: 6209691288
effective search space used: 6209691288
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 84 (37.0 bits)
Query= orf56 [14087-14190]
(102 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.316 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 59,829,153
Number of Sequences: 1028106
Number of extensions: 2127616
Number of successful extensions: 16132
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 16132
Number of HSP's gapped (non-prelim): 0
length of query: 102
length of database: 324,535,921
effective HSP length: 78
effective length of query: 24
effective length of database: 244,343,653
effective search space: 5864247672
effective search space used: 5864247672
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf57 [14234-14372]
(137 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquif... 74 2e-013
ref|NP_600815.1| (NC_003450) COG0242:N-formylmethionyl-tRNA... 72 1e-012
ref|NP_625500.1| (NC_003888) putative polypeptide deformyla... 70 3e-012
ref|NP_562660.1| (NC_003366) polypeptide deformylase [Clost... 69 1e-011
ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida] >g... 67 3e-011
ref|NP_541241.1| (NC_003318) POLYPEPTIDE DEFORMYLASE [Bruce... 67 4e-011
ref|NP_601934.1| (NC_003450) COG0242:N-formylmethionyl-tRNA... 66 6e-011
ref|NP_389454.1| (NC_000964) polypeptide deformylase [Bacil... 66 7e-011
ref|NP_269070.1| (NC_002737) putative polypeptide deformyla... 64 3e-010
ref|NP_607075.1| (NC_003485) putative polypeptide deformyla... 64 3e-010
pdb|2DEF| Peptide Deformylase Catalytic Core (Residues 1 -... 60 3e-009
ref|NP_266716.1| (NC_002662) polypeptide deformylase [Lacto... 59 9e-009
sp|Q48661|DEF_LACLA Peptide deformylase (PDF) (Polypeptide ... 59 9e-009
ref|NP_359915.1| (NC_003103) polypeptide deformylase [EC:3.... 55 1e-007
sp|Q9FV53|DEFM_ARATH Peptide deformylase, mitochondrial pre... 52 9e-007
ref|NP_345996.1| (NC_003028) polypeptide deformylase [Strep... 52 1e-006
gb|EAA02964.1| (AAAB01007479) agCP10822 [Anopheles gambiae ... 45 2e-004
gb|AAG09295.1|AF177768_1 (AF177768) ORF260 [Trypanosoma bru... 39 0.010
>ref|NP_213407.1| (NC_000918) polypeptide deformylase [Aquifex aeolicus]
sp|O66847|DEF_AQUAE Peptide deformylase (PDF) (Polypeptide deformylase)
pir||C70352 polypeptide deformylase - Aquifex aeolicus
gb|AAC06802.1| (AE000696) polypeptide deformylase [Aquifex aeolicus]
Length = 169
Score = 74.3 bits (181), Expect = 2e-013
Identities = 37/97 (38%), Positives = 57/97 (58%), Gaps = 7/97 (7%)
Query: 46 NQLGVLKRVIVVR------APKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHN 99
NQ+GV V+V+ AP K ++NP I + G + +EGCLSFPG +V+ +R
Sbjct: 49 NQIGVPLSVMVIDTSPKEDAPPLKLVLINPEI-KEGEGKIKYKEGCLSFPGLSVEVERFQ 107
Query: 100 KITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGVT 136
K+ V + H +P+++ +G A QHE+DHL G+T
Sbjct: 108 KVKVNALNEHGEPVELTLEGFPAIVFQHELDHLKGIT 144
>ref|NP_600815.1| (NC_003450) COG0242:N-formylmethionyl-tRNA deformylase
[Corynebacterium glutamicum]
dbj|BAB98994.1| (AP005279) N-formylmethionyl-tRNA deformylase [Corynebacterium
glutamicum ATCC 13032]
Length = 169
Score = 72.0 bits (175), Expect = 1e-012
Identities = 44/136 (32%), Positives = 61/136 (44%), Gaps = 4/136 (2%)
Query: 4 LTGDAYELHAIAADVPHGEDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPK-- 61
L GD + V E ++ L+D NQ+GVL+RV V
Sbjct: 8 LFGDPVLVSRADEVVDFDESLSTLIDDMFDTMEDAGGVGLAANQVGVLRRVFVFDTSHQE 67
Query: 62 --FKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKG 119
+G ++NPV T +EGCLS P + + R+ + + G D P+ A G
Sbjct: 68 GGLRGHVINPVWEPLTEDTQTGKEGCLSIPDVSAETTRYETVRLSGQDRDGNPVGFVANG 127
Query: 120 LTAFCIQHEIDHLNGV 135
L A CIQHE DHL+GV
Sbjct: 128 LLARCIQHETDHLDGV 143
>ref|NP_625500.1| (NC_003888) putative polypeptide deformylase [Streptomyces
coelicolor A3(2)]
emb|CAC01493.1| (AL391017) putative polypeptide deformylase [Streptomyces
coelicolor A3(2)]
Length = 217
Score = 70.5 bits (171), Expect = 3e-012
Identities = 46/137 (33%), Positives = 61/137 (43%), Gaps = 7/137 (5%)
Query: 4 LTGDAYELHAIAADVPH-GEDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAP-- 60
L GD LHA A+V G ++ LV+ NQ+G RV V P
Sbjct: 56 LLGDPV-LHARCAEVTDFGPELAALVEDLFATMYAAHGVGLAANQVGEAVRVFVYDCPDD 114
Query: 61 ---KFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEA 117
+ G +VNP + V EGCLS PG +R+++ V GF +P+ V
Sbjct: 115 EDERHLGHVVNPRLVETGGVVVRGPEGCLSLPGLEAGTERYDEAVVTGFTVAGEPVTVRG 174
Query: 118 KGLTAFCIQHEIDHLNG 134
G A C+QHE DHL G
Sbjct: 175 TGFFARCLQHECDHLEG 191
>ref|NP_562660.1| (NC_003366) polypeptide deformylase [Clostridium perfringens]
dbj|BAB81450.1| (AP003191) polypeptide deformylase [Clostridium perfringens str.
13]
Length = 147
Score = 68.6 bits (166), Expect = 1e-011
Identities = 39/91 (42%), Positives = 50/91 (54%), Gaps = 3/91 (3%)
Query: 47 QLGVLKRVIVVRAPKFKGC--IVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVE 104
Q+G+LKR+ VV A G +NP I SG EGCLS PG+ KR NKI ++
Sbjct: 51 QVGILKRIFVVDAMDGAGSRVFINPEILEK-SGEQTDEEGCLSLPGRHKPVKRANKIKIK 109
Query: 105 GFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
D + ++A+G A IQHE DHL GV
Sbjct: 110 ALDVNGNEFVLDAEGFLARAIQHEYDHLEGV 140
>ref|NP_246498.1| (NC_002663) Def [Pasteurella multocida]
sp|P57948|DEF_PASMU Peptide deformylase (PDF) (Polypeptide deformylase)
gb|AAK03643.1| (AE006193) Def [Pasteurella multocida]
Length = 170
Score = 67.0 bits (162), Expect = 3e-011
Identities = 37/117 (31%), Positives = 56/117 (47%), Gaps = 4/117 (3%)
Query: 22 EDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPKFKG---CIVNPVITRHTSGH 78
+++ +VD Q+ + +R+I + K ++NP I + G
Sbjct: 26 DEIREIVDNMFETMYLEEGIGLAATQVNIHQRIITIDVEGTKENQYVLINPEII-DSCGE 84
Query: 79 VNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
EGCLS PG R K+T++ D H + + A+GL A CIQHEIDHLNG+
Sbjct: 85 TGIEEGCLSLPGFRGFVPRKEKVTIKALDRHGEEYTLSAEGLLAICIQHEIDHLNGI 141
>ref|NP_541241.1| (NC_003318) POLYPEPTIDE DEFORMYLASE [Brucella melitensis]
gb|AAL53505.1| (AE009665) POLYPEPTIDE DEFORMYLASE [Brucella melitensis]
Length = 187
Score = 66.6 bits (161), Expect = 4e-011
Identities = 34/76 (44%), Positives = 42/76 (54%)
Query: 60 PKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKG 119
PK VNP I + + EGCLS P + +R + V FDA +P +EA G
Sbjct: 81 PKAPHIFVNPTIVQSSDKRSTYEEGCLSIPDYYAEVERPATVKVNYFDADGKPQSMEADG 140
Query: 120 LTAFCIQHEIDHLNGV 135
L A C+QHEIDHLNGV
Sbjct: 141 LMATCLQHEIDHLNGV 156
>ref|NP_601934.1| (NC_003450) COG0242:N-formylmethionyl-tRNA deformylase
[Corynebacterium glutamicum]
dbj|BAC00133.1| (AP005282) N-formylmethionyl-tRNA deformylase [Corynebacterium
glutamicum ATCC 13032]
Length = 193
Score = 66.2 bits (160), Expect = 6e-011
Identities = 45/144 (31%), Positives = 63/144 (43%), Gaps = 24/144 (16%)
Query: 2 KVLTGDAYELHAIAADVPHGEDVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPK 61
+++T D EL + AD+ DV V NQ+GV KR+ V P
Sbjct: 19 QLVTEDVSELQELIADMYETMDVANGVGLAA-------------NQIGVSKRIFVYDCPD 65
Query: 62 -----FKGCIVNPVITRHTSGHV------NSREGCLSFPGKTVDKKRHNKITVEGFDAHW 110
KGC +NPV+ + EGCLS PG+ R + V G +
Sbjct: 66 DEGVMHKGCFINPVLETSEIPETMPADDGSDEEGCLSVPGEGFPTGRAHWAKVTGLNEKG 125
Query: 111 QPIKVEAKGLTAFCIQHEIDHLNG 134
+ + VEA+G A C QHE+ HL+G
Sbjct: 126 EEVSVEAEGFLARCFQHEVGHLDG 149
>ref|NP_389454.1| (NC_000964) polypeptide deformylase [Bacillus subtilis]
sp|P94462|DEF1_BACSU Peptide deformylase 1 (PDF 1) (Polypeptide deformylase 1)
pir||F69613 polypeptide deformylase def - Bacillus subtilis
emb|CAA71349.1| (Y10304) polypeptide deformylase [Bacillus subtilis]
emb|CAA74262.1| (Y13937) putative Def protein [Bacillus subtilis]
emb|CAB13445.1| (Z99112) polypeptide deformylase [Bacillus subtilis]
Length = 160
Score = 65.9 bits (159), Expect = 7e-011
Identities = 39/91 (42%), Positives = 51/91 (55%), Gaps = 3/91 (3%)
Query: 47 QLGVLKRVIVVRAPKFKGCI--VNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVE 104
Q+G+LKR VV +G I VNP I SG EGCLSFP D R + + V
Sbjct: 51 QIGILKRAAVVEIGDDRGRIDLVNPEILEK-SGEQTGIEGCLSFPNVYGDVTRADYVKVR 109
Query: 105 GFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
F+ +P +EA+G A +QHE+DHL+GV
Sbjct: 110 AFNRQGKPFILEARGFLARAVQHEMDHLDGV 140
>ref|NP_269070.1| (NC_002737) putative polypeptide deformylase [Streptococcus
pyogenes] [Streptococcus pyogenes M1 GAS]
gb|AAK33791.1| (AE006536) putative polypeptide deformylase [Streptococcus pyogenes
M1 GAS]
Length = 136
Score = 63.9 bits (154), Expect = 3e-010
Identities = 35/92 (38%), Positives = 52/92 (56%), Gaps = 2/92 (2%)
Query: 46 NQLGVLKRVIVVRAPKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEG 105
N +G KR+++V + NPV+ G ++E CLS G +R++KITVE
Sbjct: 47 NMIGEQKRIVIVSMGFIDLVMFNPVMVSK-KGIYQTKESCLSLSGYR-KTQRYDKITVEY 104
Query: 106 FDAHWQPIKVEAKGLTAFCIQHEIDHLNGVTI 137
D +W+P ++ GLTA QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>ref|NP_607075.1| (NC_003485) putative polypeptide deformylase [Streptococcus
pyogenes MGAS8232]
gb|AAL97574.1| (AE010021) putative polypeptide deformylase [Streptococcus pyogenes
MGAS8232]
Length = 136
Score = 63.9 bits (154), Expect = 3e-010
Identities = 35/92 (38%), Positives = 52/92 (56%), Gaps = 2/92 (2%)
Query: 46 NQLGVLKRVIVVRAPKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEG 105
N +G KR+++V + NPV+ G ++E CLS G +R++KITVE
Sbjct: 47 NMIGEQKRIVIVSMGFIDLVMFNPVMVSK-KGIYQTKESCLSLSGYR-KTQRYDKITVEY 104
Query: 106 FDAHWQPIKVEAKGLTAFCIQHEIDHLNGVTI 137
D +W+P ++ GLTA QHE+DHL G+ I
Sbjct: 105 LDHNWRPKRLSLTGLTAQICQHELDHLEGILI 136
>pdb|2DEF| Peptide Deformylase Catalytic Core (Residues 1 - 147), Nmr, 20
Structures
Length = 147
Score = 60.5 bits (145), Expect = 3e-009
Identities = 34/115 (29%), Positives = 54/115 (46%), Gaps = 4/115 (3%)
Query: 23 DVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVRAPKFKG---CIVNPVITRHTSGHV 79
++ +VD Q+ + +R+IV+ + + ++NP + SG
Sbjct: 26 EIQRIVDDMFETMYAEEGIGLAATQVDIHQRIIVIDVSENRDERLVLINPELLEK-SGET 84
Query: 80 NSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
EGCLS P + R K+ + D +P ++EA GL A CIQHE+DHL G
Sbjct: 85 GIEEGCLSIPEQRALVPRAEKVKIRALDRDGKPFELEADGLLAICIQHEMDHLVG 139
>ref|NP_266716.1| (NC_002662) polypeptide deformylase [Lactococcus lactis subsp.
lactis]
gb|AAK04658.1|AE006290_3 (AE006290) polypeptide deformylase [Lactococcus lactis subsp.
lactis]
Length = 211
Score = 58.9 bits (141), Expect = 9e-009
Identities = 40/114 (35%), Positives = 53/114 (46%), Gaps = 27/114 (23%)
Query: 46 NQLGVLKRVIVVRAP------------------KFKGCIVNPVITRHT--SGHVNSREGC 85
NQLG+LK+VI V P K + + N + H+ V EGC
Sbjct: 79 NQLGLLKKVIAVLIPNEPEVDEDGNEIPPKEAYKMREIMYNAKVVSHSVQDAAVEGGEGC 138
Query: 86 LSF----PGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
LS PG V RH ++TVE ++ + K+ K A C+QHEIDH NGV
Sbjct: 139 LSVDREVPGYVV---RHARVTVEYYNKEGEKKKIRLKDFPAICVQHEIDHTNGV 189
>sp|Q48661|DEF_LACLA Peptide deformylase (PDF) (Polypeptide deformylase)
Length = 196
Score = 58.9 bits (141), Expect = 9e-009
Identities = 40/114 (35%), Positives = 53/114 (46%), Gaps = 27/114 (23%)
Query: 46 NQLGVLKRVIVVRAP------------------KFKGCIVNPVITRHT--SGHVNSREGC 85
NQLG+LK+VI V P K + + N + H+ V EGC
Sbjct: 64 NQLGLLKKVIAVLIPNEPEVDEDGNEIPPKEAYKMREIMYNAKVVSHSVQDAAVEGGEGC 123
Query: 86 LSF----PGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNGV 135
LS PG V RH ++TVE ++ + K+ K A C+QHEIDH NGV
Sbjct: 124 LSVDREVPGYVV---RHARVTVEYYNKEGEKKKIRLKDFPAICVQHEIDHTNGV 174
>ref|NP_359915.1| (NC_003103) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
conorii]
sp|Q92IZ1|DEF_RICCN Peptide deformylase (PDF) (Polypeptide deformylase)
gb|AAL02816.1| (AE008593) polypeptide deformylase [EC:3.5.1.31] [Rickettsia
conorii]
Length = 175
Score = 55.5 bits (132), Expect = 1e-007
Identities = 35/98 (35%), Positives = 49/98 (49%), Gaps = 10/98 (10%)
Query: 47 QLGVLKRVIV--------VRAPK--FKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKK 96
Q+GVLKR++V V PK + IVNP I V + EGC+S P + ++
Sbjct: 51 QVGVLKRILVIDIQDHDSVARPKDFYPLFIVNPEIIEKAEELVTANEGCISLPEQRIEVA 110
Query: 97 RHNKITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
R I + D H + +++A A IQHE DHL G
Sbjct: 111 RPESIKIRYLDYHGKSRELKANDWLARVIQHEYDHLEG 148
>sp|Q9FV53|DEFM_ARATH Peptide deformylase, mitochondrial precursor (PDF) (Polypeptide
deformylase)
gb|AAD39667.1|AC007591_32 (AC007591) Simalar to gi|4377403 Polypeptide Deformylase from
Chlamydia pneumoniae genome gb|AE001687. [Arabidopsis
thaliana]
gb|AAK32873.1|AF361861_1 (AF361861) At1g15390/F9L1_34 [Arabidopsis thaliana]
Length = 259
Score = 52.4 bits (124), Expect = 9e-007
Identities = 45/155 (29%), Positives = 67/155 (43%), Gaps = 24/155 (15%)
Query: 3 VLTGDAYELHAIAADVPHGE----DVTGLVDXXXXXXXXXXXXXXXXNQLGVLKRVIVVR 58
V +GD LH A +V GE + ++D Q+GV R+IV+
Sbjct: 74 VASGDPV-LHEKAREVDPGEIGSERIQKIIDDMIKVMRLAPGVGLAAPQIGVPLRIIVLE 132
Query: 59 --------APKFK-----------GCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHN 99
APK + +VNPV+ ++ EGCLS G +R+
Sbjct: 133 DTKEYISYAPKEEILAQERRHFDLMVMVNPVLKERSNKKALFFEGCLSVDGFRAAVERYL 192
Query: 100 KITVEGFDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
++ V G+D + I+V A G A +QHE DHL+G
Sbjct: 193 EVVVTGYDRQGKRIEVNASGWQARILQHECDHLDG 227
>ref|NP_345996.1| (NC_003028) polypeptide deformylase [Streptococcus pneumoniae
TIGR4]
ref|NP_359001.1| (NC_003098) Formylmethionine deformylase [Streptococcus pneumoniae
R6]
gb|AAK75636.1| (AE007449) polypeptide deformylase [Streptococcus pneumoniae TIGR4]
gb|AAL00212.1| (AE008510) Formylmethionine deformylase [Streptococcus pneumoniae
R6]
gb|AAK13237.1| (AY014508) peptide deformylase-like protein [Streptococcus
pneumoniae]
Length = 136
Score = 52.0 bits (123), Expect = 1e-006
Identities = 32/89 (35%), Positives = 42/89 (46%), Gaps = 2/89 (2%)
Query: 46 NQLGVLKRVIVVRAPKFKGCIVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEG 105
N +GV KRVI+ + NPV+ G + EGCLS G KR+ I +
Sbjct: 47 NMIGVQKRVIIFNLGLVPVVMFNPVLLSF-EGSYEAEEGCLSLVGVR-STKRYETIRLAY 104
Query: 106 FDAHWQPIKVEAKGLTAFCIQHEIDHLNG 134
D+ WQ + G A QHE+DHL G
Sbjct: 105 RDSKWQEQTITLTGFPAQICQHELDHLEG 133
>gb|EAA02964.1| (AAAB01007479) agCP10822 [Anopheles gambiae str. PEST]
Length = 232
Score = 44.7 bits (104), Expect = 2e-004
Identities = 24/70 (34%), Positives = 35/70 (49%)
Query: 66 IVNPVITRHTSGHVNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLTAFCI 125
++NP + V E C S G D R+ +I ++GFDA ++ G A
Sbjct: 130 LLNPELKVLNYEKVIHTEACESVRGYRADVPRYREILLQGFDATGNRQELRLSGWNARIA 189
Query: 126 QHEIDHLNGV 135
QHE+DHLNG+
Sbjct: 190 QHEMDHLNGI 199
>gb|AAG09295.1|AF177768_1 (AF177768) ORF260 [Trypanosoma brucei]
Length = 260
Score = 38.9 bits (89), Expect = 0.010
Identities = 24/76 (31%), Positives = 32/76 (41%), Gaps = 5/76 (6%)
Query: 67 VNPVITRHTSGH-----VNSREGCLSFPGKTVDKKRHNKITVEGFDAHWQPIKVEAKGLT 121
VNP + + H E C+S T R IT G D + G+
Sbjct: 113 VNPTVPGYDDRHSIAPMYGMWENCISCGACTAWVIRPQSITCSGLDEYGNEKTEVLDGMR 172
Query: 122 AFCIQHEIDHLNGVTI 137
A C+ HE+DHL+G TI
Sbjct: 173 ARCLMHELDHLSGKTI 188
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.321 0.139 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 80,229,907
Number of Sequences: 1028106
Number of extensions: 2754842
Number of successful extensions: 3540
Number of sequences better than 5.0e-02: 119
Number of HSP's better than 0.1 without gapping: 103
Number of HSP's successfully gapped in prelim test: 16
Number of HSP's that attempted gapping in prelim test: 3412
Number of HSP's gapped (non-prelim): 122
length of query: 137
length of database: 324,535,921
effective HSP length: 113
effective length of query: 24
effective length of database: 208,359,943
effective search space: 5000638632
effective search space used: 5000638632
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 83 (36.6 bits)
Query= orf58 [14371-14472]
(100 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.133 0.000
Gapped
Lambda K H
0.267 0.0410 0.000
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 52,888,008
Number of Sequences: 1028106
Number of extensions: 1611870
Number of successful extensions: 4417
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 4417
Number of HSP's gapped (non-prelim): 0
length of query: 100
length of database: 324,535,921
effective HSP length: 76
effective length of query: 24
effective length of database: 246,399,865
effective search space: 5913596760
effective search space used: 5913596760
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 84 (37.0 bits)
Query= orf59 [14471-14595]
(123 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.325 0.141 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 89,577,650
Number of Sequences: 1028106
Number of extensions: 3520258
Number of successful extensions: 6513
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 6513
Number of HSP's gapped (non-prelim): 0
length of query: 123
length of database: 324,535,921
effective HSP length: 99
effective length of query: 24
effective length of database: 222,753,427
effective search space: 5346082248
effective search space used: 5346082248
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 84 (37.0 bits)
Query= orf60 [14593-15385]
(791 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_489430.1| (NC_003273) hypothetical protein [Nostoc s... 101 3e-020
ref|NP_646225.1| (NC_003923) hypothetical protein [Staphylo... 86 1e-015
ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylo... 84 7e-015
gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus] 84 7e-015
gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcu... 84 7e-015
gb|AAB00938.1| (L31763) virulence-associated protein E [Dic... 83 2e-014
ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconob... 80 7e-014
pir||S49113 hypothetical protein 2 - Microcystis aeruginosa... 74 9e-012
ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NR... 59 2e-007
>ref|NP_489430.1| (NC_003273) hypothetical protein [Nostoc sp. PCC 7120]
dbj|BAB77438.1| (AP003604) ORF_ID:all8519~hypothetical protein [Nostoc sp. PCC
7120]
Length = 836
Score = 101 bits (252), Expect = 3e-020
Identities = 92/338 (27%), Positives = 161/338 (47%), Gaps = 23/338 (6%)
Query: 467 KSDIKGMIAYMAHRYAHNPVKAWVESAP--WDGLDHVGLLFSHITLTPDEDRPMCEHLFR 524
K K ++ +A +++PV +++S + +D V LL + P+ L +
Sbjct: 341 KEQAKSVVLELALANSYSPVVEYLQSVERRYPNVD-VSLLDKLAERYFGTNDPLHAALMK 399
Query: 525 KWMRAAVSAGVGDQEGC--EPVLVFVDEQGGVGKTRFFRTLCPEPFRADSVLLDVKDKDS 582
+ + AAV+ + GC + + + +Q + K+ F+ L E F D L+ +KD
Sbjct: 400 RTLIAAVARAF--EPGCKHDEITILQGKQKSL-KSTFWEILAGEEFFTDD--LNGTEKDE 454
Query: 583 VKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVN 642
V + YW++E E + ++ + + + LKAFLSR KD +R PYG +PR + VG+ N
Sbjct: 455 VLKISQYWILEYAEFENSYKKKDVSQLKAFLSRKKDSMRRPYGTDIEDFPRPSILVGTTN 514
Query: 643 EAEFLVDTTSNRRFIPMKVAALNHQHRVNTQQ---AWAQALAEARSG---AVTYVEAHEV 696
EFL D T RRF +KV ++ Q+ WA A+A R+G +T VE +
Sbjct: 515 LDEFLYDPTGERRFWVIKVLFKKIPIKMLQQERDLIWAAAVAAYRNGEQWRLTEVEDEWL 574
Query: 697 AERNASFQATSAIDDVLSSRLNEATGERNVHMTVTDLLKRCGMHNPTKRDLNDAGKWLRS 756
N +Q+T ++ + + E E +V ++D+LK + +K + N LRS
Sbjct: 575 DAANKQYQSTDTWEEAVMA-WAELQKEVSVGEILSDVLK-IELAKQSKAEQNRVAAILRS 632
Query: 757 NGYEKRIRSGVRGFML-----PDMSIGAQAFGGPQLEV 789
+G+ + R V G ++ P + + G + EV
Sbjct: 633 HGWTRGDRKRVNGRLIRPWVRPQQEVEQEVERGVEQEV 670
>ref|NP_646225.1| (NC_003923) hypothetical protein [Staphylococcus aureus subsp.
aureus MW2]
dbj|BAB95273.1| (AP004827) hypothetical protein [Staphylococcus aureus subsp.
aureus MW2]
Length = 815
Score = 86.3 bits (212), Expect = 1e-015
Identities = 78/284 (27%), Positives = 133/284 (46%), Gaps = 29/284 (10%)
Query: 449 DEAALSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHI 508
D + S IE I +++H K+ K I +A + A++PV+ ++ WDG + LF
Sbjct: 437 DSSLRSYIEKIYDIHHSGKT--KDAIISVAMQNAYHPVRDYLNKISWDGHKRLEKLFIKY 494
Query: 509 TLTPDEDRPMCEHLFRKWMRAAVSAGVGD--QEGCE-PVLVFVDEQGGVGKTRFFRTLCP 565
D + + R + A++AG+ + GC+ ++ + GVGK+ + L
Sbjct: 495 LGVEDTE------VNRTTTKKALTAGIARVMEPGCKFDYMLTLYGPQGVGKSALLKKLGG 548
Query: 566 EPFRADSVLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYG 625
F +DS L+ V K++ + WL+E+ EL T ++E A+K F+S+ D R+ YG
Sbjct: 549 AWF-SDS-LVSVTGKEAYEALQGVWLMEMAELAAT-RKAEVEAIKHFISKQVDRFRVAYG 605
Query: 626 RTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKV------AALNHQHRVNTQQAWAQA 679
+PR ++G+ N+ +FL D T RRF PM V + + Q WA+A
Sbjct: 606 HYIEDFPRQCIFIGTTNKVDFLRDETGGRRFWPMTVNPERVEVNWSKLTKEEIDQIWAEA 665
Query: 680 LAEARSGAVTYVEAHEVAERNASFQA--------TSAIDDVLSS 715
G ++ E+ E S Q+ T ID+ L++
Sbjct: 666 KYYYEQGEELFLNP-ELEEEMRSIQSKHTEESPYTGIIDEYLNT 708
>ref|NP_371315.1| (NC_002758) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
dbj|BAB56953.1| (AP003360) hypothetical protein [Staphylococcus aureus subsp.
aureus Mu50]
Length = 489
Score = 84.0 bits (206), Expect = 7e-015
Identities = 59/207 (28%), Positives = 97/207 (46%), Gaps = 7/207 (3%)
Query: 453 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 512
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 95 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 151
Query: 513 DEDRPMCEHLFRKWMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADS 572
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 152 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 209
Query: 573 VLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 632
+ K + K+ W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 210 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 267
Query: 633 RMTAYVGSVNEAEFLVDTTSNRRFIPM 659
R +VG+ N EFL D T NRRF P+
Sbjct: 268 RQCVFVGTTNNYEFLKDQTGNRRFFPI 294
>gb|AAL67615.1| (U93688) orf15 [Staphylococcus aureus]
Length = 477
Score = 84.0 bits (206), Expect = 7e-015
Identities = 59/207 (28%), Positives = 97/207 (46%), Gaps = 7/207 (3%)
Query: 453 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 512
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 87 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 143
Query: 513 DEDRPMCEHLFRKWMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADS 572
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 144 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 201
Query: 573 VLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 632
+ K + K+ W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 202 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 259
Query: 633 RMTAYVGSVNEAEFLVDTTSNRRFIPM 659
R +VG+ N EFL D T NRRF P+
Sbjct: 260 RQCVFVGTTNNYEFLKDQTGNRRFFPI 286
>gb|AAL04136.1|AF410775_12 (AF410775) unknown [Staphylococcus aureus]
Length = 474
Score = 84.0 bits (206), Expect = 7e-015
Identities = 59/207 (28%), Positives = 97/207 (46%), Gaps = 7/207 (3%)
Query: 453 LSEIESIGNLNHYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTP 512
+S I+ + N+ + + I +I A++ +P+K+ +ES WDG+ + LF I
Sbjct: 84 ISHIDKLYNVQ-FSRDLIDTVIEKEAYQNRFHPIKSMIESKSWDGIKRIETLF--IDYLG 140
Query: 513 DEDRPMCEHLFRKWMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADS 572
ED + +KWM AV+ + +++ QG VGK+ L +
Sbjct: 141 AEDNHYNREVTKKWMMGAVARIYQPGIKYDSMIILYGGQG-VGKSTAVSKLGGHWYNQS- 198
Query: 573 VLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYP 632
+ K + K+ W+ E+ EL F +S +K F+S D R YG+ ++P
Sbjct: 199 -IKTFKGDEVYKKLQGSWICEIEELSA-FQKSTIEDIKGFISAIVDIYRASYGKRTERHP 256
Query: 633 RMTAYVGSVNEAEFLVDTTSNRRFIPM 659
R +VG+ N EFL D T NRRF P+
Sbjct: 257 RQCVFVGTTNNYEFLKDQTGNRRFFPI 283
>gb|AAB00938.1| (L31763) virulence-associated protein E [Dichelobacter nodosus]
Length = 437
Score = 82.8 bits (203), Expect = 2e-014
Identities = 57/194 (29%), Positives = 94/194 (48%), Gaps = 6/194 (3%)
Query: 466 PKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGLLFSHITLTPDEDRPMCEHLFRK 525
P I I +A + P++ ++ES WDG + F E +
Sbjct: 96 PIDSIDMGIKMLAREHIFCPIQQYLESLQWDGKPRIATAFHDFLGV--ERNTYTIGISTL 153
Query: 526 WMRAAVSAGVGDQEGCEPVLVFVDEQGGVGKTRFFRTLCPEPFRADSVLLDVKDKDSVKQ 585
+ RA V+ + V+V +QG +GK+ F R L E + + + K++++
Sbjct: 154 FFRALVARVYQPAIKFDHVIVLQGDQG-IGKSSFLRILGGEWY--SDGIRKFEGKEAIEA 210
Query: 586 AISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKYPRMTAYVGSVNEAE 645
+ E+ EL G FS++E +KAF++RT+D R+ YGR +PR T +VG+ N +
Sbjct: 211 IQGVLIGEIPELQG-FSKAEIEEIKAFITRTEDRARMAYGRRVQSFPRRTVFVGTTNADD 269
Query: 646 FLVDTTSNRRFIPM 659
+L D T NRRF P+
Sbjct: 270 YLRDATGNRRFFPI 283
>ref|NP_569125.1| (NC_003374) hypothetical protein [Gluconobacter oxydans]
emb|CAD21834.1| (AJ428837) hypothetical protein [Gluconobacter oxydans]
Length = 416
Score = 80.5 bits (197), Expect = 7e-014
Identities = 68/233 (29%), Positives = 102/233 (43%), Gaps = 27/233 (11%)
Query: 464 HYPKSDIKGMIAYMAHRYAHNPVKAWVESAPWDGLDHVGL-LFSHITLTPDEDRPMCEHL 522
++ + I +A A++P++ ++ WDG+ + L + E L
Sbjct: 99 NFSTQQVSAAIQSVAEENAYHPIRDYLTGLEWDGVQRLDTWLHDFLGAEDSEYTAGVGRL 158
Query: 523 FRKWMRAAVSAGVGDQEGC--EPVLVFVDEQGG---------VGKTRFFRTLCPEPFRAD 571
F M A V + GC + VL+F QG G +R+F P P R
Sbjct: 159 FLIGMVARVM-----EPGCKNDHVLIFEGNQGIRKSTACKVLAGGSRYFSDSLP-PIREG 212
Query: 572 SVLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSRTKDDIRLPYGRTNMKY 631
S KD WLVE+ E+ SR+++ ALKAFL+RT + R YGR +
Sbjct: 213 S-------KDLASHLAGKWLVEVAEMSAA-SRADTEALKAFLTRTHETYRPAYGRNEITQ 264
Query: 632 PRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVN-TQQAWAQALAEA 683
PR VG+ N E+L D+T +RR P+K ++ Q+A Q AEA
Sbjct: 265 PRQCVLVGTTNRKEYLRDSTGDRRVWPVKCGVNGEPCDIDGLQEARDQLFAEA 317
>pir||S49113 hypothetical protein 2 - Microcystis aeruginosa
emb|CAA82191.1| (Z28337) ORF2 [Microcystis aeruginosa]
Length = 502
Score = 73.6 bits (179), Expect = 9e-012
Identities = 68/228 (29%), Positives = 105/228 (45%), Gaps = 30/228 (13%)
Query: 556 KTRFFRTLCPEPFRADSVLLDVKDKDSVKQAISYWLVELGELDGTFSRSESNALKAFLSR 615
K+ FF+TL E F D+++ DKD + W++E ELD +++ +K+FLSR
Sbjct: 278 KSTFFKTLGGEYF-TDAMMST--DKDGLLTMAKNWIIEWSELDHFTAKTYHGIIKSFLSR 334
Query: 616 TKDDIRLPYGRTNMKYPRMTAYVGSVNEAEFLVDTTSNRRFIPMKVAALNHQHRVNTQQA 675
+ D R PY R M +PR + VGS N FL D T NRRF + + +H++N
Sbjct: 335 SDDFYREPYAREAMSHPRCSMIVGSTNRDTFLNDPTGNRRF---WIIPIPKKHKINID-- 389
Query: 676 WAQALAEARSGAVT--YVEAHEV----------AERNASFQATSAIDDVLSSRLNEATGE 723
+ QA E G Y++ AE N ++ + +D L+ L +
Sbjct: 390 YLQAYREKLLGWAVWRYLDGEPCQLPEEFKALQAEANKQWENNDSWEDELAEFL-----D 444
Query: 724 RNVHMTVTDLLKRCGMH----NPTKRDLNDAGKWLRSNGY-EKRIRSG 766
R ++V + L+R N + D G LR G+ KRI+ G
Sbjct: 445 RESDLSVKECLQRLEDRGYPINFGRSDEMRMGDILRKAGFSRKRIQRG 492
>ref|NP_279342.1| (NC_002607) Vng0215c [Halobacterium sp. NRC-1]
gb|AAG18822.1| (AE004986) Vng0215c [Halobacterium sp. NRC-1]
Length = 857
Score = 59.3 bits (142), Expect = 2e-007
Identities = 32/99 (32%), Positives = 49/99 (49%), Gaps = 6/99 (6%)
Query: 188 DAKTIREALRHIDPGETREEWVKVGLSLKSGFGDDPQGLSLFDEWSSGALWRDGDEPANY 247
D I +AL H++P +EW+K+G ++ + G LF++WS G G + N
Sbjct: 220 DKSDIEDALSHVNPDCGYDEWLKLGFAVHD-WDSGSTGKRLFEQWSKG-----GSKWDNQ 273
Query: 248 VPEHIETQWDSFKAEGGRTIATVYHKAIEGGWQPPAGIN 286
+I+ WDS G T+ T+ H A +GGW P N
Sbjct: 274 AERNIQDIWDSASEGQGVTVGTLIHYAKDGGWTVPTRSN 312
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.316 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 516,679,759
Number of Sequences: 1028106
Number of extensions: 22303856
Number of successful extensions: 57628
Number of sequences better than 5.0e-02: 15
Number of HSP's better than 0.1 without gapping: 11
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 57601
Number of HSP's gapped (non-prelim): 20
length of query: 791
length of database: 324,535,921
effective HSP length: 129
effective length of query: 662
effective length of database: 191,910,247
effective search space: 127044583514
effective search space used: 127044583514
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 96 (41.6 bits)
Query= orf61 [15399-15457]
(57 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.320 0.134 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 19,570,331
Number of Sequences: 1028106
Number of extensions: 295251
Number of successful extensions: 533
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 533
Number of HSP's gapped (non-prelim): 0
length of query: 57
length of database: 324,535,921
effective HSP length: 33
effective length of query: 24
effective length of database: 290,608,423
effective search space: 6974602152
effective search space used: 6974602152
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 85 (37.4 bits)
Query= orf62 [15563-15814]
(250 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
1,028,106 sequences; 324,535,921 total letters
Searching done
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF
Posted date: Jul 12, 2002 12:04 AM
Number of letters in database: 324,535,921
Number of sequences in database: 1,028,106
Lambda K H
0.318 0.130 0.000
Gapped
Lambda K H
0.267 0.0410 4.65e-269
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 133,621,696
Number of Sequences: 1028106
Number of extensions: 4833468
Number of successful extensions: 15642
Number of sequences better than 5.0e-02: 0
Number of HSP's better than 0.1 without gapping: 0
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 15642
Number of HSP's gapped (non-prelim): 0
length of query: 250
length of database: 324,535,921
effective HSP length: 118
effective length of query: 132
effective length of database: 203,219,413
effective search space: 26824962516
effective search space used: 26824962516
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 90 (39.3 bits)